Solved: We are about to run a query on this emp table to find the, C/C++ Programming

We are about to run a query on this emp table to find the

Assume a table Emp(ssn, name, salary) of employee records, where ssn is the primary key. The total size of the table is 34,560MB. The table (i.e., the records of the table) is stored in a heap file in chunks of 2KB pages (all full of records) on a single disk drive. In all questions below that involve indices, assume that the number of leaf pages for B-trees and the number of buckets for hash indices are the same with the number of pages required to store the table in the heap.

We are about to run a query on this Emp table to find the name of the employee with a given ssn, say 1000; i.e., in SQL, "select name from Emp where ssn=1000". In a worst-case scenario, how long this operation will take? Express your answer in both, number of disk accesses (I/O) and in hours. The disk drive has the following characteristics: average seek time is 8 msecs, average rotational delay is 1 msec, and average transfer rate is 1 msec per 2KB block so, the total time to locate and transfer a disk block of data is 10 msecs.
Assume that in addition to storing the table as described above, we also have available a B-tree built for this table on ssn - this is the search key of the B-tree. A data entry in the tree is a pair (ssn, RID of a data record in the heap). What is the approximate cost (in number of disk accesses) of executing the query in (a) if we use the B-tree index?
Now assume that we have a hash index on ssn for the Emp table. A data entry in the hash is a pair (ssn, RID of a data record in the heap). What is the approximate cost (in number of disk accesses) of executing the query in (a) if we use the hash index?
Now, consider this query: find the maximum salary in the Emp table; i.e., in SQL, "select max(salary) from Emp". Assuming no indexing of any kind, i.e., we just have the records of the table in the heap, what is the cost of this query (in number of disk accesses)?
Assume we have available a B-tree on Emp.salary. A data entry in the tree is a pair (salary, RID of a data record in the heap). What is the approximate cost (in number of disk accesses) of executing the query in (d) if we use this B-tree index?
Assume we have available a hash index on Emp.salary. A data entry in the hash index is a pair (salary, RID of a data record in the heap). Is the hash index useful to compute the query? If no, explain. If yes, i.e., you think it is better to use the hash index instead of the heap, explain how you do this search and what is the approximate cost (in number of disk accesses) of executing the query if we use the hash index?
Now, consider this update: insert a new employee record (1000, "mike", 100). Assuming no indexing of any kind, i.e., we just have the records of the table in the heap, what is the approximate cost of this operation (in number of disk accesses)?
For the operation in (g), and assuming there is a B-tree as described in (e), what is the approximate cost (in number of disk accesses) of executing the operation if we use the B-tree?
For the query in (g), and assuming there is a hash index as described in (f), what is the approximate cost (in number of disk accesses) of executing the operation if we use the hash index?
For questions (b) and (c). If , What is the approximate cost (in number of disk accesses) of executing the query in (a) if we use the corresponding index, if the indices were built following the alternative-1 where instead of having the records in the heap file, they are stored in the corresponding index

Note

Assume the cost of everything else besides disk accesses is negligible.
For question (a), to simplify calculations, assume 1 MB = 1,000 KB.
Your answers in cost related questions must be plain numbers (e.g., 5, 90, 90.56) that include no formulas and/or computation of any kind. For example, the following types of answers will automatically get zero with no further consideration: log base 2 of some N; cube of N square divided by log base 2 of N cube multiplied by log base 10 N; big O(log(N)); etc. However, you must explain how you came up with your final (number) answer by indicating the steps.

View Complete Question

Solution Preview :

Prepared by a verified Expert

C/C++ Programming: We are about to run a query on this emp table to find the

Reference No:- TGS01478327

Now Priced at $50 (50% Discount)

Recommended (95%)

Rated (4.7/5)

Have a Question? (oR Write a Review)

Write atleast 100 words!!

Asked Questions

Problem about living in a family-style therapeutic group

Problem: Spencer is a 10-year-old boy who has been living in a family-style therapeutic group home for one year.

Brief explanation and introduction to mental health disorder

Problem: Brief Explanation and Introduction to Mental Health Disorders. Need Assignment Help?

Why self care is important

With busy schedules of work, children, and school it's can be difficult to focus on oneself. Self care is important.

Explore the personality variables-social variables

Problem: Explore the "Personality Variables" (Mediator 5) and "Social Variables" (Mediator 6) discussed in Chapter 3.

Discuss the principal disadvantage of correlational research

Question: In reference to Whitbourne, S. K. (2020), Discuss the principal disadvantage of correlational research?

Describe a situation where you felt very competent

Describe a situation where you felt very competent as a psychiatric clinical supervisor and why. How did your actions align with strong leadership skills?

Stress and drug cravings sometimes people feel they have too

Use something from this resource please "Stress and Drug Cravings Sometimes people feel they have too much to do and too few resource

Solution Preview :

Prepared by a verified Expert

C/C++ Programming: We are about to run a query on this emp table to find the

Reference No:- TGS01478327

Have a Question? (oR Write a Review)

Recent Questions Asked C/C++ Programming

Q : 1 provide an executive overview that addresses the

Q : Write a paper that will discuss how the united states went

Q : Give a qualitative explanation of what the limiting

Q : It is often said that if a homicide is not solved within

Q : We are about to run a query on this emp table to find the

Q : Write a paper about should america go to second world war

Q : Find the percentage error over the band in making this

Q : A machine shop makes two products each unit of the first

Q : 1 how many scans of the database does your algorithm take