class: center, middle, inverse, title-slide # Database Querying with SQL ## Searching efficiently ### Ben Baumer ### SDS 192April 22nd, 2020(
http://beanumber.github.io/sds192/lectures/mdsr_sql_05-keys.html
) --- class: center, middle, inverse # Keys and Indexes --- background-image: url(http://static.libsyn.com/p/assets/c/2/1/b/c21be1062e52949b/Unused-Indexes-vs-Foreign-Keys.png) background-size: contain --- ## Keys and Indexes .footnote[https://en.wikipedia.org/wiki/Primary_key] - Keys: - `PRIMARY KEY`: unique, non-`NULL`, only one per table - `UNIQUE KEY`: unique, may include `NULL` - [`FOREIGN KEY`](https://en.wikipedia.org/wiki/Foreign_key): references primary key in another table -- - Indexes: - No constraints, just about speed - Take up space on disk - Will they be used? -- - A `PRIMARY KEY` is always indexed --- background-image: url(https://i.imgur.com/pDq0n.png) background-size: contain --- ## A DB without indexes is like Dory .center[data:image/s3,"s3://crabby-images/b5b83/b5b83ac1c868031618f2411087bf947299ffdaec" alt=""] --- ## An index is a lookup table .center[data:image/s3,"s3://crabby-images/7bbd8/7bbd8b50b653c2298d0c3ed75b5a1f66281631f7" alt=""] .footnote[https://en.wikipedia.org/wiki/Database_index] --- ## Consider a book index... .pull-left[ data:image/s3,"s3://crabby-images/d8547/d8547a89e7f5c23a8432665f51b874df2de62f10" alt="" ] -- .pull-right[ - Helps us avoid checking each row (i.e., [table scan](https://en.wikipedia.org/wiki/Full_table_scan)) - Takes time to build! - Good index will reduce search time from `\(O(n)\)` to `\(O(log(n))\)` .footnote[https://en.wikipedia.org/wiki/Big_O_notation] ] --- ## What would be a good index for this class? .pull-left[ data:image/s3,"s3://crabby-images/b9596/b95965f8257ec5759d89041573007ed3b557cf21" alt="" ] .pull-right[ - 60 students - last name? - home institution? - first name? - class year? ] --- ## So why not just build indexes on all the columns? .pull-left[ data:image/s3,"s3://crabby-images/ed56c/ed56c9f7eb825df86b007f14ea0a1fb676074e26" alt="" ] .pull-right[ - Takes up space on disk - Takes time to build - Slows down inserts - Not always that big of an improvement - What if index is length `\(n\)`? Or 1? ] --- background-image: url(http://static.libsyn.com/p/assets/c/2/1/b/c21be1062e52949b/Unused-Indexes-vs-Foreign-Keys.png) background-size: contain