notes

Postgres

Table Of Content

Commands
[Multiversion Concurency Control][#multiversion-concurency-control]
Analize Query

Commands

List databases
```
 \list
 \l
```
Connect to database
```
 \connect database_name
```
List all tables
```
 \dt
```
Describe table
```
 \d table
```
Show indexes
```
 \di
```

Multiversion Concurency Control

Problem to solve: we need to have atomicity, consistency and isolation in concurent environment.
The easiest way to implement it is:
- Shared read lock + not shared write lock
- Dissadvantages: if one user want to write: the whole world would stop.
There are two ways to implement MVCC:
- Store revert log (Oracle, MsSQL)
- Store all versions of the row (PostgreSQL). We will call one version as a tuple.
PostgreSQL stores creation transaction ID (xmin) and expiration transaction id (xmax) for every tuple.
PostgreSQL stores statuses for all transactions (CLOG).
Having CLOG, xmin and xmax we could decide if tuple is visible for transaction or not.
Visible tuples:
- must have a creaation transaction id that:
  - is a commited transaction and
  - is less than the transaction counter stored at query start and
  - was not in-process at query start
- must have an expire transaction id that:
  - is blanck or
  - is aborted or
  - is greater than the transaction counter stored at query start or
  - was in process at query start

Analize Query

EXPLAIN ANALYZE
    SELECT Conference.name, University.name
    FROM Conference
        JOIN Participant ON (Conference.conference_id = Participant.conference_id)
        JOIN Researcher ON (Participant.researcher_id = Researcher.researcher_id)
        JOIN University ON (Researcher.university_id = University.university_id);

Get pg stats

 SELECT attname, n_distinct
 FROM pg_stats
 WHERE tablename='my-table-name';

Update pg stats
```
 analize;
```

Analize

Seq Scan- последовательное сканирование
Hash - составление хэш таблицы (`key: values[])
Hash Join - join двух хэшированных таблиц