In addition to Weibo, there is also WeChat
Please pay attention
WeChat public account
Shulou
2025-01-29 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Internet Technology >
Share
Shulou(Shulou.com)06/03 Report--
The following to understand to help MySQL achieve Oracl advanced analysis function method, I believe that everyone will benefit a lot after reading, the text in the essence is not much, hope to help MySQL achieve Oracl advanced analysis function method this short content is what you want.
Oracle supports some unique syntax and functions, which is more or less troublesome to programmers when porting to MySQL. Here are some examples of special uses of Oracle and how to use aggregators to accomplish the same function. Of course, these methods are not limited to MySQL, but are also supported for all other databases.
1. Recursive statement
A) select employee_id,first_name,last_name,manager_id
From hr.employees
Start with employee_id=102
Connect by prior employee_id = manager_id
A
one
= connect ("orcl")
two
= A1.query@x ("select employee_id, first_name, last_name, manager_id from hr.employees")
three
= A2.keys (EMPLOYEE_ID)
four
= A2.select@1 (EMPLOYEE_ID==102)
five
= A2.switch (MANAGER_ID, A2)
six
= A2.nodes (MANAGER_ID, A4)
seven
= (A4 | A6) .new (EMPLOYEE_ID, FIRST_NAME, LAST_NAME, MANAGER_ID.EMPLOYEE_ID:MANAGER_ID)
(1) A3 sets the key of sequence table A2
(2) A4 selection of starting employees
(3) A5 converts the MANAGER_ID value in A2 into a record for recursion
(4) A6 acquires all child nodes of the initial employee
B) select employee_id, first_name,last_name,manager_id
From hr.employees
Start with employee_id=104
Connect by prior manager_id = employee_id
A
one
= connect ("orcl")
two
= A1.query@x ("select employee_id, first_name, last_name, manager_id from hr.employees")
three
= A2.keys (EMPLOYEE_ID)
four
= A2.switch (MANAGER_ID, A2)
five
= A2.select@1 (EMPLOYEE_ID==104)
six
= A5.prior (MANAGER_ID)
seven
= A6.new (EMPLOYEE_ID, FIRST_NAME, LAST_NAME, MANAGER_ID.EMPLOYEE_ID:MANAGER_ID)
(1) A6 acquires all parent nodes of the initial employee
C) select employee_id,last_name,manager_id,sys_connect_by_path (last_name,'/') path from hr.employees
Start with employee_id=102
Connect by prior employee_id = manager_id
A
one
= connect ("orcl")
two
= A1.query@x ("select employee_id, last_name, manager_id,null path from hr.employees")
three
= A2.keys (EMPLOYEE_ID)
four
= A2.select@1 (EMPLOYEE_ID==102)
five
= A2.switch (MANAGER_ID, A2)
six
= A2.nodes (MANAGER_ID, A4)
seven
= A4 | A6
eight
= A7.run (PATH=if (EMPLOYEE_ID==102, "/" + LAST_NAME, MANAGER_ID.PATH+ "/" + LAST_NAME))
nine
= A7.new (EMPLOYEE_ID, LAST_NAME, MANAGER_ID.EMPLOYEE_ID:MANAGER_ID, PATH)
(1) since the parent node of each record in A7 is in front of this node, A8 can modify the PATH value of each record in A7 in turn.
2. Nested aggregation function
Select avg (max (salary)) avg_max, avg (min (salary)) avg_min
From hr.employees
Group by department_id
A
one
= connect ("orcl")
two
= A1.query@x ("select * from hr.employees")
three
= A2.groups (DEPARTMENT_ID;max (SALARY): M1, min (SALARY): m2)
four
= A3.group (; ~ .avg (M1): avg_max,~.avg (m2): avg_min)
(1) A1.query in A2 can also be changed to A1.cursor.
3. Aggregation analysis functions FIRST and LAST
SELECT department_id
MIN (salary) KEEP (DENSE_RANK FIRST ORDER BY commission_pct) worst
MAX (salary) KEEP (DENSE_RANK LAST ORDER BY commission_pct) best
FROM hr.employees
GROUP BY department_id
ORDER BY department_id
A
one
= connect ("orcl")
two
= A1.query@x ("select * from hr.employees order by department_id,commission_pct")
three
= A2.group@o (DEPARTMENT_ID)
four
= A3.new (DEPARTMENT_ID,~.minp@a (ifn (COMMISSION_PCT,2)) .min (SALARY): worst, ~ .maxp @ a (ifn (COMMISSION_PCT,2)) .max (SALARY): best)
five
= A4.sort (ifn (DEPARTMENT_ID,power (21,332)
(1) if A2 is sorted by DEPARTMENT_ID, then A3 can be grouped with group@o.
(2) FIRST/LAST takes the first / last group sorted, while null ranks last when Oracle is sorted, so the last group that LAST will get is the group in which the null value is located. Null is excluded when maxp/minp calculates all rows with maximum / minimum values, so it is maximum when A4 uses ifn (COMMISSION_PCT,2) to guarantee null values.
(3) in A5, DEPARTMENT_ID=null uses power larger than all DEPARTMENT_ID (2pj32) to ensure that this line is at the end of the row.
If the amount of data is large, you can also use cursors.
A
one
= connect ("orcl")
two
= A1.cursor@x ("select * from hr.employees")
three
= A2.groups (DEPARTMENT_ID; min ([if (COMMISSION_PCT,2), SALARY]): M1, max ([if (COMMISSION_PCT,2), SALARY]): m2)
four
= A3.new (DEPARTMENT_ID, M1 (2): worst, m2 (2): best)
five
= A4.sort (ifn (DEPARTMENT_ID,power (21,332)
(1) in A3, min ([if (COMMISSION_PCT,2), SALARY]) calculates the minimum SALARY value of COMMISSION_PCT, that is, the minimum value of SALARY when COMMISSION_PCT ranks first, and max is similar.
4. Proportion function ratio_to_report
A) SELECT last_name, salary, RATIO_TO_REPORT (salary) OVER () AS rr
FROM hr.employees
WHERE job_id = 'PU_CLERK'
ORDER BY last_name
A
one
= connect ("orcl")
two
= A1.query@x ("select last_name,salary from hr.employees where job_id='PU_CLERK'order by last_name")
three
= A2.sum (SALARY)
four
= A2.new (LAST_NAME, SALARY, SALARY/A3:RR)
B) SELECT department_id,last_name, salary, RATIO_TO_REPORT (salary) OVER (partition by department_id) AS rr
FROM hr.employees
WHERE department_id in (20pl 60)
ORDER BY department_id,last_name
A
one
= connect ("orcl")
two
= A1.query@x ("select department_id,last_name,salary from hr.employees where department_id in (20je 60) order by department_id,last_name")
three
= A2.groups@o (DEPARTMENT_ID;sum (SALARY): sum)
four
= A2.switch (DEPARTMENT_ID, A3)
five
= A2.new (DEPARTMENT_ID.DEPARTMENT_ID:DEPARTMENT_ID, LAST_NAME, SALARY, SALARY/DEPARTMENT_ID.sum:RR)
(1) if A2 is sorted by DEPARTMENT_ID, then A3 can be grouped and aggregated with groups@o.
5. Multiple grouping
SELECT department_id, job_id, sum (salary) total
FROM hr.employees
WHERE department_id in (30,50)
GROUP BY grouping sets (department_id, job_id), department_id)
A
one
= connect ("orcl")
two
= A1.query@x ("select department_id, job_id, salary from hr.employees where department_Id in (30pm 50) order by department_id, job_id")
three
= A2.groups@o (DEPARTMENT_ID, JOB_ID; sum (SALARY): TOTAL)
four
= A3.group@o (DEPARTMENT_ID, null:JOB_ID; ~ .sum (TOTAL): TOTAL)
five
= [A3magA4] .merge (DEPARTMENT_ID, ifn (JOB_ID,fill ("z", 10)
(1) because A3 and A4 are ordered to DEPARTMENT_ID, A5 can be merge,ifn (JOB_ID,fill ("z", 10)) to ensure that JOB_ID is null.
You can also use cursors.
A
one
= connect ("orcl")
two
= A1.cursor@x ("select department_id,job_id,sum (salary) total from hr.employees where department_id in (30pm 50) group by department_id,job_id order by department_id,job_id")
three
= A2.group (DEPARTMENT_ID)
four
= A3. (~ .insert (0, ~ .groups @ o (DEPARTMENT_ID, null:JOB_ID;sum (TOTAL): TOTAL)
five
= A4.fetch ()
six
= A5.conj ()
(1) A2.group in A3 requires A2 to order DEPARTMENT_ID.
(2) A4 sums each group of A3 and inserts the result at the end of the group.
It can also be piped.
A
one
= connect ("orcl")
two
= A1.cursor@x ("select department_id, job_id, salary from hr.employees where department_Id in (30pm 50) order by department_id, job_id")
three
= channel () .group@o (DEPARTMENT_ID, JOB_ID; sum (SALARY): TOTAL)
four
> A2.push (A3)
five
= channel () .groups@o (DEPARTMENT_ID, null:JOB_ID; sum (TOTAL): TOTAL)
six
> A3.push (A5)
seven
= A3.fetch ()
eight
For A2,1000
nine
= A3.result () | A5.result ()
ten
= A9.sort (DEPARTMENT_ID)
(1) A3 creates pipes and appends grouping summation
(2) A4 pushes the data in A2 to A3. Note that this action is performed only if the data is actually fetched in A2.
(3) A5 creates pipes and appends grouping summation
(4) A6 pushes A3 results to A5, and the data in A2 can also be pushed to A5 directly, but it will increase the time complexity.
(5) A7 retains A3 data
(6) read A2 in a loop, taking only 1000 entries at a time to reduce memory footprint
(7) A10 sorts the data in A3 and A5. Because the algorithm is stable, the data whose JOB_ID is null ranks last.
After reading this article on how to help MySQL implement Oracl advanced analysis functions, many readers will want to know more about it. If you need more industry information, you can follow our industry information section.
Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.
Views: 0
*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.
Continue with the installation of the previous hadoop.First, install zookooper1. Decompress zookoope
"Every 5-10 years, there's a rare product, a really special, very unusual product that's the most un
© 2024 shulou.com SLNews company. All rights reserved.