In addition to Weibo, there is also WeChat
Please pay attention
WeChat public account
Shulou
2025-02-27 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Servers >
Share
Shulou(Shulou.com)06/03 Report--
First, code risk control and online service configuration security control.
1. Make a reasonable audit of the code online through our AOS system, and increase control from all aspects of research and development, testing, product, director, operation and maintenance, so as to achieve code security.
2. Through our puppet control, online files or system configuration need to be modified, which need to be reviewed by relevant personnel in order to increase online security.
3. Through our puppet control, software needs to be installed online, which needs to be audited by relevant personnel in order to increase online security.
Second, find problems
1. Collect various metrics on the server through zabbix, such as system load, business downtime, whether the business status is good or not, and report to the police by SMS and email. (the first method of alarm)
2. Show whether the state of each business is good, whether the program is down or not, and the system load is normal through grafana+ influxdb, and call the police through the 24-hour monitoring of the NOC group. (the second method of alarm)
3. Collect log information through kibana+spark+es, and show problematic interfaces and slow interfaces through log filtering and filtering. For example, the error of 5XX occurs within 5 minutes, and the url of top10. Call the police through the 24-hour personnel monitoring of the noc group. (the third method of alarm)
4. Through our smokeping network monitoring, we can detect the network connection of each computer room used by the company. Be able to determine whether network problems have an impact on the business.
III. Analysis of problems
1. Collect log information through kibana+es, find out slow interface and relevance through log filtering and filtering, and find out possible problems through a large amount of data, and analyze them.
2. Make a reasonable business architecture scheme through the large amount of log information of kibana+es and the control of the overall business architecture. Make the business more reasonable and superior.
Fourth, deal with problems
1. After receiving text messages and phone calls, find out the specific matters of the problem through grafana+ influxdb, and quickly find the interface of the problem and the root cause of the problem through kibana+es.
2. After receiving the alarm, quickly find the root cause of the problem through grafana+ influxdb, kibana+spark+es, smokeping and kibana+es.
3. Determine whether there is a problem with dependent resources through grafana+ influxdb observation.
V. summing up the problems afterwards
1. Make a disaster recovery and emergency plan, and if there are problems, you can restore the business at the first moment and ensure the stable operation of the business.
2. Analyze and improve the problems each time. So that the same type of problem won't happen again next time.
VI. Automation of operation and maintenance
1. Through the automatic configuration of our puppet, we can reduce manual operation, avoid misoperation and increase the management and control of personnel, thus increasing the security of the online server.
2. Through our cmdb, we can quickly query the server hardware configuration, domain name ownership, server administrator and so on.
3. Through our rt transaction tracking management, we can quickly locate the important operation information that has been carried out on the server.
4. Through our sip system, you can view all the servers and domain names under the current business that the administrator is responsible for to facilitate batch authorization of users.
5, through our AOS code online, reduce the operation of personnel to avoid misoperation of personnel.
6. Through our docker platform, we can make better use of server hardware resources and reduce product cost.
Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.
Views: 0
*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.
Continue with the installation of the previous hadoop.First, install zookooper1. Decompress zookoope
"Every 5-10 years, there's a rare product, a really special, very unusual product that's the most un
© 2024 shulou.com SLNews company. All rights reserved.