In addition to Weibo, there is also WeChat
Please pay attention
WeChat public account
Shulou
2025-03-30 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Servers >
Share
Shulou(Shulou.com)05/31 Report--
In this article, the editor introduces in detail "what are the main work contents of linux operation and maintenance", the content is detailed, the steps are clear, and the details are handled properly. I hope this article "what are the main work contents of linux operation and maintenance" can help you solve your doubts? let's follow the editor's ideas to learn new knowledge.
Linux operation and maintenance work: 1, service monitoring; 2, service fault management; 3, service capacity management; 4, service performance optimization; 5, service global traffic scheduling; 6, service task scheduling; 7, service security; 8, service automatic release and deployment; 9, service cluster management; 10, database management, and so on.
The operating environment of this tutorial: linux5.9.8 system, Dell G3 computer.
The main work of Linux operation and maintenance
Linux operation and maintenance as a position with the largest number of people and the highest salary among many jobs, this paper focuses on the occupation of Linux operation and maintenance, which is jointly written by Ma GE Education and enthusiasts, an institution specializing in the study and career development of Linux operation and maintenance.
Internet Linux operation and maintenance work, with service as the center, with stability, security and efficiency as the three basic points, ensures that the company's Internet business can provide high-quality services for users in 7 × 24 hours. The responsibility of operation and maintenance covers the life cycle of the product from design to release, operation and maintenance, change and upgrade, and offline.
The responsibilities of operation and maintenance are important and extensive throughout the product life cycle, but the responsibilities of operation and maintenance engineers are not limited to this part of the work, but also need to summarize the problems encountered in the work. Extract relevant technical directions, research and development-related tools and platforms to support / optimize business development and improve the efficiency of operation and maintenance. Related technical work mainly includes:
Service monitoring technology: including the research and development and application of monitoring platform, the guarantee of service monitoring accuracy, real-time and comprehensiveness
Service fault management: including service fault plan design, automatic execution, fault summary and feedback to the product / system design level for optimization to improve product stability.
Service capacity management: measuring the capacity of services, planning the construction of computer rooms for services, capacity expansion, migration, etc.
Service performance optimization: improve service performance and response speed and improve user experience from all directions, including network optimization, operating system optimization, application optimization, client optimization, etc.
Service global traffic scheduling: the traffic of access services, which is allocated among computer rooms according to capacity and service status.
Service task scheduling: scheduling trigger and status monitoring of various scheduled / non-scheduled tasks of the service
Service security: including service access security, anti-attack, access control, etc.
Data transmission technology: including the development and application of P2P and other transmission technologies, as well as the solution of long-distance big data transmission and other problems
Automatic release and deployment of services: research and development of deployment platforms / tools, and the use of platforms / tools to achieve secure and efficient release services
Service cluster management: including service server management, large-scale cluster management, etc.
Service cost optimization: reduce the resources used by service operation as much as possible, and reduce the service operation cost.
Database management (DBA): by designing, developing, and managing high-performance database clusters, database services are made more stable, more efficient, and easier to manage.
Platform development: development and management of docker-like platforms, and service access technology
Development, Optimization and access of distributed Storage platform
And so on, all the work related to service quality, efficiency, cost, security and so on, as well as the technology, components, tools and platforms involved are in the technical category of operation and maintenance. Doing a good job in each technical direction and completing the research and development of the corresponding components, tools and platforms can play a positive role in fulfilling the responsibilities of operation and maintenance and have a key impact on the development of the business.
Classification of Linux operation and maintenance work
Operation and maintenance work in more directions, with the continuous development of the scale of business, the more mature Internet companies, the more detailed the division of operation and maintenance positions. At present, many large Internet companies only have system operation and maintenance in the start-up period, and their work is gradually subdivided with the requirements of model and service quality. In general, the classification of work and responsibilities of the operation and maintenance team (see figure 1-1) are as follows.
2.1-Application Operation and maintenance (SRE): application Operation and maintenance is responsible for online service changes, service status monitoring, service disaster recovery and data backup, as well as routine troubleshooting and emergency handling of services. The responsibilities are as follows: design review, service management, resource management, routine inspection, pre-plan management, data backup.
2.2-system Operation and maintenance (SYS): responsible for the construction of IDC, network, CDN and basic services (LVS, NTP, DNS); responsible for asset management, server selection, delivery and maintenance, responsibilities are as follows: IDC data center construction, network construction, LVS load balance and SNAT construction, CDN planning and construction, server selection, delivery and maintenance, kernel selection and OS related maintenance, asset management, basic service construction.
2.3-database operation and maintenance (DBA): database operation and maintenance is responsible for data storage scheme design, database table design, index design and SQL optimization, database change, monitoring, backup, high availability design, etc. The detailed work is as follows: design review, capacity planning, data backup and disaster preparedness, database monitoring, database security, database high availability and performance optimization, automation system construction, operation and maintenance research and development, operation and maintenance platform, monitoring system, automatic deployment system.
2.4-Operation and maintenance Security (SEC): operation and maintenance Security is responsible for network, system and business security reinforcement, routine security scanning, penetration testing, security tools and system research and development, and emergency handling of security incidents, the work is as follows: safety system establishment, security training, risk assessment, security construction, safety compliance, emergency response.
Daily use of software and skills by Linux operation and maintenance staff
The operation and maintenance platforms and tools used by operation and maintenance engineers include:
Web servers: apache, tomcat, nginx, lighttpd
Monitoring: nagios, ganglia, cacti, zabbix
Automatic deployment: ansible, sshpt, salt
Configuration management: puppet, cfengine
Load balancing: lvs, haproxy, nginx
Transmission tools: scribe, flume
Backup tools: rsync, wget
Database: mysql, oracle, sqlserver
Distributed platforms: hdfs, mapreduce, spark, storm, hive
Distributed databases: hbase, cassandra, redis, MongoDB
Containers: lxc, docker
Virtualization: openstack, xen, kvm
Security: kerberos, selinux, acl, iptables
Problem tracking: netstat, top, tcpdump, last
Operation and maintenance is based on technology and provides higher quality service through technical guarantee products. The responsibilities of operation and maintenance work and their position in the business determine that operation and maintenance engineers need to have more extensive knowledge and in-depth technical capabilities:
Solid basic computer knowledge, including computer system architecture, operating system, network technology, etc.
General applications need to understand the operating system, network, security, storage, CDN,DB, etc., and know its related principles.
Programming ability, from the development of operation and maintenance tools to the development of large-scale operation and maintenance system / platform, requires good programming ability.
Data analysis ability: be able to sort out and analyze the data of the system, find problems and find solutions.
Rich system knowledge, including system tools, typical system architecture, common platform selection, etc.
Ability to make comprehensive use of tools and platforms.
After reading this, the article "what is the main work of linux operation and maintenance" has been introduced. If you want to master the knowledge points of this article, you still need to practice and use it yourself. If you want to know more about related articles, you are welcome to follow the industry information channel.
Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.
Views: 0
*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.
Continue with the installation of the previous hadoop.First, install zookooper1. Decompress zookoope
"Every 5-10 years, there's a rare product, a really special, very unusual product that's the most un
© 2024 shulou.com SLNews company. All rights reserved.