Tru64 UNIX Troubleshooting: Diagnosing and Correcting System Problems

Ebook518 pages5 hours

Tru64 UNIX Troubleshooting: Diagnosing and Correcting System Problems

Name: Tru64 UNIX Troubleshooting: Diagnosing and Correcting System Problems
Author: Martin Moore
ISBN: 9780080519746

By Martin Moore and Steven Hancock

Rating: 0 out of 5 stars

()

Read preview

About this ebook

Dealing with system problems—from user login failures to server crashes--is a critical part of a system administrator's job. A down system can cost a business thousands of dollars per minute. But there is little or no information available on how to troubleshoot and correct system problems; in most cases, these skills are learned in an ad-hoc manner, usually in the pressure-cooker environment of a crisis. This is the first book to address this lack of information.

The authors (both experienced Tru64 UNIX support engineer for Compaq) systematically present the techniques and tools needed to find and fix system problems. The first part of the book presents the general principles and techniques needed in system troubleshooting. These principles and techniques are useful not only for UNIX system administrators, but for anyone who needs to find and fix system problems. After this foundation, the authors describe troubleshooting tools used in the UNIX environment. The remainder of the book covers specific areas of the Tru64 UNIX operating system in detail: listing common problems, their causes, how to detect them, and how to correct them. Each chapter includes a "Before You Call Support" section that details the most important things to check and correct before it's necessary to call Compaq technical support. The authors also include decision trees to help the reader systematically isolate particular problem types.

· "Before You Call Tech Support" sections
· Tables and diagrams for quick access to precise data
· Decision trees to help choose the best way to troubleshoot a particular problem

Skip carousel

LanguageEnglish

PublisherElsevier Science

Release dateDec 10, 2002

ISBN9780080519746

Author

Martin Moore

Martin Moore is a Leader of the UNIX Expert Support Team, Hewlett Packard Corporation. Martin Moore has led Compaq Computer (now HP) Corporation's UNIX expert support team since 1997. He has been a UNIX system administrator and consultant since 1992, and is an expert in UNIX security, performance tuning, and Tru64 UNIX internals.

Related to Tru64 UNIX Troubleshooting

Related ebooks

Skip carousel

UNIX for OpenVMS Users
Ebook
UNIX for OpenVMS Users
byPhilip Bourne
Rating: 0 out of 5 stars
0 ratings
Unix / Linux FAQ: with Tips to Face Interviews
Ebook
Unix / Linux FAQ: with Tips to Face Interviews
byProf. N.B. Venkateswarlu
Rating: 0 out of 5 stars
0 ratings
MAC OS X UNIX Toolbox: 1000+ Commands for the Mac OS X
Ebook
MAC OS X UNIX Toolbox: 1000+ Commands for the Mac OS X
byChristopher Negus
Rating: 0 out of 5 stars
0 ratings
Operating System Forensics
Ebook
Operating System Forensics
byRic Messier
Rating: 4 out of 5 stars
4/5
Sudo Mastery: IT Mastery, #13
Ebook
Sudo Mastery: IT Mastery, #13
byMichael W. Lucas
Rating: 0 out of 5 stars
0 ratings
Linux Programming Tools Unveiled
Ebook
Linux Programming Tools Unveiled
byN. B. Venkateswarlu
Rating: 0 out of 5 stars
0 ratings
Linux Mint Essentials
Ebook
Linux Mint Essentials
byJay LaCroix
Rating: 3 out of 5 stars
3/5
Problem-solving in High Performance Computing: A Situational Awareness Approach with Linux
Ebook
Problem-solving in High Performance Computing: A Situational Awareness Approach with Linux
byIgor Ljubuncic
Rating: 0 out of 5 stars
0 ratings
The Linux Mint Beginner's Guide
Ebook
The Linux Mint Beginner's Guide
byJonathan Moeller
Rating: 5 out of 5 stars
5/5
Mastering Embedded Linux Programming
Ebook
Mastering Embedded Linux Programming
bySimmonds Chris
Rating: 5 out of 5 stars
5/5
LINUX FOR NOVICES: A Beginner's Guide to Mastering the Linux Operating System (2023)
Ebook
LINUX FOR NOVICES: A Beginner's Guide to Mastering the Linux Operating System (2023)
byRandall Swanson
Rating: 0 out of 5 stars
0 ratings
LPI Linux Certification Questions: LPI Linux Interview Questions, Answers, and Explanations
Ebook
LPI Linux Certification Questions: LPI Linux Interview Questions, Answers, and Explanations
byEquity Press
Rating: 4 out of 5 stars
4/5
CentOS High Availability
Ebook
CentOS High Availability
byMitja Resman
Rating: 5 out of 5 stars
5/5
Lighttpd
Ebook
Lighttpd
byAndre Bogus
Rating: 4 out of 5 stars
4/5
Mastering Unix Shell Scripting: Bash, Bourne, and Korn Shell Scripting for Programmers, System Administrators, and UNIX Gurus
Ebook
Mastering Unix Shell Scripting: Bash, Bourne, and Korn Shell Scripting for Programmers, System Administrators, and UNIX Gurus
byRandal K. Michael
Rating: 4 out of 5 stars
4/5
Ed Mastery: IT Mastery, #13
Ebook
Ed Mastery: IT Mastery, #13
byMichael W. Lucas
Rating: 0 out of 5 stars
0 ratings
Hands-on Booting: Learn the Boot Process of Linux, Windows, and Unix
Ebook
Hands-on Booting: Learn the Boot Process of Linux, Windows, and Unix
byYogesh Babar
Rating: 0 out of 5 stars
0 ratings
VirtualBox 3.1: Beginner's Guide
Ebook
VirtualBox 3.1: Beginner's Guide
byRomero
Rating: 0 out of 5 stars
0 ratings
Ed Mastery (Manly McManface Edition)
Ebook
Ed Mastery (Manly McManface Edition)
byMichael W. Lucas
Rating: 0 out of 5 stars
0 ratings
Introducing VirtualBox & Debian: MyOwnGeek, #1
Ebook
Introducing VirtualBox & Debian: MyOwnGeek, #1
byDan Fleming
Rating: 0 out of 5 stars
0 ratings
How to Cheat at IIS 7 Server Administration
Ebook
How to Cheat at IIS 7 Server Administration
byChris Adams
Rating: 0 out of 5 stars
0 ratings
Microsoft Windows 7 Administrator's Reference: Upgrading, Deploying, Managing, and Securing Windows 7
Ebook
Microsoft Windows 7 Administrator's Reference: Upgrading, Deploying, Managing, and Securing Windows 7
byJorge Orchilles
Rating: 0 out of 5 stars
0 ratings
Syngress Force Emerging Threat Analysis: From Mischief to Malicious
Ebook
Syngress Force Emerging Threat Analysis: From Mischief to Malicious
byJoe Haldeman
Rating: 0 out of 5 stars
0 ratings
PAM Mastery: IT Mastery, #10
Ebook
PAM Mastery: IT Mastery, #10
byMichael W. Lucas
Rating: 0 out of 5 stars
0 ratings
FreeBSD Mastery: Storage Essentials: IT Mastery, #4
Ebook
FreeBSD Mastery: Storage Essentials: IT Mastery, #4
byMichael W. Lucas
Rating: 0 out of 5 stars
0 ratings
Network Processor Design: Issues and Practices
Ebook
Network Processor Design: Issues and Practices
byMark A. Franklin
Rating: 5 out of 5 stars
5/5
Tarsnap Mastery: IT Mastery, #6
Ebook
Tarsnap Mastery: IT Mastery, #6
byMichael W. Lucas
Rating: 0 out of 5 stars
0 ratings
C Clearly - Programming With C In Linux and On Raspberry Pi
Ebook
C Clearly - Programming With C In Linux and On Raspberry Pi
byAndrew Johnson
Rating: 0 out of 5 stars
0 ratings
UNIX and Linux Forensic Analysis DVD Toolkit
Ebook
UNIX and Linux Forensic Analysis DVD Toolkit
byChris Pogue
Rating: 0 out of 5 stars
0 ratings
Modern C for Absolute Beginners: A Friendly Introduction to the C Programming Language
Ebook
Modern C for Absolute Beginners: A Friendly Introduction to the C Programming Language
bySlobodan Dmitrović
Rating: 0 out of 5 stars
0 ratings

Systems Architecture For You

Skip carousel

AutoCAD 2023 : Beginners And Intermediate user Guide
Ebook
AutoCAD 2023 : Beginners And Intermediate user Guide
byDaniel Smith
Rating: 0 out of 5 stars
0 ratings
Dataflow and Reactive Programming Systems
Ebook
Dataflow and Reactive Programming Systems
byMatt Carkci
Rating: 0 out of 5 stars
0 ratings
Xbox Architecture: Architecture of Consoles: A Practical Analysis, #13
Ebook
Xbox Architecture: Architecture of Consoles: A Practical Analysis, #13
byRodrigo Copetti
Rating: 0 out of 5 stars
0 ratings
Microsoft Certification: Complete step by step guide to pass all Microsoft Exams and get certifications real and unique practice tests included
Ebook
Microsoft Certification: Complete step by step guide to pass all Microsoft Exams and get certifications real and unique practice tests included
byDavid Mayer
Rating: 5 out of 5 stars
5/5
Mastering Kubernetes
Ebook
Mastering Kubernetes
byGigi Sayfan
Rating: 5 out of 5 stars
5/5
Solution Architecture Foundations
Ebook
Solution Architecture Foundations
byMark Lovatt
Rating: 3 out of 5 stars
3/5
Professional Cloud Architect – Google Cloud Certification Guide: A handy guide to designing, developing, and managing enterprise-grade GCP cloud solutions
Ebook
Professional Cloud Architect – Google Cloud Certification Guide: A handy guide to designing, developing, and managing enterprise-grade GCP cloud solutions
byKonrad Cłapa
Rating: 0 out of 5 stars
0 ratings
The IT Support Handbook: A How-To Guide to Providing Effective Help and Support to IT Users
Ebook
The IT Support Handbook: A How-To Guide to Providing Effective Help and Support to IT Users
byMike Halsey
Rating: 0 out of 5 stars
0 ratings
Modeling Enterprise Architecture with TOGAF: A Practical Guide Using UML and BPMN
Ebook
Modeling Enterprise Architecture with TOGAF: A Practical Guide Using UML and BPMN
byPhilippe Desfray
Rating: 5 out of 5 stars
5/5
Engineering a Compiler
Ebook
Engineering a Compiler
byKeith D. Cooper
Rating: 0 out of 5 stars
0 ratings
Arduino Projects For Dummies
Ebook
Arduino Projects For Dummies
byBrock Craft
Rating: 3 out of 5 stars
3/5
.NET Core in Action
Ebook
.NET Core in Action
byDustin Metzgar
Rating: 0 out of 5 stars
0 ratings
Mastering Proxmox - Second Edition
Ebook
Mastering Proxmox - Second Edition
byWasim Ahmed
Rating: 0 out of 5 stars
0 ratings
CompTIA ITF+ CertMike: Prepare. Practice. Pass the Test! Get Certified!: Exam FC0-U61
Ebook
CompTIA ITF+ CertMike: Prepare. Practice. Pass the Test! Get Certified!: Exam FC0-U61
byMike Chapple
Rating: 0 out of 5 stars
0 ratings
Computer Science: A Concise Introduction
Ebook
Computer Science: A Concise Introduction
byIan Sinclair
Rating: 4 out of 5 stars
4/5
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
Ebook
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
byArthur T. Brooks
Rating: 0 out of 5 stars
0 ratings
Extending Docker
Ebook
Extending Docker
byMcKendrick Russ
Rating: 0 out of 5 stars
0 ratings
Hardware/Firmware Interface Design: Best Practices for Improving Embedded Systems Development
Ebook
Hardware/Firmware Interface Design: Best Practices for Improving Embedded Systems Development
byGary Stringham
Rating: 5 out of 5 stars
5/5
Internet of Things with ESP8266
Ebook
Internet of Things with ESP8266
bySchwartz Marco
Rating: 5 out of 5 stars
5/5
DevOps for Networking
Ebook
DevOps for Networking
bySteven Armstrong
Rating: 0 out of 5 stars
0 ratings
JavaScript Application Design: A Build First Approach
Ebook
JavaScript Application Design: A Build First Approach
byNicolas Bevacqua
Rating: 0 out of 5 stars
0 ratings
The Practice of Enterprise Architecture: A Modern Approach to Business and IT Alignment
Ebook
The Practice of Enterprise Architecture: A Modern Approach to Business and IT Alignment
bySvyatoslav Kotusev
Rating: 4 out of 5 stars
4/5
Raspberry Pi Projects For Dummies
Ebook
Raspberry Pi Projects For Dummies
byMike Cook
Rating: 5 out of 5 stars
5/5
The Tao of Microservices
Ebook
The Tao of Microservices
byRichard Rodger
Rating: 0 out of 5 stars
0 ratings
PSP Architecture: Architecture of Consoles: A Practical Analysis, #18
Ebook
PSP Architecture: Architecture of Consoles: A Practical Analysis, #18
byRodrigo Copetti
Rating: 0 out of 5 stars
0 ratings
RabbitMQ in Depth
Ebook
RabbitMQ in Depth
byGavin M. Roy
Rating: 0 out of 5 stars
0 ratings
Hardware and Computer Organization
Ebook
Hardware and Computer Organization
byArnold S. Berger
Rating: 0 out of 5 stars
0 ratings
Nintendo DS Architecture: Architecture of Consoles: A Practical Analysis, #14
Ebook
Nintendo DS Architecture: Architecture of Consoles: A Practical Analysis, #14
byRodrigo Copetti
Rating: 0 out of 5 stars
0 ratings
Sega Saturn Architecture: Architecture of Consoles: A Practical Analysis, #5
Ebook
Sega Saturn Architecture: Architecture of Consoles: A Practical Analysis, #5
byRodrigo Copetti
Rating: 0 out of 5 stars
0 ratings
Learn Git in a Month of Lunches
Ebook
Learn Git in a Month of Lunches
byRick Umali
Rating: 0 out of 5 stars
0 ratings

Related podcast episodes

Skip carousel

JIT Compilation and Exascale Computing with Hal Finkel: Rob and Jason are joined by Hal Finkel from the US Department of Energy. They first talk to Hal about the LLVM 13 release and why the release notes were lacking. Then they talk to Hal about his C++ JIT Proposal, the Clang prototype and how it could be...
Podcast episode
JIT Compilation and Exascale Computing with Hal Finkel: Rob and Jason are joined by Hal Finkel from the US Department of Energy. They first talk to Hal about the LLVM 13 release and why the release notes were lacking. Then they talk to Hal about his C++ JIT Proposal, the Clang prototype and how it could be...
byCppCast
0 ratings
0% found this document useful
Telnet may not be the backdoor you’re looking for. Large PII database left exposed by parties unknown. DHS has a Critical Functions List. ISIS inspiration is back.: Telnet may not be the backdoor you’re looking for. Large PII database left exposed by parties unknown. DHS has a Critical Functions List. ISIS inspiration is back.
Podcast episode
Telnet may not be the backdoor you’re looking for. Large PII database left exposed by parties unknown. DHS has a Critical Functions List. ISIS inspiration is back.: Telnet may not be the backdoor you’re looking for. Large PII database left exposed by parties unknown. DHS has a Critical Functions List. ISIS inspiration is back.
byCyberWire Daily
0 ratings
0% found this document useful
Cicadas, Ciphers and Codes: The Weird World of Cryptography: For much of recorded history, humans beings have been trying to devise unbreakable codes (and break the codes of their rivals). But is there any truly unbreakable code? And what exactly is Cicada 3301? Tune in to learn more.
Podcast episode
Cicadas, Ciphers and Codes: The Weird World of Cryptography: For much of recorded history, humans beings have been trying to devise unbreakable codes (and break the codes of their rivals). But is there any truly unbreakable code? And what exactly is Cicada 3301? Tune in to learn more.
byStuff They Don't Want You To Know
0 ratings
0% found this document useful
Learning Long-Time Dependencies with RNNs w/ Konstantin Rusch - #484: Today we conclude our 2021 ICLR coverage joined by Konstantin Rusch, a PhD Student at ETH Zurich. In our conversation with Konstantin, we explore his recent papers, titled coRNN and uniCORNN respectively, which focus on a novel architecture of...
Podcast episode
Learning Long-Time Dependencies with RNNs w/ Konstantin Rusch - #484: Today we conclude our 2021 ICLR coverage joined by Konstantin Rusch, a PhD Student at ETH Zurich. In our conversation with Konstantin, we explore his recent papers, titled coRNN and uniCORNN respectively, which focus on a novel architecture of...
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
235: Pair programming with Ben Orenstein & Tuple: In this episode, Kaushik goes solo and interviews Ben Orenstein. Ben is a prolific Ruby developer, an amazing conference speaker, an ardent vim-ster, and now the CEO of Tuple. Kaushik has been a big fan of Ben's work and was super stoked to talk to Ben and pick his brains on a host of topics: starting the company Tuple, pair programming in general, learning different programming languages and technology, giving better conference talks and more! This episode is chock full of wisdom from Ben. Enjoy!
Podcast episode
235: Pair programming with Ben Orenstein & Tuple: In this episode, Kaushik goes solo and interviews Ben Orenstein. Ben is a prolific Ruby developer, an amazing conference speaker, an ardent vim-ster, and now the CEO of Tuple. Kaushik has been a big fan of Ben's work and was super stoked to talk to Ben and pick his brains on a host of topics: starting the company Tuple, pair programming in general, learning different programming languages and technology, giving better conference talks and more! This episode is chock full of wisdom from Ben. Enjoy!
byFragmented - An Android Developer Podcast
0 ratings
0% found this document useful
Bonus: Afternoon Cyber Tea: IoT-Based Infrastructures
Podcast episode
Bonus: Afternoon Cyber Tea: IoT-Based Infrastructures
byCyberWire Daily
0 ratings
0% found this document useful
371: Wildcards running wild
Podcast episode
371: Wildcards running wild
byBSD Now
0 ratings
0% found this document useful
312: Why Package Managers: The UNIX Philosophy in 2019, why use package managers, touchpad interrupted, Porting wine to amd64 on NetBSD second evaluation report, Enhancing Syzkaller Support for NetBSD, all about the Pinebook Pro, killing a process and all of its descendants, fast software the best software, and more.
Podcast episode
312: Why Package Managers: The UNIX Philosophy in 2019, why use package managers, touchpad interrupted, Porting wine to amd64 on NetBSD second evaluation report, Enhancing Syzkaller Support for NetBSD, all about the Pinebook Pro, killing a process and all of its descendants, fast software the best software, and more.
byBSD Now
0 ratings
0% found this document useful
291: Storage Changes Software: Storage changing software, what makes Unix special, what you need may be “pipeline +Unix commands”, running a bakery on Emacs and PostgreSQL, the ultimate guide to memorable tech talks, light-weight contexts, and more.
Podcast episode
291: Storage Changes Software: Storage changing software, what makes Unix special, what you need may be “pipeline +Unix commands”, running a bakery on Emacs and PostgreSQL, the ultimate guide to memorable tech talks, light-weight contexts, and more.
byBSD Now
0 ratings
0% found this document useful
299: The NAS Fleet: Running AIX on QEMU on Linux on Windows, your NAS fleet with TrueCommand, Unleashed 1.3 is available, LLDB: CPU register inspection support extension, V7 Unix programs often not written as expected, and more.
Podcast episode
299: The NAS Fleet: Running AIX on QEMU on Linux on Windows, your NAS fleet with TrueCommand, Unleashed 1.3 is available, LLDB: CPU register inspection support extension, V7 Unix programs often not written as expected, and more.
byBSD Now
0 ratings
0% found this document useful
325: Cracking Rainbows: FreeBSD 12.1 is here, A history of Unix before Berkeley, FreeBSD development setup, HardenedBSD 2019 Status Report, DNSSEC, compiling RainbowCrack on OpenBSD, and more.
Podcast episode
325: Cracking Rainbows: FreeBSD 12.1 is here, A history of Unix before Berkeley, FreeBSD development setup, HardenedBSD 2019 Status Report, DNSSEC, compiling RainbowCrack on OpenBSD, and more.
byBSD Now
0 ratings
0% found this document useful
2020-029- Brad Spengler, Linux kernel security in the past 10 years, software dev practices in Linux, WISP.org PSA: WISP.org PSA at 35m56s - 37m 19s Agenda:Bio/background Why are you here (topic discussion) What is the Linux Security Summit North America Questions from the meeting invite: This only affects people who want to use a custom...
Podcast episode
2020-029- Brad Spengler, Linux kernel security in the past 10 years, software dev practices in Linux, WISP.org PSA: WISP.org PSA at 35m56s - 37m 19s Agenda:Bio/background Why are you here (topic discussion) What is the Linux Security Summit North America Questions from the meeting invite: This only affects people who want to use a custom...
byBrakeSec Education Podcast
0 ratings
0% found this document useful
Episode 252: Goes to 11.2 | BSD Now 252: FreeBSD 11.2 has been released, setting up an MTA behind Tor, running pfsense on DigitalOcean, one year of C, using OpenBGPD to announce VM networks, the power to serve, and a BSDCan trip report.
Podcast episode
Episode 252: Goes to 11.2 | BSD Now 252: FreeBSD 11.2 has been released, setting up an MTA behind Tor, running pfsense on DigitalOcean, one year of C, using OpenBGPD to announce VM networks, the power to serve, and a BSDCan trip report.
byBSD Now
0 ratings
0% found this document useful
?️ThursdAI - Jul 27: SDXL1.0, Superconductors? StackOverflowAI and Frontier Model Forum
Podcast episode
?️ThursdAI - Jul 27: SDXL1.0, Superconductors? StackOverflowAI and Frontier Model Forum
byThursdAI - The top AI news from the past week
0 ratings
0% found this document useful
374: OpenBSD’s 25th anniversary: OpenBSD 6.8 has been released, NetBSD 9.1 is out, OpenZFS devsummit report, BastilleBSD’s native container management for FreeBSD, cleaning up old tarsnap backups, Michael W. Lucas’ book sale, and more.
Podcast episode
374: OpenBSD’s 25th anniversary: OpenBSD 6.8 has been released, NetBSD 9.1 is out, OpenZFS devsummit report, BastilleBSD’s native container management for FreeBSD, cleaning up old tarsnap backups, Michael W. Lucas’ book sale, and more.
byBSD Now
0 ratings
0% found this document useful
450: Unix Tool Writing: The ideas that made Unix, hints for writing Unix tools, cron best practices, three different sorts of filesystem errors, LibreSSL 3.5.1 released, taskwarrior to manage tasks, and more.
Podcast episode
450: Unix Tool Writing: The ideas that made Unix, hints for writing Unix tools, cron best practices, three different sorts of filesystem errors, LibreSSL 3.5.1 released, taskwarrior to manage tasks, and more.
byBSD Now
0 ratings
0% found this document useful
Go From Notebook To Pipeline For Your Data Science Projects With Orchest: An interview about the Orchest IDE built for data science and how it enables you to combine your notebooks into a data pipeline
Podcast episode
Go From Notebook To Pipeline For Your Data Science Projects With Orchest: An interview about the Orchest IDE built for data science and how it enables you to combine your notebooks into a data pipeline
byThe Python Podcast.__init__
0 ratings
0% found this document useful
Episode 49: Open Source Software: Slipping Beneath the Surface of Awareness: Does operating system (OS) choice even matter anymore to most people? Especially with the emergence of serverless and containers? Debian may not see its name up in lights much these days, but it’s still very much front, center, and relevant to what people
Podcast episode
Episode 49: Open Source Software: Slipping Beneath the Surface of Awareness: Does operating system (OS) choice even matter anymore to most people? Especially with the emergence of serverless and containers? Debian may not see its name up in lights much these days, but it’s still very much front, center, and relevant to what people
byScreaming in the Cloud
0 ratings
0% found this document useful
287: rc.d in NetBSD: Design and Implementation of NetBSD’s rc.d system, first impressions of Project Trident 18.12, PXE booting a FreeBSD disk image, middle mouse button pasting, NetBSD gains hardware accelerated virtualization, and more.
Podcast episode
287: rc.d in NetBSD: Design and Implementation of NetBSD’s rc.d system, first impressions of Project Trident 18.12, PXE booting a FreeBSD disk image, middle mouse button pasting, NetBSD gains hardware accelerated virtualization, and more.
byBSD Now
0 ratings
0% found this document useful
578 Learn Linux: Good Idea Or Not? (2018 & Beyond) - Simple Programmer Podcast: ► Learn Linux: Good Idea Or Not? (2018 & Beyond) ◄ Linux is definitely among one of the most popular programming languages nowadays. Linux. It’s been around since the mid ‘90s, and has since reached a user-base that spans industries and...
Podcast episode
578 Learn Linux: Good Idea Or Not? (2018 & Beyond) - Simple Programmer Podcast: ► Learn Linux: Good Idea Or Not? (2018 & Beyond) ◄ Linux is definitely among one of the most popular programming languages nowadays. Linux. It’s been around since the mid ‘90s, and has since reached a user-base that spans industries and...
bySimple Programmer Podcast
0 ratings
0% found this document useful
142: SUSE IPO, New System76 Desktop, New Tuxedo Laptops, Mesa 21, 7-Zip & More: On this episode of This Week in Linux, we’ve got some core news to discuss the latest Mesa graphics driver. We’ve also got a lot of distro news related to SUSE, MakuluLinux, Sparky Linux, and Salient OS. In Hardware news,
Podcast episode
142: SUSE IPO, New System76 Desktop, New Tuxedo Laptops, Mesa 21, 7-Zip & More: On this episode of This Week in Linux, we’ve got some core news to discuss the latest Mesa graphics driver. We’ve also got a lot of distro news related to SUSE, MakuluLinux, Sparky Linux, and Salient OS. In Hardware news,
byThis Week in Linux
0 ratings
0% found this document useful
Episode 259: Long Live Unix | BSD Now 259: The strange birth and long life of Unix, FreeBSD jail with a single public IP, EuroBSDcon 2018 talks and schedule, OpenBSD on G4 iBook, PAM template user, ZFS file server, and reflections on one year of OpenBSD use.
Podcast episode
Episode 259: Long Live Unix | BSD Now 259: The strange birth and long life of Unix, FreeBSD jail with a single public IP, EuroBSDcon 2018 talks and schedule, OpenBSD on G4 iBook, PAM template user, ZFS file server, and reflections on one year of OpenBSD use.
byBSD Now
0 ratings
0% found this document useful
294: The SSH Tarpit: A PI-powered Plan 9 cluster, an SSH tarpit, rdist for when Ansible is too much, falling in love with OpenBSD again, how I created my first FreeBSD port, the Tilde Institute of OpenBSD education and more.
Podcast episode
294: The SSH Tarpit: A PI-powered Plan 9 cluster, an SSH tarpit, rdist for when Ansible is too much, falling in love with OpenBSD again, how I created my first FreeBSD port, the Tilde Institute of OpenBSD education and more.
byBSD Now
0 ratings
0% found this document useful
Episode 56: One Packager for All | LUP 56: The systemd group has a proposal for universal software management scheme for all Linux distributions. We’ll debate the philosophical impact & explain why it’s all powered by btrfs. Plus some thoughts on the ultimate DM & the true cost of a MacBook!
Podcast episode
Episode 56: One Packager for All | LUP 56: The systemd group has a proposal for universal software management scheme for all Linux distributions. We’ll debate the philosophical impact & explain why it’s all powered by btrfs. Plus some thoughts on the ultimate DM & the true cost of a MacBook!
byLINUX Unplugged
0 ratings
0% found this document useful
60: Compile Faster with Marc-André Lafortune: We talk with Marc-André Lafortune about reducing Elixir project compile times. On larger projects, when a single file like a view template is changed and over 100 files get recompiled, there is something wrong.
Podcast episode
60: Compile Faster with Marc-André Lafortune: We talk with Marc-André Lafortune about reducing Elixir project compile times. On larger projects, when a single file like a view template is changed and over 100 files get recompiled, there is something wrong.
byThinking Elixir Podcast
0 ratings
0% found this document useful
341: U-NAS-ification: FreeBSD on Power, DragonflyBSD 5.8 is here, Unifying FreeNAS/TrueNAS, OpenBSD vs. Prometheus and Go, gcc 4.2.1 removed from FreeBSD base, and more.
Podcast episode
341: U-NAS-ification: FreeBSD on Power, DragonflyBSD 5.8 is here, Unifying FreeNAS/TrueNAS, OpenBSD vs. Prometheus and Go, gcc 4.2.1 removed from FreeBSD base, and more.
byBSD Now
0 ratings
0% found this document useful
The Cloudcast #233 - NATS - Cloud Native Infrastructure: Aaron and Brian talk with Larry McQueary - (@McQueary; Product Manager at Apcera for NATS.io Cloud Native Infrastructure) about messaging platforms for microservices and distributed applications, how they differ from Enterprise Service Bus and how IoT ...
Podcast episode
The Cloudcast #233 - NATS - Cloud Native Infrastructure: Aaron and Brian talk with Larry McQueary - (@McQueary; Product Manager at Apcera for NATS.io Cloud Native Infrastructure) about messaging platforms for microservices and distributed applications, how they differ from Enterprise Service Bus and how IoT ...
byThe Cloudcast
0 ratings
0% found this document useful
ANTIC Interview 414 - Bob Puff, Computer Software Services
Podcast episode
ANTIC Interview 414 - Bob Puff, Computer Software Services
byANTIC The Atari 8-bit Podcast
0 ratings
0% found this document useful
305: Changing face of Unix: Website protection with OPNsense, FreeBSD Support Pull Request for ZFS-on-Linux, How much has Unix changed, Porting Wine to amd64 on NetBSD, FreeBSD Enterprise 1 PB Storage, the death watch for X11 has started, and more.
Podcast episode
305: Changing face of Unix: Website protection with OPNsense, FreeBSD Support Pull Request for ZFS-on-Linux, How much has Unix changed, Porting Wine to amd64 on NetBSD, FreeBSD Enterprise 1 PB Storage, the death watch for X11 has started, and more.
byBSD Now
0 ratings
0% found this document useful
[Cognitive Revolution] The Tiny Model Revolution with Ronen Eldan and Yuanzhi Li of Microsoft Research
Podcast episode
[Cognitive Revolution] The Tiny Model Revolution with Ronen Eldan and Yuanzhi Li of Microsoft Research
byLatent Space: The AI Engineer Podcast — Practitioners talking LLMs, CodeGen, Agents, Multimodality, AI UX, GPU Infra and all things Software 3.0
0 ratings
0% found this document useful

Skip carousel

EuroLinux 9.2
Linux Format
Article
EuroLinux 9.2
Jun 27, 2023
2 min read
What’s In A Filesystem?
Linux Format
Article
What’s In A Filesystem?
Mar 5, 2024
Mats Tage Axelsson can be counted on to find new ways to simplify your Linux usage in complex ways. Anything to seem as cool as he thinks he is. Whenever your computer starts, it uses the filesystem on your disk. Nothing runs without W it. This make
11 min read
Emmabuntüs 5
Linux Format
Article
Emmabuntüs 5
Feb 6, 2024
2 min read
Kernel Watch
Linux Format
Article
Kernel Watch
Nov 16, 2021
Linus Torvalds has released what will likely be the final RC (Release Candidate) kernel prior to 5.15 final. The new kernel will contain a number of features that we’ve covered previously, including a new NTFS filesystem driver, and a new in-kernel “
2 min read
Get More From Your Grubby Boot Menu
Linux Format
Article
Get More From Your Grubby Boot Menu
Nov 19, 2019
10 min read
Kernel Watch
Linux Format
Article
Kernel Watch
Sep 21, 2021
Linus Torvalds announced the release of Linux 5.14, which came mere days after the 30th birthday of his very first Linux announcement. The new kernel includes features such as “core scheduling” (a defence against speculation attacks that span multipl
2 min read
Orchestrating with Xen
Linux Format
Article
Orchestrating with Xen
Feb 9, 2021
The distinction between Type 1 hypervisors (being minimal OSes designed only to host VMs) and those of Type 2 (which run VMs inside a regular operating system) can get a little muddy. KVM, which userspace programs like VirtualBox and QEMU can use, mi
2 min read
Welcome to Linux
Linux Format
Article
Welcome to Linux
Jan 14, 2020
It’s 2020 and there’s no doubt that this is the year of Linux on the desktop. That’s the running joke among the Linux community, but the truth hiding behind it, is that there are millions of happy desktop Linux users out there in the world and this i
1 min read
Linux Mint 2 1.1 Vera
Linux Format
Article
Linux Mint 2 1.1 Vera
Feb 7, 2023
5 min read
The Default Applications
Linux Format
Article
The Default Applications
Mar 8, 2022
1 min read
Add Military-level Security To Any Project
Linux Format
Article
Add Military-level Security To Any Project
Aug 27, 2019
7 min read
The State Of Linux Security
Linux Format
Article
The State Of Linux Security
Apr 7, 2020
1 min read
Should You Use Tor Browser Instead?
TechLife
Article
Should You Use Tor Browser Instead?
Jun 27, 2022
1 min read
Linux: The All-in-one Option
Maximum PC
Article
Linux: The All-in-one Option
Oct 15, 2019
1 min read
Receive And Process Global Radio Signals
APC
Article
Receive And Process Global Radio Signals
Mar 23, 2020
7 min read
A Revered Modern Saint
Farmer's Weekly
Article
A Revered Modern Saint
Aug 31, 2020
Every year on 13 and 14 May, the interdenominational prayer movement Indaba Zosindiso (‘Stories of Salvation’) hosts a festival in the small Eastern Cape town of Tsolo, 42km from Mthatha. The occasion is to commemorate the life of Anglican Saint Eliz
2 min read
Installing Arch Linux
APC
Article
Installing Arch Linux
Apr 19, 2021
5 min read
Norton Adds Crypto Mining Capabilities To Antivirus Software
APC
Article
Norton Adds Crypto Mining Capabilities To Antivirus Software
Jul 12, 2021
1 min read
Use Home Assistant NodeRED devices
Linux Format
Article
Use Home Assistant NodeRED devices
May 31, 2022
NodeRED is an open source graphical programming environment that can be used to complete a large number of tasks. Graphical programming languages tend to be more intuitive than textual programming languages and can be easier to learn. Support is buil
5 min read
The Other Sub-$50 Single-board Computers
APC
Article
The Other Sub-$50 Single-board Computers
Apr 20, 2020
8 min read
Tor And Other VPN Alternatives
APC
Article
Tor And Other VPN Alternatives
May 16, 2022
VPNs are advertised as being a boost for privacy, security, and anonymity. Those nouns may well apply in particular situations. But it’s hard to call a VPN an anonymous service when you have to sign up with a very un-anonymous credit card. And it’s h
4 min read
Build The Kernel
Linux Format
Article
Build The Kernel
Mar 8, 2022
1 min read
Enhance GIMP With Must-have Plugins
Linux Format
Article
Enhance GIMP With Must-have Plugins
Apr 4, 2023
Credit: http://gimp.org Karsten Günther loves to extend his GIMP installations with useful plugins. And playing with the new tools… Many of the most important plugins presented here are quite old, but they are still maintained and sometimes even deve
9 min read
Windows Sandbox: How To Use Microsoft’s Virtual Windows PC To Secure Your Digital Life
PCWorld
Article
Windows Sandbox: How To Use Microsoft’s Virtual Windows PC To Secure Your Digital Life
Jul 2, 2019
6 min read
4 Windows Command Prompt Tricks Everyone Should Know
PCWorld
Article
4 Windows Command Prompt Tricks Everyone Should Know
Feb 8, 2017
3 min read
Rise Of The Robots
Linux Format
Article
Rise Of The Robots
Jan 12, 2021
7 min read
Digital Connection
CQ Amateur Radio
Article
Digital Connection
May 1, 2020
8 min read
Installation And Setup
Linux Format
Article
Installation And Setup
May 2, 2023
1 min read
Distro Watch
Linux Format
Article
Distro Watch
Apr 6, 2021
1 min read
Kernel Internals
Linux Format
Article
Kernel Internals
Aug 24, 2021
4 min read

Related categories

Skip carousel

Reviews for Tru64 UNIX Troubleshooting

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Tru64 UNIX Troubleshooting - Martin Moore

Steve

Introduction

Problems are only opportunities in work clothes.

—Henry J. Kaiser

1.1 Tru64 UNIX History

The operating system that we discuss in this book has had a varied and interesting history. Recent history has seen some very interesting developments, culminating with the decision to merge key elements of Tru64 UNIX, such as the Advanced File System (AdvFS) and TruCluster, with the HP-UX kernel to create a powerful new enterprise UNIX. The merger of Compaq Computer Corporation (Compaq) with the Hewlett-Packard Company (HP) is sure to make the future of this product interesting as the two UNIX powerhouses combine resources to create exciting new synergies.

Tru64 UNIX began as an attempt to commercialize the Mach kernel, developed by researchers at Carnegie Mellon University, into the foundation for a new UNIX system. Nothing in the Mach kernel required the UNIX interface, but the researchers saw the benefit of taking a high-performance micro-kernel and providing POSIX operating system services around it. Although UNIX originally began as a small and easily portable system, by this time it had begun to move away from its roots and toward a more monolithic structure. Up to that point, many operating systems were constructed in a monolithic fashion, meaning that all the system’s functions were provided from a single, large kernel program; as UNIX began to grow up in the late 1980s, it began to exhibit this trend.

A group known as the Open Software Foundation (OSF), making an attempt to reverse this trend, adopted the Mach kernel as its core. As a founding member of OSF, Digital Equipment Corporation (Digital) took much of the work done by the group and decided to commercialize it into a product running on a hot new processor they were developing at the time—the 64-bit Alpha processor. The operating system that Digital developed, based on the work done by the OSF, was named DEC OSF/1. The new operating system, designed from the ground up to be 64-bit, continued to evolve over time. More advanced features, such as symmetric multiprocessing (SMP), the Advanced File System (AdvFS), the Logical Storage Manager (LSM), and clustering, were added.

Around the same time, it became clear that Digital was the only member of the OSF to adopt the OSF/1 core for its UNIX operating system. The decision was made to change the product’s name to Digital UNIX (at version 3.2C) in order to break away from OSF roots and emphasize its UNIX character. Before long, clusters of large Turbolaser 8000-series systems were shipping with this powerful and flexible operating system. Later, when Compaq acquired Digital, it decided to rename the product again to Tru64 UNIX (at version 4.0E) in order to better represent its 64-bit qualities and to eliminate the old Digital name.

The Tru64 development team began an incredible overhaul of the product in version 5.0. This major version and its successors introduced support for single system image (SSI) clustering and nonuniform memory architecture (NUMA) systems. Some of the important new features added to support SSI clustering were a new device-naming standard and a cluster file system. Tru64 UNIX was enhanced to support the global server (GS) series, supporting up to 64GB of physical memory and 32 processors. With the V5.1A release, the product gained the additional capability of local-area network (LAN) interconnect clustering in place of memory channel, providing much greater flexibility in designing and fielding new clusters. In the future, there will be additional support for the even more powerful Marvel Alpha platform. As of this writing, these systems seem sure to push the limits again for memory and processor support in the operating system.

1.2 Scope

This book is written for the average Tru64 UNIX system administrator who has a tough problem to solve and isn’t sure how to troubleshoot it. We attempt to distill our years of experience in troubleshooting Tru64 UNIX problems into a disciplined approach. There are always tricks and techniques that can be used in any particular situation, and we will provide those whenever appropriate. The overriding goal of this book, however, is to instruct the reader in basic troubleshooting principles and techniques. From that foundation, specific practices follow naturally.

It is assumed that the reader is familiar with basic Tru64 UNIX system administration. Because of space constraints, this text is not meant as a comprehensive reference, and the reader is referred to the on-line reference pages or Tru64 UNIX documentation for in-depth coverage of a particular tool or option. The reader can also look for additional references in Appendix B of this book for more information on a particular topic.

This book discusses Tru64 UNIX version 5.1A, although in many cases the material is sufficiently general to be fully applicable to other versions. If a technique or feature applies only to a specific version of the operating system, that will be so noted.

1.3 Organization

This book is organized into 12 chapters, which cover the following topics:

1. Introduction. This includes a brief overview of the history of Tru64 UNIX and its current standing. We also describe the scope of the book and what we hope the reader will gain from it.

2. Principles and Techniques. This chapter is the foundation of the book. We present techniques for detecting problems and isolating their root causes. The material in this chapter is primarily of general applicability, not limited simply to Tru64 UNIX.

3. Tools and Resources. This chapter focuses on tools available for system reporting, error detection, and isolation in Tru64 UNIX, including parts of the operating system (e.g., Event Manager, Compaq Analyze, collect, and sys_check).

4. User Accounts and Security. This chapter discusses common problems with individual user accounts, such as login failures; password problems; enhanced security features; and shell environment problems.

5. System Failures. This chapter covers system failures such as system crashes, system hangs, and boot failures. Regarding system crashes, we do not provide a detailed course in crash dump analysis; this would be overly long, beyond most readers, and of little help except for those administrators who have purchased a Tru64 UNIX source license. We do provide a way for administrators to take a quick look at crash dumps and, in many cases, determine whether the problem is a local issue (e.g., a hardware failure or full disk) or an operating system failure that needs to be referred to HP technical support.

6. System Performance. In this chapter, we focus on identifying and relieving system performance bottlenecks. We include general system-tuning information, but our intent is to focus on responding to performance problems rather than on tuning for maximum performance. The latter is a complex issue and beyond the scope of this book.

7. Networking. We focus on common problems with key Tru64 UNIX utilities, including the Network File System (NFS), Network Information Service (NIS), and Berkeley Internet Name Domain (BIND) services. We discuss ways to determine whether a network problem is caused by hardware, but troubleshooting specific network hardware components is outside the scope of this book.

8. Storage. Our focus here is on disk and tape issues: identifying hardware problems (but not troubleshooting specific hardware components), device-naming issues, and Logical Storage Manager (LSM) problems.

9. File systems. Here we cover common problems with the UFS and AdvFS filesystems. We also discuss problems with the Tru64 native backup utilities such as tar, vdump, and vrestore.

10. System Configuration. This chapter discusses problems that arise in various aspects of system configuration: installing or upgrading the operating system, installing patches, and configuring the kernel.

11. System Administration. This chapter includes problems in several areas of system administration, including licensing problems, printing problems, and job scheduling.

12. Display Problems. This chapter covers common problems with graphics consoles and X Windows displays. The discussion is limited to isolating and correcting such problems, and is not intended as a comprehensive treatment of computer graphics or X Windows.

Principles and Techniques

When you have eliminated the impossible, whatever remains, however improbable, must be the truth.

—Sherlock Holmes

2.1 The Troubleshooting Process

Every computer system, no matter how reliable, well-designed, or well-managed, will inevitably encounter problems. When a problem occurs, it is up to the system administrator to identify and correct the problem—and then, in most cases, to provide explanations to users or management. This chapter provides a basic set of troubleshooting principles and techniques that will apply to any problem situation. These principles and techniques lay a foundation for the Tru64 UNIX troubleshooting specifics in the remainder of the book.

For the most part, troubleshooting computer problems has never been a well-defined subject. College classes and training courses provide a great deal of information on the theory and operation of computers, but they tend to focus on what happens when the system works, not when it breaks. Some computer manuals provide troubleshooting information, but it’s usually of a specific nature, focusing on the responses to specific errors. Such specific information is useful and necessary (in fact, it forms the basis for the latter chapters in this book), but it doesn’t address the process of troubleshooting.

In our experience as technical support providers, we’ve found that troubleshooting ability seems to be independent of academic achievement, computer skills, or experience as a system administrator. One of the interesting things about training new support personnel is finding that some people just seem to get it. They have an intuitive knack for getting to the root of a problem and finding the proper solution. Other people don’t have this knack and usually have to acquire their troubleshooting skills the hard way. The troubleshooting principles and techniques in this chapter are the result of our observation of such people (both with and without the knack), as well as the practices we’ve developed in our own work.

In general terms, the process of troubleshooting can be broken down into five components:

1. Detection—does a problem exist?

2. Identification—what is the problem?

3. Analysis—what caused the problem?

4. Correction—how can the problem be fixed?

5. Prevention—how can the problem be avoided?

Figure 2.1 depicts a somewhat idealized relationship among these components. In practice, not all problems encompass all five distinct steps. Some steps may overlap in troubleshooting a particular case; in other cases, some steps are unnecessary. However, it’s useful to think of each component as an independent step when troubleshooting a problem. The following sections describe each component and explain how the five components relate to each other.

Figure 2.1 The troubleshooting process.

2.1.1 Detection

The first step in troubleshooting a problem is detecting that a problem has occurred. At first glance, it may seem unnecessary to list problem detection as a distinct component of the troubleshooting process. However, not all problems are equally obvious, and it is worth considering the various ways in which problems can be detected.

In the case of a major problem affecting an entire system (e.g., a system crash), it is immediately obvious to everyone using the system that a problem exists. In such cases, we can move on to the next step: identification of the problem. Other types of problems, however, are not so immediately obvious. It’s important to detect and respond to such problems as soon as possible; early detection and resolution can prevent more severe problems from occurring. For example, if a disk is starting to experience occasional errors leading to bad-block replacements, replacing the disk will require only a little downtime. But if the problem isn’t taken care of, the disk may continue to deteriorate and will probably crash at the worst possible time.

When a problem is first detected by a system administrator, troubleshooting can begin immediately. However, if a problem is first noticed by users working on the system, it must be reported to an administrator before it can be resolved. As such, it’s necessary to have a way for users to report problems quickly and easily. The exact method used is dependent on local circumstances; it might be a help desk, a pager, an electronic mailbox, or a shout of Help! over the cubicle wall. Whatever method you choose, make sure that your users know how to use it. Users must be willing to report problems as soon as they occur. The best way to promote such willingness is to have an easy-to-use problem-reporting system and, most of all, to be responsive to problem reports.

A trickier issue is invisible problems. For example, RAID (redundant array of independent disks) storage systems are a popular method of providing storage redundancy at relatively low cost. Some RAID storage sets add an extra disk to a number of data disks, compute parity information across the data disks, and store the parity information on the extra disk. If any one disk fails, the RAID controller automatically regenerates the missing data from the remaining data chunks and the parity data. Because the data is still available, it appears that the storage set is fine, though perhaps a little slow. This is great for data availability—but not so great for problem detection.

Because there is no immediately obvious problem, no action will be taken to correct the problem unless there is some other means of detecting it. The RAIDset will continue to operate with one failed disk, but it will now be operating with no redundancy. If a second disk in the set fails, the storage set will go down and data will be lost. If the first failure had been detected promptly, the failed disk could have been replaced and the RAIDset restored to its normal level of redundancy before another disk failed. Detecting this kind of problem can be tricky; we’ll discuss strategies for doing so later in this book.

Problems are characterized by their range of effect and by their severity. Range of effect defines who or what is affected by the problem: is the whole system or a critical function down, or is the problem limited to a subset (perhaps only one) of your users or applications? Severity defines the seriousness of the problem: is the system or function completely down, partially working, or just performing poorly? These two pieces of information are extremely important when reporting a problem. Not only are they necessary for problem identification, buy they help define the urgency of resolving the problem.

Let’s consider a hypothetical problem. The system administrator of a student file server at a major university receives a report that some students are unable to log into their accounts on the server. Other students, however, are able to log in with no problem. In addition, some—but not all—students who were already logged in are now experiencing various problems.

This problem’s range of effect is some (as yet undefined) subset of the system’s users. Defining the exact extent of this subset of users will help to characterize the problem. For those affected, the problem is extremely severe; the system is completely unusable for them. "We’ll continue to follow this problem through the remaining components of the troubleshooting process. For now, we’ll leave it and go on to discuss problem identification.

2.1.2 Identification

Once the existence of a problem has been detected, the next step is to determine exactly what the problem is. This might seem redundant; after all, doesn’t the detection or report of a problem tell you what the problem is? To a certain extent, the detection and identification phases do overlap. In some cases, the problem report will contain all information needed to identify the problem. In other cases, though, more work must be done to determine exactly what the problem is—and identifying the problem is necessary in order to correct it. This, in fact, is the key difference between detection and identification. Detection is simply the awareness that a problem exists, whereas identification is the minimum characterization of a problem necessary to begin correcting it.

As an analogy, consider a visit to the doctor because you have a cough, a sore throat, and a fever. These symptoms cause you to detect that a problem exists, but they don’t identify the problem. You might have a cold, the flu, a virus, or some obscure disease that you’ve never heard of before. The doctor will consider your symptoms in diagnosing your condition, but he may also check your vital signs, ask about your recent history, and perhaps do some other tests to identify the problem accurately. Only by identifying the actual problem can it be effectively treated. It might be possible to treat just the symptoms; while this might make you feel better, it probably won’t cure the underlying condition. Similarly, when troubleshooting system problems, it may be useful to provide a workaround for a problem in order to quickly restore lost capability. However, this is just like treating only the symptoms of a medical problem. It may be helpful in the short term, but in the long term it’s important to identify and correct the real underlying problem.

So how do you identify a problem? Identifying something—whether a problem, a car model, or an animal—is a process of comparing observed characteristics with a known model until a match is found. This definition divides the process of identification into two components: observation and comparison. Observing the characteristics of a problem begins with the initial problem report. At a minimum, the type of problem, its severity, and its range of effect must be noted. It may be that these pieces of information are sufficient to identify the problem. If not, it’s necessary to continue gathering information about the problem. The gathering of information is not necessarily a passive process. Fully characterizing the problem often requires the troubleshooter to perform some actions and observe the results.

Once a problem is characterized, it must be matched with a known model in order to identify the problem. This may be an automatic process that you don’t even have to think about. If the problem is of a type that you recognize, your mind matches the problem characteristics to a known model in your memory; your recognition of the problem provides the identification. On the other hand, if you’re not already familiar with the problem, you’ll probably need to consult other sources to find a matching set of problem characteristics. These sources may include coworkers, records of past problems you’ve encountered, Internet resources, and books such as this one. Every system administrator should maintain a list of resources for troubleshooting information. Appendixes A and B contain some valuable resources to add to your list.

Problem identification frequently is not a one-shot process. In many cases, the initial information won’t be sufficient to identify the problem. When this happens, identification becomes an iterative process. After consulting available resources, you determine that additional information is needed to characterize the problem completely. Using your troubleshooting resources and the techniques described in section 2.2, you gather additional information. This cycle is repeated until the problem has been completely identified. It’s possible that the problem might be completely solved at this point. Sometimes the only way to identify a problem fully is to try various corrective actions and see which one fixes the problem.

Let’s continue with the hypothetical problem from the previous section. Some of the student users of a university system are unable to log in, and some who have already logged in are now having various problems. The report of this problem made the system administrator aware of the problem, but it was not enough to identify the problem. In order to identify the problem fully and begin correcting it, more information is needed.

The first step is to characterize the range of effect. What is common among the users having problems, and what is different between those users and other users who are still working normally? After comparing a few users of both types, it becomes apparent that the problem is affecting only users with home directories on a particular disk. A quick check confirms that the directories on that disk are no longer accessible. Further checking reveals that the disk is off-line, and attempts to bring it on-line again are unsuccessful; the disk has failed. The problem has now been identified, and corrective action can begin.

2.1.3 Analysis

Once a problem has been identified, you can start working to correct it. If the problem is well understood and has a clear solution, there is probably no need to delve any further into what caused the problem. However, sometimes it’s just as important to know why the problem occurred as it is to identify the problem. In such cases, it’s frequently necessary to perform additional analysis in order to understand the root cause of the problem. This is particularly true when trying to resolve a recurring problem. It may be possible to fix the problem each time it occurs—especially if you’re getting a lot of practice at it—but it’s highly likely that there’s something else going on that causes the problem to keep returning. In the long run, it’s much better to find and address the underlying cause in order to prevent the problem from recurring.

Even if the problem isn’t a recurring one, it may be desirable to perform additional analysis as a preventive measure to see whether any additional problems are lurking. In the university system example, the problem has been identified as a failed disk. That disk must be replaced and its files restored, regardless of whether any other problem exists. But it’s worth doing some additional analysis to see whether there might be problems extending beyond a single disk. One possibility is to check the error log for disk errors. If there are numerous errors on the failed disk but few or no errors on other disks, then the problem can reasonably be assumed to be limited to the failed disk. However, if there are a number of errors on other disks, then it’s time to look for a pattern. Are the disks all on the same bus or controller, or in the same storage cabinet? Any of these may indicate an underlying problem that, if not found, could lead to other disk problems in the near future. By rooting out and correcting the underlying problem now, those other problems can be prevented from ever happening.

In some cases, additional analysis is needed to refine the identification of a problem and determine the proper corrective action. For example, Tru64 UNIX panics if the root filesystem can’t be mounted at system startup. When this occurs, the system console prints the panic message vfs_mountroot: cannot mount root. This message is quite clear as to what the problem is: the root file system won’t mount. This appears to be a specific, well-defined problem. However, there are no less than nine possible causes—each with a different solution—for this error:

1. The root file system is corrupted.

2. The /etc/fstab entry for the root file system is incorrect.

3. The root file system is AdvFS, but AdvFS isn’t built into the kernel.

4. The root file system appears to be an LSM volume, but LSM isn’t built into the kernel.

5. The /etc/fdmns link to the root file domain is pointing to the wrong device.

6. The root file system is an AdvFS domain with more than one volume.

7. The hardware configuration has changed, causing the device numbers to change.

8. The console firmware is outdated and incompatible with the current operating system version.

9. The root device has a nonzero logical unit number (a limitation in Tru64 UNIX versions earlier than V5.0.)

In such cases, additional analysis is needed to determine which of the several possibilities caused the problem. This, in turn, will determine the appropriate corrective action.

2.1.4 Correction

Once a problem has been identified, it must be corrected. Correction is the heart of the troubleshooting process, and it is the focus of most of the detailed information in this book. As a component of the troubleshooting process, correction includes any action taken to restore or replace lost functionality. This may include finding a workaround, as well as actually fixing the problem. There are times when a full solution may be too expensive, time-consuming, or risky to implement immediately; in such cases, a workaround can serve as a temporary measure until the real problem can be

Enjoying the preview?

Page 1 of 1

Tru64 UNIX Troubleshooting: Diagnosing and Correcting System Problems

About this ebook

Martin Moore

Read more from Martin Moore

Related authors

Related to Tru64 UNIX Troubleshooting

Related ebooks

Systems Architecture For You

Related podcast episodes

Related articles

Related categories

Reviews for Tru64 UNIX Troubleshooting

What did you think?

Book preview

Tru64 UNIX Troubleshooting - Martin Moore

Introduction

1.1 Tru64 UNIX History

1.2 Scope

1.3 Organization

Principles and Techniques

2.1 The Troubleshooting Process

2.1.1 Detection

2.1.2 Identification

2.1.3 Analysis

2.1.4 Correction