Documente Academic
Documente Profesional
Documente Cultură
RELIABILITY MAINTENANCE
By Definition
Maintenance is ensuring that physical assets continue to do what the users want them to do.
RELIABILITY DEFINED
FAILURE simply means the inability of an equipment to perform its required function. The failure of a component is viewed as terminating its life on the other hand RELIABILITY is the probability that no failure will occur throughout a prescribed operating period.
MODULE 1
INTRODUCTION :
Morale declines and standard drops Spares and budget grows on maintenance
Start here
A BELIEF THAT All Parts will wear Backlog grows and PM is missed
NO
NO NO
NO
YES
NO PROBLEM!
Is it ok to fail?
I really dont see anything wrong with failure if we accept them positively
Henry Ford
(1863 - 1947)
Soichiro Honda
(1906 - 1991)
What people see of my Success is only 1 percent But what they dont see is 99%w/c are my failures
Today, Honda Corporation employs over 100,000 people in the USA and Japan, and is one of the world's largest automobile companies.
CAN WE REALLY ELIMINATE FAILURES ? An equipment will compose of the following Electronic parts Electrical parts Mechanical parts
(100,000 pcs) ( 30,000 pcs) (5000 pcs)
1) What exact part will fail ? 2) When will that part fail ?
But we have around 100 similar machines & 10 types of equipment Each equipment have around more than 130,000 components in it We only have 5 maintenance craftspeople per shift for all our equipment How do we know which parts will fail, what machine and when ? Can we accept the fact that failures are really meant to happen after all ?
Maintenance will only focus on failed parts that will stop the equipment from running & likely ignore failures of secondary functions
Inspections are added from time to time increasing the amount of work for the maintenance Maintenance are measured by how fast they perform their repair
FACT 1
Equipments do not fail, there are some parts on the equipment that had failed, once we have identified the failed part and replace it then the machine will be running again.
FACT 2
Although we might be using some statistics & history records as a baseline, the fact still remains, we do not know exactly which parts are going to fail and when it will fail precisely, but we certainly know that one day our car will run dead, our computer will stop working and our equipment will stop working due to an event of a failure or breakdown . . . . .
Making equipment more reliable is about extending the life & the time between failure (MTBF) as well as preventing failures by replacing of part & components. This is what maintenance is all about . . . . .
FAILURE
( TIP OF THE ICEBERG )
FRACTURE HUMAN ERROR CORROSION LOOSE BOLTS VIBRATION LOOSENESS DEFORMATION MISALIGNMENT DIRT / DUST LEAKAGE TEMPERATURE FATIGUE ABRASION CONTAMINATION LUBRICATION ENVIRONMENT
DETERIORATION
Failure Line
Accelerated Deterioration
Time-Based Condition-Based Failed State / Run To Fail
Point 1 Point 2
Point 3 Point 4
TIME
It is also borne out by the machine operator who says that every time maintenance works on it over the weekend, it takes up to Wednesday to get it going again
Reference page 143 RCM by John Moubrey
It is the belief that led to the idea that the more often an item is overhauled, the less likely it is to fail . . .
Schedule Overhauls / Preventive Maintenance increases Overall failures by introducing Infant Mortality into otherwise stable system
Resulting schedules are used for all similar assets again, without considering that different consequences apply in different operating context. This results in large number of schedules which are wasted , not because they are wrong in the technical sense, but in reality, they achieve nothing
What did Stanley Nowlan and the late Howard Heap Discovered 2 discoveries evolved which created a change in the evolution and thinking of the maintenance system worldwide . . . . .
First, scheduled maintenance has little or no effect on the reliability of a complex item unless the item has a dominant failure mode. Second, there are many items for which there is no effective form of scheduled maintenance.
UNDERSTANDING BREAKDOWN
HARD FACTS ABOUT EQUIPMENT FAILURES
Not all failures will constitute a downtime Failure occur in 3 pattern, Infant Mortality, Random Failure & Age-Related Failures, and most of the failures we encounter is either random or infant mortality failures Increasing the amount of Preventive Maintenance activities on the equipment will likewise increase the chances of Infant Mortality Failures & that the only way to reduce Infant Mortality Failure is to reduce the amount of work in our PM Not all failures can be eliminated, the best that maintenance can actually do is to control the timing of failure and that reducing the consequences of failure is more feasible rather than trying to eliminate the failure itself
UNDERSTANDING BREAKDOWN
HARD FACTS ABOUT EQUIPMENT FAILURES
Preventive Maintenance can only capture wear out or age-related failures. When failure is random in nature, this is when PM is at weakest point and likewise not feasible to use All failures are not created equal, yet all failures will have their degree of consequences. Hence, the degree of maintenance requirements should be based upon the consequences of failure itself. When failure has little or minor consequences it is a good decision to allow the failure to occur
1 Failure / Mo MACHINE 6
1 Failure / Mo MACHINE 7
No Failures MACHINE 8
No Failures MACHINE 9
1 Failure / Mo MACHINE 10
9 Failures / Mo
8 Failures / Mo 1 Failure / Mo
No Failures
No Failures
Will these 10 equipments have the same amount of PM required ? Which machines will require the greater amount of maintenance ? Should we follow the specs or we apply common sense on maintenance ?
Ex : 100 failures encountered on a ball bearing for a span of 9 years & distribution is as ff 20 10
4 5
15
2
10
3
5
6
15
7
10
8
10
9
PERIOD OR LIFE
1
2
0
3
0
4
0
5
2
6
1
7
0
8
94
9
PERIOD OR LIFE
CONCLUSION : Failure distribution is almost age-related, for this case the best period to perform replacement is on the 8 month
There is a belief that all items have a life and that installing a new part before the life is reach will automatically restore it to its original basic condition = FALSE
And most maintenance only focus on the 3rd type of failure, and neglecting to understand that infant mortality failures & random failures occur more frequently than wear out failures
RANDOM FAILURES
BATHTUB CURVE
INFANT MORTALITY
Occurrences of random and infant mortality failures are more frequent than wear out failures WEAR OUT FAILURES
MODULE 2
REACTIVE MAINTENANCE
PREVENTIVE MAINTENANCE
PREDICTIVE MAINTENANCE
PROACTIVE MAINTENANCE
REACTIVE MAINTENANCE :
Maintenance is done at a point when there is repair or actual breakdown It occurs when repair action is taken on a problem only when the problem results in machines failure. Unplanned downtime, in its simplest definition, breakdown maintenance simply means fixing it when it fails
Run-to fail
Run-to destruction
Reactive Maintenance
Band-Aid Maintenance
No Scheduled Maintenance
Firefighting
REACTIVE MAINTENANCE :
If aint broke dont fix it, when it breaks will fix it
A purely reactive maintenance strategy ignores opportunities to influence equipment reliability and survivability Justifiable in particular instances if :
- Does not produce critical delays - Does not sacrifice peoples safety - Does not significantly increase costs - With redundant functions of standby
RUN TO FAIL If failure is evident and does not affect safety or environment, or if it hidden but does not affect safety or environment then default decision is No Scheduled Mtce
RUN TO FAIL MAINTENANCE IS VALID IF :
- A suitable scheduled tasks cannot be found for hidden function - A costs effective preventive tasks cannot be found for failures w/c have operational or non-operational consequences
NO
WILL THE BREAKDOWN BE MORE COSTLY THAN THE TASKS OF PREVENTING THE FAILURE ITSELF ?
NO
IS THE EQUIPMENT IN THE CRITICAL PATH IN MANUFACTURING OR CONSIDERED A BOTTLENECK EQUIPMENT OR PROCESS ?
NO
IS BACK-UP EQUIPMENT UNAVAILABLE ?
NO
WILL THE BREAKDOWN ADVERSELY AFFECT DELIVERY OR CUSTOMER SERVICE OR PROVIDE ANY DELAYS ?
NO
WILL THE BREAKDOWN FURTHER DAMAGE THE EQUIPMENT OR PROVIDE SECONDARY DAMAGES ?
Overstock inventories that can accommodate the repair time itself When the consequences of failure and the cost or repair is minimal
PREVENTIVE MAINTENANCE :
Preventive Maintenance is simply performing maintenance on a fixed interval w/c may be in the form of time, number of strokes or frequency
Calendar-Based
Stroke-Based
Time-Based
Running Hours
Scheduled-Restoration / Overhaul
PREVENTIVE MAINTENANCE :
Also known as Time-Based or Calendar
Based Maintenance
Maintenance activities are performed on
a calendar or fix operating schedule in order to extend the life of the equipment and prevent failures
Maintenance is performed without regard
to equipment condition
Assumes that the condition of the machine
and the need for maintenance is correlated with time which means that the item can be expected to operate reliably for an amount of time and is expected to wear out
A failure rate and history records are used
PREVENTIVE MAINTENANCE :
Stress cause an asset to deteriorate by lowering its resistance, exposure to stress includes output, distance traveled, operating cycles, calendar time and running time
These parts will survive this defined age Ex. 98 % of impellers were replaced after the end of 2 years
The part or component will have a normal rate of wear, TPM term will be natural deterioration. A more technical term will be normal fatigue Fatigue happens when the stress exceeds the strength of the material of the spare part or component Application of Preventive Maintenance tasks will only be worth doing and feasible to parts that will have a normal wear or deterioration
Why don't PMs significantly reduce the amount of reactive maintenance being performed in your plant? The answer is simple. PMs were designed around the theory that equipment failures are directly related to the age of the equipment. Since only 20 percent of equipment failures fit this pattern that means that 80 percent of equipment failures are not being effectively managed by doing time-based PMs.
PREDICTIVE MAINTENANCE :
Predictive Maintenance aids in detective potential failures in equipment with the aid of specialized instruments. Maintenance is based on the condition of the equipment which differentiate it from Preventive Mtce
Condition-Based Maintenance
On-Condition Tasks
Reliability-Based Maintenance
A person is gifted with 5 senses which are sense of smell, touch, taste, hear, sight. He can use these senses to detect problems on the equipment. Condition-Based Monitoring checks the condition of an equipment through the use of sophisticated measuring instruments with precision accuracy. Predictive Maintenance instruments are a higher form of the human senses
CBM tasks entails checking for potential failures, so that action can be taken to prevent the functional failure or to avoid the consequences of a functional failure
P-F INTERVAL
When to used CBM technique ?
P-F INTERVAL :
Is the interval between the emergence of the Potential Failure and its decay into a Functional Failure
POTENTIAL FAILURE : Is defined as an identifiable physical condition which indicates that a functional failure is either about to occur or is in the process of occurring FUNCTIONAL FAILURE : Is defined as the inability of an item to meet a specific performance standard
Increase in Noise Pressure change Flow rate change Lubricant contamination Wall thickness decrement Rate of corrosion Leak detection Crack detection
Overhauls performed on a fixed interval Overhauls to be performed if there is a whether Time-Based or Running hours potential failure detected Preventive Maintenance is performed when the machine is stopped Parts are being replaced on fixed-interval, after it reached its specific time or running hours Predictive Maintenance can be perform while the machine is running Parts are only replaced if a specific potential failure is present, if nothing is wrong, then no replacement takes place More cost effective than preventive since part is utilized almost all of its entire life span Parts with potential failures replaced
Parts are being utilized based on the frequency of replacement, parts will be replaced even when good, to conform
Possibility of replacing good parts
Cannot detect exact location of problem Infra-red cameras can detect the exact location of the temperature rise
PROACTIVE MAINTENANCE :
- Proactive Maintenance is about analyzing why failures occur so that its recurrence is finally eliminated, and thereby extending the life of the part or component - Proactive Maintenance is when maintenance or a group of cross-functional team analyzes the failure with analytical techniques such as Root Cause Failure Analysis, FMEA, Kepner Tregoe, P-M Analysis, Fault-Tree Analysis etc. are used to better understand why the failure occurred in the first place. - In Preventive Maintenance we replace the part that we think is in the process of wearing out. Our thinking is that replacing the part will bring the equipment back to its original condition, we have not taken into account the need to analyze further why a certain part keeps on failing.
Trouble shooting is no longer an effective strategy. In todays competitive world, the Analysts find real solutions . . . .
PROACTIVE MAINTENANCE :
REDESIGN or MODIFICATION
- Includes changing the specification of a component - Adding a new item - Replacing an entire machine with a different type - Relocating a machine - Change in process or procedure which affects operation
1980s
1900s
1920s
1930s
1940s
1950s
1970s
PROACTIVE MAINTENANCE :
OPERATIONAL & NON-OPERATIONAL CONSEQUENCES - Reduce the no. of times failure occurs - Reduce or eliminate the consequences of a failure (example thru redundancy) - Preventive tasks is costs effective hence alternate solution is to re-design FACTORS CONSIDERED IN REDESIGN : 1. Does the failure involved major operational consequences ? 2. Is the cost or scheduled / or Breakdown maintenance high ? 3. Are there specific costs which can be eliminated by the design change ? 4. Does the design have no harmful effects which can be generated afterwards ? 5. Is there an economic trade off study on expected cost savings ? 6. Is the asset to stay or to be used for a long time or will it be decommissioned ?
40 - 50 %
Predictive Maintenance
20 - 30 %
10 - 15 %
Reactive Maintenance
Scheduled Overhauls Schedule Discards Outage Schedules Level 1 Time-Based Maintenance Band-Aid Maintenance Stroke-Based/Running Hrs Breakdown Maintenance Scheduled and Fix Intervals Run to Fail / Destruction Is your company adopting No Scheduled Maintenance
P-M Analysis Root Cause Failure Analysis Failure Mode & Effect Analysis Level 3 Failure Analysis Condition-Based Maintenance Use of Diagnostic Tools Specialized Equipment Predict Eminent Failure Early Alert / Detection
Reliability-Centred Maintenance ?
MODULE 3
MTBF simply means the average time between failures. It is based on historical data or estimated by vendors and is use as a benchmark for reliability
MTBF =
OPERATING TIME
NUMBER OF FAILURE
WHERE : OPERATING TIME = LOADING TIME - MACHINE RELATED DOWNTIME LOADING TIME = AVAILABLE TIME - NON-MACHINE RELATED DOWNTIME
AVAILABLE TIME = 168 hrs
NMDT
40 hrs
MDT
72 hrs (6x)
OPERATING TIME
MTBF VARIATIONS
MTBF can be computed on the following basis : MTBF BY CRITICAL COMPONENT To determine on an average when a particular critical component will fail MTBF BY SUB-ASSEMBLY To determine which sub-assembly fails frequently on a machine MTBF BY MACHINE To determine the MTBF of a particular machine MTBF BY GROUP OF MACHINES To determine the machine w/ the lowest MTBF and perform improvements MTBF BY PROCESS OR LINE To determine which equipment fails frequently and identify the bottleneck area in a process
MTBF MTTF
A MTTR
B MTTR
Point where a new part is installed Time to repair Point where the 1st failure occurs
Point where the new part will fail again Point where the 2nd failure occurs
To determine the frequency of replacement for parts which have symmetrical or linear failures, not recommended for parts that fail randomly (Patterns D, E and F)
For failures that keeps on repeating itself over and over, the best strategy will be to address the real root cause of the problem and prevent it from recurring on its own again
MTTR DEFINED
MTTR is defined as the average time required to repair the equipment divided by the Breakdown Occurrence When the system fails, and it will fail, how easy will it be to recover?" Repair Time
MTTR =
Breakdown Occurrence
MACHINE DOWNTIME Endorse Machine to operator
MACHINE STOPS
Downtime means the total amount of time the asset would normally be out of service from the time it fails until it is fully operational
MTTR
MTTR varies from one company to another, hence, there must be a clear understanding on what MTTR constitutes
MTTR DEFINED
MTTR (Mean Time To Repair) is the average time required to repair a component Other terms used is Mean Time To Restore or Mean Time To Recover MTTR trend will be the lower or the shorter the time to repair the better. Improving the MTTR means shortening the time to repair the machine
MTTR DEFINED
MTTR (Mean Time To Repair) is the average time required to perform corrective maintenance or repair on all of the removable items in a product or system. MTTR analyzes how long repairs & maintenance tasks will take in the event of a system failure MTTR may be defined as the time it will take to bring a failed system back to its available or operating status again.
If an Ethernet card in your computer fails and takes 3 hrs to purchase and install a new card the MTTR for your computer will be 3 hrs but the Ethernet card is still broken and may never be repaired hence the MTTR for the Ethernet card is forever
UNDERSTANDING MTTR
A true and correct MTTR starts at the time of failure and continues until the system is operational again, regardless if a system part or component will be available or not
Legend :
Knowledge & Skill not Satisfactory (0 points) Knowledge Satisfactory ( 0.50 points) Skill Satisfactory ( 0.75 points) Knowledge and Skill both Satisfactory
Training Attended Classification No. Knowledge / Skill Item Yes No SAM BOB
1 Basic Machine Function
2 Machine Specs, Parts and Function 3 Knowledge in Actual Set-up and Conversion 4 Basic Lubrication Knowledge 5 Basic Repair and Troubleshooting 8 Failure Mode and Effect Analysis 9 Root Cause Failure Analysis 10 P-M Analysis 11 MTBA Snapshot and Analysis 12 Sequence Of Events Analysis 13 Knowledge and use on FRL's 14 Knowledge and use on Pipings and Connectors
15 Knowledge and use of Cylinders 16 Knowledge and use on Filtration 17 Knowledge and use on Speed Controllers 18 Leaks and Seals 19 Bearing Failures and Causes 20 Sensors Technology 21 Motors and Pumps
OTHERS
22 Screws and Fasteners 23 Spare Parts Management 24 RCM and OER Strategy 25 Maintenance Indices and Measurements 26 Knowledge on Vibration Monitoring
27 Principles of Heat and Thermography 28 Oil Analysis and Tribology 29 Ultrasonic Monitoring 30 CMMS Structure and System
S5-03
Total Points
Module 4
Root Cause Failure Analysis is trying to UNDERSTAND why something went wrong . . . . .
Root Cause Failure Analysis identifies the basic source or origin of the problem so that recurrence of the problem may be prevented
RCFA provides a methodology for investigating, categorizing and eliminating the root cause of incidents w/ safety, quality, reliability & manufacturing process consequences . . .
Identifying the Root Cause Failure Analysis event allows us to explain the WHAT, HOW and WHY of the failure
Proper Root Cause Analysis identifies the basic source or the origin of the problem . . . .
Every system, spares or components failure happens for a reason. There are specific succession of events that lead to a failure. RCFA follows the cause and effect path from the final failure back to its origin The root cause analysis methodology provides specific & solid foundation for preventing the recurrence of the problem or failure
Root cause analysis is a tool to better explain what happened, to determine how it happened and to better understand why it happen . . . . .
Root Cause Analysis separates the facts from hearsay. RCFA is not about trial and error and seeing what works and not
While there are many techniques in analyzing a problem which provide a quick answer, it does not mean that the answer is correct everytime. A true and meaningful Root Cause Failure Analysis takes the time to prove that what we say is fact & supports our hypothesis with evidence before we spend our money to improve the design of the equipment
When the facts are backed up by evidence & science and they are separated from the fiction we now have a better understanding as to the real Root cause of the problem
The group decided to install a surveillance camera to know who was stealing the money
Stolen by someone
Stolen by something
The video surveillance indicates that the customers entering the car wash hence, their hypothesis that customers was not paying was disregarded The owner try to simulate the Machine by placing some coins in them and the machine was then working properly so Change Machine Malfunction was not the problem, It is clear to them that someone is stealing the money but who . . .
Thats a bird sitting on the change slot of the machine and it had to go down into the machine but why ?
Thats 3 quarters he has in his beak, another amazing thing is that it was not just one bird but several of them
Once they identify the thieves, they found over $ 4,000.00 in the roof the the car wash and more under a nearby tree, therefore, the case of the stolen money was solved thanks to Root Cause Analysis . . .
Kingdom is Lost
Why is the kingdom lost ? Why is the king killed ? Why did the king fell of the horse ?
If the king is not killed then the kingdom had not been captured ? If the horseshoe did not come off the king might not fell on the ground and might not have been killed The groomsman might have prevented the king from riding the horse due to a missing nail and its implications If the kings horse shoe nail was complete then it might not have come of at all
Level 2
King is Killed
King fell of the horse
Level 3
Level 4
Level 5
Level 6
Level 7
If the city have been defended even if the king was dead then it might not have been captured ?
(1)
(2)
(3)
(4)
(5)
(6)
(7)
Determine the problem and ask why to determine the sequence of events in these sample
PROBLEM
Layer 1
PHYSICAL CAUSE
Layer 2
How did the incident occurred ? The Physics of the incident. This usually explains how the failure had occurred, example a bearing failed due to fatigue, this mostly explains the metallurgical factor why the failure occur What is the error committed that lead to the physical cause ? Either someone did something wrong or did the wrong thing We asked what caused the person to commit this mistake These are the management system weaknesses. These includes training, policies, procedures & specifications. People make decision based on these and if the system is flawed, the decision will be in error and will be the triggering mechanism that causes the mechanical failure to occur
HUMAN CAUSE
Layer 3
LATENT CAUSE
In RCFA Analysis a Logic Tree is used to work through a failure The failure event is placed on top followed by all failure modes or possible causes of breakdowns
Each of the causes are hypothesis that needs to be verified so that HYPOTHESIS VERIFY HYPOTHESIS we have an understanding on w/c of the causes actually led to the DETERMINE PHYSICAL ROOTS & VERIFY problem
DETERMINE HUMAN ROOTS & VERIFY DETERMINE LATENT ROOTS & VERIFY
The next step consists of determining and verifying the physical roots, human roots and latent roots behind the failure. The final cause will always have to do with the latent cause of failures
Root Cause
In performing Root Cause Failure Analysis, we are interested to know the real cause of a particular failure by verifying each hypothesis until we reach the final cause of the failure . . . . .
IN-DEPT ANALYSIS
ISHIKAWA / FISHBONE WHY-WHY ANALYSIS BRAINSTORMING PARETO ANALYSIS FMEA / FMECA FAULT TREE ANALYSIS
RCFA
PHYSICAL CAUSE
HUMAN CAUSE
LATENT CAUSE
Root Cause Failure Analysis will always be based upon pure evidence and takes the time to verify each failure mode to determine the real cause of the problem. RCFA only concludes once the latent cause had been identified
P-M ANALYSIS
PROCESS MAPPING FAILURE ANALYSIS
These techniques mostly concludes on the physical and human causes only
RCFA WORKSHOP 1 :
CAUSE STUDY :
A pump was declared failed since it was not discharging fluid at all. The pump failed due to a failure of the bearing. The maintenance decided to perform a Root Cause Analysis on the failed bearing to determine the real cause of the problem and have the failed bearing analyzed on a metallurgical laboratory. Arrange the causes in sequence to determine the real root cause of the problem
INSTRUCTION :
Brainstorm and analyze the case study and rearrange the set of cards and prepare a RCFA Logic Tree Diagram
Clues :
There are 6 or 7 levels in the logic tree Metallurgical lab report indicates that the bearing failed due to fatigue w/c is a a type of wear The last level (Bottom part) will be the real root cause of the problem
The pump may fail for a variety of reasons, in this case it is evident to the mtce that the cause of the pump to fulfill its function of discharge fluid is bearing failure.
since the part had evidently failed and production is up and running again but the question is asked, Did the problem go away ? No, it will recur again on a given time
at the failed bearing, he then takes a look on failure history and data of the pump, and conclude that a different type of bearing more heavy duty be installed. We would then get a heavy duty bearing and install it with the new design and again the question is asked, Did the problem go away ?
Pump Failure
(No discharge at all)
Functional Failure
Bearing Failure
Failure Mode
Valve Is Shut
Failure Mode
The bearing may fail on a variety of reasons, such as dirt entry or ingression which may have caused the accelerated wear of the bearing. All are probable causes and are still considered as hypothesis. Hence, to distinguished the facts from hearsay the bearing was sent to a metallurgical lab for further analysis to determine how did the bearing failed to fulfill its function.
LEVEL 3 : WEAR DUE TO FATIGUE
The bearing had been analyzed and reviewed by metallurgist and the report concluded that there is strong evidence of FATIGUE, now the other probable causes had been therefore eliminated we ask ourselves how can fatigue occur on the bearing ?
Pump Failure
(No discharge at all)
Functional Failure
Bearing Failure
Failure Mode
Valve Is Shut
Failure Mode
Dirt / Debris
LEVEL 3
Lack of Lubrication
Overloading
Wear
Have the bearing analyze for its metallurgical lab on why it failed
Adhesive
Abrasive
Erosive
Fatigue
Corrosive
How
Lubrication in the bearing was checked and found out it is sufficient Vibration monitoring shows there is no indication of overloading The only possibility left was Dirt/Debris and Wear and so the team decided to have the bearing test on a metallurgical laboratory
In Level 4 of our analysis we ask ourselves How can Fatigue occur on the bearing ? We hypothesize that it can come from high vibration. We check our vibration monitoring records and we are certain that there is evidence of excessive vibration. Excessive amplitude from our vibration data supports our hypothesis that fatigue occur on the bearing due to high or excessive vibration
LEVEL 5 : MISALIGNMENT
As we dig deeper into the root cause, again we hypothesize, How can we have excessive vibration? Possibilities is that it can come from imbalance, resonance and misalignment Again the vibration analyst verifies his vibration records and find out the resonance and imbalance is not a major cause for the excessive vibration. We called the maintenance who aligned the pump to align it again and we observe his practices. From our observation we are certain that he does not know how to align the pump properly
We asked the mechanic if he had been trained in the proper alignment and he said that he was never trained in how to align, there was no procedure for the alignment and how frequent it should be performed People often misalign because they were never trained in proper alignment practices, no procedure exists outlining alignment as a required practice with specification or the current alignment equipment we are using is worn our or inadequate for the application
Pump Failure
(No discharge at all)
Functional Failure
Bearing Failure
Failure Mode
Valve Is Shut
Failure Mode
Dirt / Debris
LEVEL 3
Lack of Lubrication
Overloading
Wear
Have the bearing analyze for its metallurgical lab on why it failed
Adhesive
LEVEL 4
Abrasive
Erosive
Corrosive How
How
LEVEL 5 LEVEL 6
Imbalance
Misalignment
Resonance
How
No Procedure
No Training
No Alignment Tools
The maintenance will merely change or replace the bearing. If this part fails frequently then boss makes sure that there is enough stock in the warehouse department
FROM A PREDICTIVE MAINTENANCE VIEWPOINT
Our CBM group can warn the operation of an impending failure to occur bought about by excessive vibration in the pump. Although the failure is predicted, the problem still does not seem to go away
FROM AN ENGINEERING VIEWPOINT
Modify or change the bearing with a more heavy duty and put it in service. In short we conclude at once to change out the bearings with a New Design
FROM A CONTINUOUS IMPROVEMENT VIEWPOINT
Brainstorming teams gather together with past history and data performance of the pump and sees a variety of causes, however they are not certain which is the real cause so they all agreed that it was due to the change in the lubricant
FROM AN OPERATIONS VIEWPOINT
Hold countless hours of meeting blaming the maintenance for not doing their job
FROM TOP MANAGEMENT VIEWPOINT
We penalize the culprits and even threathen to cut off their 13 month pay if the same problem arises in the future, or get another guy that can do the job better.
MODULE 5
LESSONS ON RELIABILITY
LESSON # 1 ON RELIABILITY
Focus must be on RELIABILITY & not cost, because if RELIABILITY starts to improve COST will definitely go down, there will be times that focusing on COST will tend to hurt RELIABILITY, it cannot be the other way around. Having a low cost maintenance is a consequence of good maintenance practice
The goal of any maintenance is to improve equipments reliability, once reliability starts to improve cost goes down & its not the other way around. Cutting cost on maintenance will definitely not improve reliability. Reducing cost had been a focus for most maintenance managers and that perhaps, we need to learn from the lessons of history. Cost must be studied thoroughly not just based from its initial cost but on the entire life cycle cost of the equipment . . . . .
LESSON # 2 ON RELIABILITY
Never ever accept failures in your plant. Trouble shooting is no longer an effective strategy. In todays competitive world, the analysts finds real solutions to the problems
When we get really good at doing things then something is wrong because we are doing it much often, but when we expect a different result from the same tasks we are doing then this is simple not possible, the Chinese called this INSANITY . . . . . The new paradigm is that FAILURES MUST NOT BE ACCEPTED it can be eliminated if we know the right tools to address them. The true job of maintenance is to eliminate failures & not fixing them all the time . . . . .
LESSON # 3 ON RELIABILITY
The best time to address a problem is when it is small. It is very hard to advance to any form of specialized maintenance activities and improvement efforts if equipment's Basic Condition had not been well established. Always remember our equipment is a shared responsibility for both operators & maintenance people, a lesson we must all learn from the Japanese.
Performing maintenance on the equipment is not the sole responsibility of the maintenance department, this should be a shared responsibility for operations and maintenance . . . . .
LESSON # 4 ON RELIABILITY
In a REACTIVE ENVIRONMENT, we always complain that we lack manpower resources to address failures, but once equipment starts to improve we always wonder where they have been in the first place . . .
In reality maintenance is not outnumbered, they are just too busy working with breakdowns. Maintenance is not measured by how fast we repair but on how we are able to eliminate the failure itself
LESSON # 5 ON RELIABILITY
Every failure has a specific set of consequences, being PROACTIVE has something to do about reducing or eliminating the consequences of failure to a minimum rather that completely eliminating the failure itself . . . .
The best maintenance strategy to adopt will always have to be based upon the consequences of the failure itself The first thing to ask in the event of a failure will be what is the consequences of the failure if it occurs on its own and will the failure be acceptable to the user or not . . . .
LESSON # 6 ON RELIABILITY
A question on why industry remain reactive may lead to a thousand reasons or more & those who fear that improving reliability may lead to elimination of jobs are right only to the point where they resist change. Increasing reliability is not achieved by cutting manpower nor are they contrasting goals. Increasing reliability means slowly getting out of the repair business so that new doors will open to maintenance function
The best positions in industry always belong to the maintenance function, however, most industries groomed their people to be mechanics rather than being a maintenance. Always be proud that you belong to the maintenance function . . . .
POSITIONS ON MAINTENANCE
Vibration Analyst Thermographer Ultrasonic Analyst Technical Trainer Oil / Lube Analyst Reliability Expert
Maintenance Positions
Fractographer
CMMS Specialists
Preventive Maintenance
Failure Analyst
LESSON # 7 ON RELIABILITY
The real mission of the maintenance department is to provide reliable physical assets & excellent support for its customers by reducing and eliminating the need for maintenance. Do not confuse maintenance as synonymous to repair, these 2 are entirely different. The distinction between a true blooded maintenance & a mechanic is a maintenance uses more of his brain than his hand while a mechanic uses his hand much of the time. Let us treat our people as maintenance & not as mere mechanics
LESSON # 8 ON RELIABILITY
There is no silver bullet program or strategy that can transform a plants reliability overnight all will start with its basic foundation and that is by EDUCATION and this is the most most powerful weapon to change the mindset of our people
Reliability is not a program with an end but a culture without an end, its the same as any continuous improvement philosophy . . . .
LESSON # 9 ON RELIABILITY
Always remember that in any Reliability Improvement Initiative, the focus must be on the people provide them with the skills they need & these skills will be used to improve their equipment. People will improve their machines and it is not the other way around
The saying that the companies greatest asset is its people is not always true in the real world of manufacturing. What is correct is that, the right people will be the companies greatest asset. There are people who wants to learn and there are people who never learn