Transcription

2005 AACE International TransactionsEST.04A Methodology for Estimating Engineering DetailsMr. James D. Whiteside II, PEWhile historical data supports the idea that engineering is a percent of TIC, the impact of factored engineering to a project is unclear to project managers. Too many project managers leavethis clause unchallenged and expose the project to unrealizedgrowth risks in cost and schedule. Factored engineering can: Bury a significant amount of contingency;Hide behind growing scope and construction costs;Leverage project profit to the front of the project; andAvoid discussing the true cost of engineering services.neering hours are discussed as they relate to direct labor hours.Direct labor does not include construction management. It onlyincludes the hours that directly contribute to the asset under construction.There should be no expectation that all of the engineeringdisciplines can be modeled by a single equation. A lot of work isnecessary to take a data set and apply several data analysis techniques until something begins to deliver consistent results. A relational database removes much of this drudgery.FILTERING DATAThere is a strong correlation between engineering hours andconstruction hours. This correlation allows a project team to accurately forecast final project costs and schedule before the finalinvoice to the engineering contractor is paid and before the project experiences the first project over-run forecast. Engineering issolely responsible for controlling the scope of the project. Sinceconstruction hours grow three-fold for every additional hour ofengineering, there is no acceptable explanation for project overrun being forecasted when construction is halfway completed.The only explanation for a late over-run forecast is that the correlation between engineering and construction is being misunderstood or ignored.This paper introduces a concept called an estimate triangle.Estimate performance can be improved through better estimatingof engineering, by directly calculating engineering hours frommaterial quantities, and by cross-checking with a correlation fromdirect labor hours. The focus of this paper will be on calculatingengineering hours from material quantities.STUDY BASISThe techniques used for engineering analysis and modeldevelopment are discussed in detail in "Developing EstimatingModels" [1]. The data for this paper is drawn from over one hundred completed refining and petrochemical projects ranging from 100,000 to 500,000,000. Absolutely no cost data was used. Theestimating methodology is completely based on material quantities and labor hours. Only discipline engineering hours such asthose shown in table 1 are analyzed in this paper. Discipline engi-Of the hundred projects that comprise the engineeringmodel, hundreds more were discarded. There are a couple ofapproaches to filtering data. One approach is to manually discarddata that is out-of-bounds. For example, if a similar group of projects all have about 20,000 hours of engineering and one has 500hours, then the low value should be initially removed from thedata analysis. The removed data point may be part of a differentanalysis, or it may be an anomaly.Another approach is to statistically discard data. In the case ofpiping, study results have determined that direct-labor hour perpound is a strong correlation. Using statistics, data is discardedthat does not fall within one standard deviation of the averagehour per pound. Similarly, data filters can be applied to theremaining accounts.IDENTIFYING THE PROBLEMFigure 1,"Percent of Total Installed Cost vs. Project Size inMillions of Dollars," shows the result of applying a simple statistical average across the entire project data population. In any givenproject size, engineering is close to both the industry average andthe expected 18 percent of TIC for refining projects.Figure 2 is the same data set showing the variation of the datafor each project size. The ninetieth percentile data point is represented in P90. Ninety percent of the data is below this point. Theother percentile marks are handled similarly. The column forprojects less than one million dollars appears to depict an engi-EST.04.1

2005 AACE International TransactionsTable 1—Engineering Disciplines.pline engineering hours. Each wavelet (1-5) demonstrates thatorganizations improve efficiency as projects grow in size.Organization growth in staffing occurs in step changes, represented by each wavelet. For example, staffing a procurement specialist will also require part-time staffing of contract administrators,expediters, and warehouse personnel. Most organizations can notarbitrarily use individuals for a few hours. Once a position isstaffed, the cost is incurred by the project whether or not there isenough work for that person. This is inefficient.The French mathematician Joseph Fourier (1786-1830)demonstrated that the most general form of periodic waves couldbe built as a summation of simpler harmonic waves. A Fourierprogram was used to correlate total engineering hours to totaldirect labor hours. Applying the Fourier principle in data analysis,as in figure 3, demonstrates that there is a strong behavioral pattern between engineering and direct labor hours. The overallanalysis indicates that there are strong correlations in the lesseraccounts.ESTIMATE TRIANGLEThis paper introduces a concept called an estimate triangle.The triangle shows the balance between materials, direct laborand engineering in the estimate. If a person is given any oneaccount, the other two accounts should be easily calculated. Thebasis of the data analysis makes use of the strong and direct correlation between three accounts: Figure 1—Percent of Total Installed Cost Vs. Project Size inMillions of Dollars.Figure 2—Percent of Total Installed Cost Vs. Project Size inMillions of Dollars (Data Variation Display).neering group that is out-of-control, thus resulting in a wide variance in the engineering hours required for a project.Figure 3, "Engineering Hours vs Direct Construction LaborHours," is a graph of one hundred and 50 projects where eachwavelet is a project size class. The graph is an expansion of thefirst column (Projects 1MM) of figure 2 and demonstrates thatthere is a direct correlation between direct labor hours and disci-Material to direct labor hours;Direct labor hours to engineering hours; andMaterial to engineering hours.Given the plentiful labor data from completed projects,direct labor hours are easily calculated from material quantitiesand labor productivity (figure 4, side A). In examining the detaileddata for projects that perform poorly, the balance in the estimatetriangle is still satisfied. However, compared to the funding estimate, poor performance is due to inadequate estimating techniques in the engineering and construction accounts. The estimate details for each side of the estimate triangle must be asrobust as the other two sides.Estimating has traditionally had a very robust side A, an adequate side B, but a poor side C. Estimating engineering using apercent of TIC makes side C collapse. Estimate performance canbe improved through better estimating of engineering and bycross-checking between two sides of the triangle.There is another benefit to developing an estimate of engineering in detail. Since direct labor and engineering hours havea strong correlation, the efficiency between direct labor and engineering can be analyzed.Engineering can now be analyzed in the same manner thatproductivity is analyzed for direct labor. Engineering productivityhas historically been elusive because there were no readily published data. If collected project data can be transformed into engineering models, then a set of engineering efficiencies and pro-EST.04.2

2005 AACE International Transactionsductivity indices can also be formulated. In order to reduce engineering costs, the project scope must also be reduced.PROBLEM SOLVING METHODOLOGYEngineering is a complex and daunting system to explain.Fortunately, engineers are organized, methodical, slow to change,and constantly evolving (improving) successful systems.Therefore, they are highly predictable. Estimating engineeringhours is a quantification of the activity hours. It is not the why orhow of what engineers do. Once a mathematical model is successful, it will be stable for a long time. The model can be evolvedalong with changes in engineering.There is no shortcut to decide which analysis tool works best.This is left to trial and error and the experience of the analyst. Theprocedure used to break down complex systems has three majorsteps. These steps are designed to produce quicker regression Figure 3—Engineering Hours Vs. Direct Construction Laborresults and to break down complex problems. Understanding sim- Hours.pler things may provide insight to more difficult issues. These arethe steps that determined the order of processing data and how tocorrelate engineering hours to material quantities.I. Divide the problem into simpler ones.1.2.3.4.5.6.Find the accounts that have the strongest correlations withthe least amount of analysis work.Evaluate data at the highest level possible before workingdown to the next level.Evaluate the accounts with the most data.Correlate quantities to direct labor hours.Correlate direct labor hours to engineering hours.Correlate quantities to engineering hours.II. De-ccouple the easy issues from the complex ones.1.Find a different correlation if a correlation do not yield atFigure 4—Estimate Triangle.least a 0.75 R2 (goodness-of-fit) value.2. Subdivide the data set into ranges, groups, etc.CONVERGENCE3. Examine accounts for interdependencies (process/equipment, steel/concrete, electrical/instrumentation).For an equation to be accepted as model for estimating engineering there has to be three equations for each relationship onthe estimate triangle. Rather than publish equations for eachIII. Evaluate the predictive model.accounts, which would be highly proprietary, the chart of correlation results will be provided.1. Build objectivity by dividing data into two sets, one for regresMechanical, electrical, instrumentation and concretesion and one for testing.accounts responded well to simple trend lines. Very little out-of2. Check for convergence because all three correlations mustbound data was discarded. Piping and steel accounts requiredconverge to the same R2 goodness-of-fit.spectral analysis. Process account required building a modified3. Test for predictability. Success is defined when the predictiveyield calculation. Mechanical and "Steel 30 Tons" are examplesmodel is within ten percent of actual data across the entire(table 2) that did not converge. The subjects will be covered inregression data set and within fifteen percent of test data forascending order of complexity.any given actual data point.EST.04.3

2005 AACE International TransactionsTable 2—Convergence Results.EST.04.4

2005 AACE International TransactionsFigure 5Figure 6—Engineering Hours Vs. Steel Tonnage.Figure 7—Delineation Factor (DF) TechniqueDirect Labor:Hours / Engineering Hours Vs. Total Equipment.PIPING ENGINEERINGMECHANICAL, CONCRETE, ELECTRICAL,INSTRUMENTATION ENGINEERINGFor mechanical, instrumentation, electrical, and concretedata, a simple trend line produced a reasonable average.Correlating hours to the total mechanical equipment count didnot produce a convergence (table 2, Mechanical 1). Mechanical2 shows improvement to the correlation when the mechanicalaccount was correlated separately to various equipment classes,pumps, exchangers, vessels, etc. It would seem reasonable that theelectrical account would have a better correlation on an hour toweight basis. In this group, a goodness of fit (R2) between 0.5 and0.7 is not significant enough to detract from using the simpleregressions as averages to calculate engineering hours. Missingany one of these accounts by thirty percent will not change thetotal engineering hours by more than five percent. Based on thecollected data, the correlations for these accounts are strongenough to determine that each side of the estimate triangle is satisfied.Analyzing piping over all sizes produced a satisfactory hour toweight correlation. There is, however, a significant improvementin the correlation when piping is broken into various average pipesizes. Figure 5A, "Hours Vs. Weight (All Pipe Diameters)" and figure 5B "Hours Vs. Weight (6-inch Average Pipe Diameter)," compare the difference between the piping regression of all diametersand of six-inch average pipe diameter.Analyzing piping based on length produced a larger standarddeviation. The "Length Vs Engineering Hours" column in table 2shows the correlation for piping. In nearly all cases, the weightbasis (columns A and C) produced better results than the lengthbasis. One reason the weight basis works better is that a joint ofpipe is installed at approximately the same rate per pound as thesame joint of pipe with a valve attached. For example, a valveweighs as much as a joint of pipe and is about twenty times shorter. The joint of pipe may take 1.6 hours per foot and the valve maytake 11 hours to completely install. Using the length basis, thevalve adds less than a foot to the length of pipe or less than 1.6hours to the installation calculation. However, the pipe and thevalve both are installed at approximately weight per hour.EST.04.5

2005 AACE International TransactionsSTEEL ENGINEERINGSteel presented a problem during analysis. At first thereappeared to be no correlation between direct labor and engineering hours. It is unclear in figure 6, "Engineering Hours Vs. SteelTonnage," if this is a scattered data plot or there are two differentpopulation trends (A and B) in the data. All simple correlationcombinations of labor hours, engineering hours, equipmentcount, and steel tonnage produced similar plots.All the study data is collected in short tons. Since a short tonis 0.907 metric tons (1,000 kg), the results and figures are essentially the same.A delineation factor (DF) technique was applied to decide ifthe data bifurcates or if there are parallel data trends. Bifurcationis a discipline of chaos theory that deals with nonlinear phenomena. It means that the system splits from one state into two possible states. Systems that increase exponentially in complexity, likesteel, tend to bifurcate. For example, small tonnages of steel aredesigned piece by piece. On the high end, like open bay, multiplefloor structures, steel is designed by advanced engineering computing models.A DF technique exaggerates discontinuities in a data set. Thepoint of discontinuity is where the data should be divided into separate data sets. A plot is made of a simple value against a complexvalue, the delineation factor. The complex value (DF) is a function of a principle value and an associated value(s). In this case,the DF was obtained by dividing the associated value (direct laborhours for steel) by the principal value (engineering hours for steel)and plotted against simple value (total equipment).The intersection of lines A and B in figure 7, "DelineationFactor (DF) Technique," indicate that the division in data population is approximately 10 tons of steel. The clear field represented by the oval (C) in figure 7 provides further graphical proof thatthere are two different states of steel design. If there had been datain field, then another approach to analyzing steel would have hadto be taken. Clearly, the steel data represents two different populations for this data sample.Apply the delineation technique to different equipment subsets until all the major nodes are found. The next equipment subset would be to drop data with less than ten pieces of equipment.The result is shown in figure 8, "Equipment to Steel ComplexityBranches (Engineering Hours Vs. Steel Tonnage)." Each complexity of steel to equipment produces good convergence and correlation. Steel accounts seem to split at 5, 10, 20, 75 and 100pieces of equipment. Equipment to steel complexity differencesare represented by each branch in Figure 8.PROCESS ENGINEERINGengineers because the software that is used for process simulationcan easily perform hydraulic calculations. The hours are accounted in the process engineering account regardless of who performsthe calculations.Of the entire project data set, it became obvious that therewas no single universal regression to cover all equipment configurations for all size projects. The data population was therefore subdivided according to project characteristics.Project CharacteristicsThere are three types of projects. Large capital projects haveat least forty pieces of equipment, and all equipment groups aretypically present. Re-vamps, work on existing process units thatmay or may not be running during construction, are divided intotwo groups: 20 to 40 pieces of equipment and 5 to 20 pieces ofequipment. Small projects with a half dozen pieces of equipmentand some equipment groups present may not have equipment.For a given project type (large, revamp, small), a correlationwas developed for each equipment type as it related to the totalhours in process engineering. Miscellaneous equipment count isadded to the equipment account total that closely resembles themiscellaneous equipment's principle configuration. For example,the process specification for an exchanger is developed by processengineers, but the mechanical design is typically performed bythe manufacturer. If the miscellaneous equipment is similar to anexchanger, then add it to the exchanger total.Weighted CalculationFigure 9, "Equipment Count Vs. Process Hours," is an example of where the equipment count to process hours was a simplestraight-line regression. For any project, process hours are thesummation of five equipment equations and one piping equation.Each of these products is weighted by the R2 value for the equipment account divided by the total R2 of all the equipmentaccounts.Pr ocess hours f (n) (R 2 (n) R 2 ) for n 1 to 6Table 3, "Weighted Calculation," shows that for any givenproject, the process hours are calculated within ten percent of theactual hours. The predicted process hours are ten percent higherthan the actual hours for nearly the entire set of projects. Theranges in the delta column are important inputs to running aMonte Carlo risk analysis. Table 3 also demonstrates that not allof the accounts need to be populated to calculate accurate estimates of process hours. This is the benefit of a weighted calculation.This is the most difficult category to estimate. Examining 95completed projects uncovered six drivers to process engineeringCALCULATING ENGINEERINGhours. The six drivers that are significant to the calculation ofprocess hours are pipe weight, exchangers, pumps, compressors,The process of estimating engineering begins by developingvessels and furnaces. Process engineering for piping does not a study to produce the regression equations for all sides of theseem that it should be included as one of the six drivers. However, Estimating Triangle. This means that there will be at least thirtymost engineering contractors assign the hydraulics to processEST.04.6

2005 AACE International TransactionsTable 3—Weighted Calculation.equations to describe the data. Once this is done, programautomation can be used.Estimating engineering starts with estimating the number ofpieces of equipment (by major account) and material quantities.Estimating quantities from an analogous project is a good start forconceptual estimates. This data is simply entered into the studyequations to calculate engineering hours by discipline. This isgood for immediate and near-term answers. As new data isacquired into the database, the study equations will updated andreplace by newer ones.The best solution is to use automation to create new regressions based on the techniques documented in this study. Newlyacquired data is fed int