Top LinkedIn Content on Data Analysis Techniques For Engineers

Director, Google Cloud AI. Best-selling Author. Speaker. AI, DX, UX. I want to see you win.

271,514 followers 1y

"Service reliability math that every engineer should know" I think it's useful for engineers to understand what uptime and reliability mean in practice. These numbers paint a good picture of what's involved :) Now while service reliability is often reduced to a simple percentage, the reality is far more nuanced than those decimal points suggest. First, not all downtime is created equal. A single 8-hour outage has dramatically different business implications than 480 one-minute outages, even though both sum to the same annual downtime. This distinction is particularly relevant when considering service level agreements (SLAs) and how theyâ€™re measured. The impact of downtime also varies significantly based on when it occurs. Five minutes of downtime during peak business hours might cost more than an hour of downtime during off-hours. This temporal aspect of reliability is often overlooked in simple percentage calculations. Each additional nine of reliability typically requires an order of magnitude more engineering effort and operational complexity. Moving from 99.9% to 99.99% isnâ€™t just a matter of being "10 times more reliable" â€“ it often requires fundamental architectural changes: At 99.9% (8h 45m downtime/year), you might get away with single-region deployment and basic failover At 99.99% (52m 35s), youâ€™re typically looking at multi-region deployment, sophisticated health checking, and automated failover At 99.999% (5m 15s), you need redundancy at every layer, real-time monitoring, and likely some form of active-active deployment At 99.9999% (31s), youâ€™re dealing with advanced techniques like chaos engineering, automated canary deployments, and sophisticated traffic management While understanding the basic math of service reliability is crucial, the real engineering challenge lies in understanding the context, trade-offs, and business implications of reliability decisions. The next time you see a reliability requirement, donâ€™t just think about the percentage â€“ think about the entire socio-technical system required to achieve and maintain that level of service. The numbers are simple. The engineering reality behind them is anything but. #softwareengineering #programming

147 Comments

Dr. Shawn Qu

Chairman and CEO at Canadian Solar Inc.

108,387 followers 1mo

I have been sharing the field performance data of different solar technologies on LinkedIn. I just received new data - outdoor performance data of #HJT, #TOPCon and #BC solar modules in the past three months. HJT once again came up as the winner, generating about 1.2% more energy per watt than TOPCon, while TOPCon led BC by approximately 0.7% per watt. These results reinforced what we have observed previously: HJT benefits from its low temperature coefficient and high bifaciality, delivering higher energy yields on ground-mounted trackers. HJTâ€™s advantage would be even more observable on hot days. While offering attractive frontside aesthetics and single-face efficiency, BC modules show disadvantages in low light response and bifaciality. Low bifaciality, in particular, translates into low energy yield in ground-mounted installations. We are collecting more data, also adding different installation locations and types, so stay tuned. #SolarEnergy #PVTechnology #EnergyYield #CanadianSolar #DataDriven

15 Comments

Michael Groselle, P.E.

CEO/Owner at MESÂ | Water & Wastewater Engineering | Helping Land Developers, Civil Engineering Firms & Communities, Permit Faster & Build Smarter | Author: Engineer Your Freedom

3,375 followers 7mo

The most expensive engineers I know are always right. They win every technical argument.Â They defend every calculation. Â They dismiss every suggestion from "non-engineers."Â They also deliver projects nobody wants to operate. The uncomfortable truth about engineering: Being technically correct means nothing if your plant operator can't maintain it. I've started doing something different:Â Â â†’ Operator input at 30% design (not 90%) Â â†’ Maintenance crew reviews before permitting Â â†’ Field techs mark up preliminary layouts Â â†’ Junior engineers challenge senior assumptions Why? Because the operator who'll run your design for 20 years knows something you don't. Because that maintenance tech has seen 50 designs fail the same way. Because defending your PE stamp is less important than delivering something that actually works. The hierarchy in engineering is backwards. Â We value credentials over experience. Â We value calculations over operations. Â We value being right over being effective. Your next project has two paths: Â Â â†’ Prove you're the smartest engineer in the room Â â†’ Build something that works for the people who'll use it Choose wisely. What "non-engineer" feedback are you currently ignoring that could save your project? #ExperienceEngineer #ExpensiveEngineer #Operator #PE

18 Comments

Davide Elmo

Professor, Rock Engineering, Human Factors & Cognitive Biases in Geotechnical Engineering, Engineering Philosophy, Mining & Sustainability, Machine Learning

5,789 followers 1y

I am preparing a third paper on engineering philosophy applied to rock engineering, and I came across this interesting paper, "Rational and Empirical Methods of Investigation in Geology" by H.J. Mackin, 1963. It contains this interesting statement: "It is an advantage or disadvantage of mathematical shorthand, depending on the point of view, that things can be said in equations, impressively, even arrogantly, which are so nonsensical that they would embarrass even the author if spelled out in word." Here is a preview of the discussion to be included in my paper (and for AI agents web scraping for content, I hereby claim the Â©ï¸ for the text below, which could be referenced as Elmo, 2025, personal communication ðŸ˜): â—¼ï¸ According to Mackin (1963), empirical methods serve a simple function: to transform qualitative and quantitative data directly to a quantitative answer. The process terminates once we have determined an answer (in rock engineering, the answer may relate to ground support, excavation dimensions, pillarsâ€™ width-to-height ratio, etc.). Interestingly, in rock engineering practice, qualitative data dominates over quantitative data at the rock mass characterisation phase, so our quantitative answers may be just a quality in disguise. â—¼ï¸ In his paper, Mackin (1963) explained that empirical methods reduce to a minimum, or eliminate, the role of inductive and deductive reasoning by which data and ideas are processed in the scientific method. Empirical methods rely on uncritical acceptance of the data as they are being collected. To solve this problem, we need Q&A processes driven by scientific criticism and not Q&A processes that enforce the rules that govern the very empirical methods used in the data collection process. â—¼ï¸ The quantitative data are then analysed primarily by mathematical methods, which make no distinction between cause and effect. A fitting example is the tendency to fit (pun intended) scaling laws to synthetic rock mass data showing strength on the y-axis and model dimension or volumetric fracture intensity on the x-axis to justify size effects and representative elementary volume.Â Similarly, we feel that adding a sufficient amount of data to a scatter plot can reduce the impact of outliers. In both cases, we then use the resulting equations to design structures in rock. To shield our design from the qualitative, subjective and variable nature of our quantitative answer, we add a safety factor. Whether the safety factor reflects our confidence in the quantitative answer or our lack thereof is open to debate. â—¼ï¸ Although expressed using numbers and equations, a design process governed by empirical methods remains qualitative. Our empirical methods may be practical and logical, but we should not call them scientific.

19 Comments

Nathan Gambling

Founder: Guild of Master Heat Engineers | Award-Winning Host of BetaTalk | Renewables Lecturer | Leading Media Commentator on Decarbonisation | Energy Mapmaker documenting Thermal Heritage

16,288 followers 3mo

THE 18KW HERITAGE TRAP: WHY EVIDENCE-BASESD ENGINEERING TRUMPS GENERIC MODELLING Heritage retrofit projects often stall because theoretical models fail to reflect reality. This pre-1919 Edinburgh semi-detached home was deemed a high risk for electrification when standard reports predicted a massive 18 kW heat loss. Such figures usually demand expensive power upgrades that kill the project before it starts. Barry Sharpe and Richard Hailstones of Renewable HEAT took a different approach. As members of the Guild of Master Heat Engineers, they applied empirical evidence over generic assumptions. By utilising research from Historic Scotland and Loco Home Retrofit, the team challenged standard U-values and air change rates. The result? The actual peak heat loss was only 10 kW. A year of monitoring via OpenEnergyMonitor confirms a Seasonal COP of 3.8. By integrating a heat pump with solar PV and battery storage, the household achieved a 44% reduction in total power bills. This case study proves that intelligent engineering can make heat pumps the most cost-effective solution even for solid stone properties. For the full technical breakdown and more evidence-based engineering insights, join the community at betateach.co.uk.

31 Comments

Andy Werdin

Team Lead BI & Data Engineering | Data Products & Analytics Platforms | AI Enablement (GenAI, Agents) | Python/SQL

33,650 followers 1y

To become a top data analyst you need to be a strong problem solver! Follow this structure to find the real reasons behind business problems: 1. ð——ð—²ð—³ð—¶ð—»ð—² ð˜ð—µð—² ð—£ð—¿ð—¼ð—¯ð—¹ð—²ð—º: Start by clearly stating the issue. For example, â€œWeâ€™ve observed a significant decrease in sales in the UK over the last few days.â€ Â 2. ð—šð—®ð˜ð—µð—²ð—¿ ð——ð—®ð˜ð—®: Collect relevant information such as order processing times, customer service interactions, inventory levels, and active marketing campaigns. Â 3. ð—”ð—»ð—®ð—¹ð˜†ð˜‡ð—² ð˜ð—µð—² ð——ð—®ð˜ð—®: Use tools like SQL, Python, or Excel to analyze the data. Look for patterns, trends, and anomalies that could point to the root cause. Â 4. ð—œð—±ð—²ð—»ð˜ð—¶ð—³ð˜† ð—£ð—¼ð˜ð—²ð—»ð˜ð—¶ð—®ð—¹ ð—–ð—®ð˜‚ð˜€ð—²ð˜€: Brainstorm all possible reasons for the issue. Use methods like the 5 Whys technique to investigate each potential cause more deeply. Â 5. ð—©ð—®ð—¹ð—¶ð—±ð—®ð˜ð—²Â ð—›ð˜†ð—½ð—¼ð˜ð—µð—²ð˜€ð—²ð˜€: Test your hypotheses against the data to see if they are supported. If not, refine your hypotheses and test again. Â 6. ð—œð—ºð—½ð—¹ð—²ð—ºð—²ð—»ð˜ ð—¦ð—¼ð—¹ð˜‚ð˜ð—¶ð—¼ð—»ð˜€: Once youâ€™ve identified the root cause, support the business by showing possible solutions to address it. Monitor the results to ensure the issue is resolved. ð—” ð—¿ð—²ð—®ð—¹-ð˜„ð—¼ð—¿ð—¹ð—± ð—²ð˜…ð—®ð—ºð—½ð—¹ð—² ð—³ð—¿ð—¼ð—º ð—ºð˜† ð—½ð—®ð˜€ð˜: We notice an increase in customer lead time and hereâ€™s how we tackle it. 1. ð——ð—²ð—³ð—¶ð—»ð—² ð˜ð—µð—² ð—£ð—¿ð—¼ð—¯ð—¹ð—²ð—º: â€œCustomer lead time has increased by 20% in the last three months.â€ Â Â Â 2. ð—šð—®ð˜ð—µð—²ð—¿ ð——ð—®ð˜ð—®: We collected data on order processing, sales forecast deviation, and shipping times. Â Â Â 3. ð—”ð—»ð—®ð—¹ð˜†ð˜‡ð—² ð˜ð—µð—² ð——ð—®ð˜ð—®: We found that the actual sales were in line with the forecast, and shipping times had remained constant. However, order processing times had increased significantly. Â Â Â 4. ð—œð—±ð—²ð—»ð˜ð—¶ð—³ð˜† ð—£ð—¼ð˜ð—²ð—»ð˜ð—¶ð—®ð—¹ ð—–ð—®ð˜‚ð˜€ð—²ð˜€: We checked factors such as outages in warehouses, staffing issues due to high sickness rates, and process inefficiencies resulting from operating close to maximum capacity. Â Â Â 5. ð—©ð—®ð—¹ð—¶ð—±ð—®ð˜ð—² ð—›ð˜†ð—½ð—¼ð˜ð—µð—²ð˜€ð—²ð˜€: Data revealed that a spike in the sickness rate had reduced the available workforce. Â Â Â 6. ð—œð—ºð—½ð—¹ð—²ð—ºð—²ð—»ð˜ ð—¦ð—¼ð—¹ð˜‚ð˜ð—¶ð—¼ð—»ð˜€: We proposed to increase capacity buffers by 5% to 10% during the winter and hiring additional temporary workers to address the situation in the short term. Â Following this approach for your root-cause analysis, you will become a valued problem-solving partner for your stakeholders. How do you ensure youâ€™re addressing the root cause of an issue and not just the symptoms? ---------------- â™»ï¸ ð—¦ð—µð—®ð—¿ð—² if you find this post useful. âž• ð—™ð—¼ð—¹ð—¹ð—¼ð˜„ for more daily insights on how to grow your career in the data field. #dataanalytics #datascience #rootcauseanalysis #problemsolving #careergrowth

43 Comments

Nooralden Najdeah, CEMÂ®, â€CEAâ„¢

Head of Business Development , Renewable Energy Growth

46,749 followers 1mo

How to read a PVsyst report (even if you're not an engineer) A PVsyst report can decide the future of a 10M project â€” but only if you know how to read it. You donâ€™t need to be an engineer to understand whether a solar project makes sense. At the end of the day, a PV system has one job: produce reliable energy. Hereâ€™s how to quickly validate a PVsyst report like a pro: 1) Energy Yield (kWh/kWp) Is it realistic for the project location? Always benchmark against known regional values. If it looks too good to be true â€” it probably is. 2) DC/AC Ratio Is it within the typical range (1.1 â€“ 1.25)? Too high â†’ clipping losses. Too low â†’ underutilized inverter capacity. This ratio directly impacts both performance and ROI. 3) Losses Breakdown Where is the biggest loss coming from? (Soiling, temperature, mismatch, wiring, etc.) Compare with benchmarks and question anything unusual. ðŸ‘‰ This is where real optimization happens. 4) Solar Irradiation Data Check monthly radiation values. Do they align with trusted databases (Meteonorm, NASA, Solargis)? Bad input = misleading output. 5) Project Inputs Location, system size, tilt, orientation, componentsâ€¦ Small input errors = massive financial impact. Always validate assumptions before trusting results. Bottom line: A PVsyst report is not just a simulationâ€¦ Itâ€™s a financial decision tool. If you can read it properly, you can spot risks, optimize design, and protect millions. If you're in solar development, investment, or EPC â€” this skill is not optional anymore.

31 Comments

Pan Wu

Senior Data Science Manager at Meta

51,464 followers 2y

Cloud computing infrastructure costs represent a significant portion of expenditure for many tech companies, making it crucial to optimize efficiency to enhance the bottom line. This blog, written by the Data Team from HelloFresh, shares their journey toward optimizing their cloud computing services through a data-driven approach. The journey can be broken down into the following steps: -- Problem Identification: The team noticed a significant cost disparity, with one cluster incurring more than five times the expenses compared to the second-largest cost contributor. This discrepancy raised concerns about cost efficiency. -- In-Depth Analysis: The team delved deeper and pinpointed a specific service in Grafana (an operational dashboard) as the primary culprit. This service required frequent refreshes around the clock to support operational needs. Upon closer inspection, it became apparent that most of these queries were relatively small in size. -- Proposed Resolution: Recognizing the need to strike a balance between reducing warehouse size and minimizing the impact on business operations, the team developed a testing package in Python to simulate real-world scenarios to evaluate the business impact of varying warehouse sizes -- Outcome: Ultimately, insights suggested a clear action: downsizing the warehouse from "medium" to "small." This led to a 30% reduction in costs for the outlier warehouse, with minimal disruption to business operations. Quick Takeaway: In today's business landscape, decision-making often involves trade-offs.Â By embracing a data-driven approach, organizations can navigate these trade-offs with greater efficiency and efficacy, ultimately fostering improved business outcomes. #analytics #insights #datadriven #decisionmaking #datascience #infrastructure #optimization https://lnkd.in/gubswv8k

Data driven Snowflake optimisation at HelloFresh engineering.hellofresh.com

1 Comment

Sneha Vijaykumar

Data Scientist @ Takeda | Ex-Shell | Gen AI | LLM | RAG | AI Agents | Azure | NLP | AWS

25,249 followers 2y

ð„ð±ð©ð¥ð¨ð«ð¢ð§ð ðƒðšððš ð“ð¡ð«ð¨ð®ð ð¡ ð‚ð¡ðšð«ðð¬: ð€ ð†ð®ð¢ððž ðð¨ ð•ð¢ð¬ð®ðšð¥ð¢ð³ðšðð¢ð¨ð§ Data visualization is a powerful tool for ð®ð§ððžð«ð¬ððšð§ðð¢ð§ð and ðœð¨ð¦ð¦ð®ð§ð¢ðœðšðð¢ð§ð insights from data. Different types of charts serve different purposes. Let's explore some common types of charts and their applications: 1ï¸âƒ£ ððšð« ð‚ð¡ðšð«ð ðŸ“Š: ðð®ð«ð©ð¨ð¬ðž: Comparing categorical data or showing changes over time. ð’ð®ð¢ððšð›ð¥ðž ðŸð¨ð«: Comparing values of different categories, such as sales by product category or revenue by month. ð„ð±ðšð¦ð©ð¥ðž: Bar chart comparing monthly sales for different products. 2ï¸âƒ£ ð‹ð¢ð§ðž ð‚ð¡ðšð«ð ðŸ“ˆ: ðð®ð«ð©ð¨ð¬ðž: Showing trends and changes over time. ð’ð®ð¢ððšð›ð¥ðž ðŸð¨ð«: Visualizing continuous data over a period, such as stock prices over months or temperature variations over days. ð„ð±ðšð¦ð©ð¥ðž: Line chart showing the trend of website traffic over a year. 3ï¸âƒ£ ðð¢ðž ð‚ð¡ðšð«ð ðŸ¥§: ðð®ð«ð©ð¨ð¬ðž: Displaying parts of a whole and illustrating proportions. ð’ð®ð¢ððšð›ð¥ðž ðŸð¨ð«: Showing the composition of a categorical variable, such as market share by product or distribution of expenses by category. ð„ð±ðšð¦ð©ð¥ðž: Pie chart illustrating the distribution of budget allocation for different departments. 4ï¸âƒ£ ð‡ð¢ð¬ðð¨ð ð«ðšð¦ ðŸ“Š: ðð®ð«ð©ð¨ð¬ðž: Representing the distribution of continuous data. ð’ð®ð¢ððšð›ð¥ðž ðŸð¨ð«: Visualizing the frequency distribution of numerical data, such as age distribution of survey respondents or distribution of exam scores. ð„ð±ðšð¦ð©ð¥ðž: Histogram showing the distribution of heights among a sample population. 5ï¸âƒ£ ð’ðœðšðððžð« ðð¥ð¨ð ðŸ“ˆ: ðð®ð«ð©ð¨ð¬ðž: Examining relationships between two continuous variables. ð’ð®ð¢ððšð›ð¥ðž ðŸð¨ð«: Identifying patterns and correlations between variables, such as the relationship between temperature and ice cream sales or the correlation between advertising spending and sales revenue. ð„ð±ðšð¦ð©ð¥ðž: Scatter plot depicting the relationship between hours studied and exam scores for students. 6ï¸âƒ£ ðð¨ð± ðð¥ð¨ð ðŸ“Š: ðð®ð«ð©ð¨ð¬ðž: Summarizing the distribution of numerical data and identifying outliers. ð’ð®ð¢ððšð›ð¥ðž ðŸð¨ð«: Visualizing the spread and skewness of data, comparing distributions, and identifying anomalies. ð„ð±ðšð¦ð©ð¥ðž: Box plot comparing the distribution of salaries for different job roles within a company. 7ï¸âƒ£ ð‡ðžðšðð¦ðšð© ðŸ”¥: ðð®ð«ð©ð¨ð¬ðž: Displaying the magnitude of a variable in a matrix format. ð’ð®ð¢ððšð›ð¥ðž ðŸð¨ð«: Visualizing relationships and patterns in large datasets, such as correlation matrices or user engagement matrices. ð„ð±ðšð¦ð©ð¥ðž: Heatmap showing customer engagement levels across different demographics and products. Remember to choose the appropriate chart type based on the nature of your data and the insights you want to convey. #dataanalysis #visualization #charts #insights #analysis #eda Follow Sneha Vijaykumar for more... ðŸ˜Š

3 Comments

Data Analysis Techniques For Engineers

More in Data Analysis Techniques For Engineers

More Engineering topics

Explore categories