Skip to main content
Big data analytics in environmental sustainability supply chain management

Research Article

Big data analytics in environmental sustainability supply chain management

Abstract

This paper examines the integration of Big Data Analytics (BDA) in Environmental Sustainability Supply Chain Management (ESSCM) within the aviation and automotive industries, given their data-intensive nature. Using a multidisciplinary approach, the research employs an analytical framework to assess companies' performance in specific areas, applying the RBV framework to understand how these sectors balance environmental responsibility with BDA implementations. The methodology includes a comprehensive literature review in environmental sustainability and BDA, along with case studies focused on prominent automotive and aviation industries. Incorporating text analytics, the dissertation explores the extent of BDA utilisation in enhancing environmental sustainability within intricate supply chain systems. The study is motivated by the increasing significance of sustainable practises in the modern business landscape, coupled with the growing influence of BDA in shaping decision-making processes. Recognizing the need for a comprehensive understanding of how BDA can contribute to ESSCM within the aviation and automotive industries, this research seeks to bridge the gap in existing literature and offer valuable insights for companies navigating the intersection of technology, environmental responsibility, and supply chain dynamics. By providing a unique approach to advancing the discourse on technology and environmental responsibility, the dissertation underscores its potential to guide these industries in analysing key areas of performance and contributes to the existing literature to provide insights for further in-depth exploration.

Keywords:

  • Keyword: Big Data Analytics (BDA)
  • Keyword: Environmental Sustainability
  • Keyword: Supply Chain Management (SCM)
  • Keyword: Text Mining
  • Keyword: Resource-Based View (RBV)
  • Keyword: Automotive Industry
  • Keyword: Airline Industry

How to Cite:

Sharma, A., (2026) “Big data analytics in environmental sustainability supply chain management”, Essex Student Journal 17(1). doi: https://doi.org/10.5526/esj.457

3e0ec7e4-f1ce-444c-9a68-61b51e4675e2

Acknowledgements

I express my gratitude towards my supervisor, Dr. Debabrata Ghosh, whose advice, unwavering support, and important insights have played a pivotal role in facilitating this research endeavour. The enormous impact of their expertise, patience, and encouragement on the progress of my paper cannot be overstated.

I extend my sincere appreciation for their mentorship, constructive feedback, and dedication that made this work possible. I would like to express my appreciation to Dr. Fay Aljibory for her valuable support, motivation, and contributions that have enhanced the scholarly pursuit of this research endeavour.

Secondly, I express gratitude to my family for their unwavering support and understanding throughout my academic endeavours. Their support has been my pillar of strength.

Finally, I would like to express my gratitude to those individuals who have made direct or indirect contributions to this paper

1. Introduction

1.1 Background

To fully comprehend the significance of this study, a comprehensive understanding of its subject matter and potential benefits is essential for organizations and individuals seeking valuable insights. The exploration of Big Data Analytics (BDA) and its applications in environmental sustainability within Supply Chain Management (SCM) establishes the foundation for addressing critical concerns. BDA, categorized into five Vs (value, volume, variety, veracity, and velocity), contributes to practical ideas, sustainable development goals, enhanced eco-effective performance, and competitive advantages (Tao et al., 2018). The implementation of two fundamental BDA viewpoints enhances operational excellence: the collection of significant data from internal and external contexts (Frank et al., 2019) and the integration of big data for real-time adjustments. In recent years, sustainable supply chain management (SSCM) has incorporated environmental, social, and economic objectives within a company's supply chain operations. As organizations aspire to achieve sustainability outcomes across environmental, social, and economic dimensions (Carter and Rogers, 2008; Seuring and Muller, 2008), this study focuses on SSCM's environmental dimension in the influential automotive and aviation industries, with the aim of elucidating challenges and unveiling discoveries.

The environment faces severe damage from natural and human-induced causes in the 20th century. Environmental degradation, driven by industrialization, economic expansion, and resource pursuit, gives rise to numerous issues threatening Earth's ecosystems. The combustion of fossil fuels, particularly oil, coal, and gas, contributes to climate change and greenhouse gas (GHG) emissions, resulting in global warming and its associated consequences (Das and Sharma, 2023). Furthermore, the transportation sector plays a substantial role in GHG emissions and air pollution; elevated emissions impact air quality, expedite climate change, and necessitate sustainable transportation alternatives (Parpieva et al., 2024). Therefore, this study focuses on two major industries, automotive and aviation, which are increasingly compelled to address environmental concerns and adopt sustainable practices. The discussion on these transportation industries encompasses an overview of the environmental causes and effects in each sector, emphasizing technological advancements through BDA. For this study, major factors contributing to environmental concerns and BDA implementation are examined using text analytics.

1.2 Research Rationale

The proficient application of big data analysis can augment supply chain sustainability by providing insights into the environmental impact of operations (Wang et al., 2020). Companies can identify areas for improvement and reduce their environmental footprint by analyzing data on energy use, carbon emissions, waste creation, and other environmental issues. This study connects significant industries, providing a comprehensive understanding of global transportation businesses by considering various aspects of environmental hazards and BDA uses. However, a limited number of scholarly papers explore the significance of BDA and its technologies for insightful conclusions. For this section of the study, secondary source data, derived from company databases and scholarly papers, constructs a conceptual framework and conclusion covering topics such as fuel usage, emissions, logistics, material sources, technological transformations, advancements, technical adaptations, and product lifecycle information. The study assesses BDA implementation using text mining and analytics to evaluate supply chain environmental sustainability. Organizations commit to minimizing their environmental impact and making informed decisions by collecting and analysing vast volumes of data. Integrating big data (BD) and analytics into the supply chain is crucial for firms to measure and assess their real-time environmental impact, promoting sustainability.

1.3 Research Aim, Objective, and Question

1.3.1 Research Aim

The primary aim of this paper is to conduct a thorough examination of the application of BDA in the context of ESSCM. The overarching objective of this study is to investigate how companies utilize BDA in the supply chains of the aerospace and automotive industries to enhance environmental sustainability. The ultimate focus is on understanding how these industries employ BDA to identify concerns related to environmental sustainability while integrating the Resource-Based View (RBV) framework comprehensively. While applying this theoretical framework, the study aims to provide a distinctive approach by introducing text analytics methods to construct case studies and derive insights through visualization techniques. These insights are intended to be valuable for both companies and scholarly researchers in further exploring operational aspects.

1.3.2 Research Objective

This research is based on the below-mentioned objectives:

  1. To examine the integration of BDA and ESSCM in general and in automotive and aerospace companies’ performance in the supply chain and environmental sustainability.

  2. To evaluate the industry’s effectiveness in using big data analysis or its techniques for the sustainable development of nature in the supply chain.

  3. To employ big data’s text mining analysis to build company-based case studies for examining a company's database (reports).

  4. To discuss the correlation between BDA and the environmental sustainability of the two industries through their relationship and time series analysis.

1.3.3 Research Question

As this paper focuses on BDA and ESSCM, considering the transportation industry, the main question that this study addresses is: To what extent are companies implementing BDA and addressing environmental concerns?

To enhance the study, this paper will also discuss:

RQ1 - To what extent is BDA being used and talked about globally and in the Scholarly world?

RQ2 - How is big data and BDA being helpful to improve the environmental sustainability of supply chains?

RQ3 - How could Text Analytics be useful to build case studies and evaluate results related to the company’s initiative towards environmental concerns?

RQ4 - What challenges, limitations, and concerns are associated with the utilization of BDA and text analytics?

The research question involves two key industries – Automotive and Aviation, which broadens the scope of the study and enables a thorough examination of the use of BDA in various supply chain situations.

1.4 Methodology

The utilisation of BDA has become significant in industries' efforts to reduce their carbon footprint and ensure environmental responsibility. Data analytics has the potential to transform SCM in the aviation and automotive industries, improving efficiency, environmental impact, and sustainability. BDA enables the processing and analysis of large data sets. The advancement of technology and the increasing complexity of supply chains have led to the accumulation of substantial data within businesses. This study gained a comprehensive understanding of BDA technology, while implementing text analytics, and derive insights through predictive analysis, time series analysis, correlation analysis, and data visualisation using R programming, python, and Tableau. These methods generated new opportunities and provide future researchers with a different perspective. The alignment of strategies between the objectives and research questions is crucial for understanding the role of BDA in promoting environmental sustainability in the automotive and aviation industries. This approach ensures a seamless progression in this study as they explore an intricate network of inquiries, goals, and practical implications. This scholarly endeavour concludes with a thorough and insightful investigation that improves intellectual engagement. The rest of the paper is organized as follows: Section 2 presents the Literature Review, Section 3 the Methodology of the study, Section 4 presents Finding and Analysis through numerical values and data visualisation, Section 5 is about Discussion based on section 4, and final Section 6 discusses Limitations of the paper.

2. Literature Review

BDA has gained considerable popularity due to its diverse applications, but a precise definition remains elusive. BDA refers to technologies and architectures designed to extract economic value from large amounts of data, achieved by rapidly capturing, analysing, and discovering data patterns (Mikalef et al., 2018). Companies employ innovative technological methods for enhanced information processing, enabling better decision-making and a competitive advantage (Hofman, 2018; Gunasekaran et al., 2017; Rialti et al., 2019). Industries like aviation and automotive face increasing pressure to address environmental concerns and embrace sustainable practices, prioritising environmental sustainability in their corporate strategies. Industrialists recognise that considering only cost, quality, and productivity is no longer sufficient for competitiveness (Sureeyatanapas and Yang, 2021). Therefore, this literature review investigates the intricate relationship between BDA and ESSCM within the aviation and automotive industries. It aims to clarify the implementation of BDA in ESSCM and critically evaluate whether BDA aligns with and promotes environmental sustainability in these industries through the use of resource-based view lenses while applying text analytics.

Figure 1.1: Literature Review Model

Source: Developed by the researcher

2.1 Definition of BDA

In adherence to Moore's Law, wherein the number of transistors on microchips doubles roughly every two years (Tardi, 2020), the Big Data (BD) paradigm has continually evolved. The term BD, initially used by John R. Mashey, predates the Fourth Industrial Revolution (Bag et al., 2020). BD has expanded from the 3Vs to as many as 8Vs, leading to confusion. Doug Laney introduced the foundational 3Vs—Volume, Velocity, and Variety (Lanely, 2001). Later, “Veracity” and “Value” were added (Motau & Boursier, 2016; Singh & Marr, 2015). Some researchers extended it further, but this study considers five: volume, velocity, variety, veracity, and value (Dahdouh, 2020).

A diagram illustrating the 5 V’s of Big Data. At the centre is a pentagon labelled ‘The 5 V’s of Big Data.’ Surrounding it are five coloured ovals: Volume (red): Terabytes, Records/Arch, Transactions, Tables, Files Velocity (purple): Batch, Real/near-time, Processes, Streams Value (blue): Statistical, Events, Correlations, Hypothetical Veracity (light blue): Trustworthiness, Authenticity, Origin/Reputation, Availability, Accountability Variety (green): Structured, Semi-Structured, Unstructured, Probabilistic.

Figure 2.1.1: 5 Characteristics of Big Data

Source: (Dahdouh, 2020)

BD is better defined by its analytical processes rather than its size (Agaba, 2014). BDA involves gathering, arranging, and analysing extensive, continuously generated data to uncover insights for decision-making. However, organisations face challenges such as unreliable sources, data privacy issues, and the need for both technical expertise and human skills (Wang et al., 2019; Hazen, 2014). Advanced analytics methods such as data mining, predictive analytics, and text mining play a crucial role (Bank, 2022). Large amounts of unstructured data complicate machine learning, requiring lengthy data processing (Mergel, 2019; Muller, 2018; Nagar, 2021; Nasiri, 2020; Zhou, 2018).

2.2 ESSCM

Environmental sustainability entails the judicious utilisation of Earth's resources to meet current demands while safeguarding future generations (Golicic, 2013). SCM strategically coordinates diverse facets within a firm and across the supply chain to enhance performance (Mentzer et al., 2001). Environmental supply chain practices encompass eco-conscious endeavours throughout a product’s lifecycle. Acquaye (2014) stresses the need for firms to adopt practices that are economically, environmentally, and socially sustainable (Schaltegger et al., 2008).

In sustainability, management strategies incorporate assessments of product life cycles and their impact on carbon reduction (Acquaye et al., 2011; Koh et al., 2013; Weber & Peters, 2009). Factors such as GHG emissions, CO2 levels, and waste management are critical. Ritchie’s (2020) sector-specific analysis emphasises the complexity of global emissions and the need for holistic decarbonization approaches.

A circular chart showing global greenhouse gas emissions by sector for 2016. The largest portion is Energy at 73.2%, which is broken into subsectors including energy use in industry, transport, and buildings. Agriculture, Forestry & Land Use accounts for 18.4%, Industry for 5.2%, and Waste for 3.2%. Surrounding rings detail subsectors such as road transport, aviation, residential buildings, commercial buildings, chemicals, cement, livestock and manure, crop burning, and wastewater. The chart is produced by Our World in Data.

Figure 2.2.1: Global Greenhouse Gas Emissions by Sector Chart

Source: Ritchie (2020)

The WEF (2023) highlights that the US and EU account for 37% of cumulative global GHG emissions since 1850. Intra-organizational models focus on ecological raw materials, waste reduction, and energy conservation (Rao, 2002). Zhu, Crotty, and Sarkis (2008) compared ESSCM practices in the automotive industry in China and the UK. Mollenkopf et al. (2010) classified energy efficiency initiatives and examined adoption barriers.

2.3 Integrated Literature on Correlation of BDA and ESSCM:

The adoption of digital tools enhances business capabilities. BDA enables effective management of vast data for competitive advantage (Pettit et al., 2019). It strengthens SCM by improving production efficiency and reducing delays between manufacturers and users. Sensors, trackers, cloud computing, and warehouse digitization facilitate real-time SCM. Using descriptive, predictive, and prescriptive analytics, BDA improves purchasing and supply management (Min, 2010; Tumpa et al., 2019). BDA impacts economic, social, ethical, and legal aspects (Shubbak, 2013).

Economically, BDA supports sustainable firm performance and broader growth by enabling more informed, efficient decision-making, while also introducing a digital divide where unequal access to analytical capabilities deepens existing economic inequalities (Ertz et al., 2024; Cuquet and Fensel, 2018; Custers et al., 2017).

Socially, BDA can enhance public services and citizen engagement but simultaneously raises concerns about discrimination and privacy infringements, as data-driven systems may reproduce or intensify underlying biases (Cuquet et al., 2017; Custers et al., 2017).

Ethically, debates focus on privacy, autonomy, accountability, and the need for transparent data practices to uphold public trust and support responsible data governance (Custers et al., 2017; Guan & Zhou, 2017).

Legally, BDA challenges existing regulatory frameworks on data protection, intellectual property, accountability, and human rights, underlining the urgency of updating legal standards to protect individuals in an increasingly data-driven environment (Guan and Zhou, 2017; Cuquet et al., 2017; Custers et al., 2017).

Robust models are vital for scenario forecasting, influencing environmental, social, and economic supply chain performance (Jeble & Dubey, 2017; Rao, 2002). Combining diverse data types enhances SCM performance (Zhong et al., 2016; Fernando et al., 2018). BDA allows environmentally informed decisions, though research lacks clarity on how it directly affects green supply chain integration (Song et al., 2017).

2.4 BDA and ESSCM in Automotive Industry

Data shapes vehicle development, manufacturing, services, and connected technologies (Luckow, 2015). The industry’s digitalization, driven by connected devices, amplifies demand for data analytics. Studies emphasise BDA’s role in cost reduction, consumer behaviour, corporate environmental management, and barrier identification (Zhang et al., 2017; Beier, 2022).

BDA improves SCM through real-time data collection and predictive analysis (Giacosa et al., 2021), enhancing logistics, reducing waste, and improving energy efficiency (Chung, 2021). Big data warehouses support integration of diverse data sources (Silva, 2021). ML and BDA aid in optimising logistics and transportation (Zhu, 2021). BDA balances efficiency and environmental goals (Singh, 2022; Giacosa, 2022).

Use cases include traffic prediction, autonomous driving, telematics data, and data lake architectures (Dremel, 2018). Structural challenges remain, but BDA contributes significantly to addressing environmental concerns in the automotive sector.

2.5 BDA and ESSCM in Airline Industry

The airline industry employs BDA to enhance ESSCM practices under environmental pressures (Savchenko, 2023). IoT applications improve operations, communication, and sustainability. Technologies like BD, additive manufacturing, and blockchain support greener SCM (Ramirez-Peña, 2020).

BDA improves fuel efficiency, reduces emissions, and streamlines maintenance, saving costs while supporting environmental responsibility (Aydin, 2019). Motivations include regulations, safety, efficiency, and sustainability (Burmester, 2018). It also drives innovation in biofuels and electric propulsion.

BD strengthens aviation safety and predictive models (Barak & Dahooei, 2018; Ni et al., 2019; Douglas, 2014). It also improves service levels (Cao et al., 2015; Kim & Shin, 2016; Park & Pan, 2018). However, barriers include lack of expertise, high costs, skills shortages, and privacy concerns (Hashem et al., 2015; Izzo, 2019; Mohamed, 2021). Despite challenges, technological advancement is crucial for addressing aviation’s environmental concerns.

2.6 Text Mining and Text Analytics

This section employs text mining and analytics to extract insights from textual data such as social media, corporate documents, and news articles (Mohamed, 2021). Text mining extracts valuable insights from large datasets and is particularly relevant as much data exists in text form (Sánchez, 2008).

Text mining is widely used in analysing literature on SCM (Mergel, 2019; Thomas, 2015; Park, 2018). Tao (2020) outlines steps such as cleaning, reduction, association analysis, classification, and clustering, supported by metadata for context. Visual representations, such as time series and correlation analysis, was an aid for this study (Tao et al., 2020).

A diagram comparing text preprocessing and text analysis steps. On the left, under ‘Text preprocessing,’ three blue boxes list processes: Normalization (lower casing, punctuation removal, stopword removal, rare word removal, spelling correction, stemming, lemmatization, etc.); Noise removal (HTML/XML removal, metadata removal, text file header removal, etc.); and Tokenization (unigram, bigram, etc.). On the right, under ‘Text analysis,’ three blue boxes describe methods: Word-level analysis (simple counting, term frequency, inverse document frequency, TF-IDF, etc.); **Word association analysis (context similarity, correlated occurrence); and Advanced text analysis (sentiment analysis, text classification, topic modelling). An arrow shows preprocessing feeding into analysis.”

Figure 2.6.1: Text Analysis Framework

Source: Tao, Yang, and Feng (2020)

2.7 Theoretical Lens

Various theories support research on BDA and sustainability, including the Triple Bottom Line, panopticon, and Natural Resource-Based View (Seele, 2016; Mageto, 2021; Khatib, 2022). However, the Resource-Based View (RBV) is most relevant, explaining how BDA contributes to competitive advantage (Wamba et al., 2017). RBV is widely applied in related research (Dubey & Singh, 2019; Zhang, 2022; Vitari, 2020).

This study combines RBV with text analytics to assess how firms exploit environmental resources. RBV suggests competitive advantage can stem from deriving insights through text analytics to improve decision-making and innovation (Lubis, 2022).

Conceptual diagram illustrating the integration of analytical methods through a Resource-Based Theory Lens. Central circle labelled "Resource Based Theory Lens" connects to three surrounding circles: "Big Data Analytics," "Text Analytics," and "ESSCM." Arrows from these three feed into a rectangular box labelled "Building Case Studies," "Data Utilisation," and "Obtaining Insights," showing how analytical approaches contribute to practical outcomes.

Figure 2.7.1: Conceptual Framework

Source: Developed by the researcher

2.8 Methodological Approaches of Previous Researchers and the Current Study

Studies employ a variety of methodologies to examine BDA and ESSCM. For example, Bag et al. (2020) applied a hybrid BWM and DEMATEL method to explore BDA-driven sustainable supply chains, while Dubey et al. (2017) examined big data and predictive analytics adoption through survey-based methods. Zhao et al. (2017) used empirical case studies to analyse BDA impacts in Chinese manufacturing. In another approach, Sharma and Panwar (2020) studied the Green IoT framework to reduce energy consumption and pollution in supply chains, while Yousefi and Tosarkani (2022) applied cognitive map modelling to evaluate risk management strategies.

Text mining has also been applied, such as Meyer et al. (2021) who investigated supply chain resilience during the pandemic using sentiment analysis. Machine learning and regression-based models have been used to forecast sustainability outcomes (Asha et al., 2022; Sathyan et al., 2020). DEA models and maturity frameworks have been adopted to measure airline supply chain efficiency (Rachman & Arviansysh, 2019; Hausladen & Schosser, 2020).

This section uniquely combines BDA, ESSCM, and text analytics within the RBV framework, positioning it as distinct from prior research.

Drawing from these arguments, the following hypotheses are proposed:

  • H0: No significant correlation exists between the use of BDA tools and environmental sustainability in the automotive and airline industries.

  • H1: A significant positive correlation exists between the use of BDA tools and environmental sustainability in the automotive and airline industries.

2.9 Summary

This study identified the implications of BDA for environmentally sustainable supply chain management, highlighting features, trends, and challenges. BDA assists firms in responding to increasing data demands and competitive pressures, though challenges remain in data storage, complexity, adaptability, privacy, and return on investment (Goyal et al., 2020). Cultural barriers, skills shortages, and high costs further hinder adoption, yet analytics implementation is becoming increasingly inevitable.

The literature reflects diverse approaches across industries, with applications ranging from logistics optimization to carbon footprint reduction. While numerous studies exist on BDA in automotive industries, research on BDA and ESSCM in the airline industry is relatively limited, representing a critical gap. Thus, this section of the study investigates the roles of BDA and ESSCM in both aviation and automotive industries, supported by RBV and text analytics, contributing to a deeper understanding of digital sustainability transitions.

3. Methodology

3.1 Research Philosophy, Approach, and Design

This study adopts a positivist research philosophy, which emphasises empirical observation and objective measurement. Positivism is appropriate because the research focuses on quantifiable data extracted from corporate disclosures rather than subjective interpretation (Phair & Warren, 2021).

A deductive approach was employed, moving from established theories such as the RBV towards hypothesis testing. The aim was not to generate new theory but to examine whether patterns in corporate reporting confirm or challenge existing claims about the role of BDA in sustainable supply chain management (Snieder & Larner, 2009).

The research design is quantitative and based on secondary data. Annual and sustainability reports (2019–2022) were analysed using text mining and keyword frequency methods. These documents were selected because they are systematically prepared, publicly available, and comparable across firms and industries.

The study applies a cross-sectional time horizon, focusing on a defined period of four years. While this does not capture long-term trajectories, it allows for comparison across firms and industries during a period of heightened environmental and digitalisation pressures.

Finally, the study’s techniques include text analytics, keyword frequency analysis, correlation testing, and data visualisation. These methods enable transparent and replicable analysis of how BDA and environmental sustainability are represented in corporate disclosures, providing a basis for evaluating their alignment under the RBV framework.

3.2 Company Selection Process

The selection of companies in the airline and automotive sectors is a crucial aspect of the methodology employed in this paper. The following criteria were used to select companies for this study:

  1. The company must be a global leader in the airline or automotive industry.

  2. The company must have a strong commitment to sustainability.

  3. The company must have implemented BDA in its operations.

  4. The company must have publicly available annual and sustainability reports.

This reflects the rigorous approach of the study in this research. While selecting the companies, a list of the top 10 companies in both the automotive and airline industries was considered. From this list, Delta Airlines, United Airlines, Emirates, Tesla, Toyota, and Volkswagen were selected for further analysis. These companies were chosen due to various principal factors that collectively enhance the comprehensiveness and importance of this research, which are discussed in sections 3.2.1 and 3.2.2 for both industries.

  1. Automotive Companies Selection

  1. Technological Innovation: Innovation supports sustainable development through redesigning products, processes, and structures (OECD, 2009). Tesla, Toyota, and Volkswagen were selected for their varied approaches to innovation. Tesla ranked as the most valuable automotive brand in 2023, Toyota is the world’s largest automaker by production volume, and Volkswagen is second (Statista, 2023).

  2. Sustainability Initiatives: Each company pursues distinct sustainability strategies. Toyota prioritises hybrid technology and targets net carbon neutrality by 2050 (Toyota Motor Corporation, 2023). Tesla emphasises electric vehicles and aims to accelerate the transition to sustainable energy (Tesla, 2023). Volkswagen focuses on emission reduction and electric mobility, with a target of net-zero carbon by 2050 (Volkswagen Group, 2023).

3.2.2 Airline Companies Selection

  1. Global Presence: Delta, United, and Emirates were chosen for their strong international operations. Delta is the largest airline by revenue, United ranks third, and Emirates leads in international passenger kilometres (Statista, 2023). Their inclusion provides a global view of BDA and ESSCM adoption.

  2. Operational Diversity: Delta and United are legacy carriers with extensive domestic and international networks, while Emirates specialises in long-haul routes. Their diversity enables a comparative analysis of BDA across different operational models. Sustainability goals include net-zero emissions by 2050 for Delta and United, and a 50% reduction in emissions by 2050 for Emirates (Delta Airlines, 2023; United Airlines, 2023; Emirates, 2023).

3.3 Data collection

This study relies on secondary data from annual and sustainability reports (2019–2022) of Volkswagen, Tesla, Toyota, Delta, United, and Emirates. These reports provide information on sustainability initiatives, CSR activities, digitalisation, and growth patterns. Although reports do not explicitly detail BDA, they provide consistent insights into sustainability and digitalisation practices.

The collected data undergoes preprocessing and refinement to ensure accuracy and alignment with research objectives. The study applies below quantitative techniques:

  1. Text Analytics: Automatic extraction of patterns and keyword frequencies from unstructured text (Anandarajan, Hill, and Nolan, 2018).

  2. Correlation Analysis: Measures the strength and direction of relationships between BDA and sustainability variables.

  3. Data Visualisation: Word clouds and bar charts illustrate connections between BDA and sustainability keywords.

This approach facilitates the evaluation of technological expansion and environmental responsibility in the automotive and airline industries.

Table 3.3.1: Variables Used in Analysis

Variable Type Sub-Themes Operationalisation Source
Independent Variables (BDA) Data & Analytics, Technology & Software, AI & ML, Cybersecurity, Robotics, etc. (10 sub-themes) Frequency counts of related keywords in company reports (2019–2022) Annual & sustainability reports
Dependent Variables (ESSCM) Sustainability & Conservation, Energy & Power, Supply Chain & Procurement, Waste & Recycling, etc. (13 sub-themes) Frequency counts of related keywords in company reports (2019–2022) Annual & sustainability reports
Control / Contextual Industry Type (Automotive vs Airline) Sector categorisation of companies Statista / Company reports

Source: Developed by the researcher.

A flowchart illustrating the process of text analytics. On the left, a grey rectangle labelled “Data” connects to “Keyword Refinement,” which is linked to two smaller boxes: “BDA Text Data” above and “ESSCM Text Data” below. The process flows into a central dark diamond labelled “Text Analytics.” From there, three paths branch out: Upward to “Visualisation” via “Descriptive Analysis.” Rightward to “Structuring Data.” Downward to “Frequency,” which leads to another “Visualisation” box.

Figure 3.1: Process Flow Chart Framework

Source: Developed by the researcher.

4. Findings and Analysis

This section presents the findings and analysis of the study, examining the relationship between BDA and ESSCM across two industries: automotive (Volkswagen, Tesla, Toyota) and airlines (Delta, Emirates, United). The objective is to demonstrate how companies report on digitalisation and sustainability practices, and whether patterns of BDA adoption correlate positively with environmental sustainability efforts.

The section proceeds in a layered manner. First, the text mining procedure is outlined, describing how unstructured corporate reports were refined into structured datasets of keyword frequencies. Second, Wordclouds provide a visual introduction to reporting emphasis. Third, heatmaps quantify keyword frequencies by sub-theme, followed by radar charts to compare industry strengths across domains. Fourth, stacked bar charts show how domain contributions shift over time, and trend lines examine year-on-year dynamics for each company. Finally, scatterplots test correlations across industries, confirming or rejecting the research hypothesis. Company-specific results are available in the appendices, while this section focuses on industry-level insights.

This sequence is guided by the Findings and Analysis Framework (Figure 4.1.1), which divides the section into three stages: refinement, analysis, and findings. The framework demonstrates how the study moves systematically from data preparation, to analysis, and finally to interpretation.

This layeed approach follows recommendations from Hastig and Sodhi (2020) and Tavana and Shaabani (2022), who emphasise the value of combining text mining with structured visualisation methods to analyse sustainability reporting.

A vertical flowchart showing three main stages: Refinement Process (oval shape) with arrows pointing to it from “Selection of Data” and “Defining Selected Data.” To the right, a description reads: “Discusses refinement details and process of refining the keywords.” Data Analysis (oval shape) below, connected downward from Refinement Process. Arrows point to it from “Analysis Technique,” “Summarising Numbers,” and “Visualizing Data.” To the right, a description reads: “Describes the procedure and numbers found out through the analysis – Company Specific and Industry Specific.” Findings (oval shape) at the bottom, connected downward from Data Analysis. Arrows point to it from “Interpretation,” “Comparative Discussion,” and “Summarising.” To the right, a description reads: “Discusses the concluding statements of the findings and data analysis.”

Figure 4.1: Findings and Analysis Framework

Source: Developed by the Researcher

4.1 Clarification on Frequency Counts

It must be noted that while annual and sustainability reports were the source of data, this study did not qualitatively interpret how each keyword was discussed. For example, when “sustainability” appeared, the analysis did not distinguish whether it referred to emissions, social responsibility, or renewable energy. Instead, the study relied strictly on frequency counts, transforming them into charts such as heatmaps and radar diagrams. This approach allowed statistical analysis while avoiding subjective interpretation.

4.2 Text Mining Procedure

Before presenting visualisations, it is essential to explain how the data was generated. The raw material comprised annual and sustainability reports from 2019–2022 for the six selected companies. These reports are extensive, often covering topics beyond technology and sustainability. The challenge was to extract only the keywords relevant to BDA and ESSCM sub-themes, ensuring comparability across firms and industries.

The text mining procedure followed three structured steps:

  1. Keyword Extraction and Counting

  • Annual and sustainability reports were merged, converted from PDF into text, and uploaded into Google Spreadsheets.

  • A sheet named “Data” contained the raw text, while a second sheet “Output” listed the defined keywords vertically.

  • The COUNTIF formula was applied to calculate the number of times each keyword appeared. This approach is consistent with Manjunath and Viswanathan (2019), who identify spreadsheets as a low-cost but effective means for text-based frequency analysis, and with Tao et al. (2020), who note the role of text analytics in structuring unstructured corporate data.”

  • This method was chosen after testing alternatives: initial attempts with R Studio and libraries such as pdftools were less precise, while Google Spreadsheets offered accuracy and transparency. This approach has been recommended by Manjunath and Viswanathan (2019) as an efficient, low-cost solution for text-based frequency analysis.

  1. Data Preparation and Structuring

  • Raw keyword counts were unstructured and therefore reformatted in Microsoft Excel.

  • Separate sheets were created for BDA and ESSCM, organised by sub-themes (10 for BDA and 13 for ESSCM).

  • Company-specific totals were computed, followed by aggregation at the industry level.

  1. Final Dataset for Analysis

  • The final dataset allowed keyword frequencies to be examined at three levels: company-specific, sector-specific, and industry-level.

  • This dataset provided the numerical foundation for all subsequent visualisations: Wordclouds, heatmaps, radar charts, stacked bar charts, trend lines, and correlation scatterplots.

This process ensured that the results are replicable and transparent, transforming unstructured text into structured frequency tables.

Refer to the table for the number of keywords considered.

Table 4.2.1: Numbers

Number of Themes 2
Total Number of Sub-themes 23
Number of BDA Sub-themes 10
Number of ESSCM Sub-themes 13
Number of BDA Keywords 63
Number of ESSCM Keywords 63

Table 4.2.2: BDA Underlying Keywords

Keywords
Theme: BDA
Sub-theme: Underlying Keywords:
Artificial Intelligence and Machine Learning ML (with spaces behind and ahead), Algorithm, Analytical, Analytics, Artificial, Machine, Machine Learning
Data and Analytics Analysis, Application, Big Data, Data, Data Management, Data Mining, Database, Data-Driven, dataset, dataSets, Predict, Prediction, Predictive, Processing, Information, Mining, Mathematic, Maths
Technology and Software AWS (with spaces behind and ahead), Blockchain, Cloud, Computer, Digital, Electronic, Technology, Tool, Virtual, Software, Java, Program, Programming
Cybersecurity and Security Cyber, Cybersecurity, Security, Securities, Surveillance
Business and Strategy Business Intelligence, Strategy, Innovation
Ethics and Privacy Ethical, Privacy
Robotics and Automation Robot, Robotics
Technical and Technique Technical, Techniques, Method
Integration and Connectivity Integration
Visualization and Data Volume Visualisation, Visual, Volume

Source: Developed by the researcher.

Table 4.2.3: ESSCM Underlying Keywords

Keywords
Theme: ESSCM
Sub-theme: Underlying Keywords:
Environmental Concerns Carbon, CO2, Combustion, Emission, Emissions, Greenhouse, Pollution
Sustainability and Conservation Conservation, Eco, Eco Friendly, Eco-Friendly, Environmental, Sustainable, Renewable, Renewal, Sustainability
Energy and Power Electric, Energy, Power plant, Renewable, Power
Nature and Environment Earth, Natural, Nature, Planet, Plant
Transport and Logistics Transport, Logistics
Social and Corporate Responsibility Corporate, Social, EPA, ESG
Supply Chain and Procurement Procurement, Supplier, Suppliers, Supply, Supply Chain, SCM, SSCM
Waste and Recycling Recycle, Waste, Wastage
Harm and Hazard Harm, Harmful, Hazard
Resource Management Material, Resources
Agriculture and Farming Farmer, Farmers
Gas and Nitrogen Gas, Nitrogen
Production and Process Production, Process

Source: Developed by the researcher.

The finalised frequency data is as below:

Excel sheet showing keyword frequency for Environmental Sustainability and Big Data Analytics across three time ranges.

Figure 4.1: Frequency Numbers by Sub-themes

4.3 Wordclouds: Initial Keyword Impressions

Wordclouds provide a first visual impression of which keywords dominate reporting.

  • Automotive Wordcloud: The largest terms are environment, emission, sustainability, energy, and data. This indicates that car manufacturers integrate sustainability with digitalisation.

Colourful word cloud on the automative industry. The largest words include “Environment,” “Emission,” “Emissions,” “Material,” “Energy,” “Sustainability,” and “Supplier.” Other words relate to environmental sustainability, big data, and technology, such as “Carbon,” “Process,” “Production,” “CO2,” “Renewable,” “Technology,” and “Supply Chain.”

Figure 4.2: Wordcloud of Automotive Industry – BDA and ESSCM

  • Airline Wordcloud: The largest terms are fuel, corporate, sustainability, with relatively smaller BDA-related terms. This shows that airlines prioritise environmental compliance and reporting, with less emphasis on advanced analytics.

A colourful word cloud featuring terms related to airline. The largest and most prominent word in the centre is “Material.” Other notable words include “Environment,” “Program,” “Sustainability,” “Emissions,” “Corporate,” “Fuel,” “Process,” “Supplier,” “Data,” and “Emission.” Words are displayed in various colors such as green, orange, blue, and purple, and vary in size to indicate importance or frequency. The overall theme emphasizes environmental strategies, energy, supply chain, technology, and carbon reduction.

Figure 4.3: Wordcloud of Airline – BDA and ESSCM

Interpretation of Wordclouds: Together, these Wordclouds highlight an important point: both industries recognise the importance of environmental and sustainability-related themes as central to their corporate strategies. However, BDA appears more embedded in the automotive industry than in airlines, albeit still secondary to the environmental agenda. In contrast, airlines present sustainability as a compliance-driven and risk-avoidance priority, where BDA plays a less visible role. These findings reflect broader industry narratives; as noted by Luckow (2015) and Giacosa et al. (2021), automotive firms increasingly embed data analytics into environmental reporting, while airlines focus more on regulatory and compliance-related sustainability disclosures (Aydin, 2019)

4.4 Heatmaps: Frequency Distributions

Heatmaps quantify how often keywords were mentioned across sub-themes and years.

  • Automotive BDA: Emphasis is strongest in Data & Analytics and Technology & Software. Toyota peaked at 1,257 mentions in 2020, Volkswagen at 1,117 in 2021. AI & ML remained consistently low (<100 annually). This pattern reflects wider post-COVID digitalisation trends in automotive supply chains (Chung, 2021; Silva, 2021).

  • Automotive ESSCM: Sustainability & Conservation dominated (Toyota reported 2,031 mentions in 2019), with Supply Chain & Procurement increasing sharply (VW: 444 in 2019 → 986 in 2022).

  • Airline BDA: Much lower intensity; Delta and United averaged ~800 mentions in Data & Analytics, Emirates fewer (~600).

  • Airline ESSCM: Strongest emphasis on Sustainability (~1,200 per year for Delta). Emirates grew in Energy & Power from 56 (2019) to 188 (2022). This stability reflects regulatory and operational pressures in aviation that enforce sustained sustainability practices (Ramirez-Peña et al., 2020; Savchenko, 2023).

  • Four heatmaps comparing automotive and airline BDA and ESSCM domains across years with color-coded values.

Figure 4.4: Four-panel heatmap (Auto BDA, Auto ESSCM, Airline BDA, Airline ESSCM)

These results confirm that the automotive sector is BDA-heavy, while airlines are ESSCM-heavy.

4.5 Radar Charts: Cross-Industry Profiles

Radar charts enable cross-industry comparison by mapping domain strengths.

  • BDA Radar: Automotive firms extend furthest in Data & Analytics and Technology & Software, while airlines lag significantly.

Radar chart titled “Industry Radar – BDA” comparing Automotive (blue) and Airline (orange) sectors across eight domains: Data and Analytics, Artificial Intelligence and Machine Learning, Visualization and Data Volume, Integration and Connectivity, Technical and Technique, Robotics and Automation, Ethics and Privacy, and Business and Strategy. Automotive shows higher values in Data and Analytics and Business and Strategy, while Airline has lower overall values.

Figure 4.5: Industry Radar – BDA

  • ESSCM Radar: Airlines extend further in Energy & Power and Sustainability & Conservation, while automotive firms contribute more in Supply Chain & Procurement.

Radar chart comparing Automotive and Airline ESSCM domains; Automotive scores higher in most areas like environmental concerns.

Figure 4.6: Industry Radar – ESSCM

Interpretation: These comparisons confirm that the two industries complement each other: one prioritises digital innovation, the other regulatory sustainability. This divergence aligns with Savchenko (2023), who highlights that aviation sustainability practices are largely driven by regulation, while automotive firms pursue innovation cycles enabled by BDA (Zhang et al., 2017)

4.6 Stacked Bar Charts: Contributions Over Time

Stacked bar charts show how domain contributions evolved between 2019 and 2022.

  • BDA Contribution: Automotive shows sharp post-2020 growth in Data & Analytics and Technology. Airlines remain stable and comparatively low.

Stacked bar chart showing BDA domain contributions for Automotive and Airline from 2019–2022.

Figure 4.6: Industry-level BDA Domain Contribution Over Time

  • ESSCM Contribution: Airlines consistently prioritise Sustainability. Automotive fluctuates: Toyota declines, Volkswagen’s Supply Chain more than doubles.

Stacked bar chart showing ESSCM domain contributions for Automotive and Airline from 2019–2022.

Figure 4.7: Industry-level ESSCM Domain Contribution Over Time

Synthesis of Visual Analysis

The numerical results highlight three central insights:

  1. Magnitude Differences → Automotive companies often exceed 1,000 mentions annually in BDA, while airlines remain in the 600–800 range.

  2. ESSCM Focus → Airlines report 1,000+ sustainability mentions annually, making them stronger in ESSCM despite lower BDA intensity.

  3. Temporal Patterns → Automotive investments surged after 2020 (e.g., Volkswagen’s Supply Chain doubled from 444 → 986), while airlines grew more modestly but steadily.

These contrasting trajectories are consistent with Zhu et al. (2008), who observed that sustainability and supply chain priorities differ across industries depending on regulatory context and market dynamics.

4.7 Trend Analysis: Yearly Dynamics

Line charts provide a clearer view of yearly movement.

  • Airlines: Delta shows stability in both BDA (~3,000–3,400) and ESSCM (~6,000–7,000). Emirates grows incrementally, while United surges dramatically in 2022 (~3,500 BDA; ~5,000 ESSCM). This surge reflects wider aviation pressures, including ICAO’s CORSIA framework and the EU Emissions Trading System, which enforce sustainability commitments (Burmester, 2018; Ritchie, 2020).

Two line charts showing BDA and ESSCM trends (2019–2022) for Delta, Emirates, and United.

Figure 4.8: Airline BDA/ESSCM Trends

  • Automotive: Volkswagen peaks in BDA (~3,600 in 2021) and ESSCM (~9,000 in 2021), then declines. Toyota remains steady in BDA but declines in ESSCM (12,000 → 8,000). Tesla is stable in BDA but declines in ESSCM (9,500 → 7,500). This reflects broader sectoral transitions where firms shift emphasis from traditional sustainability to EV and technology innovation (Beier, 2022; Singh, 2022).

  • Two line charts showing BDA and ESSCM trends (2019–2022) for Volkswagen, Toyota, and Tesla.

Figure 4.9: Automotive BDA/ESSCM Trends

This confirms airlines as steady and compliance-driven, while automotive firms are volatile and innovation-driven.

4.8 Industry-Level Scatterplot Findings

Scatterplots were created to compare average values of automotive vs airline practices across both BDA and ESSC domains.

  • BDA Comparison

The scatterplot demonstrated a very strong positive correlation (r ≈ 0.97) between the two industries in BDA adoption. This indicates that increases in BDA-related activities within the automotive industry are closely mirrored by corresponding increases in the airline industry.

Scatter plot comparing Automotive vs Airline BDA values by domain; strong positive correlation (0.97) across three domains.

Figure 4.10: Automotive BDA vs Airline BDA

  • For example, in Data & Analytics, Toyota and Volkswagen consistently reported over 1,000 references annually, while Delta and United averaged ~800 mentions.

  • Technology & Software also displayed a clear parallel, with automotive firms frequently exceeding 600–1,100 mentions, while airlines such as Emirates and United remained closer to 300–500 mentions.

  • AI & ML showed the lowest overall values, but trends were still aligned: automotive firms averaged 50–70 mentions per year, while airlines reported fewer than 50 annually.

Interpretation: The near-perfect correlation indicates that even though the automotive industry operates at a higher scale of BDA integration, the pattern of adoption is highly synchronised with the airline sector.

  • ESSC Comparison

For ESSC practices, the scatterplot revealed a moderate positive correlation (r ≈ 0.69). This suggests some alignment between the industries, but with greater variability.

Scatter plot comparing Automotive vs Airline ESSC values by domain.

Figure 4.11: Automotive ESSCM vs Airline ESSCM

  • In Sustainability and Conservation, Toyota reported 2,031 mentions in 2019 but dropped to 1,074 in 2022, showing volatility. By contrast, Delta reported a steadier ~1,200 mentions annually.

  • Energy & Power was a central focus for Tesla (1,185 mentions in 2019; 836 in 2022), while Delta maintained consistent values around 600 mentions annually.

  • Supply Chain and Procurement grew more sharply in automotive, with Volkswagen rising from 444 in 2019 to 986 in 2022, whereas airlines like Delta ranged between 804–936 mentions.

The weaker correlation highlights that while both industries are committed to sustainability, their trajectories differ: automotive firms fluctuate with strategic shifts and innovation cycles, whereas airlines show steadier, compliance-driven emphasis. These strong positive correlations support the argument that BDA acts as an enabling resource for sustainability, consistent with findings in Dubey and Singh (2019) and Wamba et al. (2017).

4.9 Conclusion

This section has demonstrated how automotive and airline industries report BDA and ESSCM practices. The text mining procedure converted unstructured corporate reports into structured keyword datasets, forming the foundation for all analysis. Wordclouds revealed industry narratives, heatmaps quantified frequency intensity, radar charts compared profiles, stacked bar charts showed contribution dynamics, line charts traced yearly trajectories, and scatterplots confirmed correlations.

The findings confirm distinct sectoral orientations: automotive companies are BDA-heavy, volatile, and innovation-driven, while airlines are ESSCM-heavy, stable, and compliance-oriented. These results support the RBV perspective that digital resources such as BDA can provide firms with a valuable capability that enhances environmental sustainability outcomes (Wamba et al., 2017; Dubey & Singh, 2019).

In the next section, we will discuss in detail how the findings align with the Hypothesis and answers research questions.

5. Discussion

This section interprets the findings presented in section 4 in relation to the research questions, the hypothesis, and the Resource-Based View (RBV). Whereas the findings section presented descriptive evidence, here the focus is on explaining what these results mean, how they align with or diverge from the literature, and how they contribute to theory and practice.

The discussion is structured around the four research questions (RQ1–RQ4) and the central hypothesis. Each subsection integrates the results with existing scholarship and RBV, before moving on to contributions, implications, and methodological limitations.

5.1 Hypothesis Testing

The hypothesis testing confirmed expectations:

H0 (no significant correlation) was rejected.

H1 (a significant positive correlation exists between BDA and ESSCM) was supported.

A near-perfect BDA correlation (r = 0.97) and moderate ESSCM correlation (r = 0.69) demonstrate a systematic association between digitalisation and sustainability practices. While prior studies (Wamba et al., 2017; Dubey & Singh, 2019) relied on surveys and qualitative case studies, this research contributes by using text mining of corporate reports to provide quantitative confirmation. This methodological extension strengthens the evidence base for the BDA–sustainability link, addressing calls for more empirical validation in the literature.

5.2 Research Question Discussion

RQ1 – Extent of BDA Usage

Findings reveal that automotive firms embed BDA strategically in domains such as Data & Analytics and Technology, while airlines adopt analytics mainly for compliance. This partly aligns with Luckow (2015) and Giacosa (2021), but challenges their framing of digitalisation as a universal enabler. This paper’s results highlight sectoral divergence: despite similar regulatory pressures, airlines lag in embedding BDA as a VRIN resource. This extension underscores that BDA adoption is not uniform but shaped by industry-specific drivers—innovation cycles in automotive versus compliance in aviation.

From an RBV perspective, this suggests that automotive firms are beginning to treat BDA as a valuable and rare resource that underpins innovation and competitive advantage. Airlines, however, have yet to exploit its full potential, limiting their ability to leverage analytics as a VRIN (Valuable, Rare, Inimitable, Non-substitutable) resource.

Answer: BDA is widely referenced across both industries, but automotive firms embed it more intensively and strategically, while airlines primarily adopt it for operational compliance.

RQ2 – How BDA Supports ESSCM

The results show strong alignment between Data & Analytics and Sustainability & Conservation, suggesting that higher BDA intensity translates into greater sustainability emphasis. This confirms Min (2010), Zhong et al. (2016), and Fernando et al. (2018), but crucially addresses the ambiguity raised by Song et al. (2017), who argued that the mechanisms were unclear. By offering statistical confirmation rather than qualitative inference, this study extends prior work and demonstrates how BDA enables measurable sustainability outcomes, not just efficiency gains.

From the RBV lens, this integration of BDA into sustainability practices demonstrates its role as a resource that is valuable and difficult to imitate. Firms such as Tesla, which showed particularly strong AI–Sustainability correlations, exemplify how proprietary analytical capabilities can generate sustainability outcomes that competitors may struggle to replicate.

Answer: BDA supports ESSCM by enhancing operational efficiency, reducing environmental impact, and strengthening sustainability reporting. Automotive firms lead in this integration, but airlines also show positive alignment, particularly in energy and emissions management.

RQ3 – Usefulness of Text Analytics

The text mining approach demonstrated that text analytics can successfully transform unstructured corporate reports into structured, comparable datasets. By applying keyword frequencies, the study was able to identify patterns in reporting emphasis and statistically test correlations across industries.

This supports the claims of Tao et al. (2020) and Mergel (2019), who argued that text mining provides scalable, replicable insights from large textual datasets. It also builds on Meyer et al. (2021), who applied sentiment analysis to supply chain resilience reporting. Unlike qualitative case studies, text analytics provides consistency and comparability across firms and industries.

Theoretically, text analytics itself can be conceptualised as a dynamic capability under RBV. It equips firms (and researchers) with the ability to continuously restructure and interpret information, extracting value from otherwise unmanageable volumes of text.

Answer: Text analytics is an effective tool for building case studies and evaluating corporate initiatives, offering replicable and scalable insights into BDA–ESSCM integration.

RQ4 – Challenges, Limitations, and Concerns

Challenges remain. Airlines’ weaker BDA adoption reflects structural barriers such as cost and legacy systems (Hashem et al., 2015; Mohamed, 2021), while volatility in Toyota’s ESSCM reporting suggests symbolic disclosure rather than substantive strategy, echoing Shubbak’s (2013) critique of ‘greenwashing.’ The keyword-frequency approach further underlines that emphasis may not equal action. From an RBV perspective, this shows that while BDA is valuable and rare, its uneven distribution across firms creates winners and laggards—Tesla and Toyota gain competitive edge, whereas Emirates and United risk falling behind. Leaders such as Tesla and Toyota exploit it effectively, while laggards like Emirates and United are disadvantaged by weaker capabilities.

Answer: Key challenges include cost, expertise, and reporting consistency. Methodologically, text mining captures emphasis but not context, necessitating complementary qualitative approaches.

5.3 Theoretical Integration

The findings align with RBV but also extend it. Traditionally, RBV applied to tangible or human resources; here, results demonstrate that digital capabilities such as BDA and text analytics also function as VRIN resources. Automotive firms leverage analytics as a rare and inimitable capability, while airlines illustrate the risks of underutilisation. Moreover, text mining itself can be conceptualised as a dynamic capability, enabling firms to continuously restructure and interpret sustainability information. This theoretical extension shows that RBV remains highly relevant in the digital era but must now incorporate intangible data-driven capabilities. Wordclouds and correlations revealed strategic differences: automotive companies frame BDA as transformative, while airlines frame sustainability as compliance.

This extends existing literature by showing that RBV applies not only to tangible assets but also to digital knowledge capabilities (Seele, 2016; Mageto, 2021).

5.4 Contributions to Literature

This section contributes in three ways. First, it confirms prior claims that BDA enhances supply chain efficiency and sustainability (Giacosa et al., 2021; Chung, 2021). Second, it extends existing research by applying text mining to corporate disclosures, providing quantitative confirmation of BDA–ESSCM alignment, where most prior studies relied on surveys or case studies. Third, it addresses a major gap by offering one of the first comparative analyses of BDA–ESSCM integration in airlines and automotive firms, demonstrating synchronisation in BDA adoption (r ≈ 0.97) but divergence in sustainability emphasis (r ≈ 0.69). This dual contribution—methodological and sectoral—positions the study as a novel step in bridging digitalisation and environmental sustainability research.

5.5 Practical Implications

The findings suggest clear actions for practice. Automotive firms should embed BDA directly into sustainability strategies, moving beyond efficiency to innovation-led environmental outcomes. Airlines must shift from compliance-driven reporting to operational integration of analytics to avoid lagging behind competitors. Regulators should standardise reporting metrics across industries to reduce symbolic disclosure and ensure comparability. Managers across both sectors must prioritise investment in analytics skills and infrastructure to fully exploit BDA as a VRIN resource. Importantly, risks accompany inaction: firms that underutilise BDA risk reputational greenwashing, while overreliance on symbolic disclosure undermines competitiveness. These implications stress that analytics is not only a technical tool but a strategic enabler of sustainable supply chain transformation.

5.6 Conclusion

This section has interpreted the results in light of RQ1–RQ4, the hypothesis, and RBV. The findings confirm that BDA adoption is positively correlated with ESSCM, but with distinct orientations: automotive firms lead in analytics integration, while airlines focus on sustainability compliance. The section also outlined theoretical and practical contributions, acknowledged methodological boundaries, and positioned the research within RBV.

The next section concludes the thesis, providing an overall synthesis, extended limitations, and directions for future research.

6. Conclusion

This section concludes the study by synthesising the key findings, reflecting on the study’s limitations, and outlining directions for future research. Whereas section 5 provided detailed discussion and interpretation, this section takes a broader view, drawing the study together and positioning it within the wider research landscape.

6.1 Overall Conclusion

This research set out to investigate the relationship between Big Data Analytics (BDA) and Environmental Sustainability in Supply Chain Management (ESSCM) within the automotive and airline industries. The study confirmed that BDA adoption and sustainability practices are positively aligned, though the form and intensity of this alignment differ across sectors.

The findings show that automotive firms are more focused on integrating analytics into innovation and operational processes, whereas airlines emphasise sustainability practices largely in response to compliance and regulatory pressures. Despite these differences in orientation, both industries demonstrate that digital capabilities and sustainability goals are increasingly interconnected rather than separate domains.

The study also demonstrated the value of text mining as a methodological approach for transforming unstructured corporate reports into structured, comparable insights. This methodological contribution highlights how emerging analytical tools can advance both academic research and managerial practice by enabling systematic monitoring of sustainability emphasis.

From a theoretical standpoint, the results reinforce the Resource-Based View (RBV), confirming that BDA can be understood as a VRIN (Valuable, Rare, Inimitable, Non-substitutable) resource. Furthermore, text analytics is positioned as a dynamic capability, enabling firms to adapt and respond to evolving sustainability demands through better use of data.

Overall, the thesis provides strong evidence that BDA is not only a technological enabler but also a strategic driver of environmental sustainability. The integration of analytics into sustainability reporting and practice enhances both compliance and competitiveness, marking a shift towards a more data-driven approach to sustainable supply chain management.

6.2 Limitations

This study’s scope was deliberately defined to balance comparability with depth. The following boundaries should be recognised:

  1. Data Source: The analysis relied on annual and sustainability reports, which ensured consistency across firms but excluded informal or internal practices not disclosed publicly. Corporate reporting may also reflect reputational motives.

  2. Keyword Frequency Approach: The use of keyword counts prioritised replicability and comparability across firms. While this allowed statistical testing, it did not capture the context or meaning of terms, such as whether “sustainability” referred to emissions, CSR, or energy management.

  3. Timeframe: The period of 2019–2022 was chosen to capture recent trends, including the effects of COVID-19. While timely, it represents a snapshot rather than a long-term trajectory.

  4. Sample Focus: Six leading firms were purposively selected to reflect industry leaders whose strategies often set wider benchmarks. This provides depth but limits generalisability to smaller companies or other geographies.

  5. Quantitative Emphasis: The study’s reliance on frequency counts and correlations ensured objectivity but excluded qualitative insights such as managerial intent or cultural framing.

These boundaries reflect deliberate methodological choices rather than weaknesses, but they highlight areas where future research could extend the work.

6.3 Future Research Directions

Building on these boundaries, several avenues for future investigation are recommended:

  1. Expanding Data Sources: Future studies could integrate other disclosures such as press releases, investor briefings, or stakeholder reports, alongside annual filings, to capture richer perspectives. Social media or news datasets may also offer insights into external perceptions of BDA–ESSCM practices.

  2. Beyond Keyword Counts: More advanced methods such as natural language processing (NLP), topic modelling, or sentiment analysis could capture not just the frequency but also the framing and tone of sustainability discourse.

  3. Longer Time Horizons: Extending the timeframe would enable the study of long-term sustainability transitions, including pre-COVID baselines and future trajectories. Forecasting models could test whether BDA adoption predicts subsequent sustainability performance.

  4. Broader Industry Coverage: Extending analysis to other resource-intensive industries such as shipping, logistics, or energy would test whether the observed BDA–ESSCM correlation is sector-specific or more universal.

  5. Causal Linkages: Future work could go beyond correlation to examine causality between BDA investment and measurable environmental outcomes such as carbon reduction or supply chain resilience.

  6. Theoretical Diversification: While RBV provided a strong lens, combining it with Institutional Theory or Stakeholder Theory could reveal how external pressures and societal expectations interact with internal capabilities.

  7. Mixed-Methods Approaches: Combining text mining with interviews or surveys of managers would enrich the findings, allowing both statistical comparability and qualitative interpretation of organisational intent.

6.4 Final Remarks

This paper demonstrated that Big Data Analytics is a critical enabler of sustainable supply chain management. By applying text mining to corporate reports, it showed that analytics and sustainability are not isolated but positively correlated, albeit with different emphases across industries. The findings contribute to literature, theory, and practice by demonstrating BDA as a strategic VRIN resource, highlighting text analytics as a dynamic capability, and offering evidence for regulators and practitioners.

While the study focused on two industries and a defined timeframe, it provides a foundation for future research into digital sustainability transitions. As global pressures on environmental performance intensify, the integration of BDA into ESSCM will be increasingly central to firms’ ability to compete, comply, and innovate.

References:

Abbasi, M. and Nilsson, F., 2012. Themes and challenges in making supply chains environmentally sustainable. Supply Chain Management: An International Journal, 17(5), pp.517-530.

Acquaye, A., Genovese, A., Barrett, J. and Lenny Koh, S.C., 2014. Benchmarking carbon emissions performance in supply chains. Supply Chain Management: An International Journal, 19(3), pp.306-321.

Acquaye, A.A., Duffy, A.P. and Basu, B., 2011. Stochastic hybrid embodied CO2-eq analysis: An application to the Irish apartment building sector. Energy and Buildings, 43(6), pp.1295-1303.

Anandarajan, M., Hill, C. and Nolan, T., 2019. Practical text analytics: Maximizing the Value of Text Data. Springer.

Aydın, U., Karadayı, M.A., Ülengin, F. and Ülengin, K.B., 2019. Enhanced performance assessment of airlines with integrated balanced scorecard, network-based superefficiency DEA and PCA methods. Multiple Criteria Decision Making, 25, pp.225-247.

Bag, S., Wood, L.C., Xu, L., Dhamija, P. and Kayikci, Y., 2020. Big data analytics as an operational excellence approach to enhance sustainable supply chain performance. Resources, Conservation and Recycling, 153, p.104559.

Barak, S. and Dahooei, J.H., 2018. A novel hybrid fuzzy DEA-Fuzzy MADM method for airlines safety evaluation. Journal of Air Transport Management, 73, pp.134-149.

Beier, G., Kiefer, J. and Knopf, J., 2022. Potentials of big data for corporate environmental management: A case study from the German automotive industry. Journal of Industrial Ecology, 26(1), pp.336-349.

Cao, Q., Lv, J. and Zhang, J., 2015. Productivity efficiency analysis of the airlines in China after deregulation. Journal of Air Transport Management, 42, pp.135-140.

Carter, C.R. and Rogers, D.S., 2008. A framework of sustainable supply chain management: moving toward new theory. International Journal of Physical Distribution & Logistics Management, 38(5), pp.360-387.

Cuquet, M. and Fensel, A. (2018). The societal impact of big data: A research roadmap for Europe. Technology in Society, 54, pp.74–86.

Custers, B., La Fors, K., Jozwiak, M., Esther, K., Bachlechner, D., Friedewald, M. and Aguzzi, S. (2017). Lists of Ethical, Legal, Societal and Economic Issues of Big Data Technologies. [online] papers.ssrn.com.

Ertz, M., Imen Latrous, Ahlem Dakhlaoui and Sun, S. (2024). The impact of Big Data Analytics on firm sustainable performance. Corporate Social Responsibility and Environmental Management.

Guan, P. and Zhou, W. (2017). Business Analytics Generated Data Brokerage: Law, Ethical and Social Issues. Communications in Computer and Information Science, volume 759(CCIS), pp.167–175.

Chung, S.H., 2021. Applications of smart technologies in logistics and transport: A review. Transportation Research Part E: Logistics and Transportation Review, 153, p.102455.

Dahdouh, K., Dakkak, A., Oughdir, L. and Ibriz, A., 2020. Improving online education using big data technologies. The Role of Technology in Education, p.15.

Douglas, C.C., 2014. An open framework for dynamic big-data-driven application systems (DBDDAS) development. Procedia Computer Science, 29, pp.1246-1255.

Dubey, R. and Singh, T., 2019. Big data analytics and environmental sustainability: Resource-based view insights. Journal of Cleaner Production, 231, pp.262-273.

Das, A. and Sharma, A. (2023). Climate change and the energy sector. Elsevier eBooks, (9780323908757), pp.1–6.

Parpieva, U., Najmutdinova, G., Khaydarov, I., Zakirova, G. and Karimova, N. (2024). Issues of environmental changes and transportation. BIO Web of Conferences, [online] 141(04015), p.04015.

Fernando, Y., Jabbour, C.J.C. and Wah, W.X., 2018. Pursuing green growth in technology firms through the connection between big data analytics and business model innovation. Journal of Cleaner Production, 188, pp.463-472.

Giacosa, E., Culasso, F. and Crocco, E., 2021. How big data analytics is changing the automotive industry. Technological Forecasting and Social Change, 162, p.120382.

Giacosa, E., Culasso, F. and Crocco, E., 2022. Customer agility in the modern automotive sector: how lead management shapes agile digital companies. Technological Forecasting and Social Change, 175, p.121362.

Golicic, S.L. and Smith, C.D., 2013. A meta-analysis of environmentally sustainable supply chain management practices and firm performance. Journal of Supply Chain Management, 49(2), pp.78-95.

Hashem, I.A.T., Yaqoob, I., Anuar, N.B., Mokhtar, S., Gani, A. and Ullah Khan, S., 2015. The rise of “big data” on cloud computing: Review and open research issues. Information Systems, 47, pp.98-115.

Hazen, B.T., Skipper, J.B., Ezell, J.D. and Boone, C.A., 2016. Big data and predictive analytics for supply chain sustainability: A theory-driven research agenda. Computers & Industrial Engineering, 101, pp.592-598.

Laney, D., 2001. META Delta. Application Delivery Strategies, 949(4), pp.1-4.

Luckow, A., Kennedy, K., Manhardt, F., Djerekarov, E., Vorster, B. and Apon, A., 2015. Automotive big data: Applications, workloads, and infrastructures. 2015 IEEE International Conference on Big Data (Big Data), pp.1201-1210.

Mageto, J., 2021. Natural resource-based view of the firm and environmental sustainability. Business Strategy and the Environment, 30(5), pp.2425-2439.

Mentzer, J.T., DeWitt, W., Keebler, J.S., Min, S., Nix, N.W., Smith, C.D. and Zacharia, Z.G., 2001. Defining supply chain management. Journal of Business Logistics, 22(2), pp.1-25.

Mergel, I., Edelmann, N. and Haug, N., 2019. Defining digital transformation: Results from expert interviews. Government Information Quarterly, 36(4), p.101385.

Meyer, A., Walter, W. and Seuring, S., 2021. The impact of the coronavirus pandemic on supply chains and their sustainability: a text mining approach. Frontiers in Sustainability, 2, p.1.

Min, H., 2010. Artificial intelligence in supply chain management: theory and applications. International Journal of Logistics: Research and Applications, 13(1), pp.13-39.

Mohamed, H.A.E.S. and Al-Azab, M.R., 2021. Big Data Analytics in airlines: Opportunities and challenges. Journal of Association of Arab Universities for Tourism and Hospitality, 21(4), pp.77-112.

Motau, B. and Boursier, C., 2016. Big Data: The five Vs. Journal of Information Technology Research, 9(3), pp.56-65.

OECD, 2009. Sustainable manufacturing and eco-innovation: Framework, practices, and measurement. Paris: OECD Publishing.

Phair, C. and Warren, K., 2021. Positivism in social science research. Research Methods Review, 8(2), pp.112-119.

Ramirez-Peña, M., Mayuet, P.F., Vazquez-Martinez, J.M. and Batista, M., 2020. Sustainability in the aerospace, naval, and automotive supply chain 4.0: A descriptive review. Materials, 13(24), p.5625.

Rao, P., 2002. Greening the supply chain: a new initiative in South East Asia. International Journal of Operations & Production Management, 22(6), pp.632-655.

Ritchie, H., 2020. Sector by sector: where do global greenhouse gas emissions come from? Our World in Data. Available at: https://ourworldindata.org/ghg-emissions-by-sector.

Savchenko, L., Ovdiienko, O., Boichenko, M. and Mostenska, T., 2023. Assessment of the relationship between CSR environmental goals implementation and financial/operational indicators of airlines. Scientific Journal of Silesian University of Technology. Series Transport, 119.

Schaltegger, S., Bennett, M., Burritt, R.L. and Jasch, C., 2008. Environmental management accounting (EMA) as a support for cleaner production. Springer Netherlands.

Seele, P., 2016. Digitization as a driver of environmental sustainability: The panopticon as a model for managing transparency. Business Strategy and the Environment, 25(7), pp.413-425.

Seuring, S. and Müller, M., 2008. From a literature review to a conceptual framework for sustainable supply chain management. Journal of Cleaner Production, 16(15), pp.1699-1710.

Shubbak, M., 2013. Self-driving cars: Legal, social, and ethical aspects. SSRN. Available at: https://ssrn.com/abstract=2931847.

Silva, N., Barros, J., Santos, M.Y., Costa, C., Cortez, P., Carvalho, M.S. and Goncalves, J.N., 2021. Advancing logistics 4.0 with the implementation of a big data warehouse: A demonstration case for the automotive industry. Electronics, 10(18), p.2221.

Snieder, R. and Larner, K., 2009. The art of being a scientist: A guide for graduate students and their mentors. Cambridge University Press.

Song, M., Cen, L., Zheng, Z., Fisher, R., Liang, X., Wang, Y. and Huisingh, D., 2017. How would big data support societal development and environmental sustainability? Journal of Cleaner Production, 142, pp.489-500.

Tao, D., Yang, P. and Feng, H., 2020. Utilization of text mining as a big data analysis tool for food science and nutrition. Comprehensive Reviews in Food Science and Food Safety, 19(2), pp.875-894.

Tavana, M., Shaabani, A., Raeesi Vanani, I. and Kumar Gangadhari, R., 2022. A review of digital transformation on supply chain process management using text mining. Processes, 10(5), p.842.

Toyota Motor Corporation, 2023. Sustainability Report 2023. Available at: https://global.toyota.

Volkswagen Group, 2023. Sustainability Report 2023. Available at: https://www.volkswagen-group.com/.

Wamba, S.F., Gunasekaran, A., Akter, S., Ren, S.J., Dubey, R. and Childe, S.J., 2017. Big data analytics and firm performance: Effects on profitability, revenue, and cost. Journal of Business Research, 70, pp.356-365.

Wang, L., Ding, J., Pan, L., Cao, D., Jiang, H. and Ding, X., 2019. Artificial intelligence facilitates drug design in the big data era. Chemometrics and Intelligent Laboratory Systems, 194, p.103850.

World Economic Forum, 2023. Global per capita emissions explained. Available at: https://www.weforum.org/agenda/2023/05/global-per-capita-emissions-explained-charts/.

Zhang, Y., Ren, S., Liu, Y. and Si, S., 2017. A big data analytics architecture for cleaner manufacturing and maintenance processes of complex products. Journal of Cleaner Production, 142, pp.626-641.

Zhong, R.Y., Xu, C., Chen, C. and Huang, G.Q., 2016. Big data analytics for physical internet-based intelligent manufacturing shop floors. International Journal of Production Research, 55(9), pp.2610-2621.

Zhu, Q., Crotty, J. and Sarkis, J., 2008. A cross-country empirical comparison of environmental supply chain management practices in the automotive industry. Asian Business & Management, 7(4), pp.467-488.

Appendices

Appendix A: Raw Data Structure:

  1. Keyword Counts:

Spreadsheet showing keyword counts; top keywords include “Digital” (123), “Data” (106), “Application” (30), and “Analysis” (29).

  1. Data Sheet obtained in Step 2 – Unstructured Data

Excel sheet showing keyword frequency for Volkswagen reports (2019–2022). Left table lists Big Data Analytics terms; right table lists Environmental Sustainability terms. Counts vary by year, with high values for “Data” and “Environment.”

  1. Automotive Industry-wise Data Structure:

Excel sheet showing keyword frequency for automotive data (Volkswagen, Toyota, Tesla) from 2019–2022. Left table lists Big Data Analytics terms; right table lists Environmental Sustainability terms. High counts include “Digital,” “Data,” “Carbon,” and “Environment.”

  1. Airline Industry-wise Data Structure:

Excel sheet showing keyword frequency for airline data (Delta, Emirates, United) from 2019–2022. Left table lists Big Data Analytics terms; right table lists Environmental Sustainability terms. High counts include “Data,” “Digital,” “Carbon,” and “Environment.”

  1. Data Filtering

=VLOOKUP($A10,'B:\ RESOURCES\[Considered Keywords.xlsx]Tesla'!$A$1:$E$66,COLUMN(D9),0)

=SUM(B12:B29)

Appendix B: Company-Level Visualisations

  1. WordCloud: Airline

install.packages("wordcloud")

library(wordcloud)

data <- data.frame(

Term = c(

"ML", "Algorithm", "Analysis", "Analytical", "Analytics",

"Application", "Artificial", "AWS", "Big Data", "Blockchain",

"Bot", "Business Intelligence", "Cloud", "Cluster", "Computer",

"Cyber", "Cybersecurity", "Data", "Data Management", "Data Mining",

"Database", "Data-Driven", "dataSet", "dataSets", "Digital",

"Driven", "Electronic", "Ethical", "Evaluat", "Information",

"Innovation", "Integration", "Intelligence", "Java", "Machine",

"Machine Learning", "Mathematic", "Maths", "Method", "Mining",

"Predict", "Prediction", "Predictive", "Privacy", "Processing",

"Program", "Programming", "Robot", "Robotics", "Science", "Securities",

"Security", "Software", "Strategy", "Surveillance", "Technical",

"Technique", "Techniques", "Technolog", "Technology", "Tool",

"Virtual", "Visualisation", "Volume", "Carbon", "Climate", "CO2",

"Combustion", "Conservation", "Consumption", "Corporate", "Earth",

"Eco", "EcoFriendly", "Eco-Friendly", "Economy", "Electric", "Emission",

"Emissions", "Energy", "Environment", "Environmental", "Sustainable",

"Transport", "EPA", "ESG", "Farmer", "Farmers", "Footprint", "Fuel",

"Gas", "Green", "Greenhouse", "Harm", "Harmful", "Hazard", "Logistics",

"Material", "Mitigation", "Natural", "Nature", "Nitrogen", "Planet",

"Plant", "Pollut", "Pollution", "Power plant", "Process", "Procurement",

"Production", "Protection", "Recycle", "Reduce", "Renewable", "Renewal",

"Resources", "Reuse", "SCM", "Social", "Solar", "SSCM", "Supplier",

"Suppliers", "Supply", "Supply Chain", "Wastage", "Sustainability", "Waste"

),

Freq = c(

0, 25, 654, 32, 49, 345, 35, 4, 8, 20, 0, 0, 69, 15, 166, 169, 98, 2451, 4, 0, 71, 12, 0, 0, 869, 198, 234, 98, 1158, 5668, 345, 168, 52, 7, 136, 10, 0, 0, 1053, 246, 227, 7, 10, 243, 149, 3653, 22, 16, 9, 153, 915, 1110, 580, 1363, 1, 308, 139, 125, 1372, 802, 360, 112, 0, 414,

1487, 1306, 1327, 93, 119, 877, 2620, 16, 1, 1, 14, 588, 1111, 3507, 3212, 1752, 3616, 2445, 1814, 1175, 88, 1384, 59, 56, 352, 2606, 485, 859, 257, 287, 55, 325, 290, 4881, 78, 292, 574, 44, 555, 720, 81, 41, 14, 2034, 355, 1042, 615, 220, 1356, 527, 93, 462, 101, 4, 1423, 68, 0, 1561, 1190, 1061, 683, 0, 3234, 1207

)

)

set.seed(123)

wordcloud(words = data$Term, freq = data$Freq, scale = c(3, 0.5), colors = brewer.pal(3, "Dark2"))

filtered_data <- data[data$Term != "Information", ]

set.seed(123)

par(plt = c(0.1, 1, 0.1, 1))

wordcloud(

words = filtered_data$Term,

freq = filtered_data$Freq,

scale = c(1.5, 0.5),

colors = brewer.pal(8, "Dark2"),

min.freq = 1, # Minimum frequency for inclusion

random.order = FALSE, # Disable random word order

rot.per = 0, # Disable word rotation

max.words = 1500 # Adjust the maximum number of words displayed

)

  1. WordCloud Automotive:

library(tm)

library(wordcloud)

data <- data.frame(

Term = c(

"ML", "Algorithm", "Analysis", "Analytical", "Analytics",

"Application", "Artificial", "AWS", "Big Data", "Blockchain",

"Bot", "Business Intelligence", "Cloud", "Cluster", "Computer",

"Cyber", "Cybersecurity", "Data", "Data Management", "Data Mining",

"Database", "Data-Driven", "dataSet", "dataSets", "Digital",

"Driven", "Electronic", "Ethical", "Evaluat", "Innovation",

"Integration", "Intelligence", "Java", "Machine",

"Machine Learning", "Mathematic", "Maths", "Method", "Mining",

"Predict", "Prediction", "Predictive", "Privacy", "Processing",

"Program", "Programming", "Robot", "Robotics", "Science", "Securities",

"Security", "Software", "Strategy", "Surveillance", "Technical",

"Technique", "Techniques", "Technolog", "Technology", "Tool",

"Virtual", "Visualisation", "Volume", "Carbon", "Climate", "CO2",

"Combustion", "Conservation", "Consumption", "Corporate", "Earth",

"Eco", "EcoFriendly", "Eco-Friendly", "Economy", "Electric", "Emission",

"Emissions", "Energy", "Environment", "Environmental", "Sustainable",

"Transport", "EPA", "ESG", "Farmer", "Farmers", "Footprint", "Fuel",

"Gas", "Green", "Greenhouse", "Harm", "Harmful", "Hazard", "Logistics",

"Material", "Mitigation", "Natural", "Nature", "Nitrogen", "Planet",

"Plant", "Pollut", "Pollution", "Power plant", "Process", "Procurement",

"Production", "Protection", "Recycle", "Reduce", "Renewable", "Renewal",

"Resources", "Reuse", "SCM", "Social", "Solar", "SSCM", "Supplier",

"Suppliers", "Supply", "Supply Chain", "Wastage", "Sustainability", "Waste"

),

Freq = c(

0, 79, 868, 32, 51, 630, 126, 12, 62, 67, 7, 0, 176, 32, 122, 205, 92, 4075, 13, 0, 136, 40, 0, 0, 1483, 443, 396, 131, 1089, 416, 270, 114, 14, 384, 28, 0, 0, 1245, 655, 166, 4, 13, 168, 289, 2762, 47, 130, 56, 157, 896, 2453, 1304, 2167, 6, 623, 102, 88, 2349, 998, 534, 167, 1, 1286,

2148, 1503, 3116, 296, 235, 798, 2799, 134, 28, 0, 30, 1072, 2941, 6059, 5360, 4555, 6708, 4980, 1895, 1315, 374, 338, 8, 4, 564, 1405, 1079, 1049, 389, 784, 159, 327, 511, 5468, 188, 474, 874, 83, 302, 1774, 382, 232, 68, 4115, 580, 4206, 1252, 520, 2088, 947, 159, 975, 211, 0, 1582, 3429, 0, 3320, 2460, 3037, 2097, 0, 3973, 1486

)

)

set.seed(123)

par(plt = c(0.1, 1, 0.1, 1))

wordcloud(

words = data$Term,

freq = data$Freq,

scale = c(1.5, 0.5), # Increase the scaling for larger words

colors = brewer.pal(20, "Dark2"),

min.freq = 1,

random.order = FALSE,

rot.per = 0,

max.words = 500 # You can adjust this value to display more words

)

  1. Trend Analysis Airline-

import pandas as pd

import matplotlib.pyplot as plt

# Create a DataFrame from your new data

data = {

'Name': ['Delta', 'Delta', 'Delta', 'Delta', 'Emirates', 'Emirates', 'Emirates', 'Emirates', 'United', 'United', 'United', 'United'],

'Year': [2019, 2020, 2021, 2022, 2019, 2020, 2021, 2022, 2019, 2020, 2021, 2022],

'BDA_Count': [2980, 3309, 2992, 2985, 1336, 1461, 1445, 1366, 1516, 1744, 1741, 3685],

'ESSCM_Count': [8712, 9187, 8904, 8488, 1782, 2267, 2323, 2650, 2372, 2640, 2928, 6447]

}

df = pd.DataFrame(data)

# Plot the trends for each company and category

fig, axes = plt.subplots(nrows=2, ncols=1, figsize=(10, 8))

for company, color in zip(df['Name'].unique(), ['blue', 'orange', 'green']):

company_data = df[df['Name'] == company]

axes[0].plot(company_data['Year'], company_data['BDA_Count'], label=company, marker='o', linestyle='-', color=color)

axes[1].plot(company_data['Year'], company_data['ESSCM_Count'], label=company, marker='o', linestyle='-', color=color)

axes[0].set_title('BDA Trends')

axes[1].set_title('ESSCM Trends')

axes[0].legend()

axes[1].legend()

plt.tight_layout()

plt.show()

  1. Trend Analysis Automotive-

import pandas as pd

import matplotlib.pyplot as plt

# Create a DataFrame from your data

data = {

'Company': ['Volkswagen', 'Volkswagen', 'Volkswagen', 'Volkswagen', 'Toyota', 'Toyota', 'Toyota', 'Toyota', 'Tesla', 'Tesla', 'Tesla', 'Tesla'],

'Year': [2019, 2020, 2021, 2022, 2019, 2020, 2021, 2022, 2019, 2020, 2021, 2022],

'BDA_Count': [2096, 2153, 3694, 2352, 3064, 3159, 2996, 2937, 2968, 3014, 2935, 2896],

'SupplyChain_Count': [4727, 5392, 8807, 7086, 12702, 9748, 10122, 7672, 9574, 8621, 7804, 7940]

}

df = pd.DataFrame(data)

# Plot the trends for each company and category

fig, axes = plt.subplots(nrows=2, ncols=1, figsize=(10, 8))

for company, color in zip(df['Company'].unique(), ['blue', 'orange', 'green']):

company_data = df[df['Company'] == company]

axes[0].plot(company_data['Year'], company_data['BDA_Count'], label=company, marker='o', linestyle='-', color=color)

axes[1].plot(company_data['Year'], company_data['SupplyChain_Count'], label=company, marker='o', linestyle='-', color=color)

axes[0].set_title('BDA Trends')

axes[1].set_title('ESSCM Trends')

axes[0].legend()

axes[1].legend()

plt.tight_layout()

plt.show()

  1. Company-wise Heatmaps –

Heatmap of United’s BDA domains (2019–2022); Data & Analytics dominates, Tech & Software rises sharply in 2022.Heatmap of Emirates’ ESSC domains (2019–2022); Sustainability & Conservation and Environmental Concerns dominate overall.Heatmap of Emirates’ BDA domains (2019–2022); Data & Analytics dominates, Tech & Software remains strong throughout.Heatmap of Delta’s ESSC domains (2019–2022); Environmental Concerns and Sustainability dominate across all years.Heatmap of Delta’s BDA domains (2019–2022); Data & Analytics dominates, Tech & Software and Cybersecurity remain strong.Heatmap of Toyota’s ESSC domains (2019–2022); Environmental Concerns and Sustainability dominate across all years.Heatmap of Toyota’s BDA domains (2019–2022); Data & Analytics dominates, Tech & Software and Cybersecurity remain strong.Heatmap of Tesla’s ESSC domains (2019–2022); Environmental Concerns, Energy & Power, and Supply Chain dominate overall.Heatmap of Tesla’s BDA domains (2019–2022); Data & Analytics dominates, Tech & Software and Cybersecurity remain strong.Heatmap of Volkswagen’s ESSC domains (2019–2022); Environmental Concerns and Sustainability dominate, peaking in 2021.Heatmap of Volkswagen’s BDA domains (2019–2022); Tech & Software and Data & Analytics peak sharply in 2021.Heatmap of United’s ESSC domains (2019–2022); Social & Corporate Responsibility and Sustainability dominate overall.

  1. Company-wise Stacked charts –

Stacked bar chart of United’s BDA domains (2019–2022); Data & Analytics and Tech & Software dominate, rising in 2022.Stacked bar chart of United’s BDA domains (2019–2022); Data & Analytics and Tech & Software dominate, rising in 2022.Stacked bar chart of domain values from 2019–2022; largest segments are Data & Analytics, Tech & Software, and Cybersecurity.Stacked bar chart of Toyota domain values from 2019–2022; Data & Analytics, Tech & Software, and Cybersecurity dominate.Stacked bar chart of Tesla domain values from 2019–2022; Data & Analytics, Tech & Software, and Cybersecurity are largest segments.Stacked bar chart of Volkswagen domain values from 2019–2022; Data & Analytics and Tech & Software dominate, peaking in 2021.

  1. Company-wise Individual Radar for Airline and Automotive Industry:

Radar chart comparing Emirates BDA (blue) and ESSC (orange) from 2019–2022; ESSC values are consistently higher, peaking in 2022.Radar chart comparing Delta BDA (blue) and ESSC (orange) from 2019–2022; ESSC values are much higher across all years, peaking in 2020.Radar chart comparing Toyota BDA (blue) and ESSC (orange) from 2019–2022; ESSC values are consistently higher, peaking in 2019.Radar chart comparing Tesla BDA (blue) and ESSC (orange) from 2019–2022; ESSC values are higher throughout, peaking in 2019.Radar chart comparing Volkswagen BDA (blue) and ESSC (orange) from 2019–2022; ESSC values are higher, peaking in 2021.Radar chart comparing United BDA (blue) and ESSC (orange) from 2019–2022; ESSC values are higher each year, peaking in 2022.

Appendix C: Data Sources – Company Reports (2019–2022)

  1. Automotive Firms

  1. Airline Firms

©Akanksha Sharma. This article is licensed under a Creative Commons Attribution 4.0 International Licence (CC BY).

Share

Author details

Downloads

Information

Competing Interests

  • no competing interests

Metrics

  • Views: 21
  • Downloads: 2

Citation

Download RIS Download BibTeX

File Checksums

(MD5)
  • HTML: 13a2f5645ce85b34136b937b9c6336a4
  • PDF: b44238a32119b164019ea20d01a7bc72
  • Word document: 2c17ef9a0eac26908f2d8bd0bcb49a8e

Table of Contents