British Airways Virtual Internship

Project Information

  • Category: Data Science / NLP
  • Client: British Airways (Forage)
  • Tech Stack: Python, Scikit-Learn, BeautifulSoup
  • Key Tasks: Web Scraping, Sentiment Analysis, Modeling
  • Notebook: View on Kaggle

Analyzing Customer Reviews & Predicting Buying Behavior

This project aimed to analyze customer satisfaction and predict customer booking behavior for British Airways. The workflow involved web scraping reviews, performing sentiment analysis, and building a predictive model to identify potential customers.

British Airways Certificate

Interactive Notebook Preview

Task 1: Customer Satisfaction Analysis

Situation & Task

The airline sought to understand customer sentiment using reviews from Skytrax. The goal was to scrape, clean, and analyze this unstructured data.

Action:
  • Web Scraping: Collected 3,607 reviews using Python and BeautifulSoup.
  • NLP Pipeline: Cleaned text, removed stopwords, and performed Sentiment Analysis.
  • Visualization: Created Word Clouds to highlight frequent topics like "Service", "Food", and "Seat Comfort".
Result:

The analysis revealed mixed sentiments. Key pain points identified included customer service delays and food quality, while flight comfort was generally praised.


Task 2: Predictive Modeling of Customer Bookings

Situation & Task

To compete with low-cost carriers, the airline needed to predict which customers are most likely to book flights based on historical data.

Action:
  • Preprocessing: Handled missing values and encoded categorical variables (e.g., Flight Day, Origin).
  • Modeling: Trained a Random Forest Classifier due to its robustness with tabular data.
  • Optimization: Fine-tuned hyperparameters to maximize predictive accuracy.
Result:
  • Model Accuracy: Achieved 84.68% on the test set.
  • Key Features: The most important predictors were Purchase Lead Time, Length of Stay, and Flight Hour.

Conclusion

This project successfully demonstrated how data science can drive strategic decisions. The sentiment analysis provided a roadmap for service improvement, while the predictive model offers a tool for targeted marketing, potentially increasing booking conversion rates.