Inspiration
We were inspired by the scale and real-world impact of Kimberly-Clark’s challenge: selecting optimal mill locations using only public data. It felt like solving a multi-million-dollar logistics puzzle — with data as our only tool.
What it does
Our model identifies the Top 20 high-demand metro areas in the U.S. and recommends the best 3 mill locations that maximize demand coverage while minimizing logistics costs.
How we built it
- Collected data from Census, ZIP DB, BLS, and ACS
- Modeled demand per ZIP based on households, income, and size
- Clustered ZIPs into segments (Premium, Mass Market, Low Income)
- Focused on Mass Market ZIPs in top cities
- Scored mill locations using a hybrid of cost, distance, and zoning factors
Challenges we ran into
- Merging messy datasets from different sources
- Defining what “optimal” means for site selection
- Creating visuals that clearly communicate complex analysis
- Balancing accuracy with performance in large-scale ZIP-level processing
Accomplishments that we're proud of
- Built a complete pipeline from raw data to strategic recommendation
- Created a clear visual framework that simplifies complex decisions
- Recommended 3 sites covering 87% of high-demand ZIPs
What we learned
- How to translate demographic data into real business strategy
- Importance of clustering for market segmentation
- How to evaluate trade-offs between logistics cost and demand reach
- Data storytelling is just as important as the model itself
What's next for Axis Of Insight
We aim to enhance our model with real-time logistics data and broader infrastructure insights. Our goal is to develop a scalable site selection toolkit that businesses can use for strategic expansion, combining data science, geospatial analysis, and intuitive visualizations to drive smarter, location-based decisions.
Built With
- bls
- built-with:-python
- excel
- geopandas
- jupyter
- matplotlib
- numpy
- pandas
- powerpoint
- scikit-learn
- seaborn
- shapely
Log in or sign up for Devpost to join the conversation.