Hi 👋 I'm Nazih Kalo.-image

Hi 👋 I'm Nazih Kalo.

I'm a San Francisco based Data Scientist / Engineer, currently working at CyberConnect helping build decentralized social tooling for the future

In my free time time, you can catch me training in FitnessSF, reading about geopolitics, ML, & zero-knowledge applications, building Dune dashboards or exploring the beautiful Bay Area.

Hi 👋 I'm Nazih Kalo.-image
about-me-image

About me

From the wild world of crypto to the dynamic domain of data science, my passion for exploring the cutting edge has led me down an exciting path of discovery. I've worked across multiple layers of the data stack; as a data scientist, data engineer, product analyst and a bit of frontend development. I enjoy exploring the power of AI, macro & behavioral economics, and incentive models to uncover insights that can help the companies I work with thrive. Whether I'm building data pipelines or developing machine learning models, I always keep an eye on the latest trends in the world of crypto and web3. When I'm not busy tinkering with data, I'm usually immersing myself in educational YouTube channels on obscure topics or soaking up the vibrant Bay Area culture. With over half a decade of experience under my belt, I'm eager to take on new challenges and continue pushing the boundaries of what's possible with data.

  • Location:San Francisco, USA
  • Age:27
  • Nationality:French / Lebanese
  • Interests:Health, Macro Economics, ZKML
  • Study:University of California Berkeley / UChicago
  • Employment:CyberConnect

Education

MSc Data Science

MSc Data Science

University of Chicago•June 2020

Relevant Coursework: Advanced ML, Deep Learning, NLP, Big Data, Data Engineering

Awards: Facebook Hackathon 2019 – WebBuilder ChatBot - 1st Place Prize

B.A Economics

B.A Economics

University of California, Berkeley•December 2017

Certification: Certificate in Entrepreneurship & Technology | UC Berkeley, IEOR Department

Work

Head of Data

Head of Data

CyberConnect•May 2022 - Present
  1. Built all data pipelines, including indexing & decoding on/off-chain data from multiple chains using Airflow/Spark/dbt
  2. Developed nft & wallet recommendation engines, leveraging wallet trading/minting history to power follow/content suggestions
  3. Maintained all internal/external dashboards (incl. dune, internal), retention/growth insights, & analytics for partners on link3.to
Product Manager -> Data Engineer

Product Manager -> Data Engineer

Scale AI•September 2020 - May 2022
  1. Built & maintained data pipelines for the company's largest data extraction/scraping project, scraping 12M+ products from ~5000 ecommerce sites. Extracted data was parsed, categorized/normalized to fit into customers’ taxonomy.
  2. Developed internal Payout Optimizer to dynamically adjust payout functions to hit target rates; reduced pay variance by ~50% and led to $90k savings/month
  3. Deployed self-hosted data cataloging tool (Amundsen), improving data discovery across the company & significantly reducing analytics team onboarding time. Extracted & linked Snowflake, dbt, BigQuery, Tableau, & Salesforce metadata.
  4. Reduced LiDAR labeling time 34% through 1) optimizing ML pre-labels in product, 2) developing a new labeling pipeline (isolating 2D/3D labeling stages). New 2D labeling pipeline reduced computer spec requirement & increased labor pool.
Product Analyst

Product Analyst

Hive AI•June 2020 - September 2020
  1. Product lead for company’s new ML based text-moderation product; scope included dataset management, model training/deployment, post-training optimization, and monitoring/maintenance of SLAs
  2. Collaborated with the ML team to develop a human-assisted/in-the-loop model auditing system to identify model deficiencies and error patterns in production data. Improved model F-1 score by 24% with minimal additional training data.
Operations Internship

Operations Internship

Apple•January 2018 - December 2018
  1. Built data pipelines integrating internal & vendor data to reduce spend forecasts latency from 168 to 24hrs
  2. Managed data for $50M budget for iPhone XR dev builds and identified $1M fraudulent invoices through my analysis.

Skills

Relative self-rating 😇 of my skills by domain

Spoken languages
English
French
Arabic
Data Engineering
SQL
Python
DBT
Spark
Airflow/Dagster
Data Science / ML
NLP
Parametric Models
Computer Vision
Cloud / Platforms
AWS
GCP
Backend development
Node.js
Golang
Databases
Relational (Postgres/Mysql/TimescaleDB/Snowflake/DeltaLake)
Graph (Neo4j)
NoSQL (MongoDB)
Frontend development
GraphQL
React
Typescript

Working with Nazih made me rethink my two system theory of the mind. He is a true genius and a great person to work with.

-- Daniel Kahneman

Freedom granted only when it is known beforehand that its effects will be beneficial is not freedom.

-- Friedrich August von Hayek

Be less curious about people and more curious about ideas

-- Marie Curie

An approximate answer to the right problem is worth a good deal more than an exact answer to an approximate problem.

-- John Tukey

Get in touch.

Feel free to reach out with any of the mediums below. I am always open to new opportunities, collaborations or just to chat about data science, machine learning, or anything else really :)

Github
nazihkalo
© Copyright 2023 Nazih Kalo