Business Data Analysis Part 2 - Basic Statistics

: "From environment setup to practical business projects, a data science journey to find answers for e-commerce sales through statistics (38 lectures in total)" Go beyond simply learning how to use analysis tools and learn how to process log data from actual e-commerce fields using statistical processes. After building a solid foundation in core theories—from setting up the development environment to descriptive statistics, probability distributions, and hypothesis testing—you will move on to comparative analysis of actual user app dwell time and purchase conversion rates, and even quantify performance contribution through multiple linear regression analysis! Systematically master the core data scientist competencies of 'data-driven decision-making and analysis automation' through practical projects.

1 learners are taking this course

Level Basic

Course period Unlimited

Big Data
Big Data
Python
Python
Data Engineering
Data Engineering
Big Data
Big Data
Python
Python
Data Engineering
Data Engineering

What you will gain after the course

  • * Ability to build an optimal data analysis and statistical practice environment: The ability to flawlessly set up a Python development environment and essential libraries for efficient statistical analysis and data science.

  • * E-commerce domain-centric data design and outlier detection technology: The technology of defining the correct population and sample for analysis purposes within raw large-scale log data, and sophisticatedly detecting and refining outliers using statistical techniques.

  • * 'Test Automation' pipeline to maximize analysis efficiency in the field: Know-how to drastically increase practical productivity by using Python to automate the pre-data testing process, which is repeated every time before analysis after establishing null and alternative hypotheses.

  • * Statistical hypothesis testing skills as the foundation of A/B testing: Practical capabilities to scientifically verify the significance of core business metrics—such as differences in app session time per user or final purchase conversion rates—by correcting common misconceptions about P-values and minimizing errors.

  • * Quantifying business performance contribution through multiple linear regression analysis: A differentiated data science capability that goes beyond simple correlation analysis to infer causality and utilizes multiple linear regression models to quantify the influence of key variables contributing to sales growth into sophisticated numerical values.

  • * Business literacy to link statistical results to decision-making: The insight to translate complex statistical analysis results and confidence interval estimates into business language that management and collaborating departments can immediately understand and act upon, rather than leaving them confined to ambiguous formulas.

Course Introduction



: "From environment setup to practical business projects, a data science journey to find answers for e-commerce sales through statistics (Total 38 lectures)"

Beyond simply learning how to use analysis tools, you will learn how to process log data from actual e-commerce operations using statistical methods. After building a solid foundation in core theories—from setting up the development environment to descriptive statistics, probability distributions, and hypothesis testing—you will move on to practical projects: comparing user app dwell time and purchase conversion rates, and quantifying performance contribution through multiple linear regression analysis! You will systematically master the core competency of a data scientist—'data-driven decision-making and analysis automation capabilities'—through hands-on business projects.

 Key points unique to this lecture

* Descriptive statistics tailored to the e-commerce domain: This is not about simply memorizing formulas. Starting from how to define actual populations and samples in e-commerce log data, you will develop the insight to detect outliers in raw industry data and interpret data patterns from a business perspective.

* Mastering Hypothesis Testing, the Foundation of A/B Testing: Build a solid foundation of hypothesis testing principles, from common misconceptions and truths about P-values to Type I and Type II errors. Through this, you will perfectly master the most frequently used testing techniques in the field, such as testing differences in app dwell time by user and differences in final purchase conversion rates.

* Analysis Automation Practice to Maximize On-site Efficiency: To ensure you don't get exhausted by repetitive analysis processes, we conduct hands-on practice in building a pipeline that automatically performs the 'pre-data testing' process after establishing null and alternative hypotheses, elevating practical productivity to the next level.

* Quantifying Performance Through Multiple Linear Regression: Go beyond correlation analysis and delve deep into causal relationships. Develop data science capabilities to precisely quantify the key factors contributing to actual sales and performance among various business variables using multiple linear regression models.

* Cultivating Business Literacy in Statistics: We go beyond simple statistical software outputs or numerical calculations. You will acquire the leadership capability to accurately estimate confidence intervals based on analysis results and clearly translate them into business language that actual decision-makers can understand.

 

 


📱 Curriculum & Project Preview


✒ Section 1. Starting Data Analysis and Setting Up the Environment (Lectures 1 ~ 3)

Before diving into the actual analysis, you will understand the overall roadmap of data science and perfectly set up the Python development environment and essential libraries for efficient statistical analysis and data handling.

Key Learning: Course roadmap and overview, analysis environment setup, essential library installation and operation check


✒Section 2. Basic Statistical Theory and Data Types (Lectures 4–7)

Learn the overview of statistics, which serves as the backbone of data science, and the concepts of population and sample. In particular, you will learn how to precisely define the population and sample that fit actual analytical purposes within raw e-commerce log data.

Core Skills: Overview of Descriptive Statistics, Relationship between Population and Sample, Identifying Numerical/Categorical Data Types, Defining Samples based on E-commerce Logs

 

✒ Section 3.Understanding Descriptive Statistics and Data Distribution (Lessons 8 ~ 11)

Learn about central tendency and dispersion to summarize data characteristics and understand the shape of data distributions. Master the core logic for statistically detecting outliers that reduce analysis reliability within vast amounts of structured data.

Core Skills: Mean, Median, Mode (Central Tendency), Variance, Standard Deviation (Dispersion), Skewness, Kurtosis (Distribution Shape), Outlier Detection Techniques


✒ Section 4.Probability Theory and Central Limit Theorem (Lectures 12 ~ 15)

Establish the concept of probability, which serves as the foundation of inferential statistics, and understand the difference between discrete and continuous probability distributions. Clearly master the 'Central Limit Theorem,' a core theory of data science that defines the distribution of sample statistics.

Key Learning: Basic concepts of probability, discrete/continuous probability distributions, and the core milestone of data analysis, the 'Central Limit Theorem'


✒ Section 5. Principles of Statistical Estimation and Hypothesis Testing (Lectures 16 ~ 21)

You will learn point estimation and interval estimation (calculating confidence intervals and margins of error) to predict a population through samples. You will grasp the true meaning of P-value, which is most easily misused in practical data analysis, and establish the mathematical and business principles of hypothesis testing to minimize Type I and Type II errors.

Core Skills: Calculating confidence intervals and optimal sample sizes, guide to establishing null/alternative hypotheses, how to interpret P-values, Type I and Type II errors

Processing Logic


✒ Section 6. Key Testing Techniques and Correlation/Regression Analysis Practice (Lectures 22–28)

Learn the theories of core statistical testing techniques (t-test, ANOVA, Chi-square test, etc.) used according to the type and purpose of data, as well as correlation analysis and linear regression analysis to identify relationships between variables. Afterwards, you will conduct hands-on practice to implement these directly using Python data analysis tools.

Core Skills: Key statistical testing techniques by objective, correlation analysis algorithms, simple linear regression analysis, and hands-on practice using Python analysis tools


✒Section 7. [Project] E-commerce Business Scenarios and Analysis Automation (Lectures 29 ~ 34)

We are starting an e-commerce data analysis project that reflects real-world business concerns. We will load and refine raw, large-scale log data and build a 'pre-data verification automation' pipeline that validates the basic assumptions of the data before performing statistical analysis.

Key Learning: Understanding business background and loading log data, descriptive statistics-based data cleaning, probabilistic sampling and size estimation, and programming for pre-data validation process automation


✒Section 8.[Project] Hypothesis Testing and Quantifying Performance Contribution (Lectures 35 ~ 38)

Based on the previously refined actual user data, we statistically verify (A/B test) the 'difference in app dwell time' and 'difference in final purchase conversion rate.' Furthermore, through multiple linear regression analysis, we precisely quantify the key variables that contributed to actual sales and performance and estimate business confidence intervals.

Key Learning: Testing differences in app dwell time by user, testing differences in final purchase conversion rates, business interpretation of P-values and confidence intervals, and calculating performance contribution based on multiple linear regression models


✒ Introduction to the Knowledge Sharer

Jaeseong Yoon (Lead Data Analysis Instructor at Like Lion)


Development Experience
• Developed and launched SKT "Island Adventure" mobile content
• Developed and launched KT "Quiz Soccer" mobile content
• Launched SK "Mobile Real Estate Agent"
• Developed iPhone "Hanja-tong" app
• Developed iPhone "Health Training" app
• Developed KT/SK Japanese Namco "Tales of Commons" content
• Developed KT mini-games (Yageum Yageum Land Grab, Aladdin's Magic Lamp, Mystery Block Detective Agency, BUZZ and BUZZ)

Teaching Experience
I am a veteran instructor with 19 years of experience in teaching and development, catering to current employees of famous domestic companies and job seekers at institutions such as Samsung Multi Campus, Busan IT Industry Promotion Agency, Jeonju IT & Cultural Industry Promotion Agency, Incheon IT Industry Promotion Agency, Korea Radio Promotion Association, SK C&C, T Academy, Korea Institute for Robot Industry Advancement, Daejeon ETRI, Samsung Electronics, nica Education Center, Korea Productivity Center, Hanwha S&C, Samsung Electronics, LG Electronics, and SK C&C.

Teaching Fields
I teach in fields such as Java, Android, Frameworks, Databases, UML, iPhone, Big Data processing and analysis, Python, IoT, data analysis using R/Python, Deep Learning, Machine Learning AI, and Spark. I structure my lectures to explain concepts as easily as possible by incorporating my diverse experiences, and I create examples to help students apply them in practice. Since this is not an offline class, please use the Q&A for anything you don't understand.

Recommended for
these people

Who is this course right for?

  • - Aspiring data scientists who need a differentiated practical statistics portfolio: Those who want to move beyond cliché examples like Titanic survival prediction or Iris classification and complete a high-level inferential statistics portfolio that covers hypothesis formulation, probability sampling, A/B test verification, and regression modeling based on actual e-commerce log data.

  • - Students or self-taught learners who crave 'practical data literacy' over mathematical proofs: Those who want to learn how to apply complex theories encountered only in books—such as descriptive statistics, probability distributions, hypothesis testing, and P-values—to actual e-commerce user data, and develop the insight to precisely interpret statistical results into business language.

  • - Working data analysts who want to maximize the productivity of repetitive analytical tasks: Those who are tired of the data cleaning and preliminary statistical testing processes that repeat every time they set a hypothesis, and want to improve practical efficiency by building an 'automated preliminary data testing process' pipeline using Python.

  • - Those with a strong interest in the e-commerce domain and sophisticated A/B testing: Those who want to go beyond general structured data to statistically verify core e-commerce business metrics—such as differences in app dwell time per user and final purchase conversion rates—and find science-based answers for revenue growth.

  • - Marketers and service planners who want to integrate data analysis capabilities into practical decision-making: Working professionals who want to go beyond simple numerical aggregation or vague correlation analysis to create a powerful logical basis for persuading management by sophisticatedly quantifying key variables that contribute to actual sales and performance through multiple linear regression analysis.

Need to know before starting?

  • An understanding of basic Python syntax (variables, conditional statements, loops, functions, etc.) is required.

  • This course is optimized for those who have at least some exposure to basic grammar and want to learn how to apply it to practical data analysis and statistics, rather than for complete beginners learning Python for the very first time.

  • Practical applications of core libraries required for statistical analysis (Numpy, Pandas, Scipy, etc.) are covered step-by-step through hands-on practice within the lectures (Lessons 27–28 and the project section), so it is perfectly fine even if you have no prior experience with full-scale data analysis projects.

  • Instead of complex and tedious mathematical proofs of statistical formulas, this course focuses on the practical process of handling e-commerce business data and correctly interpreting statistical results, so you can successfully complete it as long as you have the passion.

Hello
This is softcampus

16,238

Learners

833

Reviews

595

Answers

4.7

Rating

49

Courses

Soft Campus is an education center that supports online and offline lectures and content sales.

For inquiries regarding AI-related fields and the purchase of various lectures and content, please contact us at raputa@nate.com or by phone at 02-553-0824.

Thank you.

More

Curriculum

All

38 lectures ∙ (15hr 56min)

Published: 
Last updated: 

Reviews

Not enough reviews.
Please write a valuable review that helps everyone!

softcampus's other courses

Check out other courses by the instructor!

Similar courses

Explore other courses in the same field!

Limited time deal

$29.70

50%

$59.40