4.1 Activity Performance Analysis

4.1 Activity Performance Analysis#

Activity Result Distributions#

plot_test_results_updated(data_detail)

_images/a0a4cf8903c2f066496601b8026187818fffd4770070be78325602588ace3a48.png

Insights

The majority of students have achieved an ‘Optimal’ result on their best attempt, as indicated by the largest portion of the pie chart. However, there is still over 10% of activities that resulted in a pass or suboptimal outcome, suggesting that there is room for improvement and support.

In conclusion, the majority of students on the platform are performing well, with a significant proportion achieving optimal results. However, there is always room for improvement. By focusing on the segments that are not achieving optimal scores and understanding their needs, the platform can further enhance its overall effectiveness and student success rates.

Course-Wise Activity Result Distributions#

The Stacked 100% Bar Chart illustrates the proportional distribution of test results across different courses. Each course’s bar is normalized to sum up to 100%, making it easier to compare the distribution of test results across courses.

plot_stacked_bar_chart(data_detail)

_images/1dae36384b7db38f05df460a90ecef94451c15f077fde01982ba25515ade68e1.png

The majority of students across all courses have “Optimal” results. As the course level advances, there’s a slight increase in students who failed, indicating potential areas for additional support mechanisms. Further analysis might be beneficial to understand the specific challenges faced by students in the “Pass (Not Optimal)” category. For instance, are there specific topics or modules within the courses where students consistently struggle?

Find most difficult scripts and questions#

The find_difficult_interactive function is designed to help educators and course designers identify levels where students often face difficulties. By leveraging user interaction, this tool provides a flexible approach to hone in on specific areas of interest and concern.

How to Use: Course Dropdown: Select a course of interest from the dropdown menu.

Threshold Slider: Adjust the slider to set the minimum number of attempts for a level to be considered in the analysis. This helps in filtering out levels with very few attempts which might skew the failure rate.

Num Records Slider: Determine how many of the top challenging levels you wish to see. Slide to choose anywhere from 1 to 30 records.

Start Year and End Year Dropdowns: Choose a range of school years for which you’d like to analyze the data.

After setting your preferences, the function will display a dataframe that showcases the most challenging levels based on the specified criteria.

Note

this function removes levels with the name “survey” as they typically represent preliminary assessments or questionnaires that aren’t necessarily indicative of the actual course content difficulty.

Show code cell source Hide code cell source

def find_difficult(data, threshold=100, num_records=10):
    data = data[~data['level_name'].str.contains('survey', case=False, na=False)].copy()
    data['test_result_category'] = data['best_result'].apply(categorize_test_result)
    
    # Calculating failure rate and total attempts for each script and level combination
    agg_dict = {
        'test_result_category': lambda x: (x == 'Failed').sum() / len(x),
        'level_name': 'size'
    }
    
    script_fail_rates = data.groupby(['script_name', 'level_name']).agg(agg_dict)
    script_fail_rates.columns = ['fail_rate', 'total_attempts']
    script_fail_rates.reset_index(inplace=True)
    
    filtered_scripts = script_fail_rates[script_fail_rates['total_attempts'] > threshold]
    sorted_scripts = filtered_scripts.sort_values(by=['fail_rate', 'total_attempts'], ascending=False)
    
    return sorted_scripts.head(num_records)

def find_difficult_interactive(course_name='CS Fundamentals (K-5)', threshold=100, num_records=10, start_date='2015-16', end_date='2021-22'):
    # Reverse mapping from the friendly course name to its abbreviated code
    reverse_course_name_map = {v: k for k, v in course_name_map.items()}
    course_code = reverse_course_name_map[course_name]
    
    filtered_data = data_detail[(data_detail['course_name'] == course_code) & 
                                (data_detail['school_year'] >= start_date) & 
                                (data_detail['school_year'] <= end_date)]
    
    result = find_difficult(filtered_data, threshold, num_records)
    
    return result

course_dropdown = widgets.Dropdown(options=list(course_name_map.values()), description='Course:')
threshold_slider = widgets.IntSlider(min=0, max=500, step=10, value=100, description='Threshold:')
num_records_slider = widgets.IntSlider(min=1, max=30, step=1, value=10, description='Num Records:')
start_year_dropdown = widgets.Dropdown(options=sorted(data_detail['school_year'].unique().tolist()), description='Start Year:')
end_year_dropdown = widgets.Dropdown(options=sorted(data_detail['school_year'].unique().tolist()), description='End Year:')
# Set the maximum column width to a higher value
pd.set_option('display.max_colwidth', 180)

interact(find_difficult_interactive, course_name=course_dropdown, threshold=threshold_slider, num_records=num_records_slider, start_date=start_year_dropdown, end_date=end_year_dropdown);

import panel as pn
pn.extension()

def find_difficult_interactive(course_name='CS Fundamentals (K-5)', threshold=100, num_records=10, start_date='2015-16', end_date='2021-22'):
    # Reverse mapping from the friendly course name to its abbreviated code
    reverse_course_name_map = {v: k for k, v in course_name_map.items()}
    course_code = reverse_course_name_map[course_name]
    
    filtered_data = data_detail[(data_detail['course_name'] == course_code) & 
                                (data_detail['school_year'] >= start_date) & 
                                (data_detail['school_year'] <= end_date)]
    
    result = find_difficult(filtered_data, threshold, num_records)
    
    return result

course_dropdown = pn.widgets.Select(options=list(course_name_map.values()), name='Course')
threshold_slider = pn.widgets.IntSlider(start=0, end=500, step=10, value=100, name='Threshold')
num_records_slider = pn.widgets.IntSlider(start=1, end=30, step=1, value=10, name='Num Records')
start_year_dropdown = pn.widgets.Select(options=sorted(data_detail['school_year'].unique().tolist()), name='Start Year')
end_year_dropdown = pn.widgets.Select(options=sorted(data_detail['school_year'].unique().tolist()), name='End Year')

@pn.depends(course_name=course_dropdown.param.value,
            threshold=threshold_slider.param.value,
            num_records=num_records_slider.param.value,
            start_date=start_year_dropdown.param.value,
            end_date=end_year_dropdown.param.value)
def find_difficult_interactive_wrapper(course_name, threshold, num_records, start_date, end_date):
    result = find_difficult_interactive(course_name, threshold, num_records, start_date, end_date)
    return result

widgets = pn.Column(course_dropdown,
                    threshold_slider,
                    num_records_slider,
                    start_year_dropdown,
                    end_year_dropdown)

pn.Row(widgets, find_difficult_interactive_wrapper)

Identify Challenging Topics: The primary goal is to identify which levels or topics students are struggling with the most. High failure rates can indicate that the material is too challenging, or perhaps not explained clearly enough.
Course Design Improvements: By spotting these challenging areas, educators can focus on refining course content, providing additional resources, or even restructuring the lessons for better clarity.
Targeted Support: Tutors and educators can offer additional support or tutorials on these identified levels to assist students in overcoming these challenges.
Data-Driven Decisions: Instead of relying on anecdotal evidence or assumptions, this tool provides a data-driven approach to understand student performance and behavior.