Functions

Python programming

Functions

Reproducibility

This lesson explains how functions work in Python, from calling built-in functions with arguments to defining your own reusable functions to organize and simplify code.

Authors

Noor Sohail

Will Gammerdinger

Published

March 16, 2026

Keywords

Define functions, Arguments

Approximate time: 60 minutes

Learning objectives

In this lesson, we will:

Describe and utilize functions in Python
Modify default behavior of a function using arguments
Identify Python-specific sources of obtaining more information about functions
Demonstrate how to create user-defined functions in Python

Overview of lesson

Functions allow you to bundle a task so that you can use it repeatedly. In a more complex project, you might write a function to clean a dataset or run a standard calculation. Then, you can call that function on any multitude of datasets without having to rewrite the code. This keeps your code shorter, clearer and easier to fix. It is much easier to make avoidable mistakes when you are copy-pasting code! Functions help you avoid all that by letting you write the code once and to reuse it as many times as you want.

In this lesson, you will learn how to call built‑in functions and define your own so you can start building a small “toolbox” tailored to your own needs.

What are functions?

A key feature of Python is functions. Functions are “self contained” modules of code that accomplish a specific task. Functions usually receive some sort of data structure (value, list, dataframe etc.), process it and return a result.

We have actually been using several functions already!

print(): takes in a value and prints it to the console
len(): takes in a value and returns its length
type(): takes in a value and returns its type

A function is generally called by using the name of the function followed by parentheses:

# DO NOT RUN
# Example syntax for using a function
function_name(input)

The input(s) are called arguments, which can include:

The object (any data structure) on which the function carries out a task
Specifications that alter the way the function operates (e.g. options, arguments)

Functions can potentially take several arguments. If you don’t specify a required argument when calling the function, you will either receive an error or the function will fall back on using a default value.

The defaults represent standard values that the author of the function specified as being “good enough in standard cases”. An example would be the sep argument in the print() function, which has a default value of a single space. This means that if you use print() to print multiple values, they will be separated by a single space by default, as we saw previously.

Base Python functions

Since Python is frequently used for statistical computing, many of its base functions involve mathematical operations. For example, the sum() function receives a vector of numbers and returns its sum:

# Define a list of numbers
numbers = [2, 2.23, 0.77, 10, 40.2]

# Take the sum of the number list and assign the sum to total 
total = sum(numbers)

# Print total
print(total)

55.2

Another example is the round() function, which rounds a number to a specified number of decimal places:

# Assign a value to num
num = 3.14159

# Round num and assign the rounded number to round_num
rounded_num = round(num)

# Print rounded_num
print(rounded_num)

The output we receive is 3, which is num rounded to the single digit; this is the default behavior of the round() function. However, if we want to round to a specific number of decimal places, we would need to specify more information, likely in the form of an argument.

Seeking help with functions

If you want to know more about the arguments that a function can take in, Python’s built-in help() function is a great first step. help() provides documentation for any function you pass to it. For example, help(round) will give you information about the round() function, including its arguments and usage.

help(round)

Help on built-in function round in module builtins:

round(number, ndigits=None)
    Round a number to a given precision in decimal digits.

    The return value is an integer if ndigits is omitted or None.  Otherwise
    the return value has the same type as the number.  ndigits may be negative.

The result shows us that the round() function can take two arguments: number (the number to be rounded) and ndigits (the number of decimal places to round to). The ndigits argument has a default value of 0, which is why we got 3 when we ran the example without specifying it.

Exercise 1

Round the num variable to 2 decimal places using the round() function.
What happens when you specify a negative value for the ndigits argument in the round() function? Try it out and explain the output.
What happens if you apply the round() function to the numbers list? Try it out and explain the output.
Apply the sorted() function to the numbers list. What does it do? What are the arguments for the sorted() function? Use the help() function to find out.

Methods for data structures

When we were working with lists earlier, we made use of the list.append() function. This is an example of a method, which is a function that is associated with a specific data structure. In this case, the append() method is associated with lists and allows us to add an element to the end of the list.

Lists are not the only data structure that has innate methods. For example, strings have a method called upper(), which converts all characters in the string to uppercase:

# Create a string containing "hello world" and assign it to my_string
my_string = "hello world"

# Use the upper() method to make my_string uppercase
my_string.upper()

'HELLO WORLD'

Each data structure has its own set of methods that are designed to perform specific tasks related to that data structure. To find out what methods are available for a particular data structure, you can use the dir() function. For example, dir(list) will show you all the methods that are available for lists.

Double underscore methods

You may notice that there are many functions that follow the structure __method__ when we view the output for dir(). These are called dunder methods (short for “double underscore”) and they are special methods . They are typically used to define the behavior of data structures in certain scenarios.

We will not be covering dunder methods as they relate closely to classes which we are not covering in this workshop. You can ignore them for now.

If you scroll to the bottom of the output for dir(list), you will see that there are many methods available for lists:

# Show the methods available for lists
dir(list)

['__add__',
 '__class__',
 '__class_getitem__',
 '__contains__',
 '__delattr__',
 '__delitem__',
 '__dir__',
 '__doc__',
 '__eq__',
 '__format__',
 '__ge__',
 '__getattribute__',
 '__getitem__',
 '__getstate__',
 '__gt__',
 '__hash__',
 '__iadd__',
 '__imul__',
 '__init__',
 '__init_subclass__',
 '__iter__',
 '__le__',
 '__len__',
 '__lt__',
 '__mul__',
 '__ne__',
 '__new__',
 '__reduce__',
 '__reduce_ex__',
 '__repr__',
 '__reversed__',
 '__rmul__',
 '__setattr__',
 '__setitem__',
 '__sizeof__',
 '__str__',
 '__subclasshook__',
 'append',
 'clear',
 'copy',
 'count',
 'extend',
 'index',
 'insert',
 'pop',
 'remove',
 'reverse',
 'sort']

We can now see that there are many methods available for lists, including list.sort(), list.reverse(), etc. Let us try out the list.sort method on the numbers list.

# Apply the sort() method to numbers 
numbers.sort()

# Print out numbers
print(numbers)

[0.77, 2, 2.23, 10, 40.2]

In the exercise, we used the sorted() function to sort this list in both ascending and descending order. However, there is also a list.sort() method that can be used to sort a list in place. The difference between the two is that sorted() returns a new sorted list, while list.sort() modifies the original list.

Even though there are slight differences between these two functions, they both ultimately accomplish the same objective of sorting the list. This is a good example of how there can be multiple functions that accomplish the same task, but they may have different arguments or operate in slightly different ways.

Most importantly, this is another reminder that there is no single “right” way to do something in Python!

Additionally, these methods are not limited to numerical data. For example, we can use the sort() method to sort a list of strings in alphabetical order:

# List of strings
twice = ["Momo", "Sana", "Jihyo", "Mina", "Nayeon",
         "Chaeryeong", "Dahyun", "Jeongyeon", "Tzuyu"]

# Sort the list of strings in alphabetical order
twice.sort()
print(twice)

['Chaeryeong', 'Dahyun', 'Jeongyeon', 'Jihyo', 'Mina', 'Momo', 'Nayeon', 'Sana', 'Tzuyu']

User-defined functions

One of the great strengths of Python is the user’s ability to add functions. Sometimes there is a small task (or series of tasks) you need completed and you find yourself having to repeat it multiple times. In these types of situations, it can be helpful to create your own custom function. The structure of a function is given below:

# Example syntax for defining a function
# DO NOT RUN
# Define the function name and arguments
def function_name(argument1, argument2, ...):
    # Code block that defines what the function does
    ...
    # Return statement
    return output

When you define the function you will need to provide the list of arguments required (inputs and/or options to modify behavior of the function). The argument(s) can be any type of object (like a scalar, a matrix, a dataframe, a list, a logical, etc.), and it’s not necessary to define the type of object.

Then, indented under the function definition, you will write the code that carries out whatever task the function is designed to do. This is where the function is executing code on the arguments supplied.

Finally, you can return the value of the object from the function, which means to pass the value determined by the function into the global environment. A very important fact to understand about functions is that objects that are created within the function are only local to the environment of the function – they don’t exist outside of the function.

Creating a function

Let’s try creating a simple function for an example. This function will take a numeric value as input, and return the squared value.

# Create a function called square_it which takes the value x as input 
def square_it(x):
    # Squares the value of x and assigns it to the object called square
    square = x * x
    # Returns the value of square to the console
    return square

Now, we can use this function like any other base Python functions. We first type out the name of the function, add the parentheses and provide a numeric value x inside the parentheses:

# Run the square_it function on the number 5
square_it(5)

Pretty simple, right? In this case, the function only ran a single line of code, but you could have many lines of code to get obtain the final results that you need to return to the user.

Exercise 2

Write a function called multiply_it, which takes two inputs: a numeric value x and a numeric value y. The function will return the product of these two numeric values, which is x * y.

For example, multiply_it(x=4, y=6) will return output 24.

Exercise 3

Create a function, temp_conv(), to convert the temperature in Fahrenheit (input) to the temperature in Kelvin (output). Let’s perform a two-step calculation:

First, convert from Fahrenheit to Celsius, then
Then, convert from Celsius to Kelvin

# The formula for Celsius to Fahrenheit: 
temp_c = (temp_f - 32) * 5 / 9

# The formula for Celsius to Kelvin
temp_k = temp_c + 273.15

Test your function. If your input is 70, the result of temp_conv(70) should be 294.2611.

Now we want to round the temperature in Kelvin (output of temp_conv()) to a single decimal place. Use the round() function with the newly-created temp_conv() function to achieve this in one line of code.

If your input is 70, the output should now be 294.3.

Next Lesson >>

Back to Schedule

Reuse

CC-BY-4.0

--- title: "Functions" description: | This lesson explains how functions work in Python, from calling built-in functions with arguments to defining your own reusable functions to organize and simplify code. author: - Noor Sohail - Will Gammerdinger date: "2026-03-16" categories: - Python programming - Functions - Reproducibility keywords: - Define functions - Arguments license: "CC-BY-4.0" editor_options: markdown: wrap: 72 --- ```{python} #| label: load_libraries_data #| echo: false # Load libraries and data ``` Approximate time: 60 minutes ## Learning objectives In this lesson, we will: - Describe and utilize functions in Python - Modify default behavior of a function using arguments - Identify Python-specific sources of obtaining more information about functions - Demonstrate how to create user-defined functions in Python ## Overview of lesson Functions allow you to bundle a task so that you can use it repeatedly. In a more complex project, you might write a function to clean a dataset or run a standard calculation. Then, you can call that function on any multitude of datasets without having to rewrite the code. This keeps your code shorter, clearer and easier to fix. It is much easier to make avoidable mistakes when you are copy-pasting code! Functions help you avoid all that by letting you **write the code once and to reuse it as many times as you want.** In this lesson, you will learn how to call built‑in functions and define your own so you can start building a small “toolbox” tailored to your own needs. ## What are functions? A key feature of Python is functions. Functions are “self contained” modules of code that accomplish a specific task. Functions usually receive some sort of data structure (value, list, dataframe etc.), process it and return a result. We have actually been using several functions already! - `print()`: takes in a value and prints it to the console - `len()`: takes in a value and returns its length - `type()`: takes in a value and returns its type A function is generally called by using the name of the function followed by parentheses: ```{python} #| label: function_call #| eval: false # DO NOT RUN # Example syntax for using a function function_name(input) ``` The input(s) are called **arguments**, which can include: - The object (any data structure) on which the function carries out a task - Specifications that alter the way the function operates (e.g. options, arguments) Functions can potentially take several arguments. If you don’t specify a required argument when calling the function, you will either receive an error or the function will fall back on using a default value. The **defaults** represent standard values that the author of the function specified as being “good enough in standard cases”. An example would be the `sep` argument in the `print()` function, which has a default value of a single space. This means that if you use `print()` to print multiple values, they will be separated by a single space by default, as we saw previously. ### Base Python functions Since Python is frequently used for statistical computing, many of its base functions involve mathematical operations. For example, the `sum()` function receives a vector of numbers and returns its sum: ```{python} #| label: sum_example # Define a list of numbers numbers = [2, 2.23, 0.77, 10, 40.2] # Take the sum of the number list and assign the sum to total total = sum(numbers) # Print total print(total) ``` Another example is the `round()` function, which rounds a number to a specified number of decimal places: ```{python} #| label: round_example # Assign a value to num num = 3.14159 # Round num and assign the rounded number to round_num rounded_num = round(num) # Print rounded_num print(rounded_num) ``` The output we receive is `3`, which is num rounded to the single digit; this is the default behavior of the `round()` function. However, if we want to round to a specific number of decimal places, we would need to specify more information, likely in the form of an argument. ### Seeking help with functions If you want to know more about the arguments that a function can take in, **Python’s built-in `help()` function** is a great first step. `help()` provides documentation for any function you pass to it. For example, `help(round)` will give you information about the `round()` function, including its arguments and usage. ```{python} #| label: help_round help(round) ``` The result shows us that the `round()` function can take two arguments: `number` (the number to be rounded) and `ndigits` (the number of decimal places to round to). The `ndigits` argument has a default value of `0`, which is why we got `3` when we ran the example without specifying it. :::{.callout-tip} # [**Exercise 1**](06_functions-Answer_key.qmd#exercise-1) 1. Round the `num` variable to 2 decimal places using the `round()` function. 2. What happens when you specify a negative value for the `ndigits` argument in the `round()` function? Try it out and explain the output. 3. What happens if you apply the `round()` function to the `numbers` list? Try it out and explain the output. 4. Apply the `sorted()` function to the `numbers` list. What does it do? What are the arguments for the `sorted()` function? Use the `help()` function to find out. ::: ## Methods for data structures When we were working with lists earlier, we made use of the `list.append()` function. This is an example of a **method**, which is a function that is associated with a specific data structure. In this case, the `append()` method is associated with lists and allows us to add an element to the end of the list. Lists are not the only data structure that has innate methods. For example, strings have a method called `upper()`, which converts all characters in the string to uppercase: ```{python} #| label: string_upper # Create a string containing "hello world" and assign it to my_string my_string = "hello world" # Use the upper() method to make my_string uppercase my_string.upper() ``` Each data structure has its own set of methods that are designed to perform specific tasks related to that data structure. To find out what methods are available for a particular data structure, you can use the `dir()` function. For example, `dir(list)` will show you all the methods that are available for lists. ::: callout-note # Double underscore methods You may notice that there are many functions that follow the structure `__method__` when we view the output for `dir()`. These are called **dunder methods** (short for “double underscore”) and they are special methods . They are typically used to define the behavior of data structures in certain scenarios. We will not be covering dunder methods as they relate closely to `classes` which we are not covering in this workshop. You can ignore them for now. ::: If you scroll to the bottom of the output for `dir(list)`, you will see that there are many methods available for lists: ```{python} #| label: dir_list # Show the methods available for lists dir(list) ``` We can now see that there are many methods available for lists, including `list.sort()`, `list.reverse()`, etc. Let us try out the `list.sort` method on the `numbers` list. ```{python} #| label: list_sort # Apply the sort() method to numbers numbers.sort() # Print out numbers print(numbers) ``` In the exercise, we used the `sorted()` function to sort this list in both ascending and descending order. However, there is also a `list.sort()` method that can be used to sort a list in place. The difference between the two is that `sorted()` returns a new sorted list, while `list.sort()` modifies the original list. Even though there are slight differences between these two functions, they both ultimately accomplish the same objective of sorting the list. This is a good example of how there can be multiple functions that accomplish the same task, but they may have different arguments or operate in slightly different ways. **Most importantly, this is another reminder that there is no single “right” way to do something in Python!** Additionally, these methods are not limited to numerical data. For example, we can use the `sort()` method to sort a list of strings in alphabetical order: ```{python} #| label: string_sort # List of strings twice = ["Momo", "Sana", "Jihyo", "Mina", "Nayeon", "Chaeryeong", "Dahyun", "Jeongyeon", "Tzuyu"] # Sort the list of strings in alphabetical order twice.sort() print(twice) ``` ## User-defined functions One of the great strengths of Python is the user’s ability to add functions. Sometimes there is a small task (or series of tasks) you need completed and you find yourself having to repeat it multiple times. In these types of situations, it can be helpful to create your own custom function. The **structure of a function is given below**: ```{python} #| label: function_syntax #| eval: false # Example syntax for defining a function # DO NOT RUN # Define the function name and arguments def function_name(argument1, argument2, ...): # Code block that defines what the function does ... # Return statement return output ``` When you **define the function** you will need to provide the **list of arguments** required (inputs and/or options to modify behavior of the function). The argument(s) can be any type of object (like a scalar, a matrix, a dataframe, a list, a logical, etc.), and it’s not necessary to define the type of object. Then, indented under the function definition, you will write the code that carries out whatever task the function is designed to do. This is where the function is **executing code on the arguments supplied**. Finally, you can **`return` the value of the object from the function**, which means to pass the value determined by the function into the global environment. A very important fact to understand about functions is that objects that are created within the function are only local to the environment of the function – they don’t exist outside of the function. ### Creating a function Let’s try creating a simple function for an example. This function will take a numeric value as input, and return the squared value. ```{python} #| label: function_example # Create a function called square_it which takes the value x as input def square_it(x): # Squares the value of x and assigns it to the object called square square = x * x # Returns the value of square to the console return square ``` Now, we can use this function like any other base Python functions. We first type out the name of the function, add the parentheses and provide a numeric value `x` inside the parentheses: ```{python} #| label: function_usage # Run the square_it function on the number 5 square_it(5) ``` Pretty simple, right? In this case, the function only ran a single line of code, but you could have many lines of code to get obtain the final results that you need to `return` to the user. :::{.callout-tip} # [**Exercise 2**](06_functions-Answer_key.qmd#exercise-2) 1. Write a function called `multiply_it`, which takes two inputs: a numeric value `x` and a numeric value `y`. The function will return the product of these two numeric values, which is `x * y`. For example, `multiply_it(x=4, y=6)` will return output `24`. ::: :::{.callout-tip} # [**Exercise 3**](06_functions-Answer_key.qmd#exercise-3) 1. Create a function, `temp_conv()`, to convert the temperature in Fahrenheit (input) to the temperature in Kelvin (output). Let’s perform a two-step calculation: - First, convert from Fahrenheit to Celsius, then - Then, convert from Celsius to Kelvin ```{python} #| label: temp_conv #| eval: false # The formula for Celsius to Fahrenheit: temp_c = (temp_f - 32) * 5 / 9 # The formula for Celsius to Kelvin temp_k = temp_c + 273.15 ``` Test your function. If your input is 70, the result of `temp_conv(70)` should be 294.2611. 2. Now we want to round the temperature in Kelvin (output of `temp_conv()`) to a single decimal place. Use the `round()` function with the newly-created `temp_conv()` function to achieve this in one line of code. If your input is 70, the output should now be 294.3. ::: *** [Next Lesson >>](07_libraries.qmd) [Back to Schedule](../schedule/schedule.qmd)