Insert data from a SQL table into a Python pandas dataframe

Applies to: SQL Server Azure SQL Database Azure SQL Managed Instance SQL database in Microsoft Fabric

This article describes how to insert SQL data into a pandas dataframe using the mssql-python driver in Python. The rows and columns of data contained within the dataframe can be used for further data exploration.

Prerequisites

SQL Server for Windows or for Linux

Azure SQL Database

SQL database in Fabric

Azure SQL Managed Instance
SQL Server Management Studio for restoring the sample database to Azure SQL Managed Instance.

Azure Data Studio. To install, see Azure Data Studio.
Restore sample database to get sample data used in this article.

Verify restored database

You can verify that the restored database exists by querying the Person.CountryRegion table:

USE AdventureWorks;
SELECT * FROM Person.CountryRegion;

Install Python packages

Download and Install Azure Data Studio.

Install mssql-python

pip install mssql-python

apk add libtool krb5-libs krb5-dev
pip install mssql-python

apt-get install -y libltdl7 libkrb5-3 libgssapi-krb5-2
pip install mssql-python

dnf install -y libtool-ltdl krb5-libs
pip install mssql-python

zypper install -y libltdl7 libkrb5-3 libgssapi-krb5-2
pip install mssql-python

zypper install -y libltdl7
pip install mssql-python

brew install openssl
pip install mssql-python

Install other packages

Install the pandas package using Azure Data Studio:

In your Azure Data Studio notebook, select Manage Packages.
In the Manage Packages pane, select the Add new tab.
Enter pandas, select Search, then select Install.

Insert data

Use the following script to select data from Person.CountryRegion table and insert into a dataframe. Edit the connection string variables: 'server', 'database', 'username', and 'password' to connect to SQL.

To create a new notebook:

In Azure Data Studio, select File, select New Notebook.
In the notebook, select kernel Python3, select the +code.
Paste code in notebook, select Run All.

from mssql_python import connect
import pandas as pd
# Some other example server values are
# server = 'localhost\sqlexpress' # for a named instance
# server = 'myserver,port' # to specify an alternate port
server = 'servername' 
database = 'AdventureWorks' 
username = 'yourusername' 
password = 'yourpassword'
connection_string = f'Server={server};Database={database};UID={username};PWD={password};TrustServerCertificate=yes;'
conn = connect(connection_string)
# select 26 rows from SQL table to insert in dataframe.
query = "SELECT [CountryRegionCode], [Name] FROM Person.CountryRegion;"
df = pd.read_sql(query, conn)
print(df.head(26))
conn.close()

Output

The print command in the preceding script displays the rows of data from the pandas dataframe df.

CountryRegionCode                 Name
0                 AF          Afghanistan
1                 AL              Albania
2                 DZ              Algeria
3                 AS       American Samoa
4                 AD              Andorra
5                 AO               Angola
6                 AI             Anguilla
7                 AQ           Antarctica
8                 AG  Antigua and Barbuda
9                 AR            Argentina
10                AM              Armenia
11                AW                Aruba
12                AU            Australia
13                AT              Austria
14                AZ           Azerbaijan
15                BS         Bahamas, The
16                BH              Bahrain
17                BD           Bangladesh
18                BB             Barbados
19                BY              Belarus
20                BE              Belgium
21                BZ               Belize
22                BJ                Benin
23                BM              Bermuda
24                BT               Bhutan
25                BO              Bolivia

Next steps

Insert Python dataframe into SQL

Feedback

Was this page helpful?

Last updated on 2026-02-10