Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Exercise 3: Finding a potential ligand or inhibitor for your protein

A button to download the slides in PDF format for this section

Background

In life sciences research, chemical compounds that interact with biomolecules are critical for understanding biological processes and for developing biochemical reagents and/or therapeutic drugs. 

For example:
  • Exposing a cell to an inhibitor for an enzyme can block the biological pathway and provide a clue to the enzyme's importance in the cell's function.
  • Using fluorescence microscopy, a flurescently-tagged ligand for an intracellular receptor can indicate which portion of the cell the protein resides.
  • Performing biological activity assays with various structural analogs for an enzyme's normal substrate can identify a new strong-binding reagant to serve as either an inhibitor or probe. 
  • Particularly strong enzyme inhibitors that also have bioavailable properties may be able to serve as targeted therapeutic medications in disorders caused by the enzyme's dysfunction.

PubChem contains not only chemical information, but biological assay studies with data as well. In 2004, the PubChem project was originally developed to serve as a data warehouse for the NIH Molecular Libraries Program. This initiative was created for large-scale screening of small molecules that could be optimized as chemical probes to study the functions of genes, cells, and biochemical pathways in health and disease. It was also intended to help validate new drug targets, which could then move into the drug-development pipeline. 

The PubChem BioAssay database was created to house information about in vivo, in vitro and biochemical bioactivity experiments measuring the impact of small-molecules and inhibitory-RNAs (RNAi) along with the screening data from submitting organizations.  BioAssay contains a collection of bioactivity and toxicity data that has greatly supported research in fields such as medicinal chemistry, drug discovery, pharmaceutical genomics and informatics research. 

Searching for bioassay-related information in PubChem

While you can certainly type in key words such as gene symbols or experimental method types into the search box to find related bioassay information, using standard Entrez Search Syntax allows you to really customize and retrieve more specifically what you are looking for using Indexed Filters and Fields.  

Example Search Strategies:

 Type of assay

"fluorescence resonance energy transfer"[Detection Method]
"flow cytometry"[Detection Method] 
"mass spectrometry"[Detection Method] 

 Organism or cell-line

"hl 60"[Cell Line] 
"escherichia coli k 12"[Organism]
"candida albicans"[Organism]

 Type of experiment

cellbased[Filter] 
biochemical[Filter]
or more specifically:  invivo[Filter]  or  invitro[Filter]

 The target of the experiment:
    gene symbols or gene or protein names

"c kit"[Gene Symbol] 

Learn more about using Entrez Indices and Filters in PubChem


Finding a potential inhibitor- KIT Example

Proto-oncogenes are genes that code for proteins that help regulate cell growth and differentiation. The human kit protooncogene encodes a receptor tyrosine kinase which is a potentially important drug target because atypical kit protooncogene's gene expression leads to diseases like gastrointestinal tumors, mast cell disease, and leukemia.  

We'd like to:
  • Learn more about the kit protooncogene - in NCBI's Gene database, but also in PubChem's Gene pages
  • Find chemicals that were tested for activity against this gene product
  • Narrow the list down to one that might be a good inhibitor to use for our own studies
  • Identify one with chemical vendor information

To learn more about the human kit protooncogene:

1. Start your search on the PubChem Homepage and type in: kit protooncogene
2. Click on the Best Match result to get to the PubChem Gene page for the human kit protooncogene.
3. Take a look over at the Contents (on the top-right) to see and be able to quickly jump to sections within the record


To find compounds tested against the human c-kit gene product:

4. Click on Chemicals and Bioactivities section, to the Tested Compounds table
5. Download the table as a CSV file and once downloaded, you can open the file in Excel, Google sheets, R, Python's Pandas, etc. and see all the detailed information provided including corresponding PubMed identifiers (PMIDs), if you want to look more into any particular row's data.
6. Open the CSV file and sort the data based on "acname". Pay attention to measures of inhibition (Ki or IC50 values) or substrate binding (KD)
7. If you don't want to download a CSV file, you can click View More Rows & Detials for more information
8. Sort structures by activity value
Select a compound to learn more about!


Find the PubChem Compound record for the proposed inhibitor and link to a vendor:

9. Click on a relevant CID for more information, including how to buy the compound 
10. Click on Chemical Vendors
11. CLick on Purchasable Chemical ID for more information

Congratulations on finding a reagent. Contact the vendor and good luck with your  experiments!


Take-away Message

  • You can leverage fundamental search strategies we've learned and your knowledge of PubChem BioAssay to gain exhaustive information about your target protein or gene, including information on potential inhibitors!
  • PubChem Text Search, PubChem Gene pagesPubChem BioAssay, and PubChem Compound were key resources for this exercise. Review PubChem sponsored documentation for more information

Last Reviewed: November 12, 2022