This repository contains the analysis code for the manuscript titled below.
Hicks C, Gardner J, Chundi A, Camarda ND, Pham U, Dhar S, Rodriguez H, Jassal C, Bazan JF, Eiger DS, Rajagopal S. Identifying conserved G protein-coupled receptor signaling networks independent of heterotrimeric G proteins and β-arrestins. Submitted to Proceedings of the National Academy of Sciences (PNAS) (July 2025).
Our goal was to discover conserved signaling pathways that persist when canonical heterotrimeric G protein and β-arrestin signaling are genetically or pharmacologically disabled. The resulting analyses revealed a conserved signaling network independent of both G proteins and β-arrestins, suggesting alternative scaffolds maintain GPCR signaling. These findings help redefine GPCR pharmacology and may guide the design of biased ligands with fewer off-target effects.
- Data preprocessing: Standardizes and normalizes mass spectrometry and phosphoproteomic datasets from multiple GPCR studies (Rajagopal, Kruse, Rockman, von Zastrow).
- Identifier harmonization: Maps protein accessions to UniProtKB and Entrez Gene identifiers for integration across species and platforms.
- Signal extraction: Applies log2 transformation, variance stabilization, and outlier filtering to extract phosphorylation changes.
- Pathway and network analysis: Runs functional enrichment (Gene Ontology, KEGG) and protein–protein interaction analysis (STRINGdb) to identify non-canonical GPCR signaling modules.
- Visualization: Provides functions to generate heatmaps, volcano plots, and network diagrams of significant pathway activations.
Data from Rajagopal, Kruse, Rockman, and Von Zastrow datasets were preprocessed and mapped to human-readable gene names for further analysis. This involved filtering, log2 transformation, and outlier removal. Protein accessions were mapped using UniProtKB and Entrez Gene identifiers for subsequent analyses in Gene Ontology/KEGG and STRINGdb.
We employed Fisher's method for combined p-value calculation using the "metap" R package. This involved converting two-sided p-values to one-sided, grouping by accession, and correcting the resultant combined Fisher p-values for multiple hypothesis testing.
The "clusterProfiler" and "STRINGdb" R packages facilitated GO/KEGG enrichment and STRINGdb protein-protein interaction, network visualization, and clustering analyses. We set specific thresholds for enrichment and interaction scores and used the FastGreedy algorithm for clustering.
The repository includes several custom R functions to aid in data analysis and visualization:
plot_fc: For generating waterfall plots, showcasing protein expression data.plot_volcano: For creating volcano plots, highlighting significant protein expressions.
generate_intervals,find_good_intervals,generate_quant_intervals: Functions for interval generation in data analysis.generate_my_mapped_proteins: Maps gene names to STRINGdb IDs and summarizes data.make_tbl_data: Saves data into Excel files for further analysis.plot_stringdb_interactions,generate_interaction_enrichment_tables: Functions for STRINGdb interaction analysis and visualization.
remove_fig_number,p_friend: Assist in text and data formatting.make_vector_genes: Creates a list of unique gene symbols and corresponding Entrez IDs.make_naming_tribble_waterfall,make_naming_tribble_volcano: Utility functions for naming and organizing plot data.generate_significant_hits_interaction_network,generate_signif_hits_enrichment_tables,generate_pert_pathways: For analyzing and visualizing significant interaction networks.
easy_plot_fc,easy_plot_volcano: Simplified functions for quick and efficient generation of waterfall and volcano plots.
To use these functions, clone this repository and source the functions in your R environment. Example usage for each function is provided within the function documentation. The usage order of the main executing scripts was:
load_and_process_data.Rfigure2B-D.Rfigures_2E-3-4.Rtable_1.R
Contributions to this codebase are welcome. Please feel free to fork the repository, make changes, and submit pull requests. For bugs, questions, or suggestions, please open an issue in the repository.
This project represents a collaborative effort to advance the understanding of GPCR signaling pathways. We hope this codebase will be a valuable resource for researchers in the field.