Protein Crystallization & X-ray Crystallography Services Company

We support our clients’ drug discovery projects with custom protein crystallization and protein X-ray structure determination services. Our proximity to the 4th-generation synchrotron MAX IV ensures priority access for our clients’ projects and enables the collection of high-quality X-ray data.

State-of-the-Art Services Facilities: Crystallization & Protein X-ray Crystallography

SARomics Biostructures’ protein X-ray crystallography services technology platform is designed to accelerate our clients’ structural biology and drug discovery projects. Our location near the BioMax and MicroMax beamlines at the 4th-generation MAX IV synchrotron laboratory enables us to offer premium high-throughput crystallography services.

Our services are also enhanced by high-precision, high-throughput liquid-handling imaging robotics for crystallization screening, along with predefined protocols that standardize settings, rules, and procedures to ensure consistency, reproducibility, and reliability in sample handling and crystallization. This technical base enables us to establish initial crystallization conditions and rapidly optimize them. A list of FAQs on protein crystallization and structure determination service project is presented below.

Please see below for more details about our services.

Our FastLane™ libraries of off-the-shelf drug target protein structures power our services!

FastLane premium library for accelerated protein crystallography

Premium FastLane™ library

FastLane standard library of proteins

Standard FastLane™ library

SARomics Biostructures’ Protein X-Ray Crystallography Services

Björn Walse & Raymond Kimbung discussing a protein crystallization experiment
Contact us to discuss your project!

Gene-to-Structure Services

Our comprehensive gene-to-structure services pipeline leverages our FastLane™ Standard library of almost 500 drug target constructs, which can be cloned, expressed, purified, and crystallized using verified protocols. In addition, we offer custom gene-to-structure services for proteins not included in our FastLane™ Standard library.
Our services include:

  • Cloning, expression, purification, and biophysical characterization of the protein.
  • Polydispersity assessment by dynamic light scattering (DLS).
  • Folding assessment by circular dichroism (CD) spectroscopy.
  • Stability assessment by differential scanning fluorimetry (DSF).
  • If required, additional assessments of protein folding may be conducted using our NMR spectroscopy services.
  • Protein crystallization screens, which include thousands of conditions, are used to identify initial crystallization conditions.
  • Synchrotron X-ray data collection, followed by crystal structure determination of the protein using molecular replacement or SeMet-labeled protein.

The details of our protein characterization, crystallization, and structure determination are provided in our FAQ section below.


Why Choose SARomics Biostructures CRO Services?

  • With over 20 years of expertise across a broad range of drug-target classes, we provide protein crystallization and high-throughput X-ray crystallography services to support our clients’ drug discovery projects. We are offering access to 600+ off-the-shelf FastLane protein structures for fast co-crystallization with clients’ small-molecule ligands and therapeutic antibodies.
  • If the protein of your interest is not in our library, you can benefit from our custom protein production and gene-to-structure services.
  • We have been serving the majority of the top pharmaceutical companies and many of the world’s most innovative biotech firms, as well as academic groups across North America, South America, Asia, Europe, and Australia.
  • Our publications list, featuring, among others, 8 Nature papers and over 10 papers in other high-ranking journals, demonstrates the company’s proven expertise and the high standard of the projects we handle.

FAQs: Sample Preparation, Crystallization, and Structure Determination

To streamline and maximize the speed and efficiency of our services, our staff adheres to specific protocols throughout all crystallization and protein X-ray structure determination experiments. To ensure the quality of client samples, we have also developed general guidelines which may be downloaded for your convenience. A product sheet for one of our crystallization-grade Protein Shop proteins demonstrates the required quality for crystallization-grade proteins.

Below is a detailed description of the various stages, including sample preparation, crystallization, and subsequent crystal structure determination. On our educational site, you may find more information on the basics and techniques of protein crystallization.

Assuming the protein has been cloned, expressed, and purified, we can begin planning the crystallization experiments. The following steps are involved in the procedure:

  • Sample characterization
  • Preparation for screening and choosing commercial screens
  • Monitoring crystallization drops and marking the most promising conditions
  • Optimization of positive hits using in-house designed screens

Before running crystallization screens, we always begin with quality control (QC) of the protein sample. Such controls aim to remove factors that can prevent protein crystallization. Among these factors are protein flexibility, instability, polydispersity, time-dependent denaturation, and degradation. QC is an essential step that prevents the loss of precious material and time in crystallization trials that yield no crystals. Our sample characterization and crystallization experiments follow predefined protocols that standardize settings and procedures to ensure consistency, reproducibility, and reliability.

After expressing and purifying the protein, or upon receiving it from a client, we employ various biophysical techniques to control the sample’s quality. The primary aims of this stage include:

  • Evaluating the protein’s state in solution, e.g., the presence of aggregates, denatured material, polydispersity, or any impurities.
  • It is also essential to assess stability across various buffers, pH values, and salt concentrations commonly used for crystallization screening.
  • It is desirable to verify the protein’s stability at room temperature and in cold storage, since crystallization trials are typically conducted at two temperatures.

We use the following biophysical techniques for sample characterization:

  • Differential scanning fluorimetry (DSF) is used to assess protein stability across different buffers and temperatures.
  • Size-exclusion chromatography (SEC) is used to remove potential aggregates.
  • Dynamic light scattering (DLS) and mass spectrometry (MS) are used to assess sample monodispersity.
  • For quality control and to detect partially unfolded regions or regions of high flexibility, NMR spectroscopy is used via HSQC fingerprint analysis.

Screening and optimization of the crystallization conditions for a new protein can take as little as 3-6 weeks, though some samples may require more time. Please note that the process is substantially accelerated if the protein is found in our FastLane™ structure libraries. The libraries contain 600+ proteins ready for crystallization (Premium list) or for expression, purification, and crystallization according to verified protocols (Standard list). However, even when the protein’s structure is known from previous studies, it is often necessary to perform new screening experiments to obtain high-quality crystals. Slight changes in expression, purification, or characterization protocols, or the presence of new additives in the buffer, such as ligands for co-crystallization to obtain a protein-ligand complex, may also require modifications to the crystallization conditions. In addition, in some experiments, like crystallographic fragment screening, hundreds of stable crystals with well-reproducible crystallization conditions are required.

Hundreds of screens, such as those available at Hampton Research, are used initially. We routinely use high-throughput, high-precision liquid-handling robotics for the initial screens and during subsequent optimization of newly identified crystallization conditions. Typically, approximately 5 mg of protein at a concentration of around 10 mg/ml is required at the initial stage. Lower protein concentrations can be used if solubility is an issue.

The initial crystallization conditions identified during screening almost always require further optimization. During optimization, all volumes mentioned above can be adjusted based on the size of the crystals being produced and the evaporation rate of the crystallization solution. Alternative temperatures can also be explored during this stage. By varying incubation time and temperature, crystal nucleation and growth rates can be adjusted, potentially improving crystal quality and yield (e.g., for crystallographic fragment screening). Well-formed crystals typically range from 0.5 to 2 mm in size, depending on the protein and crystallization conditions. We use automated, temperature-controlled plate hotels designed for high-throughput protein crystallization and monitoring. The crystallization plates are incubated at different temperatures and monitored using an imaging system, such as the Formulatrix Rock Imager.

The large number of buffer conditions used in the screening reflects the numerous factors and their various combinations that may affect protein solubility and crystallization. The most common protocols rely on the variation of factors, such as:

  • Type of buffer and its pH
  • Ionic strength
  • The presence of various salts in the solution
  • The presence of ligands (co-factors, substrate analogs, inhibitors)
  • The type of precipitant used (polyethylene glycol (PEG), ammonium sulfate, etc.)

Protein-Ligand Complex Co-Crystallization

Protein-ligand co-crystallization is extensively used in structure-based drug design (SBDD). Co-crystallization with a ligand of interest usually starts after establishing the crystallization conditions for the apo-protein. There are two primary methods for obtaining a ligand complex: (1) adding a ligand solution to the crystallization drop containing crystals and soaking the crystals with the ligand, and (2) co-crystallization with the ligand. Soaking may work because protein crystals contain a relatively high percentage of solvent (crystallization buffer), allowing the ligand to diffuse into the crystal. If the ligand-binding site within the crystal is exposed and accessible to solvent, the ligand will bind to it. On the other hand, if the site is deeply buried within the protein or obscured by crystal contacts, it becomes inaccessible to ligands.

There are many cases where ligand soaking cracks the crystal, resulting in poor X-ray diffraction. In such cases, we need to use the second method, co-crystallization with a ligand, which can be applied in different variations. The simplest approach is to mix the ligand solution with the protein prior to preparing crystallization drops. However, ligand binding may trigger a conformational transition in the protein, which may require reoptimization of the crystallization conditions. In some cases, co-crystallization may be combined with seeding, which may help in obtaining crystals of the complex. We also need to note that in all these experiments, ligand affinity is an important factor to consider.

More information, with detailed discussion and guidelines for the successful generation of protein–ligand complex crystals, can be found in this excellent publication. Some practical details can also be found on the Hampton Research website.

More than 100 years have passed since William H. Bragg and William L. Bragg (father and son) explained the diffraction patterns obtained by irradiating crystals with X-rays and derived Bragg’s law. A few years later, in 1915, they were awarded the Nobel Prize in physics. In 1926, the first crystals of an enzyme, urease, were reported by James B. Sumner, which can be considered the starting point for structural biology.

For many years, low-intensity laboratory X-ray sources available during the early stages of protein X-ray crystallography made collecting high-quality diffraction data very slow and, for small, weakly diffracting crystals, entirely impossible. Much time had to be spent optimizing crystallization conditions to obtain larger crystals suitable for the X-ray sources of that time. However, since the mid-1990s, with the introduction of synchrotrons, enormous advances have fundamentally changed how we do X-ray crystallography. With modern 4th-generation synchrotrons capable of generating high-brilliance microfocus beams, advanced robotics and automation, fast detectors, and high-speed diffraction data processing, high-throughput X-ray crystallography has become routine. Today, we can determine protein structures using much smaller crystals with greater efficiency, at a much higher speed, and at a considerable reduction in cost.

The process of protein structure determination by X-ray crystallography includes five main stages:

  1. Crystallization (initial screening and crystal hit optimization)
  2. Crystal harvesting, testing, and X-ray data collection
  3. Obtaining phases, calculating the electron density map, and building the structural model
  4. Refinement and overall quality assessment against Protein Data Bank criteria
  5. Ligand-protein complex structure validation and quality assessment

In the text above, we discussed crystallization. Below, we will address the remaining topics.

2. Crystal Testing and the “Synchrotron-Only” Approach to Data Collection

In our earlier blog post, we discussed how AstraZeneca’s team decided to switch from combining a laboratory X-ray source (for preliminary crystal tests) and synchrotron radiation for data collection to a “synchrotron-only” approach. At SARomics Biostructures, both crystal testing and diffraction data collection are primarily conducted at the protein crystallography beamlines BioMax and MicroMax at the 4th-generation MAX IV synchrotron in Lund. With a synchrotron in proximity, we enjoy quick access to beamlines, eliminating the need to ship crystals over long distances to remote synchrotrons and the risk of crystal damage during transport. In a note below, we discuss the benefits of synchrotron X-ray crystallography.

Generally, to minimize radiation damage from modern high-intensity synchrotron beams, data collection must be conducted at cryogenic temperatures. To this end, after harvesting from drops, crystals are transferred to a cryo-solution and flash-frozen at liquid nitrogen temperature. The cryo-solution prevents ice formation around a crystal when it is mounted in a special loop before insertion into an X-ray beam. Ice interferes with data collection (ice crystals scatter X-rays) and may also damage the protein crystals, resulting in poor data quality. However, as discussed in our blog article, there are instances in which room-temperature protein crystallography can be advantageous.

3. Phase Determination and Electron Density Calculation

Calculating an electron density map, into which the structural model will be built, requires phase information. X-ray detectors register only the intensities of the diffracted waves; the phase information is lost during this process and must be retrieved using other methods. The method of heavy-atom replacement for solving the phase problem was first developed by J. C. Kendrew et al., who reported the structure of myoglobin in 1958, and M. F. Perutz et al., who published the structure of hemoglobin in 1960. Both scientists were awarded the Nobel Prize in Chemistry in 1962.

Currently, heavy-atom replacement is rarely used. Instead, molecular replacement is the most popular method. Molecular replacement requires an experimental 3D structure of the protein or a close homolog (typically with sequence identity above 40-50%) to calculate initial phases and an electron density map. At times, computer-modeled structures (e.g., from AlphaFold) may also be used to solve the phase problem. However, this requires that the conformation of the computed structure (especially for multidomain proteins) match that of the crystal structure being studied. If molecular replacement fails, de novo structure determination must be applied. Typically, a selenomethionine (SeMet)-labeled protein sample, which we can produce upon the client’s request, enables structure determination using synchrotron radiation and anomalous X-ray scattering.

Once the phases are available, an electron density map revealing the positions of the atoms in the crystal can be calculated. An atomic model of the structure can be built at this stage, e.g., using the protein’s amino acid sequence.

Please visit our educational site for more information on basic principles of protein crystallography.

4. Refinement and Model Overall Quality Assessment

The initial model built into the electron density map must be examined and further adjusted to better fit the experimental data. This process is called refinement. An improved model will provide better phases and allow calculation of a better map that may reveal new details (e.g., missing loop density, missing density for parts of a ligand, or new solvent molecules). The crystallographic R-factor and R-free are primarily used to monitor refinement progress and to assess the overall agreement between the model and the experimental data. However, we should keep in mind that the resolution of the X-ray data ultimately determines both the quality and level of detail in the interpretation of the structure.

Generally, a higher resolution allows more reliable model interpretation, whereas for lower-resolution models (e.g., under 2.5 Å), the interpretation of the electron density map may be more subjective. However, higher resolution does not automatically guarantee good model quality. There are many examples of high-resolution structures with poor overall quality. For this reason, in addition to the R-factor and resolution, the PDB provides other factors for quality assessment. They include the Ramachandran plot and deviations of bond lengths and bond angles from their ideal values, collectively referred to as overall geometry (stereochemistry) criteria. Our educational site provides a detailed discussion on X-ray crystallographic structure validation and quality criteria assessment.

5. Ligand-Protein Complex Structure Validation

The correct interpretation of the electron density observed in a protein’s binding site can sometimes be challenging. There have been numerous instances in which the density at the binding site was actually due to water molecules, buffer components, or precipitant molecules. Therefore, it is essential to validate the quality of the atomic model of the ligand-protein complex. This validation is crucial for all subsequent steps, especially in computational methods used in structure-based drug discovery projects, which depend heavily on the quality of the 3D structure. Global statistics, such as resolution, the R-factor, and model geometry discussed above, ensure that the structure is well-refined and meets the general quality criteria set by the PDB. However, even well-refined protein structures may contain problematic regions with poor electron density, resulting in poor local model quality. In the case of a bound ligand (small molecule), poor electron density may result from partial occupancy of the binding site as well as conformational flexibility of the ligand in the crystal. Therefore, it is important to inspect the bound ligand using so-called real-space (in contrast to reciprocal space) quality indicators.

  • One such indicator is the quality of the electron density used to build the ligand. We need to ensure that the electron density covers all ligand atoms and that there is no unaccounted-for density within the binding site. The presence of unaccounted-for density, e.g., in crystallographic fragment screening, may indicate that the bound molecule is not what we expected to find there.
    The fit of the model to the electron density can be assessed by examining the Real Space Correlation Coefficient (RSCC) and the Real Space R-Value (RSR). These values are calculated per residue and can be obtained from the PDB validation report. RSCC is generally higher for higher-resolution and lower-B-factor models – a perfect fit of the model to the electron density would yield an RSCC of 1.0. The lower the RSCC, the greater the uncertainty in the ligand model’s position relative to the electron density. Generally, values between 0.8 and 1.0 are considered satisfactory.
  • On some occasions, model bias may also result in spurious electron density, which is mistakenly interpreted as evidence of ligand presence. If in doubt, we can verify the presence of the ligand by first removing it from the coordinate file, running a few cycles of refinement, and then calculating a so-called omit electron density map (a map with the ligand molecule omitted from the coordinate file). If the ligand is present in the binding site at a high occupancy, an omit map will clearly reveal its density.
  • In addition, we need to examine the interactions between the ligand and the amino acid side chains within the protein-binding site. For example, chemically unrealistic interactions or atom-atom distances that are too short reveal potential problems with the ligand’s positioning.
  • We also need to ensure that the ligand model complies with the expected stereochemical parameters of the chemical structure.
  • The atomic displacement factors (B-factors or temperature factors) of the ligand should be compared to the average B-factors of the protein atoms within the binding site. Too-high ligand temperature factors should also alert us to potential problems.

A detailed discussion of ligand validation in macromolecular structures determined by X-ray crystallography in the PDB can be found in the excellent paper by Smart et al. (2018) and Pozharski et al. (2017).


What Are The Benefits of Synchrotrons in Crystallography & Drug Discovery?

  • With no further time-consuming crystallization-condition optimizations, the high beam brilliance enables high-throughput X-ray data collection from much smaller crystals than before.
  • A brighter, more focused beam yields considerably higher-resolution data than laboratory X-ray sources.
  • Tunable wavelengths can be used for phasing by multiple-wavelength anomalous scattering (MAD).
  • Synchrotrons enable time-resolved studies and serial crystallography.
  • High-throughput Crystallographic fragment screening is only possible at synchrotrons.
  • The revival of room-temperature crystallography has important applications in structure-based drug design.

An examination of Protein Data Bank (PDB) statistics clearly demonstrates the central role of X-ray crystallography in structural biology. It shows that, out of over 250,000 deposited structures, around 15,000 have been determined by NMR spectroscopy, over 200,000 by X-ray crystallography, and more than 33,000 by Cryo-EM. These differences indicate that while X-ray crystallography still dominates structural biology, Cryo-EM is catching up, especially for larger proteins and protein complexes.

The arrival of modern 4th-generation synchrotrons with higher radiation brightness and coherence is likely to enhance further synchrotrons’ capabilities for studying a wider range of biological processes, pushing the boundaries of X-ray crystallography applications in drug discovery. A recent study by the AstraZeneca team on the impact of synchrotron radiation on drug discovery emphasized a shift from using a “combined laboratory X-ray source-synchrotron” to a “synchrotron-only” approach for the company’s drug discovery projects.

On the image: the MAX IV Synchrotron Laboratory in Lund, Sweden

Highlights of Synchrotron Crystallography

Among the highlights of synchrotron crystallography was the determination of the ribosome’s three-dimensional structure. Three researchers, Ada Yonath, Venki Ramakrishnan, and Thomas Steitz, were awarded the Nobel Prize in Chemistry in 2009 for their work on the structure of the ribosome. The Nobel Foundation report on the scientific background of the prize stresses the role of new synchrotron technologies, such as the “introduction of CCD area-detectors for precise and automated analysis of x-ray diffraction patterns and tunable synchrotron radiation sources for optimal use of anomalous scattering for phase determination” in the project’s success. It should be noted that the bacterial ribosome is probably the largest known drug target, as many antibiotics inhibit bacterial protein synthesis.

Case Study: Do We Need Protein X-ray Crystallography in AI Drug Discovery?

A recent excellent paper, “Prospective de novo drug design with deep interactome learning,” by the group of Professor Gisbert Schneider at ETH Zurich, with a contribution from SARomics Biostructures, demonstrates that X-ray crystallography remains necessary for ligand-binding verification.

Prospective de novo drug design with deep interactome learning.
Atz K, Cotos L, Isert C, Håkansson MFocht D, Hilleke M, Nippa DF, Iff M, Ledergerber J, Schiebroek CCG, Romeo V, Hiss JA, Merk D, Schneider P, Kuhn B, Grether U, Schneider G (2024). Nat Commun. 15, 3408.

In this work, new ligands targeting the binding site of the human peroxisome proliferator-activated receptor subtype gamma (PPARgamma, a protein from our FastLane™ Premium library ) were generated. The ligand’s binding mode was subsequently confirmed by the crystal structure of the protein-ligand complex provided by the SARomics team. The structure shows that the ligand effectively interacts with the receptor in a canonical binding mode.

AI-based drug discovery, protein crystallization set up   at SARomics
Related Services
Antibody structure determination by protein X-ray crystallography
Antibody Structure
Protein NMR (BioNMR) spectroscopy services
BioNMR Spectroscopy
Fragment screening services
Fragment Screening & Drug Design