This archive contains data files the Python 3 code needed to reproduce the analysis done for the preprint "Proteome-scale quantification of the interactions driving condensate formation of intrinsically disordered proteins". Large amounts of the data are Illumina DNA sequencing experiments, where the fastq-file are currently not included due to their large file size. Instead, these files are included as sequencing counts of identical merged reads (CSV-files). Additionally, data recorded on the Fida 1 instrument (Fidabio, Denmark) is included and consists of fluorescence recordings over time at the end of a 1 or 0.5 m long microfluidic channel (Ø 75 µm) presented in CSV-files. Additional data file types include raw microscopy images (Leica SP8 confocal microscope, Germany). The scripts are made using Python 3.11.7 with Jupyter notebooks and uses packages: Numpy (v2.3.5), Pandas (v2.3.3), Matplotlib (v3.10.8), Seaborn (v0.13.2), Scipy (v1.16.3), Scikit-learn (v1.8.0), Biopython (v1.86), Uncertainties (v3.2.3), Emcee (v3.1.6), Metapredict (v3.0.1), and localCIDER (v0.1.21). Two-costum Python packages have been created, CPmD and fidapy, and are included in the uploaded files. CPmD is used to run most of the analysis code and fidapy is used to analyse Fida 1 data. Both packages have README files describing installation and basic usage. All figures are included as Scalable Vector Graphics (SVG) files and can be opened using Inkscape. © Technical University of Denmark
Norrild et al. (Thu,) studied this question.