Key points are not available for this paper at this time.
We established a protocol for the prediction of the coding sequences of unidentified human genes based on the double selection and sequence analysis of cDNA clones with inserts carrying unreported 5'-terminal sequences and with insert sizes corresponding to nearly full-length transcripts. By applying the protocol, cDNA clones with inserts longer than 2 kb were isolated from a cDNA library of human immature myeloid cell line KG-1, and the coding sequences of 40 new genes were predicted. A computer search of the sequences indicated that 20 genes contained sequences similar to known genes in the GenBank/EMBL databases. The sequences of the remaining 20 genes were entirely new, and characteristic protein motifs or domains were identified in 32 genes. Other sequence features noted were that the coding sequences of 23 genes were followed by relatively long stretches of 3'-untranslated sequences and that 5 genes contained repetitive sequences in their 3'-untranslated regions. The chromosomal location of these genes has been determined. By increasing the scale of the above analysis, the coding sequences of many unidentified genes can be predicted.
Building similarity graph...
Analyzing shared references across papers
Loading...
Nobuhiko Nomura
University of Tsukuba
N. Miyajima
Kazusa DNA Research Institute
Takashi Sazuka
Nagoya University
DNA Research
Nagoya University
Nippon Medical School
Kazusa DNA Research Institute
Building similarity graph...
Analyzing shared references across papers
Loading...
Nomura et al. (Sat,) studied this question.
synapsesocial.com/papers/6a0f3b0b11edbd3546bddaab — DOI: https://doi.org/10.1093/dnares/1.1.27