; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS002941 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS002941
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionTrigger_N domain-containing protein
Genome locationscaffold595_1:201383..205647
RNA-Seq ExpressionMS002941
SyntenyMS002941
Gene Ontology termsGO:0000413 - protein peptidyl-prolyl isomerization (biological process)
GO:0015031 - protein transport (biological process)
GO:0043335 - protein unfolding (biological process)
GO:0051083 - 'de novo' cotranslational protein folding (biological process)
GO:0061077 - chaperone-mediated protein folding (biological process)
GO:0003755 - peptidyl-prolyl cis-trans isomerase activity (molecular function)
GO:0043022 - ribosome binding (molecular function)
GO:0044183 - protein folding chaperone (molecular function)
InterPro domainsIPR005215 - Trigger factor
IPR008881 - Trigger factor, ribosome-binding, bacterial
IPR036611 - Trigger factor ribosome-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008460020.1 PREDICTED: uncharacterized protein LOC103498959 [Cucumis melo]2.6e-9276.33Show/hide
Query:  ASATATVTNLA-SEYRRARFIKVPVKGYCRSVLACPNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGSLEAAITDYKGNAIALKNAKIV
        A+ATATVTN+A SE+ R  F KV VKGYC + L C N GGVEFDNVQ+GLCRRSSFS S H++GSRFLS+PTSIASSG LEAA+TDYKGNAI LKNAK+V
Subjt:  ASATATVTNLA-SEYRRARFIKVPVKGYCRSVLACPNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGSLEAAITDYKGNAIALKNAKIV

Query:  VESEDENKIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRRQKGGSTMIFLPIHVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAIHFFLLTPR
        VESE+ENKIQLRVDL+GDETQK+FDQVLTNLARSAPPMPGFR+QKGG T      +VPKSFLLEVLG++RVTKFVIQEILNSTMADYAKK          
Subjt:  VESEDENKIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRRQKGGSTMIFLPIHVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAIHFFLLTPR

Query:  QENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSED
         EN+NVKD KVNTTQTADELK+LF PGKEFGFNAILELESA+  +
Subjt:  QENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSED

XP_011656737.1 uncharacterized protein LOC101212225 [Cucumis sativus]3.1e-9376.23Show/hide
Query:  ASATATVTNLASEYRRARFIKVPVKGYCRSVLACPNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGSLEAAITDYKGNAIALKNAKIVV
        A+ATATVTN+ASE+ R  F KVPVKGYC + L C N GGVEFDNV++GLCRRSSFS S HK+GSRFLS+PTSIASSG LEAAITDYKGN I LKNAK+VV
Subjt:  ASATATVTNLASEYRRARFIKVPVKGYCRSVLACPNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGSLEAAITDYKGNAIALKNAKIVV

Query:  ESEDENKIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRRQKGGSTMIFLPIHVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAIHFFLLTPRQ
        ESE+ENKIQLRVDL+GDETQK+FDQVLTNLARSAPPMPGFR+QKGG T      +VPKSFLLEVLG++RVTKF+IQEILNSTM DYAKK           
Subjt:  ESEDENKIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRRQKGGSTMIFLPIHVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAIHFFLLTPRQ

Query:  ENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSED
        EN+NVKDKKV+TTQTADELK+LF PGKEFGFNAILELESA+  +
Subjt:  ENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSED

XP_022155406.1 uncharacterized protein LOC111022553 [Momordica charantia]9.6e-11190.24Show/hide
Query:  MASATATVTNLASEYRRARFIKVPVKGYCRSVLACPNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGSLEAAITDYKGNAIALKNAKIV
        MASATATVTNLASEYRRA FIKVPVKGYCRSVLAC NSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSG LEAAITDYKGNAIALKNAKIV
Subjt:  MASATATVTNLASEYRRARFIKVPVKGYCRSVLACPNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGSLEAAITDYKGNAIALKNAKIV

Query:  VESEDENKIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRRQKGGSTMIFLPIHVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAIHFFLLTPR
        VESEDEN IQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFR QKGG T      +VPKSFLLEVLGEERVTKFVIQEILNSTMADYAKK          
Subjt:  VESEDENKIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRRQKGGSTMIFLPIHVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAIHFFLLTPR

Query:  QENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSEDE
         ENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSEDE
Subjt:  QENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSEDE

XP_022960370.1 uncharacterized protein LOC111461115 [Cucurbita moschata]7.7e-8475.31Show/hide
Query:  MASATATVTNLASEYRRARFIKVPVKGYCRSVLACPNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGSLEAAITDYKGNAIALKNAKIV
        MASATATV N+ASE+RR  F KVPV G  R+ L C N GGVEF+NVQ+ LC RSS S SP  +GSRFLSRPT+IASSG LEAAITDYKG AI LKNAKIV
Subjt:  MASATATVTNLASEYRRARFIKVPVKGYCRSVLACPNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGSLEAAITDYKGNAIALKNAKIV

Query:  VESEDENKIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRRQKGGSTMIFLPIHVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAIHFFLLTPR
        VESEDENKIQLRVDL+GDETQK+FDQVLTNLARSAP MPGFR+QKGG T      +VPKSFLLEVLG++RVTKFVIQEILNSTM DYAKK          
Subjt:  VESEDENKIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRRQKGGSTMIFLPIHVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAIHFFLLTPR

Query:  QENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELE
         EN+ VKDK VNTTQT DELK LF PGKEFGFNAILELE
Subjt:  QENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELE

XP_038906537.1 uncharacterized protein LOC120092511 [Benincasa hispida]9.7e-9577.87Show/hide
Query:  ASATATVTNLASEYRRARFIKVPVKGYCRSVLACPNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGSLEAAITDYKGNAIALKNAKIVV
        A+ATAT TN+ASE+R   F +VPVKGYCR+ L C N GGVEFDNVQ+GLCRRSSFS S HK+GSRFLS+PTSIASSG LEAAITDYKGNAI LKNAK+VV
Subjt:  ASATATVTNLASEYRRARFIKVPVKGYCRSVLACPNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGSLEAAITDYKGNAIALKNAKIVV

Query:  ESEDENKIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRRQKGGSTMIFLPIHVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAIHFFLLTPRQ
        ESE+ENKIQLRVDL+GDETQK+FDQVLTNLARSAPPMPGFRRQKGG T      +VPKSFLLEVLG++RV KFVIQEILNSTMADYAKK           
Subjt:  ESEDENKIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRRQKGGSTMIFLPIHVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAIHFFLLTPRQ

Query:  ENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSED
        EN+NVKDKKVNTTQTADELK+LF+PGKEFGFNAILELESA+  +
Subjt:  ENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSED

TrEMBL top hitse value%identityAlignment
A0A0A0K988 Trigger_N domain-containing protein1.5e-9376.23Show/hide
Query:  ASATATVTNLASEYRRARFIKVPVKGYCRSVLACPNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGSLEAAITDYKGNAIALKNAKIVV
        A+ATATVTN+ASE+ R  F KVPVKGYC + L C N GGVEFDNV++GLCRRSSFS S HK+GSRFLS+PTSIASSG LEAAITDYKGN I LKNAK+VV
Subjt:  ASATATVTNLASEYRRARFIKVPVKGYCRSVLACPNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGSLEAAITDYKGNAIALKNAKIVV

Query:  ESEDENKIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRRQKGGSTMIFLPIHVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAIHFFLLTPRQ
        ESE+ENKIQLRVDL+GDETQK+FDQVLTNLARSAPPMPGFR+QKGG T      +VPKSFLLEVLG++RVTKF+IQEILNSTM DYAKK           
Subjt:  ESEDENKIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRRQKGGSTMIFLPIHVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAIHFFLLTPRQ

Query:  ENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSED
        EN+NVKDKKV+TTQTADELK+LF PGKEFGFNAILELESA+  +
Subjt:  ENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSED

A0A1S3CBN2 uncharacterized protein LOC1034989591.3e-9276.33Show/hide
Query:  ASATATVTNLA-SEYRRARFIKVPVKGYCRSVLACPNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGSLEAAITDYKGNAIALKNAKIV
        A+ATATVTN+A SE+ R  F KV VKGYC + L C N GGVEFDNVQ+GLCRRSSFS S H++GSRFLS+PTSIASSG LEAA+TDYKGNAI LKNAK+V
Subjt:  ASATATVTNLA-SEYRRARFIKVPVKGYCRSVLACPNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGSLEAAITDYKGNAIALKNAKIV

Query:  VESEDENKIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRRQKGGSTMIFLPIHVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAIHFFLLTPR
        VESE+ENKIQLRVDL+GDETQK+FDQVLTNLARSAPPMPGFR+QKGG T      +VPKSFLLEVLG++RVTKFVIQEILNSTMADYAKK          
Subjt:  VESEDENKIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRRQKGGSTMIFLPIHVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAIHFFLLTPR

Query:  QENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSED
         EN+NVKD KVNTTQTADELK+LF PGKEFGFNAILELESA+  +
Subjt:  QENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSED

A0A6J1DRL2 uncharacterized protein LOC1110225534.7e-11190.24Show/hide
Query:  MASATATVTNLASEYRRARFIKVPVKGYCRSVLACPNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGSLEAAITDYKGNAIALKNAKIV
        MASATATVTNLASEYRRA FIKVPVKGYCRSVLAC NSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSG LEAAITDYKGNAIALKNAKIV
Subjt:  MASATATVTNLASEYRRARFIKVPVKGYCRSVLACPNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGSLEAAITDYKGNAIALKNAKIV

Query:  VESEDENKIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRRQKGGSTMIFLPIHVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAIHFFLLTPR
        VESEDEN IQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFR QKGG T      +VPKSFLLEVLGEERVTKFVIQEILNSTMADYAKK          
Subjt:  VESEDENKIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRRQKGGSTMIFLPIHVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAIHFFLLTPR

Query:  QENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSEDE
         ENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSEDE
Subjt:  QENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSEDE

A0A6J1H8V8 uncharacterized protein LOC1114611153.7e-8475.31Show/hide
Query:  MASATATVTNLASEYRRARFIKVPVKGYCRSVLACPNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGSLEAAITDYKGNAIALKNAKIV
        MASATATV N+ASE+RR  F KVPV G  R+ L C N GGVEF+NVQ+ LC RSS S SP  +GSRFLSRPT+IASSG LEAAITDYKG AI LKNAKIV
Subjt:  MASATATVTNLASEYRRARFIKVPVKGYCRSVLACPNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGSLEAAITDYKGNAIALKNAKIV

Query:  VESEDENKIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRRQKGGSTMIFLPIHVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAIHFFLLTPR
        VESEDENKIQLRVDL+GDETQK+FDQVLTNLARSAP MPGFR+QKGG T      +VPKSFLLEVLG++RVTKFVIQEILNSTM DYAKK          
Subjt:  VESEDENKIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRRQKGGSTMIFLPIHVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAIHFFLLTPR

Query:  QENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELE
         EN+ VKDK VNTTQT DELK LF PGKEFGFNAILELE
Subjt:  QENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELE

A0A6J1KVH6 uncharacterized protein LOC1114975771.4e-8372.8Show/hide
Query:  ASATATVTNLASEYRRARFIKVPVKGYCRSVLACPNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGSLEAAITDYKGNAIALKNAKIVV
        A+ATATV N+ASE+RR  F KVPV G  R+ L C N GGVEF+NVQ+ LC RSS S S   +GSRFLSRPT+IASSG LEAAITDYKG AI LKNAKIVV
Subjt:  ASATATVTNLASEYRRARFIKVPVKGYCRSVLACPNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGSLEAAITDYKGNAIALKNAKIVV

Query:  ESEDENKIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRRQKGGSTMIFLPIHVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAIHFFLLTPRQ
        ESEDENKIQLRV+L+GDETQK+FDQVLTNLARSAPPMPGFR+QKGG T      +VPKSFLLEVLG++RVTKFVIQEILNSTMADYAKK           
Subjt:  ESEDENKIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRRQKGGSTMIFLPIHVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAIHFFLLTPRQ

Query:  ENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELE-----SANSEDE
        EN+ VKDK VNTTQT DELK+LF PGKEFGFNAILELE      + SEDE
Subjt:  ENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELE-----SANSEDE

SwissProt top hitse value%identityAlignment
Q3M725 Trigger factor2.8e-0424.65Show/hide
Query:  KIVVESEDENKIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRRQKGGSTMIFLPIHVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAIHFFLL
        K+  E    ++I L ++++ + TQK ++QV+ NL+R+   +PGFR+ K           VP+  LL+ LG+  +    ++E+L   +    K+     + 
Subjt:  KIVVESEDENKIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRRQKGGSTMIFLPIHVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAIHFFLL

Query:  TPRQENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELE
         PR  +            + D+L   + PG+   F A +++E
Subjt:  TPRQENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELE

Q8YQX9 Trigger factor2.8e-0424.65Show/hide
Query:  KIVVESEDENKIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRRQKGGSTMIFLPIHVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAIHFFLL
        K+  E    ++I L ++++ + TQK ++QV+ NL+R+   +PGFR+ K           VP+  LL+ LG+  +    ++E+L   +    K+     + 
Subjt:  KIVVESEDENKIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRRQKGGSTMIFLPIHVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAIHFFLL

Query:  TPRQENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELE
         PR  +            + D+L   + PG+   F A +++E
Subjt:  TPRQENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELE

Arabidopsis top hitse value%identityAlignment
AT2G30695.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: protein folding, protein transport; LOCATED IN: chloroplast stroma, chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Trigger factor, ribosome-binding, bacterial (InterPro:IPR008881); Has 253 Blast hits to 253 proteins in 72 species: Archae - 0; Bacteria - 138; Metazoa - 0; Fungi - 0; Plants - 40; Viruses - 0; Other Eukaryotes - 75 (source: NCBI BLink).9.9e-2136.62Show/hide
Query:  VESEDENKIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRRQKGGSTMIFLPIHVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAIHFFLLTPR
        VE+E  N++++ V +SG++TQ +F+ V   +  +A P+PGFRR KGG T      ++PK  LLE+LG  +V K VI++++NS + DY K           
Subjt:  VESEDENKIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRRQKGGSTMIFLPIHVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAIHFFLLTPR

Query:  QENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELESAN
        QE++ V  K++   Q+ ++L+  F PG+ F F+A ++L+ A+
Subjt:  QENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELESAN

AT2G30695.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: protein folding, protein transport; LOCATED IN: chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Trigger factor, ribosome-binding, bacterial (InterPro:IPR008881); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink).9.9e-2136.62Show/hide
Query:  VESEDENKIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRRQKGGSTMIFLPIHVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAIHFFLLTPR
        VE+E  N++++ V +SG++TQ +F+ V   +  +A P+PGFRR KGG T      ++PK  LLE+LG  +V K VI++++NS + DY K           
Subjt:  VESEDENKIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRRQKGGSTMIFLPIHVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAIHFFLLTPR

Query:  QENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELESAN
        QE++ V  K++   Q+ ++L+  F PG+ F F+A ++L+ A+
Subjt:  QENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILELESAN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TCCAGCCGAAAAATGGCGTCCGCAACCGCAACGGTAACCAATCTCGCCTCAGAATACCGGCGGGCAAGATTTATCAAAGTTCCTGTTAAAGGCTATTGCCGCAGTGTTCT
GGCCTGTCCGAACTCCGGAGGCGTTGAATTCGATAATGTACAAAATGGGCTGTGCAGGAGGTCGTCATTTTCTTGTAGTCCTCACAAATTGGGTTCCAGATTTTTGTCCA
GACCGACTTCAATAGCTAGTTCAGGTAGTTTGGAGGCAGCCATCACAGATTACAAAGGCAATGCAATAGCCTTAAAAAATGCTAAGATAGTTGTAGAGTCTGAAGATGAA
AACAAGATACAGCTTCGAGTGGACTTGAGTGGGGACGAGACACAAAAAATTTTCGATCAGGTTTTGACAAATTTGGCCCGCTCCGCGCCGCCAATGCCAGGATTTCGTAG
GCAGAAAGGAGGTAGTACCATGATTTTCCTCCCCATACACGTCCCAAAAAGCTTCCTATTAGAAGTCCTTGGTGAGGAGCGTGTCACAAAGTTTGTCATACAAGAAATAT
TGAACTCAACCATGGCAGATTATGCAAAGAAGGCAATTCATTTTTTTCTGTTGACTCCAAGACAGGAAAATGTAAATGTGAAGGACAAGAAGGTTAACACAACACAAACA
GCAGATGAACTGAAACTGTTGTTTACTCCAGGGAAAGAGTTTGGATTCAATGCCATACTTGAGCTTGAATCTGCTAATTCAGAAGATGAA
mRNA sequenceShow/hide mRNA sequence
TCCAGCCGAAAAATGGCGTCCGCAACCGCAACGGTAACCAATCTCGCCTCAGAATACCGGCGGGCAAGATTTATCAAAGTTCCTGTTAAAGGCTATTGCCGCAGTGTTCT
GGCCTGTCCGAACTCCGGAGGCGTTGAATTCGATAATGTACAAAATGGGCTGTGCAGGAGGTCGTCATTTTCTTGTAGTCCTCACAAATTGGGTTCCAGATTTTTGTCCA
GACCGACTTCAATAGCTAGTTCAGGTAGTTTGGAGGCAGCCATCACAGATTACAAAGGCAATGCAATAGCCTTAAAAAATGCTAAGATAGTTGTAGAGTCTGAAGATGAA
AACAAGATACAGCTTCGAGTGGACTTGAGTGGGGACGAGACACAAAAAATTTTCGATCAGGTTTTGACAAATTTGGCCCGCTCCGCGCCGCCAATGCCAGGATTTCGTAG
GCAGAAAGGAGGTAGTACCATGATTTTCCTCCCCATACACGTCCCAAAAAGCTTCCTATTAGAAGTCCTTGGTGAGGAGCGTGTCACAAAGTTTGTCATACAAGAAATAT
TGAACTCAACCATGGCAGATTATGCAAAGAAGGCAATTCATTTTTTTCTGTTGACTCCAAGACAGGAAAATGTAAATGTGAAGGACAAGAAGGTTAACACAACACAAACA
GCAGATGAACTGAAACTGTTGTTTACTCCAGGGAAAGAGTTTGGATTCAATGCCATACTTGAGCTTGAATCTGCTAATTCAGAAGATGAA
Protein sequenceShow/hide protein sequence
SSRKMASATATVTNLASEYRRARFIKVPVKGYCRSVLACPNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGSLEAAITDYKGNAIALKNAKIVVESEDE
NKIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRRQKGGSTMIFLPIHVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAIHFFLLTPRQENVNVKDKKVNTTQT
ADELKLLFTPGKEFGFNAILELESANSEDE