; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC07g0073 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC07g0073
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionTrigger_N domain-containing protein
Genome locationMC07:952355..956782
RNA-Seq ExpressionMC07g0073
SyntenyMC07g0073
Gene Ontology termsGO:0000413 - protein peptidyl-prolyl isomerization (biological process)
GO:0015031 - protein transport (biological process)
GO:0043335 - protein unfolding (biological process)
GO:0051083 - 'de novo' cotranslational protein folding (biological process)
GO:0061077 - chaperone-mediated protein folding (biological process)
GO:0003755 - peptidyl-prolyl cis-trans isomerase activity (molecular function)
GO:0043022 - ribosome binding (molecular function)
GO:0044183 - protein folding chaperone (molecular function)
InterPro domainsIPR005215 - Trigger factor
IPR008881 - Trigger factor, ribosome-binding, bacterial
IPR036611 - Trigger factor ribosome-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008460020.1 PREDICTED: uncharacterized protein LOC103498959 [Cucumis melo]1.52e-12378.75Show/hide
Query:  ASATATVTNLAS-EYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVV
        A+ATATVTN+AS E+ R  F KV VKGYC + L C N GGVEFDNVQ+GLCRRSSFS S H++GSRFLS+PTSIASSGLEAA+TDYKGNAI LKNAK+VV
Subjt:  ASATATVTNLAS-EYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVV

Query:  ESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAKVHFFLLTPRQENVN
        ESE+EN IQLRVDL+GDETQK+FDQVLTNLARSAPPMPGFR QKGGKTSNVPKSFLLEVLG++RVTKFVIQEILNSTMADYAKK            EN+N
Subjt:  ESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAKVHFFLLTPRQENVN

Query:  VKDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSED
        VKD KVNTTQTADELK+LF PGKEFGFNAILELESA+  +
Subjt:  VKDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSED

XP_011656737.1 uncharacterized protein LOC101212225 [Cucumis sativus]8.89e-12578.66Show/hide
Query:  ASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVVE
        A+ATATVTN+ASE+ R  F KVPVKGYC + L C N GGVEFDNV++GLCRRSSFS S HK+GSRFLS+PTSIASSGLEAAITDYKGN I LKNAK+VVE
Subjt:  ASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVVE

Query:  SEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAKVHFFLLTPRQENVNV
        SE+EN IQLRVDL+GDETQK+FDQVLTNLARSAPPMPGFR QKGGKTSNVPKSFLLEVLG++RVTKF+IQEILNSTM DYAKK            EN+NV
Subjt:  SEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAKVHFFLLTPRQENVNV

Query:  KDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSED
        KDKKV+TTQTADELK+LF PGKEFGFNAILELESA+  +
Subjt:  KDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSED

XP_022155406.1 uncharacterized protein LOC111022553 [Momordica charantia]1.22e-15195Show/hide
Query:  MASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVV
        MASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVV
Subjt:  MASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVV

Query:  ESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAKVHFFLLTPRQENVN
        ESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKK            ENVN
Subjt:  ESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAKVHFFLLTPRQENVN

Query:  VKDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSED
        VKDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSED
Subjt:  VKDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSED

XP_023513632.1 uncharacterized protein LOC111778179 isoform X2 [Cucurbita pepo subsp. pepo]1.95e-11377.78Show/hide
Query:  MASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVV
        MASATATV N+A E+RR  F KVPV G  R+ L C N GGVEF+NVQ+ LC RSS S SP  +GSRFLSRPT+IASSGLEAAITDYKG AI LKNAKIVV
Subjt:  MASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVV

Query:  ESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAKVHFFLLTPRQENVN
        ESEDE++IQLRVDL+GDETQK+FDQVLTNLARSAPPMPGFR QKGGKTSNVPKSFLLEVLG++RVTKFVIQEILNSTMADYAKKA            N+ 
Subjt:  ESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAKVHFFLLTPRQENVN

Query:  VKDKKVNTTQTADELKLLFTPGKEFGFNAILELE
        VKDK VNTTQT DELK+LF PGKEFGFNAILELE
Subjt:  VKDKKVNTTQTADELKLLFTPGKEFGFNAILELE

XP_038906537.1 uncharacterized protein LOC120092511 [Benincasa hispida]2.49e-12679.92Show/hide
Query:  ASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVVE
        A+ATAT TN+ASE+R   F +VPVKGYCR+ L C N GGVEFDNVQ+GLCRRSSFS S HK+GSRFLS+PTSIASSGLEAAITDYKGNAI LKNAK+VVE
Subjt:  ASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVVE

Query:  SEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAKVHFFLLTPRQENVNV
        SE+EN IQLRVDL+GDETQK+FDQVLTNLARSAPPMPGFR QKGGKTSNVPKSFLLEVLG++RV KFVIQEILNSTMADYAKK            EN+NV
Subjt:  SEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAKVHFFLLTPRQENVNV

Query:  KDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSED
        KDKKVNTTQTADELK+LF+PGKEFGFNAILELESA+  +
Subjt:  KDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSED

TrEMBL top hitse value%identityAlignment
A0A0A0K988 Trigger_N domain-containing protein4.31e-12578.66Show/hide
Query:  ASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVVE
        A+ATATVTN+ASE+ R  F KVPVKGYC + L C N GGVEFDNV++GLCRRSSFS S HK+GSRFLS+PTSIASSGLEAAITDYKGN I LKNAK+VVE
Subjt:  ASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVVE

Query:  SEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAKVHFFLLTPRQENVNV
        SE+EN IQLRVDL+GDETQK+FDQVLTNLARSAPPMPGFR QKGGKTSNVPKSFLLEVLG++RVTKF+IQEILNSTM DYAKK            EN+NV
Subjt:  SEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAKVHFFLLTPRQENVNV

Query:  KDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSED
        KDKKV+TTQTADELK+LF PGKEFGFNAILELESA+  +
Subjt:  KDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSED

A0A1S3CBN2 uncharacterized protein LOC1034989597.37e-12478.75Show/hide
Query:  ASATATVTNLAS-EYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVV
        A+ATATVTN+AS E+ R  F KV VKGYC + L C N GGVEFDNVQ+GLCRRSSFS S H++GSRFLS+PTSIASSGLEAA+TDYKGNAI LKNAK+VV
Subjt:  ASATATVTNLAS-EYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVV

Query:  ESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAKVHFFLLTPRQENVN
        ESE+EN IQLRVDL+GDETQK+FDQVLTNLARSAPPMPGFR QKGGKTSNVPKSFLLEVLG++RVTKFVIQEILNSTMADYAKK            EN+N
Subjt:  ESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAKVHFFLLTPRQENVN

Query:  VKDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSED
        VKD KVNTTQTADELK+LF PGKEFGFNAILELESA+  +
Subjt:  VKDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSED

A0A6J1DRL2 uncharacterized protein LOC1110225535.92e-15295Show/hide
Query:  MASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVV
        MASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVV
Subjt:  MASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVV

Query:  ESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAKVHFFLLTPRQENVN
        ESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKK            ENVN
Subjt:  ESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAKVHFFLLTPRQENVN

Query:  VKDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSED
        VKDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSED
Subjt:  VKDKKVNTTQTADELKLLFTPGKEFGFNAILELESANSED

A0A6J1H8V8 uncharacterized protein LOC1114611151.45e-11277.78Show/hide
Query:  MASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVV
        MASATATV N+ASE+RR  F KVPV G  R+ L C N GGVEF+NVQ+ LC RSS S SP  +GSRFLSRPT+IASSGLEAAITDYKG AI LKNAKIVV
Subjt:  MASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVV

Query:  ESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAKVHFFLLTPRQENVN
        ESEDEN IQLRVDL+GDETQK+FDQVLTNLARSAP MPGFR QKGGKTSNVPKSFLLEVLG++RVTKFVIQEILNSTM DYAKK            EN+ 
Subjt:  ESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAKVHFFLLTPRQENVN

Query:  VKDKKVNTTQTADELKLLFTPGKEFGFNAILELE
        VKDK VNTTQT DELK LF PGKEFGFNAILELE
Subjt:  VKDKKVNTTQTADELKLLFTPGKEFGFNAILELE

A0A6J1KVH6 uncharacterized protein LOC1114975771.81e-11177.25Show/hide
Query:  ASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVVE
        A+ATATV N+ASE+RR  F KVPV G  R+ L C N GGVEF+NVQ+ LC RSS S S   +GSRFLSRPT+IASSGLEAAITDYKG AI LKNAKIVVE
Subjt:  ASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVVE

Query:  SEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAKVHFFLLTPRQENVNV
        SEDEN IQLRV+L+GDETQK+FDQVLTNLARSAPPMPGFR QKGGKTSNVPKSFLLEVLG++RVTKFVIQEILNSTMADYAKK            EN+ V
Subjt:  SEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAKVHFFLLTPRQENVNV

Query:  KDKKVNTTQTADELKLLFTPGKEFGFNAILELE
        KDK VNTTQT DELK+LF PGKEFGFNAILELE
Subjt:  KDKKVNTTQTADELKLLFTPGKEFGFNAILELE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G30695.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: protein folding, protein transport; LOCATED IN: chloroplast stroma, chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Trigger factor, ribosome-binding, bacterial (InterPro:IPR008881); Has 253 Blast hits to 253 proteins in 72 species: Archae - 0; Bacteria - 138; Metazoa - 0; Fungi - 0; Plants - 40; Viruses - 0; Other Eukaryotes - 75 (source: NCBI BLink).2.3e-2238.41Show/hide
Query:  VESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAKVHFFLLTPRQENV
        VE+E  N +++ V +SG++TQ +F+ V   +  +A P+PGFR  KGGKT N+PK  LLE+LG  +V K VI++++NS + DY K            QE++
Subjt:  VESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAKVHFFLLTPRQENV

Query:  NVKDKKVNTTQTADELKLLFTPGKEFGFNAILELESAN
         V  K++   Q+ ++L+  F PG+ F F+A ++L+ A+
Subjt:  NVKDKKVNTTQTADELKLLFTPGKEFGFNAILELESAN

AT2G30695.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: protein folding, protein transport; LOCATED IN: chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Trigger factor, ribosome-binding, bacterial (InterPro:IPR008881); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink).2.3e-2238.41Show/hide
Query:  VESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAKVHFFLLTPRQENV
        VE+E  N +++ V +SG++TQ +F+ V   +  +A P+PGFR  KGGKT N+PK  LLE+LG  +V K VI++++NS + DY K            QE++
Subjt:  VESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAKVHFFLLTPRQENV

Query:  NVKDKKVNTTQTADELKLLFTPGKEFGFNAILELESAN
         V  K++   Q+ ++L+  F PG+ F F+A ++L+ A+
Subjt:  NVKDKKVNTTQTADELKLLFTPGKEFGFNAILELESAN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCCGCAACCGCAACGGTAACCAATCTCGCCTCAGAATACCGGCGGGCAAGCTTTATCAAAGTTCCTGTTAAAGGCTACTGCCGCAGTGTTCTGGCCTGTTCGAA
CTCCGGAGGCGTTGAATTCGATAATGTACAAAATGGGCTGTGCAGGAGGTCGTCATTTTCTTGTAGTCCTCACAAATTGGGTTCCAGATTTTTGTCCAGACCGACTTCAA
TAGCTAGTTCAGGTTTGGAGGCAGCCATCACAGATTACAAAGGCAATGCAATAGCCTTAAAAAATGCTAAGATAGTTGTAGAGTCTGAAGATGAAAACTTGATACAGCTT
CGAGTGGACTTGAGTGGGGACGAGACACAAAAAATTTTCGATCAGGTTTTGACAAATTTGGCCCGTTCCGCACCGCCAATGCCAGGATTTCGTATGCAAAAAGGAGGGAA
AACATCAAATGTCCCAAAAAGCTTCCTATTAGAAGTCCTTGGTGAGGAGCGTGTCACAAAGTTTGTCATACAAGAAATATTGAACTCAACCATGGCAGATTATGCAAAGA
AGGCAAAAGTTCATTTTTTTCTGTTGACTCCAAGACAGGAAAATGTAAATGTGAAGGACAAGAAGGTTAACACAACACAAACAGCAGATGAACTGAAACTGTTGTTTACT
CCAGGAAAAGAGTTTGGATTCAATGCCATACTTGAGCTTGAATCTGCTAATTCAGAAGAT
mRNA sequenceShow/hide mRNA sequence
CGACAATAGAGGAAAGGAGGGAAAGAAACCAACCGAGAACGACGTCATATTGGGACTCGGATAAGTCTCTGCTCTGCGAGAACTGAGAACACAAGAGAGAAACTTCCAGC
CGAAAAATGGCGTCCGCAACCGCAACGGTAACCAATCTCGCCTCAGAATACCGGCGGGCAAGCTTTATCAAAGTTCCTGTTAAAGGCTACTGCCGCAGTGTTCTGGCCTG
TTCGAACTCCGGAGGCGTTGAATTCGATAATGTACAAAATGGGCTGTGCAGGAGGTCGTCATTTTCTTGTAGTCCTCACAAATTGGGTTCCAGATTTTTGTCCAGACCGA
CTTCAATAGCTAGTTCAGGTTTGGAGGCAGCCATCACAGATTACAAAGGCAATGCAATAGCCTTAAAAAATGCTAAGATAGTTGTAGAGTCTGAAGATGAAAACTTGATA
CAGCTTCGAGTGGACTTGAGTGGGGACGAGACACAAAAAATTTTCGATCAGGTTTTGACAAATTTGGCCCGTTCCGCACCGCCAATGCCAGGATTTCGTATGCAAAAAGG
AGGGAAAACATCAAATGTCCCAAAAAGCTTCCTATTAGAAGTCCTTGGTGAGGAGCGTGTCACAAAGTTTGTCATACAAGAAATATTGAACTCAACCATGGCAGATTATG
CAAAGAAGGCAAAAGTTCATTTTTTTCTGTTGACTCCAAGACAGGAAAATGTAAATGTGAAGGACAAGAAGGTTAACACAACACAAACAGCAGATGAACTGAAACTGTTG
TTTACTCCAGGAAAAGAGTTTGGATTCAATGCCATACTTGAGCTTGAATCTGCTAATTCAGAAGAT
Protein sequenceShow/hide protein sequence
MASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVVESEDENLIQL
RVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKAKVHFFLLTPRQENVNVKDKKVNTTQTADELKLLFT
PGKEFGFNAILELESANSED