; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g01730 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g01730
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionNatterin-3 like
Genome locationchr7:1366746..1367930
RNA-Seq ExpressionMoc07g01730
SyntenyMoc07g01730
Gene Ontology termsNA
InterPro domainsIPR004991 - Aerolysin-like toxin
IPR008998 - Agglutinin domain
IPR036242 - Agglutinin domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155413.1 uncharacterized protein LOC111022561 [Momordica charantia]2.2e-10867.94Show/hide
Query:  DDEIYALSTLVDWDSLFILPKHVAFKGDNGQYLSFSSPYLKFSNSDVKDSSVVQEIFPTKDGNIRIKNDDSGKFWYRDPNWICATST-VKSDNPNNLFWP
        +D   +L  +VDWDS++ILP+HVAFKG+NG+YL F  P L+FS+SD+KDSSV  EIFPTKDGNI I++D+SGKFW RDP+WI A S    S++PN LFWP
Subjt:  DDEIYALSTLVDWDSLFILPKHVAFKGDNGQYLSFSSPYLKFSNSDVKDSSVVQEIFPTKDGNIRIKNDDSGKFWYRDPNWICATST-VKSDNPNNLFWP

Query:  VKIDNQNLALRNLGNNHFCQRLTIEGKTSCLNAGAVTITKESRMEVVEIVISRTIDNVEYRLNDARIYGQKVVSMAKGTAVNKTKETDIVTFQFKYEEKN
        VK+++  +ALRNLGN+ FC+RLTIEGK  CLNA A ++T E+RM V EIV+SRTIDNVEYRLNDARIYGQK+VSMAKG A+N TKETDIVTF+F YE K 
Subjt:  VKIDNQNLALRNLGNNHFCQRLTIEGKTSCLNAGAVTITKESRMEVVEIVISRTIDNVEYRLNDARIYGQKVVSMAKGTAVNKTKETDIVTFQFKYEEKN

Query:  KKNWSSSVSSKIGVSVTTNFQVGVPMVAKGKIEVSAEMETEYEWGETHKNKNVKEFTYPVIVPPMSKVKIHAVIKQGMCQVPFSYHR
        K NW+S++S+ IG  VTT FQ GVP+V KGKIEVSAE+ + YEWGETHK+KN  E  YPV VPP+S+VKI+AV+KQGMCQVPFSY R
Subjt:  KKNWSSSVSSKIGVSVTTNFQVGVPMVAKGKIEVSAEMETEYEWGETHKNKNVKEFTYPVIVPPMSKVKIHAVIKQGMCQVPFSYHR

XP_022157620.1 uncharacterized protein LOC111024281 isoform X1 [Momordica charantia]4.0e-13181.7Show/hide
Query:  IVAKAEREVELLPMVTIMKADDEIYALSTLVDWDSLFILPKHVAFKGDNGQYLSFSSPYLKFSNSDVKDSSVVQEIFPTKDGNIRIKNDDSGKFWYRDPN
        IVAKAEREVE+LPMVTI KADDEIYALSTLVDWDSLFILPKHVAFKGDNGQYLSFSSPYLKFS+SDVK+SSVVQEIFPTKDGNI IKNDDS         
Subjt:  IVAKAEREVELLPMVTIMKADDEIYALSTLVDWDSLFILPKHVAFKGDNGQYLSFSSPYLKFSNSDVKDSSVVQEIFPTKDGNIRIKNDDSGKFWYRDPN

Query:  WICATSTVKSDNPNNLFWPVKIDNQNLALRNLGNNHFCQRLTIEGKTSCLNAGAVTITKESRMEVVEIVISRTIDNVEYRLNDARIYGQKVVSMAKGTAV
                                        GNNHFCQRLTIEGKTSCL+AGA+TITKESRMEVVE+VISRTIDNVEYR+NDA+IYGQKVVSM KGTAV
Subjt:  WICATSTVKSDNPNNLFWPVKIDNQNLALRNLGNNHFCQRLTIEGKTSCLNAGAVTITKESRMEVVEIVISRTIDNVEYRLNDARIYGQKVVSMAKGTAV

Query:  NKTKETDIVTFQFKYEEKNKKNWSSSVSSKIGVSVTTNFQVGVPMVAKGKIEVSAEMETEYEWGETHKNKNVKEFTYPVIVPPMSKVKIHAVIKQGMCQV
        NKTKETD VTFQFKYEEKNKKNWSSSVSSKIGVSVTTNFQVGVPMVAKGKIEVSAEMETEYEWGETHK+KNVKEFTYPVIVPPMSKVKIHA+IKQGMCQV
Subjt:  NKTKETDIVTFQFKYEEKNKKNWSSSVSSKIGVSVTTNFQVGVPMVAKGKIEVSAEMETEYEWGETHKNKNVKEFTYPVIVPPMSKVKIHAVIKQGMCQV

Query:  PFSYHR
        PFSY R
Subjt:  PFSYHR

XP_022157621.1 uncharacterized protein LOC111024281 isoform X2 [Momordica charantia]9.6e-12578.76Show/hide
Query:  IVAKAEREVELLPMVTIMKADDEIYALSTLVDWDSLFILPKHVAFKGDNGQYLSFSSPYLKFSNSDVKDSSVVQEIFPTKDGNIRIKNDDSGKFWYRDPN
        IVAKAEREVE+LPMVTI KADDEIYALSTLVDWDSLFILPKHVAFKGDNGQYLSFSSPYLKFS+SDVK+SSVVQEIFPTKD                   
Subjt:  IVAKAEREVELLPMVTIMKADDEIYALSTLVDWDSLFILPKHVAFKGDNGQYLSFSSPYLKFSNSDVKDSSVVQEIFPTKDGNIRIKNDDSGKFWYRDPN

Query:  WICATSTVKSDNPNNLFWPVKIDNQNLALRNLGNNHFCQRLTIEGKTSCLNAGAVTITKESRMEVVEIVISRTIDNVEYRLNDARIYGQKVVSMAKGTAV
                                        GNNHFCQRLTIEGKTSCL+AGA+TITKESRMEVVE+VISRTIDNVEYR+NDA+IYGQKVVSM KGTAV
Subjt:  WICATSTVKSDNPNNLFWPVKIDNQNLALRNLGNNHFCQRLTIEGKTSCLNAGAVTITKESRMEVVEIVISRTIDNVEYRLNDARIYGQKVVSMAKGTAV

Query:  NKTKETDIVTFQFKYEEKNKKNWSSSVSSKIGVSVTTNFQVGVPMVAKGKIEVSAEMETEYEWGETHKNKNVKEFTYPVIVPPMSKVKIHAVIKQGMCQV
        NKTKETD VTFQFKYEEKNKKNWSSSVSSKIGVSVTTNFQVGVPMVAKGKIEVSAEMETEYEWGETHK+KNVKEFTYPVIVPPMSKVKIHA+IKQGMCQV
Subjt:  NKTKETDIVTFQFKYEEKNKKNWSSSVSSKIGVSVTTNFQVGVPMVAKGKIEVSAEMETEYEWGETHKNKNVKEFTYPVIVPPMSKVKIHAVIKQGMCQV

Query:  PFSYHR
        PFSY R
Subjt:  PFSYHR

XP_022157622.1 uncharacterized protein LOC111024282 [Momordica charantia]8.7e-13477.35Show/hide
Query:  IVAKAEREVELLPMVTIM--KADDEIYALSTLVDWDSLFILPKHVAFKGDNGQYLSFSSPYLKFSNSDVKDSSVVQEIFPTKDGNIRIKNDDSGKFWYRD
        +VAKA+R V ++P++  +  KADD+IYALST+VDWDS+FILPKHVAFKGDNGQYLSFSSPYLKFS+S + DSSVVQEIFPTKDGNIRIKND+SG+FWYRD
Subjt:  IVAKAEREVELLPMVTIM--KADDEIYALSTLVDWDSLFILPKHVAFKGDNGQYLSFSSPYLKFSNSDVKDSSVVQEIFPTKDGNIRIKNDDSGKFWYRD

Query:  PNWICATST-VKSDNPNNLFWPVKIDNQNLALRNLGNNHFCQRLTIEGKTSCLNAGAVTITKESRMEVVEIVISRTIDNVEYRLNDARIYGQKVVSMAKG
        PNWI ATST  KS+NPNNLFWP KID++ LALRNLGNNHFC RL++EGKT CLN+  V IT E+ ++  E VIS+TIDNVEYRLNDARIYGQKVVSMAKG
Subjt:  PNWICATST-VKSDNPNNLFWPVKIDNQNLALRNLGNNHFCQRLTIEGKTSCLNAGAVTITKESRMEVVEIVISRTIDNVEYRLNDARIYGQKVVSMAKG

Query:  TAVNKTKETDIVTFQFKYEEKNKKNWSSSVSSKIGVSVTTNFQVGVPMVAKGKIEVSAEMETEYEWGETHKNKNVKEFTYPVIVPPMSKVKIHAVIKQGM
         AVNKTKETD VTF+F YEEK KKNWSSSVSSKIGVSV++ FQVG+P V +GKIEV AEM  EYEWG THK+KNV+EFTYPV VPPMS+VKI+AVIKQG+
Subjt:  TAVNKTKETDIVTFQFKYEEKNKKNWSSSVSSKIGVSVTTNFQVGVPMVAKGKIEVSAEMETEYEWGETHKNKNVKEFTYPVIVPPMSKVKIHAVIKQGM

Query:  CQVPFSYHR
        CQVPFSY R
Subjt:  CQVPFSYHR

XP_022157628.1 uncharacterized protein LOC111024289 [Momordica charantia]1.5e-194100Show/hide
Query:  MAVLHVSRVFFPPNSIVAKAEREVELLPMVTIMKADDEIYALSTLVDWDSLFILPKHVAFKGDNGQYLSFSSPYLKFSNSDVKDSSVVQEIFPTKDGNIR
        MAVLHVSRVFFPPNSIVAKAEREVELLPMVTIMKADDEIYALSTLVDWDSLFILPKHVAFKGDNGQYLSFSSPYLKFSNSDVKDSSVVQEIFPTKDGNIR
Subjt:  MAVLHVSRVFFPPNSIVAKAEREVELLPMVTIMKADDEIYALSTLVDWDSLFILPKHVAFKGDNGQYLSFSSPYLKFSNSDVKDSSVVQEIFPTKDGNIR

Query:  IKNDDSGKFWYRDPNWICATSTVKSDNPNNLFWPVKIDNQNLALRNLGNNHFCQRLTIEGKTSCLNAGAVTITKESRMEVVEIVISRTIDNVEYRLNDAR
        IKNDDSGKFWYRDPNWICATSTVKSDNPNNLFWPVKIDNQNLALRNLGNNHFCQRLTIEGKTSCLNAGAVTITKESRMEVVEIVISRTIDNVEYRLNDAR
Subjt:  IKNDDSGKFWYRDPNWICATSTVKSDNPNNLFWPVKIDNQNLALRNLGNNHFCQRLTIEGKTSCLNAGAVTITKESRMEVVEIVISRTIDNVEYRLNDAR

Query:  IYGQKVVSMAKGTAVNKTKETDIVTFQFKYEEKNKKNWSSSVSSKIGVSVTTNFQVGVPMVAKGKIEVSAEMETEYEWGETHKNKNVKEFTYPVIVPPMS
        IYGQKVVSMAKGTAVNKTKETDIVTFQFKYEEKNKKNWSSSVSSKIGVSVTTNFQVGVPMVAKGKIEVSAEMETEYEWGETHKNKNVKEFTYPVIVPPMS
Subjt:  IYGQKVVSMAKGTAVNKTKETDIVTFQFKYEEKNKKNWSSSVSSKIGVSVTTNFQVGVPMVAKGKIEVSAEMETEYEWGETHKNKNVKEFTYPVIVPPMS

Query:  KVKIHAVIKQGMCQVPFSYHREPMFLGRDNKSFIIYMMDFSLV
        KVKIHAVIKQGMCQVPFSYHREPMFLGRDNKSFIIYMMDFSLV
Subjt:  KVKIHAVIKQGMCQVPFSYHREPMFLGRDNKSFIIYMMDFSLV

TrEMBL top hitse value%identityAlignment
A0A6J1DMW2 uncharacterized protein LOC1110225611.0e-10867.94Show/hide
Query:  DDEIYALSTLVDWDSLFILPKHVAFKGDNGQYLSFSSPYLKFSNSDVKDSSVVQEIFPTKDGNIRIKNDDSGKFWYRDPNWICATST-VKSDNPNNLFWP
        +D   +L  +VDWDS++ILP+HVAFKG+NG+YL F  P L+FS+SD+KDSSV  EIFPTKDGNI I++D+SGKFW RDP+WI A S    S++PN LFWP
Subjt:  DDEIYALSTLVDWDSLFILPKHVAFKGDNGQYLSFSSPYLKFSNSDVKDSSVVQEIFPTKDGNIRIKNDDSGKFWYRDPNWICATST-VKSDNPNNLFWP

Query:  VKIDNQNLALRNLGNNHFCQRLTIEGKTSCLNAGAVTITKESRMEVVEIVISRTIDNVEYRLNDARIYGQKVVSMAKGTAVNKTKETDIVTFQFKYEEKN
        VK+++  +ALRNLGN+ FC+RLTIEGK  CLNA A ++T E+RM V EIV+SRTIDNVEYRLNDARIYGQK+VSMAKG A+N TKETDIVTF+F YE K 
Subjt:  VKIDNQNLALRNLGNNHFCQRLTIEGKTSCLNAGAVTITKESRMEVVEIVISRTIDNVEYRLNDARIYGQKVVSMAKGTAVNKTKETDIVTFQFKYEEKN

Query:  KKNWSSSVSSKIGVSVTTNFQVGVPMVAKGKIEVSAEMETEYEWGETHKNKNVKEFTYPVIVPPMSKVKIHAVIKQGMCQVPFSYHR
        K NW+S++S+ IG  VTT FQ GVP+V KGKIEVSAE+ + YEWGETHK+KN  E  YPV VPP+S+VKI+AV+KQGMCQVPFSY R
Subjt:  KKNWSSSVSSKIGVSVTTNFQVGVPMVAKGKIEVSAEMETEYEWGETHKNKNVKEFTYPVIVPPMSKVKIHAVIKQGMCQVPFSYHR

A0A6J1DTL3 uncharacterized protein LOC111024281 isoform X12.0e-13181.7Show/hide
Query:  IVAKAEREVELLPMVTIMKADDEIYALSTLVDWDSLFILPKHVAFKGDNGQYLSFSSPYLKFSNSDVKDSSVVQEIFPTKDGNIRIKNDDSGKFWYRDPN
        IVAKAEREVE+LPMVTI KADDEIYALSTLVDWDSLFILPKHVAFKGDNGQYLSFSSPYLKFS+SDVK+SSVVQEIFPTKDGNI IKNDDS         
Subjt:  IVAKAEREVELLPMVTIMKADDEIYALSTLVDWDSLFILPKHVAFKGDNGQYLSFSSPYLKFSNSDVKDSSVVQEIFPTKDGNIRIKNDDSGKFWYRDPN

Query:  WICATSTVKSDNPNNLFWPVKIDNQNLALRNLGNNHFCQRLTIEGKTSCLNAGAVTITKESRMEVVEIVISRTIDNVEYRLNDARIYGQKVVSMAKGTAV
                                        GNNHFCQRLTIEGKTSCL+AGA+TITKESRMEVVE+VISRTIDNVEYR+NDA+IYGQKVVSM KGTAV
Subjt:  WICATSTVKSDNPNNLFWPVKIDNQNLALRNLGNNHFCQRLTIEGKTSCLNAGAVTITKESRMEVVEIVISRTIDNVEYRLNDARIYGQKVVSMAKGTAV

Query:  NKTKETDIVTFQFKYEEKNKKNWSSSVSSKIGVSVTTNFQVGVPMVAKGKIEVSAEMETEYEWGETHKNKNVKEFTYPVIVPPMSKVKIHAVIKQGMCQV
        NKTKETD VTFQFKYEEKNKKNWSSSVSSKIGVSVTTNFQVGVPMVAKGKIEVSAEMETEYEWGETHK+KNVKEFTYPVIVPPMSKVKIHA+IKQGMCQV
Subjt:  NKTKETDIVTFQFKYEEKNKKNWSSSVSSKIGVSVTTNFQVGVPMVAKGKIEVSAEMETEYEWGETHKNKNVKEFTYPVIVPPMSKVKIHAVIKQGMCQV

Query:  PFSYHR
        PFSY R
Subjt:  PFSYHR

A0A6J1DTV1 uncharacterized protein LOC1110242897.3e-195100Show/hide
Query:  MAVLHVSRVFFPPNSIVAKAEREVELLPMVTIMKADDEIYALSTLVDWDSLFILPKHVAFKGDNGQYLSFSSPYLKFSNSDVKDSSVVQEIFPTKDGNIR
        MAVLHVSRVFFPPNSIVAKAEREVELLPMVTIMKADDEIYALSTLVDWDSLFILPKHVAFKGDNGQYLSFSSPYLKFSNSDVKDSSVVQEIFPTKDGNIR
Subjt:  MAVLHVSRVFFPPNSIVAKAEREVELLPMVTIMKADDEIYALSTLVDWDSLFILPKHVAFKGDNGQYLSFSSPYLKFSNSDVKDSSVVQEIFPTKDGNIR

Query:  IKNDDSGKFWYRDPNWICATSTVKSDNPNNLFWPVKIDNQNLALRNLGNNHFCQRLTIEGKTSCLNAGAVTITKESRMEVVEIVISRTIDNVEYRLNDAR
        IKNDDSGKFWYRDPNWICATSTVKSDNPNNLFWPVKIDNQNLALRNLGNNHFCQRLTIEGKTSCLNAGAVTITKESRMEVVEIVISRTIDNVEYRLNDAR
Subjt:  IKNDDSGKFWYRDPNWICATSTVKSDNPNNLFWPVKIDNQNLALRNLGNNHFCQRLTIEGKTSCLNAGAVTITKESRMEVVEIVISRTIDNVEYRLNDAR

Query:  IYGQKVVSMAKGTAVNKTKETDIVTFQFKYEEKNKKNWSSSVSSKIGVSVTTNFQVGVPMVAKGKIEVSAEMETEYEWGETHKNKNVKEFTYPVIVPPMS
        IYGQKVVSMAKGTAVNKTKETDIVTFQFKYEEKNKKNWSSSVSSKIGVSVTTNFQVGVPMVAKGKIEVSAEMETEYEWGETHKNKNVKEFTYPVIVPPMS
Subjt:  IYGQKVVSMAKGTAVNKTKETDIVTFQFKYEEKNKKNWSSSVSSKIGVSVTTNFQVGVPMVAKGKIEVSAEMETEYEWGETHKNKNVKEFTYPVIVPPMS

Query:  KVKIHAVIKQGMCQVPFSYHREPMFLGRDNKSFIIYMMDFSLV
        KVKIHAVIKQGMCQVPFSYHREPMFLGRDNKSFIIYMMDFSLV
Subjt:  KVKIHAVIKQGMCQVPFSYHREPMFLGRDNKSFIIYMMDFSLV

A0A6J1DUY9 uncharacterized protein LOC1110242824.2e-13477.35Show/hide
Query:  IVAKAEREVELLPMVTIM--KADDEIYALSTLVDWDSLFILPKHVAFKGDNGQYLSFSSPYLKFSNSDVKDSSVVQEIFPTKDGNIRIKNDDSGKFWYRD
        +VAKA+R V ++P++  +  KADD+IYALST+VDWDS+FILPKHVAFKGDNGQYLSFSSPYLKFS+S + DSSVVQEIFPTKDGNIRIKND+SG+FWYRD
Subjt:  IVAKAEREVELLPMVTIM--KADDEIYALSTLVDWDSLFILPKHVAFKGDNGQYLSFSSPYLKFSNSDVKDSSVVQEIFPTKDGNIRIKNDDSGKFWYRD

Query:  PNWICATST-VKSDNPNNLFWPVKIDNQNLALRNLGNNHFCQRLTIEGKTSCLNAGAVTITKESRMEVVEIVISRTIDNVEYRLNDARIYGQKVVSMAKG
        PNWI ATST  KS+NPNNLFWP KID++ LALRNLGNNHFC RL++EGKT CLN+  V IT E+ ++  E VIS+TIDNVEYRLNDARIYGQKVVSMAKG
Subjt:  PNWICATST-VKSDNPNNLFWPVKIDNQNLALRNLGNNHFCQRLTIEGKTSCLNAGAVTITKESRMEVVEIVISRTIDNVEYRLNDARIYGQKVVSMAKG

Query:  TAVNKTKETDIVTFQFKYEEKNKKNWSSSVSSKIGVSVTTNFQVGVPMVAKGKIEVSAEMETEYEWGETHKNKNVKEFTYPVIVPPMSKVKIHAVIKQGM
         AVNKTKETD VTF+F YEEK KKNWSSSVSSKIGVSV++ FQVG+P V +GKIEV AEM  EYEWG THK+KNV+EFTYPV VPPMS+VKI+AVIKQG+
Subjt:  TAVNKTKETDIVTFQFKYEEKNKKNWSSSVSSKIGVSVTTNFQVGVPMVAKGKIEVSAEMETEYEWGETHKNKNVKEFTYPVIVPPMSKVKIHAVIKQGM

Query:  CQVPFSYHR
        CQVPFSY R
Subjt:  CQVPFSYHR

A0A6J1DYR0 uncharacterized protein LOC111024281 isoform X24.7e-12578.76Show/hide
Query:  IVAKAEREVELLPMVTIMKADDEIYALSTLVDWDSLFILPKHVAFKGDNGQYLSFSSPYLKFSNSDVKDSSVVQEIFPTKDGNIRIKNDDSGKFWYRDPN
        IVAKAEREVE+LPMVTI KADDEIYALSTLVDWDSLFILPKHVAFKGDNGQYLSFSSPYLKFS+SDVK+SSVVQEIFPTKD                   
Subjt:  IVAKAEREVELLPMVTIMKADDEIYALSTLVDWDSLFILPKHVAFKGDNGQYLSFSSPYLKFSNSDVKDSSVVQEIFPTKDGNIRIKNDDSGKFWYRDPN

Query:  WICATSTVKSDNPNNLFWPVKIDNQNLALRNLGNNHFCQRLTIEGKTSCLNAGAVTITKESRMEVVEIVISRTIDNVEYRLNDARIYGQKVVSMAKGTAV
                                        GNNHFCQRLTIEGKTSCL+AGA+TITKESRMEVVE+VISRTIDNVEYR+NDA+IYGQKVVSM KGTAV
Subjt:  WICATSTVKSDNPNNLFWPVKIDNQNLALRNLGNNHFCQRLTIEGKTSCLNAGAVTITKESRMEVVEIVISRTIDNVEYRLNDARIYGQKVVSMAKGTAV

Query:  NKTKETDIVTFQFKYEEKNKKNWSSSVSSKIGVSVTTNFQVGVPMVAKGKIEVSAEMETEYEWGETHKNKNVKEFTYPVIVPPMSKVKIHAVIKQGMCQV
        NKTKETD VTFQFKYEEKNKKNWSSSVSSKIGVSVTTNFQVGVPMVAKGKIEVSAEMETEYEWGETHK+KNVKEFTYPVIVPPMSKVKIHA+IKQGMCQV
Subjt:  NKTKETDIVTFQFKYEEKNKKNWSSSVSSKIGVSVTTNFQVGVPMVAKGKIEVSAEMETEYEWGETHKNKNVKEFTYPVIVPPMSKVKIHAVIKQGMCQV

Query:  PFSYHR
        PFSY R
Subjt:  PFSYHR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCACCTTGGAGCACCAATGGCTGTCCTCCACGTGTCCAGGGTATTCTTTCCCCCAAATAGTATCGTTGCAAAAGCTGAGCGTGAAGTTGAGCTTCTACCA
ATGGTTACAATTATGAAGGCTGATGATGAAATTTATGCACTCTCAACACTCGTTGATTGGGACTCCCTCTTTATACTTCCAAAACATGTGGCCTTCAAGGGAGAT
AATGGCCAATATTTGAGCTTTTCTAGTCCATACTTGAAGTTTTCGAACTCGGATGTTAAAGACTCATCAGTTGTGCAAGAGATCTTTCCTACAAAAGATGGTAAT
ATTCGTATAAAGAATGACGACTCGGGTAAGTTTTGGTATCGTGATCCTAACTGGATATGCGCCACATCGACCGTCAAGAGCGACAATCCCAACAACTTATTTTGG
CCCGTCAAAATCGACAACCAAAACCTGGCTCTTCGCAACTTAGGTAATAATCATTTCTGCCAGAGACTGACCATCGAGGGAAAAACGAGTTGTCTCAATGCTGGT
GCGGTTACTATTACCAAAGAATCGCGCATGGAAGTGGTAGAGATTGTAATTTCGAGGACCATTGATAACGTGGAGTATCGACTTAACGATGCTCGAATTTACGGT
CAAAAGGTGGTGTCAATGGCGAAAGGAACTGCTGTCAACAAAACAAAAGAAACTGACATAGTAACTTTTCAATTCAAGTACGAGGAAAAAAATAAGAAAAACTGG
AGTTCCTCTGTCTCGTCCAAGATTGGTGTGAGTGTGACTACAAACTTTCAAGTTGGGGTTCCTATGGTTGCAAAAGGGAAGATTGAAGTTTCAGCTGAGATGGAG
ACTGAATATGAATGGGGAGAAACCCATAAGAATAAAAATGTGAAGGAATTTACTTACCCTGTAATTGTTCCTCCAATGTCAAAAGTGAAGATTCATGCTGTCATA
AAGCAAGGCATGTGCCAAGTTCCTTTCTCCTACCACCGCGAACCGATGTTCTTAGGACGGGACAACAAGTCGTTCATCATTTACATGATGGACTTCTCACTGGTG
TAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTCACCTTGGAGCACCAATGGCTGTCCTCCACGTGTCCAGGGTATTCTTTCCCCCAAATAGTATCGTTGCAAAAGCTGAGCGTGAAGTTGAGCTTCTACCA
ATGGTTACAATTATGAAGGCTGATGATGAAATTTATGCACTCTCAACACTCGTTGATTGGGACTCCCTCTTTATACTTCCAAAACATGTGGCCTTCAAGGGAGAT
AATGGCCAATATTTGAGCTTTTCTAGTCCATACTTGAAGTTTTCGAACTCGGATGTTAAAGACTCATCAGTTGTGCAAGAGATCTTTCCTACAAAAGATGGTAAT
ATTCGTATAAAGAATGACGACTCGGGTAAGTTTTGGTATCGTGATCCTAACTGGATATGCGCCACATCGACCGTCAAGAGCGACAATCCCAACAACTTATTTTGG
CCCGTCAAAATCGACAACCAAAACCTGGCTCTTCGCAACTTAGGTAATAATCATTTCTGCCAGAGACTGACCATCGAGGGAAAAACGAGTTGTCTCAATGCTGGT
GCGGTTACTATTACCAAAGAATCGCGCATGGAAGTGGTAGAGATTGTAATTTCGAGGACCATTGATAACGTGGAGTATCGACTTAACGATGCTCGAATTTACGGT
CAAAAGGTGGTGTCAATGGCGAAAGGAACTGCTGTCAACAAAACAAAAGAAACTGACATAGTAACTTTTCAATTCAAGTACGAGGAAAAAAATAAGAAAAACTGG
AGTTCCTCTGTCTCGTCCAAGATTGGTGTGAGTGTGACTACAAACTTTCAAGTTGGGGTTCCTATGGTTGCAAAAGGGAAGATTGAAGTTTCAGCTGAGATGGAG
ACTGAATATGAATGGGGAGAAACCCATAAGAATAAAAATGTGAAGGAATTTACTTACCCTGTAATTGTTCCTCCAATGTCAAAAGTGAAGATTCATGCTGTCATA
AAGCAAGGCATGTGCCAAGTTCCTTTCTCCTACCACCGCGAACCGATGTTCTTAGGACGGGACAACAAGTCGTTCATCATTTACATGATGGACTTCTCACTGGTG
TAA
Protein sequenceShow/hide protein sequence
MGHLGAPMAVLHVSRVFFPPNSIVAKAEREVELLPMVTIMKADDEIYALSTLVDWDSLFILPKHVAFKGDNGQYLSFSSPYLKFSNSDVKDSSVVQEIFPTKDGN
IRIKNDDSGKFWYRDPNWICATSTVKSDNPNNLFWPVKIDNQNLALRNLGNNHFCQRLTIEGKTSCLNAGAVTITKESRMEVVEIVISRTIDNVEYRLNDARIYG
QKVVSMAKGTAVNKTKETDIVTFQFKYEEKNKKNWSSSVSSKIGVSVTTNFQVGVPMVAKGKIEVSAEMETEYEWGETHKNKNVKEFTYPVIVPPMSKVKIHAVI
KQGMCQVPFSYHREPMFLGRDNKSFIIYMMDFSLV