; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g10580 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g10580
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr7:8103851..8110562
RNA-Seq ExpressionMoc07g10580
SyntenyMoc07g10580
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0090304 - nucleic acid metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156067.1 uncharacterized protein LOC111023035 [Momordica charantia]3.4e-10788.79Show/hide
Query:  QSSRGHHHLVQRQTVSPVCPSCKKSHAGPCWTGKRICYRCQKEGHFARECLMTGSNTQALGQRIPVTAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLV
        QSS GH H VQRQT  P CPSCKK+HAGPCW GKRIC+RCQKEGHFAREC MTGSNTQALGQ+ P  AAAQGGT RARVFALTRGDVEHAEAVVTGT+LV
Subjt:  QSSRGHHHLVQRQTVSPVCPSCKKSHAGPCWTGKRICYRCQKEGHFARECLMTGSNTQALGQRIPVTAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLV

Query:  FSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQALEVKLIQLDMQDFDVILGMDWLAANWANIDCSKKEV
         SMPAYALFDSGSSHSFIASTFV+HADLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQ LEVKLIQLDMQDFDVILGMDWLAAN ANI+CSKKEV
Subjt:  FSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQALEVKLIQLDMQDFDVILGMDWLAANWANIDCSKKEV

Query:  SFRLPSGQNFIFKGAKAGVPKVV
        +FRLPSGQNF FKG KAGVP+VV
Subjt:  SFRLPSGQNFIFKGAKAGVPKVV

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]4.1e-12981.79Show/hide
Query:  QSSRGHHHLVQRQTVSPVCPSCKKSHAGPCWTGKRICYRCQKEGHFARECLMTGSNTQALGQRIPVTAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLV
        QSSRGH H  QRQT  PVCPSCKK+HA PCW GK+IC++CQKEGHF RECLMTGSNTQAL Q+ P   A QGGT  ARVFALTRGDVEHAEAVVTGT+L+
Subjt:  QSSRGHHHLVQRQTVSPVCPSCKKSHAGPCWTGKRICYRCQKEGHFARECLMTGSNTQALGQRIPVTAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLV

Query:  FSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQALEVKLIQLDMQDFDVILGMDWLAANWANIDCSKKEV
         S+PAYALFDSGSSHSFIASTFVRHADLELES GF LSVSTPSGSVLVTSQVVKGGQLSF GQ LEV LIQL+MQDFDVILGMDWLAAN ANI+CSKKEV
Subjt:  FSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQALEVKLIQLDMQDFDVILGMDWLAANWANIDCSKKEV

Query:  SFRLPSGQNFIFKGAKAGVPKVVSALKASHLLQRGAWAYLASVVDVRKVVPSIEAVRVVNEFTDVFSEDFPGLPPSRKVDFYETLCYEEVP
        SF L SGQNF FKG KAGVP+VVSALKAS+LLQRG WAYLASVVD RKVVPSIE VRVVNEFTDVF ED PGLPP R+VDF    C E +P
Subjt:  SFRLPSGQNFIFKGAKAGVPKVVSALKASHLLQRGAWAYLASVVDVRKVVPSIEAVRVVNEFTDVFSEDFPGLPPSRKVDFYETLCYEEVP

XP_022156992.1 uncharacterized protein LOC111023821 [Momordica charantia]1.2e-13687.29Show/hide
Query:  QSSRGHHHLVQRQTVSPVCPSCKKSHAGPCWTGKRICYRCQKEGHFARECLMTGSNTQALGQRIPVTAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLV
        Q SRGH H VQRQT  PVCPSCKKSH GPCW GK ICYRCQKEGHFAREC MTG NTQ LGQRIPVT AAQGGTHRARVFALTRGDV HAEAVV GTVLV
Subjt:  QSSRGHHHLVQRQTVSPVCPSCKKSHAGPCWTGKRICYRCQKEGHFARECLMTGSNTQALGQRIPVTAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLV

Query:  FSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQALEVKLIQLDMQDFDVILGMDWLAANWANIDCSKKEV
         SMPAYALFDS SSHSFIASTFVRHADLELESLGFLLSVSTPSGSVLVTSQ+VKGGQLSFDGQ LEVKLIQLDMQDFDVILGMDWLAAN ANIDCSKKE 
Subjt:  FSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQALEVKLIQLDMQDFDVILGMDWLAANWANIDCSKKEV

Query:  SFRLPSGQNFIFKGAKAGVPKVVSALKASHLLQRGAWAYLASVVDVRKVVPSIEAVRVVNEFTDVFSEDFPGLPPSRKVDFYETLCYEEVP
        SFRLPS QNF FKG KA VP+VVSALKASH LQRGAWAYLASVVD RKVVPSIEAVRVVNEFTDVF ED PGLPPSR+VDF    C E +P
Subjt:  SFRLPSGQNFIFKGAKAGVPKVVSALKASHLLQRGAWAYLASVVDVRKVVPSIEAVRVVNEFTDVFSEDFPGLPPSRKVDFYETLCYEEVP

XP_022157413.1 uncharacterized protein LOC111024114 [Momordica charantia]4.9e-11482.33Show/hide
Query:  QSSRGHHHLVQRQTVSPVCPSCKKSHAGPCWTGKRICYRCQKEGHFARECLMTGSNTQALGQRIPVTAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLV
        QSSRGH H VQRQT  PVCPSCKK+HAGPCW GKRIC+RCQK                      P  AAAQGGT RARVFALTRGDVEHAEAVVTGT+LV
Subjt:  QSSRGHHHLVQRQTVSPVCPSCKKSHAGPCWTGKRICYRCQKEGHFARECLMTGSNTQALGQRIPVTAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLV

Query:  FSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQALEVKLIQLDMQDFDVILGMDWLAANWANIDCSKKEV
         SMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVLV SQVVKGGQLSFDGQ  EVKLIQLDMQDFDVILGMDWLAAN ANI+CSKKEV
Subjt:  FSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQALEVKLIQLDMQDFDVILGMDWLAANWANIDCSKKEV

Query:  SFRLPSGQNFIFKGAKAGVPKVVSALKASHLLQRGAWAYLASVVDVRKVVPSIEAVRVVNEFTDVF
        SFRLPSGQNF FK  K GVP+VVSALKA++LLQRGAWAYLASVVD RKVVPSIEAVRVVNEFTDVF
Subjt:  SFRLPSGQNFIFKGAKAGVPKVVSALKASHLLQRGAWAYLASVVDVRKVVPSIEAVRVVNEFTDVF

XP_022158750.1 uncharacterized protein LOC111025215 [Momordica charantia]7.2e-12691.76Show/hide
Query:  QSSRGHHHLVQRQTVSPVCPSCKKSHAGPCWTGKRICYRCQKEGHFARECLMTGSNTQALGQRIPVTAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLV
        Q SR H H VQRQT  PVCPSCKKSHAGPCW GKRICYRCQKEGHFAREC MTGSNTQALGQRIP TAAAQGGTHRARVFALTRGDVE+AEAVVT TVLV
Subjt:  QSSRGHHHLVQRQTVSPVCPSCKKSHAGPCWTGKRICYRCQKEGHFARECLMTGSNTQALGQRIPVTAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLV

Query:  FSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQALEVKLIQLDMQDFDVILGMDWLAANWANIDCSKKEV
         SMPAYALFDSGSSHSFIASTFV HADLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQ LEVKLIQLDMQDFDVILGMDWLAAN ANIDCSKK+V
Subjt:  FSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQALEVKLIQLDMQDFDVILGMDWLAANWANIDCSKKEV

Query:  SFRLPSGQNFIFKGAKAGVPKVVSALKASHLLQRGAWAYLASVVDVRKVVPSIEA
        SFRLPSGQNF FKG KAGVP+VV ALKASHLLQRGAWAYLASVVD RKVVPSIEA
Subjt:  SFRLPSGQNFIFKGAKAGVPKVVSALKASHLLQRGAWAYLASVVDVRKVVPSIEA

TrEMBL top hitse value%identityAlignment
A0A6J1DQB9 Reverse transcriptase2.0e-12981.79Show/hide
Query:  QSSRGHHHLVQRQTVSPVCPSCKKSHAGPCWTGKRICYRCQKEGHFARECLMTGSNTQALGQRIPVTAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLV
        QSSRGH H  QRQT  PVCPSCKK+HA PCW GK+IC++CQKEGHF RECLMTGSNTQAL Q+ P   A QGGT  ARVFALTRGDVEHAEAVVTGT+L+
Subjt:  QSSRGHHHLVQRQTVSPVCPSCKKSHAGPCWTGKRICYRCQKEGHFARECLMTGSNTQALGQRIPVTAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLV

Query:  FSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQALEVKLIQLDMQDFDVILGMDWLAANWANIDCSKKEV
         S+PAYALFDSGSSHSFIASTFVRHADLELES GF LSVSTPSGSVLVTSQVVKGGQLSF GQ LEV LIQL+MQDFDVILGMDWLAAN ANI+CSKKEV
Subjt:  FSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQALEVKLIQLDMQDFDVILGMDWLAANWANIDCSKKEV

Query:  SFRLPSGQNFIFKGAKAGVPKVVSALKASHLLQRGAWAYLASVVDVRKVVPSIEAVRVVNEFTDVFSEDFPGLPPSRKVDFYETLCYEEVP
        SF L SGQNF FKG KAGVP+VVSALKAS+LLQRG WAYLASVVD RKVVPSIE VRVVNEFTDVF ED PGLPP R+VDF    C E +P
Subjt:  SFRLPSGQNFIFKGAKAGVPKVVSALKASHLLQRGAWAYLASVVDVRKVVPSIEAVRVVNEFTDVFSEDFPGLPPSRKVDFYETLCYEEVP

A0A6J1DR22 uncharacterized protein LOC1110230351.6e-10788.79Show/hide
Query:  QSSRGHHHLVQRQTVSPVCPSCKKSHAGPCWTGKRICYRCQKEGHFARECLMTGSNTQALGQRIPVTAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLV
        QSS GH H VQRQT  P CPSCKK+HAGPCW GKRIC+RCQKEGHFAREC MTGSNTQALGQ+ P  AAAQGGT RARVFALTRGDVEHAEAVVTGT+LV
Subjt:  QSSRGHHHLVQRQTVSPVCPSCKKSHAGPCWTGKRICYRCQKEGHFARECLMTGSNTQALGQRIPVTAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLV

Query:  FSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQALEVKLIQLDMQDFDVILGMDWLAANWANIDCSKKEV
         SMPAYALFDSGSSHSFIASTFV+HADLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQ LEVKLIQLDMQDFDVILGMDWLAAN ANI+CSKKEV
Subjt:  FSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQALEVKLIQLDMQDFDVILGMDWLAANWANIDCSKKEV

Query:  SFRLPSGQNFIFKGAKAGVPKVV
        +FRLPSGQNF FKG KAGVP+VV
Subjt:  SFRLPSGQNFIFKGAKAGVPKVV

A0A6J1DTA8 uncharacterized protein LOC1110241142.4e-11482.33Show/hide
Query:  QSSRGHHHLVQRQTVSPVCPSCKKSHAGPCWTGKRICYRCQKEGHFARECLMTGSNTQALGQRIPVTAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLV
        QSSRGH H VQRQT  PVCPSCKK+HAGPCW GKRIC+RCQK                      P  AAAQGGT RARVFALTRGDVEHAEAVVTGT+LV
Subjt:  QSSRGHHHLVQRQTVSPVCPSCKKSHAGPCWTGKRICYRCQKEGHFARECLMTGSNTQALGQRIPVTAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLV

Query:  FSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQALEVKLIQLDMQDFDVILGMDWLAANWANIDCSKKEV
         SMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVLV SQVVKGGQLSFDGQ  EVKLIQLDMQDFDVILGMDWLAAN ANI+CSKKEV
Subjt:  FSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQALEVKLIQLDMQDFDVILGMDWLAANWANIDCSKKEV

Query:  SFRLPSGQNFIFKGAKAGVPKVVSALKASHLLQRGAWAYLASVVDVRKVVPSIEAVRVVNEFTDVF
        SFRLPSGQNF FK  K GVP+VVSALKA++LLQRGAWAYLASVVD RKVVPSIEAVRVVNEFTDVF
Subjt:  SFRLPSGQNFIFKGAKAGVPKVVSALKASHLLQRGAWAYLASVVDVRKVVPSIEAVRVVNEFTDVF

A0A6J1DTE5 uncharacterized protein LOC1110238215.8e-13787.29Show/hide
Query:  QSSRGHHHLVQRQTVSPVCPSCKKSHAGPCWTGKRICYRCQKEGHFARECLMTGSNTQALGQRIPVTAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLV
        Q SRGH H VQRQT  PVCPSCKKSH GPCW GK ICYRCQKEGHFAREC MTG NTQ LGQRIPVT AAQGGTHRARVFALTRGDV HAEAVV GTVLV
Subjt:  QSSRGHHHLVQRQTVSPVCPSCKKSHAGPCWTGKRICYRCQKEGHFARECLMTGSNTQALGQRIPVTAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLV

Query:  FSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQALEVKLIQLDMQDFDVILGMDWLAANWANIDCSKKEV
         SMPAYALFDS SSHSFIASTFVRHADLELESLGFLLSVSTPSGSVLVTSQ+VKGGQLSFDGQ LEVKLIQLDMQDFDVILGMDWLAAN ANIDCSKKE 
Subjt:  FSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQALEVKLIQLDMQDFDVILGMDWLAANWANIDCSKKEV

Query:  SFRLPSGQNFIFKGAKAGVPKVVSALKASHLLQRGAWAYLASVVDVRKVVPSIEAVRVVNEFTDVFSEDFPGLPPSRKVDFYETLCYEEVP
        SFRLPS QNF FKG KA VP+VVSALKASH LQRGAWAYLASVVD RKVVPSIEAVRVVNEFTDVF ED PGLPPSR+VDF    C E +P
Subjt:  SFRLPSGQNFIFKGAKAGVPKVVSALKASHLLQRGAWAYLASVVDVRKVVPSIEAVRVVNEFTDVFSEDFPGLPPSRKVDFYETLCYEEVP

A0A6J1DWP4 uncharacterized protein LOC1110252153.5e-12691.76Show/hide
Query:  QSSRGHHHLVQRQTVSPVCPSCKKSHAGPCWTGKRICYRCQKEGHFARECLMTGSNTQALGQRIPVTAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLV
        Q SR H H VQRQT  PVCPSCKKSHAGPCW GKRICYRCQKEGHFAREC MTGSNTQALGQRIP TAAAQGGTHRARVFALTRGDVE+AEAVVT TVLV
Subjt:  QSSRGHHHLVQRQTVSPVCPSCKKSHAGPCWTGKRICYRCQKEGHFARECLMTGSNTQALGQRIPVTAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLV

Query:  FSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQALEVKLIQLDMQDFDVILGMDWLAANWANIDCSKKEV
         SMPAYALFDSGSSHSFIASTFV HADLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQ LEVKLIQLDMQDFDVILGMDWLAAN ANIDCSKK+V
Subjt:  FSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQALEVKLIQLDMQDFDVILGMDWLAANWANIDCSKKEV

Query:  SFRLPSGQNFIFKGAKAGVPKVVSALKASHLLQRGAWAYLASVVDVRKVVPSIEA
        SFRLPSGQNF FKG KAGVP+VV ALKASHLLQRGAWAYLASVVD RKVVPSIEA
Subjt:  SFRLPSGQNFIFKGAKAGVPKVVSALKASHLLQRGAWAYLASVVDVRKVVPSIEA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCCTTCGTCCCATCACCATGGTCGCTGCATAAGTACTTGTTCTCAAGGAACCAACTACTTATGCAGCGGCGGTCAGGTGTGTGTTGGTTATGGACAAATGTCTCG
AGGAGCCTCAGTCTCAGCAGGTGATGGGCTCCAGCTCGGGGGTCAAGAGGAAATTTGCATCGTTCTCCTCCAGTCCTCGAGAGGACACCATCATCTTGTGCAAAGGCAGA
CTGTTTCTCCGGTGTGCCCCTCTTGTAAGAAGAGCCATGCTGGGCCGTGTTGGACGGGAAAAAGAATATGTTACAGGTGTCAGAAGGAAGGACATTTCGCAAGGGAGTGT
CTGATGACCGGTTCGAATACCCAAGCTTTAGGCCAGAGGATCCCTGTAACGGCGGCAGCTCAAGGTGGGACGCATAGGGCGCGCGTCTTCGCTCTTACCAGGGGGGATGT
TGAGCATGCCGAGGCGGTGGTCACAGGGACTGTTTTAGTGTTTAGTATGCCTGCATACGCATTATTTGACTCTGGATCTAGTCATTCTTTCATTGCTTCTACCTTTGTTC
GACATGCGGACCTAGAGCTAGAATCGTTAGGCTTTTTGTTGTCGGTATCCACACCGTCAGGATCTGTGTTGGTCACTAGTCAAGTGGTGAAAGGAGGCCAGCTCTCCTTT
GATGGTCAGGCCTTGGAGGTGAAATTAATCCAACTGGATATGCAGGATTTCGATGTGATACTAGGCATGGATTGGTTAGCTGCCAACTGGGCTAATATTGATTGCTCGAA
GAAGGAAGTTAGCTTTCGCTTGCCCTCCGGACAAAACTTTATCTTTAAAGGAGCTAAGGCTGGGGTCCCAAAGGTGGTGTCGGCATTGAAGGCCAGCCATCTCCTCCAGC
GTGGTGCCTGGGCCTATTTGGCTAGCGTCGTGGATGTAAGGAAGGTTGTGCCAAGCATTGAGGCGGTTCGTGTGGTTAATGAGTTCACTGACGTGTTCTCTGAGGACTTC
CCCGGCTTGCCTCCGTCTCGCAAAGTGGACTTCTATGAGACCTTGTGCTACGAAGAGGTACCCGTCAAAATTTTGACAAAAGAAACCAAGTTGTTGAGGAACCGGACGAT
TCGCTTGGTTAAGGTTTTGTGGAGAAACCACCAAGTGGAAGAAGCTACTTGGGAGCGGGAGGACGATATCAAGGCGAGATACCCTAAACTGTTGGAACAGTCAACTTTCG
GGGACGAAACCGCCCCCCCTTTGCTTTCCGGCGACCCAGCCTTCCTCCGACGACGTCTTCAGATCCGGCGACACCCAGCAGGCCGCGGCCCTTATCCGGCGTTTCTTTTG
CGTGAGACGTCGTGGCGGTTCTCCGTGAACGGTAGGTCCCACGAGTGGCGCACGGGTTCCACGAGCAGTGGCGGCGCGACCTTCCCCTTGGCGGCGTTCGAGCAGCGGCG
GTCGCCGACTCCCGGCGACTCCCCGACGTTCCGACGGCCTGTAGCAGCGGCGGCGCGCCCCGGCGATCTGCAACCTCGCGGCGGCGCACGGCGAACGAGCGGTACAGCAG
CGGAACGGCGGCGGAATTTGTACGTTACAGCGGTGTTTAGGACGTTTGGCGGTGACCCACATCCGTTCGAAGCTCGATTTAGGCTACCCACACCTTGGCAAGGTGAGGTC
GACGGAAAACTTAAGGGCACGGTTGTGGTCGATATACTTGGAATGCGAGCTGTCGTGCACCGCTGTGAGGAGGGGGCCGCGCCGCCGTTGTACCTGTTACGAAACGCAGC
CGCTGGAGCGCCGGGAGTCGCCGGCCGTCGCTGCTCGAACGCCCCCAAGGGGAAGGTCGTGCCGCCGCTGCTCATGCAACCCGTGCGCCACTCGTCGGACCTACCGTTCG
CGGAGAACCGCCGCGACGTCGCACGCAAAAGAAACGCCGAAGGAGGGCCGCGGCCTGCTGGGTGTCGCCGGATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCCTTCGTCCCATCACCATGGTCGCTGCATAAGTACTTGTTCTCAAGGAACCAACTACTTATGCAGCGGCGGTCAGGTGTGTGTTGGTTATGGACAAATGTCTCG
AGGAGCCTCAGTCTCAGCAGGTGATGGGCTCCAGCTCGGGGGTCAAGAGGAAATTTGCATCGTTCTCCTCCAGTCCTCGAGAGGACACCATCATCTTGTGCAAAGGCAGA
CTGTTTCTCCGGTGTGCCCCTCTTGTAAGAAGAGCCATGCTGGGCCGTGTTGGACGGGAAAAAGAATATGTTACAGGTGTCAGAAGGAAGGACATTTCGCAAGGGAGTGT
CTGATGACCGGTTCGAATACCCAAGCTTTAGGCCAGAGGATCCCTGTAACGGCGGCAGCTCAAGGTGGGACGCATAGGGCGCGCGTCTTCGCTCTTACCAGGGGGGATGT
TGAGCATGCCGAGGCGGTGGTCACAGGGACTGTTTTAGTGTTTAGTATGCCTGCATACGCATTATTTGACTCTGGATCTAGTCATTCTTTCATTGCTTCTACCTTTGTTC
GACATGCGGACCTAGAGCTAGAATCGTTAGGCTTTTTGTTGTCGGTATCCACACCGTCAGGATCTGTGTTGGTCACTAGTCAAGTGGTGAAAGGAGGCCAGCTCTCCTTT
GATGGTCAGGCCTTGGAGGTGAAATTAATCCAACTGGATATGCAGGATTTCGATGTGATACTAGGCATGGATTGGTTAGCTGCCAACTGGGCTAATATTGATTGCTCGAA
GAAGGAAGTTAGCTTTCGCTTGCCCTCCGGACAAAACTTTATCTTTAAAGGAGCTAAGGCTGGGGTCCCAAAGGTGGTGTCGGCATTGAAGGCCAGCCATCTCCTCCAGC
GTGGTGCCTGGGCCTATTTGGCTAGCGTCGTGGATGTAAGGAAGGTTGTGCCAAGCATTGAGGCGGTTCGTGTGGTTAATGAGTTCACTGACGTGTTCTCTGAGGACTTC
CCCGGCTTGCCTCCGTCTCGCAAAGTGGACTTCTATGAGACCTTGTGCTACGAAGAGGTACCCGTCAAAATTTTGACAAAAGAAACCAAGTTGTTGAGGAACCGGACGAT
TCGCTTGGTTAAGGTTTTGTGGAGAAACCACCAAGTGGAAGAAGCTACTTGGGAGCGGGAGGACGATATCAAGGCGAGATACCCTAAACTGTTGGAACAGTCAACTTTCG
GGGACGAAACCGCCCCCCCTTTGCTTTCCGGCGACCCAGCCTTCCTCCGACGACGTCTTCAGATCCGGCGACACCCAGCAGGCCGCGGCCCTTATCCGGCGTTTCTTTTG
CGTGAGACGTCGTGGCGGTTCTCCGTGAACGGTAGGTCCCACGAGTGGCGCACGGGTTCCACGAGCAGTGGCGGCGCGACCTTCCCCTTGGCGGCGTTCGAGCAGCGGCG
GTCGCCGACTCCCGGCGACTCCCCGACGTTCCGACGGCCTGTAGCAGCGGCGGCGCGCCCCGGCGATCTGCAACCTCGCGGCGGCGCACGGCGAACGAGCGGTACAGCAG
CGGAACGGCGGCGGAATTTGTACGTTACAGCGGTGTTTAGGACGTTTGGCGGTGACCCACATCCGTTCGAAGCTCGATTTAGGCTACCCACACCTTGGCAAGGTGAGGTC
GACGGAAAACTTAAGGGCACGGTTGTGGTCGATATACTTGGAATGCGAGCTGTCGTGCACCGCTGTGAGGAGGGGGCCGCGCCGCCGTTGTACCTGTTACGAAACGCAGC
CGCTGGAGCGCCGGGAGTCGCCGGCCGTCGCTGCTCGAACGCCCCCAAGGGGAAGGTCGTGCCGCCGCTGCTCATGCAACCCGTGCGCCACTCGTCGGACCTACCGTTCG
CGGAGAACCGCCGCGACGTCGCACGCAAAAGAAACGCCGAAGGAGGGCCGCGGCCTGCTGGGTGTCGCCGGATCTGA
Protein sequenceShow/hide protein sequence
MSPSSHHHGRCISTCSQGTNYLCSGGQVCVGYGQMSRGASVSAGDGLQLGGQEEICIVLLQSSRGHHHLVQRQTVSPVCPSCKKSHAGPCWTGKRICYRCQKEGHFAREC
LMTGSNTQALGQRIPVTAAAQGGTHRARVFALTRGDVEHAEAVVTGTVLVFSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSF
DGQALEVKLIQLDMQDFDVILGMDWLAANWANIDCSKKEVSFRLPSGQNFIFKGAKAGVPKVVSALKASHLLQRGAWAYLASVVDVRKVVPSIEAVRVVNEFTDVFSEDF
PGLPPSRKVDFYETLCYEEVPVKILTKETKLLRNRTIRLVKVLWRNHQVEEATWEREDDIKARYPKLLEQSTFGDETAPPLLSGDPAFLRRRLQIRRHPAGRGPYPAFLL
RETSWRFSVNGRSHEWRTGSTSSGGATFPLAAFEQRRSPTPGDSPTFRRPVAAAARPGDLQPRGGARRTSGTAAERRRNLYVTAVFRTFGGDPHPFEARFRLPTPWQGEV
DGKLKGTVVVDILGMRAVVHRCEEGAAPPLYLLRNAAAGAPGVAGRRCSNAPKGKVVPPLLMQPVRHSSDLPFAENRRDVARKRNAEGGPRPAGCRRI