; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g16740 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g16740
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag protease polyprotein
Genome locationchr3:11145256..11146464
RNA-Seq ExpressionMoc03g16740
SyntenyMoc03g16740
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033513.1 gag protease polyprotein [Cucumis melo var. makuwa]2.6e-7445.31Show/hide
Query:  MSIEAKYLRDFKKYNPRSFDGLFVDPTLKEAWLLSMETVFCYMSCPEEQKVQCAVFMLKDDALLWWESAER-----------------------------
        +S EAK+LRDF+KYNP +FDG   DPT  + WL S+ET+F YM CPE+QKVQCAVFML D    WWE+ ER                             
Subjt:  MSIEAKYLRDFKKYNPRSFDGLFVDPTLKEAWLLSMETVFCYMSCPEEQKVQCAVFMLKDDALLWWESAER-----------------------------

Query:  -KQAEFLNLKQGNRSVEQYEREFTKLSRFAPKLVDIEAKKYEQFIMGLKDEIQGFVAVLSPPDYAIALRAAA------------LIDNRSTSGS--QTNQ
         K+ EFLNL+QG+ +VEQYE EF  LSRFAP+++  EA + ++F+ GL+ +IQG V    P  +A ALR A              +   STSG   +  Q
Subjt:  -KQAEFLNLKQGNRSVEQYEREFTKLSRFAPKLVDIEAKKYEQFIMGLKDEIQGFVAVLSPPDYAIALRAAA------------LIDNRSTSGS--QTNQ

Query:  AHVQRQQLNRQQGPSYE---------------KPVCSTCGKRHWGQCLSGTRVCFKCGQERHVGLNCLQRNTAGGVSQPLNSNQAITGNSTQQQGKVFAT
          V   Q N + G  +                KP+C+TCGK H G+CL GTR CFKC QE H    C  R T    +Q         G S   QG+VFAT
Subjt:  AHVQRQQLNRQQGPSYE---------------KPVCSTCGKRHWGQCLSGTRVCFKCGQERHVGLNCLQRNTAGGVSQPLNSNQAITGNSTQQQGKVFAT

Query:  TRQEAENSNAVVTGTLPILRRYGLILFDSGSTHSFISASFVNQAKLELEPLGYVLSVSTPAGIIMLASETVKA
         + EAE +  VVTGTLP+L  Y L+LFDSGS+HSFIS++FV  A+LE+EPL +VLSVSTP+G  ML+ E VKA
Subjt:  TRQEAENSNAVVTGTLPILRRYGLILFDSGSTHSFISASFVNQAKLELEPLGYVLSVSTPAGIIMLASETVKA

KAA0042480.1 pol protein [Cucumis melo var. makuwa]2.3e-7547.13Show/hide
Query:  MSIEAKYLRDFKKYNPRSFDGLFVDPTLKEAWLLSMETVFCYMSCPEEQKVQCAVFMLKDDALLWWESAER-----KQAEFLNLKQGNRSVEQYEREFTK
        +S+EAK+LRDF+KYNP +FDG   DPT  + WL S+ET+F YM CPE+QKVQCAVFML D    WWE+ ER     K+ EFLNL+QG+ +VEQY+ +F  
Subjt:  MSIEAKYLRDFKKYNPRSFDGLFVDPTLKEAWLLSMETVFCYMSCPEEQKVQCAVFMLKDDALLWWESAER-----KQAEFLNLKQGNRSVEQYEREFTK

Query:  LSRFAPKLVDIEAKKYEQFIMGLKDEIQGFVAVLSPPDYAIALRAAA------------LIDNRSTSGS--QTNQAHVQRQQLNRQQGPSYE--------
        LSRFA +++  EA + ++F+ GL+ +IQGFV    P  +  ALR A             +    STSG   +  Q  +   Q N + G  +         
Subjt:  LSRFAPKLVDIEAKKYEQFIMGLKDEIQGFVAVLSPPDYAIALRAAA------------LIDNRSTSGS--QTNQAHVQRQQLNRQQGPSYE--------

Query:  -------KPVCSTCGKRHWGQCLSGTRVCFKCGQERHVGLNCLQRNTAGGVSQPLNSNQAITGNSTQQQGKVFATTRQEAENSNAVVTGTLPILRRYGLI
               KP+C+TCGK H G CL GTR CFKC QE H    C  R T    +Q         G     QGKVFAT + EAE +  VVTGTLP+L  Y L+
Subjt:  -------KPVCSTCGKRHWGQCLSGTRVCFKCGQERHVGLNCLQRNTAGGVSQPLNSNQAITGNSTQQQGKVFATTRQEAENSNAVVTGTLPILRRYGLI

Query:  LFDSGSTHSFISASFVNQAKLELEPLGYVLSVSTPAGIIMLASETVKA
        LFDSGS+HSFIS++FV  A+LE+EPL +VLSVSTP+G  ML+ + VKA
Subjt:  LFDSGSTHSFISASFVNQAKLELEPLGYVLSVSTPAGIIMLASETVKA

KAA0062245.1 pol protein [Cucumis melo var. makuwa]4.0e-7546.96Show/hide
Query:  MSIEAKYLRDFKKYNPRSFDGLFVDPTLKEAWLLSMETVFCYMSCPEEQKVQCAVFMLKDDALLWWESAERKQA--EFLNLKQGNRSVEQYEREFTKLSR
        +S EAK+LRDF+KYNP +FDG   DPT  + WL S+ET+F YM CPE QKVQCAVFML D    WWE+ ER     EFLNL+QG+ +VEQY+ EF  LSR
Subjt:  MSIEAKYLRDFKKYNPRSFDGLFVDPTLKEAWLLSMETVFCYMSCPEEQKVQCAVFMLKDDALLWWESAERKQA--EFLNLKQGNRSVEQYEREFTKLSR

Query:  FAPKLVDIEAKKYEQFIMGLKDEIQGFVAVLSPPDYAIALRAAA--LIDNRSTSGSQTNQAHVQRQQLNRQQGP--------------------------
        FAP+++  EA   ++F+ GL+ +IQG V    P  +A ALR A    +  R+ S     +     Q+   +Q P                          
Subjt:  FAPKLVDIEAKKYEQFIMGLKDEIQGFVAVLSPPDYAIALRAAA--LIDNRSTSGSQTNQAHVQRQQLNRQQGP--------------------------

Query:  -SYEKPVCSTCGKRHWGQCLSGTRVCFKCGQERHVGLNCLQRNTAGGVSQPLNSNQAITGNSTQQQGKVFATTRQEAENSNAVVTGTLPILRRYGLILFD
         +  KP+C+TCGK H G+CL GTR CFKC QE H    C  R T    +Q         G     QG+VFAT + EAE +  VVTGTLP+L  Y L+LFD
Subjt:  -SYEKPVCSTCGKRHWGQCLSGTRVCFKCGQERHVGLNCLQRNTAGGVSQPLNSNQAITGNSTQQQGKVFATTRQEAENSNAVVTGTLPILRRYGLILFD

Query:  SGSTHSFISASFVNQAKLELEPLGYVLSVSTPAGIIMLASETVKA
        SGS+HSFIS++FV  A+LE+EPL +VLSVSTP+G  ML+ E VKA
Subjt:  SGSTHSFISASFVNQAKLELEPLGYVLSVSTPAGIIMLASETVKA

KAA0067481.1 pol protein [Cucumis melo var. makuwa]1.2e-7445.43Show/hide
Query:  MSIEAKYLRDFKKYNPRSFDGLFVDPTLKEAWLLSMETVFCYMSCPEEQKVQCAVFMLKDDALLWWESAER-----------------------------
        +S EAK+LRDF+KYNP +FDG   DPT  + WL S+ET+FCYM CPE+QKVQCAVFML D    WWE+ ER                             
Subjt:  MSIEAKYLRDFKKYNPRSFDGLFVDPTLKEAWLLSMETVFCYMSCPEEQKVQCAVFMLKDDALLWWESAER-----------------------------

Query:  -KQAEFLNLKQGNRSVEQYEREFTKLSRFAPKLVDIEAKKYEQFIMGLKDEIQGFVAVLSPPDYAIALRAAA--LIDNRSTS------GSQTNQAHVQRQ
         K+ EFLNL+QG+ +VEQY+ EF  LSRFAP+++  EA + ++F+ GL+ +IQG V    P  +A ALR A    +  R+ S      GS + Q     Q
Subjt:  -KQAEFLNLKQGNRSVEQYEREFTKLSRFAPKLVDIEAKKYEQFIMGLKDEIQGFVAVLSPPDYAIALRAAA--LIDNRSTS------GSQTNQAHVQRQ

Query:  Q--------------LNRQQGPSYE-------KPVCSTCGKRHWGQCLSGTRVCFKCGQERHVGLNCLQRNTAGGVSQPLNSNQAITGNSTQQQGKVFAT
        Q                R Q   +E       KP+C+TCGK H G+CL GTR CFKC QE H    C  R T    +Q         G     QGKVFAT
Subjt:  Q--------------LNRQQGPSYE-------KPVCSTCGKRHWGQCLSGTRVCFKCGQERHVGLNCLQRNTAGGVSQPLNSNQAITGNSTQQQGKVFAT

Query:  TRQEAENSNAVVTGTLPILRRYGLILFDSGSTHSFISASFVNQAKLELEPLGYVLSVSTPAGIIMLASETVK
         + EAE +  VVTGTLP+L  Y L+LFDSGS+HSFIS++FV  A+LE+EPL +VLSVSTP+G  ML+ E VK
Subjt:  TRQEAENSNAVVTGTLPILRRYGLILFDSGSTHSFISASFVNQAKLELEPLGYVLSVSTPAGIIMLASETVK

TYK01613.1 pol protein [Cucumis melo var. makuwa]3.4e-7445.04Show/hide
Query:  MSIEAKYLRDFKKYNPRSFDGLFVDPTLKEAWLLSMETVFCYMSCPEEQKVQCAVFMLKDDALLWWESAER-----------------------------
        +S EAK+LRDF+KYNP +FDG   DPT  + WL S+ET+F YM CPE+QKVQCAVFML D    WWE+ ER                             
Subjt:  MSIEAKYLRDFKKYNPRSFDGLFVDPTLKEAWLLSMETVFCYMSCPEEQKVQCAVFMLKDDALLWWESAER-----------------------------

Query:  -KQAEFLNLKQGNRSVEQYEREFTKLSRFAPKLVDIEAKKYEQFIMGLKDEIQGFVAVLSPPDYAIALRAAA--LIDNRSTSGSQTNQAHVQRQQLNRQQ
         K+ EFLNL+QG+ +VEQY+ EF  LSRFAP+++  EA + ++F+ GL+ +IQG V    P  +A ALR A    +  R+ S     +     Q+   +Q
Subjt:  -KQAEFLNLKQGNRSVEQYEREFTKLSRFAPKLVDIEAKKYEQFIMGLKDEIQGFVAVLSPPDYAIALRAAA--LIDNRSTSGSQTNQAHVQRQQLNRQQ

Query:  GP---------------SYE------------KPVCSTCGKRHWGQCLSGTRVCFKCGQERHVGLNCLQRNTAGGVSQPLNSNQAITGNSTQQQGKVFAT
         P               S++            KP+C+TCGK H G+CL GTR CFKC QE H    C  R T  G++Q    NQ   G     QG+VFAT
Subjt:  GP---------------SYE------------KPVCSTCGKRHWGQCLSGTRVCFKCGQERHVGLNCLQRNTAGGVSQPLNSNQAITGNSTQQQGKVFAT

Query:  TRQEAENSNAVVTGTLPILRRYGLILFDSGSTHSFISASFVNQAKLELEPLGYVLSVSTPAGIIMLASETVKA
         R EAE +  VVTGTLP+L  Y L+LFDSGS+HSFIS++FV+ A+LE+EPL +VLSVSTP+G  ML+ E VKA
Subjt:  TRQEAENSNAVVTGTLPILRRYGLILFDSGSTHSFISASFVNQAKLELEPLGYVLSVSTPAGIIMLASETVKA

TrEMBL top hitse value%identityAlignment
A0A5A7SSS7 Gag protease polyprotein1.3e-7445.31Show/hide
Query:  MSIEAKYLRDFKKYNPRSFDGLFVDPTLKEAWLLSMETVFCYMSCPEEQKVQCAVFMLKDDALLWWESAER-----------------------------
        +S EAK+LRDF+KYNP +FDG   DPT  + WL S+ET+F YM CPE+QKVQCAVFML D    WWE+ ER                             
Subjt:  MSIEAKYLRDFKKYNPRSFDGLFVDPTLKEAWLLSMETVFCYMSCPEEQKVQCAVFMLKDDALLWWESAER-----------------------------

Query:  -KQAEFLNLKQGNRSVEQYEREFTKLSRFAPKLVDIEAKKYEQFIMGLKDEIQGFVAVLSPPDYAIALRAAA------------LIDNRSTSGS--QTNQ
         K+ EFLNL+QG+ +VEQYE EF  LSRFAP+++  EA + ++F+ GL+ +IQG V    P  +A ALR A              +   STSG   +  Q
Subjt:  -KQAEFLNLKQGNRSVEQYEREFTKLSRFAPKLVDIEAKKYEQFIMGLKDEIQGFVAVLSPPDYAIALRAAA------------LIDNRSTSGS--QTNQ

Query:  AHVQRQQLNRQQGPSYE---------------KPVCSTCGKRHWGQCLSGTRVCFKCGQERHVGLNCLQRNTAGGVSQPLNSNQAITGNSTQQQGKVFAT
          V   Q N + G  +                KP+C+TCGK H G+CL GTR CFKC QE H    C  R T    +Q         G S   QG+VFAT
Subjt:  AHVQRQQLNRQQGPSYE---------------KPVCSTCGKRHWGQCLSGTRVCFKCGQERHVGLNCLQRNTAGGVSQPLNSNQAITGNSTQQQGKVFAT

Query:  TRQEAENSNAVVTGTLPILRRYGLILFDSGSTHSFISASFVNQAKLELEPLGYVLSVSTPAGIIMLASETVKA
         + EAE +  VVTGTLP+L  Y L+LFDSGS+HSFIS++FV  A+LE+EPL +VLSVSTP+G  ML+ E VKA
Subjt:  TRQEAENSNAVVTGTLPILRRYGLILFDSGSTHSFISASFVNQAKLELEPLGYVLSVSTPAGIIMLASETVKA

A0A5A7TGL9 Reverse transcriptase1.1e-7547.13Show/hide
Query:  MSIEAKYLRDFKKYNPRSFDGLFVDPTLKEAWLLSMETVFCYMSCPEEQKVQCAVFMLKDDALLWWESAER-----KQAEFLNLKQGNRSVEQYEREFTK
        +S+EAK+LRDF+KYNP +FDG   DPT  + WL S+ET+F YM CPE+QKVQCAVFML D    WWE+ ER     K+ EFLNL+QG+ +VEQY+ +F  
Subjt:  MSIEAKYLRDFKKYNPRSFDGLFVDPTLKEAWLLSMETVFCYMSCPEEQKVQCAVFMLKDDALLWWESAER-----KQAEFLNLKQGNRSVEQYEREFTK

Query:  LSRFAPKLVDIEAKKYEQFIMGLKDEIQGFVAVLSPPDYAIALRAAA------------LIDNRSTSGS--QTNQAHVQRQQLNRQQGPSYE--------
        LSRFA +++  EA + ++F+ GL+ +IQGFV    P  +  ALR A             +    STSG   +  Q  +   Q N + G  +         
Subjt:  LSRFAPKLVDIEAKKYEQFIMGLKDEIQGFVAVLSPPDYAIALRAAA------------LIDNRSTSGS--QTNQAHVQRQQLNRQQGPSYE--------

Query:  -------KPVCSTCGKRHWGQCLSGTRVCFKCGQERHVGLNCLQRNTAGGVSQPLNSNQAITGNSTQQQGKVFATTRQEAENSNAVVTGTLPILRRYGLI
               KP+C+TCGK H G CL GTR CFKC QE H    C  R T    +Q         G     QGKVFAT + EAE +  VVTGTLP+L  Y L+
Subjt:  -------KPVCSTCGKRHWGQCLSGTRVCFKCGQERHVGLNCLQRNTAGGVSQPLNSNQAITGNSTQQQGKVFATTRQEAENSNAVVTGTLPILRRYGLI

Query:  LFDSGSTHSFISASFVNQAKLELEPLGYVLSVSTPAGIIMLASETVKA
        LFDSGS+HSFIS++FV  A+LE+EPL +VLSVSTP+G  ML+ + VKA
Subjt:  LFDSGSTHSFISASFVNQAKLELEPLGYVLSVSTPAGIIMLASETVKA

A0A5A7TVK7 Gag protease polyprotein1.6e-7445.31Show/hide
Query:  MSIEAKYLRDFKKYNPRSFDGLFVDPTLKEAWLLSMETVFCYMSCPEEQKVQCAVFMLKDDALLWWESAER-----------------------------
        +S EAK+LRDF+KYNP +FDG   DPT  + WL S+ET+F YM CPE+QKVQCAVFML D    WWE+ ER                             
Subjt:  MSIEAKYLRDFKKYNPRSFDGLFVDPTLKEAWLLSMETVFCYMSCPEEQKVQCAVFMLKDDALLWWESAER-----------------------------

Query:  -KQAEFLNLKQGNRSVEQYEREFTKLSRFAPKLVDIEAKKYEQFIMGLKDEIQGFVAVLSPPDYAIALRAAA--LIDNRSTS------GSQTNQAHVQRQ
         K+ EFLN +QG+ +VEQY+ EF  LSRFAP+++  EA K ++F+ GL+ +IQG V    P  +A ALR A    +  R+ S      GS + Q     Q
Subjt:  -KQAEFLNLKQGNRSVEQYEREFTKLSRFAPKLVDIEAKKYEQFIMGLKDEIQGFVAVLSPPDYAIALRAAA--LIDNRSTS------GSQTNQAHVQRQ

Query:  Q--------------LNRQQGPSYE-------KPVCSTCGKRHWGQCLSGTRVCFKCGQERHVGLNCLQRNTAGGVSQPLNSNQAITGNSTQQQGKVFAT
        Q                R Q   +E       KP+C+TCGK H G+CL GTR+CFKC QE H    CL R T    +Q         G     QG+VFAT
Subjt:  Q--------------LNRQQGPSYE-------KPVCSTCGKRHWGQCLSGTRVCFKCGQERHVGLNCLQRNTAGGVSQPLNSNQAITGNSTQQQGKVFAT

Query:  TRQEAENSNAVVTGTLPILRRYGLILFDSGSTHSFISASFVNQAKLELEPLGYVLSVSTPAGIIMLASETVKA
         + EAE +  VVTGTLP+L  Y L+LFDSGS+HSFIS++FV  A+LE+EPL +VLSVSTP+G  ML+ E VKA
Subjt:  TRQEAENSNAVVTGTLPILRRYGLILFDSGSTHSFISASFVNQAKLELEPLGYVLSVSTPAGIIMLASETVKA

A0A5A7V8L8 Pol protein1.9e-7546.96Show/hide
Query:  MSIEAKYLRDFKKYNPRSFDGLFVDPTLKEAWLLSMETVFCYMSCPEEQKVQCAVFMLKDDALLWWESAERKQA--EFLNLKQGNRSVEQYEREFTKLSR
        +S EAK+LRDF+KYNP +FDG   DPT  + WL S+ET+F YM CPE QKVQCAVFML D    WWE+ ER     EFLNL+QG+ +VEQY+ EF  LSR
Subjt:  MSIEAKYLRDFKKYNPRSFDGLFVDPTLKEAWLLSMETVFCYMSCPEEQKVQCAVFMLKDDALLWWESAERKQA--EFLNLKQGNRSVEQYEREFTKLSR

Query:  FAPKLVDIEAKKYEQFIMGLKDEIQGFVAVLSPPDYAIALRAAA--LIDNRSTSGSQTNQAHVQRQQLNRQQGP--------------------------
        FAP+++  EA   ++F+ GL+ +IQG V    P  +A ALR A    +  R+ S     +     Q+   +Q P                          
Subjt:  FAPKLVDIEAKKYEQFIMGLKDEIQGFVAVLSPPDYAIALRAAA--LIDNRSTSGSQTNQAHVQRQQLNRQQGP--------------------------

Query:  -SYEKPVCSTCGKRHWGQCLSGTRVCFKCGQERHVGLNCLQRNTAGGVSQPLNSNQAITGNSTQQQGKVFATTRQEAENSNAVVTGTLPILRRYGLILFD
         +  KP+C+TCGK H G+CL GTR CFKC QE H    C  R T    +Q         G     QG+VFAT + EAE +  VVTGTLP+L  Y L+LFD
Subjt:  -SYEKPVCSTCGKRHWGQCLSGTRVCFKCGQERHVGLNCLQRNTAGGVSQPLNSNQAITGNSTQQQGKVFATTRQEAENSNAVVTGTLPILRRYGLILFD

Query:  SGSTHSFISASFVNQAKLELEPLGYVLSVSTPAGIIMLASETVKA
        SGS+HSFIS++FV  A+LE+EPL +VLSVSTP+G  ML+ E VKA
Subjt:  SGSTHSFISASFVNQAKLELEPLGYVLSVSTPAGIIMLASETVKA

A0A5A7VPI8 Reverse transcriptase5.6e-7545.43Show/hide
Query:  MSIEAKYLRDFKKYNPRSFDGLFVDPTLKEAWLLSMETVFCYMSCPEEQKVQCAVFMLKDDALLWWESAER-----------------------------
        +S EAK+LRDF+KYNP +FDG   DPT  + WL S+ET+FCYM CPE+QKVQCAVFML D    WWE+ ER                             
Subjt:  MSIEAKYLRDFKKYNPRSFDGLFVDPTLKEAWLLSMETVFCYMSCPEEQKVQCAVFMLKDDALLWWESAER-----------------------------

Query:  -KQAEFLNLKQGNRSVEQYEREFTKLSRFAPKLVDIEAKKYEQFIMGLKDEIQGFVAVLSPPDYAIALRAAA--LIDNRSTS------GSQTNQAHVQRQ
         K+ EFLNL+QG+ +VEQY+ EF  LSRFAP+++  EA + ++F+ GL+ +IQG V    P  +A ALR A    +  R+ S      GS + Q     Q
Subjt:  -KQAEFLNLKQGNRSVEQYEREFTKLSRFAPKLVDIEAKKYEQFIMGLKDEIQGFVAVLSPPDYAIALRAAA--LIDNRSTS------GSQTNQAHVQRQ

Query:  Q--------------LNRQQGPSYE-------KPVCSTCGKRHWGQCLSGTRVCFKCGQERHVGLNCLQRNTAGGVSQPLNSNQAITGNSTQQQGKVFAT
        Q                R Q   +E       KP+C+TCGK H G+CL GTR CFKC QE H    C  R T    +Q         G     QGKVFAT
Subjt:  Q--------------LNRQQGPSYE-------KPVCSTCGKRHWGQCLSGTRVCFKCGQERHVGLNCLQRNTAGGVSQPLNSNQAITGNSTQQQGKVFAT

Query:  TRQEAENSNAVVTGTLPILRRYGLILFDSGSTHSFISASFVNQAKLELEPLGYVLSVSTPAGIIMLASETVK
         + EAE +  VVTGTLP+L  Y L+LFDSGS+HSFIS++FV  A+LE+EPL +VLSVSTP+G  ML+ E VK
Subjt:  TRQEAENSNAVVTGTLPILRRYGLILFDSGSTHSFISASFVNQAKLELEPLGYVLSVSTPAGIIMLASETVK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGAGACTTTGCAGACACTTGTTCAAACAGCTGTCTCTAATCAAATGGCACAATTGACTCAGGATCGAGGGAGCATGTCAATAGAAGCTAAATATCTGCGAGATTT
TAAGAAGTACAATCCTCGCTCTTTTGACGGACTATTTGTAGATCCAACATTAAAAGAGGCTTGGTTGTTGTCGATGGAGACCGTCTTTTGTTATATGAGTTGTCCGGAGG
AACAAAAAGTGCAGTGTGCTGTCTTTATGCTAAAAGATGATGCCCTTCTGTGGTGGGAGTCTGCAGAAAGGAAACAAGCTGAATTTCTGAACCTGAAGCAAGGCAATAGA
TCAGTGGAGCAATATGAGAGAGAATTCACAAAACTGTCCCGTTTTGCCCCTAAGTTAGTAGACATAGAGGCTAAGAAGTATGAACAATTTATTATGGGTTTGAAGGATGA
GATTCAAGGCTTTGTAGCAGTCCTCTCTCCACCAGACTATGCTATAGCACTTCGAGCAGCTGCATTGATTGACAATCGTTCAACAAGTGGGTCCCAAACGAACCAAGCTC
ATGTTCAAAGACAGCAACTTAATCGACAACAAGGCCCTAGTTACGAAAAACCAGTATGCAGTACTTGTGGGAAGCGTCATTGGGGGCAATGTTTGTCGGGAACCAGAGTA
TGTTTTAAATGTGGCCAGGAAAGGCACGTGGGATTAAATTGCCTTCAGAGAAATACCGCAGGTGGTGTAAGCCAACCCTTGAATTCCAACCAGGCAATTACAGGAAATTC
AACTCAACAGCAGGGTAAGGTGTTTGCCACTACACGTCAAGAAGCTGAGAACTCAAATGCTGTAGTGACAGGTACGCTACCTATTCTTCGTCGCTATGGATTGATATTGT
TTGATTCAGGTTCTACACACTCTTTTATATCTGCCTCATTTGTGAATCAAGCTAAGTTAGAGTTGGAACCATTAGGATATGTGTTATCAGTTTCCACCCCGGCAGGGATC
ATTATGTTAGCTAGTGAGACAGTAAAAGCCGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATGGAGACTTTGCAGACACTTGTTCAAACAGCTGTCTCTAATCAAATGGCACAATTGACTCAGGATCGAGGGAGCATGTCAATAGAAGCTAAATATCTGCGAGATTT
TAAGAAGTACAATCCTCGCTCTTTTGACGGACTATTTGTAGATCCAACATTAAAAGAGGCTTGGTTGTTGTCGATGGAGACCGTCTTTTGTTATATGAGTTGTCCGGAGG
AACAAAAAGTGCAGTGTGCTGTCTTTATGCTAAAAGATGATGCCCTTCTGTGGTGGGAGTCTGCAGAAAGGAAACAAGCTGAATTTCTGAACCTGAAGCAAGGCAATAGA
TCAGTGGAGCAATATGAGAGAGAATTCACAAAACTGTCCCGTTTTGCCCCTAAGTTAGTAGACATAGAGGCTAAGAAGTATGAACAATTTATTATGGGTTTGAAGGATGA
GATTCAAGGCTTTGTAGCAGTCCTCTCTCCACCAGACTATGCTATAGCACTTCGAGCAGCTGCATTGATTGACAATCGTTCAACAAGTGGGTCCCAAACGAACCAAGCTC
ATGTTCAAAGACAGCAACTTAATCGACAACAAGGCCCTAGTTACGAAAAACCAGTATGCAGTACTTGTGGGAAGCGTCATTGGGGGCAATGTTTGTCGGGAACCAGAGTA
TGTTTTAAATGTGGCCAGGAAAGGCACGTGGGATTAAATTGCCTTCAGAGAAATACCGCAGGTGGTGTAAGCCAACCCTTGAATTCCAACCAGGCAATTACAGGAAATTC
AACTCAACAGCAGGGTAAGGTGTTTGCCACTACACGTCAAGAAGCTGAGAACTCAAATGCTGTAGTGACAGGTACGCTACCTATTCTTCGTCGCTATGGATTGATATTGT
TTGATTCAGGTTCTACACACTCTTTTATATCTGCCTCATTTGTGAATCAAGCTAAGTTAGAGTTGGAACCATTAGGATATGTGTTATCAGTTTCCACCCCGGCAGGGATC
ATTATGTTAGCTAGTGAGACAGTAAAAGCCGGTTAG
Protein sequenceShow/hide protein sequence
MMETLQTLVQTAVSNQMAQLTQDRGSMSIEAKYLRDFKKYNPRSFDGLFVDPTLKEAWLLSMETVFCYMSCPEEQKVQCAVFMLKDDALLWWESAERKQAEFLNLKQGNR
SVEQYEREFTKLSRFAPKLVDIEAKKYEQFIMGLKDEIQGFVAVLSPPDYAIALRAAALIDNRSTSGSQTNQAHVQRQQLNRQQGPSYEKPVCSTCGKRHWGQCLSGTRV
CFKCGQERHVGLNCLQRNTAGGVSQPLNSNQAITGNSTQQQGKVFATTRQEAENSNAVVTGTLPILRRYGLILFDSGSTHSFISASFVNQAKLELEPLGYVLSVSTPAGI
IMLASETVKAG