; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039703 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039703
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr2:48591139..48597535
RNA-Seq ExpressionLag0039703
SyntenyLag0039703
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032380.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]8.9e-7046.99Show/hide
Query:  TKGGLNMIQMQRRWLELVKDYDCEINYHLDKANVVVDALSKRAAGVASLVSTQIPLLAELDRVEIELAVADVPALLAQLSVVPNLRQQIAEAQRKDPEIV
        T+  LNM    RRWLELVKDYDCEI YH  KANVV DALS++ +  A+L++ Q PL  +L+R EI ++V  V   LAQL+V   LRQ+I +AQ  DP +V
Subjt:  TKGGLNMIQMQRRWLELVKDYDCEINYHLDKANVVVDALSKRAAGVASLVSTQIPLLAELDRVEIELAVADVPALLAQLSVVPNLRQQIAEAQRKDPEIV

Query:  K---------------------LWRR----------------PAGLLQPLEIPVWKWENVSMDFIVGLSRTLKGNTVIWVVVDRLTKSSHFILGKATYLV
        +                     L+ R                PAGLLQPL +P WKWE VSMDFI GL RTL+G TVIWVVVDRLTKS+HF+ GK+TY+V
Subjt:  K---------------------LWRR----------------PAGLLQPLEIPVWKWENVSMDFIVGLSRTLKGNTVIWVVVDRLTKSSHFILGKATYLV

Query:  EKWTELYLKGIVRL----------------------------------------------QLNQILEDMLMACVMDFGGSWENHLYLIEFAYNNRYQAMI
         KW +LY+  IVRL                                              +LNQ+LEDML AC ++F GSW++HL+L+EFAYNN +QA I
Subjt:  EKWTELYLKGIVRL----------------------------------------------QLNQILEDMLMACVMDFGGSWENHLYLIEFAYNNRYQAMI

Query:  GLAPYEALYGKKCRSPVHWDEVGEKALLGPEV
        G+AP+EALYGK CR P+ W EVGE+ L+GPE+
Subjt:  GLAPYEALYGKKCRSPVHWDEVGEKALLGPEV

KAA0043555.1 pol protein [Cucumis melo var. makuwa]2.3e-7045.66Show/hide
Query:  LNMIQMQRRWLELVKDYDCEINYHLDKANVVVDALSKRAAGVASLVSTQIPLLAELDRVEIELAVADVPALLAQLSVVPNLRQQIAEAQRKDPEIVKLW-
        LNM   QRRWLELVKDYDCEI YH  KANVV DALS++ +  A+L++ Q PL  +L++ EI ++V  V   LAQL+V P LRQ+I +AQR DP +V+   
Subjt:  LNMIQMQRRWLELVKDYDCEINYHLDKANVVVDALSKRAAGVASLVSTQIPLLAELDRVEIELAVADVPALLAQLSVVPNLRQQIAEAQRKDPEIVKLW-

Query:  ------------------------------------------------------RRPAGLLQPLEIPVWKWENVSMDFIVGLSRTLKGNTVIWVVVDRLT
                                                              ++PAGLLQPL IP WKWENVSMDFI GL RTL+G TVIWVVVDRLT
Subjt:  ------------------------------------------------------RRPAGLLQPLEIPVWKWENVSMDFIVGLSRTLKGNTVIWVVVDRLT

Query:  KSSHFILGKATYLVEKWTELYLKGIVRL----------------------------------------------QLNQILEDMLMACVMDFGGSWENHLY
        KS+HF+ GK+TY   KW +LYL  IVRL                                              +LNQ+LEDML AC ++F GSW++HL+
Subjt:  KSSHFILGKATYLVEKWTELYLKGIVRL----------------------------------------------QLNQILEDMLMACVMDFGGSWENHLY

Query:  LIEFAYNNRYQAMIGLAPYEALYGKKCRSPVHWDEVGEKALLGPEV
        L+EFAYNN +QA IG+AP+EALYGK CRSP+ W EVGE+ L+GPE+
Subjt:  LIEFAYNNRYQAMIGLAPYEALYGKKCRSPVHWDEVGEKALLGPEV

KAA0059071.1 pol protein [Cucumis melo var. makuwa]8.9e-7044.6Show/hide
Query:  TKGGLNMIQMQRRWLELVKDYDCEINYHLDKANVVVDALSKRAAGVASLVSTQIPLLAELDRVEIELAVADVPALLAQLSVVPNLRQQIAEAQRKDPEIV
        T+  LNM   QRRWLELVKDYDCEI YH  KANVV DALS++ +  A+L++ Q PL  +L+R EI ++V  V   LAQL+V P LRQ+I +AQ  DP +V
Subjt:  TKGGLNMIQMQRRWLELVKDYDCEINYHLDKANVVVDALSKRAAGVASLVSTQIPLLAELDRVEIELAVADVPALLAQLSVVPNLRQQIAEAQRKDPEIV

Query:  --------------------------------------------------------KLW----------RRPAGLLQPLEIPVWKWENVSMDFIVGLSRT
                                                                K++          ++PAGLLQPL IP WKWEN+SMDFI GL RT
Subjt:  --------------------------------------------------------KLW----------RRPAGLLQPLEIPVWKWENVSMDFIVGLSRT

Query:  LKGNTVIWVVVDRLTKSSHFILGKATYLVEKWTELYLKGIVRL----------------------------------------------QLNQILEDMLM
        LKG TVIWVVVDRLTKS+HF+ GK+TY   KW +LY+  IVRL                                              +LNQ+LEDML 
Subjt:  LKGNTVIWVVVDRLTKSSHFILGKATYLVEKWTELYLKGIVRL----------------------------------------------QLNQILEDMLM

Query:  ACVMDFGGSWENHLYLIEFAYNNRYQAMIGLAPYEALYGKKCRSPVHWDEVGEKALLGPEV
        AC ++F GSW++HL+L+EFAYNN YQA IG+AP+EALYGK CRSPV W EVGE+ L+GPE+
Subjt:  ACVMDFGGSWENHLYLIEFAYNNRYQAMIGLAPYEALYGKKCRSPVHWDEVGEKALLGPEV

KAA0061618.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]6.8e-7045.71Show/hide
Query:  TKGGLNMIQMQRRWLELVKDYDCEINYHLDKANVVVDALSKRAAGVASLVSTQIPLLAELDRVEIELAVADVPALLAQLSVVPNLRQQIAEAQRKDPEIV
        T+  LNM   QRRWLELVKDYDCEI YH  KANVV DALS++ +  A+L++ Q PL  +L+R EI ++V  V   LAQL+V P LRQ+I +AQ  DP +V
Subjt:  TKGGLNMIQMQRRWLELVKDYDCEINYHLDKANVVVDALSKRAAGVASLVSTQIPLLAELDRVEIELAVADVPALLAQLSVVPNLRQQIAEAQRKDPEIV

Query:  K---------------------LWRR----------------------------------PAGLLQPLEIPVWKWENVSMDFIVGLSRTLKGNTVIWVVV
        +                     L+ R                                  PAGLLQPL IP WKWENVSMDFI GL RTL+G TVIWVVV
Subjt:  K---------------------LWRR----------------------------------PAGLLQPLEIPVWKWENVSMDFIVGLSRTLKGNTVIWVVV

Query:  DRLTKSSHFILGKATYLVEKWTELYLKGIVRL----------------------------------------------QLNQILEDMLMACVMDFGGSWE
        DRLTKS+HF+ GK+TY   KW +LY+  IVRL                                              ++NQ+LEDML AC ++F GSW+
Subjt:  DRLTKSSHFILGKATYLVEKWTELYLKGIVRL----------------------------------------------QLNQILEDMLMACVMDFGGSWE

Query:  NHLYLIEFAYNNRYQAMIGLAPYEALYGKKCRSPVHWDEVGEKALLGPEV
        +HL+L+EFAYNN YQA IG+AP+EALY K CRSPV W EVGE+ L+GPE+
Subjt:  NHLYLIEFAYNNRYQAMIGLAPYEALYGKKCRSPVHWDEVGEKALLGPEV

KAA0066365.1 pol protein [Cucumis melo var. makuwa]4.0e-7044.75Show/hide
Query:  TKGGLNMIQMQRRWLELVKDYDCEINYHLDKANVVVDALSKRAAGVASLVSTQIPLLAELDRVEIELAVADVPALLAQLSVVPNLRQQIAEAQRKDPEIV
        T+  LNM   QRRWLELVKDYDCEI YH  KANVV DALS++ A  A+L++ Q PLL + +R EI +AV +V A LAQL+V P LRQ+I  AQ  DP + 
Subjt:  TKGGLNMIQMQRRWLELVKDYDCEINYHLDKANVVVDALSKRAAGVASLVSTQIPLLAELDRVEIELAVADVPALLAQLSVVPNLRQQIAEAQRKDPEIV

Query:  KLWR-------------------------------------------------------------------RPAGLLQPLEIPVWKWENVSMDFIVGLSR
        +  R                                                                   RPAGLLQPL +P WKWE+VSMDFI GL +
Subjt:  KLWR-------------------------------------------------------------------RPAGLLQPLEIPVWKWENVSMDFIVGLSR

Query:  TLKGNTVIWVVVDRLTKSSHFILGKATYLVEKWTELYLKGIVRL----------------------------------------------QLNQILEDML
        TL+G TVIWVVVDRL KS+HF+ GK+TY   KW +LY+  IVRL                                              +LNQILEDML
Subjt:  TLKGNTVIWVVVDRLTKSSHFILGKATYLVEKWTELYLKGIVRL----------------------------------------------QLNQILEDML

Query:  MACVMDFGGSWENHLYLIEFAYNNRYQAMIGLAPYEALYGKKCRSPVHWDEVGEKALLGPEV
         ACV++F GSW++HL+L+EFAYNN YQA IG+AP+EALYGK CRSPV W EVGE+ +LGPE+
Subjt:  MACVMDFGGSWENHLYLIEFAYNNRYQAMIGLAPYEALYGKKCRSPVHWDEVGEKALLGPEV

TrEMBL top hitse value%identityAlignment
A0A5A7SNL9 Reverse transcriptase4.3e-7046.99Show/hide
Query:  TKGGLNMIQMQRRWLELVKDYDCEINYHLDKANVVVDALSKRAAGVASLVSTQIPLLAELDRVEIELAVADVPALLAQLSVVPNLRQQIAEAQRKDPEIV
        T+  LNM    RRWLELVKDYDCEI YH  KANVV DALS++ +  A+L++ Q PL  +L+R EI ++V  V   LAQL+V   LRQ+I +AQ  DP +V
Subjt:  TKGGLNMIQMQRRWLELVKDYDCEINYHLDKANVVVDALSKRAAGVASLVSTQIPLLAELDRVEIELAVADVPALLAQLSVVPNLRQQIAEAQRKDPEIV

Query:  K---------------------LWRR----------------PAGLLQPLEIPVWKWENVSMDFIVGLSRTLKGNTVIWVVVDRLTKSSHFILGKATYLV
        +                     L+ R                PAGLLQPL +P WKWE VSMDFI GL RTL+G TVIWVVVDRLTKS+HF+ GK+TY+V
Subjt:  K---------------------LWRR----------------PAGLLQPLEIPVWKWENVSMDFIVGLSRTLKGNTVIWVVVDRLTKSSHFILGKATYLV

Query:  EKWTELYLKGIVRL----------------------------------------------QLNQILEDMLMACVMDFGGSWENHLYLIEFAYNNRYQAMI
         KW +LY+  IVRL                                              +LNQ+LEDML AC ++F GSW++HL+L+EFAYNN +QA I
Subjt:  EKWTELYLKGIVRL----------------------------------------------QLNQILEDMLMACVMDFGGSWENHLYLIEFAYNNRYQAMI

Query:  GLAPYEALYGKKCRSPVHWDEVGEKALLGPEV
        G+AP+EALYGK CR P+ W EVGE+ L+GPE+
Subjt:  GLAPYEALYGKKCRSPVHWDEVGEKALLGPEV

A0A5A7TQ69 Reverse transcriptase1.1e-7045.66Show/hide
Query:  LNMIQMQRRWLELVKDYDCEINYHLDKANVVVDALSKRAAGVASLVSTQIPLLAELDRVEIELAVADVPALLAQLSVVPNLRQQIAEAQRKDPEIVKLW-
        LNM   QRRWLELVKDYDCEI YH  KANVV DALS++ +  A+L++ Q PL  +L++ EI ++V  V   LAQL+V P LRQ+I +AQR DP +V+   
Subjt:  LNMIQMQRRWLELVKDYDCEINYHLDKANVVVDALSKRAAGVASLVSTQIPLLAELDRVEIELAVADVPALLAQLSVVPNLRQQIAEAQRKDPEIVKLW-

Query:  ------------------------------------------------------RRPAGLLQPLEIPVWKWENVSMDFIVGLSRTLKGNTVIWVVVDRLT
                                                              ++PAGLLQPL IP WKWENVSMDFI GL RTL+G TVIWVVVDRLT
Subjt:  ------------------------------------------------------RRPAGLLQPLEIPVWKWENVSMDFIVGLSRTLKGNTVIWVVVDRLT

Query:  KSSHFILGKATYLVEKWTELYLKGIVRL----------------------------------------------QLNQILEDMLMACVMDFGGSWENHLY
        KS+HF+ GK+TY   KW +LYL  IVRL                                              +LNQ+LEDML AC ++F GSW++HL+
Subjt:  KSSHFILGKATYLVEKWTELYLKGIVRL----------------------------------------------QLNQILEDMLMACVMDFGGSWENHLY

Query:  LIEFAYNNRYQAMIGLAPYEALYGKKCRSPVHWDEVGEKALLGPEV
        L+EFAYNN +QA IG+AP+EALYGK CRSP+ W EVGE+ L+GPE+
Subjt:  LIEFAYNNRYQAMIGLAPYEALYGKKCRSPVHWDEVGEKALLGPEV

A0A5A7V003 Reverse transcriptase4.3e-7044.6Show/hide
Query:  TKGGLNMIQMQRRWLELVKDYDCEINYHLDKANVVVDALSKRAAGVASLVSTQIPLLAELDRVEIELAVADVPALLAQLSVVPNLRQQIAEAQRKDPEIV
        T+  LNM   QRRWLELVKDYDCEI YH  KANVV DALS++ +  A+L++ Q PL  +L+R EI ++V  V   LAQL+V P LRQ+I +AQ  DP +V
Subjt:  TKGGLNMIQMQRRWLELVKDYDCEINYHLDKANVVVDALSKRAAGVASLVSTQIPLLAELDRVEIELAVADVPALLAQLSVVPNLRQQIAEAQRKDPEIV

Query:  --------------------------------------------------------KLW----------RRPAGLLQPLEIPVWKWENVSMDFIVGLSRT
                                                                K++          ++PAGLLQPL IP WKWEN+SMDFI GL RT
Subjt:  --------------------------------------------------------KLW----------RRPAGLLQPLEIPVWKWENVSMDFIVGLSRT

Query:  LKGNTVIWVVVDRLTKSSHFILGKATYLVEKWTELYLKGIVRL----------------------------------------------QLNQILEDMLM
        LKG TVIWVVVDRLTKS+HF+ GK+TY   KW +LY+  IVRL                                              +LNQ+LEDML 
Subjt:  LKGNTVIWVVVDRLTKSSHFILGKATYLVEKWTELYLKGIVRL----------------------------------------------QLNQILEDMLM

Query:  ACVMDFGGSWENHLYLIEFAYNNRYQAMIGLAPYEALYGKKCRSPVHWDEVGEKALLGPEV
        AC ++F GSW++HL+L+EFAYNN YQA IG+AP+EALYGK CRSPV W EVGE+ L+GPE+
Subjt:  ACVMDFGGSWENHLYLIEFAYNNRYQAMIGLAPYEALYGKKCRSPVHWDEVGEKALLGPEV

A0A5A7V223 Reverse transcriptase3.3e-7045.71Show/hide
Query:  TKGGLNMIQMQRRWLELVKDYDCEINYHLDKANVVVDALSKRAAGVASLVSTQIPLLAELDRVEIELAVADVPALLAQLSVVPNLRQQIAEAQRKDPEIV
        T+  LNM   QRRWLELVKDYDCEI YH  KANVV DALS++ +  A+L++ Q PL  +L+R EI ++V  V   LAQL+V P LRQ+I +AQ  DP +V
Subjt:  TKGGLNMIQMQRRWLELVKDYDCEINYHLDKANVVVDALSKRAAGVASLVSTQIPLLAELDRVEIELAVADVPALLAQLSVVPNLRQQIAEAQRKDPEIV

Query:  K---------------------LWRR----------------------------------PAGLLQPLEIPVWKWENVSMDFIVGLSRTLKGNTVIWVVV
        +                     L+ R                                  PAGLLQPL IP WKWENVSMDFI GL RTL+G TVIWVVV
Subjt:  K---------------------LWRR----------------------------------PAGLLQPLEIPVWKWENVSMDFIVGLSRTLKGNTVIWVVV

Query:  DRLTKSSHFILGKATYLVEKWTELYLKGIVRL----------------------------------------------QLNQILEDMLMACVMDFGGSWE
        DRLTKS+HF+ GK+TY   KW +LY+  IVRL                                              ++NQ+LEDML AC ++F GSW+
Subjt:  DRLTKSSHFILGKATYLVEKWTELYLKGIVRL----------------------------------------------QLNQILEDMLMACVMDFGGSWE

Query:  NHLYLIEFAYNNRYQAMIGLAPYEALYGKKCRSPVHWDEVGEKALLGPEV
        +HL+L+EFAYNN YQA IG+AP+EALY K CRSPV W EVGE+ L+GPE+
Subjt:  NHLYLIEFAYNNRYQAMIGLAPYEALYGKKCRSPVHWDEVGEKALLGPEV

A0A5A7VKS7 Reverse transcriptase1.9e-7044.75Show/hide
Query:  TKGGLNMIQMQRRWLELVKDYDCEINYHLDKANVVVDALSKRAAGVASLVSTQIPLLAELDRVEIELAVADVPALLAQLSVVPNLRQQIAEAQRKDPEIV
        T+  LNM   QRRWLELVKDYDCEI YH  KANVV DALS++ A  A+L++ Q PLL + +R EI +AV +V A LAQL+V P LRQ+I  AQ  DP + 
Subjt:  TKGGLNMIQMQRRWLELVKDYDCEINYHLDKANVVVDALSKRAAGVASLVSTQIPLLAELDRVEIELAVADVPALLAQLSVVPNLRQQIAEAQRKDPEIV

Query:  KLWR-------------------------------------------------------------------RPAGLLQPLEIPVWKWENVSMDFIVGLSR
        +  R                                                                   RPAGLLQPL +P WKWE+VSMDFI GL +
Subjt:  KLWR-------------------------------------------------------------------RPAGLLQPLEIPVWKWENVSMDFIVGLSR

Query:  TLKGNTVIWVVVDRLTKSSHFILGKATYLVEKWTELYLKGIVRL----------------------------------------------QLNQILEDML
        TL+G TVIWVVVDRL KS+HF+ GK+TY   KW +LY+  IVRL                                              +LNQILEDML
Subjt:  TLKGNTVIWVVVDRLTKSSHFILGKATYLVEKWTELYLKGIVRL----------------------------------------------QLNQILEDML

Query:  MACVMDFGGSWENHLYLIEFAYNNRYQAMIGLAPYEALYGKKCRSPVHWDEVGEKALLGPEV
         ACV++F GSW++HL+L+EFAYNN YQA IG+AP+EALYGK CRSPV W EVGE+ +LGPE+
Subjt:  MACVMDFGGSWENHLYLIEFAYNNRYQAMIGLAPYEALYGKKCRSPVHWDEVGEKALLGPEV

SwissProt top hitse value%identityAlignment
Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein4.2e-0638.95Show/hide
Query:  VPALLAQLSVV---PNLRQQIAEAQRK--DPEIVKLWR-RPAGLLQPLEIPVWKWENVSMDFIVGLSRTLKGNTVIWVVVDRLTKSSHFILGKAT
        V   LA++S +   P L+  I +  R     +++K  R R  GLLQPL I   +W ++SMDF+ GL  T     +I VVVDR +K +HFI  + T
Subjt:  VPALLAQLSVV---PNLRQQIAEAQRK--DPEIVKLWR-RPAGLLQPLEIPVWKWENVSMDFIVGLSRTLKGNTVIWVVVDRLTKSSHFILGKAT

Q99315 Transposon Ty3-G Gag-Pol polyprotein4.2e-0638.95Show/hide
Query:  VPALLAQLSVV---PNLRQQIAEAQRK--DPEIVKLWR-RPAGLLQPLEIPVWKWENVSMDFIVGLSRTLKGNTVIWVVVDRLTKSSHFILGKAT
        V   LA++S +   P L+  I +  R     +++K  R R  GLLQPL I   +W ++SMDF+ GL  T     +I VVVDR +K +HFI  + T
Subjt:  VPALLAQLSVV---PNLRQQIAEAQRK--DPEIVKLWR-RPAGLLQPLEIPVWKWENVSMDFIVGLSRTLKGNTVIWVVVDRLTKSSHFILGKAT

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGGAAGACTTTGTGGAAACCTTGGAGATTCAACGTTATTGGAGTTGAAGCAAAGATCCGCAAGAAGTTTCTCCATATTATTTCGAGGGTCTAAGCTAAATCCTCT
CTCCCGAGCAAGATTCAACACGAGACGAGATCATCCAATCAATGGCCACTCCCACCAAAGGTCTCTAGTCGGAATTAGGGTTAAAATCTCACCAATTGGACTCCCTTTGG
CCGTAGCATATTGTCGCAACGTCATCGCGACGCGGAGTGGCCGACTTCCTTCCCTAAGTGACGGCAGCGTGGGCGGCAAAACTAAGGCATCGAACGGAACTGGCAGCGAC
GATGGTTTTCGAGTGGCAGCGATGGAGTTGAGGTGTGAGTTTTTGGAGGAAGAAGAAAATGAGTTTGGGTGGAGGTCACAATGGGAAGCTTGGATGGTGATTGGAGAGAG
CAATGGTAGGGAGTTGAGGAAGAAGGGAGAGGGAGAGGGAAGTGGAGAATTATGGAAAATTGGTAGCGTCGAGACGCTAAGGAGGCAACGTCTCGATGCTACCTCTCGGC
TGAATGTGGATTTGGATTGGTACACCGAGCTCGATAGCACACCTTATCCTAGAGGCTATAAAAAGAGACTAAGAGCTTCAACAAAAGGGGGATTGAATATGATTCAAATG
CAAAGGAGATGGTTGGAGTTGGTGAAGGACTATGACTGTGAAATCAATTATCACCTCGATAAAGCTAATGTTGTAGTAGATGCTCTTAGCAAGAGAGCAGCAGGTGTGGC
GTCGTTGGTTTCGACTCAGATTCCCTTGCTAGCAGAGTTGGATCGGGTAGAGATCGAGTTAGCAGTGGCGGATGTTCCAGCTTTGTTGGCTCAGCTGTCTGTGGTCCCTA
ATTTAAGGCAGCAAATCGCTGAAGCACAGAGGAAAGATCCTGAGATTGTCAAGCTTTGGAGAAGACCTGCTGGGTTGTTGCAACCATTAGAAATCCCAGTCTGGAAGTGG
GAAAATGTGTCCATGGACTTCATTGTGGGTCTGTCGAGGACACTAAAGGGCAACACAGTGATCTGGGTTGTAGTTGATCGTTTAACCAAGTCATCTCATTTCATTCTTGG
TAAGGCCACTTATCTAGTGGAAAAGTGGACAGAGTTGTACCTGAAGGGGATTGTGAGACTTCAATTGAATCAAATTCTTGAGGATATGCTCATGGCTTGTGTCATGGACT
TTGGAGGTAGTTGGGAGAATCATTTGTACCTGATCGAGTTTGCCTACAACAATAGGTATCAGGCTATGATTGGCCTGGCTCCCTATGAAGCCTTGTATGGAAAGAAGTGC
AGGTCCCCAGTCCATTGGGATGAAGTTGGTGAAAAGGCCCTATTAGGCCCAGAGGTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCATGGAAGACTTTGTGGAAACCTTGGAGATTCAACGTTATTGGAGTTGAAGCAAAGATCCGCAAGAAGTTTCTCCATATTATTTCGAGGGTCTAAGCTAAATCCTCT
CTCCCGAGCAAGATTCAACACGAGACGAGATCATCCAATCAATGGCCACTCCCACCAAAGGTCTCTAGTCGGAATTAGGGTTAAAATCTCACCAATTGGACTCCCTTTGG
CCGTAGCATATTGTCGCAACGTCATCGCGACGCGGAGTGGCCGACTTCCTTCCCTAAGTGACGGCAGCGTGGGCGGCAAAACTAAGGCATCGAACGGAACTGGCAGCGAC
GATGGTTTTCGAGTGGCAGCGATGGAGTTGAGGTGTGAGTTTTTGGAGGAAGAAGAAAATGAGTTTGGGTGGAGGTCACAATGGGAAGCTTGGATGGTGATTGGAGAGAG
CAATGGTAGGGAGTTGAGGAAGAAGGGAGAGGGAGAGGGAAGTGGAGAATTATGGAAAATTGGTAGCGTCGAGACGCTAAGGAGGCAACGTCTCGATGCTACCTCTCGGC
TGAATGTGGATTTGGATTGGTACACCGAGCTCGATAGCACACCTTATCCTAGAGGCTATAAAAAGAGACTAAGAGCTTCAACAAAAGGGGGATTGAATATGATTCAAATG
CAAAGGAGATGGTTGGAGTTGGTGAAGGACTATGACTGTGAAATCAATTATCACCTCGATAAAGCTAATGTTGTAGTAGATGCTCTTAGCAAGAGAGCAGCAGGTGTGGC
GTCGTTGGTTTCGACTCAGATTCCCTTGCTAGCAGAGTTGGATCGGGTAGAGATCGAGTTAGCAGTGGCGGATGTTCCAGCTTTGTTGGCTCAGCTGTCTGTGGTCCCTA
ATTTAAGGCAGCAAATCGCTGAAGCACAGAGGAAAGATCCTGAGATTGTCAAGCTTTGGAGAAGACCTGCTGGGTTGTTGCAACCATTAGAAATCCCAGTCTGGAAGTGG
GAAAATGTGTCCATGGACTTCATTGTGGGTCTGTCGAGGACACTAAAGGGCAACACAGTGATCTGGGTTGTAGTTGATCGTTTAACCAAGTCATCTCATTTCATTCTTGG
TAAGGCCACTTATCTAGTGGAAAAGTGGACAGAGTTGTACCTGAAGGGGATTGTGAGACTTCAATTGAATCAAATTCTTGAGGATATGCTCATGGCTTGTGTCATGGACT
TTGGAGGTAGTTGGGAGAATCATTTGTACCTGATCGAGTTTGCCTACAACAATAGGTATCAGGCTATGATTGGCCTGGCTCCCTATGAAGCCTTGTATGGAAAGAAGTGC
AGGTCCCCAGTCCATTGGGATGAAGTTGGTGAAAAGGCCCTATTAGGCCCAGAGGTCTAA
Protein sequenceShow/hide protein sequence
MHGRLCGNLGDSTLLELKQRSARSFSILFRGSKLNPLSRARFNTRRDHPINGHSHQRSLVGIRVKISPIGLPLAVAYCRNVIATRSGRLPSLSDGSVGGKTKASNGTGSD
DGFRVAAMELRCEFLEEEENEFGWRSQWEAWMVIGESNGRELRKKGEGEGSGELWKIGSVETLRRQRLDATSRLNVDLDWYTELDSTPYPRGYKKRLRASTKGGLNMIQM
QRRWLELVKDYDCEINYHLDKANVVVDALSKRAAGVASLVSTQIPLLAELDRVEIELAVADVPALLAQLSVVPNLRQQIAEAQRKDPEIVKLWRRPAGLLQPLEIPVWKW
ENVSMDFIVGLSRTLKGNTVIWVVVDRLTKSSHFILGKATYLVEKWTELYLKGIVRLQLNQILEDMLMACVMDFGGSWENHLYLIEFAYNNRYQAMIGLAPYEALYGKKC
RSPVHWDEVGEKALLGPEV