; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011331 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011331
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr1:21809381..21816594
RNA-Seq ExpressionLag0011331
SyntenyLag0011331
Gene Ontology termsNA
InterPro domainsIPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032714.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]4.6e-14950.79Show/hide
Query:  KTHEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVED
        ++HEEH RI+LQTLRDKQLYAKFSKCEFWL+QVVF GHVVSA G                 VSVDPQKVE +VNWE P SATEVRSFLGLAG Y RF+ED
Subjt:  KTHEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVED

Query:  FSRLALLLTALTRNNAKFEWSDK-----------------------WKEYVIYCYASRQGLGCVLMQEGKVIAYASRQLKKHECDYATHDLELTTIVLAL
        FSRLAL LTALTR N KFEWSDK                        K YVIYC  SR GLGCVLM++G VIAYASRQLK+HEC+Y THDLEL  +VLAL
Subjt:  FSRLALLLTALTRNNAKFEWSDK-----------------------WKEYVIYCYASRQGLGCVLMQEGKVIAYASRQLKKHECDYATHDLELTTIVLAL

Query:  KIRRHYLFSEKCPIFTDHKCLKYIFDKKELNLRQRRW---------------------------KSRLPKSALCGTRATLLSELRGFRAVVTVESSGSLL
        KI RHYLF EKC IFTDHK LKYIFD+KELNLRQRRW                           KSRLPKSALCG R  LL+ELRG +AVVT E SGSLL
Subjt:  KIRRHYLFSEKCPIFTDHKCLKYIFDKKELNLRQRRW---------------------------KSRLPKSALCGTRATLLSELRGFRAVVTVESSGSLL

Query:  AQFQVRSSLVAEIVRRQPKDR---------------------------------------------------------STKMYKTLKKTYWWPGMKREIA
        AQFQVRSSLV EIV+RQ +D                                                          STKMY+TLKKTYWW GMK+EIA
Subjt:  AQFQVRSSLVAEIVRRQPKDR---------------------------------------------------------STKMYKTLKKTYWWPGMKREIA

Query:  EYVDKCLICQQVKPVRKSAGGLHNPLPVLEWK----------------------------LTKTERFIPIKVTSTLDQLTKLYVDKIVSQCGVPVSIVSD
        EYVD+CLICQQVKPVR+  GG  NPLPV EWK                            LTKT +FIPIK+TSTLDQL +LYVDKIVSQ GVPVSIVSD
Subjt:  EYVDKCLICQQVKPVRKSAGGLHNPLPVLEWK----------------------------LTKTERFIPIKVTSTLDQLTKLYVDKIVSQCGVPVSIVSD

Query:  RDPRFTSKFWP-----------------------------------------------------------------------------------------
        RDPRFTSKFWP                                                                                         
Subjt:  RDPRFTSKFWP-----------------------------------------------------------------------------------------

Query:  -NKRQRDLEFQVGDQVFLKLSPWKGVIRFGRK
         +KR+R+LEFQVGDQVFLKLSPW+GVIRFGRK
Subjt:  -NKRQRDLEFQVGDQVFLKLSPWKGVIRFGRK

KAA0036676.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.7e-15151.33Show/hide
Query:  KTHEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVED
        ++HEEH RI+LQTLR+KQLYAKFSKCEFWL+QVVFLGHVVSA G                 VSVDPQKVE VVNWE P SATEVRSFLGLAGYY RF+ED
Subjt:  KTHEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVED

Query:  FSRLALLLTALTRNNAKFEWSDK-----------------------WKEYVIYCYASRQGLGCVLMQEGKVIAYASRQLKKHECDYATHDLELTTIVLAL
        FSRLAL LTALTR N KFEWSDK                        K+YVIYC ASR GLGCVLMQ+G VIAYASRQLK+HEC+Y THDLEL  +VLAL
Subjt:  FSRLALLLTALTRNNAKFEWSDK-----------------------WKEYVIYCYASRQGLGCVLMQEGKVIAYASRQLKKHECDYATHDLELTTIVLAL

Query:  KIRRHYLFSEKCPIFTDHKCLKYIFDKKELNLRQRRW---------------------------KSRLPKSALCGTRATLLSELRGFRAVVTVESSGSLL
        KI RHYLF EKC IFTDHK LKYIFD+KELNLRQRRW                           KSRLPKSALCG R  LL+ELRG +AVVT E SGSLL
Subjt:  KIRRHYLFSEKCPIFTDHKCLKYIFDKKELNLRQRRW---------------------------KSRLPKSALCGTRATLLSELRGFRAVVTVESSGSLL

Query:  AQFQVRSSLVAEIVRRQPKDR---------------------------------------------------------STKMYKTLKKTYWWPGMKREIA
        AQFQVRSSLV EIVRRQ +D                                                          STKMY+TLKKTYWW GMK+EIA
Subjt:  AQFQVRSSLVAEIVRRQPKDR---------------------------------------------------------STKMYKTLKKTYWWPGMKREIA

Query:  EYVDKCLICQQVKPVRKSAGGLHNPLPVLEWK----------------------------LTKTERFIPIKVTSTLDQLTKLYVDKIVSQCGVPVSIVSD
        EYVD+CLICQQVKPVR+  GG  NPLPV EWK                            LTKT RFIPIK+TSTLDQL +LYVDKIVSQ GVPVSIVSD
Subjt:  EYVDKCLICQQVKPVRKSAGGLHNPLPVLEWK----------------------------LTKTERFIPIKVTSTLDQLTKLYVDKIVSQCGVPVSIVSD

Query:  RDPRFTSKFWP-----------------------------------------------------------------------------------------
        RDPRFTSKFWP                                                                                         
Subjt:  RDPRFTSKFWP-----------------------------------------------------------------------------------------

Query:  ------NKRQRDLEFQVGDQVFLKLSPWKGVIRFGRK
              +KR+R+LEFQVGDQVFLKLSPW+GVIRFGRK
Subjt:  ------NKRQRDLEFQVGDQVFLKLSPWKGVIRFGRK

KAA0041108.1 reverse transcriptase [Cucumis melo var. makuwa]1.7e-14849.1Show/hide
Query:  KTHEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVED
        ++HEEH RI+LQTLR+KQLYAKFSKCEFWL+QVVFLGHVVSA G                 VSVDPQKVE VVNWE P SATEVRSFLGLAGYY RF+ED
Subjt:  KTHEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVED

Query:  FSRLALLLTALTRNNAKFEWSDK-----------------------WKEYVIYCYASRQGLGCVLMQEGKVIAYASRQLKKHECDYATHDLELTTIVLAL
        FSRLAL LTALTR N KFEWSDK                        K+YVIYC ASR GLGCVLMQ+G VIAYASRQLK+HEC+Y THDLEL  +VLAL
Subjt:  FSRLALLLTALTRNNAKFEWSDK-----------------------WKEYVIYCYASRQGLGCVLMQEGKVIAYASRQLKKHECDYATHDLELTTIVLAL

Query:  KIRRHYLFSEKCPIFTDHKCLKYIFDKKELNLRQRRW---------------------------KSRLPKSALCGTRATLLSELRGFRAVVTVESSGSLL
        KI RHYLF EKC IFTDHK LKYIFD+KELNLRQRRW                           KSRLPKSALCG R  LL+ELRG +AVVT E SGSLL
Subjt:  KIRRHYLFSEKCPIFTDHKCLKYIFDKKELNLRQRRW---------------------------KSRLPKSALCGTRATLLSELRGFRAVVTVESSGSLL

Query:  AQFQVRSSLVAEIVRRQPKDR---------------------------------------------------------STKMYKTLKKTYWWPGMKREIA
        AQFQVRSSLV EIVRRQ +D                                                          STKMY+TLKKTYWW GMK+EIA
Subjt:  AQFQVRSSLVAEIVRRQPKDR---------------------------------------------------------STKMYKTLKKTYWWPGMKREIA

Query:  EYVDKCLICQQVKPVRKSAGGLHNPLPVLEWK----------------------------LTKTERFIPIKVTSTLDQLTKLYVDKIVSQCGVPVSIVSD
        EYVD+CLICQQVKPVR+  GG  NPLPV EWK                            LTKT RFIPIK+TSTLDQL +LYVDKIVSQ GVPVSIVSD
Subjt:  EYVDKCLICQQVKPVRKSAGGLHNPLPVLEWK----------------------------LTKTERFIPIKVTSTLDQLTKLYVDKIVSQCGVPVSIVSD

Query:  RDPRFTSKFWP-----------------------------------------------------------------------------------------
        RDPRFTSKFWP                                                                                         
Subjt:  RDPRFTSKFWP-----------------------------------------------------------------------------------------

Query:  ----------------------------------NKRQRDLEFQVGDQVFLKLSPWKGVIRFGRKE
                                          +KR+R+LEFQVGDQVFLKLSPW+GVIRFGRK+
Subjt:  ----------------------------------NKRQRDLEFQVGDQVFLKLSPWKGVIRFGRKE

KAA0050493.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]3.0e-14849.17Show/hide
Query:  KTHEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVED
        ++HEEH RI+LQTLR+KQLYAKFSKCEFWL+QVVFLGHVVSA G                 VSVDPQKVE VVNWE P SATEVRSFLGLAGYY RF+ED
Subjt:  KTHEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVED

Query:  FSRLALLLTALTRNNAKFEWSDK-----------------------WKEYVIYCYASRQGLGCVLMQEGKVIAYASRQLKKHECDYATHDLELTTIVLAL
        FSRLAL LTALTR N KFEWSDK                        K+YVIYC ASR GLGCVLMQ+G VIAYASRQLK+HEC+Y THDLEL  +VLAL
Subjt:  FSRLALLLTALTRNNAKFEWSDK-----------------------WKEYVIYCYASRQGLGCVLMQEGKVIAYASRQLKKHECDYATHDLELTTIVLAL

Query:  KIRRHYLFSEKCPIFTDHKCLKYIFDKKELNLRQRRW---------------------------KSRLPKSALCGTRATLLSELRGFRAVVTVESSGSLL
        KI RHYLF EKC IFTDHK LKYIFD+KELNLRQRRW                           KSRLPKSALCG R  LL+ELRG +AVVT E SGSLL
Subjt:  KIRRHYLFSEKCPIFTDHKCLKYIFDKKELNLRQRRW---------------------------KSRLPKSALCGTRATLLSELRGFRAVVTVESSGSLL

Query:  AQFQVRSSLVAEIVRRQPKDR---------------------------------------------------------STKMYKTLKKTYWWPGMKREIA
        AQFQVRSSLV EIVRRQ +D                                                          STKMY+TLKKTYWW GMK+EIA
Subjt:  AQFQVRSSLVAEIVRRQPKDR---------------------------------------------------------STKMYKTLKKTYWWPGMKREIA

Query:  EYVDKCLICQQVKPVRKSAGGLHNPLPVLEWK----------------------------LTKTERFIPIKVTSTLDQLTKLYVDKIVSQCGVPVSIVSD
        EYVD+CLICQQVKPVR+  GG  NPLPV EWK                            LTKT RFIPIK+TSTLDQL +LYVDKIVSQ GVPVSIVSD
Subjt:  EYVDKCLICQQVKPVRKSAGGLHNPLPVLEWK----------------------------LTKTERFIPIKVTSTLDQLTKLYVDKIVSQCGVPVSIVSD

Query:  RDPRFTSKFWP-----------------------------------------------------------------------------------------
        RDPRFTSKFWP                                                                                         
Subjt:  RDPRFTSKFWP-----------------------------------------------------------------------------------------

Query:  ----------------------------------NKRQRDLEFQVGDQVFLKLSPWKGVIRFGRK
                                          +KR+R+LEFQVGDQVFLKLSPW+GVIRFGRK
Subjt:  ----------------------------------NKRQRDLEFQVGDQVFLKLSPWKGVIRFGRK

KAA0050527.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]3.0e-14849.17Show/hide
Query:  KTHEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVED
        ++HEEH RI+LQTLR+KQLYAKFSKCEFWL+QVVFLGHVVSA G                 VSVDPQKVE VVNWE P SATEVRSFLGLAGYY RF+ED
Subjt:  KTHEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVED

Query:  FSRLALLLTALTRNNAKFEWSDK-----------------------WKEYVIYCYASRQGLGCVLMQEGKVIAYASRQLKKHECDYATHDLELTTIVLAL
        FSRLAL LTALTR N KFEWSDK                        K+YVIYC ASR GLGCVLMQ+G VIAYASRQLK+HEC+Y THDLEL  +VLAL
Subjt:  FSRLALLLTALTRNNAKFEWSDK-----------------------WKEYVIYCYASRQGLGCVLMQEGKVIAYASRQLKKHECDYATHDLELTTIVLAL

Query:  KIRRHYLFSEKCPIFTDHKCLKYIFDKKELNLRQRRW---------------------------KSRLPKSALCGTRATLLSELRGFRAVVTVESSGSLL
        KI RHYLF EKC IFTDHK LKYIFD+KELNLRQRRW                           KSRLPKSALCG R  LL+ELRG +AVVT E SGSLL
Subjt:  KIRRHYLFSEKCPIFTDHKCLKYIFDKKELNLRQRRW---------------------------KSRLPKSALCGTRATLLSELRGFRAVVTVESSGSLL

Query:  AQFQVRSSLVAEIVRRQPKDR---------------------------------------------------------STKMYKTLKKTYWWPGMKREIA
        AQFQVRSSLV EIVRRQ +D                                                          STKMY+TLKKTYWW GMK+EIA
Subjt:  AQFQVRSSLVAEIVRRQPKDR---------------------------------------------------------STKMYKTLKKTYWWPGMKREIA

Query:  EYVDKCLICQQVKPVRKSAGGLHNPLPVLEWK----------------------------LTKTERFIPIKVTSTLDQLTKLYVDKIVSQCGVPVSIVSD
        EYVD+CLICQQVKPVR+  GG  NPLPV EWK                            LTKT RFIPIK+TSTLDQL +LYVDKIVSQ GVPVSIVSD
Subjt:  EYVDKCLICQQVKPVRKSAGGLHNPLPVLEWK----------------------------LTKTERFIPIKVTSTLDQLTKLYVDKIVSQCGVPVSIVSD

Query:  RDPRFTSKFWP-----------------------------------------------------------------------------------------
        RDPRFTSKFWP                                                                                         
Subjt:  RDPRFTSKFWP-----------------------------------------------------------------------------------------

Query:  ----------------------------------NKRQRDLEFQVGDQVFLKLSPWKGVIRFGRK
                                          +KR+R+LEFQVGDQVFLKLSPW+GVIRFGRK
Subjt:  ----------------------------------NKRQRDLEFQVGDQVFLKLSPWKGVIRFGRK

TrEMBL top hitse value%identityAlignment
A0A5A7SNE6 DNA/RNA polymerases superfamily protein2.2e-14950.79Show/hide
Query:  KTHEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVED
        ++HEEH RI+LQTLRDKQLYAKFSKCEFWL+QVVF GHVVSA G                 VSVDPQKVE +VNWE P SATEVRSFLGLAG Y RF+ED
Subjt:  KTHEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVED

Query:  FSRLALLLTALTRNNAKFEWSDK-----------------------WKEYVIYCYASRQGLGCVLMQEGKVIAYASRQLKKHECDYATHDLELTTIVLAL
        FSRLAL LTALTR N KFEWSDK                        K YVIYC  SR GLGCVLM++G VIAYASRQLK+HEC+Y THDLEL  +VLAL
Subjt:  FSRLALLLTALTRNNAKFEWSDK-----------------------WKEYVIYCYASRQGLGCVLMQEGKVIAYASRQLKKHECDYATHDLELTTIVLAL

Query:  KIRRHYLFSEKCPIFTDHKCLKYIFDKKELNLRQRRW---------------------------KSRLPKSALCGTRATLLSELRGFRAVVTVESSGSLL
        KI RHYLF EKC IFTDHK LKYIFD+KELNLRQRRW                           KSRLPKSALCG R  LL+ELRG +AVVT E SGSLL
Subjt:  KIRRHYLFSEKCPIFTDHKCLKYIFDKKELNLRQRRW---------------------------KSRLPKSALCGTRATLLSELRGFRAVVTVESSGSLL

Query:  AQFQVRSSLVAEIVRRQPKDR---------------------------------------------------------STKMYKTLKKTYWWPGMKREIA
        AQFQVRSSLV EIV+RQ +D                                                          STKMY+TLKKTYWW GMK+EIA
Subjt:  AQFQVRSSLVAEIVRRQPKDR---------------------------------------------------------STKMYKTLKKTYWWPGMKREIA

Query:  EYVDKCLICQQVKPVRKSAGGLHNPLPVLEWK----------------------------LTKTERFIPIKVTSTLDQLTKLYVDKIVSQCGVPVSIVSD
        EYVD+CLICQQVKPVR+  GG  NPLPV EWK                            LTKT +FIPIK+TSTLDQL +LYVDKIVSQ GVPVSIVSD
Subjt:  EYVDKCLICQQVKPVRKSAGGLHNPLPVLEWK----------------------------LTKTERFIPIKVTSTLDQLTKLYVDKIVSQCGVPVSIVSD

Query:  RDPRFTSKFWP-----------------------------------------------------------------------------------------
        RDPRFTSKFWP                                                                                         
Subjt:  RDPRFTSKFWP-----------------------------------------------------------------------------------------

Query:  -NKRQRDLEFQVGDQVFLKLSPWKGVIRFGRK
         +KR+R+LEFQVGDQVFLKLSPW+GVIRFGRK
Subjt:  -NKRQRDLEFQVGDQVFLKLSPWKGVIRFGRK

A0A5A7T3K4 Reverse transcriptase8.1e-15251.33Show/hide
Query:  KTHEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVED
        ++HEEH RI+LQTLR+KQLYAKFSKCEFWL+QVVFLGHVVSA G                 VSVDPQKVE VVNWE P SATEVRSFLGLAGYY RF+ED
Subjt:  KTHEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVED

Query:  FSRLALLLTALTRNNAKFEWSDK-----------------------WKEYVIYCYASRQGLGCVLMQEGKVIAYASRQLKKHECDYATHDLELTTIVLAL
        FSRLAL LTALTR N KFEWSDK                        K+YVIYC ASR GLGCVLMQ+G VIAYASRQLK+HEC+Y THDLEL  +VLAL
Subjt:  FSRLALLLTALTRNNAKFEWSDK-----------------------WKEYVIYCYASRQGLGCVLMQEGKVIAYASRQLKKHECDYATHDLELTTIVLAL

Query:  KIRRHYLFSEKCPIFTDHKCLKYIFDKKELNLRQRRW---------------------------KSRLPKSALCGTRATLLSELRGFRAVVTVESSGSLL
        KI RHYLF EKC IFTDHK LKYIFD+KELNLRQRRW                           KSRLPKSALCG R  LL+ELRG +AVVT E SGSLL
Subjt:  KIRRHYLFSEKCPIFTDHKCLKYIFDKKELNLRQRRW---------------------------KSRLPKSALCGTRATLLSELRGFRAVVTVESSGSLL

Query:  AQFQVRSSLVAEIVRRQPKDR---------------------------------------------------------STKMYKTLKKTYWWPGMKREIA
        AQFQVRSSLV EIVRRQ +D                                                          STKMY+TLKKTYWW GMK+EIA
Subjt:  AQFQVRSSLVAEIVRRQPKDR---------------------------------------------------------STKMYKTLKKTYWWPGMKREIA

Query:  EYVDKCLICQQVKPVRKSAGGLHNPLPVLEWK----------------------------LTKTERFIPIKVTSTLDQLTKLYVDKIVSQCGVPVSIVSD
        EYVD+CLICQQVKPVR+  GG  NPLPV EWK                            LTKT RFIPIK+TSTLDQL +LYVDKIVSQ GVPVSIVSD
Subjt:  EYVDKCLICQQVKPVRKSAGGLHNPLPVLEWK----------------------------LTKTERFIPIKVTSTLDQLTKLYVDKIVSQCGVPVSIVSD

Query:  RDPRFTSKFWP-----------------------------------------------------------------------------------------
        RDPRFTSKFWP                                                                                         
Subjt:  RDPRFTSKFWP-----------------------------------------------------------------------------------------

Query:  ------NKRQRDLEFQVGDQVFLKLSPWKGVIRFGRK
              +KR+R+LEFQVGDQVFLKLSPW+GVIRFGRK
Subjt:  ------NKRQRDLEFQVGDQVFLKLSPWKGVIRFGRK

A0A5A7TDR2 Reverse transcriptase8.4e-14949.1Show/hide
Query:  KTHEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVED
        ++HEEH RI+LQTLR+KQLYAKFSKCEFWL+QVVFLGHVVSA G                 VSVDPQKVE VVNWE P SATEVRSFLGLAGYY RF+ED
Subjt:  KTHEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVED

Query:  FSRLALLLTALTRNNAKFEWSDK-----------------------WKEYVIYCYASRQGLGCVLMQEGKVIAYASRQLKKHECDYATHDLELTTIVLAL
        FSRLAL LTALTR N KFEWSDK                        K+YVIYC ASR GLGCVLMQ+G VIAYASRQLK+HEC+Y THDLEL  +VLAL
Subjt:  FSRLALLLTALTRNNAKFEWSDK-----------------------WKEYVIYCYASRQGLGCVLMQEGKVIAYASRQLKKHECDYATHDLELTTIVLAL

Query:  KIRRHYLFSEKCPIFTDHKCLKYIFDKKELNLRQRRW---------------------------KSRLPKSALCGTRATLLSELRGFRAVVTVESSGSLL
        KI RHYLF EKC IFTDHK LKYIFD+KELNLRQRRW                           KSRLPKSALCG R  LL+ELRG +AVVT E SGSLL
Subjt:  KIRRHYLFSEKCPIFTDHKCLKYIFDKKELNLRQRRW---------------------------KSRLPKSALCGTRATLLSELRGFRAVVTVESSGSLL

Query:  AQFQVRSSLVAEIVRRQPKDR---------------------------------------------------------STKMYKTLKKTYWWPGMKREIA
        AQFQVRSSLV EIVRRQ +D                                                          STKMY+TLKKTYWW GMK+EIA
Subjt:  AQFQVRSSLVAEIVRRQPKDR---------------------------------------------------------STKMYKTLKKTYWWPGMKREIA

Query:  EYVDKCLICQQVKPVRKSAGGLHNPLPVLEWK----------------------------LTKTERFIPIKVTSTLDQLTKLYVDKIVSQCGVPVSIVSD
        EYVD+CLICQQVKPVR+  GG  NPLPV EWK                            LTKT RFIPIK+TSTLDQL +LYVDKIVSQ GVPVSIVSD
Subjt:  EYVDKCLICQQVKPVRKSAGGLHNPLPVLEWK----------------------------LTKTERFIPIKVTSTLDQLTKLYVDKIVSQCGVPVSIVSD

Query:  RDPRFTSKFWP-----------------------------------------------------------------------------------------
        RDPRFTSKFWP                                                                                         
Subjt:  RDPRFTSKFWP-----------------------------------------------------------------------------------------

Query:  ----------------------------------NKRQRDLEFQVGDQVFLKLSPWKGVIRFGRKE
                                          +KR+R+LEFQVGDQVFLKLSPW+GVIRFGRK+
Subjt:  ----------------------------------NKRQRDLEFQVGDQVFLKLSPWKGVIRFGRKE

A0A5A7U2V7 Reverse transcriptase1.4e-14849.17Show/hide
Query:  KTHEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVED
        ++HEEH RI+LQTLR+KQLYAKFSKCEFWL+QVVFLGHVVSA G                 VSVDPQKVE VVNWE P SATEVRSFLGLAGYY RF+ED
Subjt:  KTHEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVED

Query:  FSRLALLLTALTRNNAKFEWSDK-----------------------WKEYVIYCYASRQGLGCVLMQEGKVIAYASRQLKKHECDYATHDLELTTIVLAL
        FSRLAL LTALTR N KFEWSDK                        K+YVIYC ASR GLGCVLMQ+G VIAYASRQLK+HEC+Y THDLEL  +VLAL
Subjt:  FSRLALLLTALTRNNAKFEWSDK-----------------------WKEYVIYCYASRQGLGCVLMQEGKVIAYASRQLKKHECDYATHDLELTTIVLAL

Query:  KIRRHYLFSEKCPIFTDHKCLKYIFDKKELNLRQRRW---------------------------KSRLPKSALCGTRATLLSELRGFRAVVTVESSGSLL
        KI RHYLF EKC IFTDHK LKYIFD+KELNLRQRRW                           KSRLPKSALCG R  LL+ELRG +AVVT E SGSLL
Subjt:  KIRRHYLFSEKCPIFTDHKCLKYIFDKKELNLRQRRW---------------------------KSRLPKSALCGTRATLLSELRGFRAVVTVESSGSLL

Query:  AQFQVRSSLVAEIVRRQPKDR---------------------------------------------------------STKMYKTLKKTYWWPGMKREIA
        AQFQVRSSLV EIVRRQ +D                                                          STKMY+TLKKTYWW GMK+EIA
Subjt:  AQFQVRSSLVAEIVRRQPKDR---------------------------------------------------------STKMYKTLKKTYWWPGMKREIA

Query:  EYVDKCLICQQVKPVRKSAGGLHNPLPVLEWK----------------------------LTKTERFIPIKVTSTLDQLTKLYVDKIVSQCGVPVSIVSD
        EYVD+CLICQQVKPVR+  GG  NPLPV EWK                            LTKT RFIPIK+TSTLDQL +LYVDKIVSQ GVPVSIVSD
Subjt:  EYVDKCLICQQVKPVRKSAGGLHNPLPVLEWK----------------------------LTKTERFIPIKVTSTLDQLTKLYVDKIVSQCGVPVSIVSD

Query:  RDPRFTSKFWP-----------------------------------------------------------------------------------------
        RDPRFTSKFWP                                                                                         
Subjt:  RDPRFTSKFWP-----------------------------------------------------------------------------------------

Query:  ----------------------------------NKRQRDLEFQVGDQVFLKLSPWKGVIRFGRK
                                          +KR+R+LEFQVGDQVFLKLSPW+GVIRFGRK
Subjt:  ----------------------------------NKRQRDLEFQVGDQVFLKLSPWKGVIRFGRK

A0A5D3BHI1 Reverse transcriptase1.4e-14849.17Show/hide
Query:  KTHEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVED
        ++HEEH RI+LQTLR+KQLYAKFSKCEFWL+QVVFLGHVVSA G                 VSVDPQKVE VVNWE P SATEVRSFLGLAGYY RF+ED
Subjt:  KTHEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVED

Query:  FSRLALLLTALTRNNAKFEWSDK-----------------------WKEYVIYCYASRQGLGCVLMQEGKVIAYASRQLKKHECDYATHDLELTTIVLAL
        FSRLAL LTALTR N KFEWSDK                        K+YVIYC ASR GLGCVLMQ+G VIAYASRQLK+HEC+Y THDLEL  +VLAL
Subjt:  FSRLALLLTALTRNNAKFEWSDK-----------------------WKEYVIYCYASRQGLGCVLMQEGKVIAYASRQLKKHECDYATHDLELTTIVLAL

Query:  KIRRHYLFSEKCPIFTDHKCLKYIFDKKELNLRQRRW---------------------------KSRLPKSALCGTRATLLSELRGFRAVVTVESSGSLL
        KI RHYLF EKC IFTDHK LKYIFD+KELNLRQRRW                           KSRLPKSALCG R  LL+ELRG +AVVT E SGSLL
Subjt:  KIRRHYLFSEKCPIFTDHKCLKYIFDKKELNLRQRRW---------------------------KSRLPKSALCGTRATLLSELRGFRAVVTVESSGSLL

Query:  AQFQVRSSLVAEIVRRQPKDR---------------------------------------------------------STKMYKTLKKTYWWPGMKREIA
        AQFQVRSSLV EIVRRQ +D                                                          STKMY+TLKKTYWW GMK+EIA
Subjt:  AQFQVRSSLVAEIVRRQPKDR---------------------------------------------------------STKMYKTLKKTYWWPGMKREIA

Query:  EYVDKCLICQQVKPVRKSAGGLHNPLPVLEWK----------------------------LTKTERFIPIKVTSTLDQLTKLYVDKIVSQCGVPVSIVSD
        EYVD+CLICQQVKPVR+  GG  NPLPV EWK                            LTKT RFIPIK+TSTLDQL +LYVDKIVSQ GVPVSIVSD
Subjt:  EYVDKCLICQQVKPVRKSAGGLHNPLPVLEWK----------------------------LTKTERFIPIKVTSTLDQLTKLYVDKIVSQCGVPVSIVSD

Query:  RDPRFTSKFWP-----------------------------------------------------------------------------------------
        RDPRFTSKFWP                                                                                         
Subjt:  RDPRFTSKFWP-----------------------------------------------------------------------------------------

Query:  ----------------------------------NKRQRDLEFQVGDQVFLKLSPWKGVIRFGRK
                                          +KR+R+LEFQVGDQVFLKLSPW+GVIRFGRK
Subjt:  ----------------------------------NKRQRDLEFQVGDQVFLKLSPWKGVIRFGRK

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.5e-2528.75Show/hide
Query:  HEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVEDFS
        H +   ++ + L    L  +  KCEF  ++  FLGHV++ DG                 +  +P+K+E +  +  P    E+++FLGL GYY +F+ +F+
Subjt:  HEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVEDFS

Query:  RLALLLTALTRNNAKFEWS------------------------DKWKEYVIYCYASRQGLGCVLMQEGKVIAYASRQLKKHECDYATHDLELTTIVLALK
         +A  +T   + N K + +                        D  K++ +   AS   LG VL Q+G  ++Y SR L +HE +Y+T + EL  IV A K
Subjt:  RLALLLTALTRNNAKFEWS------------------------DKWKEYVIYCYASRQGLGCVLMQEGKVIAYASRQLKKHECDYATHDLELTTIVLALK

Query:  IRRHYLFSEKCPIFTDHKCLKYIFDKKELNLRQRRWKSRL
          RHYL      I +DH+ L +++  K+ N +  RW+ +L
Subjt:  IRRHYLFSEKCPIFTDHKCLKYIFDKKELNLRQRRWKSRL

P10401 Retrovirus-related Pol polyprotein from transposon gypsy8.4e-2130.4Show/hide
Query:  HEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVEDFS
        H  H   +L+ L D  +     K  F+ + V +LG +VS DG                    DP+KV+ +  +  P    +VRSFLGLA YY  F++DF+
Subjt:  HEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVEDFS

Query:  RLALLLTAL----------------------TRNNA-------------KFEWSDKWKEYVIYCYASRQGLGCVLMQEGKVIAYASRQLKKHECDYATHD
         +A  +T +                      T+ NA               ++ D  K + +   AS  G+G VL QEG+ I   SR LK+ E +YAT++
Subjt:  RLALLLTAL----------------------TRNNA-------------KFEWSDKWKEYVIYCYASRQGLGCVLMQEGKVIAYASRQLKKHECDYATHD

Query:  LELTTIVLALKIRRHYLF-SEKCPIFTDHKCLKYIFDKKELNLRQRRWKS
         EL  IV AL   +++L+ S +  IFTDH+ L +    +  N + +RWKS
Subjt:  LELTTIVLALKIRRHYLF-SEKCPIFTDHKCLKYIFDKKELNLRQRRWKS

P20825 Retrovirus-related Pol polyprotein from transposon 2973.3e-2530Show/hide
Query:  HEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVEDFS
        H    +++   L D  L  +  KCEF  K+  FLGH+V+ DG                 +  +P KV+ +V++  P    E+R+FLGL GYY +F+ +++
Subjt:  HEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVEDFS

Query:  RLALLLTALTRNNAKFEWS------------------------DKWKEYVIYCYASRQGLGCVLMQEGKVIAYASRQLKKHECDYATHDLELTTIVLALK
         +A  +T+  +   K +                          D  K++V+   AS   LG VL Q G  I++ SR L  HE +Y+  + EL  IV A K
Subjt:  RLALLLTALTRNNAKFEWS------------------------DKWKEYVIYCYASRQGLGCVLMQEGKVIAYASRQLKKHECDYATHDLELTTIVLALK

Query:  IRRHYLFSEKCPIFTDHKCLKYIFDKKELNLRQRRWKSRL
          RHYL   +  I +DH+ L+++ + KE   +  RW+ RL
Subjt:  IRRHYLFSEKCPIFTDHKCLKYIFDKKELNLRQRRWKSRL

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus4.9e-2128.91Show/hide
Query:  THEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVEDF
        TH ++ R++L +L    L     K  F   QV FLG++V+ADG                 +  DP+KV  +     P S  E++ FLG+  YY +F++D+
Subjt:  THEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVEDF

Query:  SRLALLLTALTRN---NAKFEWSDK-------------------------------WKEYVIYCYASRQGLGCVLMQE----GKVIAYASRQLKKHECDY
        +++A  LT LTR    N K   S K                                K + +   AS   +G VL Q+     + IAY SR L K E +Y
Subjt:  SRLALLLTALTRN---NAKFEWSDK-------------------------------WKEYVIYCYASRQGLGCVLMQE----GKVIAYASRQLKKHECDY

Query:  ATHDLELTTIVLALKIRRHYLF-SEKCPIFTDHKCLKYIFDKKELNLRQRRWKSRL
        AT + E+  I+ +L   R YL+ +    ++TDH+ L +    +  N + +RWK+R+
Subjt:  ATHDLELTTIVLALKIRRHYLF-SEKCPIFTDHKCLKYIFDKKELNLRQRRWKSRL

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.8e-1521.87Show/hide
Query:  HEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVEDFS
        H +H   +L+ L+++ L  K  KC+F  ++  FLG+ +                     ++    K   + ++ TP +  + + FLG+  YY RF+ + S
Subjt:  HEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVEDFS

Query:  RLALLLTALTRNNAKFEWS-------DKWKE----------------YVIYCYASRQGLGCVLMQEGK------VIAYASRQLKKHECDYATHDLELTTI
        ++A  +     +  K +W+       DK K+                Y +   AS+ G+G VL +         V+ Y S+ L+  + +Y   +LEL  I
Subjt:  RLALLLTALTRNNAKFEWS-------DKWKE----------------YVIYCYASRQGLGCVLMQEGK------VIAYASRQLKKHECDYATHDLELTTI

Query:  VLALKIRRHYLFSEKCPIFTDHKCLKYIFDKKELNLRQRR---------------------------------------------WKSRLPKSALCGTRA
        + AL   R+ L  +   + TDH  L  + +K E   R +R                                             WKS      LC    
Subjt:  VLALKIRRHYLFSEKCPIFTDHKCLKYIFDKKELNLRQRR---------------------------------------------WKSRLPKSALCGTRA

Query:  TLLSEL-------RGFRAVVTVESSGSLLAQFQVRSSLVAEIVRRQPK--------DRSTKMYK-------------TLKK---TYWWPGMKREIAEYVD
          + EL           A  + +    L   F+   SL  E++  Q +        +   ++Y              TL K    Y+WP ++  I +Y+ 
Subjt:  TLLSEL-------RGFRAVVTVESSGSLLAQFQVRSSLVAEIVRRQPK--------DRSTKMYK-------------TLKK---TYWWPGMKREIAEYVD

Query:  KCLICQQVKPVRKSAGGLHNPLPVLE--W--------------------------KLTKTERFIPIKVTSTLDQLTKLYVDKIVSQCGVPVSIVSDRDPR
         C+ CQ +K  R    GL  PLP+ E  W                          + +K   FI  + T    QL  L    I S  G P +I SDRD R
Subjt:  KCLICQQVKPVRKSAGGLHNPLPVLE--W--------------------------KLTKTERFIPIKVTSTLDQLTKLYVDKIVSQCGVPVSIVSDRDPR

Query:  FTS
         T+
Subjt:  FTS

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein3.2e-1534.09Show/hide
Query:  HQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVEDFSRLA
        H  ++LQ     Q YA   KC F   Q+ +LGH                H+     VS DP K+E +V W  P + TE+R FLGL GYY RFV+++ ++ 
Subjt:  HQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVEDFSRLA

Query:  LLLTALTRNNAKFEWSDKWKEYVIYCYASRQG
          LT L + N     S KW E     + + +G
Subjt:  LLLTALTRNNAKFEWSDKWKEYVIYCYASRQG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAACCCATGAGGAACATCAGAGGATTATTCTACAGACACTACGTGATAAACAGTTGTACGCTAAGTTCAGCAAATGTGAGTTCTGGTTGAAACAAGTAGTGTTCTT
AGGGCATGTAGTTTCAGCGGACGGAGTTAGTGTTAGGATCCGAATTGCTTCCGCTCATGTATCGTACACTCCTTCAGTTAGTGTTGATCCACAGAAAGTGGAAGTTGTTG
TCAACTGGGAGACACCAGCTAGTGCAACAGAGGTACGTAGTTTCCTAGGCCTAGCAGGATACTACGGACGTTTTGTTGAGGATTTCTCACGATTAGCATTACTCTTGACA
GCTTTGACAAGGAATAATGCTAAGTTTGAGTGGTCAGATAAATGGAAGGAATATGTGATCTATTGTTACGCTTCGAGGCAAGGATTAGGTTGTGTGCTTATGCAGGAAGG
AAAGGTAATAGCTTATGCTTCAAGGCAGTTGAAGAAGCATGAGTGTGATTACGCTACCCATGATCTTGAGCTAACAACAATTGTTTTAGCACTGAAGATCCGGAGACATT
ATTTATTTAGTGAGAAGTGCCCTATTTTCACAGATCATAAGTGTCTGAAGTATATCTTTGATAAAAAAGAACTGAATCTGAGACAGAGGCGATGGAAGTCGAGACTTCCG
AAGAGTGCCTTGTGTGGTACTCGAGCAACCTTGCTAAGTGAGTTAAGAGGTTTCAGGGCAGTTGTGACTGTAGAGAGTTCAGGGAGTCTTTTAGCTCAATTTCAGGTTAG
GTCTTCTCTAGTAGCAGAGATTGTGAGAAGACAACCAAAGGATAGAAGCACCAAGATGTACAAAACTCTGAAGAAAACTTATTGGTGGCCTGGTATGAAGCGAGAGATTG
CTGAATATGTTGATAAATGTTTGATCTGCCAACAGGTTAAACCAGTGAGAAAGAGCGCAGGAGGACTCCATAATCCACTGCCAGTGCTGGAGTGGAAACTCACCAAGACA
GAGCGGTTTATACCAATTAAAGTGACGTCTACATTAGATCAGCTAACTAAGTTATACGTCGACAAGATTGTGAGTCAATGTGGGGTGCCAGTGTCCATAGTTTCAGATAG
GGATCCAAGGTTTACTTCTAAGTTTTGGCCTAATAAGCGACAAAGAGACTTAGAATTTCAGGTTGGAGATCAAGTTTTCTTGAAGTTGTCTCCGTGGAAAGGTGTTATTC
GCTTTGGGAGGAAAGAGGAAAACAGAGAACATAATTTTCGTGTAAATCTGAAGGGTCTCTGGTTTTCTTTGGTTTTTCAGCCGTTCTGGGGTGTACGCCTTGAATTAGAA
TCTCAACTTGGTATGGGTATTGAAACTGGAAAATCTGGAAAAAGTGTGGAATGTGATTTAATTCTAGAGTTTAGAATTCATGAGCATAGAAAGACAAGACTATTTAAGGT
TCTGATGCTCAGGTTTGGTTTGTTTGATAGGCCAAGACGAAATAGGCCGAGTCTACATACCAAGGAACTAGCCAAGGCTGCACCCGAGGCCATTGTAGTCCCTATAGTTA
GGGAATTTTACGCAAACATGACGAAAGGATCCACTACTTCCTTTGTTAGGGGTAAAATGATTCTTTTCGACTCGGCCTCCATTAACCAATTCTTTGGCCTTCCCAATATT
TATCGGGATGGGTACAATGATTACGCAAAGGAGAAAGCGGCTCTGTTATTCGCTATAGCGACCGGTCATAGTGTAGACAGCTGCTGGGGTCGTGTGGGACCCCGTGAGGA
GATTAGTCATCCTGCAGCTGTGATTGATGGGAATTTCATCTCGATTAGACTTAGGGAGCCTAGACCTAGGATAGCCCGCCCCCCACCTCAACCCCAGCAGGAAGAACCAG
AAGAAGAGCTGCATGCTCAAGGAGAGCAGCCCCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAACCCATGAGGAACATCAGAGGATTATTCTACAGACACTACGTGATAAACAGTTGTACGCTAAGTTCAGCAAATGTGAGTTCTGGTTGAAACAAGTAGTGTTCTT
AGGGCATGTAGTTTCAGCGGACGGAGTTAGTGTTAGGATCCGAATTGCTTCCGCTCATGTATCGTACACTCCTTCAGTTAGTGTTGATCCACAGAAAGTGGAAGTTGTTG
TCAACTGGGAGACACCAGCTAGTGCAACAGAGGTACGTAGTTTCCTAGGCCTAGCAGGATACTACGGACGTTTTGTTGAGGATTTCTCACGATTAGCATTACTCTTGACA
GCTTTGACAAGGAATAATGCTAAGTTTGAGTGGTCAGATAAATGGAAGGAATATGTGATCTATTGTTACGCTTCGAGGCAAGGATTAGGTTGTGTGCTTATGCAGGAAGG
AAAGGTAATAGCTTATGCTTCAAGGCAGTTGAAGAAGCATGAGTGTGATTACGCTACCCATGATCTTGAGCTAACAACAATTGTTTTAGCACTGAAGATCCGGAGACATT
ATTTATTTAGTGAGAAGTGCCCTATTTTCACAGATCATAAGTGTCTGAAGTATATCTTTGATAAAAAAGAACTGAATCTGAGACAGAGGCGATGGAAGTCGAGACTTCCG
AAGAGTGCCTTGTGTGGTACTCGAGCAACCTTGCTAAGTGAGTTAAGAGGTTTCAGGGCAGTTGTGACTGTAGAGAGTTCAGGGAGTCTTTTAGCTCAATTTCAGGTTAG
GTCTTCTCTAGTAGCAGAGATTGTGAGAAGACAACCAAAGGATAGAAGCACCAAGATGTACAAAACTCTGAAGAAAACTTATTGGTGGCCTGGTATGAAGCGAGAGATTG
CTGAATATGTTGATAAATGTTTGATCTGCCAACAGGTTAAACCAGTGAGAAAGAGCGCAGGAGGACTCCATAATCCACTGCCAGTGCTGGAGTGGAAACTCACCAAGACA
GAGCGGTTTATACCAATTAAAGTGACGTCTACATTAGATCAGCTAACTAAGTTATACGTCGACAAGATTGTGAGTCAATGTGGGGTGCCAGTGTCCATAGTTTCAGATAG
GGATCCAAGGTTTACTTCTAAGTTTTGGCCTAATAAGCGACAAAGAGACTTAGAATTTCAGGTTGGAGATCAAGTTTTCTTGAAGTTGTCTCCGTGGAAAGGTGTTATTC
GCTTTGGGAGGAAAGAGGAAAACAGAGAACATAATTTTCGTGTAAATCTGAAGGGTCTCTGGTTTTCTTTGGTTTTTCAGCCGTTCTGGGGTGTACGCCTTGAATTAGAA
TCTCAACTTGGTATGGGTATTGAAACTGGAAAATCTGGAAAAAGTGTGGAATGTGATTTAATTCTAGAGTTTAGAATTCATGAGCATAGAAAGACAAGACTATTTAAGGT
TCTGATGCTCAGGTTTGGTTTGTTTGATAGGCCAAGACGAAATAGGCCGAGTCTACATACCAAGGAACTAGCCAAGGCTGCACCCGAGGCCATTGTAGTCCCTATAGTTA
GGGAATTTTACGCAAACATGACGAAAGGATCCACTACTTCCTTTGTTAGGGGTAAAATGATTCTTTTCGACTCGGCCTCCATTAACCAATTCTTTGGCCTTCCCAATATT
TATCGGGATGGGTACAATGATTACGCAAAGGAGAAAGCGGCTCTGTTATTCGCTATAGCGACCGGTCATAGTGTAGACAGCTGCTGGGGTCGTGTGGGACCCCGTGAGGA
GATTAGTCATCCTGCAGCTGTGATTGATGGGAATTTCATCTCGATTAGACTTAGGGAGCCTAGACCTAGGATAGCCCGCCCCCCACCTCAACCCCAGCAGGAAGAACCAG
AAGAAGAGCTGCATGCTCAAGGAGAGCAGCCCCACTGA
Protein sequenceShow/hide protein sequence
MKTHEEHQRIILQTLRDKQLYAKFSKCEFWLKQVVFLGHVVSADGVSVRIRIASAHVSYTPSVSVDPQKVEVVVNWETPASATEVRSFLGLAGYYGRFVEDFSRLALLLT
ALTRNNAKFEWSDKWKEYVIYCYASRQGLGCVLMQEGKVIAYASRQLKKHECDYATHDLELTTIVLALKIRRHYLFSEKCPIFTDHKCLKYIFDKKELNLRQRRWKSRLP
KSALCGTRATLLSELRGFRAVVTVESSGSLLAQFQVRSSLVAEIVRRQPKDRSTKMYKTLKKTYWWPGMKREIAEYVDKCLICQQVKPVRKSAGGLHNPLPVLEWKLTKT
ERFIPIKVTSTLDQLTKLYVDKIVSQCGVPVSIVSDRDPRFTSKFWPNKRQRDLEFQVGDQVFLKLSPWKGVIRFGRKEENREHNFRVNLKGLWFSLVFQPFWGVRLELE
SQLGMGIETGKSGKSVECDLILEFRIHEHRKTRLFKVLMLRFGLFDRPRRNRPSLHTKELAKAAPEAIVVPIVREFYANMTKGSTTSFVRGKMILFDSASINQFFGLPNI
YRDGYNDYAKEKAALLFAIATGHSVDSCWGRVGPREEISHPAAVIDGNFISIRLREPRPRIARPPPQPQQEEPEEELHAQGEQPH