; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038645 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038645
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr2:22208111..22209638
RNA-Seq ExpressionLag0038645
SyntenyLag0038645
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]3.7e-11647.94Show/hide
Query:  GDRNTKWFHIKTNERRKTNKIRGLIDDSGNLAVEDEDMERVVNSYFGAIFQTSHPNMEDIEKILETIPRTITEQQNDRLLATFSRTEIQEVVKNMNPTKA
        GDRNTK+FH + +ERRK N I G+ D+ G     +E + +   SYF  I+ +SHP+   IE++ E IP  +TE+ N+ L+  F++ E+   +K ++P KA
Subjt:  GDRNTKWFHIKTNERRKTNKIRGLIDDSGNLAVEDEDMERVVNSYFGAIFQTSHPNMEDIEKILETIPRTITEQQNDRLLATFSRTEIQEVVKNMNPTKA

Query:  PWPDGMQAIFYQKYWDIVGEDVCNTCMQILNDGASLRAINKTYIVLIPKVKDPEVMKDFRPISLYIVIYKIVAKVLANRLKSVLDQIISPSQSAFVPGRL
        P PDGM A+F+QKYW IVG +V +  + +LN    +  +NKT I LIPK  +P+ M DFRPISL  V+YK+++K+LANRLK +L  IIS +QSAF   RL
Subjt:  PWPDGMQAIFYQKYWDIVGEDVCNTCMQILNDGASLRAINKTYIVLIPKVKDPEVMKDFRPISLYIVIYKIVAKVLANRLKSVLDQIISPSQSAFVPGRL

Query:  ITDNVILGFECIHSIKSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWIELIMGCVESVGFQVLLNGTPRVEFKPRRGLRQGDPLSPYLFL
        ITDNV++ FE +H +  K  GK G + +KLDMSK +DRVEW ++ K+M +MGF   W +L+M C+ SV + +L+NG       P RGLRQGDPLSP LFL
Subjt:  ITDNVILGFECIHSIKSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWIELIMGCVESVGFQVLLNGTPRVEFKPRRGLRQGDPLSPYLFL

Query:  ICAEGLSSLLNQAESQRELTGLRINKYSPSISHLFYADDSLLFFKATLSDCRTIKSILNTYERASGQTINYEKSLFMVSPHTCSNIKHEISNTLKVKHKE
        +CAEGLS+L+NQA   + +TG+ IN+  P ++HLF+ADDS+LF KA   +C  ++SIL  YE ASGQ IN +KS    SP+T    + EI N L      
Subjt:  ICAEGLSSLLNQAESQRELTGLRINKYSPSISHLFYADDSLLFFKATLSDCRTIKSILNTYERASGQTINYEKSLFMVSPHTCSNIKHEISNTLKVKHKE

Query:  SLGQYLGLPSQNARNKREVFSSIKERVWKALQGFGG
           +YLGLPS   R+K +VF+ +KE+V   L G+ G
Subjt:  SLGQYLGLPSQNARNKREVFSSIKERVWKALQGFGG

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]3.0e-11848.97Show/hide
Query:  MGDRNTKWFHIKTNERRKTNKIRGLIDDSGNLAVEDEDMERVVNSYFGAIFQTSHPNMEDIEKILETIPRTITEQQNDRLLATFSRTEIQEVVKNMNPTK
        +GDRNTK+FH K ++RR+ N I G++D++GN     E + +V  SYF  I+ +S P    I ++L+ IP T+TE+ N  L+  F+R EI+  +  M+PTK
Subjt:  MGDRNTKWFHIKTNERRKTNKIRGLIDDSGNLAVEDEDMERVVNSYFGAIFQTSHPNMEDIEKILETIPRTITEQQNDRLLATFSRTEIQEVVKNMNPTK

Query:  APWPDGMQAIFYQKYWDIVGEDVCNTCMQILNDGASLRAINKTYIVLIPKVKDPEVMKDFRPISLYIVIYKIVAKVLANRLKSVLDQIISPSQSAFVPGR
        AP PDGM AIF+QKYW+IVG D+    + +LN   S+  INKT I L+PK+K+P  M DFRPISL  V+YK+++KVLANRLK++L QIIS +QSAF+ GR
Subjt:  APWPDGMQAIFYQKYWDIVGEDVCNTCMQILNDGASLRAINKTYIVLIPKVKDPEVMKDFRPISLYIVIYKIVAKVLANRLKSVLDQIISPSQSAFVPGR

Query:  LITDNVILGFECIHSIKSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWIELIMGCVESVGFQVLLNGTPRVEFKPRRGLRQGDPLSPYLF
        LITDNV++ FE +H ++ K+ GK G   +KLDMSK YDRVEW +++++M KMGF   WI+L+M C+ SV + +L+NG       P RGLRQGDP+SPY+F
Subjt:  LITDNVILGFECIHSIKSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWIELIMGCVESVGFQVLLNGTPRVEFKPRRGLRQGDPLSPYLF

Query:  LICAEGLSSLLNQAESQRELTGLRINKYSPSISHLFYADDSLLFFKATLSDCRTIKSILNTYERASGQTINYEKSLFMVSPHTCSNIKHEISNTLKVKHK
        L+CA+G SSLLN    +  ++G+ I +  P I+HLF+ADDSLLF KA   +C+T+  IL  YE ASGQ IN +KS    S +T    + E+   L     
Subjt:  LICAEGLSSLLNQAESQRELTGLRINKYSPSISHLFYADDSLLFFKATLSDCRTIKSILNTYERASGQTINYEKSLFMVSPHTCSNIKHEISNTLKVKHK

Query:  ESLGQYLGLPSQNARNKREVFSSIKERVWKALQGF
            +YLGLPS   ++K E+F+ +KERV + L G+
Subjt:  ESLGQYLGLPSQNARNKREVFSSIKERVWKALQGF

XP_023908235.1 uncharacterized protein LOC112019924 [Quercus suber]1.1e-11549.54Show/hide
Query:  GDRNTKWFHIKTNERRKTNKIRGLIDDSGNLAVEDEDMERVVNSYFGAIFQTSHPNMEDIEKILETIPRTITEQQNDRLLATFSRTEIQEVVKNMNPTKA
        GD+NTK+FH K ++RR+ N I+G+    GN   E E++  V   YF  +F     N + +E+ L  +PR +T + +D L + F+  E++  +  M PTKA
Subjt:  GDRNTKWFHIKTNERRKTNKIRGLIDDSGNLAVEDEDMERVVNSYFGAIFQTSHPNMEDIEKILETIPRTITEQQNDRLLATFSRTEIQEVVKNMNPTKA

Query:  PWPDGMQAIFYQKYWDIVGEDVCNTCMQILNDGASLRAINKTYIVLIPKVKDPEVMKDFRPISLYIVIYKIVAKVLANRLKSVLDQIISPSQSAFVPGRL
        P PDGM A+FYQK+W +VG+ V +  +  LN+G     IN TYIVLIPKVK+PE M DFRPISL  VIYKI++KVL NRLK VL  IISP+QSAFVPGRL
Subjt:  PWPDGMQAIFYQKYWDIVGEDVCNTCMQILNDGASLRAINKTYIVLIPKVKDPEVMKDFRPISLYIVIYKIVAKVLANRLKSVLDQIISPSQSAFVPGRL

Query:  ITDNVILGFECIHSIKSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWIELIMGCVESVGFQVLLNGTPRVEFKPRRGLRQGDPLSPYLFL
        ITDNV+L +E +H++ S++ GK G + LKLD+SK YDRVEW +L+ IM ++GF   WIE +M CV +  F VL+NG P    +P RG+RQGDPLSPYLFL
Subjt:  ITDNVILGFECIHSIKSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWIELIMGCVESVGFQVLLNGTPRVEFKPRRGLRQGDPLSPYLFL

Query:  ICAEGLSSLLNQAESQRELTGLRINKYSPSISHLFYADDSLLFFKATLSDCRTIKSILNTYERASGQTINYEKSLFMVSPHTCSNIKHEISNTLKVKHKE
        +C EG +SLL++A +   L G+ I + +P I++L +ADDSL+F + +  +  TI  IL  Y +ASGQ+IN EKS    S +     K E    L VK   
Subjt:  ICAEGLSSLLNQAESQRELTGLRINKYSPSISHLFYADDSLLFFKATLSDCRTIKSILNTYERASGQTINYEKSLFMVSPHTCSNIKHEISNTLKVKHKE

Query:  SLGQYLGLPSQNARNKREVFSSIKERVWKALQGFGG
            YLGLP+   R+K   FS IK+RVWK LQG+ G
Subjt:  SLGQYLGLPSQNARNKREVFSSIKERVWKALQGFGG

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]4.0e-11851.15Show/hide
Query:  GDRNTKWFHIKTNERRKTNKIRGLIDDSGNLAVEDEDMERVVNSYFGAIFQTSHPNMEDIEKILETIPRTITEQQNDRLLATFSRTEIQEVVKNMNPTKA
        GDRNTK+FH K ++RR+ N IRG+ +  G      E++ +V   YF  +FQ      + +E+ L+ +   +TE   + L   F+  E+Q  +  M PTKA
Subjt:  GDRNTKWFHIKTNERRKTNKIRGLIDDSGNLAVEDEDMERVVNSYFGAIFQTSHPNMEDIEKILETIPRTITEQQNDRLLATFSRTEIQEVVKNMNPTKA

Query:  PWPDGMQAIFYQKYWDIVGEDVCNTCMQILNDGASLRAINKTYIVLIPKVKDPEVMKDFRPISLYIVIYKIVAKVLANRLKSVLDQIISPSQSAFVPGRL
        P PDGM A+FYQK+W IVG+ V +  +  LN+G  L  IN T IVLIPKV++PE M +FRPISL  VIYKI++KVLANRLK VL QIIS +QSAFVPGRL
Subjt:  PWPDGMQAIFYQKYWDIVGEDVCNTCMQILNDGASLRAINKTYIVLIPKVKDPEVMKDFRPISLYIVIYKIVAKVLANRLKSVLDQIISPSQSAFVPGRL

Query:  ITDNVILGFECIHSIKSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWIELIMGCVESVGFQVLLNGTPRVEFKPRRGLRQGDPLSPYLFL
        ITDNV++ +E +H++ +++ GK G V LKLD+SK YDRVEW +L+ IM KMGF  GWIE +M CV +  F +L+NG P    +P RG+RQGDP+SPYLFL
Subjt:  ITDNVILGFECIHSIKSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWIELIMGCVESVGFQVLLNGTPRVEFKPRRGLRQGDPLSPYLFL

Query:  ICAEGLSSLLNQAESQRELTGLRINKYSPSISHLFYADDSLLFFKATLSDCRTIKSILNTYERASGQTINYEKSLFMVSPHTCSNIKHEISNTLKVKHKE
        +CAEGL++LLN+AE    +TG+ I + +P I++L +ADDSLLF +AT ++  TI  IL  YERASGQ+IN EKS    S +T    K +I   L VK  +
Subjt:  ICAEGLSSLLNQAESQRELTGLRINKYSPSISHLFYADDSLLFFKATLSDCRTIKSILNTYERASGQTINYEKSLFMVSPHTCSNIKHEISNTLKVKHKE

Query:  SLGQYLGLPSQNARNKREVFSSIKERVWKALQGFGG
           +YLGLP+   R K   FS +K+RVWK LQG+ G
Subjt:  SLGQYLGLPSQNARNKREVFSSIKERVWKALQGFGG

XP_030941688.1 uncharacterized protein LOC115966628 [Quercus lobata]4.1e-11550.33Show/hide
Query:  GDRNTKWFHIKTNERRKTNKIRGLIDDSGNLAVEDEDMERVVNSYFGAIFQTSHPNMEDIEKILETIPRTITEQQNDRLLATFSRTEIQEVVKNMNPTKA
        GD+N+K+FH K ++RR+ N I+G+ D   N   E ED+  V  +YFG +F  S    + I + L  +P  +T+     L   F+  EI+E +  M PTKA
Subjt:  GDRNTKWFHIKTNERRKTNKIRGLIDDSGNLAVEDEDMERVVNSYFGAIFQTSHPNMEDIEKILETIPRTITEQQNDRLLATFSRTEIQEVVKNMNPTKA

Query:  PWPDGMQAIFYQKYWDIVGEDVCNTCMQILNDGASLRAINKTYIVLIPKVKDPEVMKDFRPISLYIVIYKIVAKVLANRLKSVLDQIISPSQSAFVPGRL
        P PDGM A+FYQK+W IVG++V +  +   N G     IN T IVLIPKV  PE M DFRPISL  VIYKI++KVLANRLK VL  IIS +QSAFVPG L
Subjt:  PWPDGMQAIFYQKYWDIVGEDVCNTCMQILNDGASLRAINKTYIVLIPKVKDPEVMKDFRPISLYIVIYKIVAKVLANRLKSVLDQIISPSQSAFVPGRL

Query:  ITDNVILGFECIHSIKSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWIELIMGCVESVGFQVLLNGTPRVEFKPRRGLRQGDPLSPYLFL
        ITDNV++  + +HS++ +R GK G++ LKLD+SK YDRVEW +L+ IM K+GF   WI  +M CV +  F VL+NG P     P RGLRQGDPLSPYLFL
Subjt:  ITDNVILGFECIHSIKSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWIELIMGCVESVGFQVLLNGTPRVEFKPRRGLRQGDPLSPYLFL

Query:  ICAEGLSSLLNQAESQRELTGLRINKYSPSISHLFYADDSLLFFKATLSDCRTIKSILNTYERASGQTINYEKSLFMVSPHTCSNIKHEISNTLKVKHKE
        +CAEG SSLL QAE +  L G+ I K +P ISHL +ADDSLLF +AT  +   +  IL TY  ASGQ IN EKS  + S +T    K     TL VK  E
Subjt:  ICAEGLSSLLNQAESQRELTGLRINKYSPSISHLFYADDSLLFFKATLSDCRTIKSILNTYERASGQTINYEKSLFMVSPHTCSNIKHEISNTLKVKHKE

Query:  SLGQYLGLPSQNARNKREVFSSIKERVWKALQGFGGGRQGLETKFIRKAGE
            YLGLP+   R K + F+ IK+RVWK LQG+ G       K + +AG+
Subjt:  SLGQYLGLPSQNARNKREVFSSIKERVWKALQGFGGGRQGLETKFIRKAGE

TrEMBL top hitse value%identityAlignment
A0A2N9E7R2 Reverse transcriptase domain-containing protein1.3e-12250.23Show/hide
Query:  GDRNTKWFHIKTNERRKTNKIRGLIDDSGNLAVEDEDMERVVNSYFGAIFQTSHPNMEDIEKILETIPRTITEQQNDRLLATFSRTEIQEVVKNMNPTKA
        GDRNT++FH + ++RR+ N+I G+ +++G    + E++  ++  Y+  IFQTSHP++  IE+ +  +P+ +T   N+ L   F+  E++E +K M P KA
Subjt:  GDRNTKWFHIKTNERRKTNKIRGLIDDSGNLAVEDEDMERVVNSYFGAIFQTSHPNMEDIEKILETIPRTITEQQNDRLLATFSRTEIQEVVKNMNPTKA

Query:  PWPDGMQAIFYQKYWDIVGEDVCNTCMQILNDGASLRAINKTYIVLIPKVKDPEVMKDFRPISLYIVIYKIVAKVLANRLKSVLDQIISPSQSAFVPGRL
        P PDG+  +FYQ++W +VG+DV    +  LN G  L +IN T+I LIPKVK+PE + +FRPISL  VIYK+V+KV+ANRLK +L  IIS SQSAFVPGRL
Subjt:  PWPDGMQAIFYQKYWDIVGEDVCNTCMQILNDGASLRAINKTYIVLIPKVKDPEVMKDFRPISLYIVIYKIVAKVLANRLKSVLDQIISPSQSAFVPGRL

Query:  ITDNVILGFECIHSIKSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWIELIMGCVESVGFQVLLNGTPRVEFKPRRGLRQGDPLSPYLFL
        ITDNV++ FE +H + + ++G+ G + LKLDMSK YDRVEW YL  IM KMGF P W+ +IM C+ +V + +L+NG P    KP RGLRQGDPLSPYLFL
Subjt:  ITDNVILGFECIHSIKSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWIELIMGCVESVGFQVLLNGTPRVEFKPRRGLRQGDPLSPYLFL

Query:  ICAEGLSSLLNQAESQRELTGLRINKYSPSISHLFYADDSLLFFKATLSDCRTIKSILNTYERASGQTINYEKSLFMVSPHTCSNIKHEISNTLKVKHKE
        +CAEGL SL++QA  Q E+ G+ + +  P I+HLF+ADDSLLF KAT  DC  ++ IL  YERASGQ IN +K+    S  T S  +  I NTL V   +
Subjt:  ICAEGLSSLLNQAESQRELTGLRINKYSPSISHLFYADDSLLFFKATLSDCRTIKSILNTYERASGQTINYEKSLFMVSPHTCSNIKHEISNTLKVKHKE

Query:  SLGQYLGLPSQNARNKREVFSSIKERVWKALQGF
           +YLGLPS   +N+   F+ +KERVW  L G+
Subjt:  SLGQYLGLPSQNARNKREVFSSIKERVWKALQGF

A0A2N9FMJ0 Reverse transcriptase domain-containing protein4.1e-12150.69Show/hide
Query:  GDRNTKWFHIKTNERRKTNKIRGLIDDSGNLAVEDEDMERVVNSYFGAIFQTSHPNMEDIEKILETIPRTITEQQNDRLLATFSRTEIQEVVKNMNPTKA
        GD+NT++FH + ++RR+ N+I GL D++G      E++  +   Y+ +IFQTS P  + I+  +  +P  I++  ND L+  FS  E+ + +K M P  A
Subjt:  GDRNTKWFHIKTNERRKTNKIRGLIDDSGNLAVEDEDMERVVNSYFGAIFQTSHPNMEDIEKILETIPRTITEQQNDRLLATFSRTEIQEVVKNMNPTKA

Query:  PWPDGMQAIFYQKYWDIVGEDVCNTCMQILNDGASLRAINKTYIVLIPKVKDPEVMKDFRPISLYIVIYKIVAKVLANRLKSVLDQIISPSQSAFVPGRL
        P PDG+  +FYQKYW +VG +V +  +  LN G  L +IN TYI LIPKVK+PE + +FRPISL  VIYK+V+KV+ANRLK +L +IIS SQSAFVPGRL
Subjt:  PWPDGMQAIFYQKYWDIVGEDVCNTCMQILNDGASLRAINKTYIVLIPKVKDPEVMKDFRPISLYIVIYKIVAKVLANRLKSVLDQIISPSQSAFVPGRL

Query:  ITDNVILGFECIHSIKSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWIELIMGCVESVGFQVLLNGTPRVEFKPRRGLRQGDPLSPYLFL
        ITDN+++ FE +H + S R+G+ G + LKLDMSK YDRVEW++L KIM KMGF   +I LIM C++SV + +L+NG P    KP RGLRQGDPLSPYLFL
Subjt:  ITDNVILGFECIHSIKSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWIELIMGCVESVGFQVLLNGTPRVEFKPRRGLRQGDPLSPYLFL

Query:  ICAEGLSSLLNQAESQRELTGLRINKYSPSISHLFYADDSLLFFKATLSDCRTIKSILNTYERASGQTINYEKSLFMVSPHTCSNIKHEISNTLKVKHKE
        + AEGL+SL++QA S+  + G+ + +  P I+HLF+ADDSLLF +ATL DC  I+ IL+TYE+ASGQ +N +K+    S  T  +I+  I  TL V    
Subjt:  ICAEGLSSLLNQAESQRELTGLRINKYSPSISHLFYADDSLLFFKATLSDCRTIKSILNTYERASGQTINYEKSLFMVSPHTCSNIKHEISNTLKVKHKE

Query:  SLGQYLGLPSQNARNKREVFSSIKERVWKALQGF
           +YLGLPS   RN+   FS IKE+VW  L+G+
Subjt:  SLGQYLGLPSQNARNKREVFSSIKERVWKALQGF

A0A2N9FNH6 Reverse transcriptase domain-containing protein3.6e-12551.15Show/hide
Query:  GDRNTKWFHIKTNERRKTNKIRGLIDDSGNLAVEDEDMERVVNSYFGAIFQTSHPNMEDIEKILETIPRTITEQQNDRLLATFSRTEIQEVVKNMNPTKA
        GDRNT++FH   ++R+K N I GL D +G+LA     M  +V +YF  IF+TS+P+   I +++ ++ +T+T++ ND LLA F+  EI+  +  M+PTKA
Subjt:  GDRNTKWFHIKTNERRKTNKIRGLIDDSGNLAVEDEDMERVVNSYFGAIFQTSHPNMEDIEKILETIPRTITEQQNDRLLATFSRTEIQEVVKNMNPTKA

Query:  PWPDGMQAIFYQKYWDIVGEDVCNTCMQILNDGASLRAINKTYIVLIPKVKDPEVMKDFRPISLYIVIYKIVAKVLANRLKSVLDQIISPSQSAFVPGRL
        P PDGM A+FYQK+W IVG+DV N  ++ L+ G  L+++N T+I LIPK+  PE+M  FRPISL  V+YKI++KVLANRLK+VLD IIS +QSAFVPGRL
Subjt:  PWPDGMQAIFYQKYWDIVGEDVCNTCMQILNDGASLRAINKTYIVLIPKVKDPEVMKDFRPISLYIVIYKIVAKVLANRLKSVLDQIISPSQSAFVPGRL

Query:  ITDNVILGFECIHSIKSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWIELIMGCVESVGFQVLLNGTPRVEFKPRRGLRQGDPLSPYLFL
        ITDN+++ FE +H +K+KR G+   + +KLDMSK YDRVEW +L  +M K+GF   W+ LIM C+ SV + V+LNG P    KP RG+RQGDPLSPYLFL
Subjt:  ITDNVILGFECIHSIKSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWIELIMGCVESVGFQVLLNGTPRVEFKPRRGLRQGDPLSPYLFL

Query:  ICAEGLSSLLNQAESQRELTGLRINKYSPSISHLFYADDSLLFFKATLSDCRTIKSILNTYERASGQTINYEKSLFMVSPHTCSNIKHEISNTLKVKHKE
        ICAEGL++LL QAE    + GL I +  P ISHLF+ADDSLLF +A + +C+ + +IL+TYE+ASGQ +N+EK+    S +T  +++H I   L+     
Subjt:  ICAEGLSSLLNQAESQRELTGLRINKYSPSISHLFYADDSLLFFKATLSDCRTIKSILNTYERASGQTINYEKSLFMVSPHTCSNIKHEISNTLKVKHKE

Query:  SLGQYLGLPSQNARNKREVFSSIKERVWKALQGFGG
         LG+YLGLP    R K++ F  IK+++ K L G+ G
Subjt:  SLGQYLGLPSQNARNKREVFSSIKERVWKALQGFGG

A0A2N9ITS3 Reverse transcriptase domain-containing protein2.9e-12250Show/hide
Query:  GDRNTKWFHIKTNERRKTNKIRGLIDDSGNLAVEDEDMERVVNSYFGAIFQTSHPNMEDIEKILETIPRTITEQQNDRLLATFSRTEIQEVVKNMNPTKA
        GDRNT++FH + ++RR+ N+I G+ +++G    + E++  ++  Y+ +IFQTSHP++  IE+ +  +P+ +T   N+ L   F+  E++E +K M P KA
Subjt:  GDRNTKWFHIKTNERRKTNKIRGLIDDSGNLAVEDEDMERVVNSYFGAIFQTSHPNMEDIEKILETIPRTITEQQNDRLLATFSRTEIQEVVKNMNPTKA

Query:  PWPDGMQAIFYQKYWDIVGEDVCNTCMQILNDGASLRAINKTYIVLIPKVKDPEVMKDFRPISLYIVIYKIVAKVLANRLKSVLDQIISPSQSAFVPGRL
        P PDG+  +FYQ++W +VG+DV    +  LN G  L +IN T+I LIPKVK+PE + +FRPISL  VIYK+V+KV+ANRLK +L  IIS SQSAFVPGRL
Subjt:  PWPDGMQAIFYQKYWDIVGEDVCNTCMQILNDGASLRAINKTYIVLIPKVKDPEVMKDFRPISLYIVIYKIVAKVLANRLKSVLDQIISPSQSAFVPGRL

Query:  ITDNVILGFECIHSIKSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWIELIMGCVESVGFQVLLNGTPRVEFKPRRGLRQGDPLSPYLFL
        ITDNV++ FE +H + + ++G+ G + LKLDMSK YDRVEW YL  IM KMGF P W+ +IM C+ +V + +L+NG P    KP RGLRQGDPLSPYLFL
Subjt:  ITDNVILGFECIHSIKSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWIELIMGCVESVGFQVLLNGTPRVEFKPRRGLRQGDPLSPYLFL

Query:  ICAEGLSSLLNQAESQRELTGLRINKYSPSISHLFYADDSLLFFKATLSDCRTIKSILNTYERASGQTINYEKSLFMVSPHTCSNIKHEISNTLKVKHKE
        +CAEGL SL++QA  Q E+ G+ + +  P I+HLF+ADDSLLF KAT  DC  ++ IL  YERASGQ IN +K+    S  T +  +  I NTL V   +
Subjt:  ICAEGLSSLLNQAESQRELTGLRINKYSPSISHLFYADDSLLFFKATLSDCRTIKSILNTYERASGQTINYEKSLFMVSPHTCSNIKHEISNTLKVKHKE

Query:  SLGQYLGLPSQNARNKREVFSSIKERVWKALQGF
           +YLGLPS   +N+   F+ +KERVW  L G+
Subjt:  SLGQYLGLPSQNARNKREVFSSIKERVWKALQGF

A0A2N9J3U0 Reverse transcriptase domain-containing protein3.6e-12551.15Show/hide
Query:  GDRNTKWFHIKTNERRKTNKIRGLIDDSGNLAVEDEDMERVVNSYFGAIFQTSHPNMEDIEKILETIPRTITEQQNDRLLATFSRTEIQEVVKNMNPTKA
        GDRNT++FH   ++R+K N I GL D +G+LA     M  +V +YF  IF+TS+P+   I +++ ++ +T+T++ ND LLA F+  EI+  +  M+PTKA
Subjt:  GDRNTKWFHIKTNERRKTNKIRGLIDDSGNLAVEDEDMERVVNSYFGAIFQTSHPNMEDIEKILETIPRTITEQQNDRLLATFSRTEIQEVVKNMNPTKA

Query:  PWPDGMQAIFYQKYWDIVGEDVCNTCMQILNDGASLRAINKTYIVLIPKVKDPEVMKDFRPISLYIVIYKIVAKVLANRLKSVLDQIISPSQSAFVPGRL
        P PDGM A+FYQK+W IVG+DV N  ++ L+ G  L+++N T+I LIPK+  PE+M  FRPISL  V+YKI++KVLANRLK+VLD IIS +QSAFVPGRL
Subjt:  PWPDGMQAIFYQKYWDIVGEDVCNTCMQILNDGASLRAINKTYIVLIPKVKDPEVMKDFRPISLYIVIYKIVAKVLANRLKSVLDQIISPSQSAFVPGRL

Query:  ITDNVILGFECIHSIKSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWIELIMGCVESVGFQVLLNGTPRVEFKPRRGLRQGDPLSPYLFL
        ITDN+++ FE +H +K+KR G+   + +KLDMSK YDRVEW +L  +M K+GF   W+ LIM C+ SV + V+LNG P    KP RG+RQGDPLSPYLFL
Subjt:  ITDNVILGFECIHSIKSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWIELIMGCVESVGFQVLLNGTPRVEFKPRRGLRQGDPLSPYLFL

Query:  ICAEGLSSLLNQAESQRELTGLRINKYSPSISHLFYADDSLLFFKATLSDCRTIKSILNTYERASGQTINYEKSLFMVSPHTCSNIKHEISNTLKVKHKE
        ICAEGL++LL QAE    + GL I +  P ISHLF+ADDSLLF +A + +C+ + +IL+TYE+ASGQ +N+EK+    S +T  +++H I   L+     
Subjt:  ICAEGLSSLLNQAESQRELTGLRINKYSPSISHLFYADDSLLFFKATLSDCRTIKSILNTYERASGQTINYEKSLFMVSPHTCSNIKHEISNTLKVKHKE

Query:  SLGQYLGLPSQNARNKREVFSSIKERVWKALQGFGG
         LG+YLGLP    R K++ F  IK+++ K L G+ G
Subjt:  SLGQYLGLPSQNARNKREVFSSIKERVWKALQGFGG

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein5.9e-4026.06Show/hide
Query:  ERRKTNKIRGLIDDSGNLAVEDEDMERVVNSYFGAIFQTSHPNMEDIEKILE--TIPRTITEQQNDRLLATFSRTEIQEVVKNMNPTKAPWPDGMQAIFY
        ++R+ N+I  + +D G++  +  +++  +  Y+  ++     N+E+++  L+  T+PR + +++ + L    + +EI  ++ ++   K+P PDG  A FY
Subjt:  ERRKTNKIRGLIDDSGNLAVEDEDMERVVNSYFGAIFQTSHPNMEDIEKILE--TIPRTITEQQNDRLLATFSRTEIQEVVKNMNPTKAPWPDGMQAIFY

Query:  QKYWDIVGEDVCNTCMQILNDGASLRAINKTYIVLIPKV-KDPEVMKDFRPISLYIVIYKIVAKVLANRLKSVLDQIISPSQSAFVPGRLITDNVILGFE
        Q+Y + +   +      I  +G    +  +  I+LIPK  +D    ++FRPISL  +  KI+ K+LANR++  + ++I   Q  F+PG     N+     
Subjt:  QKYWDIVGEDVCNTCMQILNDGASLRAINKTYIVLIPKV-KDPEVMKDFRPISLYIVIYKIVAKVLANRLKSVLDQIISPSQSAFVPGRLITDNVILGFE

Query:  CIHSIKSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWIELIMGCVESVGFQVLLNGTPRVEFKPRRGLRQGDPLSPYLFLICAEGLSSLL
         I  I   R      V + +D  K +D+++  ++ K +NK+G    ++++I    +     ++LNG     F  + G RQG PLSP LF I  E L+  +
Subjt:  CIHSIKSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWIELIMGCVESVGFQVLLNGTPRVEFKPRRGLRQGDPLSPYLFLICAEGLSSLL

Query:  NQAESQRELTGLRINKYSPSISHLFYADDSLLFFKATLSDCRTIKSILNTYERASGQTINYEKS---LFMVSPHTCSNIKHEISNTLKVKHKESLGQYLG
         Q   ++E+ G+++ K    +S   +ADD +++ +  +   + +  +++ + + SG  IN +KS   L+  +  T S I  E+  T+  K  + LG  L 
Subjt:  NQAESQRELTGLRINKYSPSISHLFYADDSLLFFKATLSDCRTIKSILNTYERASGQTINYEKS---LFMVSPHTCSNIKHEISNTLKVKHKESLGQYLG

Query:  LPSQN--ARNKREVFSSIKE--RVWK
           ++    N + +   IKE    WK
Subjt:  LPSQN--ARNKREVFSSIKE--RVWK

P08548 LINE-1 reverse transcriptase homolog2.1e-3726.8Show/hide
Query:  TNERRKTNKIRGLIDDSGNLAVEDEDMERVVNSYFGAIFQTSHPNMEDIEKILET--IPRTITEQQNDRLLATFSRTEIQEVVKNMNPTKAPWPDGMQAI
        T ++R  + I  + + +  +  +  ++++++N Y+  ++   + N+++I++ LE   +PR +++++ + L    S +EI   ++N+   K+P PDG  + 
Subjt:  TNERRKTNKIRGLIDDSGNLAVEDEDMERVVNSYFGAIFQTSHPNMEDIEKILET--IPRTITEQQNDRLLATFSRTEIQEVVKNMNPTKAPWPDGMQAI

Query:  FYQKYWDIVGEDVCNTCMQILNDGASLRAINKTYIVLIPKV-KDPEVMKDFRPISLYIVIYKIVAKVLANRLKSVLDQIISPSQSAFVPGRLITDNVILG
        FYQ + + +   + N    I  +G       +  I LIPK  KDP   +++RPISL  +  KI+ K+L NR++  + +II   Q  F+PG     N+   
Subjt:  FYQKYWDIVGEDVCNTCMQILNDGASLRAINKTYIVLIPKV-KDPEVMKDFRPISLYIVIYKIVAKVLANRLKSVLDQIISPSQSAFVPGRLITDNVILG

Query:  FECIHSIKSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWIELIMGCVESVGFQVLLNGTPRVEFKPRRGLRQGDPLSPYLFLICAEGLSS
           I  I +K   K  ++ L +D  K +D ++  ++ + + K+G    +++LI          ++LNG     F  R G RQG PLSP LF I  E L+ 
Subjt:  FECIHSIKSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWIELIMGCVESVGFQVLLNGTPRVEFKPRRGLRQGDPLSPYLFLICAEGLSS

Query:  LLNQAESQRELTGLRINKYSPSISHLFYADDSLLFFKATLSDCRTIKSILNTYERASGQTINYEKSLFMVSPHTCSN-----IKHEISNTLKVKHKESLG
         + +   ++ + G+ I   S  I    +ADD +++ + T      +  ++  Y   SG  IN  KS+  +  +T +N     +K  I  T+  K  + LG
Subjt:  LLNQAESQRELTGLRINKYSPSISHLFYADDSLLFFKATLSDCRTIKSILNTYERASGQTINYEKSLFMVSPHTCSN-----IKHEISNTLKVKHKESLG

Query:  QYL
         YL
Subjt:  QYL

P11369 LINE-1 retrotransposable element ORF2 protein4.5e-4027.83Show/hide
Query:  NKIRGLIDDSGNLAVEDEDMERVVNSYFGAIFQTSHPNMEDIEKILE--TIPRTITEQQNDRLLATFSRTEIQEVVKNMNPTKAPWPDGMQAIFYQKYWD
        NKIR   ++ G++  + E+++  + S++  ++ T   N+++++K L+   +P+ + + Q D L +  S  EI+ V+ ++   K+P PDG  A FYQ + +
Subjt:  NKIRGLIDDSGNLAVEDEDMERVVNSYFGAIFQTSHPNMEDIEKILE--TIPRTITEQQNDRLLATFSRTEIQEVVKNMNPTKAPWPDGMQAIFYQKYWD

Query:  IVGEDVCNTCMQILNDGASLRAINKTYIVLIPK-VKDPEVMKDFRPISLYIVIYKIVAKVLANRLKSVLDQIISPSQSAFVPGRLITDNVILGFECIHSI
         +   +     +I  +G    +  +  I LIPK  KDP  +++FRPISL  +  KI+ K+LANR++  +  II P Q  F+PG     N+      IH I
Subjt:  IVGEDVCNTCMQILNDGASLRAINKTYIVLIPK-VKDPEVMKDFRPISLYIVIYKIVAKVLANRLKSVLDQIISPSQSAFVPGRLITDNVILGFECIHSI

Query:  KSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWIELIMGCVESVGFQVLLNGTPRVEFKPRRGLRQGDPLSPYLFLICAEGLSSLLNQAES
           +     ++   LD  K +D+++  ++ K++ + G    ++ +I          + +NG        + G RQG PLSPYLF I  E L+  + Q   
Subjt:  KSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWIELIMGCVESVGFQVLLNGTPRVEFKPRRGLRQGDPLSPYLFLICAEGLSSLLNQAES

Query:  QRELTGLRINKYSPSISHLFYADDSLLFFKATLSDCRTIKSILNTYERASGQTINYEKSLFMVSPHTCSNIKHEISNTLKVKHKESLGQYLGLPSQNA--
        Q+E+ G++I K    IS L  ADD +++     +  R + +++N++    G  IN  KS+  +        + EI  T       +  +YLG+       
Subjt:  QRELTGLRINKYSPSISHLFYADDSLLFFKATLSDCRTIKSILNTYERASGQTINYEKSLFMVSPHTCSNIKHEISNTLKVKHKESLGQYLGLPSQNA--

Query:  ----RNKREVFSSIKE--RVWKAL
            +N + +   IKE  R WK L
Subjt:  ----RNKREVFSSIKE--RVWKAL

P14381 Transposon TX1 uncharacterized 149 kDa protein1.2e-3728.95Show/hide
Query:  DRNTKWFHIKTNERRKTNKIRGLIDDSGNLAVEDEDMERVVNSYFGAIFQTSHPNMEDIEKILETIPRTITEQQNDRLLATFSRTEIQEVVKNMNPTKAP
        DR +++F+    ++    +I  L  + G    + E +     S++  +F     + +  E++ + +P  ++E++ +RL    +  E+ + ++ M   K+P
Subjt:  DRNTKWFHIKTNERRKTNKIRGLIDDSGNLAVEDEDMERVVNSYFGAIFQTSHPNMEDIEKILETIPRTITEQQNDRLLATFSRTEIQEVVKNMNPTKAP

Query:  WPDGMQAIFYQKYWDIVGEDVCNTCMQILNDGASLRAINKTYIVLIPKVKDPEVMKDFRPISLYIVIYKIVAKVLANRLKSVLDQIISPSQSAFVPGRLI
          DG+   F+Q +WD +G D      +    G    +  +  + L+PK  D  ++K++RP+SL    YKIVAK ++ RLKSVL ++I P QS  VPGR I
Subjt:  WPDGMQAIFYQKYWDIVGEDVCNTCMQILNDGASLRAINKTYIVLIPKVKDPEVMKDFRPISLYIVIYKIVAKVLANRLKSVLDQIISPSQSAFVPGRLI

Query:  TDNVILGFECIHSIKSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWIELIMGCVESVGFQVLLNGTPRVEFKPRRGLRQGDPLSPYLFLI
         DNV L  + +H   ++R G   +  L LD  K +DRV+  YL   +    F P ++  +     S    V +N +        RG+RQG PLS  L+ +
Subjt:  TDNVILGFECIHSIKSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWIELIMGCVESVGFQVLLNGTPRVEFKPRRGLRQGDPLSPYLFLI

Query:  CAEGLSSLLNQAESQRELTGLRINKYSPSISHLFYADDSLLFFKATLSDCRTIKSILNTYERASGQTINYEKS
          E    LL     ++ LTGL + +    +    YADD +L  +  L D    +     Y  AS   IN+ KS
Subjt:  CAEGLSSLLNQAESQRELTGLRINKYSPSISHLFYADDSLLFFKATLSDCRTIKSILNTYERASGQTINYEKS

P92555 Uncharacterized mitochondrial protein AtMg012503.8e-1552.94Show/hide
Query:  LLNGTPRVEFKPRRGLRQGDPLSPYLFLICAEGLSSLLNQAESQRELTGLRINKYSPSISHLFYADDS
        ++NG P+    P RGLRQGDPLSPYLF++C E LS L  +A+ Q  L G+R++  SP I+HL +ADD+
Subjt:  LLNGTPRVEFKPRRGLRQGDPLSPYLFLICAEGLSSLLNQAESQRELTGLRINKYSPSISHLFYADDS

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein4.1e-1228.81Show/hide
Query:  GDRNTKWFH--IKTNERRKTNKIRGLIDDSGNLAVED-EDMERVVNSYFGAIFQTSHPNM--EDIEKILETIPRTITEQQNDRLLATFSRTEIQEVVKNM
        GD NT++FH  I  N+ +   K   + DD   + VE+   ++ ++ +Y+  +  +    +  + +++I +  P    +    RL A  S  EI   V  M
Subjt:  GDRNTKWFH--IKTNERRKTNKIRGLIDDSGNLAVED-EDMERVVNSYFGAIFQTSHPNM--EDIEKILETIPRTITEQQNDRLLATFSRTEIQEVVKNM

Query:  NPTKAPWPDGMQAIFYQKYWDIVGEDVCNTCMQILNDGASLRAINKTYIVLIPKVKDPEVMKDFRPISLYIVIYKIV
           KAP PD   A F+ + W +V +       +    G  L+  N T I LIPKV   + +  FRP+S   V+YKI+
Subjt:  NPTKAPWPDGMQAIFYQKYWDIVGEDVCNTCMQILNDGASLRAINKTYIVLIPKVKDPEVMKDFRPISLYIVIYKIV

AT4G20520.1 RNA binding;RNA-directed DNA polymerases7.4e-1439.76Show/hide
Query:  LANRLKSVLDQIISPSQSAFVPGRLITDNVILGFECIHSIKSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWI
        +  RLK ++  +I P+Q++F+PGR+ TDN++   E +HS++ K+ G  G + LKLD+ K YDR+ W YL   +   GF   W+
Subjt:  LANRLKSVLDQIISPSQSAFVPGRLITDNVILGFECIHSIKSKRVGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWI

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.7e-1652.94Show/hide
Query:  LLNGTPRVEFKPRRGLRQGDPLSPYLFLICAEGLSSLLNQAESQRELTGLRINKYSPSISHLFYADDS
        ++NG P+    P RGLRQGDPLSPYLF++C E LS L  +A+ Q  L G+R++  SP I+HL +ADD+
Subjt:  LLNGTPRVEFKPRRGLRQGDPLSPYLFLICAEGLSSLLNQAESQRELTGLRINKYSPSISHLFYADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGACCGAAATACTAAGTGGTTCCACATAAAAACCAATGAGAGGAGGAAAACTAATAAAATAAGGGGTTTAATTGATGATTCTGGCAATCTGGCGGTGGAAGATGA
GGATATGGAAAGGGTGGTGAATAGTTACTTTGGGGCGATATTCCAAACATCTCATCCCAACATGGAGGACATTGAAAAGATCTTGGAGACGATTCCTAGGACTATAACAG
AGCAGCAAAATGATAGGTTGTTGGCAACATTTTCTCGGACAGAGATTCAGGAGGTGGTAAAGAATATGAACCCAACGAAAGCTCCATGGCCTGACGGTATGCAGGCAATA
TTTTATCAAAAATACTGGGATATTGTGGGTGAGGATGTCTGTAACACATGTATGCAGATCCTAAATGATGGTGCCTCTTTGAGAGCGATTAATAAGACTTACATAGTTTT
AATTCCTAAGGTGAAAGATCCGGAAGTGATGAAGGATTTTCGACCGATTAGTCTATACATTGTGATATATAAGATAGTAGCTAAAGTGCTAGCAAATCGGTTGAAATCAG
TCCTTGACCAGATAATCTCTCCAAGTCAGTCTGCCTTTGTGCCCGGTAGATTAATTACAGACAACGTCATCCTGGGTTTCGAATGTATTCACTCGATTAAGAGTAAGCGA
GTTGGGAAGTTCGGAGTGGTTGGATTAAAATTGGATATGAGCAAAACTTACGACAGAGTTGAGTGGATTTATCTTCGAAAAATTATGAACAAAATGGGTTTTAGTCCAGG
ATGGATTGAGTTGATCATGGGATGTGTAGAATCAGTGGGCTTTCAAGTTCTTCTCAATGGTACACCGAGAGTTGAGTTCAAGCCTAGACGAGGTCTCCGCCAAGGGGATC
CTCTATCCCCTTACTTATTCCTTATCTGTGCTGAAGGGCTTTCGAGCCTCCTAAATCAAGCTGAGTCTCAAAGGGAGTTAACAGGTTTGCGCATTAATAAGTATTCTCCA
TCAATTTCTCACTTGTTTTATGCAGATGACAGTTTGTTGTTTTTCAAAGCAACTTTGAGTGATTGTAGGACTATCAAGAGTATTCTAAATACCTATGAGAGAGCTTCGGG
ACAAACTATAAACTATGAGAAATCGTTGTTTATGGTTAGCCCTCATACTTGCTCCAATATCAAACATGAGATTAGTAATACTTTAAAAGTCAAGCATAAGGAAAGTCTAG
GTCAATATCTTGGACTACCATCTCAGAATGCGAGGAATAAACGGGAAGTGTTCAGCAGTATCAAGGAGCGGGTATGGAAAGCGCTGCAAGGTTTTGGTGGGGGTCGACAG
GGATTGGAAACAAAATTCATTAGAAAAGCTGGAGAAGGCTATGCTTCAACAAAGTGCAAGGAGGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGGACCGAAATACTAAGTGGTTCCACATAAAAACCAATGAGAGGAGGAAAACTAATAAAATAAGGGGTTTAATTGATGATTCTGGCAATCTGGCGGTGGAAGATGA
GGATATGGAAAGGGTGGTGAATAGTTACTTTGGGGCGATATTCCAAACATCTCATCCCAACATGGAGGACATTGAAAAGATCTTGGAGACGATTCCTAGGACTATAACAG
AGCAGCAAAATGATAGGTTGTTGGCAACATTTTCTCGGACAGAGATTCAGGAGGTGGTAAAGAATATGAACCCAACGAAAGCTCCATGGCCTGACGGTATGCAGGCAATA
TTTTATCAAAAATACTGGGATATTGTGGGTGAGGATGTCTGTAACACATGTATGCAGATCCTAAATGATGGTGCCTCTTTGAGAGCGATTAATAAGACTTACATAGTTTT
AATTCCTAAGGTGAAAGATCCGGAAGTGATGAAGGATTTTCGACCGATTAGTCTATACATTGTGATATATAAGATAGTAGCTAAAGTGCTAGCAAATCGGTTGAAATCAG
TCCTTGACCAGATAATCTCTCCAAGTCAGTCTGCCTTTGTGCCCGGTAGATTAATTACAGACAACGTCATCCTGGGTTTCGAATGTATTCACTCGATTAAGAGTAAGCGA
GTTGGGAAGTTCGGAGTGGTTGGATTAAAATTGGATATGAGCAAAACTTACGACAGAGTTGAGTGGATTTATCTTCGAAAAATTATGAACAAAATGGGTTTTAGTCCAGG
ATGGATTGAGTTGATCATGGGATGTGTAGAATCAGTGGGCTTTCAAGTTCTTCTCAATGGTACACCGAGAGTTGAGTTCAAGCCTAGACGAGGTCTCCGCCAAGGGGATC
CTCTATCCCCTTACTTATTCCTTATCTGTGCTGAAGGGCTTTCGAGCCTCCTAAATCAAGCTGAGTCTCAAAGGGAGTTAACAGGTTTGCGCATTAATAAGTATTCTCCA
TCAATTTCTCACTTGTTTTATGCAGATGACAGTTTGTTGTTTTTCAAAGCAACTTTGAGTGATTGTAGGACTATCAAGAGTATTCTAAATACCTATGAGAGAGCTTCGGG
ACAAACTATAAACTATGAGAAATCGTTGTTTATGGTTAGCCCTCATACTTGCTCCAATATCAAACATGAGATTAGTAATACTTTAAAAGTCAAGCATAAGGAAAGTCTAG
GTCAATATCTTGGACTACCATCTCAGAATGCGAGGAATAAACGGGAAGTGTTCAGCAGTATCAAGGAGCGGGTATGGAAAGCGCTGCAAGGTTTTGGTGGGGGTCGACAG
GGATTGGAAACAAAATTCATTAGAAAAGCTGGAGAAGGCTATGCTTCAACAAAGTGCAAGGAGGACTAG
Protein sequenceShow/hide protein sequence
MGDRNTKWFHIKTNERRKTNKIRGLIDDSGNLAVEDEDMERVVNSYFGAIFQTSHPNMEDIEKILETIPRTITEQQNDRLLATFSRTEIQEVVKNMNPTKAPWPDGMQAI
FYQKYWDIVGEDVCNTCMQILNDGASLRAINKTYIVLIPKVKDPEVMKDFRPISLYIVIYKIVAKVLANRLKSVLDQIISPSQSAFVPGRLITDNVILGFECIHSIKSKR
VGKFGVVGLKLDMSKTYDRVEWIYLRKIMNKMGFSPGWIELIMGCVESVGFQVLLNGTPRVEFKPRRGLRQGDPLSPYLFLICAEGLSSLLNQAESQRELTGLRINKYSP
SISHLFYADDSLLFFKATLSDCRTIKSILNTYERASGQTINYEKSLFMVSPHTCSNIKHEISNTLKVKHKESLGQYLGLPSQNARNKREVFSSIKERVWKALQGFGGGRQ
GLETKFIRKAGEGYASTKCKED