; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012842 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012842
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr1:44865269..44867332
RNA-Seq ExpressionLag0012842
SyntenyLag0012842
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037436.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]3.5e-8932.58Show/hide
Query:  MVSNLNWCPISGSEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPKKNTALQIQDFRS
        ++ NLNW PI  S   +L   F E E+ + L +   NK+PGPDG+T+EF KK W  +K  V++VF DF++K IIN NVN TYI LI KK+     +D+RS
Subjt:  MVSNLNWCPISGSEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPKKNTALQIQDFRS

Query:  ISLTTVLYRLIAKTLAERLKSTLQGTVSENQLAFVKGRQITDAILVANEVVDFWKCSHTRGFVIKLDIEKAFDKICWNFIDKILAFKGYPITWH------
        ISL T LY++IAK +A RLKSTL  ++S NQLAFVKGRQIT AIL+ANE VDFW  S T+G+V+KLDIEKAFDK+ W+FID +L  K YP+ W       
Subjt:  ISLTTVLYRLIAKTLAERLKSTLQGTVSENQLAFVKGRQITDAILVANEVVDFWKCSHTRGFVIKLDIEKAFDKICWNFIDKILAFKGYPITWH------

Query:  -------------------------------------------------------------------------DILLFVQDNDAMLGNLFNIIKVFELSS
                                                                                 DIL+F++D++  + +L N I +FE +S
Subjt:  -------------------------------------------------------------------------DILLFVQDNDAMLGNLFNIIKVFELSS

Query:  GLNINFSKSSITGINVEDVRVAQIAANWGCPMAQFPIPYLGSPLGIRLSDLFFG---MGKIKVVASLW----LIGAWWLRLLRLGVWAYSRLELQIAHFK
        GL IN SKSSI+ +N+   R A++A  W  P    P  YL  PLG + +   F    M KI+   S W    +     L L++  + +    +L I  FK
Subjt:  GLNINFSKSSITGINVEDVRVAQIAANWGCPMAQFPIPYLGSPLGIRLSDLFFG---MGKIKVVASLW----LIGAWWLRLLRLGVWAYSRLELQIAHFK

Query:  S-NGFG----------------------------DSFMRRLL----------------------CGSESSQQNIQPIDKEL-------FQLRRDSL----
        + +G G                            +   R  L                      C       N++ I+  L       FQ  +DSL    
Subjt:  S-NGFG----------------------------DSFMRRLL----------------------CGSESSQQNIQPIDKEL-------FQLRRDSL----

Query:  -------------PP--------RAPWTSILKQASLFLANTVWNLKDGSKISFWHDSWTDHGPLYQAIPRLFALSSRKNMTVSDAWDHTTSSWKLYPRRP
                     PP        +APW  I+K+   F  ++ W +KDG  +SFWH +W   G L    PRL+ LS  +N T+ + WD  T+ W L+PRRP
Subjt:  -------------PP--------RAPWTSILKQASLFLANTVWNLKDGSKISFWHDSWTDHGPLYQAIPRLFALSSRKNMTVSDAWDHTTSSWKLYPRRP

Query:  LFEREALVQAECASSFPTPALTGGADHMLWNPDSKGRFSVASARHFYCNLSSPDRPQPH---SIIFSNLWKALIPKKDQILYLDSYLQKIK-----YNRQ
        L +RE  +            L GG ++M+W P+ KG ++ ASA+    +    +RP P+      +  LWK+ +PKK +      + + I        R 
Subjt:  LFEREALVQAECASSFPTPALTGGADHMLWNPDSKGRFSVASARHFYCNLSSPDRPQPH---SIIFSNLWKALIPKKDQILYLDSYLQKIK-----YNRQ

Query:  ASTYFQ
         +TY Q
Subjt:  ASTYFQ

KAA0039770.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]3.2e-9034.67Show/hide
Query:  MVSNLNWCPISGSEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPKKNTALQIQDFRS
        ++ NLNW PIS ++A +L   F+E E+ + L A  +NKSPGPDGFT+EF+K +W  +K  ++ +F DF+   IIN  VN T I LI KK    +  D+R 
Subjt:  MVSNLNWCPISGSEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPKKNTALQIQDFRS

Query:  ISLTTVLYRLIAKTLAERLKSTLQGTVSENQLAFVKGRQITDAILVANEVVDFWKCSHTRGFVIKLDIEKAFDKICWNFIDKILAFKGYPITWH------
        ISLTT +Y+LIAK +AERLK TL  TV+ENQ+AFVKGRQI DAILVANE +D+W+    +GFVIKLDIEKAFDK+ W FID +L  KGYP  W       
Subjt:  ISLTTVLYRLIAKTLAERLKSTLQGTVSENQLAFVKGRQITDAILVANEVVDFWKCSHTRGFVIKLDIEKAFDKICWNFIDKILAFKGYPITWH------

Query:  -----------------------------------------------------------------------DILLFVQDNDAMLGNLFNIIKVFELSSGL
                                                                               DILLFV+D++  + NL NII +F+L+SGL
Subjt:  -----------------------------------------------------------------------DILLFVQDNDAMLGNLFNIIKVFELSSGL

Query:  NINFSKSSITGINVEDVRVAQIAANWGCPMAQFPIPYLGSPLG-----------------IRLSDLFFGM----GKIKVV-ASLWLIGAWWLRLLRLGVW
        +IN +KS+I+ INV+  R  QIA+ WG      PI YLG PLG                  +L+   + M    GKI ++ +SL  +  + L + +  V 
Subjt:  NINFSKSSITGINVEDVRVAQIAANWGCPMAQFPIPYLGSPLG-----------------IRLSDLFFGM----GKIKVV-ASLWLIGAWWLRLLRLGVW

Query:  AYSRLE---------------------------------LQIAHFKSNGFG--DSFMRRLLCGSESSQQNIQPIDKELFQLRRDSLP-------PRAPWT
            +E                                 L I+  K   F     ++ R +   E S    + I+ +   L +  +P        R+PW 
Subjt:  AYSRLE---------------------------------LQIAHFKSNGFG--DSFMRRLLCGSESSQQNIQPIDKELFQLRRDSLP-------PRAPWT

Query:  SILKQASLFLANTVWNLKDGSKISFWHDSWTDHGPLYQAIPRLFALSSRKNMTVSDAWDHTTSSWKLYPRRPLFEREALVQAECASSFPTPALTGGADHM
        SI K    F  +  W +K+G   SFWH  W  + PL    PRL+ALS+ K  ++ D W++T   W L PRR L E E  + AE  +S        G D  
Subjt:  SILKQASLFLANTVWNLKDGSKISFWHDSWTDHGPLYQAIPRLFALSSRKNMTVSDAWDHTTSSWKLYPRRPLFEREALVQAECASSFPTPALTGGADHM

Query:  LWNPDSKGRFSVASARHFYCNLSSPDR---PQPHSIIFSNLWKALIPKK
        +W  +S G ++VAS +     L  P++          F NLWK  IPKK
Subjt:  LWNPDSKGRFSVASARHFYCNLSSPDR---PQPHSIIFSNLWKALIPKK

KAA0046762.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.9e-9033.33Show/hide
Query:  VSNLNWCPISGSEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPKKNTALQIQDFRSI
        + NL W PI  SE  +L  PF E E+   + ++   K+PGPDGF + FFK  W  +K  +M++F DFY K +IN N+N TYI LIPKK      +DFR I
Subjt:  VSNLNWCPISGSEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPKKNTALQIQDFRSI

Query:  SLTTVLYRLIAKTLAERLKSTLQGTVSENQLAFVKGRQITDAILVANEVVDFWKCSHTRGFVIKLDIEKAFDKICWNFIDKILAFKGYPITWH-------
        SLTT +Y++IAKTL+ RLK++L  T+SENQLAFVK RQITDAIL+ANE VDFWK    +GF++KLDIEKAFD + W+FID +L  K +PI W        
Subjt:  SLTTVLYRLIAKTLAERLKSTLQGTVSENQLAFVKGRQITDAILVANEVVDFWKCSHTRGFVIKLDIEKAFDKICWNFIDKILAFKGYPITWH-------

Query:  ------------------------------------------------------------------------DILLFVQDNDAMLGNLFNIIKVFELSSG
                                                                                DILLF++DND  L NL   + +FE +SG
Subjt:  ------------------------------------------------------------------------DILLFVQDNDAMLGNLFNIIKVFELSSG

Query:  LNINFSKSSITGINVEDVRVAQIAANWGCPMAQFPIPYLGSPLGIR-LSDLFFGMGKIKVVASL--W----LIGAWWLRLLRLGVWAYSRLELQIAHFKS
        L IN  KS++  +NV ++R  + A+ WG      P+ YLG PLG    S LF+   + K+   L  W    +     L L++  + +    +L +    S
Subjt:  LNINFSKSSITGINVEDVRVAQIAANWGCPMAQFPIPYLGSPLGIR-LSDLFFGMGKIKVVASL--W----LIGAWWLRLLRLGVWAYSRLELQIAHFKS

Query:  NGFG--DSFMRRLLCGSESSQQNIQPID-----------------------------------------KELFQLRRDSLPP------------RAPWTS
              + F R  L    +S +    I+                                         + L Q +   + P            +APW S
Subjt:  NGFG--DSFMRRLLCGSESSQQNIQPID-----------------------------------------KELFQLRRDSLPP------------RAPWTS

Query:  ILKQASLFLANTVWNLKDGSKISFWHDSWTDHGPLYQAIPRLFALSSRKNMTVSDAWDHTTSSWKLYPRRPLFEREALVQAECASSFPTPALTGGADHML
        I+     F +N  W+L +G +ISFW+ +W+  G L  A PRLFALS  K++TV DAW+   + W +  RR L +RE    A+     P P    G+    
Subjt:  ILKQASLFLANTVWNLKDGSKISFWHDSWTDHGPLYQAIPRLFALSSRKNMTVSDAWDHTTSSWKLYPRRPLFEREALVQAECASSFPTPALTGGADHML

Query:  WNPDSKGRFSVASARHFYCNLSSPDRPQPHSIIFSNLWKALIPKK
        W PDSK  FS+ASA+             P S +   +WK+ IP K
Subjt:  WNPDSKGRFSVASARHFYCNLSSPDRPQPHSIIFSNLWKALIPKK

TYK00226.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]4.4e-9233.77Show/hide
Query:  MVSNLNWCPISGSEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPKKNTALQIQDFRS
        ++ NLNW PI  S   +L   F E E+ + L +   NK+PGPDG+T+EF KK W  +K  V++VF DF++K IIN NVN TYI LI KK+     +D+RS
Subjt:  MVSNLNWCPISGSEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPKKNTALQIQDFRS

Query:  ISLTTVLYRLIAKTLAERLKSTLQGTVSENQLAFVKGRQITDAILVANEVVDFWKCSHTRGFVIKLDIEKAFDKICWNFIDKILAFKGYPITWH------
        ISL T LY++IAK +A RLKSTL  ++S NQLAFVKGRQIT AIL+ANE VDFW  S T+G+V+KLDIEKAFDK+ W+FID +L  K YP+ W       
Subjt:  ISLTTVLYRLIAKTLAERLKSTLQGTVSENQLAFVKGRQITDAILVANEVVDFWKCSHTRGFVIKLDIEKAFDKICWNFIDKILAFKGYPITWH------

Query:  -------------------------------------------------------------------------DILLFVQDNDAMLGNLFNIIKVFELSS
                                                                                 DIL+F++D++  + +L N I +FE +S
Subjt:  -------------------------------------------------------------------------DILLFVQDNDAMLGNLFNIIKVFELSS

Query:  GLNINFSKSSITGINVEDVRVAQIAANWGCPMAQFPIPYLGSPLGIRLSDLFFG---MGKIKVVASLW----LIGAWWLRLLRLGVWAYSRLELQIAHFK
        GL IN SKSSI+ +N+   R A++A  W  P    P  YL  PLG + +   F    M KI+   S W    +     L L++  + +    +L I  FK
Subjt:  GLNINFSKSSITGINVEDVRVAQIAANWGCPMAQFPIPYLGSPLGIRLSDLFFG---MGKIKVVASLW----LIGAWWLRLLRLGVWAYSRLELQIAHFK

Query:  S-NGFG---DSFMRRLL----------------------CGSESSQQNIQPIDKEL-------FQLRRDSL-----------------PP--------RA
        + +G G   +   R  L                      C       N++ I+  L       FQ  +DSL                 PP        +A
Subjt:  S-NGFG---DSFMRRLL----------------------CGSESSQQNIQPIDKEL-------FQLRRDSL-----------------PP--------RA

Query:  PWTSILKQASLFLANTVWNLKDGSKISFWHDSWTDHGPLYQAIPRLFALSSRKNMTVSDAWDHTTSSWKLYPRRPLFEREALVQAECASSFPTPALTGGA
        PW  I+K+   F  ++ W +KDG  +SFWH +W   G L    PRL+ LS  +N T+ + WD  T+ W L+PRRPL +RE  +            L GG 
Subjt:  PWTSILKQASLFLANTVWNLKDGSKISFWHDSWTDHGPLYQAIPRLFALSSRKNMTVSDAWDHTTSSWKLYPRRPLFEREALVQAECASSFPTPALTGGA

Query:  DHMLWNPDSKGRFSVASARHFYCNLSSPDRPQPH---SIIFSNLWKALIPKKDQILYLDSYLQKIK-----YNRQASTYFQ
        ++M+W P+ KG ++ ASA+    +    +RP P+      +  LWK+ +PKK +      + + I        R  +TY Q
Subjt:  DHMLWNPDSKGRFSVASARHFYCNLSSPDRPQPH---SIIFSNLWKALIPKKDQILYLDSYLQKIK-----YNRQASTYFQ

XP_016902461.1 PREDICTED: LINE-1 retrotransposable element ORF2 protein [Cucumis melo]1.9e-9034.67Show/hide
Query:  MVSNLNWCPISGSEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPKKNTALQIQDFRS
        ++ NLNW PIS ++A +L   F+E E+ + L A  +NKSPGPDGFT+EF+K +W  +K  ++ +F DF+   IIN  VN T I LI KK    +  D+R 
Subjt:  MVSNLNWCPISGSEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPKKNTALQIQDFRS

Query:  ISLTTVLYRLIAKTLAERLKSTLQGTVSENQLAFVKGRQITDAILVANEVVDFWKCSHTRGFVIKLDIEKAFDKICWNFIDKILAFKGYPITWH------
        ISLTT +Y+LIAK +AERLK TL  TV+ENQ+AFVKGRQI DAILVANE +D+W+    +GFVIKLDIEKAFDK+ W FID +L  KGYP  W       
Subjt:  ISLTTVLYRLIAKTLAERLKSTLQGTVSENQLAFVKGRQITDAILVANEVVDFWKCSHTRGFVIKLDIEKAFDKICWNFIDKILAFKGYPITWH------

Query:  -----------------------------------------------------------------------DILLFVQDNDAMLGNLFNIIKVFELSSGL
                                                                               DILLFV+D++  + NL NII +F+L+SGL
Subjt:  -----------------------------------------------------------------------DILLFVQDNDAMLGNLFNIIKVFELSSGL

Query:  NINFSKSSITGINVEDVRVAQIAANWGCPMAQFPIPYLGSPLG-----------------IRLSDLFFGM----GKIKVV-ASLWLIGAWWLRLLRLGVW
        +IN +KS+I+ INV+  R  QIA+ WG      PI YLG PLG                  +L+   + M    GKI ++ +SL  +  + L + ++ V 
Subjt:  NINFSKSSITGINVEDVRVAQIAANWGCPMAQFPIPYLGSPLG-----------------IRLSDLFFGM----GKIKVV-ASLWLIGAWWLRLLRLGVW

Query:  AYSRLE---------------------------------LQIAHFKSNGFG--DSFMRRLLCGSESSQQNIQPIDKELFQLRRDSLP-------PRAPWT
            +E                                 L I+  K   F     ++ R +   E S    + I+ +   L +  +P        R+PW 
Subjt:  AYSRLE---------------------------------LQIAHFKSNGFG--DSFMRRLLCGSESSQQNIQPIDKELFQLRRDSLP-------PRAPWT

Query:  SILKQASLFLANTVWNLKDGSKISFWHDSWTDHGPLYQAIPRLFALSSRKNMTVSDAWDHTTSSWKLYPRRPLFEREALVQAECASSFPTPALTGGADHM
        SI K    F  +  W +K+G   SFWH  W  + PL    PRL+ALS+ K  ++ D W++T   W L PRR L E E  + AE  +S        G D  
Subjt:  SILKQASLFLANTVWNLKDGSKISFWHDSWTDHGPLYQAIPRLFALSSRKNMTVSDAWDHTTSSWKLYPRRPLFEREALVQAECASSFPTPALTGGADHM

Query:  LWNPDSKGRFSVASARHFYCNLSSPDR---PQPHSIIFSNLWKALIPKK
        +W  +S G ++VAS +     L  P++          F NLWK  IPKK
Subjt:  LWNPDSKGRFSVASARHFYCNLSSPDR---PQPHSIIFSNLWKALIPKK

TrEMBL top hitse value%identityAlignment
A0A1S4E2K5 LINE-1 retrotransposable element ORF2 protein9.0e-9134.67Show/hide
Query:  MVSNLNWCPISGSEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPKKNTALQIQDFRS
        ++ NLNW PIS ++A +L   F+E E+ + L A  +NKSPGPDGFT+EF+K +W  +K  ++ +F DF+   IIN  VN T I LI KK    +  D+R 
Subjt:  MVSNLNWCPISGSEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPKKNTALQIQDFRS

Query:  ISLTTVLYRLIAKTLAERLKSTLQGTVSENQLAFVKGRQITDAILVANEVVDFWKCSHTRGFVIKLDIEKAFDKICWNFIDKILAFKGYPITWH------
        ISLTT +Y+LIAK +AERLK TL  TV+ENQ+AFVKGRQI DAILVANE +D+W+    +GFVIKLDIEKAFDK+ W FID +L  KGYP  W       
Subjt:  ISLTTVLYRLIAKTLAERLKSTLQGTVSENQLAFVKGRQITDAILVANEVVDFWKCSHTRGFVIKLDIEKAFDKICWNFIDKILAFKGYPITWH------

Query:  -----------------------------------------------------------------------DILLFVQDNDAMLGNLFNIIKVFELSSGL
                                                                               DILLFV+D++  + NL NII +F+L+SGL
Subjt:  -----------------------------------------------------------------------DILLFVQDNDAMLGNLFNIIKVFELSSGL

Query:  NINFSKSSITGINVEDVRVAQIAANWGCPMAQFPIPYLGSPLG-----------------IRLSDLFFGM----GKIKVV-ASLWLIGAWWLRLLRLGVW
        +IN +KS+I+ INV+  R  QIA+ WG      PI YLG PLG                  +L+   + M    GKI ++ +SL  +  + L + ++ V 
Subjt:  NINFSKSSITGINVEDVRVAQIAANWGCPMAQFPIPYLGSPLG-----------------IRLSDLFFGM----GKIKVV-ASLWLIGAWWLRLLRLGVW

Query:  AYSRLE---------------------------------LQIAHFKSNGFG--DSFMRRLLCGSESSQQNIQPIDKELFQLRRDSLP-------PRAPWT
            +E                                 L I+  K   F     ++ R +   E S    + I+ +   L +  +P        R+PW 
Subjt:  AYSRLE---------------------------------LQIAHFKSNGFG--DSFMRRLLCGSESSQQNIQPIDKELFQLRRDSLP-------PRAPWT

Query:  SILKQASLFLANTVWNLKDGSKISFWHDSWTDHGPLYQAIPRLFALSSRKNMTVSDAWDHTTSSWKLYPRRPLFEREALVQAECASSFPTPALTGGADHM
        SI K    F  +  W +K+G   SFWH  W  + PL    PRL+ALS+ K  ++ D W++T   W L PRR L E E  + AE  +S        G D  
Subjt:  SILKQASLFLANTVWNLKDGSKISFWHDSWTDHGPLYQAIPRLFALSSRKNMTVSDAWDHTTSSWKLYPRRPLFEREALVQAECASSFPTPALTGGADHM

Query:  LWNPDSKGRFSVASARHFYCNLSSPDR---PQPHSIIFSNLWKALIPKK
        +W  +S G ++VAS +     L  P++          F NLWK  IPKK
Subjt:  LWNPDSKGRFSVASARHFYCNLSSPDR---PQPHSIIFSNLWKALIPKK

A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein1.7e-8934.37Show/hide
Query:  VSNLNWCPISGSEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPKKNTALQIQDFRSI
        + NL+WCPIS   +  L +PF+E E++  LK+   NK+PGPDG+ ++F +KSW  +K ++ ++F DF+   IIN  VNET ITLI KK       DFR I
Subjt:  VSNLNWCPISGSEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPKKNTALQIQDFRSI

Query:  SLTTVLYRLIAKTLAERLKSTLQGTVSENQLAFVKGRQITDAILVANEVVDFWKCSHTRGFVIKLDIEKAFDKICWNFIDKILAFKGYPITWH-------
        SLTT +Y+LIAKTLA+RLK TL  T+SE+Q+AFVKGRQIT+AIL+ANE +DFW+    RGFVIKLDIEKAFDK+ W FID +L  K Y   W        
Subjt:  SLTTVLYRLIAKTLAERLKSTLQGTVSENQLAFVKGRQITDAILVANEVVDFWKCSHTRGFVIKLDIEKAFDKICWNFIDKILAFKGYPITWH-------

Query:  ------------------------------------------------------------------------DILLFVQDNDAMLGNLFNIIKVFELSSG
                                                                                DIL+FV+D D  + NL  I+ +FE +SG
Subjt:  ------------------------------------------------------------------------DILLFVQDNDAMLGNLFNIIKVFELSSG

Query:  LNINFSKSSITGINVEDVRVAQIAANWGCPMAQFPIPYLGSPLGIRLSDLFFG---MGKIKVVASLW------------LIGA-------WWLRLLRLGV
        LNIN SKS+I  INV   R   IA +WG      P  YLG PLG R S   F    + KI+   S W            LI +       + + + ++  
Subjt:  LNINFSKSSITGINVEDVRVAQIAANWGCPMAQFPIPYLGSPLGIRLSDLFFG---MGKIKVVASLW------------LIGA-------WWLRLLRLGV

Query:  WAYSRLELQIAHF----KSNGFGDSFMR--RLLCGSESSQQNIQPIDKELFQL----------RRDSLPPR------------------------APWTS
            ++E    +F     SNG   S +R  +++   E     I  ++   F L           +D L  R                        +PW +
Subjt:  WAYSRLELQIAHF----KSNGFGDSFMR--RLLCGSESSQQNIQPIDKELFQL----------RRDSLPPR------------------------APWTS

Query:  ILKQASLFLANTVWNLKDGSKISFWHDSWTDHGPLYQAIPRLFALSSRKNMTVSDAWDHTTSSWKLYPRRPLFEREALVQAECASSFPTPALTGGADHML
        + +  S F  N  W + DG  ISFW D+W  + PL  A+PRLFALS+ K  +V + W+ +++ W L+  RPL + E  +     +S PTP    G    L
Subjt:  ILKQASLFLANTVWNLKDGSKISFWHDSWTDHGPLYQAIPRLFALSSRKNMTVSDAWDHTTSSWKLYPRRPLFEREALVQAECASSFPTPALTGGADHML

Query:  WNPDSKGRFSVASARHFYCNLSSPDRPQP-HSIIFSNLWKALIPKK
        WN +S   F  AS +       +P  P   H  ++  LWK   PKK
Subjt:  WNPDSKGRFSVASARHFYCNLSSPDRPQP-HSIIFSNLWKALIPKK

A0A5A7TTK1 LINE-1 retrotransposable element ORF2 protein9.0e-9133.33Show/hide
Query:  VSNLNWCPISGSEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPKKNTALQIQDFRSI
        + NL W PI  SE  +L  PF E E+   + ++   K+PGPDGF + FFK  W  +K  +M++F DFY K +IN N+N TYI LIPKK      +DFR I
Subjt:  VSNLNWCPISGSEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPKKNTALQIQDFRSI

Query:  SLTTVLYRLIAKTLAERLKSTLQGTVSENQLAFVKGRQITDAILVANEVVDFWKCSHTRGFVIKLDIEKAFDKICWNFIDKILAFKGYPITWH-------
        SLTT +Y++IAKTL+ RLK++L  T+SENQLAFVK RQITDAIL+ANE VDFWK    +GF++KLDIEKAFD + W+FID +L  K +PI W        
Subjt:  SLTTVLYRLIAKTLAERLKSTLQGTVSENQLAFVKGRQITDAILVANEVVDFWKCSHTRGFVIKLDIEKAFDKICWNFIDKILAFKGYPITWH-------

Query:  ------------------------------------------------------------------------DILLFVQDNDAMLGNLFNIIKVFELSSG
                                                                                DILLF++DND  L NL   + +FE +SG
Subjt:  ------------------------------------------------------------------------DILLFVQDNDAMLGNLFNIIKVFELSSG

Query:  LNINFSKSSITGINVEDVRVAQIAANWGCPMAQFPIPYLGSPLGIR-LSDLFFGMGKIKVVASL--W----LIGAWWLRLLRLGVWAYSRLELQIAHFKS
        L IN  KS++  +NV ++R  + A+ WG      P+ YLG PLG    S LF+   + K+   L  W    +     L L++  + +    +L +    S
Subjt:  LNINFSKSSITGINVEDVRVAQIAANWGCPMAQFPIPYLGSPLGIR-LSDLFFGMGKIKVVASL--W----LIGAWWLRLLRLGVWAYSRLELQIAHFKS

Query:  NGFG--DSFMRRLLCGSESSQQNIQPID-----------------------------------------KELFQLRRDSLPP------------RAPWTS
              + F R  L    +S +    I+                                         + L Q +   + P            +APW S
Subjt:  NGFG--DSFMRRLLCGSESSQQNIQPID-----------------------------------------KELFQLRRDSLPP------------RAPWTS

Query:  ILKQASLFLANTVWNLKDGSKISFWHDSWTDHGPLYQAIPRLFALSSRKNMTVSDAWDHTTSSWKLYPRRPLFEREALVQAECASSFPTPALTGGADHML
        I+     F +N  W+L +G +ISFW+ +W+  G L  A PRLFALS  K++TV DAW+   + W +  RR L +RE    A+     P P    G+    
Subjt:  ILKQASLFLANTVWNLKDGSKISFWHDSWTDHGPLYQAIPRLFALSSRKNMTVSDAWDHTTSSWKLYPRRPLFEREALVQAECASSFPTPALTGGADHML

Query:  WNPDSKGRFSVASARHFYCNLSSPDRPQPHSIIFSNLWKALIPKK
        W PDSK  FS+ASA+             P S +   +WK+ IP K
Subjt:  WNPDSKGRFSVASARHFYCNLSSPDRPQPHSIIFSNLWKALIPKK

A0A5D3BPP1 LINE-1 retrotransposable element ORF2 protein2.1e-9233.77Show/hide
Query:  MVSNLNWCPISGSEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPKKNTALQIQDFRS
        ++ NLNW PI  S   +L   F E E+ + L +   NK+PGPDG+T+EF KK W  +K  V++VF DF++K IIN NVN TYI LI KK+     +D+RS
Subjt:  MVSNLNWCPISGSEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPKKNTALQIQDFRS

Query:  ISLTTVLYRLIAKTLAERLKSTLQGTVSENQLAFVKGRQITDAILVANEVVDFWKCSHTRGFVIKLDIEKAFDKICWNFIDKILAFKGYPITWH------
        ISL T LY++IAK +A RLKSTL  ++S NQLAFVKGRQIT AIL+ANE VDFW  S T+G+V+KLDIEKAFDK+ W+FID +L  K YP+ W       
Subjt:  ISLTTVLYRLIAKTLAERLKSTLQGTVSENQLAFVKGRQITDAILVANEVVDFWKCSHTRGFVIKLDIEKAFDKICWNFIDKILAFKGYPITWH------

Query:  -------------------------------------------------------------------------DILLFVQDNDAMLGNLFNIIKVFELSS
                                                                                 DIL+F++D++  + +L N I +FE +S
Subjt:  -------------------------------------------------------------------------DILLFVQDNDAMLGNLFNIIKVFELSS

Query:  GLNINFSKSSITGINVEDVRVAQIAANWGCPMAQFPIPYLGSPLGIRLSDLFFG---MGKIKVVASLW----LIGAWWLRLLRLGVWAYSRLELQIAHFK
        GL IN SKSSI+ +N+   R A++A  W  P    P  YL  PLG + +   F    M KI+   S W    +     L L++  + +    +L I  FK
Subjt:  GLNINFSKSSITGINVEDVRVAQIAANWGCPMAQFPIPYLGSPLGIRLSDLFFG---MGKIKVVASLW----LIGAWWLRLLRLGVWAYSRLELQIAHFK

Query:  S-NGFG---DSFMRRLL----------------------CGSESSQQNIQPIDKEL-------FQLRRDSL-----------------PP--------RA
        + +G G   +   R  L                      C       N++ I+  L       FQ  +DSL                 PP        +A
Subjt:  S-NGFG---DSFMRRLL----------------------CGSESSQQNIQPIDKEL-------FQLRRDSL-----------------PP--------RA

Query:  PWTSILKQASLFLANTVWNLKDGSKISFWHDSWTDHGPLYQAIPRLFALSSRKNMTVSDAWDHTTSSWKLYPRRPLFEREALVQAECASSFPTPALTGGA
        PW  I+K+   F  ++ W +KDG  +SFWH +W   G L    PRL+ LS  +N T+ + WD  T+ W L+PRRPL +RE  +            L GG 
Subjt:  PWTSILKQASLFLANTVWNLKDGSKISFWHDSWTDHGPLYQAIPRLFALSSRKNMTVSDAWDHTTSSWKLYPRRPLFEREALVQAECASSFPTPALTGGA

Query:  DHMLWNPDSKGRFSVASARHFYCNLSSPDRPQPH---SIIFSNLWKALIPKKDQILYLDSYLQKIK-----YNRQASTYFQ
        ++M+W P+ KG ++ ASA+    +    +RP P+      +  LWK+ +PKK +      + + I        R  +TY Q
Subjt:  DHMLWNPDSKGRFSVASARHFYCNLSSPDRPQPH---SIIFSNLWKALIPKKDQILYLDSYLQKIK-----YNRQASTYFQ

A0A5D3DM72 LINE-1 retrotransposable element ORF2 protein1.5e-9034.67Show/hide
Query:  MVSNLNWCPISGSEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPKKNTALQIQDFRS
        ++ NLNW PIS ++A +L   F+E E+ + L A  +NKSPGPDGFT+EF+K +W  +K  ++ +F DF+   IIN  VN T I LI KK    +  D+R 
Subjt:  MVSNLNWCPISGSEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPKKNTALQIQDFRS

Query:  ISLTTVLYRLIAKTLAERLKSTLQGTVSENQLAFVKGRQITDAILVANEVVDFWKCSHTRGFVIKLDIEKAFDKICWNFIDKILAFKGYPITWH------
        ISLTT +Y+LIAK +AERLK TL  TV+ENQ+AFVKGRQI DAILVANE +D+W+    +GFVIKLDIEKAFDK+ W FID +L  KGYP  W       
Subjt:  ISLTTVLYRLIAKTLAERLKSTLQGTVSENQLAFVKGRQITDAILVANEVVDFWKCSHTRGFVIKLDIEKAFDKICWNFIDKILAFKGYPITWH------

Query:  -----------------------------------------------------------------------DILLFVQDNDAMLGNLFNIIKVFELSSGL
                                                                               DILLFV+D++  + NL NII +F+L+SGL
Subjt:  -----------------------------------------------------------------------DILLFVQDNDAMLGNLFNIIKVFELSSGL

Query:  NINFSKSSITGINVEDVRVAQIAANWGCPMAQFPIPYLGSPLG-----------------IRLSDLFFGM----GKIKVV-ASLWLIGAWWLRLLRLGVW
        +IN +KS+I+ INV+  R  QIA+ WG      PI YLG PLG                  +L+   + M    GKI ++ +SL  +  + L + +  V 
Subjt:  NINFSKSSITGINVEDVRVAQIAANWGCPMAQFPIPYLGSPLG-----------------IRLSDLFFGM----GKIKVV-ASLWLIGAWWLRLLRLGVW

Query:  AYSRLE---------------------------------LQIAHFKSNGFG--DSFMRRLLCGSESSQQNIQPIDKELFQLRRDSLP-------PRAPWT
            +E                                 L I+  K   F     ++ R +   E S    + I+ +   L +  +P        R+PW 
Subjt:  AYSRLE---------------------------------LQIAHFKSNGFG--DSFMRRLLCGSESSQQNIQPIDKELFQLRRDSLP-------PRAPWT

Query:  SILKQASLFLANTVWNLKDGSKISFWHDSWTDHGPLYQAIPRLFALSSRKNMTVSDAWDHTTSSWKLYPRRPLFEREALVQAECASSFPTPALTGGADHM
        SI K    F  +  W +K+G   SFWH  W  + PL    PRL+ALS+ K  ++ D W++T   W L PRR L E E  + AE  +S        G D  
Subjt:  SILKQASLFLANTVWNLKDGSKISFWHDSWTDHGPLYQAIPRLFALSSRKNMTVSDAWDHTTSSWKLYPRRPLFEREALVQAECASSFPTPALTGGADHM

Query:  LWNPDSKGRFSVASARHFYCNLSSPDR---PQPHSIIFSNLWKALIPKK
        +W  +S G ++VAS +     L  P++          F NLWK  IPKK
Subjt:  LWNPDSKGRFSVASARHFYCNLSSPDR---PQPHSIIFSNLWKALIPKK

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.7e-1430Show/hide
Query:  ISGSEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPKK-NTALQIQDFRSISLTTVLY
        ++  E  SL +P +  E+   + ++   KSPGPDGFT EF+++   ++ P ++++F    ++ I+  +  E  I LIPK      + ++FR ISL  +  
Subjt:  ISGSEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPKK-NTALQIQDFRSISLTTVLY

Query:  RLIAKTLAERLKSTLQGTVSENQLAFVKGRQ----ITDAILVANEVVDFWKCSHTRGFVIKLDIEKAFDKICWNFIDKIL
        +++ K LA R++  ++  +  +Q+ F+ G Q    I  +I V   +      +H    +I +D EKAFDKI   F+ K L
Subjt:  RLIAKTLAERLKSTLQGTVSENQLAFVKGRQ----ITDAILVANEVVDFWKCSHTRGFVIKLDIEKAFDKICWNFIDKIL

P08548 LINE-1 reverse transcriptase homolog1.9e-1328.33Show/hide
Query:  ISGSEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPKK-NTALQIQDFRSISLTTVLY
        +S  E   L +P S  E+   ++ +   KSPGPDGFT EF++    ++ P ++ +F +  ++ I+     E  ITLIPK      + +++R ISL  +  
Subjt:  ISGSEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPKK-NTALQIQDFRSISLTTVLY

Query:  RLIAKTLAERLKSTLQGTVSENQLAFVKGRQ----ITDAILVANEVVDFWKCSHTRGFVIKLDIEKAFDKICWNFIDKIL
        +++ K L  R++  ++  +  +Q+ F+ G Q    I  +I V   +       H    ++ +D EKAFD I   F+ + L
Subjt:  RLIAKTLAERLKSTLQGTVSENQLAFVKGRQ----ITDAILVANEVVDFWKCSHTRGFVIKLDIEKAFDKICWNFIDKIL

P11369 LINE-1 retrotransposable element ORF2 protein1.2e-1530.39Show/hide
Query:  ISGSEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPK-KNTALQIQDFRSISLTTVLY
        ++  +   L  P S  E+   + ++   KSPGPDGF+ EF++    D+ P + ++F+    +  +  +  E  ITLIPK +    +I++FR ISL  +  
Subjt:  ISGSEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPK-KNTALQIQDFRSISLTTVLY

Query:  RLIAKTLAERLKSTLQGTVSENQLAFVKGRQITDAILVANEVVDFW-KCSHTRGFVIKLDIEKAFDKICWNFIDKILAFKG
        +++ K LA R++  ++  +  +Q+ F+ G Q    I  +  V+ +  K       +I LD EKAFDKI   F+ K+L   G
Subjt:  RLIAKTLAERLKSTLQGTVSENQLAFVKGRQITDAILVANEVVDFW-KCSHTRGFVIKLDIEKAFDKICWNFIDKILAFKG

P14381 Transposon TX1 uncharacterized 149 kDa protein1.5e-1831.93Show/hide
Query:  ISGSEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPKKNTALQIQDFRSISLTTVLYR
        +S      L  P +  E+ Q L+ M HNKSPG DG T+EFF+  W  + P    V  + ++K  +  +     ++L+PKK     I+++R +SL +  Y+
Subjt:  ISGSEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPKKNTALQIQDFRSISLTTVLYR

Query:  LIAKTLAERLKSTLQGTVSENQLAFVKGRQITDAILVANEVVDFWKCSHTRGFVIKLDIEKAFDKI
        ++AK ++ RLKS L   +  +Q   V GR I D + +  +++ F + +      + LD EKAFD++
Subjt:  LIAKTLAERLKSTLQGTVSENQLAFVKGRQITDAILVANEVVDFWKCSHTRGFVIKLDIEKAFDKI

P16423 Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM2.8e-0427.81Show/hide
Query:  QNLKA--MGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINC-----NVNETYITLIPKKNTALQIQDFRSISLTTVLYRLIAKTLAERLKS
        Q+L+A  +  + SPGPDG T +  ++    I   +M +        I+ C     ++       IPK  TA + QDFR IS+ +VL R +   LA RL S
Subjt:  QNLKA--MGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINC-----NVNETYITLIPKKNTALQIQDFRSISLTTVLYRLIAKTLAERLKS

Query:  TLQGTVSENQLAFVKGRQITDAILVANEVVDFWKCSHTRGFVIKLDIEKAFDKICWNFIDKILAFKGYP
        ++       Q  F+      D   + + V+          ++  LD+ KAFD +    I   L   G P
Subjt:  TLQGTVSENQLAFVKGRQITDAILVANEVVDFWKCSHTRGFVIKLDIEKAFDKICWNFIDKILAFKGYP

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein8.2e-1240.45Show/hide
Query:  SELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPKKNTALQIQDFRSISLTTVLYRLI
        S+ E+   + AM  NK+PGPD FT EFF +SW  +K S +    +F++   +    N T ITLIPK     Q+  FR +S  TV+Y++I
Subjt:  SELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPKKNTALQIQDFRSISLTTVLYRLI

AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.7e-0733.33Show/hide
Query:  LAERLKSTLQGTVSENQLAFVKGRQITDAILVANEVV-DFWKCSHTRGF-VIKLDIEKAFDKICWNFIDKILAFKGYPITW
        + ERLK  +   +   Q +F+ GR  TD I+   E V    +    +G+ ++KLD+EKA+D+I W++++  L   G+P  W
Subjt:  LAERLKSTLQGTVSENQLAFVKGRQITDAILVANEVV-DFWKCSHTRGF-VIKLDIEKAFDKICWNFIDKILAFKGYPITW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCTCAAATCTAAATTGGTGCCCTATTTCTGGTTCTGAGGCGACATCTTTGATTCAACCTTTTTCTGAGCTAGAAGTCTTTCAGAATCTAAAAGCGATGGGTCATAA
TAAGTCCCCTGGCCCGGATGGATTCACTGTTGAATTCTTTAAAAAGTCCTGGATCGATATCAAGCCCTCAGTGATGGAAGTGTTCTATGACTTCTATCAGAAGGATATCA
TCAACTGTAATGTTAATGAAACCTACATTACTTTGATACCTAAGAAGAATACAGCTTTGCAAATTCAGGACTTTAGATCCATTAGTCTCACCACAGTTCTTTATCGCCTT
ATCGCTAAGACTCTTGCAGAAAGGCTTAAAAGTACTCTCCAAGGGACAGTATCTGAAAATCAACTAGCATTTGTTAAAGGTCGTCAAATTACTGATGCTATTTTGGTGGC
TAATGAAGTTGTTGATTTCTGGAAATGTTCTCACACGAGAGGATTCGTTATAAAGCTGGATATTGAAAAGGCTTTTGACAAGATCTGTTGGAACTTCATTGATAAGATTC
TCGCTTTCAAGGGATACCCTATTACCTGGCATGACATTCTCCTCTTTGTTCAAGATAACGATGCTATGTTAGGCAACCTGTTCAACATCATCAAAGTATTTGAACTATCT
TCGGGTCTCAATATAAACTTTAGCAAATCCTCTATAACGGGTATCAACGTGGAGGATGTCAGAGTTGCTCAAATCGCCGCCAATTGGGGATGCCCAATGGCCCAATTCCC
CATTCCTTATTTAGGCTCTCCCTTGGGGATAAGATTATCAGATCTTTTCTTTGGCATGGGCAAGATCAAAGTAGTAGCATCCCTTTGGTTAATTGGGGCATGGTGGCTGC
GCCTATTGAGGCTGGGGGTTTGGGCTTATTCAAGACTAGAATTACAAATAGCGCATTTCAAGTCAAATGGCTTTGGAGATTCTTTCATGAGGAGACTTCTATGTGGAAGC
GAGTCATCTCAGCAAAATATACAACCCATAGACAAGGAGCTCTTCCAACTCAGGCGCGATTCACTTCCTCCCCGAGCGCCATGGACATCAATTCTCAAGCAAGCCTCTCT
TTTTCTGGCAAATACAGTTTGGAATCTTAAGGATGGCAGTAAAATCTCTTTTTGGCATGACTCTTGGACCGATCATGGGCCATTATATCAAGCCATCCCTCGCCTTTTTG
CGCTGTCCAGTAGGAAAAATATGACTGTCTCGGATGCCTGGGACCACACCACTTCTTCATGGAAGCTTTATCCTAGACGCCCTTTGTTCGAAAGGGAGGCCTTGGTTCAG
GCGGAATGTGCTTCCTCATTTCCAACTCCTGCATTAACTGGAGGGGCGGACCACATGCTTTGGAACCCAGACTCCAAGGGTCGCTTCTCCGTTGCTTCAGCTAGGCATTT
CTATTGCAATCTATCATCGCCAGATCGGCCACAGCCTCATTCCATTATTTTCTCAAATCTATGGAAGGCCCTGATTCCAAAAAAAGATCAAATTCTTTATTTGGACAGTT
ATTTACAGAAGATTAAATACAACAGACAGGCTTCAACATATTTTCAAATCCACTGCTCTAAACCCGAGCTGTTGCCCCCTTTGCCATGCAGGCTCAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCTCAAATCTAAATTGGTGCCCTATTTCTGGTTCTGAGGCGACATCTTTGATTCAACCTTTTTCTGAGCTAGAAGTCTTTCAGAATCTAAAAGCGATGGGTCATAA
TAAGTCCCCTGGCCCGGATGGATTCACTGTTGAATTCTTTAAAAAGTCCTGGATCGATATCAAGCCCTCAGTGATGGAAGTGTTCTATGACTTCTATCAGAAGGATATCA
TCAACTGTAATGTTAATGAAACCTACATTACTTTGATACCTAAGAAGAATACAGCTTTGCAAATTCAGGACTTTAGATCCATTAGTCTCACCACAGTTCTTTATCGCCTT
ATCGCTAAGACTCTTGCAGAAAGGCTTAAAAGTACTCTCCAAGGGACAGTATCTGAAAATCAACTAGCATTTGTTAAAGGTCGTCAAATTACTGATGCTATTTTGGTGGC
TAATGAAGTTGTTGATTTCTGGAAATGTTCTCACACGAGAGGATTCGTTATAAAGCTGGATATTGAAAAGGCTTTTGACAAGATCTGTTGGAACTTCATTGATAAGATTC
TCGCTTTCAAGGGATACCCTATTACCTGGCATGACATTCTCCTCTTTGTTCAAGATAACGATGCTATGTTAGGCAACCTGTTCAACATCATCAAAGTATTTGAACTATCT
TCGGGTCTCAATATAAACTTTAGCAAATCCTCTATAACGGGTATCAACGTGGAGGATGTCAGAGTTGCTCAAATCGCCGCCAATTGGGGATGCCCAATGGCCCAATTCCC
CATTCCTTATTTAGGCTCTCCCTTGGGGATAAGATTATCAGATCTTTTCTTTGGCATGGGCAAGATCAAAGTAGTAGCATCCCTTTGGTTAATTGGGGCATGGTGGCTGC
GCCTATTGAGGCTGGGGGTTTGGGCTTATTCAAGACTAGAATTACAAATAGCGCATTTCAAGTCAAATGGCTTTGGAGATTCTTTCATGAGGAGACTTCTATGTGGAAGC
GAGTCATCTCAGCAAAATATACAACCCATAGACAAGGAGCTCTTCCAACTCAGGCGCGATTCACTTCCTCCCCGAGCGCCATGGACATCAATTCTCAAGCAAGCCTCTCT
TTTTCTGGCAAATACAGTTTGGAATCTTAAGGATGGCAGTAAAATCTCTTTTTGGCATGACTCTTGGACCGATCATGGGCCATTATATCAAGCCATCCCTCGCCTTTTTG
CGCTGTCCAGTAGGAAAAATATGACTGTCTCGGATGCCTGGGACCACACCACTTCTTCATGGAAGCTTTATCCTAGACGCCCTTTGTTCGAAAGGGAGGCCTTGGTTCAG
GCGGAATGTGCTTCCTCATTTCCAACTCCTGCATTAACTGGAGGGGCGGACCACATGCTTTGGAACCCAGACTCCAAGGGTCGCTTCTCCGTTGCTTCAGCTAGGCATTT
CTATTGCAATCTATCATCGCCAGATCGGCCACAGCCTCATTCCATTATTTTCTCAAATCTATGGAAGGCCCTGATTCCAAAAAAAGATCAAATTCTTTATTTGGACAGTT
ATTTACAGAAGATTAAATACAACAGACAGGCTTCAACATATTTTCAAATCCACTGCTCTAAACCCGAGCTGTTGCCCCCTTTGCCATGCAGGCTCAGATGA
Protein sequenceShow/hide protein sequence
MVSNLNWCPISGSEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIDIKPSVMEVFYDFYQKDIINCNVNETYITLIPKKNTALQIQDFRSISLTTVLYRL
IAKTLAERLKSTLQGTVSENQLAFVKGRQITDAILVANEVVDFWKCSHTRGFVIKLDIEKAFDKICWNFIDKILAFKGYPITWHDILLFVQDNDAMLGNLFNIIKVFELS
SGLNINFSKSSITGINVEDVRVAQIAANWGCPMAQFPIPYLGSPLGIRLSDLFFGMGKIKVVASLWLIGAWWLRLLRLGVWAYSRLELQIAHFKSNGFGDSFMRRLLCGS
ESSQQNIQPIDKELFQLRRDSLPPRAPWTSILKQASLFLANTVWNLKDGSKISFWHDSWTDHGPLYQAIPRLFALSSRKNMTVSDAWDHTTSSWKLYPRRPLFEREALVQ
AECASSFPTPALTGGADHMLWNPDSKGRFSVASARHFYCNLSSPDRPQPHSIIFSNLWKALIPKKDQILYLDSYLQKIKYNRQASTYFQIHCSKPELLPPLPCRLR