; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000342 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000342
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr4:4675662..4677640
RNA-Seq ExpressionLag0000342
SyntenyLag0000342
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037445.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.0e-13246.37Show/hide
Query:  MQEIRMWKQRCKKTWLKEGDENTTFFHK------------------------------------------GTTSEGWMVSNLNWCPISGPEATSLIQPFS
        ++E + W QR KK WL+EGDEN++FFH+                                           T S+   + NL+W PI   E   L  PF 
Subjt:  MQEIRMWKQRCKKTWLKEGDENTTFFHK------------------------------------------GTTSEGWMVSNLNWCPISGPEATSLIQPFS

Query:  ELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETYITLIPKKNKAMQIQDFRPISLTTVLYRLIAKTLAERLKSTL
        E E+   + ++   K+PGPDGF + FFK  W  ++  +M++F+DF+ KG+IN N+N TYI LIPKK      +DFRPISLTT +Y++IAKTL+ RLK+ L
Subjt:  ELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETYITLIPKKNKAMQIQDFRPISLTTVLYRLIAKTLAERLKSTL

Query:  QGTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYPITW--------------------PKRKIKAERG
          TIS NQLAFVK RQITDAIL+ANEA+DFWK  + +GF++KLDIEKAFD + W+F+D +L  K +P  W                    P+ +IKA RG
Subjt:  QGTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYPITW--------------------PKRKIKAERG

Query:  IRQGDPISPFIFVLAMDYLSRILQSAEQKGLVKGCSLNS-VSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSKSSITGINVEDSRVAQ
        +RQGDP+SPF+FV+AMDYLSR+L   E  G +KG SLNS  ++SH+LFADDILLF++DND  L NL   + +FE +SGL IN  KS++  +NV  +R  +
Subjt:  IRQGDPISPFIFVLAMDYLSRILQSAEQKGLVKGCSLNS-VSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSKSSITGINVEDSRVAQ

Query:  IAANWGCPTTQFPIPYLGSPLGGNPSSSSFWANTVDKIHRKLDSWRYSYISKGGRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGHDQSS
         A+ WG      P+ YLG PLGGNP S+ FW N  DKI +KL++W+Y+ ISKGGRLTLI++TLS +P Y LS+F+ P   C +I+K  R FLW G++ S 
Subjt:  IAANWGCPTTQFPIPYLGSPLGGNPSSSSFWANTVDKIHRKLDSWRYSYISKGGRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGHDQSS

Query:  SIPLVSWDKVAAPIEAGVWAYSRL
           L++W KV+   E G    SRL
Subjt:  SIPLVSWDKVAAPIEAGVWAYSRL

KAA0039770.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.6e-13548.09Show/hide
Query:  QEIRMWKQRCKKTWLKEGDENTTFFHK------------------------------------------GTTSEGWMVSNLNWCPISGPEATSLIQPFSE
        +E ++W Q+ K+ W+ EGDENT+FFHK                                          G     W++ NLNW PIS  +A +L   F+E
Subjt:  QEIRMWKQRCKKTWLKEGDENTTFFHK------------------------------------------GTTSEGWMVSNLNWCPISGPEATSLIQPFSE

Query:  LEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETYITLIPKKNKAMQIQDFRPISLTTVLYRLIAKTLAERLKSTLQ
         E+ + L A  +NKSPGPDGFT+EF+K +W  ++  ++ +FRDF    IIN  VN T I LI KK K  +  D+RPISLTT +Y+LIAK +AERLK TL 
Subjt:  LEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETYITLIPKKNKAMQIQDFRPISLTTVLYRLIAKTLAERLKSTLQ

Query:  GTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYPITW--------------------PKRKIKAERGI
         T++ENQ+AFVKGRQI DAILVANEAID+W+  + +GFVIKLDIEKAFDK+ W F+D +L  KGYP  W                    P+ KI+  RGI
Subjt:  GTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYPITW--------------------PKRKIKAERGI

Query:  RQGDPISPFIFVLAMDYLSRILQSAEQKGLVKGCSL-NSVSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSKSSITGINVEDSRVAQI
        RQGDPISPFIFVLAMDY+SR+L S  +K  +KG  L  +++++HLLFADDILLFV+D++  + NL NII +F+L+SGL+IN +KS+I+ INV+ SR  QI
Subjt:  RQGDPISPFIFVLAMDYLSRILQSAEQKGLVKGCSL-NSVSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSKSSITGINVEDSRVAQI

Query:  AANWGCPTTQFPIPYLGSPLGGNPSSSSFWANTVDKIHRKLDSWRYSYISKGGRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGHDQSSS
        A+ WG  T   PI YLG PLGG   + +FW N  +KI++KL SW+YS +SKGG++TLI+++L+ +P Y LSIFKAP S C +I+K  R+FLW    ++  
Subjt:  AANWGCPTTQFPIPYLGSPLGGNPSSSSFWANTVDKIHRKLDSWRYSYISKGGRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGHDQSSS

Query:  IPLVSWDKVAAPIEAGVWAYSRLE
        + LV+W K+ +  E G    SRL+
Subjt:  IPLVSWDKVAAPIEAGVWAYSRLE

KAA0041367.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]3.3e-13547.71Show/hide
Query:  QEIRMWKQRCKKTWLKEGDENTTFFHK------------------------------------------GTTSEGWMVSNLNWCPISGPEATSLIQPFSE
        +E ++W Q+ K+ W+ EGDENT+FFHK                                          G     W++ NL+W PIS  +A +L   F+E
Subjt:  QEIRMWKQRCKKTWLKEGDENTTFFHK------------------------------------------GTTSEGWMVSNLNWCPISGPEATSLIQPFSE

Query:  LEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETYITLIPKKNKAMQIQDFRPISLTTVLYRLIAKTLAERLKSTLQ
         E+   L A  +NKSPGPDGFT+EF+K +W  ++  ++ +FRDF    IIN  VN T I LI KK K  +  D+RPISLTT +Y+LIAK +AERLK TL 
Subjt:  LEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETYITLIPKKNKAMQIQDFRPISLTTVLYRLIAKTLAERLKSTLQ

Query:  GTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYPITW--------------------PKRKIKAERGI
         T++ENQ+AFVKGRQI DAILVANEAID+W+  + +GFVIKLDIEKAFDK+ W F+D +L  KGYP  W                    P+ KI+  RGI
Subjt:  GTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYPITW--------------------PKRKIKAERGI

Query:  RQGDPISPFIFVLAMDYLSRILQSAEQKGLVKGCSL-NSVSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSKSSITGINVEDSRVAQI
        RQGDPISPFIFVLAMDY+SR+L S  +K  +KG  +  +++++HLLFADDILLFV+D++  + NL NII +F+L+SGL+IN +KS+I+ INV+ +R  QI
Subjt:  RQGDPISPFIFVLAMDYLSRILQSAEQKGLVKGCSL-NSVSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSKSSITGINVEDSRVAQI

Query:  AANWGCPTTQFPIPYLGSPLGGNPSSSSFWANTVDKIHRKLDSWRYSYISKGGRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGHDQSSS
        A+ WG  T   PI YLG PLGG  ++ +FW N  +KI++KL SW+YS +SKGG++TLI+++L+ +P Y LSIFKAP S C +I+K  R+FLW    ++  
Subjt:  AANWGCPTTQFPIPYLGSPLGGNPSSSSFWANTVDKIHRKLDSWRYSYISKGGRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGHDQSSS

Query:  IPLVSWDKVAAPIEAGVWAYSRLE
        + LV+W K+ +P E G    SRL+
Subjt:  IPLVSWDKVAAPIEAGVWAYSRLE

KAA0046762.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]3.7e-13445.99Show/hide
Query:  MQEIRMWKQRCKKTWLKEGDENTTFFHK------------------------------------------GTTSEGWMVSNLNWCPISGPEATSLIQPFS
        ++E + W QR KK WL+EGDEN++FFH+                                           T S+ + + NL W PI   E  +L  PF 
Subjt:  MQEIRMWKQRCKKTWLKEGDENTTFFHK------------------------------------------GTTSEGWMVSNLNWCPISGPEATSLIQPFS

Query:  ELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETYITLIPKKNKAMQIQDFRPISLTTVLYRLIAKTLAERLKSTL
        E E+   + ++   K+PGPDGF + FFK  W  ++  +M++F+DF+ KG+IN N+N TYI LIPKK      +DFRPISLTT +Y++IAKTL+ RLK++L
Subjt:  ELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETYITLIPKKNKAMQIQDFRPISLTTVLYRLIAKTLAERLKSTL

Query:  QGTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYPITW--------------------PKRKIKAERG
          TISENQLAFVK RQITDAIL+ANEA+DFWK  + +GF++KLDIEKAFD + W+F+D +L  K +PI W                    P+ +IKA RG
Subjt:  QGTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYPITW--------------------PKRKIKAERG

Query:  IRQGDPISPFIFVLAMDYLSRILQSAEQKGLVKGCSLNS-VSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSKSSITGINVEDSRVAQ
        +RQGDP+SPF+FV+AMDYLSR+L   E  G +KG S +S  ++SH+LFADDILLF++DND  L NL   + +FE +SGL IN  KS++  +NV + R  +
Subjt:  IRQGDPISPFIFVLAMDYLSRILQSAEQKGLVKGCSLNS-VSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSKSSITGINVEDSRVAQ

Query:  IAANWGCPTTQFPIPYLGSPLGGNPSSSSFWANTVDKIHRKLDSWRYSYISKGGRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGHDQSS
         A+ WG  +   P+ YLG PLGGNP S  FW+N  +KI +KL++W+Y+ ISKGGRLTLI++TLS +P Y LS+F+AP   C +I+K  R+FLW G++ S 
Subjt:  IAANWGCPTTQFPIPYLGSPLGGNPSSSSFWANTVDKIHRKLDSWRYSYISKGGRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGHDQSS

Query:  SIPLVSWDKVAAPIEAGVWAYSRL
           L++W KV      G    SR+
Subjt:  SIPLVSWDKVAAPIEAGVWAYSRL

XP_016902461.1 PREDICTED: LINE-1 retrotransposable element ORF2 protein [Cucumis melo]7.4e-13547.9Show/hide
Query:  QEIRMWKQRCKKTWLKEGDENTTFFHK------------------------------------------GTTSEGWMVSNLNWCPISGPEATSLIQPFSE
        +E ++W Q+ K+ W+ EGDENT+FFHK                                          G     W++ NLNW PIS  +A +L   F+E
Subjt:  QEIRMWKQRCKKTWLKEGDENTTFFHK------------------------------------------GTTSEGWMVSNLNWCPISGPEATSLIQPFSE

Query:  LEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETYITLIPKKNKAMQIQDFRPISLTTVLYRLIAKTLAERLKSTLQ
         E+ + L A  +NKSPGPDGFT+EF+K +W  ++  ++ +FRDF    IIN  VN T I LI KK K  +  D+RPISLTT +Y+LIAK +AERLK TL 
Subjt:  LEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETYITLIPKKNKAMQIQDFRPISLTTVLYRLIAKTLAERLKSTLQ

Query:  GTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYPITW--------------------PKRKIKAERGI
         T++ENQ+AFVKGRQI DAILVANEAID+W+  + +GFVIKLDIEKAFDK+ W F+D +L  KGYP  W                    P+ KI+  RGI
Subjt:  GTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYPITW--------------------PKRKIKAERGI

Query:  RQGDPISPFIFVLAMDYLSRILQSAEQKGLVKGCSL-NSVSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSKSSITGINVEDSRVAQI
        RQGDPISPFIFVLAMDY+SR+L S  +K  +KG  L  +++++HLLFADDILLFV+D++  + NL NII +F+L+SGL+IN +KS+I+ INV+ SR  QI
Subjt:  RQGDPISPFIFVLAMDYLSRILQSAEQKGLVKGCSL-NSVSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSKSSITGINVEDSRVAQI

Query:  AANWGCPTTQFPIPYLGSPLGGNPSSSSFWANTVDKIHRKLDSWRYSYISKGGRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGHDQSSS
        A+ WG  T   PI YLG PLGG   + +FW N  +KI++KL SW+YS +SKGG++TLI+++L+ +P Y LSIFK P S C +I+K  R+FLW    ++  
Subjt:  AANWGCPTTQFPIPYLGSPLGGNPSSSSFWANTVDKIHRKLDSWRYSYISKGGRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGHDQSSS

Query:  IPLVSWDKVAAPIEAGVWAYSRLE
        + LV+W K+ +  E G    SRL+
Subjt:  IPLVSWDKVAAPIEAGVWAYSRLE

TrEMBL top hitse value%identityAlignment
A0A1S4E2K5 LINE-1 retrotransposable element ORF2 protein3.6e-13547.9Show/hide
Query:  QEIRMWKQRCKKTWLKEGDENTTFFHK------------------------------------------GTTSEGWMVSNLNWCPISGPEATSLIQPFSE
        +E ++W Q+ K+ W+ EGDENT+FFHK                                          G     W++ NLNW PIS  +A +L   F+E
Subjt:  QEIRMWKQRCKKTWLKEGDENTTFFHK------------------------------------------GTTSEGWMVSNLNWCPISGPEATSLIQPFSE

Query:  LEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETYITLIPKKNKAMQIQDFRPISLTTVLYRLIAKTLAERLKSTLQ
         E+ + L A  +NKSPGPDGFT+EF+K +W  ++  ++ +FRDF    IIN  VN T I LI KK K  +  D+RPISLTT +Y+LIAK +AERLK TL 
Subjt:  LEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETYITLIPKKNKAMQIQDFRPISLTTVLYRLIAKTLAERLKSTLQ

Query:  GTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYPITW--------------------PKRKIKAERGI
         T++ENQ+AFVKGRQI DAILVANEAID+W+  + +GFVIKLDIEKAFDK+ W F+D +L  KGYP  W                    P+ KI+  RGI
Subjt:  GTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYPITW--------------------PKRKIKAERGI

Query:  RQGDPISPFIFVLAMDYLSRILQSAEQKGLVKGCSL-NSVSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSKSSITGINVEDSRVAQI
        RQGDPISPFIFVLAMDY+SR+L S  +K  +KG  L  +++++HLLFADDILLFV+D++  + NL NII +F+L+SGL+IN +KS+I+ INV+ SR  QI
Subjt:  RQGDPISPFIFVLAMDYLSRILQSAEQKGLVKGCSL-NSVSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSKSSITGINVEDSRVAQI

Query:  AANWGCPTTQFPIPYLGSPLGGNPSSSSFWANTVDKIHRKLDSWRYSYISKGGRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGHDQSSS
        A+ WG  T   PI YLG PLGG   + +FW N  +KI++KL SW+YS +SKGG++TLI+++L+ +P Y LSIFK P S C +I+K  R+FLW    ++  
Subjt:  AANWGCPTTQFPIPYLGSPLGGNPSSSSFWANTVDKIHRKLDSWRYSYISKGGRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGHDQSSS

Query:  IPLVSWDKVAAPIEAGVWAYSRLE
        + LV+W K+ +  E G    SRL+
Subjt:  IPLVSWDKVAAPIEAGVWAYSRLE

A0A5A7TI93 LINE-1 retrotransposable element ORF2 protein1.6e-13547.71Show/hide
Query:  QEIRMWKQRCKKTWLKEGDENTTFFHK------------------------------------------GTTSEGWMVSNLNWCPISGPEATSLIQPFSE
        +E ++W Q+ K+ W+ EGDENT+FFHK                                          G     W++ NL+W PIS  +A +L   F+E
Subjt:  QEIRMWKQRCKKTWLKEGDENTTFFHK------------------------------------------GTTSEGWMVSNLNWCPISGPEATSLIQPFSE

Query:  LEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETYITLIPKKNKAMQIQDFRPISLTTVLYRLIAKTLAERLKSTLQ
         E+   L A  +NKSPGPDGFT+EF+K +W  ++  ++ +FRDF    IIN  VN T I LI KK K  +  D+RPISLTT +Y+LIAK +AERLK TL 
Subjt:  LEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETYITLIPKKNKAMQIQDFRPISLTTVLYRLIAKTLAERLKSTLQ

Query:  GTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYPITW--------------------PKRKIKAERGI
         T++ENQ+AFVKGRQI DAILVANEAID+W+  + +GFVIKLDIEKAFDK+ W F+D +L  KGYP  W                    P+ KI+  RGI
Subjt:  GTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYPITW--------------------PKRKIKAERGI

Query:  RQGDPISPFIFVLAMDYLSRILQSAEQKGLVKGCSL-NSVSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSKSSITGINVEDSRVAQI
        RQGDPISPFIFVLAMDY+SR+L S  +K  +KG  +  +++++HLLFADDILLFV+D++  + NL NII +F+L+SGL+IN +KS+I+ INV+ +R  QI
Subjt:  RQGDPISPFIFVLAMDYLSRILQSAEQKGLVKGCSL-NSVSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSKSSITGINVEDSRVAQI

Query:  AANWGCPTTQFPIPYLGSPLGGNPSSSSFWANTVDKIHRKLDSWRYSYISKGGRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGHDQSSS
        A+ WG  T   PI YLG PLGG  ++ +FW N  +KI++KL SW+YS +SKGG++TLI+++L+ +P Y LSIFKAP S C +I+K  R+FLW    ++  
Subjt:  AANWGCPTTQFPIPYLGSPLGGNPSSSSFWANTVDKIHRKLDSWRYSYISKGGRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGHDQSSS

Query:  IPLVSWDKVAAPIEAGVWAYSRLE
        + LV+W K+ +P E G    SRL+
Subjt:  IPLVSWDKVAAPIEAGVWAYSRLE

A0A5A7TTK1 LINE-1 retrotransposable element ORF2 protein1.8e-13445.99Show/hide
Query:  MQEIRMWKQRCKKTWLKEGDENTTFFHK------------------------------------------GTTSEGWMVSNLNWCPISGPEATSLIQPFS
        ++E + W QR KK WL+EGDEN++FFH+                                           T S+ + + NL W PI   E  +L  PF 
Subjt:  MQEIRMWKQRCKKTWLKEGDENTTFFHK------------------------------------------GTTSEGWMVSNLNWCPISGPEATSLIQPFS

Query:  ELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETYITLIPKKNKAMQIQDFRPISLTTVLYRLIAKTLAERLKSTL
        E E+   + ++   K+PGPDGF + FFK  W  ++  +M++F+DF+ KG+IN N+N TYI LIPKK      +DFRPISLTT +Y++IAKTL+ RLK++L
Subjt:  ELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETYITLIPKKNKAMQIQDFRPISLTTVLYRLIAKTLAERLKSTL

Query:  QGTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYPITW--------------------PKRKIKAERG
          TISENQLAFVK RQITDAIL+ANEA+DFWK  + +GF++KLDIEKAFD + W+F+D +L  K +PI W                    P+ +IKA RG
Subjt:  QGTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYPITW--------------------PKRKIKAERG

Query:  IRQGDPISPFIFVLAMDYLSRILQSAEQKGLVKGCSLNS-VSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSKSSITGINVEDSRVAQ
        +RQGDP+SPF+FV+AMDYLSR+L   E  G +KG S +S  ++SH+LFADDILLF++DND  L NL   + +FE +SGL IN  KS++  +NV + R  +
Subjt:  IRQGDPISPFIFVLAMDYLSRILQSAEQKGLVKGCSLNS-VSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSKSSITGINVEDSRVAQ

Query:  IAANWGCPTTQFPIPYLGSPLGGNPSSSSFWANTVDKIHRKLDSWRYSYISKGGRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGHDQSS
         A+ WG  +   P+ YLG PLGGNP S  FW+N  +KI +KL++W+Y+ ISKGGRLTLI++TLS +P Y LS+F+AP   C +I+K  R+FLW G++ S 
Subjt:  IAANWGCPTTQFPIPYLGSPLGGNPSSSSFWANTVDKIHRKLDSWRYSYISKGGRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGHDQSS

Query:  SIPLVSWDKVAAPIEAGVWAYSRL
           L++W KV      G    SR+
Subjt:  SIPLVSWDKVAAPIEAGVWAYSRL

A0A5D3BUZ3 LINE-1 retrotransposable element ORF2 protein9.8e-13346.37Show/hide
Query:  MQEIRMWKQRCKKTWLKEGDENTTFFHK------------------------------------------GTTSEGWMVSNLNWCPISGPEATSLIQPFS
        ++E + W QR KK WL+EGDEN++FFH+                                           T S+   + NL+W PI   E   L  PF 
Subjt:  MQEIRMWKQRCKKTWLKEGDENTTFFHK------------------------------------------GTTSEGWMVSNLNWCPISGPEATSLIQPFS

Query:  ELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETYITLIPKKNKAMQIQDFRPISLTTVLYRLIAKTLAERLKSTL
        E E+   + ++   K+PGPDGF + FFK  W  ++  +M++F+DF+ KG+IN N+N TYI LIPKK      +DFRPISLTT +Y++IAKTL+ RLK+ L
Subjt:  ELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETYITLIPKKNKAMQIQDFRPISLTTVLYRLIAKTLAERLKSTL

Query:  QGTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYPITW--------------------PKRKIKAERG
          TIS NQLAFVK RQITDAIL+ANEA+DFWK  + +GF++KLDIEKAFD + W+F+D +L  K +P  W                    P+ +IKA RG
Subjt:  QGTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYPITW--------------------PKRKIKAERG

Query:  IRQGDPISPFIFVLAMDYLSRILQSAEQKGLVKGCSLNS-VSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSKSSITGINVEDSRVAQ
        +RQGDP+SPF+FV+AMDYLSR+L   E  G +KG SLNS  ++SH+LFADDILLF++DND  L NL   + +FE +SGL IN  KS++  +NV  +R  +
Subjt:  IRQGDPISPFIFVLAMDYLSRILQSAEQKGLVKGCSLNS-VSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSKSSITGINVEDSRVAQ

Query:  IAANWGCPTTQFPIPYLGSPLGGNPSSSSFWANTVDKIHRKLDSWRYSYISKGGRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGHDQSS
         A+ WG      P+ YLG PLGGNP S+ FW N  DKI +KL++W+Y+ ISKGGRLTLI++TLS +P Y LS+F+ P   C +I+K  R FLW G++ S 
Subjt:  IAANWGCPTTQFPIPYLGSPLGGNPSSSSFWANTVDKIHRKLDSWRYSYISKGGRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGHDQSS

Query:  SIPLVSWDKVAAPIEAGVWAYSRL
           L++W KV+   E G    SRL
Subjt:  SIPLVSWDKVAAPIEAGVWAYSRL

A0A5D3DM72 LINE-1 retrotransposable element ORF2 protein1.2e-13548.09Show/hide
Query:  QEIRMWKQRCKKTWLKEGDENTTFFHK------------------------------------------GTTSEGWMVSNLNWCPISGPEATSLIQPFSE
        +E ++W Q+ K+ W+ EGDENT+FFHK                                          G     W++ NLNW PIS  +A +L   F+E
Subjt:  QEIRMWKQRCKKTWLKEGDENTTFFHK------------------------------------------GTTSEGWMVSNLNWCPISGPEATSLIQPFSE

Query:  LEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETYITLIPKKNKAMQIQDFRPISLTTVLYRLIAKTLAERLKSTLQ
         E+ + L A  +NKSPGPDGFT+EF+K +W  ++  ++ +FRDF    IIN  VN T I LI KK K  +  D+RPISLTT +Y+LIAK +AERLK TL 
Subjt:  LEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETYITLIPKKNKAMQIQDFRPISLTTVLYRLIAKTLAERLKSTLQ

Query:  GTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYPITW--------------------PKRKIKAERGI
         T++ENQ+AFVKGRQI DAILVANEAID+W+  + +GFVIKLDIEKAFDK+ W F+D +L  KGYP  W                    P+ KI+  RGI
Subjt:  GTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYPITW--------------------PKRKIKAERGI

Query:  RQGDPISPFIFVLAMDYLSRILQSAEQKGLVKGCSL-NSVSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSKSSITGINVEDSRVAQI
        RQGDPISPFIFVLAMDY+SR+L S  +K  +KG  L  +++++HLLFADDILLFV+D++  + NL NII +F+L+SGL+IN +KS+I+ INV+ SR  QI
Subjt:  RQGDPISPFIFVLAMDYLSRILQSAEQKGLVKGCSL-NSVSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSKSSITGINVEDSRVAQI

Query:  AANWGCPTTQFPIPYLGSPLGGNPSSSSFWANTVDKIHRKLDSWRYSYISKGGRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGHDQSSS
        A+ WG  T   PI YLG PLGG   + +FW N  +KI++KL SW+YS +SKGG++TLI+++L+ +P Y LSIFKAP S C +I+K  R+FLW    ++  
Subjt:  AANWGCPTTQFPIPYLGSPLGGNPSSSSFWANTVDKIHRKLDSWRYSYISKGGRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGHDQSSS

Query:  IPLVSWDKVAAPIEAGVWAYSRLE
        + LV+W K+ +  E G    SRL+
Subjt:  IPLVSWDKVAAPIEAGVWAYSRLE

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein2.5e-3226.41Show/hide
Query:  EATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETYITLIPKKNK-AMQIQDFRPISLTTVLYRLIA
        E  SL +P +  E+   + ++   KSPGPDGFT EF+++    + P ++++F+   ++GI+  +  E  I LIPK  +   + ++FRPISL  +  +++ 
Subjt:  EATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETYITLIPKKNK-AMQIQDFRPISLTTVLYRLIA

Query:  KTLAERLKSTLQGTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFV-IKLDIEKAFDKICWNFVDKILAFKGYPITWPK----------------
        K LA R++  ++  I  +Q+ F+ G Q    I  +   I     ++ +  V I +D EKAFDKI   F+ K L   G    + K                
Subjt:  KTLAERLKSTLQGTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFV-IKLDIEKAFDKICWNFVDKILAFKGYPITWPK----------------

Query:  -RKIKA---ERGIRQGDPISPFIFVLAMDYLSRILQSAEQKGLVKGCSLNSVSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSKSSIT
         +K++A   + G RQG P+SP +F + ++ L+R ++  ++   +KG  L    V   LFADD+++++++      NL  +I  F   SG  IN  KS   
Subjt:  -RKIKA---ERGIRQGDPISPFIFVLAMDYLSRILQSAEQKGLVKGCSLNSVSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSKSSIT

Query:  GINVEDSRVAQIAANWGCPTTQFPIPYLGSPLGGNPSS--SSFWANTVDKIHRKLDSWRYSYISKGGRLTLIRATLSGIPNYLLSI--FKAPQSVCFSID
          N      +QI            I YLG  L  +        +   + +I    + W+    S  GR+ +++  +     Y  +    K P +    ++
Subjt:  GINVEDSRVAQIAANWGCPTTQFPIPYLGSPLGGNPSS--SSFWANTVDKIHRKLDSWRYSYISKGGRLTLIRATLSGIPNYLLSI--FKAPQSVCFSID

Query:  KIIRSFLWH
        K    F+W+
Subjt:  KIIRSFLWH

P08548 LINE-1 reverse transcriptase homolog1.2e-3126.51Show/hide
Query:  ISGPEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETYITLIPKKNK-AMQIQDFRPISLTTVLY
        +S  E   L +P S  E+   ++ +   KSPGPDGFT EF++     + P ++ +F++  ++GI+     E  ITLIPK  K   + +++RPISL  +  
Subjt:  ISGPEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETYITLIPKKNK-AMQIQDFRPISLTTVLY

Query:  RLIAKTLAERLKSTLQGTISENQLAFVKGRQITDAILVANEAID-FWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYPITWPK------------
        +++ K L  R++  ++  I  +Q+ F+ G Q    I  +   I    K       ++ +D EKAFD I   F+ + L   G   T+ K            
Subjt:  RLIAKTLAERLKSTLQGTISENQLAFVKGRQITDAILVANEAID-FWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYPITWPK------------

Query:  --------RKIKAERGIRQGDPISPFIFVLAMDYLSRILQSAEQKGLVKGCSLNSVSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSK
                +      G RQG P+SP +F + M+ L+  ++  E+K  +KG  + S  +   LFADD+++++++       L  +IK +   SG  IN  K
Subjt:  --------RKIKAERGIRQGDPISPFIFVLAMDYLSRILQSAEQKGLVKGCSLNSVSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSK

Query:  SSITGINVEDSRVAQIAANWGCPTTQFP--IPYLGSPLGGNPSS--SSFWANTVDKIHRKLDSWRYSYISKGGRLTLIRATL--SGIPNYLLSIFKAPQS
        S        ++  A+       P T  P  + YLG  L  +        +     +I   ++ W+    S  GR+ +++ ++    I N+     KAP S
Subjt:  SSITGINVEDSRVAQIAANWGCPTTQFP--IPYLGSPLGGNPSS--SSFWANTVDKIHRKLDSWRYSYISKGGRLTLIRATL--SGIPNYLLSIFKAPQS

Query:  VCFSIDKIIRSFLWH
            ++KII  F+W+
Subjt:  VCFSIDKIIRSFLWH

P11369 LINE-1 retrotransposable element ORF2 protein2.6e-2925.77Show/hide
Query:  IRMWKQRCKKTWLKEGDENTTFFHKGTTSEGWMVSNLNWCPISGPEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDF
        IR + +R   T L+  DE   F  +      + V  LN       +   L  P S  E+   + ++   KSPGPDGF+ EF++     + P + ++F   
Subjt:  IRMWKQRCKKTWLKEGDENTTFFHKGTTSEGWMVSNLNWCPISGPEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDF

Query:  FQKGIINCNVNETYITLIPKKNK-AMQIQDFRPISLTTVLYRLIAKTLAERLKSTLQGTISENQLAFVKGRQITDAILVANEAIDFW-KCSRTRGFVIKL
          +G +  +  E  ITLIPK  K   +I++FRPISL  +  +++ K LA R++  ++  I  +Q+ F+ G Q    I  +   I +  K       +I L
Subjt:  FQKGIINCNVNETYITLIPKKNK-AMQIQDFRPISLTTVLYRLIAKTLAERLKSTLQGTISENQLAFVKGRQITDAILVANEAIDFW-KCSRTRGFVIKL

Query:  DIEKAFDKICWNFVDKILAFKG------------YPITWPKRKIKAER--------GIRQGDPISPFIFVLAMDYLSRILQSAEQKGLVKGCSLNSVSVS
        D EKAFDKI   F+ K+L   G            Y       K+  E+        G RQG P+SP++F + ++ L+R ++  ++   +KG  +    V 
Subjt:  DIEKAFDKICWNFVDKILAFKG------------YPITWPKRKIKAER--------GIRQGDPISPFIFVLAMDYLSRILQSAEQKGLVKGCSLNSVSVS

Query:  HLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSKSSITGINVEDSRVAQIAANWGCPTTQFPIPYLGSPLGGNPSS--SSFWANTVDKIHRKL
          L ADD+++++ D       L N+I  F    G  IN +KS             +I            I YLG  L           + +   +I   L
Subjt:  HLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSKSSITGINVEDSRVAQIAANWGCPTTQFPIPYLGSPLGGNPSS--SSFWANTVDKIHRKL

Query:  DSWRYSYISKGGRLTLIRATL--SGIPNYLLSIFKAPQSVCFSIDKIIRSFLWH
          W+    S  GR+ +++  +    I  +     K P      ++  I  F+W+
Subjt:  DSWRYSYISKGGRLTLIRATL--SGIPNYLLSIFKAPQSVCFSIDKIIRSFLWH

P14381 Transposon TX1 uncharacterized 149 kDa protein1.1e-2928.78Show/hide
Query:  PFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETYITLIPKKNKAMQIQDFRPISLTTVLYRLIAKTLAERLK
        P +  E+ Q L+ M HNKSPG DG T+EFF+  W  + P    V  + F+KG +  +     ++L+PKK     I+++RP+SL +  Y+++AK ++ RLK
Subjt:  PFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETYITLIPKKNKAMQIQDFRPISLTTVLYRLIAKTLAERLK

Query:  STLQGTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKIL-------AFKGY------------PITWP-KRKIKA
        S L   I  +Q   V GR I D + +  + + F + +      + LD EKAFD++   ++   L        F GY             I W     +  
Subjt:  STLQGTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKIL-------AFKGY------------PITWP-KRKIKA

Query:  ERGIRQGDPISPFIFVLAMDYLSRILQSAEQKGLVKGCSLNSVSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSKSS--ITG-INVED
         RG+RQG P+S  ++ LA++    +L+      ++K   +  V  +   +ADD++L  QD    L       +V+  +S   IN+SKSS  + G + V+ 
Subjt:  ERGIRQGDPISPFIFVLAMDYLSRILQSAEQKGLVKGCSLNSVSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSKSS--ITG-INVED

Query:  SRVAQIAANWGCPTTQFPIPYLGSPLGGN--PSSSSFWANTVDKIHRKLDSWR--YSYISKGGRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSF
           A    +W        I YLG  L     P S +F     + +  +L  W+     +S  GR  +I   ++    Y L      Q     I + +  F
Subjt:  SRVAQIAANWGCPTTQFPIPYLGSPLGGN--PSSSSFWANTVDKIHRKLDSWR--YSYISKGGRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSF

Query:  LWHG-HDQS---SSIPL
        LW G H  S   SS+PL
Subjt:  LWHG-HDQS---SSIPL

Q03274 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)1.5e-1628Show/hide
Query:  SPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETY----ITLIPKKNKAMQIQDFRPISLTTVLYRLIAKTLAERLKSTLQGTISENQLAF
        +PG DG TV+   ++ +          R+F Q  ++  +V   +     TLIPK        ++RPI++ + L RL+ + LA+RL++ ++   ++   A 
Subjt:  SPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETY----ITLIPKKNKAMQIQDFRPISLTTVLYRLIAKTLAERLKSTLQGTISENQLAF

Query:  VKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYP------ITW---------------PKRKIKAERGIRQGDPISPF
        + G  +   +L  +  I   +  R    V+ LD+ KAFD +  + + + L   G        IT                  RKI   RG++QGDP+SPF
Subjt:  VKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYP------ITW---------------PKRKIKAERGIRQGDPISPF

Query:  IFVLAMDYLSRILQSAEQKGLVKGCSLNSVSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSKS
        +F   +D L   LQS    G+  G ++    +  L FADD+LL ++DND +L      +  F    G+++N  KS
Subjt:  IFVLAMDYLSRILQSAEQKGLVKGCSLNSVSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSKS

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein3.3e-1630.11Show/hide
Query:  WKQRCKKTWLKEGDENTTFFHKGTTS-------------EGWMVSNLN---------WCPISGPEATSL----------IQPF--------------SEL
        ++Q+ +  WL++GD NT FFHK   +             +   V N+          +  + G ++  L          I PF              S+ 
Subjt:  WKQRCKKTWLKEGDENTTFFHKGTTS-------------EGWMVSNLN---------WCPISGPEATSL----------IQPF--------------SEL

Query:  EVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETYITLIPKKNKAMQIQDFRPISLTTVLYRLI
        E+   + AM  NK+PGPD FT EFF +SW  ++ S +   ++FF+ G +    N T ITLIPK     Q+  FRP+S  TV+Y++I
Subjt:  EVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIINCNVNETYITLIPKKNKAMQIQDFRPISLTTVLYRLI

AT4G20520.1 RNA binding;RNA-directed DNA polymerases6.7e-0932.32Show/hide
Query:  LAERLKSTLQGTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRG----FVIKLDIEKAFDKICWNFVDKILAFKGYPITWPKRKIKAERGIRQGDP
        + ERLK  +   I   Q +F+ GR  TD I+   EA+   +  R +G     ++KLD+EKA+D+I W++++  L   G+P  W     ++  G R+  P
Subjt:  LAERLKSTLQGTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRG----FVIKLDIEKAFDKICWNFVDKILAFKGYPITWPKRKIKAERGIRQGDP

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)3.9e-0943.55Show/hide
Query:  PKRKIKAERGIRQGDPISPFIFVLAMDYLSRILQSAEQKGLVKG--CSLNSVSVSHLLFADD
        P+  +   RG+RQGDP+SP++F+L  + LS + + A+++G + G   S NS  ++HLLFADD
Subjt:  PKRKIKAERGIRQGDPISPFIFVLAMDYLSRILQSAEQKGLVKG--CSLNSVSVSHLLFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGAGATCAGGATGTGGAAGCAAAGATGCAAGAAAACCTGGCTTAAGGAAGGAGATGAAAATACCACCTTTTTCCATAAAGGCACCACTTCTGAAGGATGGATGGT
CTCAAATTTAAATTGGTGCCCTATTTCTGGTCCTGAGGCGACATCTTTGATCCAGCCTTTTTCAGAGCTAGAAGTATTTCAGAATCTAAAAGCAATGGGTCATAACAAGT
CCCCTGGCCCGGATGGATTCACAGTTGAATTCTTTAAAAAGTCCTGGATTGCTATCAGGCCTTCAGTTATGGAAGTGTTCCGTGACTTCTTTCAGAAGGGTATCATCAAC
TGTAATGTTAATGAAACCTACATTACTTTGATACCAAAGAAGAATAAAGCTATGCAAATTCAAGACTTTAGACCCATTAGCCTCACCACAGTTCTCTATCGCCTTATCGC
TAAGACTCTTGCAGAAAGGCTTAAAAGTACTCTTCAAGGGACAATATCTGAGAATCAACTGGCATTTGTAAAAGGTCGTCAAATTACTGATGCTATTTTGGTGGCTAATG
AAGCTATTGATTTTTGGAAATGCTCTCGCACGAGAGGATTTGTTATAAAGCTAGATATTGAAAAGGCTTTTGACAAGATCTGTTGGAACTTCGTTGATAAGATTCTTGCT
TTCAAGGGATACCCTATCACCTGGCCCAAGAGGAAAATCAAGGCCGAAAGAGGCATTCGTCAAGGAGATCCGATCTCCCCTTTTATCTTTGTCCTAGCTATGGACTATCT
TAGTCGAATTCTTCAGTCGGCTGAGCAAAAGGGGCTTGTTAAGGGTTGTTCTCTCAACTCCGTCTCTGTCTCTCATCTTCTATTCGCAGATGACATTCTCCTTTTTGTTC
AAGATAACGATGCTATGTTAGGCAACCTGTTCAACATCATCAAAGTATTTGAGCTCTCTTCGGGTCTCAATATAAACTTCAGCAAATCCTCTATAACGGGTATCAACGTG
GAGGATTCCAGAGTTGCTCAAATTGCCGCCAATTGGGGATGCCCAACGACCCAATTTCCCATTCCTTATTTAGGCTCTCCCTTGGGGGGTAATCCATCATCGTCTTCGTT
CTGGGCTAATACGGTTGATAAGATTCATCGTAAATTGGATAGTTGGCGCTATTCCTATATTTCTAAAGGAGGAAGACTAACCTTGATAAGAGCAACTTTAAGTGGCATTC
CCAACTACTTGTTATCTATCTTTAAAGCTCCGCAATCGGTTTGCTTTAGTATAGATAAGATTATCAGATCTTTCCTTTGGCATGGGCATGACCAAAGTAGTAGCATCCCT
TTGGTTAGCTGGGATAAGGTGGCTGCGCCTATTGAGGCGGGGGTTTGGGCTTATTCAAGACTAGAATCACAAATAGCGCATTTCAAGTCAAATGGCTTTGGAGATTCTTT
CATGAGGAGACTTCTCTGTGGAAGCGAGTCATTTCAGCAAAATATACAACCCAAAGACAGGGAGCTCTCCCAACTCAGACGCGATACACTTCCTCCCGAGCGCCATGGAC
ATCAATTCTCAAGCAAGCATCATCTTTTCTGGCAAATACAGCTTGGAATCTTAAGGATGGCAGTAAAATCTCTTTTTGGCATGACTCTTGGACCGATCATGGGCCATTAC
ATCAAGCCCTCCCTCGGCTCTTTGCGCTGTCCAGTAGGAAAGATATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAAGAGATCAGGATGTGGAAGCAAAGATGCAAGAAAACCTGGCTTAAGGAAGGAGATGAAAATACCACCTTTTTCCATAAAGGCACCACTTCTGAAGGATGGATGGT
CTCAAATTTAAATTGGTGCCCTATTTCTGGTCCTGAGGCGACATCTTTGATCCAGCCTTTTTCAGAGCTAGAAGTATTTCAGAATCTAAAAGCAATGGGTCATAACAAGT
CCCCTGGCCCGGATGGATTCACAGTTGAATTCTTTAAAAAGTCCTGGATTGCTATCAGGCCTTCAGTTATGGAAGTGTTCCGTGACTTCTTTCAGAAGGGTATCATCAAC
TGTAATGTTAATGAAACCTACATTACTTTGATACCAAAGAAGAATAAAGCTATGCAAATTCAAGACTTTAGACCCATTAGCCTCACCACAGTTCTCTATCGCCTTATCGC
TAAGACTCTTGCAGAAAGGCTTAAAAGTACTCTTCAAGGGACAATATCTGAGAATCAACTGGCATTTGTAAAAGGTCGTCAAATTACTGATGCTATTTTGGTGGCTAATG
AAGCTATTGATTTTTGGAAATGCTCTCGCACGAGAGGATTTGTTATAAAGCTAGATATTGAAAAGGCTTTTGACAAGATCTGTTGGAACTTCGTTGATAAGATTCTTGCT
TTCAAGGGATACCCTATCACCTGGCCCAAGAGGAAAATCAAGGCCGAAAGAGGCATTCGTCAAGGAGATCCGATCTCCCCTTTTATCTTTGTCCTAGCTATGGACTATCT
TAGTCGAATTCTTCAGTCGGCTGAGCAAAAGGGGCTTGTTAAGGGTTGTTCTCTCAACTCCGTCTCTGTCTCTCATCTTCTATTCGCAGATGACATTCTCCTTTTTGTTC
AAGATAACGATGCTATGTTAGGCAACCTGTTCAACATCATCAAAGTATTTGAGCTCTCTTCGGGTCTCAATATAAACTTCAGCAAATCCTCTATAACGGGTATCAACGTG
GAGGATTCCAGAGTTGCTCAAATTGCCGCCAATTGGGGATGCCCAACGACCCAATTTCCCATTCCTTATTTAGGCTCTCCCTTGGGGGGTAATCCATCATCGTCTTCGTT
CTGGGCTAATACGGTTGATAAGATTCATCGTAAATTGGATAGTTGGCGCTATTCCTATATTTCTAAAGGAGGAAGACTAACCTTGATAAGAGCAACTTTAAGTGGCATTC
CCAACTACTTGTTATCTATCTTTAAAGCTCCGCAATCGGTTTGCTTTAGTATAGATAAGATTATCAGATCTTTCCTTTGGCATGGGCATGACCAAAGTAGTAGCATCCCT
TTGGTTAGCTGGGATAAGGTGGCTGCGCCTATTGAGGCGGGGGTTTGGGCTTATTCAAGACTAGAATCACAAATAGCGCATTTCAAGTCAAATGGCTTTGGAGATTCTTT
CATGAGGAGACTTCTCTGTGGAAGCGAGTCATTTCAGCAAAATATACAACCCAAAGACAGGGAGCTCTCCCAACTCAGACGCGATACACTTCCTCCCGAGCGCCATGGAC
ATCAATTCTCAAGCAAGCATCATCTTTTCTGGCAAATACAGCTTGGAATCTTAAGGATGGCAGTAAAATCTCTTTTTGGCATGACTCTTGGACCGATCATGGGCCATTAC
ATCAAGCCCTCCCTCGGCTCTTTGCGCTGTCCAGTAGGAAAGATATGA
Protein sequenceShow/hide protein sequence
MQEIRMWKQRCKKTWLKEGDENTTFFHKGTTSEGWMVSNLNWCPISGPEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFRDFFQKGIIN
CNVNETYITLIPKKNKAMQIQDFRPISLTTVLYRLIAKTLAERLKSTLQGTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILA
FKGYPITWPKRKIKAERGIRQGDPISPFIFVLAMDYLSRILQSAEQKGLVKGCSLNSVSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFSKSSITGINV
EDSRVAQIAANWGCPTTQFPIPYLGSPLGGNPSSSSFWANTVDKIHRKLDSWRYSYISKGGRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGHDQSSSIP
LVSWDKVAAPIEAGVWAYSRLESQIAHFKSNGFGDSFMRRLLCGSESFQQNIQPKDRELSQLRRDTLPPERHGHQFSSKHHLFWQIQLGILRMAVKSLFGMTLGPIMGHY
IKPSLGSLRCPVGKI