; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000156 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000156
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationchr4:635277..639672
RNA-Seq ExpressionLag0000156
SyntenyLag0000156
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN65484.1 hypothetical protein VITISV_029474 [Vitis vinifera]2.4e-11631.84Show/hide
Query:  GKFDYQASPEKALQIIVPWLEKYQVLEIGKKGQYVDCNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIVTTDITKGMFSISILLSLADGYNFWITG
        G+F  +    + +  +   L+   + E   K +  D   + S+W+ RN  WA L A G+SGGI+I+W+   +   ++  G FS+SI  +L    + W++ 
Subjt:  GKFDYQASPEKALQIIVPWLEKYQVLEIGKKGQYVDCNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIVTTDITKGMFSISILLSLADGYNFWITG

Query:  VYGPNKYRERPSFWQEI----------YC------------------------------------------PKAFSISSYFINLEARKLD----------
        VYGPN    R  FW E+          +C                                            +F+ S+  +N   ++LD          
Subjt:  VYGPNKYRERPSFWQEI----------YC------------------------------------------PKAFSISSYFINLEARKLD----------

Query:  -----------RPTSDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPLRGRPGHRFIMKLKDLKVRLRSWNKDVFGCNTTKRNQLLTEI
                   R TSDH+P++L     KWGP+PFRFENMWL+H SF      WW+     G  GH+F+ KL+ +K +L+ WNK  FG  + ++  +L+++
Subjt:  -----------RPTSDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPLRGRPGHRFIMKLKDLKVRLRSWNKDVFGCNTTKRNQLLTEI

Query:  SILDSIEETGNISPIQLSQRRSLKAEILNLVAFEEQRWKQKCKNRWIEEGDNNTSYFHRILVAKKRKNTISEILSIHGQSLQCEDEIIKEFIDFYKALYK
           DS+E+ G +S   L+QR   K E+  L+  EE  W+QK + +W++EGD N+ +FH++   ++ +  I E+ + +G  +   + I +E + +++ LY 
Subjt:  SILDSIEETGNISPIQLSQRRSLKAEILNLVAFEEQRWKQKCKNRWIEEGDNNTSYFHRILVAKKRKNTISEILSIHGQSLQCEDEIIKEFIDFYKALYK

Query:  KEDVLGAFTDDIDWNPLQSHLVVDLEGSFTETKL------LGSDKSPGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVD-AR
                 + +DW+P+       LE  FTE ++      +  DK+PG DGFT   F+ CW ++K+D+++VF +F ++GIIN S N ++I L+PK   +R
Subjt:  KEDVLGAFTDDIDWNPLQSHLVVDLEGSFTETKL------LGSDKSPGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVD-AR

Query:  TVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVDARTVADF
         ++DFRPISL T LYKIIA+VL  R++ VL  TI   Q AFV+G               + D ++        N I++         ++ K+D     D 
Subjt:  TVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVDARTVADF

Query:  RPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEAKGFGNKWRSWIRGCISSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLT
                       V ++ L  VL            E KGFG +WR W+RGC+SS ++++L+NG  +G + ASRGLRQGDPLSPFLF ++ D  SR+L 
Subjt:  RPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEAKGFGNKWRSWIRGCISSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLT

Query:  KADSDGLIKGFRTDPNSPSINHIQFADDTILFTQFEDDLSAKSMFKVVEAFEQASGHNINLHKTEILGINVEHSFFWRL
        KA+   +++GF+   N   ++H+QFADDTI F+   ++    ++  V+  F   SG  +NL K+ I GIN+E +   RL
Subjt:  KADSDGLIKGFRTDPNSPSINHIQFADDTILFTQFEDDLSAKSMFKVVEAFEQASGHNINLHKTEILGINVEHSFFWRL

CAN68838.1 hypothetical protein VITISV_030956 [Vitis vinifera]8.1e-11732.58Show/hide
Query:  KGQYVDCNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIVTTDITKGMFSISILLSLADGYNFWITGVYGPNKYRERPSFWQEI----------YC-
        K +  D   + S+W+ RN  WA L A G+SGGI+I+W+   +   ++  G FS+SI  +L    + W++ VYGPN    R   W E+          +C 
Subjt:  KGQYVDCNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIVTTDITKGMFSISILLSLADGYNFWITGVYGPNKYRERPSFWQEI----------YC-

Query:  -----------------------------------------PKAFSISSYFINLEARKLD---------------------RPTSDHYPLMLTMGSGKWG
                                                   +F+ S+  +N   ++LD                     R TSDH+P++L     KWG
Subjt:  -----------------------------------------PKAFSISSYFINLEARKLD---------------------RPTSDHYPLMLTMGSGKWG

Query:  PSPFRFENMWLKHRSFLPLIDYWWKNTPLRGRPGHRFIMKLKDLKVRLRSWNKDVFGCNTTKRNQLLTEISILDSIEETGNISPIQLSQRRSLKAEILNL
        P+PFRFENMWL+H SF      WW+     G  GH+F+ KL+ +K +L+ WNK  FG  + ++  +L+ +   DS+E+ G +S   L+QR   K E+  L
Subjt:  PSPFRFENMWLKHRSFLPLIDYWWKNTPLRGRPGHRFIMKLKDLKVRLRSWNKDVFGCNTTKRNQLLTEISILDSIEETGNISPIQLSQRRSLKAEILNL

Query:  VAFEEQRWKQKCKNRWIEEGDNNTSYFHRILVAKKRKNTISEILSIHGQSLQCEDEIIKEFIDFYKALYKKEDVLGAFTDDIDWNPLQSHLVVDLEGSFT
        +  EE  W+QK + +W++EGD N+ +FH++   ++ +  I E+ + +GQ +   + I +E + +++ LY          + +DW+P+     V LE  FT
Subjt:  VAFEEQRWKQKCKNRWIEEGDNNTSYFHRILVAKKRKNTISEILSIHGQSLQCEDEIIKEFIDFYKALYKKEDVLGAFTDDIDWNPLQSHLVVDLEGSFT

Query:  ETKL------LGSDKSPGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVD-ARTVADFRPISLTTCLYKIIARVLYERLKKVL
        E ++      +  DK+PG DGFT   F+ CW ++K+D+++VF +F ++GIIN S N ++I L+PK   +R ++DFRPISL T LYKIIA+VL  R+++VL
Subjt:  ETKL------LGSDKSPGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVD-ARTVADFRPISLTTCLYKIIARVLYERLKKVL

Query:  PFTIREHQQAFVEGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVDARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIR
          TI   Q AFV+G               + D ++        N I++         ++ K+D     D                V ++ L  V+     
Subjt:  PFTIREHQQAFVEGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVDARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIR

Query:  EHQQAFVEAKGFGNKWRSWIRGCISSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKADSDGLIKGFRTDPNSPSINHIQFADDTI
               E KGFG +WR W+RGC+SS ++++L+NG  +G + ASRGLRQGDPLSPFLF ++ D  SR+L KA+   +++GF+   N   ++H+QFADDTI
Subjt:  EHQQAFVEAKGFGNKWRSWIRGCISSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKADSDGLIKGFRTDPNSPSINHIQFADDTI

Query:  LFTQFEDDLSAKSMFKVVEAFEQASGHNINLHKTEILGINVEHSFFWRL
         F+   ++    ++  V+  F   SG  +NL K+ I GIN+E +   RL
Subjt:  LFTQFEDDLSAKSMFKVVEAFEQASGHNINLHKTEILGINVEHSFFWRL

RVW13148.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]2.5e-11834.91Show/hide
Query:  KGQYVDCNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIVTTDITKGMFSISILLSLADGYNFWITGVYGPNKYRERPSFWQEIYCPKAFSISSYFI
        K +  D   + S+W+ RN  W  L ASG+SGGI+I+W+  ++   ++  G FS+S+  SL      WI+ VYGPN    R  FW E++    + +    I
Subjt:  KGQYVDCNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIVTTDITKGMFSISILLSLADGYNFWITGVYGPNKYRERPSFWQEIYCPKAFSISSYFI

Query:  NLEARKLDRPTSDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPLRGRPGHRFIMKLKDLKVRLRSWNKDVFGCNTTKRNQLLTEISIL
                R TSDH+P+++      WGP+PFRFENMWL+H +F      WW      G  GH+F+ +L+ +K +L+ WNK  FG    K+  +L +++  
Subjt:  NLEARKLDRPTSDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPLRGRPGHRFIMKLKDLKVRLRSWNKDVFGCNTTKRNQLLTEISIL

Query:  DSIEETGNISPIQLSQRRSLKAEILNLVAFEEQRWKQKCKNRWIEEGDNNTSYFHRILVAKKRKNTISEILSIHGQSLQCEDEIIKEFIDFYKALYKKED
        D+IE+ G ++P  LSQR S K E+  L+  EE  W+QK K +W++EGD N+ ++H++   ++ +  I E+ +  G  L+  + I +E + +++ LY    
Subjt:  DSIEETGNISPIQLSQRRSLKAEILNLVAFEEQRWKQKCKNRWIEEGDNNTSYFHRILVAKKRKNTISEILSIHGQSLQCEDEIIKEFIDFYKALYKKED

Query:  VLGAFTDDIDWNPLQSHLVVDLEGSFTETKL------LGSDKSPGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVD-ARTVA
              + +DW+P+     + L+  FTE ++      L  DK+PG DGFT   F++CW+++K+D++RVF +F ++GIIN S N ++I LIPK   ++ ++
Subjt:  VLGAFTDDIDWNPLQSHLVVDLEGSFTETKL------LGSDKSPGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVD-ARTVA

Query:  DFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVDARTVADFRPI
        DFRPISL T LYKIIA+VL  RL+ VL  TI   Q AFV+G                               I++A L                      
Subjt:  DFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVDARTVADFRPI

Query:  SLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEAKGFGNKWRSWIRGCISSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKAD
                 IA  + +R +               E KGF  +WR W+ GC+SS +Y+IL+NG  +G + ASRGLRQGDPLSPFLF ++ D  SR+L +A+
Subjt:  SLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEAKGFGNKWRSWIRGCISSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKAD

Query:  SDGLIKGFRTDPNSPSINHIQFADDTILFTQFEDDLSAKSMFKVVEAFEQASGHNINLHKTEILGINVEHSFFWRL
           +++GFR   N   ++H+QFADDTI F+   ++   +++  ++  F   SG  +NL+K+ I GIN++ +   RL
Subjt:  SDGLIKGFRTDPNSPSINHIQFADDTILFTQFEDDLSAKSMFKVVEAFEQASGHNINLHKTEILGINVEHSFFWRL

RVW65579.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]1.6e-11733.99Show/hide
Query:  EKYQVLEIGK-KGQYVDCNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIVTTDITKGMFSISILLSLADGYNFWITGVYGPNKYRERPSFWQEIY-
        EK  V+ I + K +  D  ++ S+WS RN  WA L ASG+SGGI+I+W+   +   ++  G FS+SI  ++    + W++ VYGPN    R  FW E+  
Subjt:  EKYQVLEIGK-KGQYVDCNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIVTTDITKGMFSISILLSLADGYNFWITGVYGPNKYRERPSFWQEIY-

Query:  --------------------------------CPKAF---------------SISSYFINLE----ARKLD---------------------RPTSDHYP
                                        C K F               S+S  + N++     ++LD                     R TSDH+P
Subjt:  --------------------------------CPKAF---------------SISSYFINLE----ARKLD---------------------RPTSDHYP

Query:  LMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPLRGRPGHRFIMKLKDLKVRLRSWNKDVFGCNTTKRNQLLTEISILDSIEETGNISPIQLSQ
        ++L     KWGP+PFRFENMWL+H SF      WW      G  GH+F+ KL+ +K +L+ WNK  FG  + K+  +L  ++  DS+E+ G +S   L Q
Subjt:  LMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPLRGRPGHRFIMKLKDLKVRLRSWNKDVFGCNTTKRNQLLTEISILDSIEETGNISPIQLSQ

Query:  RRSLKAEILNLVAFEEQRWKQKCKNRWIEEGDNNTSYFHRILVAKKRKNTISEILSIHGQSLQCEDEIIKEFIDFYKALYKKEDVLGAFTDDIDWNPLQS
        R   K E+  L+  EE  W+QK + +W+++GD N+ +FH++   ++ +  I E+ +  G  L   + I +E + +++ LY          + +DW+P+  
Subjt:  RRSLKAEILNLVAFEEQRWKQKCKNRWIEEGDNNTSYFHRILVAKKRKNTISEILSIHGQSLQCEDEIIKEFIDFYKALYKKEDVLGAFTDDIDWNPLQS

Query:  HLVVDLEGSFTETKL------LGSDKSPGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVD-ARTVADFRPISLTTCLYKIIA
             LE  FTE ++      +  DK+PG DGFT   F+ CW+++K+D++RVF +F ++GIIN S N ++I L+PK   +R ++DFRPISL T LYKIIA
Subjt:  HLVVDLEGSFTETKL------LGSDKSPGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVD-ARTVADFRPISLTTCLYKIIA

Query:  RVLYERLKKVLPFTIREHQQAFVEGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVDARTVADFRPISLTTCLYKIIARVLYE
        +VL  RL+ VL  TI   Q AFV+G               + D ++        N I++         ++ K+D     D                V ++
Subjt:  RVLYERLKKVLPFTIREHQQAFVEGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVDARTVADFRPISLTTCLYKIIARVLYE

Query:  RLKKVLPFTIREHQQAFVEAKGFGNKWRSWIRGCISSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKADSDGLIKGFRTDPNSPS
         L  VL            E KGF  +WR W+RGC+SS +Y++L+NG  +G + ASRGLRQGDPLSPFLF ++ D  SR+L KA+   +++GFR   N   
Subjt:  RLKKVLPFTIREHQQAFVEAKGFGNKWRSWIRGCISSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKADSDGLIKGFRTDPNSPS

Query:  INHIQFADDTILF--TQFEDDLSAKSMFKVVEAFEQASGHNINLHKTEILGINVEHSFFWRLLLI
        ++H+QFADDTI F  T+ ED ++ KS+  V   F   SG  +NL K+ I GIN+E +   RL ++
Subjt:  INHIQFADDTILF--TQFEDDLSAKSMFKVVEAFEQASGHNINLHKTEILGINVEHSFFWRLLLI

RVW90400.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]2.2e-12236.01Show/hide
Query:  EKYQVLEIGK-KGQYVDCNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIVTTDITKGMFSISILLSLADGYNFWITGVYGPNKYRERPSFWQE---
        EK  V+ I + K +  D  ++ S+WS RN  WA L ASG+SGGI+I+W+   +   ++  G FS+SI  ++    + W++ VYGPN    R  FW E   
Subjt:  EKYQVLEIGK-KGQYVDCNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIVTTDITKGMFSISILLSLADGYNFWITGVYGPNKYRERPSFWQE---

Query:  -------IYCPKAFSISSYFINLEARK---------LDRPTSDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPLRGRPGHRFIMKLKD
               +  P    +  +  + E  +         L R TSDH+P++L     KWGP+PFRFENMWL+H SF      WW      G  GH+F+ KL+ 
Subjt:  -------IYCPKAFSISSYFINLEARK---------LDRPTSDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPLRGRPGHRFIMKLKD

Query:  LKVRLRSWNKDVFGCNTTKRNQLLTEISILDSIEETGNISPIQLSQRRSLKAEILNLVAFEEQRWKQKCKNRWIEEGDNNTSYFHRILVAKKRKNTISEI
        +K +L+ WNK  FG  + K+  +L  ++  DS+E+ G +S  +L QR   K E+  L+  EE  W+QK + +W++EGD N+ +FH++   ++ +  I E+
Subjt:  LKVRLRSWNKDVFGCNTTKRNQLLTEISILDSIEETGNISPIQLSQRRSLKAEILNLVAFEEQRWKQKCKNRWIEEGDNNTSYFHRILVAKKRKNTISEI

Query:  LSIHGQSLQCEDEIIKEFIDFYKALYKKEDVLGAFTDDIDWNPLQSHLVVDLEGSFTETKL------LGSDKSPGSDGFTSEFFKKCWNILKDDIMRVFQ
         +  G  L   + I +E + +++ LY          + +DW+P+       LE  FTE ++      +  DK+PG DGFT   F+ CW+++K+D++RVF 
Subjt:  LSIHGQSLQCEDEIIKEFIDFYKALYKKEDVLGAFTDDIDWNPLQSHLVVDLEGSFTETKL------LGSDKSPGSDGFTSEFFKKCWNILKDDIMRVFQ

Query:  DFFKNGIINASLNETYICLIPKVD-ARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEGSDGFTSEFFKKCWNILKDDIMRVFQDFFK
        +F ++GIIN S N ++I LIPK   +R ++D+RPISL T LYKIIA+VL  RL+ VL  TI   Q AFV+G               + D ++        
Subjt:  DFFKNGIINASLNETYICLIPKVD-ARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEGSDGFTSEFFKKCWNILKDDIMRVFQDFFK

Query:  NGIINASLNETYICLIPKVDARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEAKGFGNKWRSWIRGCISSANYSILINGRPRGKIHA
        N I++         ++ K+D     D                V ++ L  VL            E KGF  +WR W+RGC+SS +Y++L+NG  +G + A
Subjt:  NGIINASLNETYICLIPKVDARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEAKGFGNKWRSWIRGCISSANYSILINGRPRGKIHA

Query:  SRGLRQGDPLSPFLFIMIMDSFSRLLTKADSDGLIKGFRTDPNSPSINHIQFADDTILF--TQFEDDLSAKSMFKVVEAFEQASGHNINLHKTEILGINV
        SRGLRQGDPLSPFLF ++ D  SR+L KA+   +++GFR   N   ++H+QFADDTI F  T+ ED ++ KS+  V   F   SG  +NL K+ I GIN+
Subjt:  SRGLRQGDPLSPFLFIMIMDSFSRLLTKADSDGLIKGFRTDPNSPSINHIQFADDTILF--TQFEDDLSAKSMFKVVEAFEQASGHNINLHKTEILGINV

Query:  EHSFFWRLLLI
        E +   RL ++
Subjt:  EHSFFWRLLLI

TrEMBL top hitse value%identityAlignment
A0A438BQB2 Transposon TX1 uncharacterized 149 kDa protein1.2e-11834.91Show/hide
Query:  KGQYVDCNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIVTTDITKGMFSISILLSLADGYNFWITGVYGPNKYRERPSFWQEIYCPKAFSISSYFI
        K +  D   + S+W+ RN  W  L ASG+SGGI+I+W+  ++   ++  G FS+S+  SL      WI+ VYGPN    R  FW E++    + +    I
Subjt:  KGQYVDCNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIVTTDITKGMFSISILLSLADGYNFWITGVYGPNKYRERPSFWQEIYCPKAFSISSYFI

Query:  NLEARKLDRPTSDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPLRGRPGHRFIMKLKDLKVRLRSWNKDVFGCNTTKRNQLLTEISIL
                R TSDH+P+++      WGP+PFRFENMWL+H +F      WW      G  GH+F+ +L+ +K +L+ WNK  FG    K+  +L +++  
Subjt:  NLEARKLDRPTSDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPLRGRPGHRFIMKLKDLKVRLRSWNKDVFGCNTTKRNQLLTEISIL

Query:  DSIEETGNISPIQLSQRRSLKAEILNLVAFEEQRWKQKCKNRWIEEGDNNTSYFHRILVAKKRKNTISEILSIHGQSLQCEDEIIKEFIDFYKALYKKED
        D+IE+ G ++P  LSQR S K E+  L+  EE  W+QK K +W++EGD N+ ++H++   ++ +  I E+ +  G  L+  + I +E + +++ LY    
Subjt:  DSIEETGNISPIQLSQRRSLKAEILNLVAFEEQRWKQKCKNRWIEEGDNNTSYFHRILVAKKRKNTISEILSIHGQSLQCEDEIIKEFIDFYKALYKKED

Query:  VLGAFTDDIDWNPLQSHLVVDLEGSFTETKL------LGSDKSPGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVD-ARTVA
              + +DW+P+     + L+  FTE ++      L  DK+PG DGFT   F++CW+++K+D++RVF +F ++GIIN S N ++I LIPK   ++ ++
Subjt:  VLGAFTDDIDWNPLQSHLVVDLEGSFTETKL------LGSDKSPGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVD-ARTVA

Query:  DFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVDARTVADFRPI
        DFRPISL T LYKIIA+VL  RL+ VL  TI   Q AFV+G                               I++A L                      
Subjt:  DFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVDARTVADFRPI

Query:  SLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEAKGFGNKWRSWIRGCISSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKAD
                 IA  + +R +               E KGF  +WR W+ GC+SS +Y+IL+NG  +G + ASRGLRQGDPLSPFLF ++ D  SR+L +A+
Subjt:  SLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEAKGFGNKWRSWIRGCISSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKAD

Query:  SDGLIKGFRTDPNSPSINHIQFADDTILFTQFEDDLSAKSMFKVVEAFEQASGHNINLHKTEILGINVEHSFFWRL
           +++GFR   N   ++H+QFADDTI F+   ++   +++  ++  F   SG  +NL+K+ I GIN++ +   RL
Subjt:  SDGLIKGFRTDPNSPSINHIQFADDTILFTQFEDDLSAKSMFKVVEAFEQASGHNINLHKTEILGINVEHSFFWRL

A0A438G038 Transposon TX1 uncharacterized 149 kDa protein7.9e-11833.99Show/hide
Query:  EKYQVLEIGK-KGQYVDCNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIVTTDITKGMFSISILLSLADGYNFWITGVYGPNKYRERPSFWQEIY-
        EK  V+ I + K +  D  ++ S+WS RN  WA L ASG+SGGI+I+W+   +   ++  G FS+SI  ++    + W++ VYGPN    R  FW E+  
Subjt:  EKYQVLEIGK-KGQYVDCNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIVTTDITKGMFSISILLSLADGYNFWITGVYGPNKYRERPSFWQEIY-

Query:  --------------------------------CPKAF---------------SISSYFINLE----ARKLD---------------------RPTSDHYP
                                        C K F               S+S  + N++     ++LD                     R TSDH+P
Subjt:  --------------------------------CPKAF---------------SISSYFINLE----ARKLD---------------------RPTSDHYP

Query:  LMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPLRGRPGHRFIMKLKDLKVRLRSWNKDVFGCNTTKRNQLLTEISILDSIEETGNISPIQLSQ
        ++L     KWGP+PFRFENMWL+H SF      WW      G  GH+F+ KL+ +K +L+ WNK  FG  + K+  +L  ++  DS+E+ G +S   L Q
Subjt:  LMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPLRGRPGHRFIMKLKDLKVRLRSWNKDVFGCNTTKRNQLLTEISILDSIEETGNISPIQLSQ

Query:  RRSLKAEILNLVAFEEQRWKQKCKNRWIEEGDNNTSYFHRILVAKKRKNTISEILSIHGQSLQCEDEIIKEFIDFYKALYKKEDVLGAFTDDIDWNPLQS
        R   K E+  L+  EE  W+QK + +W+++GD N+ +FH++   ++ +  I E+ +  G  L   + I +E + +++ LY          + +DW+P+  
Subjt:  RRSLKAEILNLVAFEEQRWKQKCKNRWIEEGDNNTSYFHRILVAKKRKNTISEILSIHGQSLQCEDEIIKEFIDFYKALYKKEDVLGAFTDDIDWNPLQS

Query:  HLVVDLEGSFTETKL------LGSDKSPGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVD-ARTVADFRPISLTTCLYKIIA
             LE  FTE ++      +  DK+PG DGFT   F+ CW+++K+D++RVF +F ++GIIN S N ++I L+PK   +R ++DFRPISL T LYKIIA
Subjt:  HLVVDLEGSFTETKL------LGSDKSPGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVD-ARTVADFRPISLTTCLYKIIA

Query:  RVLYERLKKVLPFTIREHQQAFVEGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVDARTVADFRPISLTTCLYKIIARVLYE
        +VL  RL+ VL  TI   Q AFV+G               + D ++        N I++         ++ K+D     D                V ++
Subjt:  RVLYERLKKVLPFTIREHQQAFVEGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVDARTVADFRPISLTTCLYKIIARVLYE

Query:  RLKKVLPFTIREHQQAFVEAKGFGNKWRSWIRGCISSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKADSDGLIKGFRTDPNSPS
         L  VL            E KGF  +WR W+RGC+SS +Y++L+NG  +G + ASRGLRQGDPLSPFLF ++ D  SR+L KA+   +++GFR   N   
Subjt:  RLKKVLPFTIREHQQAFVEAKGFGNKWRSWIRGCISSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKADSDGLIKGFRTDPNSPS

Query:  INHIQFADDTILF--TQFEDDLSAKSMFKVVEAFEQASGHNINLHKTEILGINVEHSFFWRLLLI
        ++H+QFADDTI F  T+ ED ++ KS+  V   F   SG  +NL K+ I GIN+E +   RL ++
Subjt:  INHIQFADDTILF--TQFEDDLSAKSMFKVVEAFEQASGHNINLHKTEILGINVEHSFFWRLLLI

A0A438I181 Transposon TX1 uncharacterized 149 kDa protein1.1e-12236.01Show/hide
Query:  EKYQVLEIGK-KGQYVDCNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIVTTDITKGMFSISILLSLADGYNFWITGVYGPNKYRERPSFWQE---
        EK  V+ I + K +  D  ++ S+WS RN  WA L ASG+SGGI+I+W+   +   ++  G FS+SI  ++    + W++ VYGPN    R  FW E   
Subjt:  EKYQVLEIGK-KGQYVDCNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIVTTDITKGMFSISILLSLADGYNFWITGVYGPNKYRERPSFWQE---

Query:  -------IYCPKAFSISSYFINLEARK---------LDRPTSDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPLRGRPGHRFIMKLKD
               +  P    +  +  + E  +         L R TSDH+P++L     KWGP+PFRFENMWL+H SF      WW      G  GH+F+ KL+ 
Subjt:  -------IYCPKAFSISSYFINLEARK---------LDRPTSDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPLRGRPGHRFIMKLKD

Query:  LKVRLRSWNKDVFGCNTTKRNQLLTEISILDSIEETGNISPIQLSQRRSLKAEILNLVAFEEQRWKQKCKNRWIEEGDNNTSYFHRILVAKKRKNTISEI
        +K +L+ WNK  FG  + K+  +L  ++  DS+E+ G +S  +L QR   K E+  L+  EE  W+QK + +W++EGD N+ +FH++   ++ +  I E+
Subjt:  LKVRLRSWNKDVFGCNTTKRNQLLTEISILDSIEETGNISPIQLSQRRSLKAEILNLVAFEEQRWKQKCKNRWIEEGDNNTSYFHRILVAKKRKNTISEI

Query:  LSIHGQSLQCEDEIIKEFIDFYKALYKKEDVLGAFTDDIDWNPLQSHLVVDLEGSFTETKL------LGSDKSPGSDGFTSEFFKKCWNILKDDIMRVFQ
         +  G  L   + I +E + +++ LY          + +DW+P+       LE  FTE ++      +  DK+PG DGFT   F+ CW+++K+D++RVF 
Subjt:  LSIHGQSLQCEDEIIKEFIDFYKALYKKEDVLGAFTDDIDWNPLQSHLVVDLEGSFTETKL------LGSDKSPGSDGFTSEFFKKCWNILKDDIMRVFQ

Query:  DFFKNGIINASLNETYICLIPKVD-ARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEGSDGFTSEFFKKCWNILKDDIMRVFQDFFK
        +F ++GIIN S N ++I LIPK   +R ++D+RPISL T LYKIIA+VL  RL+ VL  TI   Q AFV+G               + D ++        
Subjt:  DFFKNGIINASLNETYICLIPKVD-ARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEGSDGFTSEFFKKCWNILKDDIMRVFQDFFK

Query:  NGIINASLNETYICLIPKVDARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEAKGFGNKWRSWIRGCISSANYSILINGRPRGKIHA
        N I++         ++ K+D     D                V ++ L  VL            E KGF  +WR W+RGC+SS +Y++L+NG  +G + A
Subjt:  NGIINASLNETYICLIPKVDARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEAKGFGNKWRSWIRGCISSANYSILINGRPRGKIHA

Query:  SRGLRQGDPLSPFLFIMIMDSFSRLLTKADSDGLIKGFRTDPNSPSINHIQFADDTILF--TQFEDDLSAKSMFKVVEAFEQASGHNINLHKTEILGINV
        SRGLRQGDPLSPFLF ++ D  SR+L KA+   +++GFR   N   ++H+QFADDTI F  T+ ED ++ KS+  V   F   SG  +NL K+ I GIN+
Subjt:  SRGLRQGDPLSPFLFIMIMDSFSRLLTKADSDGLIKGFRTDPNSPSINHIQFADDTILF--TQFEDDLSAKSMFKVVEAFEQASGHNINLHKTEILGINV

Query:  EHSFFWRLLLI
        E +   RL ++
Subjt:  EHSFFWRLLLI

A5BCI7 Reverse transcriptase domain-containing protein1.1e-11631.84Show/hide
Query:  GKFDYQASPEKALQIIVPWLEKYQVLEIGKKGQYVDCNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIVTTDITKGMFSISILLSLADGYNFWITG
        G+F  +    + +  +   L+   + E   K +  D   + S+W+ RN  WA L A G+SGGI+I+W+   +   ++  G FS+SI  +L    + W++ 
Subjt:  GKFDYQASPEKALQIIVPWLEKYQVLEIGKKGQYVDCNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIVTTDITKGMFSISILLSLADGYNFWITG

Query:  VYGPNKYRERPSFWQEI----------YC------------------------------------------PKAFSISSYFINLEARKLD----------
        VYGPN    R  FW E+          +C                                            +F+ S+  +N   ++LD          
Subjt:  VYGPNKYRERPSFWQEI----------YC------------------------------------------PKAFSISSYFINLEARKLD----------

Query:  -----------RPTSDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPLRGRPGHRFIMKLKDLKVRLRSWNKDVFGCNTTKRNQLLTEI
                   R TSDH+P++L     KWGP+PFRFENMWL+H SF      WW+     G  GH+F+ KL+ +K +L+ WNK  FG  + ++  +L+++
Subjt:  -----------RPTSDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPLRGRPGHRFIMKLKDLKVRLRSWNKDVFGCNTTKRNQLLTEI

Query:  SILDSIEETGNISPIQLSQRRSLKAEILNLVAFEEQRWKQKCKNRWIEEGDNNTSYFHRILVAKKRKNTISEILSIHGQSLQCEDEIIKEFIDFYKALYK
           DS+E+ G +S   L+QR   K E+  L+  EE  W+QK + +W++EGD N+ +FH++   ++ +  I E+ + +G  +   + I +E + +++ LY 
Subjt:  SILDSIEETGNISPIQLSQRRSLKAEILNLVAFEEQRWKQKCKNRWIEEGDNNTSYFHRILVAKKRKNTISEILSIHGQSLQCEDEIIKEFIDFYKALYK

Query:  KEDVLGAFTDDIDWNPLQSHLVVDLEGSFTETKL------LGSDKSPGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVD-AR
                 + +DW+P+       LE  FTE ++      +  DK+PG DGFT   F+ CW ++K+D+++VF +F ++GIIN S N ++I L+PK   +R
Subjt:  KEDVLGAFTDDIDWNPLQSHLVVDLEGSFTETKL------LGSDKSPGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVD-AR

Query:  TVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVDARTVADF
         ++DFRPISL T LYKIIA+VL  R++ VL  TI   Q AFV+G               + D ++        N I++         ++ K+D     D 
Subjt:  TVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVDARTVADF

Query:  RPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEAKGFGNKWRSWIRGCISSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLT
                       V ++ L  VL            E KGFG +WR W+RGC+SS ++++L+NG  +G + ASRGLRQGDPLSPFLF ++ D  SR+L 
Subjt:  RPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEAKGFGNKWRSWIRGCISSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLT

Query:  KADSDGLIKGFRTDPNSPSINHIQFADDTILFTQFEDDLSAKSMFKVVEAFEQASGHNINLHKTEILGINVEHSFFWRL
        KA+   +++GF+   N   ++H+QFADDTI F+   ++    ++  V+  F   SG  +NL K+ I GIN+E +   RL
Subjt:  KADSDGLIKGFRTDPNSPSINHIQFADDTILFTQFEDDLSAKSMFKVVEAFEQASGHNINLHKTEILGINVEHSFFWRL

A5CAA2 Reverse transcriptase domain-containing protein3.9e-11732.58Show/hide
Query:  KGQYVDCNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIVTTDITKGMFSISILLSLADGYNFWITGVYGPNKYRERPSFWQEI----------YC-
        K +  D   + S+W+ RN  WA L A G+SGGI+I+W+   +   ++  G FS+SI  +L    + W++ VYGPN    R   W E+          +C 
Subjt:  KGQYVDCNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIVTTDITKGMFSISILLSLADGYNFWITGVYGPNKYRERPSFWQEI----------YC-

Query:  -----------------------------------------PKAFSISSYFINLEARKLD---------------------RPTSDHYPLMLTMGSGKWG
                                                   +F+ S+  +N   ++LD                     R TSDH+P++L     KWG
Subjt:  -----------------------------------------PKAFSISSYFINLEARKLD---------------------RPTSDHYPLMLTMGSGKWG

Query:  PSPFRFENMWLKHRSFLPLIDYWWKNTPLRGRPGHRFIMKLKDLKVRLRSWNKDVFGCNTTKRNQLLTEISILDSIEETGNISPIQLSQRRSLKAEILNL
        P+PFRFENMWL+H SF      WW+     G  GH+F+ KL+ +K +L+ WNK  FG  + ++  +L+ +   DS+E+ G +S   L+QR   K E+  L
Subjt:  PSPFRFENMWLKHRSFLPLIDYWWKNTPLRGRPGHRFIMKLKDLKVRLRSWNKDVFGCNTTKRNQLLTEISILDSIEETGNISPIQLSQRRSLKAEILNL

Query:  VAFEEQRWKQKCKNRWIEEGDNNTSYFHRILVAKKRKNTISEILSIHGQSLQCEDEIIKEFIDFYKALYKKEDVLGAFTDDIDWNPLQSHLVVDLEGSFT
        +  EE  W+QK + +W++EGD N+ +FH++   ++ +  I E+ + +GQ +   + I +E + +++ LY          + +DW+P+     V LE  FT
Subjt:  VAFEEQRWKQKCKNRWIEEGDNNTSYFHRILVAKKRKNTISEILSIHGQSLQCEDEIIKEFIDFYKALYKKEDVLGAFTDDIDWNPLQSHLVVDLEGSFT

Query:  ETKL------LGSDKSPGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVD-ARTVADFRPISLTTCLYKIIARVLYERLKKVL
        E ++      +  DK+PG DGFT   F+ CW ++K+D+++VF +F ++GIIN S N ++I L+PK   +R ++DFRPISL T LYKIIA+VL  R+++VL
Subjt:  ETKL------LGSDKSPGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVD-ARTVADFRPISLTTCLYKIIARVLYERLKKVL

Query:  PFTIREHQQAFVEGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVDARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIR
          TI   Q AFV+G               + D ++        N I++         ++ K+D     D                V ++ L  V+     
Subjt:  PFTIREHQQAFVEGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVDARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIR

Query:  EHQQAFVEAKGFGNKWRSWIRGCISSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKADSDGLIKGFRTDPNSPSINHIQFADDTI
               E KGFG +WR W+RGC+SS ++++L+NG  +G + ASRGLRQGDPLSPFLF ++ D  SR+L KA+   +++GF+   N   ++H+QFADDTI
Subjt:  EHQQAFVEAKGFGNKWRSWIRGCISSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKADSDGLIKGFRTDPNSPSINHIQFADDTI

Query:  LFTQFEDDLSAKSMFKVVEAFEQASGHNINLHKTEILGINVEHSFFWRL
         F+   ++    ++  V+  F   SG  +NL K+ I GIN+E +   RL
Subjt:  LFTQFEDDLSAKSMFKVVEAFEQASGHNINLHKTEILGINVEHSFFWRL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein5.2e-2624.84Show/hide
Query:  TKRNQLLTEISILDSIEETGNISPIQLSQRRSLKAEILNLVAFEEQRWKQKC--KNRWIEEGDNNTSY-FHRILVAKKRKNTISEILSIHGQSLQCEDEI
        +K + L +++  L+  E+T +    + S+R+ +      L   E Q+  QK      W  E  N       R++  K+ KN I  I +  G       EI
Subjt:  TKRNQLLTEISILDSIEETGNISPIQLSQRRSLKAEILNLVAFEEQRWKQKC--KNRWIEEGDNNTSY-FHRILVAKKRKNTISEILSIHGQSLQCEDEI

Query:  IKEFIDFYKALYKKE----DVLGAFTDDIDWNPLQSHLVVDLEGSFTETKL------LGSDKSPGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINA
             ++YK LY  +    + +  F D      L    V  L    T +++      L + KSPG DGFT+EF+++    L   ++++FQ   K GI+  
Subjt:  IKEFIDFYKALYKKE----DVLGAFTDDIDWNPLQSHLVVDLEGSFTETKL------LGSDKSPGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINA

Query:  SLNETYICLIPKV--DARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLN
        S  E  I LIPK   D     +FRPISL     KI+ ++L  R+++ +   I   Q  F+ G  G+ +   +K  N+++               IN + +
Subjt:  SLNETYICLIPKV--DARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLN

Query:  ETYICLIPKVDARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEAKGFGNKWRSWIRGCISSANYSILINGRPRGKIHASRGLRQGDP
        + ++  I  +DA    D           KI    + + L K+                G    +   IR        +I++NG+         G RQG P
Subjt:  ETYICLIPKVDARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEAKGFGNKWRSWIRGCISSANYSILINGRPRGKIHASRGLRQGDP

Query:  LSPFLFIMIMDSFSRLLTKADSDGLIKGFRTDPNSPSINHIQFADDTILFTQFEDDLSAKSMFKVVEAFEQASGHNINLHKTE
        LSP LF ++++  +R + +      IKG +       ++   FADD I++ +    +SA+++ K++  F + SG+ IN+ K++
Subjt:  LSPFLFIMIMDSFSRLLTKADSDGLIKGFRTDPNSPSINHIQFADDTILFTQFEDDLSAKSMFKVVEAFEQASGHNINLHKTE

P08548 LINE-1 reverse transcriptase homolog6.0e-2225.89Show/hide
Query:  NQLLTEISILDSIEETGNISPIQLSQRRSLKAEILNLVAFEEQRWKQKC--KNRWIEEGDNNTSYFHRILVAKKR-KNTISEILSIHGQSLQCEDEIIKE
        N L+  +  L+  EE  N  P +  +   ++AE+  +   E +R  Q+      W  E  N        L  KKR K+ IS I + + +      EI K 
Subjt:  NQLLTEISILDSIEETGNISPIQLSQRRSLKAEILNLVAFEEQRWKQKC--KNRWIEEGDNNTSYFHRILVAKKR-KNTISEILSIHGQSLQCEDEIIKE

Query:  FIDFYKALY--KKEDV--LGAFTDDIDWNPLQSHLVVDLEGSFTETKL------LGSDKSPGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLN
          ++YK LY  K E++  +  + +      L    V  L    + +++      L   KSPG DGFTSEF++     L   ++ +FQ+  K GI+  +  
Subjt:  FIDFYKALY--KKEDV--LGAFTDDIDWNPLQSHLVVDLEGSFTETKL------LGSDKSPGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLN

Query:  ETYICLIPK--VDARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETY
        E  I LIPK   D     ++RPISL     KI+ ++L  R+++ +   I   Q  F+ GS G+ +   +K  N+++               IN   N+ +
Subjt:  ETYICLIPK--VDARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETY

Query:  ICLIPKVDARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEAKGFGNKWRSWIRGCISSANYSILINGRPRGKIHASRGLRQGDPLSP
        + L   +DA    D            I    +   LKK+                G    +   I    S    +I++NG          G RQG PLSP
Subjt:  ICLIPKVDARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEAKGFGNKWRSWIRGCISSANYSILINGRPRGKIHASRGLRQGDPLSP

Query:  FLFIMIMDSFSRLLTKADSDGLIKGFRTDPNSPSINHIQFADDTILFTQFEDDLSAKSMFKVVEAFEQASGHNINLHKT
         LF ++M+  +  + +   +  IKG      S  I    FADD I++ +   D + K + +V++ +   SG+ IN HK+
Subjt:  FLFIMIMDSFSRLLTKADSDGLIKGFRTDPNSPSINHIQFADDTILFTQFEDDLSAKSMFKVVEAFEQASGHNINLHKT

P11369 LINE-1 retrotransposable element ORF2 protein1.1e-1823.82Show/hide
Query:  TTKRNQLLTEISILDSIEETGNISPIQLSQRRSLKAEILNLVA----FEEQRWKQKCK--NRWIEEGDNNTSY-FHRILVAKKRKNTISEILSIHGQSLQ
        T   + L T +  L+  E          S +RS + EI+ L       E +R  Q+      W  E  N       R+    + K  I++I +  G    
Subjt:  TTKRNQLLTEISILDSIEETGNISPIQLSQRRSLKAEILNLVA----FEEQRWKQKCK--NRWIEEGDNNTSY-FHRILVAKKRKNTISEILSIHGQSLQ

Query:  CEDEIIKEFIDFYKALYKKE----DVLGAFTDDIDWNPLQSHLVVDLEGSFTETKL------LGSDKSPGSDGFTSEFFKKCWNILKDDIMRVFQDFFKN
          +EI      FYK LY  +    D +  F D      L    V  L    +  ++      L + KSPG DGF++EF++     L   + ++F      
Subjt:  CEDEIIKEFIDFYKALYKKE----DVLGAFTDDIDWNPLQSHLVVDLEGSFTETKL------LGSDKSPGSDGFTSEFFKKCWNILKDDIMRVFQDFFKN

Query:  GIINASLNETYICLIPK--VDARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGII
        G +  S  E  I LIPK   D   + +FRPISL     KI+ ++L  R+++ +   I   Q  F+ G  G+ +   +K  N++                I
Subjt:  GIINASLNETYICLIPK--VDARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGII

Query:  NASLNETYICLIPKVDARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEAKGFGNKWRSWIRGCISSANYSILINGRPRGKIHASRGL
        N   ++ +  +I  +DA    D                      K   PF I+      +E  G    + + I+   S    +I +NG     I    G 
Subjt:  NASLNETYICLIPKVDARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEAKGFGNKWRSWIRGCISSANYSILINGRPRGKIHASRGL

Query:  RQGDPLSPFLFIMIMDSFSRLLTKADSDGLIKGFRTDPNSPSINHIQFADDTILFTQFEDDLSAKSMFKVVEAFEQASGHNINLHKT
        RQG PLSP+LF ++++  +R + +      IKG +       I+ +  ADD I++   +   S + +  ++ +F +  G+ IN +K+
Subjt:  RQGDPLSPFLFIMIMDSFSRLLTKADSDGLIKGFRTDPNSPSINHIQFADDTILFTQFEDDLSAKSMFKVVEAFEQASGHNINLHKT

P14381 Transposon TX1 uncharacterized 149 kDa protein6.2e-1923.98Show/hide
Query:  DNNTSYFHRILVAKKRKNTISEILSIHGQSLQCEDEIIKEFIDFYKALYKKEDVLGAFTDDIDWNPL---------QSHLVVDLEGSFTETKLLGSDKSP
        D  + +F+ +   K  +  I+ + +  G  L+  + I      FY+ L+  + +     +++ W+ L         +    + L+      +L+  +KSP
Subjt:  DNNTSYFHRILVAKKRKNTISEILSIHGQSLQCEDEIIKEFIDFYKALYKKEDVLGAFTDDIDWNPL---------QSHLVVDLEGSFTETKLLGSDKSP

Query:  GSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKV-DARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEGSDG
        G DG T EFF+  W+ L  D  RV  + FK G +  S     + L+PK  D R + ++RP+SL +  YKI+A+ +  RLK VL   I   Q   V G   
Subjt:  GSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKV-DARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEGSDG

Query:  FTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVDARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEAKGFGNKWR
        F + F       L  D++   +   + G+  A L+        +VD                                     ++    ++A  FG ++ 
Subjt:  FTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVDARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEAKGFGNKWR

Query:  SWIRGCISSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKADSDGLIKGFRTDPNSPSINHIQFADDTILFTQFEDDLSAKSMFKV
         +++   +SA   + IN      +   RG+RQG PLS  L+ + ++ F  LL K  +  ++K    +P+   +    +ADD IL  Q   DL      + 
Subjt:  SWIRGCISSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKADSDGLIKGFRTDPNSPSINHIQFADDTILFTQFEDDLSAKSMFKV

Query:  VEAFEQASGHNINLHKT
         E +  AS   IN  K+
Subjt:  VEAFEQASGHNINLHKT

P92555 Uncharacterized mitochondrial protein AtMg012506.0e-1454.41Show/hide
Query:  LINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKADSDGLIKGFRTDPNSPSINHIQFADDT
        +ING P+G +  SRGLRQGDPLSP+LFI+  +  S L  +A   G + G R   NSP INH+ FADDT
Subjt:  LINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKADSDGLIKGFRTDPNSPSINHIQFADDT

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein6.4e-1925.8Show/hide
Query:  SDHYPLMLTMGS-GKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPLRGRPGHRFIM--KLKDLKVRLRSWNKDVFGCNTTKRNQLLTEISILDSIEETGN
        SDH P ++ + +  K     FR+ +    H +FL  +   W+     G   H F +   LK  K   +  N+  FG    K  + L  +  + S +   N
Subjt:  SDHYPLMLTMGS-GKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPLRGRPGHRFIM--KLKDLKVRLRSWNKDVFGCNTTKRNQLLTEISILDSIEETGN

Query:  ISPIQLSQRRSLKAEILNLVAFEEQRWKQKCKNRWIEEGDNNTSYFHRILVAKKRKNTISEILSIHGQSLQCEDEIIKEFIDFYKALYKKE-DVLGAFTD
         S          + +     A  E  ++QK + +W+++GD NT +FH++++A + KN I  +       ++   ++ +  + +Y  L   + D+L    D
Subjt:  ISPIQLSQRRSLKAEILNLVAFEEQRWKQKCKNRWIEEGDNNTSYFHRILVAKKRKNTISEILSIHGQSLQCEDEIIKEFIDFYKALYKKE-DVLGAFTD

Query:  DI----DWNPLQSH--LVVDLEGSFTETKLLGS------DKSPGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKV-DARTVAD
         +    D +P + +  L   L    ++ ++  +      +K+PG D FT+EFF + W ++KD  +   ++FF+ G +    N T I LIPKV     ++ 
Subjt:  DI----DWNPLQSH--LVVDLEGSFTETKLLGS------DKSPGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKV-DARTVAD

Query:  FRPISLTTCLYKII
        FRP+S  T +YKII
Subjt:  FRPISLTTCLYKII

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)4.3e-1554.41Show/hide
Query:  LINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKADSDGLIKGFRTDPNSPSINHIQFADDT
        +ING P+G +  SRGLRQGDPLSP+LFI+  +  S L  +A   G + G R   NSP INH+ FADDT
Subjt:  LINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKADSDGLIKGFRTDPNSPSINHIQFADDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAGAAAGAAGAGAAGGGGGCGGTTATCAAATTAGACATCGAGAAGGCCTTTGATAAGGTGAATTGGATCTATTTGGACAATATTCTTGAAGCCAAAGGCTTTGG
CAACAAATGGAGATCATGGATTCGAGGTTGTGTCTCATCATCCAACTTCTCCAATCTCATCAACGGTCGTCCTAGAGGTAAAATCTTAGCCTCGAGGGGCCTTCGTCAGG
GTGATCCCTTGTCTCCATTTCTATTCATCATGATTATGGATAGTTTTAGCCGTATGTTGACTAAAGCCGAAAATGAAGGTTTTATCAATGGCTTTCGGGTTGGTCATGGG
CGATTATGCATTAACCATATCCAATTCGCTGATGACACAATTCTTTTCTCTAATTTATCTGATGATTCATTCCTTCCGAACCTTCTTAATGTGGTGAGGGCATTCGAAAA
GGTTTCCGGTCACAACATTAATATGCATAAATCTGTATTGGCATTAATGTCTCCCCAACAGATCTTGAAGAGCTACGTGTGCGGAGGATTGGTCGAAATTGCAAAGAAGA
CAGCCGCTCAATTGGATCTCATGGAAGCCAGCATCCATGTTAGAGAAAACACCTCTGGTTTCATCCCCGCTGATATAGGCCTTCCTTCATCATCATCGAAGCCCATTATC
ATTCAGATTGACCCATTCTTTGATGCCGATAAGTGTGTGGGATCCATCGAGGCTTCACGTACAAGGATGTCGATGCCGATCACTAGAGCATCGTTGGAAGGAATCCTGGG
GCCGTACCCTCGAGAAGGTGCCATCTCTCTTCCCCTATCTCCCACCGACAAGTGCCCTGCCCCACACCCTCCCGATAGCCCATTTACCAGCCCCACTACTGACCCCCCTC
ATAGCCCAATAGATAACTCTCCACATTCTAGCCCAAAAGCCTCATCCCCTCTCAATAACACACCCAACAGCCCAAACCCAAATCTTCTCCTACCAAACTCGTCGTCGGTG
AGCAACTCCCCCGATAAATTGGTCATATCCATTAACCATCATCAAACCTACCTCGTTCCTGGGCATAAATTCACAACCTCCCTTCCCATCATGGACACCGACGACGAATA
TGCATCCAACCCTCTTCCCCTATCCACTGCCCCGCCTTCGCCGTTGGCTAACAAACCCTCCTCTCCCACCCCTATACACCTAGCCAACACAGAGATTGGCACCATTCTCA
TGGAAGACCATGAAGCTCAAGATGGAAAATTTGACTATCAGGCATCTCCAGAAAAGGCCCTTCAAATTATAGTCCCTTGGCTGGAAAAATACCAGGTATTGGAGATTGGC
AAAAAAGGGCAGTACGTCGACTGCAATATTATTAAGTCTCTCTGGAGCGGCAGAAACATCAGTTGGGCTTTCTTGGAAGCTTCAGGCTCTTCGGGTGGTATCATCATTAT
GTGGAATGATCCTTCGATTGTCACCACAGACATTACGAAAGGTATGTTCTCCATATCCATCCTTCTTTCTCTCGCTGACGGCTACAACTTTTGGATCACAGGCGTATATG
GCCCCAACAAATATAGGGAGAGACCATCTTTTTGGCAGGAAATATACTGTCCCAAGGCATTCTCGATAAGTTCATATTTCATAAATCTGGAGGCCAGAAAACTTGACCGA
CCAACGTCCGATCACTACCCATTGATGCTTACTATGGGCAGTGGTAAATGGGGGCCATCCCCTTTCCGTTTTGAGAATATGTGGTTGAAACACCGATCTTTCTTGCCCTT
GATTGATTACTGGTGGAAGAATACTCCTCTGAGAGGCAGGCCGGGCCACAGATTTATCATGAAATTGAAAGATTTGAAAGTGAGACTTCGCTCTTGGAATAAAGATGTCT
TTGGGTGTAACACTACCAAAAGAAACCAATTGTTGACAGAGATCTCTATCCTTGATAGTATAGAAGAAACAGGTAATATCTCCCCAATTCAACTCTCGCAGCGTAGATCT
TTGAAAGCCGAAATCCTTAACCTCGTTGCCTTCGAGGAACAAAGATGGAAGCAAAAATGTAAAAATAGATGGATAGAGGAGGGTGACAACAACACTAGTTATTTTCACCG
TATTTTGGTGGCCAAGAAAAGGAAAAATACCATCTCAGAAATCCTCTCTATTCATGGTCAAAGCCTTCAATGTGAAGATGAGATCATCAAAGAATTTATAGATTTTTACA
AAGCCTTATACAAGAAGGAAGATGTATTGGGAGCCTTTACAGATGACATTGATTGGAATCCTTTACAAAGCCACCTTGTTGTGGATTTGGAAGGGTCATTCACAGAGACT
AAACTTCTAGGTAGCGACAAATCGCCAGGATCGGATGGTTTTACATCAGAATTCTTTAAAAAATGTTGGAACATCCTCAAAGACGACATTATGAGAGTGTTCCAAGATTT
TTTCAAGAATGGTATTATCAATGCTAGCCTCAACGAGACTTACATTTGCTTGATTCCGAAGGTTGATGCTCGCACAGTGGCAGATTTTCGTCCTATCAGTCTCACTACTT
GTTTATACAAAATTATAGCTCGGGTTCTTTATGAGAGATTAAAGAAAGTTTTGCCCTTTACCATTAGAGAACACCAACAGGCTTTCGTGGAAGGATCGGATGGTTTTACA
TCAGAATTCTTTAAAAAATGTTGGAACATCCTCAAAGACGACATTATGAGAGTGTTCCAAGATTTTTTCAAGAATGGTATTATCAATGCTAGCCTCAACGAGACTTACAT
TTGCTTGATTCCGAAGGTTGATGCTCGCACAGTGGCAGATTTTCGTCCTATCAGTCTCACTACTTGTTTATACAAAATTATAGCTCGGGTTCTTTATGAGAGATTAAAGA
AAGTTTTGCCCTTTACCATTAGAGAACACCAACAGGCTTTCGTGGAAGCCAAAGGATTCGGTAACAAGTGGAGATCCTGGATTAGAGGATGCATTTCATCAGCAAATTAT
TCAATCCTCATAAATGGCCGCCCTCGAGGTAAAATCCATGCTTCTCGGGGATTAAGACAAGGAGATCCACTGTCTCCTTTTCTTTTTATCATGATTATGGATAGTTTTAG
TCGATTGCTCACCAAAGCTGACTCCGATGGGCTTATTAAAGGTTTCAGAACTGATCCAAATTCCCCAAGCATCAACCACATCCAATTCGCCGATGACACCATTTTATTCA
CTCAGTTTGAAGATGATTTATCGGCAAAATCCATGTTCAAAGTGGTGGAGGCTTTTGAACAAGCATCTGGACATAACATTAATCTTCATAAAACTGAAATTCTGGGCATA
AATGTGGAGCATAGTTTTTTTTGGAGACTTTTGCTCATCAATCTGGATGCAAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAGAAAGAAGAGAAGGGGGCGGTTATCAAATTAGACATCGAGAAGGCCTTTGATAAGGTGAATTGGATCTATTTGGACAATATTCTTGAAGCCAAAGGCTTTGG
CAACAAATGGAGATCATGGATTCGAGGTTGTGTCTCATCATCCAACTTCTCCAATCTCATCAACGGTCGTCCTAGAGGTAAAATCTTAGCCTCGAGGGGCCTTCGTCAGG
GTGATCCCTTGTCTCCATTTCTATTCATCATGATTATGGATAGTTTTAGCCGTATGTTGACTAAAGCCGAAAATGAAGGTTTTATCAATGGCTTTCGGGTTGGTCATGGG
CGATTATGCATTAACCATATCCAATTCGCTGATGACACAATTCTTTTCTCTAATTTATCTGATGATTCATTCCTTCCGAACCTTCTTAATGTGGTGAGGGCATTCGAAAA
GGTTTCCGGTCACAACATTAATATGCATAAATCTGTATTGGCATTAATGTCTCCCCAACAGATCTTGAAGAGCTACGTGTGCGGAGGATTGGTCGAAATTGCAAAGAAGA
CAGCCGCTCAATTGGATCTCATGGAAGCCAGCATCCATGTTAGAGAAAACACCTCTGGTTTCATCCCCGCTGATATAGGCCTTCCTTCATCATCATCGAAGCCCATTATC
ATTCAGATTGACCCATTCTTTGATGCCGATAAGTGTGTGGGATCCATCGAGGCTTCACGTACAAGGATGTCGATGCCGATCACTAGAGCATCGTTGGAAGGAATCCTGGG
GCCGTACCCTCGAGAAGGTGCCATCTCTCTTCCCCTATCTCCCACCGACAAGTGCCCTGCCCCACACCCTCCCGATAGCCCATTTACCAGCCCCACTACTGACCCCCCTC
ATAGCCCAATAGATAACTCTCCACATTCTAGCCCAAAAGCCTCATCCCCTCTCAATAACACACCCAACAGCCCAAACCCAAATCTTCTCCTACCAAACTCGTCGTCGGTG
AGCAACTCCCCCGATAAATTGGTCATATCCATTAACCATCATCAAACCTACCTCGTTCCTGGGCATAAATTCACAACCTCCCTTCCCATCATGGACACCGACGACGAATA
TGCATCCAACCCTCTTCCCCTATCCACTGCCCCGCCTTCGCCGTTGGCTAACAAACCCTCCTCTCCCACCCCTATACACCTAGCCAACACAGAGATTGGCACCATTCTCA
TGGAAGACCATGAAGCTCAAGATGGAAAATTTGACTATCAGGCATCTCCAGAAAAGGCCCTTCAAATTATAGTCCCTTGGCTGGAAAAATACCAGGTATTGGAGATTGGC
AAAAAAGGGCAGTACGTCGACTGCAATATTATTAAGTCTCTCTGGAGCGGCAGAAACATCAGTTGGGCTTTCTTGGAAGCTTCAGGCTCTTCGGGTGGTATCATCATTAT
GTGGAATGATCCTTCGATTGTCACCACAGACATTACGAAAGGTATGTTCTCCATATCCATCCTTCTTTCTCTCGCTGACGGCTACAACTTTTGGATCACAGGCGTATATG
GCCCCAACAAATATAGGGAGAGACCATCTTTTTGGCAGGAAATATACTGTCCCAAGGCATTCTCGATAAGTTCATATTTCATAAATCTGGAGGCCAGAAAACTTGACCGA
CCAACGTCCGATCACTACCCATTGATGCTTACTATGGGCAGTGGTAAATGGGGGCCATCCCCTTTCCGTTTTGAGAATATGTGGTTGAAACACCGATCTTTCTTGCCCTT
GATTGATTACTGGTGGAAGAATACTCCTCTGAGAGGCAGGCCGGGCCACAGATTTATCATGAAATTGAAAGATTTGAAAGTGAGACTTCGCTCTTGGAATAAAGATGTCT
TTGGGTGTAACACTACCAAAAGAAACCAATTGTTGACAGAGATCTCTATCCTTGATAGTATAGAAGAAACAGGTAATATCTCCCCAATTCAACTCTCGCAGCGTAGATCT
TTGAAAGCCGAAATCCTTAACCTCGTTGCCTTCGAGGAACAAAGATGGAAGCAAAAATGTAAAAATAGATGGATAGAGGAGGGTGACAACAACACTAGTTATTTTCACCG
TATTTTGGTGGCCAAGAAAAGGAAAAATACCATCTCAGAAATCCTCTCTATTCATGGTCAAAGCCTTCAATGTGAAGATGAGATCATCAAAGAATTTATAGATTTTTACA
AAGCCTTATACAAGAAGGAAGATGTATTGGGAGCCTTTACAGATGACATTGATTGGAATCCTTTACAAAGCCACCTTGTTGTGGATTTGGAAGGGTCATTCACAGAGACT
AAACTTCTAGGTAGCGACAAATCGCCAGGATCGGATGGTTTTACATCAGAATTCTTTAAAAAATGTTGGAACATCCTCAAAGACGACATTATGAGAGTGTTCCAAGATTT
TTTCAAGAATGGTATTATCAATGCTAGCCTCAACGAGACTTACATTTGCTTGATTCCGAAGGTTGATGCTCGCACAGTGGCAGATTTTCGTCCTATCAGTCTCACTACTT
GTTTATACAAAATTATAGCTCGGGTTCTTTATGAGAGATTAAAGAAAGTTTTGCCCTTTACCATTAGAGAACACCAACAGGCTTTCGTGGAAGGATCGGATGGTTTTACA
TCAGAATTCTTTAAAAAATGTTGGAACATCCTCAAAGACGACATTATGAGAGTGTTCCAAGATTTTTTCAAGAATGGTATTATCAATGCTAGCCTCAACGAGACTTACAT
TTGCTTGATTCCGAAGGTTGATGCTCGCACAGTGGCAGATTTTCGTCCTATCAGTCTCACTACTTGTTTATACAAAATTATAGCTCGGGTTCTTTATGAGAGATTAAAGA
AAGTTTTGCCCTTTACCATTAGAGAACACCAACAGGCTTTCGTGGAAGCCAAAGGATTCGGTAACAAGTGGAGATCCTGGATTAGAGGATGCATTTCATCAGCAAATTAT
TCAATCCTCATAAATGGCCGCCCTCGAGGTAAAATCCATGCTTCTCGGGGATTAAGACAAGGAGATCCACTGTCTCCTTTTCTTTTTATCATGATTATGGATAGTTTTAG
TCGATTGCTCACCAAAGCTGACTCCGATGGGCTTATTAAAGGTTTCAGAACTGATCCAAATTCCCCAAGCATCAACCACATCCAATTCGCCGATGACACCATTTTATTCA
CTCAGTTTGAAGATGATTTATCGGCAAAATCCATGTTCAAAGTGGTGGAGGCTTTTGAACAAGCATCTGGACATAACATTAATCTTCATAAAACTGAAATTCTGGGCATA
AATGTGGAGCATAGTTTTTTTTGGAGACTTTTGCTCATCAATCTGGATGCAAATTGA
Protein sequenceShow/hide protein sequence
MEKKEEKGAVIKLDIEKAFDKVNWIYLDNILEAKGFGNKWRSWIRGCVSSSNFSNLINGRPRGKILASRGLRQGDPLSPFLFIMIMDSFSRMLTKAENEGFINGFRVGHG
RLCINHIQFADDTILFSNLSDDSFLPNLLNVVRAFEKVSGHNINMHKSVLALMSPQQILKSYVCGGLVEIAKKTAAQLDLMEASIHVRENTSGFIPADIGLPSSSSKPII
IQIDPFFDADKCVGSIEASRTRMSMPITRASLEGILGPYPREGAISLPLSPTDKCPAPHPPDSPFTSPTTDPPHSPIDNSPHSSPKASSPLNNTPNSPNPNLLLPNSSSV
SNSPDKLVISINHHQTYLVPGHKFTTSLPIMDTDDEYASNPLPLSTAPPSPLANKPSSPTPIHLANTEIGTILMEDHEAQDGKFDYQASPEKALQIIVPWLEKYQVLEIG
KKGQYVDCNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIVTTDITKGMFSISILLSLADGYNFWITGVYGPNKYRERPSFWQEIYCPKAFSISSYFINLEARKLDR
PTSDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPLRGRPGHRFIMKLKDLKVRLRSWNKDVFGCNTTKRNQLLTEISILDSIEETGNISPIQLSQRRS
LKAEILNLVAFEEQRWKQKCKNRWIEEGDNNTSYFHRILVAKKRKNTISEILSIHGQSLQCEDEIIKEFIDFYKALYKKEDVLGAFTDDIDWNPLQSHLVVDLEGSFTET
KLLGSDKSPGSDGFTSEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVDARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEGSDGFT
SEFFKKCWNILKDDIMRVFQDFFKNGIINASLNETYICLIPKVDARTVADFRPISLTTCLYKIIARVLYERLKKVLPFTIREHQQAFVEAKGFGNKWRSWIRGCISSANY
SILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKADSDGLIKGFRTDPNSPSINHIQFADDTILFTQFEDDLSAKSMFKVVEAFEQASGHNINLHKTEILGI
NVEHSFFWRLLLINLDAN