; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G011290 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G011290
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationCG_Chr05:13315663..13321768
RNA-Seq ExpressionClCG05G011290
SyntenyClCG05G011290
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR025558 - Domain of unknown function DUF4283
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN65484.1 hypothetical protein VITISV_029474 [Vitis vinifera]1.2e-21929.93Show/hide
Query:  DDWKEIAKFL--EVLYNVKVKINPLFDDKALIRVVQGKIEDLIEALGKWFNYRKFHLLFEKWNPLLHSRPSVLRSF-GGWIAIKNLPLEYWSGETFEAIG
        ++W E+ + +   +  N  V I P    K +  V   K    +  L K     +  +   +W+P       + R F  GWI ++ LP   WS    + I 
Subjt:  DDWKEIAKFL--EVLYNVKVKINPLFDDKALIRVVQGKIEDLIEALGKWFNYRKFHLLFEKWNPLLHSRPSVLRSF-GGWIAIKNLPLEYWSGETFEAIG

Query:  QHFGGLEEISIETLNLLDVSEAKIKVRKNVCGFIPATIEINNEFRGSIHLLFKDIIPMKDSSSIIKEFIGSDFK----NPIDQVHLSKVEEDETQSGQEE
          +G ++EI   TL L D+S A++K+       +PA +E+ +                  + +++   +G + +       +   ++     ET  G+  
Subjt:  QHFGGLEEISIETLNLLDVSEAKIKVRKNVCGFIPATIEINNEFRGSIHLLFKDIIPMKDSSSIIKEFIGSDFK----NPIDQVHLSKVEEDETQSGQEE

Query:  KPPKILVGSPTVRKRSKLTANGHEGKSINAAVE----SPIGVLKSMRSAKVMKKEKRKERVFGRINENIKNFQTPGIPSRDFLPSGAPTGFETIQKNRKD
        +      G  T   R ++   G E K      E    + +G     R A+++ K      V G         +  G       P G  T   T+  +R++
Subjt:  KPPKILVGSPTVRKRSKLTANGHEGKSINAAVE----SPIGVLKSMRSAKVMKKEKRKERVFGRINENIKNFQTPGIPSRDFLPSGAPTGFETIQKNRKD

Query:  SKVPLSFTEEAEKSPPISMKSNREEFPFQTLPPP--LQLPRARVFTPLQQSPAHKVSPIKSPFFPAVRRRSDL-FKPFLKHFSRGKTSFLNLWLALTNPD
         +   S  E+ + +P +       EF    L     L   R R  T    S  H+   I+S   P  R +  +  +   K    G  S  N  L   N  
Subjt:  SKVPLSFTEEAEKSPPISMKSNREEFPFQTLPPP--LQLPRARVFTPLQQSPAHKVSPIKSPFFPAVRRRSDL-FKPFLKHFSRGKTSFLNLWLALTNPD

Query:  LLEVSCANSQVPQQSGNRSALPLSSS--QIFHQSKIL----------IPGSNILFLRGSPSSYRPKPNKIQDLEEESLISISSDELNEPEEDENHLSLVL
         L   C +    +  G R + P   +  +IF  ++ +            GS ++    SP +    P+ I +   E       +         +  SL +
Subjt:  LLEVSCANSQVPQQSGNRSALPLSSS--QIFHQSKIL----------IPGSNILFLRGSPSSYRPKPNKIQDLEEESLISISSDELNEPEEDENHLSLVL

Query:  DESIEGNKSIADHLSSMLNEKIPHHLRSFVEDCGISLKGKLFSKKIVSWNIRGL-GDYSKQLAVKHLIMKTNPRIVEAIKKL------------------
        +            ++ ++    P+ +   V      L+         + ++ G+ G  S +  ++ L  +   ++ E +K L                  
Subjt:  DESIEGNKSIADHLSSMLNEKIPHHLRSFVEDCGISLKGKLFSKKIVSWNIRGL-GDYSKQLAVKHLIMKTNPRIVEAIKKL------------------

Query:  WSSRDIRWSFVEAYGRSGGLLIMWDESKISVIETLKGGYTLSVKCKTLRRKVCWVTNVYRPTDYKERRRIWPELQALAAYCTNAWCLGGDFNITRAIHER
        W++R+  W+ + A G SGG+LI+WD  K+S  E + G +++S+K      +  W++ VY P     R+  W EL  +A   +  WC+GGDFN+ R   E+
Subjt:  WSSRDIRWSFVEAYGRSGGLLIMWDESKISVIETLKGGYTLSVKCKTLRRKVCWVTNVYRPTDYKERRRIWPELQALAAYCTNAWCLGGDFNITRAIHER

Query:  VPTGRFR-GMKKFNKFIEEAHLMEIPMSNGRFTWSR-EGNIISRSLLDRFLVTKEWDKAFEGTRVFRQARIISDHFPLLLEVGALEWGPSPFRFCNSWLL
        +   R    MK F+ FI +  L+++P+ +  FTWS  + N + +  LDRFL + EW++AF  +      R  SDH+P++LE    +WGP+PFRF N WL 
Subjt:  VPTGRFR-GMKKFNKFIEEAHLMEIPMSNGRFTWSR-EGNIISRSLLDRFLVTKEWDKAFEGTRVFRQARIISDHFPLLLEVGALEWGPSPFRFCNSWLL

Query:  NSQCCSTIIRTLEAGYHQGWAGFVIFAKLRSVKTSLQQWHADSLINQRKQEAEIVEKL---DKEEQGAELEDPSSILQDLRASLKADLMDIYRKKERDLI
        +        R        GW G     KL+ VK  L+ W+  S     K++ +I+  L   D  EQ   L   S  L   RA  K +L ++  ++E    
Subjt:  NSQCCSTIIRTLEAGYHQGWAGFVIFAKLRSVKTSLQQWHADSLINQRKQEAEIVEKL---DKEEQGAELEDPSSILQDLRASLKADLMDIYRKKERDLI

Query:  QKSKLNWLHLGDENTRFFRFLA-AKKRKNLIAELVNDQGIPTNSAGCFP---------------------ANLEWQRVSVEQNNRLSSKFSRKEIRFALR
        QK+++ W+  GD N+RFF  +A  ++ +  I EL N+ G+  N++                           L+W  +S E   RL S F+ +EI  A+ 
Subjt:  QKSKLNWLHLGDENTRFFRFLA-AKKRKNLIAELVNDQGIPTNSAGCFP---------------------ANLEWQRVSVEQNNRLSSKFSRKEIRFALR

Query:  GMGKNKVGGPDGFTVEFLNKFWDRIKDDFVALFNKFHENEKLNSCVKENVICLIKEKEDAIMVKDFRPISLTTLTYKVIAKVLAERLKLVMPSIIAPTQS
         M ++K  GPDGFT+      W+ IK+D V +F +FH +  +N     + I L+ +K  +  + DFRPISL T  YK+IAKVLA R++ V+   I  TQ 
Subjt:  GMGKNKVGGPDGFTVEFLNKFWDRIKDDFVALFNKFHENEKLNSCVKENVICLIKEKEDAIMVKDFRPISLTTLTYKVIAKVLAERLKLVMPSIIAPTQS

Query:  AFIEGRQILDPILIANELVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFDSKWTSWTLGCIKNPKFSIFINRRPRGRVQASRGVRQGDPLSP
        AF++GRQILD +LIANE+V+E R   ++G + K+D EKA+D V W FL+ VL+ K F  +W  W  GC+ +  F++ +N   +G V+ASRG+RQGDPLSP
Subjt:  AFIEGRQILDPILIANELVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFDSKWTSWTLGCIKNPKFSIFINRRPRGRVQASRGVRQGDPLSP

Query:  FIFLLVSEVLSSLISRLHKSKKFEGFIVGKKKVHVPILQFADDILLFCKYDFDMLEALRKTIEFFEWCSGQKVNWEKSALCGLNIDDLEVKLTAAKLNCK
        F+F +V++VLS ++ +  +    EGF VG+ +  V  LQFADD + F     + +  L+  +  F   SG KVN +KS + G+N++   +   A  L+CK
Subjt:  FIFLLVSEVLSSLISRLHKSKKFEGFIVGKKKVHVPILQFADDILLFCKYDFDMLEALRKTIEFFEWCSGQKVNWEKSALCGLNIDDLEVKLTAAKLNCK

Query:  AEKLPLMYLGLPLGGHPKMIVFWQPILDKIQGKLSKWKRYNLSRGGRLTQRKTVLSNLPSSYMSIFLMPEKVVLSIERAMRNFFWEGTGGSKLNHLARWE
        A   P++YLGLPLGG+PK   FW P++++I  +L  W++  LS GGR+T  ++ L+++P  ++S+F +P  V   IER  R+F W G G  K +HL  W+
Subjt:  AEKLPLMYLGLPLGGHPKMIVFWQPILDKIQGKLSKWKRYNLSRGGRLTQRKTVLSNLPSSYMSIFLMPEKVVLSIERAMRNFFWEGTGGSKLNHLARWE

Query:  TVTKYHKDGGLRLENLKIRNLALLSKWGWRFMQESEALWCNVITSIHGEDHFQWHTNRKEVASLRSPWISISRQWQKIEAFAIFKVGDGRRITFWSDPWI
         V K    GGL    + IRN+ALL KW WR+ +E  ALW  VI SI+G     W  N     S R PW +I+  +Q+   F  F VG+G RI FW D W 
Subjt:  TVTKYHKDGGLRLENLKIRNLALLSKWGWRFMQESEALWCNVITSIHGEDHFQWHTNRKEVASLRSPWISISRQWQKIEAFAIFKVGDGRRITFWSDPWI

Query:  EDQPLKVRFPRLFKLALKTTGTVADHWDPCSS--------SWDLLFKRQLKEEELGEFLSLSSSVANKRVMVQ-PDKRIWALEGNGSFSIKSLTNHLSVA
         +QPL V++PRL ++       V D   P SS        SW+  F+R L + E+ +   L  S+    +    PDKR W L  +G F++KS    LS  
Subjt:  EDQPLKVRFPRLFKLALKTTGTVADHWDPCSS--------SWDLLFKRQLKEEELGEFLSLSSSVANKRVMVQ-PDKRIWALEGNGSFSIKSLTNHLSVA

Query:  SPIDDEL-VKGLWKSKCPRRVNILIWTMLFGSLNCAATIQRKLPYHCLSPHMCVLCRQDQEDIQHLFFGCSYASSCWSRLFGFFDLSWVMGSDFKSNVLQ
        S        K +W ++ P +V   +W +    +N    +Q + PY  LSP +C LC +  E + HLF  CS     W RLF    + WV        +  
Subjt:  SPIDDEL-VKGLWKSKCPRRVNILIWTMLFGSLNCAATIQRKLPYHCLSPHMCVLCRQDQEDIQHLFFGCSYASSCWSRLFGFFDLSWVMGSDFKSNVLQ

Query:  ILLGPRLNKEPKLLWVNAVKALLSEIWFERNQHVFNDKASNWEDRFDQARLNASSWCTLSKTFKDYSIQELALN
           G   +K   +LW NA  A++  +W ERN  +F DKA N E  +D      S W   SK FK   +  L L+
Subjt:  ILLGPRLNKEPKLLWVNAVKALLSEIWFERNQHVFNDKASNWEDRFDQARLNASSWCTLSKTFKDYSIQELALN

CAN68165.1 hypothetical protein VITISV_008538 [Vitis vinifera]5.2e-22034.9Show/hide
Query:  KIVSWNIRGLGDYSKQLAVKHLIMKTNPRIV------------EAIKKLWSSRDIRWSFVEAYGRSGGLLIMWDESKISVIETLKGGYTLSVKCKTLRRK
        KI+SWN RGLG   K+  VK  +    P IV              +  +W++R+  W+ + A G SGG+L++WD  K+   E + G +++SVK      +
Subjt:  KIVSWNIRGLGDYSKQLAVKHLIMKTNPRIV------------EAIKKLWSSRDIRWSFVEAYGRSGGLLIMWDESKISVIETLKGGYTLSVKCKTLRRK

Query:  VCWVTNVYRPTDYKERRRIWPELQALAAYCTNAWCLGGDFNITRAIHERVPTGRFR-GMKKFNKFIEEAHLMEIPMSNGRFTWSREGNIISRSLLDRFLV
          W++ VY P     R+  W EL  +    +  WC+GGDFN+ R   E++  GR    MK  + FI E  L++ P+ +  FTWS          LDRFL 
Subjt:  VCWVTNVYRPTDYKERRRIWPELQALAAYCTNAWCLGGDFNITRAIHERVPTGRFR-GMKKFNKFIEEAHLMEIPMSNGRFTWSREGNIISRSLLDRFLV

Query:  TKEWDKAFEGTRVFRQARIISDHFPLLLEVGALEWGPSPFRFCNSWLLNSQCCSTIIRTLEAGYHQGWAGFVIFAKLRSVKTSLQQWHADS---LINQRK
        + EW++ F  +      R  SDH+P++LE    +WGP+PFRF N WL +        R        GW G     KL+ +K  L++W+ ++   LI ++K
Subjt:  TKEWDKAFEGTRVFRQARIISDHFPLLLEVGALEWGPSPFRFCNSWLLNSQCCSTIIRTLEAGYHQGWAGFVIFAKLRSVKTSLQQWHADS---LINQRK

Query:  QEAEIVEKLDKEEQGAELEDPSSILQDLRASLKADLMDIYRKKERDLIQKSKLNWLHLGDENTRFFRFLA-AKKRKNLIAELVNDQGIPTNSAGCFP---
             +   D  EQ   L  P  ++Q  RA  K +L ++  ++E    QK+++ W+  GD N++ F  +A  ++ +  I  L N++G+  +++       
Subjt:  QEAEIVEKLDKEEQGAELEDPSSILQDLRASLKADLMDIYRKKERDLIQKSKLNWLHLGDENTRFFRFLA-AKKRKNLIAELVNDQGIPTNSAGCFP---

Query:  ------------------ANLEWQRVSVEQNNRLSSKFSRKEIRFALRGMGKNKVGGPDGFTVEFLNKFWDRIKDDFVALFNKFHENEKLNSCVKENVIC
                            L+W  +S E  +RL S F+ +EI  A+  M ++   GPDGFT+      WD IK+D V +F++FH +  +N     + I 
Subjt:  ------------------ANLEWQRVSVEQNNRLSSKFSRKEIRFALRGMGKNKVGGPDGFTVEFLNKFWDRIKDDFVALFNKFHENEKLNSCVKENVIC

Query:  LIKEKEDAIMVKDFRPISLTTLTYKVIAKVLAERLKLVMPSIIAPTQSAFIEGRQILDPILIANELVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVL
        L+ +K  A  + ++RPISL T  YK+IAKVLA RL+ ++   I  TQ AF++GRQILD +LIANE+V+E +   ++G + K+D EKA+D V W FL+ V+
Subjt:  LIKEKEDAIMVKDFRPISLTTLTYKVIAKVLAERLKLVMPSIIAPTQSAFIEGRQILDPILIANELVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVL

Query:  QGKNFDSKWTSWTLGCIKNPKFSIFINRRPRGRVQASRGVRQGDPLSPFIFLLVSEVLSSLISRLHKSKKFEGFIVGKKKVHVPILQFADDILLFCKYDF
        + K F+     W   C+ +  F+I +N   +G V+  RG+RQGDPLSPF+F +V++V S ++ R  +   FEGF VG+ +  V  LQFADD + F     
Subjt:  QGKNFDSKWTSWTLGCIKNPKFSIFINRRPRGRVQASRGVRQGDPLSPFIFLLVSEVLSSLISRLHKSKKFEGFIVGKKKVHVPILQFADDILLFCKYDF

Query:  DMLEALRKTIEFFEWCSGQKVNWEKSALCGLNIDDLEVKLTAAKLNCKAEKLPLMYLGLPLGGHPKMIVFWQPILDKIQGKLSKWKRYNLSRGGRLTQRK
        + L  L+  +  F   SG KVN +KS + G+N+    +   A  L+CKA   P++YLGLPLGG+PK   FW P++++I  +L  W++  LS GGR+T  +
Subjt:  DMLEALRKTIEFFEWCSGQKVNWEKSALCGLNIDDLEVKLTAAKLNCKAEKLPLMYLGLPLGGHPKMIVFWQPILDKIQGKLSKWKRYNLSRGGRLTQRK

Query:  TVLSNLPSSYMSIFLMPEKVVLSIERAMRNFFWEGTGGSKLNHLARWETVTKYHKDGGLRLENLKIRNLALLSKWGWRFMQESEALWCNVITSIHGEDHF
        + L+++P  ++S+F +P  V   IER  R+F W G G  K +HL  W+ V K    GGL L  + +RN ALL KW WR+ +E  ALW  VI SI+G    
Subjt:  TVLSNLPSSYMSIFLMPEKVVLSIERAMRNFFWEGTGGSKLNHLARWETVTKYHKDGGLRLENLKIRNLALLSKWGWRFMQESEALWCNVITSIHGEDHF

Query:  QWHTNRKEVASLRSPWISISRQWQKIEAFAIFKVGDGRRITFWSDPWIEDQPLKVRFPRLFKLALKTTGTVADHWDPCSS-SWDLLFKRQLKEEELGEFL
         W  N     S R PW +I++ +Q    F  F VGDG RI FW D W  DQ L VRFPRL ++ +     ++         SW+  F+R L + E+ +  
Subjt:  QWHTNRKEVASLRSPWISISRQWQKIEAFAIFKVGDGRRITFWSDPWIEDQPLKVRFPRLFKLALKTTGTVADHWDPCSS-SWDLLFKRQLKEEELGEFL

Query:  SLSSSVANKRVMVQ-PDKRIWALEGNGSFSIKSLTNHLSVASPIDDEL-VKGLWKSKCPRRVNILIWTMLFGSLNCAATIQRKLPYHCLSPHMCVLCRQD
        SL  S+ +  +    PDKR W+L  +G F++KS    LS  S +      K +W S+ P ++   +W +    +N    +Q + PY  LSP +C+LC + 
Subjt:  SLSSSVANKRVMVQ-PDKRIWALEGNGSFSIKSLTNHLSVASPIDDEL-VKGLWKSKCPRRVNILIWTMLFGSLNCAATIQRKLPYHCLSPHMCVLCRQD

Query:  QEDIQHLFFGCSYASSCWSRLFGFFDLSWVMGSDFKSNVLQILLGPRLNKEPKLLWVNAVKALLSEIWFERNQHVFNDKASNWEDRFDQARLNASSWCTL
         E + HLF  CS     W RLF    + WV        +     G   +K   +LW  A  A+L  +W ERN  +F DK+ N E+ +D     AS W + 
Subjt:  QEDIQHLFFGCSYASSCWSRLFGFFDLSWVMGSDFKSNVLQILLGPRLNKEPKLLWVNAVKALLSEIWFERNQHVFNDKASNWEDRFDQARLNASSWCTL

Query:  SKTFKDYSIQELALN
        SK FK   +  + L+
Subjt:  SKTFKDYSIQELALN

RVW16209.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]3.6e-22135.22Show/hide
Query:  FSKKIVSWNIRGLGDYSKQLAVKHLIMKTNPRIV------------EAIKKLWSSRDIRWSFVEAYGRSGGLLIMWDESKISVIETLKGGYTLSVKCKTL
        F  KI+SWN+RGLG  +K+  +K  +   NP +V              +  +W+ R+  W  + A G SGG+LI+WD   +S  E + G +++SVK    
Subjt:  FSKKIVSWNIRGLGDYSKQLAVKHLIMKTNPRIV------------EAIKKLWSSRDIRWSFVEAYGRSGGLLIMWDESKISVIETLKGGYTLSVKCKTL

Query:  RRKVCWVTNVYRPTDYKERRRIWPELQALAAYCTNAWCLGGDFNITRAIHERVPTGRFR-GMKKFNKFIEEAHLMEIPMSNGRFTWS--REGNIISRSLL
             W++ VY P     R+  W EL  +       WC+GGDFN+ R   E++        M+ F+ FI E  L++ P+ N  FTWS  +E  +  R  L
Subjt:  RRKVCWVTNVYRPTDYKERRRIWPELQALAAYCTNAWCLGGDFNITRAIHERVPTGRFR-GMKKFNKFIEEAHLMEIPMSNGRFTWS--REGNIISRSLL

Query:  DRFLVTKEWDKAFEGTRVFRQARIISDHFPLLLEVGALEWGPSPFRFCNSWLLNSQCCSTIIRTLEAGYHQ-GWAGFVIFAKLRSVKTSLQQWHADSLIN
        DRFL + EW   F         R  SDH+P++++     WGP+PFRF N WL ++       R   +G+   GW G     +L+ VK  L++W+  S   
Subjt:  DRFLVTKEWDKAFEGTRVFRQARIISDHFPLLLEVGALEWGPSPFRFCNSWLLNSQCCSTIIRTLEAGYHQ-GWAGFVIFAKLRSVKTSLQQWHADSLIN

Query:  QRKQEAEIVEKLDKEEQGAELEDPSSILQDLRASLKADLMDIYRKKERDLIQKSKLNWLHLGDENTRFFRFLA-AKKRKNLIAELVNDQGIPTNSAGCFP
                                     +L+   K +L ++  ++E    QK+K+ W+  GD N++F+  +A  ++ +  I EL N++G+   +A    
Subjt:  QRKQEAEIVEKLDKEEQGAELEDPSSILQDLRASLKADLMDIYRKKERDLIQKSKLNWLHLGDENTRFFRFLA-AKKRKNLIAELVNDQGIPTNSAGCFP

Query:  ---------------------ANLEWQRVSVEQNNRLSSKFSRKEIRFALRGMGKNKVGGPDGFTVEFLNKFWDRIKDDFVALFNKFHENEKLNSCVKEN
                               L+W  +S E   RL S F+ +EI  A+  + ++K  GPDGFT+    + WD IK+D V +F +FH +  +N     +
Subjt:  ---------------------ANLEWQRVSVEQNNRLSSKFSRKEIRFALRGMGKNKVGGPDGFTVEFLNKFWDRIKDDFVALFNKFHENEKLNSCVKEN

Query:  VICLIKEKEDAIMVKDFRPISLTTLTYKVIAKVLAERLKLVMPSIIAPTQSAFIEGRQILDPILIANELVEEYRIKKKKGWILKLDLEKAFDKVDWGFLE
         I LI +K  +  + DFRPISL T  YK+IAKVL+ RL+ V+   I  TQ AF++GRQILD +LIANE+V+E R   ++G + K+D EKA+D V W FL+
Subjt:  VICLIKEKEDAIMVKDFRPISLTTLTYKVIAKVLAERLKLVMPSIIAPTQSAFIEGRQILDPILIANELVEEYRIKKKKGWILKLDLEKAFDKVDWGFLE

Query:  KVLQGKNFDSKWTSWTLGCIKNPKFSIFINRRPRGRVQASRGVRQGDPLSPFIFLLVSEVLSSLISRLHKSKKFEGFIVGKKKVHVPILQFADDILLFCK
         +L+ K F  +W  W  GC+ +  F+I +N   +G V+ASRG+RQGDPLSPF+F LV++VLS ++ R  +    EGF VG+ +  V  LQFADD + F  
Subjt:  KVLQGKNFDSKWTSWTLGCIKNPKFSIFINRRPRGRVQASRGVRQGDPLSPFIFLLVSEVLSSLISRLHKSKKFEGFIVGKKKVHVPILQFADDILLFCK

Query:  YDFDMLEALRKTIEFFEWCSGQKVNWEKSALCGLNIDDLEVKLTAAKLNCKAEKLPLMYLGLPLGGHPKMIVFWQPILDKIQGKLSKWKRYNLSRGGRLT
           + L+ L+  +  F    G KVN  KS++ G+N+D   +   A  L+CKA   P++YLGLPLGG+PK   FW P++++I  +L  W++  LS GGR+T
Subjt:  YDFDMLEALRKTIEFFEWCSGQKVNWEKSALCGLNIDDLEVKLTAAKLNCKAEKLPLMYLGLPLGGHPKMIVFWQPILDKIQGKLSKWKRYNLSRGGRLT

Query:  QRKTVLSNLPSSYMSIFLMPEKVVLSIERAMRNFFWEGTGGSKLNHLARWETVTKYHKDGGLRLENLKIRNLALLSKWGWRFMQESEALWCNVITSIHGE
          ++ L++LPS ++S+F MP  V   IER  R+F W G G  K +HL RW+ V +    GGL L N+  RNLALL KW WR+ +E  ALW  VI SI+G 
Subjt:  QRKTVLSNLPSSYMSIFLMPEKVVLSIERAMRNFFWEGTGGSKLNHLARWETVTKYHKDGGLRLENLKIRNLALLSKWGWRFMQESEALWCNVITSIHGE

Query:  DHFQWHTNRKEVASLRSPWISISRQWQKIEAFAIFKVGDGRRITFWSDPWIEDQPLKVRFPRLFKLALKTTGTVADHWDPCSS-SWDLLFKRQLKEEELG
            W  N     S R PW +I++ +Q+      +  G+G RI FW D W  DQPL  ++PRLF++ +    +++    P     W+L F+R L + E+ 
Subjt:  DHFQWHTNRKEVASLRSPWISISRQWQKIEAFAIFKVGDGRRITFWSDPWIEDQPLKVRFPRLFKLALKTTGTVADHWDPCSS-SWDLLFKRQLKEEELG

Query:  EFLSLSSSVANKRVMVQ-PDKRIWALEGNGSFSIKSLTNHLSVASPIDDEL-VKGLWKSKCPRRVNILIWTMLFGSLNCAATIQRKLPYHCLSPHMCVLC
        +   L  S+ +  +    PD ++W L  +G FS+KS    LS +S        K +W S+ P +V   +W +    +N    +Q + PY  LSP +C+LC
Subjt:  EFLSLSSSVANKRVMVQ-PDKRIWALEGNGSFSIKSLTNHLSVASPIDDEL-VKGLWKSKCPRRVNILIWTMLFGSLNCAATIQRKLPYHCLSPHMCVLC

Query:  RQDQEDIQHLFFGCSYASSCWSRLFGFFDLSWVMGSDFKSNVLQILLGPRLNKEPKLLWVNAVKALLSEIWFERNQHVFNDKASNWEDRFDQARLNASSW
         +  E   HLF  CS     W RLF    L WV        +     G   +K   +LW  A  AL+  +W+ERN  +F DKA N E  +D     AS W
Subjt:  RQDQEDIQHLFFGCSYASSCWSRLFGFFDLSWVMGSDFKSNVLQILLGPRLNKEPKLLWVNAVKALLSEIWFERNQHVFNDKASNWEDRFDQARLNASSW

Query:  CTLSKTFKDYSIQELALN
           SK FK   +  + L+
Subjt:  CTLSKTFKDYSIQELALN

RVW64408.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]2.8e-22934.43Show/hide
Query:  NPDLLEVSCANSQVPQQSGNRSALPLSSSQIFHQSKI----LIPGSNILFLRGSPSSYRPKPNKIQDLEEESLISISSDELNEPEE----------DENH
        NP++L  S  +S +P +    S +PL SS +  +S      ++  S ++ +        P     ++ EE  L     D L+ PE+          +  H
Subjt:  NPDLLEVSCANSQVPQQSGNRSALPLSSSQIFHQSKI----LIPGSNILFLRGSPSSYRPKPNKIQDLEEESLISISSDELNEPEE----------DENH

Query:  LSLVLDESIEGNKSIADHLSSMLNEKIPHHLRSFVEDCGISLKGK-LFSKKIVSWNIRGLGDYSKQLAVKHLIMKTNPRIV------------EAIKKLW
         SL L E  +        +     EK     R  +   G    G  L   KI+SWN RGLG   K+  V+  +   NP IV              +  +W
Subjt:  LSLVLDESIEGNKSIADHLSSMLNEKIPHHLRSFVEDCGISLKGK-LFSKKIVSWNIRGLGDYSKQLAVKHLIMKTNPRIV------------EAIKKLW

Query:  SSRDIRWSFVEAYGRSGGLLIMWDESKISVIETLKGGYTLSVKCKTLRRKVCWVTNVYRPTDYKERRRIWPELQALAAYCTNAWCLGGDFNITRAIHERV
          + + W+ + A G SGG++I+WD SK+   E + G ++++VK  +      W+T+VY P +   R+  W ELQ L       WC+GGDFN+ R I E++
Subjt:  SSRDIRWSFVEAYGRSGGLLIMWDESKISVIETLKGGYTLSVKCKTLRRKVCWVTNVYRPTDYKERRRIWPELQALAAYCTNAWCLGGDFNITRAIHERV

Query:  PTGRFR-GMKKFNKFIEEAHLMEIPMSNGRFTWSREGNIISRSLLDRFLVTKEWDKAFEGTRVFRQARIISDHFPLLLEVGALEWGPSPFRFCNSWLLNS
           R    M+ F++FI E+ L++ P+ N  FTWS          LDRFL + EWD  F  +      R  SDH P+ LE   L+WGP+PFRF N WLL+ 
Subjt:  PTGRFR-GMKKFNKFIEEAHLMEIPMSNGRFTWSREGNIISRSLLDRFLVTKEWDKAFEGTRVFRQARIISDHFPLLLEVGALEWGPSPFRFCNSWLLNS

Query:  QCCSTIIRTLEAGYHQGWAGFVIFAKLRSVKTSLQQWHADSLINQRKQEAEIVEKLDKEEQGAELEDPSSILQDL---RASLKADLMDIYRKKERDLIQK
        +              +GW G     KL+ VK+ L++W+  +  + ++++  I+  L + +    +E   ++  DL   R   + +L D+  K+E    QK
Subjt:  QCCSTIIRTLEAGYHQGWAGFVIFAKLRSVKTSLQQWHADSLINQRKQEAEIVEKLDKEEQGAELEDPSSILQDL---RASLKADLMDIYRKKERDLIQK

Query:  SKLNWLHLGDENTRFFRFLAAKKR-KNLIAELVNDQGIPTNSAG-------CFPANL--------------EWQRVSVEQNNRLSSKFSRKEIRFALRGM
        S++ W+  GD N++FF  +A  +R +  I  L++++G   N+          F  NL              +W  +S E    L   F+ +E+R A+  +
Subjt:  SKLNWLHLGDENTRFFRFLAAKKR-KNLIAELVNDQGIPTNSAG-------CFPANL--------------EWQRVSVEQNNRLSSKFSRKEIRFALRGM

Query:  GKNKVGGPDGFTVEFLNKFWDRIKDDFVALFNKFHENEKLNSCVKENVICLIKEKEDAIMVKDFRPISLTTLTYKVIAKVLAERLKLVMPSIIAPTQSAF
         K K  GPDGFT+    + WD IK+D + +F +FH N  +N       I L+ +K  ++ + D+RPISL T  YK+IAKVL+ RL+ V+   I+ +Q AF
Subjt:  GKNKVGGPDGFTVEFLNKFWDRIKDDFVALFNKFHENEKLNSCVKENVICLIKEKEDAIMVKDFRPISLTTLTYKVIAKVLAERLKLVMPSIIAPTQSAF

Query:  IEGRQILDPILIANELVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFDSKWTSWTLGCIKNPKFSIFINRRPRGRVQASRGVRQGDPLSPFI
        +EGR ILD +LIANE+V+E R   ++G + K+D EKA+D VDWGFL+ VLQ K F  KW  W  GC+ +  F+I +N   +G V+ASRG+RQGDPLSPF+
Subjt:  IEGRQILDPILIANELVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFDSKWTSWTLGCIKNPKFSIFINRRPRGRVQASRGVRQGDPLSPFI

Query:  FLLVSEVLSSLISRLHKSKKFEGFIVGKKKVHVPILQFADDILLFCKYDFDMLEALRKTIEFFEWCSGQKVNWEKSALCGLNIDDLEVKLTAAKLNCKAE
        F LV++VLS ++ R  ++   EGF VG+ +  V +LQFADD + F K   + L+ L+  +  F   SG K+N EKS + G+N     +   A+  +C+  
Subjt:  FLLVSEVLSSLISRLHKSKKFEGFIVGKKKVHVPILQFADDILLFCKYDFDMLEALRKTIEFFEWCSGQKVNWEKSALCGLNIDDLEVKLTAAKLNCKAE

Query:  KLPLMYLGLPLGGHPKMIVFWQPILDKIQGKLSKWKRYNLSRGGRLTQRKTVLSNLPSSYMSIFLMPEKVVLSIERAMRNFFWEGTGGSKLNHLARWETV
        + PL YLGLPLGG+PK I FW P++++I  +L  WK+  LS GGR+T  ++ LS++PS ++S+F +P  +   IE+  RNF W G G  K +HL RWE V
Subjt:  KLPLMYLGLPLGGHPKMIVFWQPILDKIQGKLSKWKRYNLSRGGRLTQRKTVLSNLPSSYMSIFLMPEKVVLSIERAMRNFFWEGTGGSKLNHLARWETV

Query:  TKYHKDGGLRLENLKIRNLALLSKWGWRFMQESEALWCNVITSIHGEDHFQWHTNRKEVASLRSPWISISRQWQKIEAFAIFKVGDGRRITFWSDPWIED
        ++  + GGL    + +RN+ALL KW WRF +E   LW  VI SI+G     W  N     S R PW +I++ +Q+   F    VG+G RI FW D W  +
Subjt:  TKYHKDGGLRLENLKIRNLALLSKWGWRFMQESEALWCNVITSIHGEDHFQWHTNRKEVASLRSPWISISRQWQKIEAFAIFKVGDGRRITFWSDPWIED

Query:  QPLKVRFPRLFKLALKTTGTVAD-HWDPCSSSWDLLFKRQLKEEELGEFLSLSSSVANKRVMVQ-PDKRIWALEGNGSFSIKSLTNHLS-VASPIDDELV
        Q L  +F  L+++      TV++   +    +W+L F+R L + E+     L SS+++ R      D R W+L  +G F++KS    LS V++PI     
Subjt:  QPLKVRFPRLFKLALKTTGTVAD-HWDPCSSSWDLLFKRQLKEEELGEFLSLSSSVANKRVMVQ-PDKRIWALEGNGSFSIKSLTNHLS-VASPIDDELV

Query:  KGLWKSKCPRRVNILIWTMLFGSLNCAATIQRKLPYHCLSPHMCVLCRQDQEDIQHLFFGCSYASSCWSRLFGFFDLSWVMGSDFKSNVLQILLGPRLNK
        K LW SK P +V  L W +  G +N    +Q + PY  L P  C+LC+ + E I HLF  C      W++LF    L WV    F+  ++    G   + 
Subjt:  KGLWKSKCPRRVNILIWTMLFGSLNCAATIQRKLPYHCLSPHMCVLCRQDQEDIQHLFFGCSYASSCWSRLFGFFDLSWVMGSDFKSNVLQILLGPRLNK

Query:  EPKLLWVNAVKALLSEIWFERNQHVFNDKASNWEDRFDQARLNASSWCTLSKTFKDYSIQELALN
          K LW  A   L+  +W ERN+ +F DK  + E  +D     ++ W + S  F+   +  + LN
Subjt:  EPKLLWVNAVKALLSEIWFERNQHVFNDKASNWEDRFDQARLNASSWCTLSKTFKDYSIQELALN

RVW70235.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]6.3e-22633.94Show/hide
Query:  ALTNPDLLEVSCANSQVPQQSGNRSALPLSSSQIFHQSKILIPGSNILFLRGSPSSYRPKPNKIQDLEEESLISISSDELNEPEEDENHLSLVLDESIEG
        +L NP  + + C ++ +   S ++S  P+ +             S     +G+   + P    I  LE E+ +   +D   +  E  N LS  L    + 
Subjt:  ALTNPDLLEVSCANSQVPQQSGNRSALPLSSSQIFHQSKILIPGSNILFLRGSPSSYRPKPNKIQDLEEESLISISSDELNEPEEDENHLSLVLDESIEG

Query:  NKSIADHLSSMLNEKIPH-HLRSFVEDCGISLKGKLFSKKIVSWNIRGLGDYSKQLAVKHLIMKTNPRIV------------EAIKKLWSSRDIRWSFVE
        +     HL  +  E+    H    +   G  ++   F  KI+SWN RGLG   K+  VK  +    P +V              +  +W++R+  W+ + 
Subjt:  NKSIADHLSSMLNEKIPH-HLRSFVEDCGISLKGKLFSKKIVSWNIRGLGDYSKQLAVKHLIMKTNPRIV------------EAIKKLWSSRDIRWSFVE

Query:  AYGRSGGLLIMWDESKISVIETLKGGYTLSVKCKTLRRKVCWVTNVYRPTDYKERRRIWPELQALAAYCTNAWCLGGDFNITRAIHERVPTGRFR-GMKK
        A G SGG+LI+WD  K+S  E + G +++S+K      +  W++ VY P +   R+ +W EL  +A   +  WC+GGDFN+ R   E++   R    MK 
Subjt:  AYGRSGGLLIMWDESKISVIETLKGGYTLSVKCKTLRRKVCWVTNVYRPTDYKERRRIWPELQALAAYCTNAWCLGGDFNITRAIHERVPTGRFR-GMKK

Query:  FNKFIEEAHLMEIPMSNGRFTWSR-EGNIISRSLLDRFLVTKEWDKAFEGTRVFRQARIISDHFPLLLEVGALEWGPSPFRFCNSWLLNSQCCSTIIRTL
        F+ FI +  L+++P+ +  FTWS  + N + +  LDRFL + EW++ F  +      R  SDH+P++LE    +WGP+PFRF N WL +        R  
Subjt:  FNKFIEEAHLMEIPMSNGRFTWSR-EGNIISRSLLDRFLVTKEWDKAFEGTRVFRQARIISDHFPLLLEVGALEWGPSPFRFCNSWLLNSQCCSTIIRTL

Query:  EAGYHQGWAGFVIFAKLRSVKTSLQQWHADSLINQRKQEAEIVEKL---DKEEQGAELEDPSSILQDLRASLKADLMDIYRKKERDLIQKSKLNWLHLGD
              GW G     KL+ VK  L+ W+  S     K++ +I+  L   D  EQ   L   S  L   RA  K +L ++  ++E    QK+++ W+  GD
Subjt:  EAGYHQGWAGFVIFAKLRSVKTSLQQWHADSLINQRKQEAEIVEKL---DKEEQGAELEDPSSILQDLRASLKADLMDIYRKKERDLIQKSKLNWLHLGD

Query:  ENTRFFRFLA-AKKRKNLIAELVNDQGIPTNSAGCFP---------------------ANLEWQRVSVEQNNRLSSKFSRKEIRFALRGMGKNKVGGPDG
         N++FF  +A  ++ +  I EL N+ G   N++                           L+W  +S E   RL S F+ +EI  A+  M ++K  GPDG
Subjt:  ENTRFFRFLA-AKKRKNLIAELVNDQGIPTNSAGCFP---------------------ANLEWQRVSVEQNNRLSSKFSRKEIRFALRGMGKNKVGGPDG

Query:  FTVEFLNKFWDRIKDDFVALFNKFHENEKLNSCVKENVICLIKEKEDAIMVKDFRPISLTTLTYKVIAKVLAERLKLVMPSIIAPTQSAFIEGRQILDPI
        FT+      W+ IK+D V +F +FH +  +N     + I L+ +K  +  + DFRPISL T  YK+IAKVLA R++ V+   I  TQ AF++GRQILD +
Subjt:  FTVEFLNKFWDRIKDDFVALFNKFHENEKLNSCVKENVICLIKEKEDAIMVKDFRPISLTTLTYKVIAKVLAERLKLVMPSIIAPTQSAFIEGRQILDPI

Query:  LIANELVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFDSKWTSWTLGCIKNPKFSIFINRRPRGRVQASRGVRQGDPLSPFIFLLVSEVLSS
        LIANE+V+E R   ++G + K+D EKA+D V W FL+ V++ K F  +W  W  GC+ +  F++ +N   +G V+ASRG+RQGDPLSPF+F +V++VLS 
Subjt:  LIANELVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFDSKWTSWTLGCIKNPKFSIFINRRPRGRVQASRGVRQGDPLSPFIFLLVSEVLSS

Query:  LISRLHKSKKFEGFIVGKKKVHVPILQFADDILLFCKYDFDMLEALRKTIEFFEWCSGQKVNWEKSALCGLNIDDLEVKLTAAKLNCKAEKLPLMYLGLP
        ++ +  +    EGF VG+ +  V  LQFADD + F     + +  L+  +  F   SG KVN +KS + G+N++   +   A  L+CKA   P++YLGLP
Subjt:  LISRLHKSKKFEGFIVGKKKVHVPILQFADDILLFCKYDFDMLEALRKTIEFFEWCSGQKVNWEKSALCGLNIDDLEVKLTAAKLNCKAEKLPLMYLGLP

Query:  LGGHPKMIVFWQPILDKIQGKLSKWKRYNLSRGGRLTQRKTVLSNLPSSYMSIFLMPEKVVLSIERAMRNFFWEGTGGSKLNHLARWETVTKYHKDGGLR
        LGG+PK   FW P++++I  +L  W++  LS GGR+T  ++ L+++P  ++S+F +P  V   IER  R+F W G G  K +HL  W+ V K    GGL 
Subjt:  LGGHPKMIVFWQPILDKIQGKLSKWKRYNLSRGGRLTQRKTVLSNLPSSYMSIFLMPEKVVLSIERAMRNFFWEGTGGSKLNHLARWETVTKYHKDGGLR

Query:  LENLKIRNLALLSKWGWRFMQESEALWCNVITSIHGEDHFQWHTNRKEVASLRSPWISISRQWQKIEAFAIFKVGDGRRITFWSDPWIEDQPLKVRFPRL
           + IRN+ALL KW WR+ +E  ALW  VI SI+G     W  N     S R PW +I+  +Q+   F  F VG+G RI FW D W  +QPL V++PRL
Subjt:  LENLKIRNLALLSKWGWRFMQESEALWCNVITSIHGEDHFQWHTNRKEVASLRSPWISISRQWQKIEAFAIFKVGDGRRITFWSDPWIEDQPLKVRFPRL

Query:  FKLALKTTGTVADHWDPCSS--------SWDLLFKRQLKEEELGEFLSLSSSVANKRVMVQ-PDKRIWALEGNGSFSIKSL---TNHLSVASPIDDELVK
         ++       V D   P SS        SW+  F+R L + E+ +   L  S     +    PDKR W+L  +G F++KS     +  SV+ PI     K
Subjt:  FKLALKTTGTVADHWDPCSS--------SWDLLFKRQLKEEELGEFLSLSSSVANKRVMVQ-PDKRIWALEGNGSFSIKSL---TNHLSVASPIDDELVK

Query:  GLWKSKCPRRVNILIWTMLFGSLNCAATIQRKLPYHCLSPHMCVLCRQDQEDIQHLFFGCSYASSCWSRLFGFFDLSWVMGSDFKSNVLQILLGPRLNKE
         +W ++ P +V   +W +    +N    +Q + PY  LSP +C LC +  E + HLF  CS     W RLF    + WV        +     G   +K 
Subjt:  GLWKSKCPRRVNILIWTMLFGSLNCAATIQRKLPYHCLSPHMCVLCRQDQEDIQHLFFGCSYASSCWSRLFGFFDLSWVMGSDFKSNVLQILLGPRLNKE

Query:  PKLLWVNAVKALLSEIWFERNQHVFNDKASNWEDRFDQARLNASSWCTLSKTFKDYSIQELALN
          +LW NA  AL+  +W ERN  +F DKA N E  +D      S W   SK FK   +  L L+
Subjt:  PKLLWVNAVKALLSEIWFERNQHVFNDKASNWEDRFDQARLNASSWCTLSKTFKDYSIQELALN

TrEMBL top hitse value%identityAlignment
A0A438FWU5 LINE-1 retrotransposable element ORF2 protein1.3e-22934.43Show/hide
Query:  NPDLLEVSCANSQVPQQSGNRSALPLSSSQIFHQSKI----LIPGSNILFLRGSPSSYRPKPNKIQDLEEESLISISSDELNEPEE----------DENH
        NP++L  S  +S +P +    S +PL SS +  +S      ++  S ++ +        P     ++ EE  L     D L+ PE+          +  H
Subjt:  NPDLLEVSCANSQVPQQSGNRSALPLSSSQIFHQSKI----LIPGSNILFLRGSPSSYRPKPNKIQDLEEESLISISSDELNEPEE----------DENH

Query:  LSLVLDESIEGNKSIADHLSSMLNEKIPHHLRSFVEDCGISLKGK-LFSKKIVSWNIRGLGDYSKQLAVKHLIMKTNPRIV------------EAIKKLW
         SL L E  +        +     EK     R  +   G    G  L   KI+SWN RGLG   K+  V+  +   NP IV              +  +W
Subjt:  LSLVLDESIEGNKSIADHLSSMLNEKIPHHLRSFVEDCGISLKGK-LFSKKIVSWNIRGLGDYSKQLAVKHLIMKTNPRIV------------EAIKKLW

Query:  SSRDIRWSFVEAYGRSGGLLIMWDESKISVIETLKGGYTLSVKCKTLRRKVCWVTNVYRPTDYKERRRIWPELQALAAYCTNAWCLGGDFNITRAIHERV
          + + W+ + A G SGG++I+WD SK+   E + G ++++VK  +      W+T+VY P +   R+  W ELQ L       WC+GGDFN+ R I E++
Subjt:  SSRDIRWSFVEAYGRSGGLLIMWDESKISVIETLKGGYTLSVKCKTLRRKVCWVTNVYRPTDYKERRRIWPELQALAAYCTNAWCLGGDFNITRAIHERV

Query:  PTGRFR-GMKKFNKFIEEAHLMEIPMSNGRFTWSREGNIISRSLLDRFLVTKEWDKAFEGTRVFRQARIISDHFPLLLEVGALEWGPSPFRFCNSWLLNS
           R    M+ F++FI E+ L++ P+ N  FTWS          LDRFL + EWD  F  +      R  SDH P+ LE   L+WGP+PFRF N WLL+ 
Subjt:  PTGRFR-GMKKFNKFIEEAHLMEIPMSNGRFTWSREGNIISRSLLDRFLVTKEWDKAFEGTRVFRQARIISDHFPLLLEVGALEWGPSPFRFCNSWLLNS

Query:  QCCSTIIRTLEAGYHQGWAGFVIFAKLRSVKTSLQQWHADSLINQRKQEAEIVEKLDKEEQGAELEDPSSILQDL---RASLKADLMDIYRKKERDLIQK
        +              +GW G     KL+ VK+ L++W+  +  + ++++  I+  L + +    +E   ++  DL   R   + +L D+  K+E    QK
Subjt:  QCCSTIIRTLEAGYHQGWAGFVIFAKLRSVKTSLQQWHADSLINQRKQEAEIVEKLDKEEQGAELEDPSSILQDL---RASLKADLMDIYRKKERDLIQK

Query:  SKLNWLHLGDENTRFFRFLAAKKR-KNLIAELVNDQGIPTNSAG-------CFPANL--------------EWQRVSVEQNNRLSSKFSRKEIRFALRGM
        S++ W+  GD N++FF  +A  +R +  I  L++++G   N+          F  NL              +W  +S E    L   F+ +E+R A+  +
Subjt:  SKLNWLHLGDENTRFFRFLAAKKR-KNLIAELVNDQGIPTNSAG-------CFPANL--------------EWQRVSVEQNNRLSSKFSRKEIRFALRGM

Query:  GKNKVGGPDGFTVEFLNKFWDRIKDDFVALFNKFHENEKLNSCVKENVICLIKEKEDAIMVKDFRPISLTTLTYKVIAKVLAERLKLVMPSIIAPTQSAF
         K K  GPDGFT+    + WD IK+D + +F +FH N  +N       I L+ +K  ++ + D+RPISL T  YK+IAKVL+ RL+ V+   I+ +Q AF
Subjt:  GKNKVGGPDGFTVEFLNKFWDRIKDDFVALFNKFHENEKLNSCVKENVICLIKEKEDAIMVKDFRPISLTTLTYKVIAKVLAERLKLVMPSIIAPTQSAF

Query:  IEGRQILDPILIANELVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFDSKWTSWTLGCIKNPKFSIFINRRPRGRVQASRGVRQGDPLSPFI
        +EGR ILD +LIANE+V+E R   ++G + K+D EKA+D VDWGFL+ VLQ K F  KW  W  GC+ +  F+I +N   +G V+ASRG+RQGDPLSPF+
Subjt:  IEGRQILDPILIANELVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFDSKWTSWTLGCIKNPKFSIFINRRPRGRVQASRGVRQGDPLSPFI

Query:  FLLVSEVLSSLISRLHKSKKFEGFIVGKKKVHVPILQFADDILLFCKYDFDMLEALRKTIEFFEWCSGQKVNWEKSALCGLNIDDLEVKLTAAKLNCKAE
        F LV++VLS ++ R  ++   EGF VG+ +  V +LQFADD + F K   + L+ L+  +  F   SG K+N EKS + G+N     +   A+  +C+  
Subjt:  FLLVSEVLSSLISRLHKSKKFEGFIVGKKKVHVPILQFADDILLFCKYDFDMLEALRKTIEFFEWCSGQKVNWEKSALCGLNIDDLEVKLTAAKLNCKAE

Query:  KLPLMYLGLPLGGHPKMIVFWQPILDKIQGKLSKWKRYNLSRGGRLTQRKTVLSNLPSSYMSIFLMPEKVVLSIERAMRNFFWEGTGGSKLNHLARWETV
        + PL YLGLPLGG+PK I FW P++++I  +L  WK+  LS GGR+T  ++ LS++PS ++S+F +P  +   IE+  RNF W G G  K +HL RWE V
Subjt:  KLPLMYLGLPLGGHPKMIVFWQPILDKIQGKLSKWKRYNLSRGGRLTQRKTVLSNLPSSYMSIFLMPEKVVLSIERAMRNFFWEGTGGSKLNHLARWETV

Query:  TKYHKDGGLRLENLKIRNLALLSKWGWRFMQESEALWCNVITSIHGEDHFQWHTNRKEVASLRSPWISISRQWQKIEAFAIFKVGDGRRITFWSDPWIED
        ++  + GGL    + +RN+ALL KW WRF +E   LW  VI SI+G     W  N     S R PW +I++ +Q+   F    VG+G RI FW D W  +
Subjt:  TKYHKDGGLRLENLKIRNLALLSKWGWRFMQESEALWCNVITSIHGEDHFQWHTNRKEVASLRSPWISISRQWQKIEAFAIFKVGDGRRITFWSDPWIED

Query:  QPLKVRFPRLFKLALKTTGTVAD-HWDPCSSSWDLLFKRQLKEEELGEFLSLSSSVANKRVMVQ-PDKRIWALEGNGSFSIKSLTNHLS-VASPIDDELV
        Q L  +F  L+++      TV++   +    +W+L F+R L + E+     L SS+++ R      D R W+L  +G F++KS    LS V++PI     
Subjt:  QPLKVRFPRLFKLALKTTGTVAD-HWDPCSSSWDLLFKRQLKEEELGEFLSLSSSVANKRVMVQ-PDKRIWALEGNGSFSIKSLTNHLS-VASPIDDELV

Query:  KGLWKSKCPRRVNILIWTMLFGSLNCAATIQRKLPYHCLSPHMCVLCRQDQEDIQHLFFGCSYASSCWSRLFGFFDLSWVMGSDFKSNVLQILLGPRLNK
        K LW SK P +V  L W +  G +N    +Q + PY  L P  C+LC+ + E I HLF  C      W++LF    L WV    F+  ++    G   + 
Subjt:  KGLWKSKCPRRVNILIWTMLFGSLNCAATIQRKLPYHCLSPHMCVLCRQDQEDIQHLFFGCSYASSCWSRLFGFFDLSWVMGSDFKSNVLQILLGPRLNK

Query:  EPKLLWVNAVKALLSEIWFERNQHVFNDKASNWEDRFDQARLNASSWCTLSKTFKDYSIQELALN
          K LW  A   L+  +W ERN+ +F DK  + E  +D     ++ W + S  F+   +  + LN
Subjt:  EPKLLWVNAVKALLSEIWFERNQHVFNDKASNWEDRFDQARLNASSWCTLSKTFKDYSIQELALN

A0A438GDE7 LINE-1 retrotransposable element ORF2 protein3.1e-22633.94Show/hide
Query:  ALTNPDLLEVSCANSQVPQQSGNRSALPLSSSQIFHQSKILIPGSNILFLRGSPSSYRPKPNKIQDLEEESLISISSDELNEPEEDENHLSLVLDESIEG
        +L NP  + + C ++ +   S ++S  P+ +             S     +G+   + P    I  LE E+ +   +D   +  E  N LS  L    + 
Subjt:  ALTNPDLLEVSCANSQVPQQSGNRSALPLSSSQIFHQSKILIPGSNILFLRGSPSSYRPKPNKIQDLEEESLISISSDELNEPEEDENHLSLVLDESIEG

Query:  NKSIADHLSSMLNEKIPH-HLRSFVEDCGISLKGKLFSKKIVSWNIRGLGDYSKQLAVKHLIMKTNPRIV------------EAIKKLWSSRDIRWSFVE
        +     HL  +  E+    H    +   G  ++   F  KI+SWN RGLG   K+  VK  +    P +V              +  +W++R+  W+ + 
Subjt:  NKSIADHLSSMLNEKIPH-HLRSFVEDCGISLKGKLFSKKIVSWNIRGLGDYSKQLAVKHLIMKTNPRIV------------EAIKKLWSSRDIRWSFVE

Query:  AYGRSGGLLIMWDESKISVIETLKGGYTLSVKCKTLRRKVCWVTNVYRPTDYKERRRIWPELQALAAYCTNAWCLGGDFNITRAIHERVPTGRFR-GMKK
        A G SGG+LI+WD  K+S  E + G +++S+K      +  W++ VY P +   R+ +W EL  +A   +  WC+GGDFN+ R   E++   R    MK 
Subjt:  AYGRSGGLLIMWDESKISVIETLKGGYTLSVKCKTLRRKVCWVTNVYRPTDYKERRRIWPELQALAAYCTNAWCLGGDFNITRAIHERVPTGRFR-GMKK

Query:  FNKFIEEAHLMEIPMSNGRFTWSR-EGNIISRSLLDRFLVTKEWDKAFEGTRVFRQARIISDHFPLLLEVGALEWGPSPFRFCNSWLLNSQCCSTIIRTL
        F+ FI +  L+++P+ +  FTWS  + N + +  LDRFL + EW++ F  +      R  SDH+P++LE    +WGP+PFRF N WL +        R  
Subjt:  FNKFIEEAHLMEIPMSNGRFTWSR-EGNIISRSLLDRFLVTKEWDKAFEGTRVFRQARIISDHFPLLLEVGALEWGPSPFRFCNSWLLNSQCCSTIIRTL

Query:  EAGYHQGWAGFVIFAKLRSVKTSLQQWHADSLINQRKQEAEIVEKL---DKEEQGAELEDPSSILQDLRASLKADLMDIYRKKERDLIQKSKLNWLHLGD
              GW G     KL+ VK  L+ W+  S     K++ +I+  L   D  EQ   L   S  L   RA  K +L ++  ++E    QK+++ W+  GD
Subjt:  EAGYHQGWAGFVIFAKLRSVKTSLQQWHADSLINQRKQEAEIVEKL---DKEEQGAELEDPSSILQDLRASLKADLMDIYRKKERDLIQKSKLNWLHLGD

Query:  ENTRFFRFLA-AKKRKNLIAELVNDQGIPTNSAGCFP---------------------ANLEWQRVSVEQNNRLSSKFSRKEIRFALRGMGKNKVGGPDG
         N++FF  +A  ++ +  I EL N+ G   N++                           L+W  +S E   RL S F+ +EI  A+  M ++K  GPDG
Subjt:  ENTRFFRFLA-AKKRKNLIAELVNDQGIPTNSAGCFP---------------------ANLEWQRVSVEQNNRLSSKFSRKEIRFALRGMGKNKVGGPDG

Query:  FTVEFLNKFWDRIKDDFVALFNKFHENEKLNSCVKENVICLIKEKEDAIMVKDFRPISLTTLTYKVIAKVLAERLKLVMPSIIAPTQSAFIEGRQILDPI
        FT+      W+ IK+D V +F +FH +  +N     + I L+ +K  +  + DFRPISL T  YK+IAKVLA R++ V+   I  TQ AF++GRQILD +
Subjt:  FTVEFLNKFWDRIKDDFVALFNKFHENEKLNSCVKENVICLIKEKEDAIMVKDFRPISLTTLTYKVIAKVLAERLKLVMPSIIAPTQSAFIEGRQILDPI

Query:  LIANELVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFDSKWTSWTLGCIKNPKFSIFINRRPRGRVQASRGVRQGDPLSPFIFLLVSEVLSS
        LIANE+V+E R   ++G + K+D EKA+D V W FL+ V++ K F  +W  W  GC+ +  F++ +N   +G V+ASRG+RQGDPLSPF+F +V++VLS 
Subjt:  LIANELVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFDSKWTSWTLGCIKNPKFSIFINRRPRGRVQASRGVRQGDPLSPFIFLLVSEVLSS

Query:  LISRLHKSKKFEGFIVGKKKVHVPILQFADDILLFCKYDFDMLEALRKTIEFFEWCSGQKVNWEKSALCGLNIDDLEVKLTAAKLNCKAEKLPLMYLGLP
        ++ +  +    EGF VG+ +  V  LQFADD + F     + +  L+  +  F   SG KVN +KS + G+N++   +   A  L+CKA   P++YLGLP
Subjt:  LISRLHKSKKFEGFIVGKKKVHVPILQFADDILLFCKYDFDMLEALRKTIEFFEWCSGQKVNWEKSALCGLNIDDLEVKLTAAKLNCKAEKLPLMYLGLP

Query:  LGGHPKMIVFWQPILDKIQGKLSKWKRYNLSRGGRLTQRKTVLSNLPSSYMSIFLMPEKVVLSIERAMRNFFWEGTGGSKLNHLARWETVTKYHKDGGLR
        LGG+PK   FW P++++I  +L  W++  LS GGR+T  ++ L+++P  ++S+F +P  V   IER  R+F W G G  K +HL  W+ V K    GGL 
Subjt:  LGGHPKMIVFWQPILDKIQGKLSKWKRYNLSRGGRLTQRKTVLSNLPSSYMSIFLMPEKVVLSIERAMRNFFWEGTGGSKLNHLARWETVTKYHKDGGLR

Query:  LENLKIRNLALLSKWGWRFMQESEALWCNVITSIHGEDHFQWHTNRKEVASLRSPWISISRQWQKIEAFAIFKVGDGRRITFWSDPWIEDQPLKVRFPRL
           + IRN+ALL KW WR+ +E  ALW  VI SI+G     W  N     S R PW +I+  +Q+   F  F VG+G RI FW D W  +QPL V++PRL
Subjt:  LENLKIRNLALLSKWGWRFMQESEALWCNVITSIHGEDHFQWHTNRKEVASLRSPWISISRQWQKIEAFAIFKVGDGRRITFWSDPWIEDQPLKVRFPRL

Query:  FKLALKTTGTVADHWDPCSS--------SWDLLFKRQLKEEELGEFLSLSSSVANKRVMVQ-PDKRIWALEGNGSFSIKSL---TNHLSVASPIDDELVK
         ++       V D   P SS        SW+  F+R L + E+ +   L  S     +    PDKR W+L  +G F++KS     +  SV+ PI     K
Subjt:  FKLALKTTGTVADHWDPCSS--------SWDLLFKRQLKEEELGEFLSLSSSVANKRVMVQ-PDKRIWALEGNGSFSIKSL---TNHLSVASPIDDELVK

Query:  GLWKSKCPRRVNILIWTMLFGSLNCAATIQRKLPYHCLSPHMCVLCRQDQEDIQHLFFGCSYASSCWSRLFGFFDLSWVMGSDFKSNVLQILLGPRLNKE
         +W ++ P +V   +W +    +N    +Q + PY  LSP +C LC +  E + HLF  CS     W RLF    + WV        +     G   +K 
Subjt:  GLWKSKCPRRVNILIWTMLFGSLNCAATIQRKLPYHCLSPHMCVLCRQDQEDIQHLFFGCSYASSCWSRLFGFFDLSWVMGSDFKSNVLQILLGPRLNKE

Query:  PKLLWVNAVKALLSEIWFERNQHVFNDKASNWEDRFDQARLNASSWCTLSKTFKDYSIQELALN
          +LW NA  AL+  +W ERN  +F DKA N E  +D      S W   SK FK   +  L L+
Subjt:  PKLLWVNAVKALLSEIWFERNQHVFNDKASNWEDRFDQARLNASSWCTLSKTFKDYSIQELALN

A0A803P8A0 Uncharacterized protein7.6e-23336.41Show/hide
Query:  KIVSWNIRGLGDYSKQLAVKHLIMKTNPRIV------------EAIKKLWSSRDIRWSFVEAYGRSGGLLIMWDESKISVIETLKGGYTLSVKCKTLRRK
        KI++WNIRG GD  K+ A+K  I K NP +V              I  +W SR   W  + A GRSGG L++WD   ISV+++L G +++SV      ++
Subjt:  KIVSWNIRGLGDYSKQLAVKHLIMKTNPRIV------------EAIKKLWSSRDIRWSFVEAYGRSGGLLIMWDESKISVIETLKGGYTLSVKCKTLRRK

Query:  VCWVTNVYRPTDYKERRRIWPELQALAAYCTNAWCLGGDFNITRAIHERV-PTGRFRGMKKFNKFIEEAHLMEIPMSNGRFTWSREGNIISRSLLDRFLV
          W + VY P  YK R   W EL  L++ C  +WC+GGDFN+TR + E++  +   R MK F+  I E  L++  + NG FTWS    I   S LDRFL 
Subjt:  VCWVTNVYRPTDYKERRRIWPELQALAAYCTNAWCLGGDFNITRAIHERV-PTGRFRGMKKFNKFIEEAHLMEIPMSNGRFTWSREGNIISRSLLDRFLV

Query:  TKEWDKAFEGTRVFRQARIISDHFPLLLEVGALEWGPSPFRFCNSWLLNSQCCSTIIRTLEAGYHQGWAGFVIFAKLRSVKTSLQQWHADSLINQRKQEA
           W+  F   R     R++SDH P++++    +WGP PFRF N WL +           +     GW G     KL++++   ++W   +    +  + 
Subjt:  TKEWDKAFEGTRVFRQARIISDHFPLLLEVGALEWGPSPFRFCNSWLLNSQCCSTIIRTLEAGYHQGWAGFVIFAKLRSVKTSLQQWHADSLINQRKQEA

Query:  EIVEKLDKEEQGAELEDPSSILQDLRASLKADLMDIYRKKERDLIQKSKLNWLHLGDENTRFF-RFLAAKKRKNLIAELVNDQGIPTNS-----------
         +  +L   ++       +  L D R  LK +   +  ++ER +  KSK  W   GD N+RFF   L A+K +N I+ +  D G   +S           
Subjt:  EIVEKLDKEEQGAELEDPSSILQDLRASLKADLMDIYRKKERDLIQKSKLNWLHLGDENTRFF-RFLAAKKRKNLIAELVNDQGIPTNS-----------

Query:  ----------AGCFPANLEWQRVSVEQNNRLSSKFSRKEIRFALRGMGKNKVGGPDGFTVEFLNKFWDRIKDDFVALFNKFHENEKLNSCVKENVICLIK
                   G     +EWQ ++     +L   F   E+R  +     +K  GPDGF++      W+ IK++ + +F  FH   ++   + +  ICLI 
Subjt:  ----------AGCFPANLEWQRVSVEQNNRLSSKFSRKEIRFALRGMGKNKVGGPDGFTVEFLNKFWDRIKDDFVALFNKFHENEKLNSCVKENVICLIK

Query:  EKEDAIMVKDFRPISLTTLTYKVIAKVLAERLKLVMPSIIAPTQSAFIEGRQILDPILIANELVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGK
        ++ ++  VKDFRPISL T  YK+IAK LA RL+ V+   I+ TQSAF+EGRQILD +L+ANE VE+YR + KKG++LK+D EKA+D+VDWGFL+ VL+ K
Subjt:  EKEDAIMVKDFRPISLTTLTYKVIAKVLAERLKLVMPSIIAPTQSAFIEGRQILDPILIANELVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGK

Query:  NFDSKWTSWTLGCIKNPKFSIFINRRPRGRVQASRGVRQGDPLSPFIFLLVSEVLSSLISRLHKSKKFEGFIVGKKKVHVPILQFADDILLFCKYDFDML
         F  +W  W  GC+ +  FSIF+N R RG+   SRG+RQGDPLSPF+F LV++VL  ++ +  +++ F GF +GK  + +  LQFADD L F K D D L
Subjt:  NFDSKWTSWTLGCIKNPKFSIFINRRPRGRVQASRGVRQGDPLSPFIFLLVSEVLSSLISRLHKSKKFEGFIVGKKKVHVPILQFADDILLFCKYDFDML

Query:  EALRKTIEFFEWCSGQKVNWEKSALCGLNIDDLEVKLTAAKLNCKAEKLPLMYLGLPLGGHPKMIVFWQPILDKIQGKLSKWKRYNLSRGGRLTQRKTVL
        + L K +E F   SG KVN  KS L G+ + D  V   A  + C+  K P+ YLG+PLGG P+   FW+P+LDK   ++  WK   LSRGGRLT  ++VL
Subjt:  EALRKTIEFFEWCSGQKVNWEKSALCGLNIDDLEVKLTAAKLNCKAEKLPLMYLGLPLGGHPKMIVFWQPILDKIQGKLSKWKRYNLSRGGRLTQRKTVL

Query:  SNLPSSYMSIFLMPEKVVLSIERAMRNFFWEGTGGSKLNHLARWETVTKYHKDGGLRLENLKIRNLALLSKWGWRFMQESEALWCNVITSIHGEDHFQWH
        S+LP  Y+S+F +P+ V+  +E+ MR+FFWEG   +  +HL  W+ V K   +GGL +  L++RN  LL KW WRF  ES +LW  VI S +G+    W 
Subjt:  SNLPSSYMSIFLMPEKVVLSIERAMRNFFWEGTGGSKLNHLARWETVTKYHKDGGLRLENLKIRNLALLSKWGWRFMQESEALWCNVITSIHGEDHFQWH

Query:  TNRKEVASLRSPWISISRQWQKIEAFAIFKVGDGRRITFWSDPWIEDQPLKVRFPRLFKLALKTTGTV------ADHWDPCSSSWDLLFKRQLKEEELGE
        T +    S R PW+ I+  + +      FKVG+G  I FW D WI    L+ +FP L  L+     ++      A     C +SWD  F+R + + E+  
Subjt:  TNRKEVASLRSPWISISRQWQKIEAFAIFKVGDGRRITFWSDPWIEDQPLKVRFPRLFKLALKTTGTV------ADHWDPCSSSWDLLFKRQLKEEELGE

Query:  FLSLSSSVANKRVM-VQPDKRIWALEGNGSFSIKSLTNHLSVASPIDD-ELVKGLWKSKCPRRVNILIWTMLFGSLNCAATIQRKLPYHCLSPHMCVLCR
           L   + + RV+ +  D RIW  +  G FS KS  +  +     ++    K LWK++ P +V +  W +    LN  A +Q+K P+ C+SP  CV C+
Subjt:  FLSLSSSVANKRVM-VQPDKRIWALEGNGSFSIKSLTNHLSVASPIDD-ELVKGLWKSKCPRRVNILIWTMLFGSLNCAATIQRKLPYHCLSPHMCVLCR

Query:  QDQEDIQHLFFGCSYASSCWSRLFGFFDLSWVMGSDFKSNVLQILLGPRLNKEPKLLWVNAVKALLSEIWFERNQHVFNDKASNWEDRFDQARLNASSWC
           ED  HLF  CS A S W  L   F++ W M     S    ++     +     LW  AV ++L  +W ERN   F   A + E  +++ +   ++W 
Subjt:  QDQEDIQHLFFGCSYASSCWSRLFGFFDLSWVMGSDFKSNVLQILLGPRLNKEPKLLWVNAVKALLSEIWFERNQHVFNDKASNWEDRFDQARLNASSWC

Query:  TLSKTFKDYSIQEL
          +K F++ S  +L
Subjt:  TLSKTFKDYSIQEL

A0A803QEA6 Uncharacterized protein4.9e-23235.86Show/hide
Query:  QDLEEESLISISSDELNEPEEDENHLSLVLDESIEGNKSIADHLSSMLNEKIPHHLRSFVEDCGISLKGKLFSKKIVSWNIRGLGDYSKQLAVKHLIMKT
        Q LEE  L+  S D+L+E E++E   +   DE +    S ++ + SM   ++   +    ED          +++I++WNIRG GD  K+ A+K  I K 
Subjt:  QDLEEESLISISSDELNEPEEDENHLSLVLDESIEGNKSIADHLSSMLNEKIPHHLRSFVEDCGISLKGKLFSKKIVSWNIRGLGDYSKQLAVKHLIMKT

Query:  NPRIV------------EAIKKLWSSRDIRWSFVEAYGRSGGLLIMWDESKISVIETLKGGYTLSVKCKTLRRKVCWVTNVYRPTDYKERRRIWPELQAL
        NP +V              I  +W SR   W  + A GRSGG L++WD   ISV+++L G +++SV      ++  W + VY P  YK R   W EL  L
Subjt:  NPRIV------------EAIKKLWSSRDIRWSFVEAYGRSGGLLIMWDESKISVIETLKGGYTLSVKCKTLRRKVCWVTNVYRPTDYKERRRIWPELQAL

Query:  AAYCTNAWCLGGDFNITRAIHERVPTGRF-RGMKKFNKFIEEAHLMEIPMSNGRFTWS--REGNIISRSLLDRFLVTKEWDKAFEGTRVFRQARIISDHF
        ++ C  +WC+ GDFN+TR + E++ +  F R MK F+  I E  L++  + NG FTWS  R   + SR  LDRFL T  W+  F   R     RI+SDH 
Subjt:  AAYCTNAWCLGGDFNITRAIHERVPTGRF-RGMKKFNKFIEEAHLMEIPMSNGRFTWS--REGNIISRSLLDRFLVTKEWDKAFEGTRVFRQARIISDHF

Query:  PLLLEVGALEWGPSPFRFCNSWLLNSQCCSTIIRTLEAGYHQGWAGFVIFAKLRSVKTSLQQWHADSLINQRKQEAEIVEKLDKEEQGAELEDPSSILQD
        P++++    +WGP PFRF N WL +        R  +   + GW G     KL+ ++  +++W   +    R ++  +  +L   ++       +  L D
Subjt:  PLLLEVGALEWGPSPFRFCNSWLLNSQCCSTIIRTLEAGYHQGWAGFVIFAKLRSVKTSLQQWHADSLINQRKQEAEIVEKLDKEEQGAELEDPSSILQD

Query:  LRASLKADLMDIYRKKERDLIQKSKLNWLHLGDENTRFF-RFLAAKKRKNLIAELVNDQG-------------------IPTNSA--GCFPANLEWQRVS
         R  LK +   +  ++ER    KSK  W   GD N+RFF   L A+K +N I+ +  + G                   + T+ A  G     +EWQ+++
Subjt:  LRASLKADLMDIYRKKERDLIQKSKLNWLHLGDENTRFF-RFLAAKKRKNLIAELVNDQG-------------------IPTNSA--GCFPANLEWQRVS

Query:  VEQNNRLSSKFSRKEIRFALRGMGKNKVGGPDGFTVEFLNKFWDRIKDDFVALFNKFHENEKLNSCVKENVICLIKEKEDAIMVKDFRPISLTTLTYKVI
             +L   F  +E+R  +     NK  GPDGF++  L   W+ IK D + +F  FH   ++   + +  ICLI ++ ++  VKDFRPISL T  YK+I
Subjt:  VEQNNRLSSKFSRKEIRFALRGMGKNKVGGPDGFTVEFLNKFWDRIKDDFVALFNKFHENEKLNSCVKENVICLIKEKEDAIMVKDFRPISLTTLTYKVI

Query:  AKVLAERLKLVMPSIIAPTQSAFIEGRQILDPILIANELVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFDSKWTSWTLGCIKNPKFSIFIN
        AK LA RL+ V+   I+ TQSAF+EGRQILD +L+ANE VE+YR + +KG++LK+D EKA+D+VDWGFL+ VL+ K F  +W  W  GC+ +  FSIFIN
Subjt:  AKVLAERLKLVMPSIIAPTQSAFIEGRQILDPILIANELVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFDSKWTSWTLGCIKNPKFSIFIN

Query:  RRPRGRVQASRGVRQGDPLSPFIFLLVSEVLSSLISRLHKSKKFEGFIVGKKKVHVPILQFADDILLFCKYDFDMLEALRKTIEFFEWCSGQKVNWEKSA
         R RG+   SRG+RQGDPLSPF+F ++++VL  ++ +  +++   GF +GK  + +  LQFADD L F K D   L+ L K ++ F   SG KVN  KS 
Subjt:  RRPRGRVQASRGVRQGDPLSPFIFLLVSEVLSSLISRLHKSKKFEGFIVGKKKVHVPILQFADDILLFCKYDFDMLEALRKTIEFFEWCSGQKVNWEKSA

Query:  LCGLNIDDLEVKLTAAKLNCKAEKLPLMYLGLPLGGHPKMIVFWQPILDKIQGKLSKWKRYNLSRGGRLTQRKTVLSNLPSSYMSIFLMPEKVVLSIERA
        L G+ +++  V  +A  + C+  + P+ YLG+ LGG P+   FW+P+LDK   ++  WK   LSRGGRLT  ++VLS+LP  Y+S+F  P+ V+  +E+ 
Subjt:  LCGLNIDDLEVKLTAAKLNCKAEKLPLMYLGLPLGGHPKMIVFWQPILDKIQGKLSKWKRYNLSRGGRLTQRKTVLSNLPSSYMSIFLMPEKVVLSIERA

Query:  MRNFFWEGTGGSKLNHLARWETVTKYHKDGGLRLENLKIRNLALLSKWGWRFMQESEALWCNVITSIHGEDHFQWHTNRKEVASLRSPWISISRQWQKIE
        MR FFWEG   +  +HL  W+ V K   +GGL +  L +RN  LL KW WRF  E  +LW  VI S +G+    W T +    S R PW  IS  + +  
Subjt:  MRNFFWEGTGGSKLNHLARWETVTKYHKDGGLRLENLKIRNLALLSKWGWRFMQESEALWCNVITSIHGEDHFQWHTNRKEVASLRSPWISISRQWQKIE

Query:  AFAIFKVGDGRRITFWSDPWIEDQPLKVRFPRLFKLA----------LKTTGTVADHWDPCSSSWDLLFKRQLKEEELGEFLSLSSSVANKRVM-VQPDK
            FKVG+G RI FW D W+    L+ +FP L  ++          +   G V D    C +SWDL F+R + + E+     L   + + RV+ +  D 
Subjt:  AFAIFKVGDGRRITFWSDPWIEDQPLKVRFPRLFKLA----------LKTTGTVADHWDPCSSSWDLLFKRQLKEEELGEFLSLSSSVANKRVM-VQPDK

Query:  RIWALEGNGSFSIKSLTNHLSVASPIDDEL-VKGLWKSKCPRRVNILIWTMLFGSLNCAATIQRKLPYHCLSPHMCVLCRQDQEDIQHLFFGCSYASSCW
        R W  +  G FS KS  +  +     D+    K LWKS+ P +V +  W +    LN  A +Q+K P+  +SP  CV C+   ED  HLF  C  ASS W
Subjt:  RIWALEGNGSFSIKSLTNHLSVASPIDDEL-VKGLWKSKCPRRVNILIWTMLFGSLNCAATIQRKLPYHCLSPHMCVLCRQDQEDIQHLFFGCSYASSCW

Query:  SRLFGFFDLSWVMGSDFKSNVLQILLGPRLNKEPKLLWVNAVKALLSEIWFERNQHVFNDKASNWEDRFDQARLNASSWCTLSKTFKDYSIQEL
          L   F + W M S     ++  + G   +K    LW   V ++L  +W ERN  +F    S+ E  +D+ +   +SW   +K F++ S  +L
Subjt:  SRLFGFFDLSWVMGSDFKSNVLQILLGPRLNKEPKLLWVNAVKALLSEIWFERNQHVFNDKASNWEDRFDQARLNASSWCTLSKTFKDYSIQEL

M5WJ76 Reverse transcriptase domain-containing protein (Fragment)6.8e-22636.97Show/hide
Query:  KIVSWNIRGLGDYSKQLAVKHLIMKTNPRIV------------EAIKKLWSSRDIRWSFVEAYGRSGGLLIMWDESKISVIETLKGGYTLSVKCKTLRRK
        KI+SWNIRGLG   K+L VK  + +  P IV            + +  +W SR   W F  + GRSGG+ ++W+   +SVI+++ G +++S++ +     
Subjt:  KIVSWNIRGLGDYSKQLAVKHLIMKTNPRIV------------EAIKKLWSSRDIRWSFVEAYGRSGGLLIMWDESKISVIETLKGGYTLSVKCKTLRRK

Query:  VCWVTNVYRPTDYKERRRIWPELQALAAYCTNAWCLGGDFNITRAIHERVPTGRF-RGMKKFNKFIEEAHLMEIPMSNGRFTWS--REGNIISRSLLDRF
          W++ +Y P   +ER   W EL  L  YC + WCLGGDFN+ R   E+   GR  + M+ FN FI+E +L +  + N  FTWS  RE  +  R  LDRF
Subjt:  VCWVTNVYRPTDYKERRRIWPELQALAAYCTNAWCLGGDFNITRAIHERVPTGRF-RGMKKFNKFIEEAHLMEIPMSNGRFTWS--REGNIISRSLLDRF

Query:  LVTKEWDKAFEGTRVFRQARIISDHFPLLLEVGALEWGPSPFRFCNSWLLNSQCCSTIIRTLEAGYHQGWAGFVIFAKLRSVKTSLQQWHADSLINQRKQ
        LV+  W++ F   R     RI SDH P+ L+   ++WGPSPFRF N WL +      I          GW G+    +L+ +K+ L+ W  +   +  + 
Subjt:  LVTKEWDKAFEGTRVFRQARIISDHFPLLLEVGALEWGPSPFRFCNSWLLNSQCCSTIIRTLEAGYHQGWAGFVIFAKLRSVKTSLQQWHADSLINQRKQ

Query:  EAEIVEKLDKEEQGAELEDPSSILQDLRASLKADLMDIYRKKERDLIQKSKLNWLHLGDENTRFFRFLA-AKKRKNLIAEL-VNDQGIPTNSAGCFPANL
          E   +L   +Q    E    +L+  R +L   + D+ +K+E    Q+ K+ W   GD NT+FF  +A   +++N I +L V D G+         AN+
Subjt:  EAEIVEKLDKEEQGAELEDPSSILQDLRASLKADLMDIYRKKERDLIQKSKLNWLHLGDENTRFFRFLA-AKKRKNLIAEL-VNDQGIPTNSAGCFPANL

Query:  EWQRVSVEQNNRLSSKFSRKEIRFALRGMGKNKVGGPDGFTVEFLNKFWDRIKDDFVALFNKFHENEKLNSCVKENVICLIKEKEDAIMVKDFRPISLTT
        E + +   +    S+K        A+   GK+K  GPDGF++ F    W+ +K D + +   F ++  +N    E  ICLI +K +++ V D+RPISL T
Subjt:  EWQRVSVEQNNRLSSKFSRKEIRFALRGMGKNKVGGPDGFTVEFLNKFWDRIKDDFVALFNKFHENEKLNSCVKENVICLIKEKEDAIMVKDFRPISLTT

Query:  LTYKVIAKVLAERLKLVMPSIIAPTQSAFIEGRQILDPILIANELVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFDSKWTSWTLGCIKNPK
          YKVI+KVLA  L+ V+ + I+ +Q AF++ RQILD +L+ANE+VEE R +K+KG + K+D EKA+D V+W F++ V+  K F  KW  W +GC+++  
Subjt:  LTYKVIAKVLAERLKLVMPSIIAPTQSAFIEGRQILDPILIANELVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFDSKWTSWTLGCIKNPK

Query:  FSIFINRRPRGRVQASRGVRQGDPLSPFIFLLVSEVLSSLISRLHKSKKFEGFIVGKKKVHVPILQFADDILLFCKYDFDMLEALRKTIEFFEWCSGQKV
        FSI IN +PRG+ +ASRG+RQGDPLSPF+F LVS+VLS LI R        G + G  +V V  LQFADD +       +    L + ++ F   SG K+
Subjt:  FSIFINRRPRGRVQASRGVRQGDPLSPFIFLLVSEVLSSLISRLHKSKKFEGFIVGKKKVHVPILQFADDILLFCKYDFDMLEALRKTIEFFEWCSGQKV

Query:  NWEKSALCGLNIDDLEVKLTAAKLNCKAEKLPLMYLGLPLGGHPKMIVFWQPILDKIQGKLSKWKRYNLSRGGRLTQRKTVLSNLPSSYMSIFLMPEKVV
        N  KS + G+N     +   A    C+    P++YLGLPLGG+P+ + FW P+++K++ +L KWKR  LS+GGRLT  + VLS++PS YMS+F MP  V 
Subjt:  NWEKSALCGLNIDDLEVKLTAAKLNCKAEKLPLMYLGLPLGGHPKMIVFWQPILDKIQGKLSKWKRYNLSRGGRLTQRKTVLSNLPSSYMSIFLMPEKVV

Query:  LSIERAMRNFFWEGTGGSKLNHLARWETVTKYHKDGGLRLENLKIRNLALLSKWGWRFMQESEALWCNVITSIHGEDHFQWHTNRKEVASLRSPWISISR
          +E+ MRNF WEG    K  HL RWE VTK  ++GGL + +L+ R  AL +KW WRF  E+ +LW  +I S +G D            S  +PW  IS+
Subjt:  LSIERAMRNFFWEGTGGSKLNHLARWETVTKYHKDGGLRLENLKIRNLALLSKWGWRFMQESEALWCNVITSIHGEDHFQWHTNRKEVASLRSPWISISR

Query:  QWQKIEAFAIFKVGDGRRITFWSDPWIEDQPLKVRFPRLFKLALKTTGTVA----DHWDPCSSSWDLLFKRQLKEEELGEFLSLSSSVANKRVM-VQPDK
         +        F VG+G +I FW D W+++  LK  FPRL  L+ +   ++A    +H  P   +WD  F+R L E E+ E + L   + N R+   +PD+
Subjt:  QWQKIEAFAIFKVGDGRRITFWSDPWIEDQPLKVRFPRLFKLALKTTGTVA----DHWDPCSSSWDLLFKRQLKEEELGEFLSLSSSVANKRVM-VQPDK

Query:  RIWALEGNGSFSIKSLTNHLSVASPIDDELVKGLWKSKCPRRVNILIWTMLFGSLNCAATIQRKLPYHCLSPHMCVLCRQDQEDIQHLFFGCSYASSCWS
        R W +E  GSFS KS  + L   +         +WK+K P ++   +W    G +N    IQR+ P  CLSP  CVLC+++ E+I HLF  CSY+   W 
Subjt:  RIWALEGNGSFSIKSLTNHLSVASPIDDELVKGLWKSKCPRRVNILIWTMLFGSLNCAATIQRKLPYHCLSPHMCVLCRQDQEDIQHLFFGCSYASSCWS

Query:  RLFGFFDLSWVMGSD-FKSNVLQILLGPRLNKEPKLLWVNAVKALLSEIWFERNQHVFNDKAS-NWEDRFDQARLNASSWCTLSKTFKDY
        ++ G   + WV+    F+   + + +  +  K   +L    V A+   IW ERNQ +F        E+ +D+ +  AS W ++S  FKDY
Subjt:  RLFGFFDLSWVMGSD-FKSNVLQILLGPRLNKEPKLLWVNAVKALLSEIWFERNQHVFNDKAS-NWEDRFDQARLNASSWCTLSKTFKDY

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein7.0e-4223.16Show/hide
Query:  RSGGLLIMWDES--KISVIETLKGGYTLSVKCKTLRRKVCWVTNVYRPTDYKERRRIWPELQALAAYCTNAWCLGGDFNITRAIHERVPTGRF-RGMKKF
        ++G  +++ D++  K + I+  K G+ + VK  +++++   + N+Y P +    R I   L  L     +   + GDFN   +I +R    +  +  ++ 
Subjt:  RSGGLLIMWDES--KISVIETLKGGYTLSVKCKTLRRKVCWVTNVYRPTDYKERRRIWPELQALAAYCTNAWCLGGDFNITRAIHERVPTGRF-RGMKKF

Query:  NKFIEEAHLMEI-----PMSNGRFTWSREGNIISRSLLDRFLVTKEWDKAFEGTRVFRQARIISDHFPLLLEV--------GALEWGPSPFRFCNSWLLN
        N  + +  L++I     P S     +S   +  S+  +D  + +K      + T +      +SDH  + LE+         +  W  +     + W+ N
Subjt:  NKFIEEAHLMEI-----PMSNGRFTWSREGNIISRSLLDRFLVTKEWDKAFEGTRVFRQARIISDHFPLLLEV--------GALEWGPSPFRFCNSWLLN

Query:  SQCCSTII-----RTLEAGYHQGWAGFVI-----FAKLRSVKTSLQQWHADSLINQRKQEAEIVEKLDKEEQGAELEDPSSILQDLRASLKADLMDIYRK
               +        +  Y   W  F       F  L + K   ++   D+L +Q K+       L+K+EQ       +S  Q++   ++A+L +I  +
Subjt:  SQCCSTII-----RTLEAGYHQGWAGFVI-----FAKLRSVKTSLQQWHADSLINQRKQEAEIVEKLDKEEQGAELEDPSSILQDLRASLKADLMDIYRK

Query:  KERDLIQKSKLNWL--HLGDENTRFFRFLAAKKRKNLIAELVNDQGIPT-------------------------NSAGCFPANLEWQRVSVEQNNRLSSK
        K    I +S+ +W    +   +    R +  K+ KN I  + ND+G  T                              F       R++ E+   L+  
Subjt:  KERDLIQKSKLNWL--HLGDENTRFFRFLAAKKRKNLIAELVNDQGIPT-------------------------NSAGCFPANLEWQRVSVEQNNRLSSK

Query:  FSRKEIRFALRGMGKNKVGGPDGFTVEFLNKFWDRIKDDFVALFNKFHENEKL-NSCVKENVICLIKEKEDAIMVKDFRPISLTTLTYKVIAKVLAERLK
         +  EI   +  +   K  GPDGFT EF  ++ + +    + LF    +   L NS  + ++I + K   D    ++FRPISL  +  K++ K+LA R++
Subjt:  FSRKEIRFALRGMGKNKVGGPDGFTVEFLNKFWDRIKDDFVALFNKFHENEKL-NSCVKENVICLIKEKEDAIMVKDFRPISLTTLTYKVIAKVLAERLK

Query:  LVMPSIIAPTQSAFIEGRQILDPILIANELVEEY-RIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFDSKWTSWTLGCIKNPKFSIFINRRPRGRVQ
          +  +I   Q  FI G Q    I  +  +++   R K K   I+ +D EKAFDK+   F+ K L     D  +          P  +I +N +      
Subjt:  LVMPSIIAPTQSAFIEGRQILDPILIANELVEEY-RIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFDSKWTSWTLGCIKNPKFSIFINRRPRGRVQ

Query:  ASRGVRQGDPLSPFIFLLVSEVLSSLISRLHKSKKFEGFIVGKKKVHVPILQFADDILLFCKYDFDMLEALRKTIEFFEWCSGQKVNWEKSALCGLNIDD
           G RQG PLSP +F +V EVL+  I    + K+ +G  +GK++V + +  FADD++++ +      + L K I  F   SG K+N +KS     N + 
Subjt:  ASRGVRQGDPLSPFIFLLVSEVLSSLISRLHKSKKFEGFIVGKKKVHVPILQFADDILLFCKYDFDMLEALRKTIEFFEWCSGQKVNWEKSALCGLNIDD

Query:  LEVKLTAAKLNCKAEKLPLMYLGLPLGGHPKMIV--FWQPILDKIQGKLSKWKRYNLSRGGRLTQRKTVLSNLPSSYMSIFLMPEKVVLS----IERAMR
                +L        + YLG+ L    K +    ++P+L +I+   +KWK    S  GR+   K  +  LP        +P K+ ++    +E+   
Subjt:  LEVKLTAAKLNCKAEKLPLMYLGLPLGGHPKMIV--FWQPILDKIQGKLSKWKRYNLSRGGRLTQRKTVLSNLPSSYMSIFLMPEKVVLS----IERAMR

Query:  NFFWEGTGGSKLNHLARWETVTKYHKDGGLRLENLKIRNLALLSKWGWRFMQESE
         F W      K   +A+   +++ +K GG+ L + K+   A ++K  W + Q  +
Subjt:  NFFWEGTGGSKLNHLARWETVTKYHKDGGLRLENLKIRNLALLSKWGWRFMQESE

P08548 LINE-1 reverse transcriptase homolog1.3e-3225.64Show/hide
Query:  RVSVEQNNRLSSKFSRKEIRFALRGMGKNKVGGPDGFTVEFLNKFWDRIKDDFVALFNKFHENEKL-NSCVKENVICLIKEKEDAIMVKDFRPISLTTLT
        R+S ++   L+   S  EI   ++ + K K  GPDGFT EF   F + +    + LF    +   L N+  + N+  + K  +D    +++RPISL  + 
Subjt:  RVSVEQNNRLSSKFSRKEIRFALRGMGKNKVGGPDGFTVEFLNKFWDRIKDDFVALFNKFHENEKL-NSCVKENVICLIKEKEDAIMVKDFRPISLTTLT

Query:  YKVIAKVLAERLKLVMPSIIAPTQSAFIEGRQILDPILIANELVEEY-RIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFDSKWTSWTLGCIKNPKF
         K++ K+L  R++  +  II   Q  FI G Q    I  +  +++   ++K K   IL +D EKAFD +   F+ + L+    +  +          P  
Subjt:  YKVIAKVLAERLKLVMPSIIAPTQSAFIEGRQILDPILIANELVEEY-RIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFDSKWTSWTLGCIKNPKF

Query:  SIFINRRPRGRVQASRGVRQGDPLSPFIFLLVSEVLSSLISRLHKSKKFEGFIVGKKKVHVPILQFADDILLFCKYDFDMLEALRKTIEFFEWCSGQKVN
        +I +N           G RQG PLSP +F +V EVL+  I    + K  +G  +G +++ + +  FADD++++ +   D    L + I+ +   SG K+N
Subjt:  SIFINRRPRGRVQASRGVRQGDPLSPFIFLLVSEVLSSLISRLHKSKKFEGFIVGKKKVHVPILQFADDILLFCKYDFDMLEALRKTIEFFEWCSGQKVN

Query:  WEKSALCGLNIDDLEVKLTAAKLNCKAEKLPLMYLGLPLGGHPKMIV--FWQPILDKIQGKLSKWKRYNLSRGGRLTQRKTVLSNLPSSYMSIFLMPEKV
          KS       ++   K     +        + YLG+ L    K +    ++ +  +I   ++KWK    S  GR+   K  +S LP +  +   +P K 
Subjt:  WEKSALCGLNIDDLEVKLTAAKLNCKAEKLPLMYLGLPLGGHPKMIV--FWQPILDKIQGKLSKWKRYNLSRGGRLTQRKTVLSNLPSSYMSIFLMPEKV

Query:  VLS----IERAMRNFFWEGTGGSKLNHLARWETVTKYHKDGGLRLENLKI--RNLALLSKWGWRFMQE
         LS    +E+ + +F W      K   +A+   ++  +K GG+ L +L++  +++ + + W W   +E
Subjt:  VLS----IERAMRNFFWEGTGGSKLNHLARWETVTKYHKDGGLRLENLKI--RNLALLSKWGWRFMQE

P0C2F6 Putative ribonuclease H protein At1g657508.2e-3528.45Show/hide
Query:  ILDKIQGKLSKWKRYNLSRGGRLTQRKTVLSNLPSSYMSIFLMPEKVVLSIERAMRNFFWEGTGGSKLNHLARWETVTKYHKDGGLRLENLKIRNLALLS
        IL+++  ++S W+   LS  GRLT  K VLS++P   MS  L+P+ ++  +++  R F W  T   K  HL +W  V    K+GGL +   K  N AL+S
Subjt:  ILDKIQGKLSKWKRYNLSRGGRLTQRKTVLSNLPSSYMSIFLMPEKVVLSIERAMRNFFWEGTGGSKLNHLARWETVTKYHKDGGLRLENLKIRNLALLS

Query:  KWGWRFMQESEALWCNVITSIHGEDHFQWHTNRKEVASLRSPWISISRQWQKIEAFAIFKV-GDGRRITFWSDPWIEDQPLKVRFPRLFKLALKTTGTVA
        K GWR +QE  +LW  V+   +     +         S  S W SI+   + + +  +  + GDG++I FW+D W+  +PL +      +     T    
Subjt:  KWGWRFMQESEALWCNVITSIHGEDHFQWHTNRKEVASLRSPWISISRQWQKIEAFAIFKV-GDGRRITFWSDPWIEDQPLKVRFPRLFKLALKTTGTVA

Query:  DHWDPCSSSWDLLFKRQLKEEELGEFLSLSSSVANKRVMVQPDKRIWALEGNGSFSIKSLTNHLS---VASPIDDELVKGLWKSKCPRRVNILIWTMLFG
        D W P    WD  F +          L L + V +  V    D+  W    +G FS++S    L+   V  P        LWK + P RV   +W  L G
Subjt:  DHWDPCSSSWDLLFKRQLKEEELGEFLSLSSSVANKRVMVQPDKRIWALEGNGSFSIKSLTNHLS---VASPIDDELVKGLWKSKCPRRVNILIWTMLFG

Query:  SLNCAATIQRKLPYHCLSPHMCVLCRQDQEDIQHLFFGCSYASSCWSRLF------GFFDLS
        +     T + +   H  + ++C +C+   E + H+   C      W R+       GFF  S
Subjt:  SLNCAATIQRKLPYHCLSPHMCVLCRQDQEDIQHLFFGCSYASSCWSRLF------GFFDLS

P11369 LINE-1 retrotransposable element ORF2 protein8.2e-3526.48Show/hide
Query:  RVSVEQNNRLSSKFSRKEIRFALRGMGKNKVGGPDGFTVEFLNKFWDRIKDDFVALFNK-FH----ENEKLNSCVKENVICLIKEKEDAIMVKDFRPISL
        +++ +Q + L+S  S KEI   +  +   K  GPDGF+ EF   F    K+D + + +K FH    E    NS  +  +  + K ++D   +++FRPISL
Subjt:  RVSVEQNNRLSSKFSRKEIRFALRGMGKNKVGGPDGFTVEFLNKFWDRIKDDFVALFNK-FH----ENEKLNSCVKENVICLIKEKEDAIMVKDFRPISL

Query:  TTLTYKVIAKVLAERLKLVMPSIIAPTQSAFIEGRQILDPILIANELVEEY-RIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFDSKWTSWTLGCIK
          +  K++ K+LA R++  + +II P Q  FI G Q    I  +  ++    ++K K   I+ LD EKAFDK+   F+ KVL+       + +       
Subjt:  TTLTYKVIAKVLAERLKLVMPSIIAPTQSAFIEGRQILDPILIANELVEEY-RIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFDSKWTSWTLGCIK

Query:  NPKFSIFINRRPRGRVQASRGVRQGDPLSPFIFLLVSEVLSSLISRLHKSKKFEGFIVGKKKVHVPILQFADDILLFCKYDFDMLEALRKTIEFFEWCSG
         P  +I +N      +    G RQG PLSP++F +V EVL+  I    + K+ +G  +GK++V + +L  ADD++++     +    L   I  F    G
Subjt:  NPKFSIFINRRPRGRVQASRGVRQGDPLSPFIFLLVSEVLSSLISRLHKSKKFEGFIVGKKKVHVPILQFADDILLFCKYDFDMLEALRKTIEFFEWCSG

Query:  QKVNWEKSALCGLNIDDLEVKLTAAKLNCKAEKLPLMYLGLPLGGHPKMIV--FWQPILDKIQGKLSKWKRYNLSRGGRLTQRKTVLSNLPSSYMSIFLM
         K+N  KS       +    K              + YLG+ L    K +    ++ +  +I+  L +WK    S  GR+   K  +  LP +      +
Subjt:  QKVNWEKSALCGLNIDDLEVKLTAAKLNCKAEKLPLMYLGLPLGGHPKMIV--FWQPILDKIQGKLSKWKRYNLSRGGRLTQRKTVLSNLPSSYMSIFLM

Query:  PEKVVL----SIERAMRNFFWEGTGGSKLNHLARWETVTKYHKDGGLRLENLKIRNLALLSKWGWRFMQESEALWCNVITSIHGEDHFQWH
        P K+       +E A+  F W     +K   +A+   +      GG+ + +LK+   A++ K  W + ++ +    N I       H   H
Subjt:  PEKVVL----SIERAMRNFFWEGTGGSKLNHLARWETVTKYHKDGGLRLENLKIRNLALLSKWGWRFMQESEALWCNVITSIHGEDHFQWH

P14381 Transposon TX1 uncharacterized 149 kDa protein8.5e-4826.98Show/hide
Query:  NVYRPTDYKERRRIWPELQALAAYCTN--AWCLGGDFNITRAIHER-VPTGRFRGMKKFNKFIEEAHLMEIPMSNG----RFTWSR-EGNIISRSLLDRF
        NVY PT   ER R +  L A      +  A  +GGDFN T    +R VP  R        + I    L+++          FT+ R     +S+S +DR 
Subjt:  NVYRPTDYKERRRIWPELQALAAYCTN--AWCLGGDFNITRAIHER-VPTGRFRGMKKFNKFIEEAHLMEIPMSNG----RFTWSR-EGNIISRSLLDRF

Query:  LVTKEWDKAFEGTRVFRQARIISDHFPLLLEVGALEWGP-SPFRFCNSWLLNSQCCSTIIRTLEAGY----------HQGWAGFVIFAKLRSVKTSLQQW
         ++       + + +  +    SDH  + L +      P + +   N+ LL  +  +  +R    G+          +Q W    +  KL      L Q 
Subjt:  LVTKEWDKAFEGTRVFRQARIISDHFPLLLEVGALEWGP-SPFRFCNSWLLNSQCCSTIIRTLEAGY----------HQGWAGFVIFAKLRSVKTSLQQW

Query:  HADSLINQRKQEAEIV--EKLDKEEQGAELEDPSSILQDLRASLKADLMDIYRKKERDLIQKSKLNWLHLGDENTRFFRFLAAKK-RKNLIAELVNDQGI
        +  S+  QR  E E +  E LD E++ +  ED +  LQ      K  L ++ +++ R    +S++  L   D  +RFF  L  KK  +  I  L  + G 
Subjt:  HADSLINQRKQEAEIV--EKLDKEEQGAELEDPSSILQDLRASLKADLMDIYRKKERDLIQKSKLNWLHLGDENTRFFRFLAAKK-RKNLIAELVNDQGI

Query:  P-------TNSAGCFPANL-------------EWQR---VSVEQNNRLSSKFSRKEIRFALRGMGKNKVGGPDGFTVEFLNKFWDRIKDDFVALFNKFHE
        P        + A  F  NL              W     VS  +  RL +  +  E+  ALR M  NK  G DG T+EF   FWD +  DF  +  +  +
Subjt:  P-------TNSAGCFPANL-------------EWQR---VSVEQNNRLSSKFSRKEIRFALRGMGKNKVGGPDGFTVEFLNKFWDRIKDDFVALFNKFHE

Query:  NEKLNSCVKENVICLIKEKEDAIMVKDFRPISLTTLTYKVIAKVLAERLKLVMPSIIAPTQSAFIEGRQILDPILIANELVEEYRIKKKKGWILKLDLEK
          +L    +  V+ L+ +K D  ++K++RP+SL +  YK++AK ++ RLK V+  +I P QS  + GR I D + +  +L+   R        L LD EK
Subjt:  NEKLNSCVKENVICLIKEKEDAIMVKDFRPISLTTLTYKVIAKVLAERLKLVMPSIIAPTQSAFIEGRQILDPILIANELVEEYRIKKKKGWILKLDLEK

Query:  AFDKVDWGFLEKVLQGKNFDSKWTSWTLGCIKNPKFSIFINRRPRGRVQASRGVRQGDPLSPFIFLLVSEVLSSLISRLHKSKKFEGFIVGKKKVHVPIL
        AFD+VD  +L   LQ  +F  ++  +      + +  + IN      +   RGVRQG PLS  ++ L  E    L+      K+  G ++ +  + V + 
Subjt:  AFDKVDWGFLEKVLQGKNFDSKWTSWTLGCIKNPKFSIFINRRPRGRVQASRGVRQGDPLSPFIFLLVSEVLSSLISRLHKSKKFEGFIVGKKKVHVPIL

Query:  QFADDILLFCKYDFDMLEALRKTIEFFEWCSGQKVNWEKSALCGLNIDDLEVK-LTAAKLNCKAEKLPLMYLGLPLGG--HPKMIVFWQPILDKIQGKLS
         +ADD++L  + D   LE  ++  E +   S  ++NW KS+  GL    L+V  L  A  +   E   + YLG+ L    +P    F + + + +  +L 
Subjt:  QFADDILLFCKYDFDMLEALRKTIEFFEWCSGQKVNWEKSALCGLNIDDLEVK-LTAAKLNCKAEKLPLMYLGLPLGG--HPKMIVFWQPILDKIQGKLS

Query:  KWKRYNLSRGGRLTQRKTVLSNLPSS---YMSIFLMP-EKVVLSIERAMRNFFWEG
        KWK +  ++   +  R  V++ L +S   Y  I L P ++ +  I+R + +F W G
Subjt:  KWKRYNLSRGGRLTQRKTVLSNLPSS---YMSIFLMP-EKVVLSIERAMRNFFWEG

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein4.1e-2125.41Show/hide
Query:  ERRRIW----PELQALAAYCTN-AWCLGGDFNITRAIHERVPTGR----FRGMKKFNKFIEEAHLMEIPMSNGRFTWS--REGNIISRSLLDRFLVTKEW
        E  RIW    P +  L    T+    L GDF+   A  +     +     RG+++F   + ++ L++IP     +TWS  ++ N I R  LDR +   +W
Subjt:  ERRRIW----PELQALAAYCTN-AWCLGGDFNITRAIHERVPTGR----FRGMKKFNKFIEEAHLMEIPMSNGRFTWS--REGNIISRSLLDRFLVTKEW

Query:  DKAFEGTRVFRQARIISDHFPLLLEVGALEWGPSPFRFCNSWLLNSQCCSTIIRTLEAGYHQ----GWAGFVIFAKLRSVKTSLQQWHADSLINQRKQEA
          +F       +   +SDH P ++    LE  P   + C  +        T + +L   + +    G   F +   L++ K   +  +     N + +  
Subjt:  DKAFEGTRVFRQARIISDHFPLLLEVGALEWGPSPFRFCNSWLLNSQCCSTIIRTLEAGYHQ----GWAGFVIFAKLRSVKTSLQQWHADSLINQRKQEA

Query:  EIVEKLDKEEQGAELEDPSSILQDLRASLKADLMDIYRKKERDLIQKSKLNWLHLGDENTRFF-RFLAAKKRKNLI-----------------AELV---
        E ++ L+   Q   L +PS  L  +    +          E    QKS++ WL  GD NTRFF + + A + KNLI                  E++   
Subjt:  EIVEKLDKEEQGAELEDPSSILQDLRASLKADLMDIYRKKERDLIQKSKLNWLHLGDENTRFF-RFLAAKKRKNLI-----------------AELV---

Query:  ------NDQGIPTNSAGCFPANLEWQRVSVEQNNRLSSKFSRKEIRFALRGMGKNKVGGPDGFTVEFLNKFWDRIKDDFVALFNKFHENEKLNSCVKENV
              +D  I T  +     ++   R +    +RLS+  S KEI  A+  M +NK  GPD FT EF  + W  +KD  +A   +F     L        
Subjt:  ------NDQGIPTNSAGCFPANLEWQRVSVEQNNRLSSKFSRKEIRFALRGMGKNKVGGPDGFTVEFLNKFWDRIKDDFVALFNKFHENEKLNSCVKENV

Query:  ICLIKEKEDAIMVKDFRPISLTTLTYKVI
        I LI +      +  FRP+S  T+ YK+I
Subjt:  ICLIKEKEDAIMVKDFRPISLTTLTYKVI

AT3G25270.1 Ribonuclease H-like superfamily protein7.7e-1229.14Show/hide
Query:  ELVKGLWKSKCPRRVNILIWTMLFGSLNCAATIQRKLPYHCLSPHMCVLCRQDQEDIQHLFFGCSYASSCWSRLFGFFDLSWVMGSDFKSNVLQILLGPR
        E+   +WK K   ++   +W +L G+L     ++R+   H  +   C  C Q+ E  QHLFF C YA   W R  G                +++LL   
Subjt:  ELVKGLWKSKCPRRVNILIWTMLFGSLNCAATIQRKLPYHCLSPHMCVLCRQDQEDIQHLFFGCSYASSCWSRLFGFFDLSWVMGSDFKSNVLQILLGPR

Query:  L-NKEPKLLWVNAVKALLSEIWFERNQHVFNDKASNWEDRFDQARLNASSW
        L N++P+L   N    +L  +W  RNQ VF  K+ +W++   +AR +   W
Subjt:  L-NKEPKLLWVNAVKALLSEIWFERNQHVFNDKASNWEDRFDQARLNASSW

AT4G20520.1 RNA binding;RNA-directed DNA polymerases6.5e-1145.68Show/hide
Query:  LAERLKLVMPSIIAPTQSAFIEGRQILDPILIANELVEEYRIKK-KKGW-ILKLDLEKAFDKVDWGFLEKVLQGKNFDSKW
        + ERLK +M ++I P Q++FI GR   D I+   E V   R KK  KGW +LKLDLEKA+D++ W +LE  L    F   W
Subjt:  LAERLKLVMPSIIAPTQSAFIEGRQILDPILIANELVEEYRIKK-KKGW-ILKLDLEKAFDKVDWGFLEKVLQGKNFDSKW

AT4G29090.1 Ribonuclease H-like superfamily protein4.2e-2623.7Show/hide
Query:  LPSSYMSIFLMPEKVVLSIERAMRNFFWEGTGGSKLNHLARWETVTKYHKDGGLRLENLKIRNLALLSKWGWRFMQESEALWCNVITSIHGEDHFQWHTN
        LP+  M+ FL+P+ V   I   + +F+W     +K  H   W+ ++ Y  +GG+  ++++  NLALL K  WR +   E+L   V  S +      +H +
Subjt:  LPSSYMSIFLMPEKVVLSIERAMRNFFWEGTGGSKLNHLARWETVTKYHKDGGLRLENLKIRNLALLSKWGWRFMQESEALWCNVITSIHGEDHFQWHTN

Query:  RKEVASLRS----PWISISRQWQKIEAFAIFKVGDGRRITFWSDPWIEDQPLKV-----RFPRLFKLALKTTGTVADHWDPCSSSW-----DLLFKRQLK
            A L S     W SI    + +   A   VG+G  I  W   W++ +P        R P     ++ +   V+D  D     W     ++LF  +++
Subjt:  RKEVASLRS----PWISISRQWQKIEAFAIFKVGDGRRITFWSDPWIEDQPLKV-----RFPRLFKLALKTTGTVADHWDPCSSSW-----DLLFKRQLK

Query:  EEELGEFLSLSSSVANKRVMVQPDKRIWALEGNGSFSIKS----LTNHLS-------VASPIDDELVKGLWKSKCPRRVNILIWTMLFGSLNCAATIQRK
         + +GE          +R++   D   W    +G +++KS    LT  ++       V+ P  + + + +WKS+   ++   +W  L  SL  A  +  +
Subjt:  EEELGEFLSLSSSVANKRVMVQPDKRIWALEGNGSFSIKS----LTNHLS-------VASPIDDELVKGLWKSKCPRRVNILIWTMLFGSLNCAATIQRK

Query:  LPYHCLSPHMCVLCRQDQEDIQHLFFGCSYASSCWSRLFGFFDLSWVMGSDFKSNVLQIL-----LGPRLNKEPKLLWVNA---VKALLSEIWFERNQHV
           H      C+ C   +E + HL F C++A   W+       +   +G ++  ++   L     LG   N  P+  W  A   V  LL  +W  RN+ V
Subjt:  LPYHCLSPHMCVLCRQDQEDIQHLFFGCSYASSCWSRLFGFFDLSWVMGSDFKSNVLQIL-----LGPRLNKEPKLLWVNA---VKALLSEIWFERNQHV

Query:  FNDKASNWEDRFDQARLNASSW
        F  +  N ++   +A  +   W
Subjt:  FNDKASNWEDRFDQARLNASSW

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.7e-0946.97Show/hide
Query:  INRRPRGRVQASRGVRQGDPLSPFIFLLVSEVLSSLISRLHKSKKFEGFIVGKKKVHVPILQFADD
        IN  P+G V  SRG+RQGDPLSP++F+L +EVLS L  R  +  +  G  V      +  L FADD
Subjt:  INRRPRGRVQASRGVRQGDPLSPFIFLLVSEVLSSLISRLHKSKKFEGFIVGKKKVHVPILQFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTAGTGAGCTGCAAAATCAATGGAGTGTTCTTCTGTTCATGGTTTGAAAAAGGAAAATTCTCAGTAGAAGATGTTGAAGTTCGAAAATCAATCCAACTCTCAAA
ATCGCAGTTGCTTTGGAAGGTAGAACAAATCAAAGACCTCCTCAACGGTTCAAGCATTCGTTTTTTCCTAAGAGAATGGAGAGATGATTTAGGAACAATAAGAATTTTGA
AGTTAAAGACCAAGGCTGGATGGAATTTAAGATGTGTGGTTTGGCTGATTTCCAGAGGGCGATTCTATATTCATGTACCAGAAGGAGTAGCTCAACATAGATGGTCAGAG
TTCTTGAAAATGCTGAAAGGAAGTCTACAACGACTCGAAAACCACAGAGAAAGTTCAATTTTTGAGAAGGATCAACAGAAATGCTCCCACTCATCACCATTGATACAACC
AAGCTTTGCTAAAGTAGTGTCAACAAGGAAAAGTCAGTTGTTTTCCTCTGTCTGGGCATCTACAAATCTAAGTATAAGAGAGGATACTCAGATCCAAAAAGCGCTCCATG
CTTCTGCAACAATAGAACAGAGTATACTACCTCGACGAGTTGCAAAAAAGGAGAAAGACAGCAAGAAGAATTTTAAAATGAGTTCAAAGGAAACCTCAGCAAAGGACCAG
TACTATGCTCCACCCAAACAGCAATTTTGGGTTAGGAAAAACAGAGAAGTCTTCAAGGAAGACTTTAACAACCTATGGGTTCTCTCTAGGCTACTTGCCTCTGATGATTG
GAAGGAAATTGCCAAATTCTTAGAAGTCTTATACAATGTGAAAGTTAAAATCAACCCCCTATTCGATGACAAAGCTCTAATCCGAGTAGTCCAAGGGAAGATAGAAGATC
TAATTGAAGCCCTCGGCAAATGGTTTAATTATAGAAAATTTCATTTATTATTTGAAAAATGGAATCCATTACTTCACAGTAGACCCTCTGTTTTAAGAAGCTTTGGTGGA
TGGATTGCAATCAAGAACTTACCGCTGGAGTATTGGAGCGGAGAAACATTTGAAGCAATTGGGCAGCACTTTGGAGGATTGGAGGAGATAAGTATTGAAACACTGAATTT
GTTAGACGTTTCAGAGGCGAAAATTAAAGTTAGAAAGAATGTTTGTGGCTTTATTCCAGCTACAATCGAAATTAACAATGAATTTAGAGGTTCCATTCATTTACTTTTTA
AAGACATCATACCAATGAAAGATTCAAGTTCCATTATCAAGGAATTTATTGGGAGCGATTTCAAAAATCCCATTGACCAAGTACATTTGAGCAAAGTTGAAGAAGATGAA
ACCCAATCGGGGCAGGAGGAAAAGCCACCGAAAATCTTGGTCGGGAGTCCAACAGTCAGAAAAAGGTCGAAGTTGACAGCAAATGGTCACGAAGGAAAAAGCATTAATGC
AGCCGTAGAAAGCCCAATAGGCGTGCTGAAGTCGATGCGGTCTGCAAAAGTTATGAAGAAAGAGAAGAGGAAAGAGAGGGTGTTTGGCCGAATTAATGAGAATATCAAAA
ACTTCCAGACCCCTGGCATTCCGTCTAGGGATTTCCTTCCATCAGGGGCGCCCACTGGCTTCGAAACTATCCAAAAAAACCGAAAAGACTCAAAAGTTCCTTTGTCATTC
ACAGAAGAAGCCGAAAAGTCTCCCCCAATTAGCATGAAGTCCAATCGGGAAGAGTTTCCCTTCCAGACATTGCCACCCCCTCTTCAGCTCCCTCGAGCTCGCGTGTTCAC
CCCTCTTCAGCAGTCTCCAGCACACAAGGTTTCCCCGATCAAGAGCCCTTTCTTCCCAGCCGTTAGAAGGAGATCAGACCTCTTCAAACCCTTCCTAAAACACTTCAGCA
GAGGAAAAACATCGTTCCTCAACCTTTGGTTAGCTTTGACAAATCCAGATCTACTTGAGGTTAGTTGTGCAAACTCTCAAGTCCCCCAACAATCTGGTAACAGGTCAGCA
CTCCCCCTCTCTTCTTCGCAAATATTTCACCAATCAAAAATTCTAATCCCAGGATCAAATATTCTCTTCTTGCGAGGCTCCCCAAGTTCTTACCGGCCCAAGCCAAATAA
GATTCAAGACTTAGAAGAGGAATCACTCATCAGCATAAGTAGTGATGAATTGAATGAGCCAGAAGAAGATGAGAACCACTTAAGTTTGGTTTTGGATGAATCCATCGAAG
GCAACAAGAGTATAGCCGACCACCTCTCTTCAATGCTTAACGAAAAAATTCCTCATCACTTGAGATCATTTGTAGAAGATTGTGGAATTTCATTGAAAGGAAAGTTATTC
TCAAAGAAAATCGTTTCTTGGAACATTAGAGGCCTTGGAGATTATTCAAAGCAGCTGGCAGTTAAGCACCTAATTATGAAGACAAATCCAAGAATTGTCGAAGCAATCAA
GAAACTATGGAGTTCAAGAGACATCAGATGGTCATTTGTGGAAGCCTATGGCAGATCAGGAGGGTTATTAATCATGTGGGATGAAAGTAAAATATCAGTCATTGAAACAC
TCAAAGGAGGCTACACTCTTTCTGTTAAATGTAAGACCTTACGCAGAAAAGTTTGTTGGGTAACAAATGTATACAGACCAACCGATTATAAAGAAAGAAGGCGCATCTGG
CCGGAGTTACAAGCTTTGGCAGCATATTGCACAAATGCCTGGTGCCTGGGTGGGGACTTCAACATCACTAGAGCAATCCATGAAAGAGTTCCAACTGGAAGATTTAGAGG
AATGAAGAAGTTTAACAAATTCATAGAAGAGGCACATTTAATGGAAATTCCTATGAGCAATGGGCGTTTCACATGGTCGAGAGAAGGAAACATAATATCAAGATCCTTGC
TAGACAGATTTTTGGTGACAAAGGAATGGGATAAAGCATTTGAAGGCACTCGAGTCTTTAGACAGGCTCGCATTATATCAGATCACTTTCCTCTCTTACTAGAAGTTGGA
GCGTTGGAATGGGGACCTTCCCCTTTTCGTTTTTGCAACAGTTGGCTGCTCAATTCCCAGTGCTGCAGCACTATAATTAGAACTCTTGAGGCTGGATATCATCAAGGCTG
GGCTGGTTTTGTCATTTTTGCTAAGTTGAGATCAGTTAAAACCTCCCTACAACAGTGGCACGCTGACTCTTTAATAAACCAAAGAAAACAGGAAGCGGAAATTGTAGAAA
AGCTGGATAAAGAGGAGCAAGGGGCAGAATTAGAGGATCCATCCTCCATTTTACAGGACCTAAGGGCATCTTTAAAAGCTGATTTGATGGACATTTACAGAAAAAAGGAA
AGAGATTTAATCCAGAAGAGTAAGTTGAATTGGTTACATTTAGGGGACGAAAACACGAGATTCTTCCGTTTTCTCGCTGCAAAAAAGAGAAAAAATCTTATAGCAGAGCT
AGTCAATGATCAAGGCATTCCAACTAATTCTGCGGGCTGTTTTCCCGCAAATTTGGAGTGGCAAAGGGTTTCAGTTGAACAAAACAATAGGCTGTCTTCAAAATTCAGCA
GAAAAGAAATCAGATTCGCTTTAAGAGGAATGGGGAAAAATAAAGTAGGAGGTCCAGATGGGTTTACGGTGGAATTCCTTAATAAATTTTGGGATAGAATCAAAGATGAT
TTTGTTGCCCTATTCAACAAATTCCATGAGAATGAGAAGCTAAACTCCTGTGTGAAGGAGAATGTCATTTGTCTGATCAAGGAAAAAGAAGATGCCATCATGGTTAAAGA
CTTCCGGCCTATTAGCCTCACTACACTAACATATAAAGTAATTGCCAAAGTGTTAGCTGAAAGGTTGAAATTGGTTATGCCAAGCATTATAGCCCCCACCCAAAGTGCTT
TTATCGAAGGCCGACAAATCTTAGATCCAATACTCATTGCCAATGAATTGGTAGAGGAATATAGAATAAAGAAGAAAAAAGGGTGGATTCTAAAGCTTGACCTAGAAAAA
GCCTTTGACAAAGTAGATTGGGGATTTCTAGAAAAGGTTCTACAAGGCAAGAATTTCGACTCAAAATGGACCTCCTGGACTCTGGGCTGCATCAAAAACCCTAAATTCTC
CATCTTCATTAATAGAAGACCAAGAGGAAGAGTTCAAGCCTCAAGAGGAGTGAGACAAGGAGACCCTCTTTCGCCCTTCATATTCCTCCTAGTGAGTGAAGTATTGAGTA
GTCTTATTTCAAGACTTCATAAAAGCAAGAAATTCGAGGGATTTATAGTTGGAAAGAAGAAGGTACATGTTCCAATTCTTCAATTCGCAGACGATATACTCCTATTTTGT
AAGTACGACTTCGATATGTTGGAAGCCTTAAGGAAAACTATTGAGTTTTTTGAATGGTGCTCCGGACAAAAGGTGAATTGGGAAAAATCAGCCTTGTGTGGACTGAACAT
TGATGATTTAGAGGTTAAATTGACTGCTGCAAAATTAAATTGCAAGGCTGAAAAATTGCCTCTAATGTACCTTGGTTTGCCCCTAGGGGGACACCCAAAAATGATTGTCT
TTTGGCAGCCTATCCTTGACAAAATTCAAGGAAAATTGAGCAAGTGGAAAAGATATAACCTCTCAAGAGGGGGTAGACTTACTCAACGTAAGACCGTGCTTTCAAACCTC
CCCTCTTCCTACATGTCTATCTTTCTAATGCCCGAAAAGGTAGTCCTATCAATAGAAAGAGCCATGAGAAATTTTTTTTGGGAAGGCACTGGAGGAAGCAAACTGAACCA
CCTTGCCCGATGGGAAACAGTCACAAAATACCATAAGGATGGAGGCCTTAGGCTGGAAAATTTGAAAATTAGAAACTTGGCATTGCTGTCCAAGTGGGGATGGCGTTTTA
TGCAAGAATCTGAAGCTCTTTGGTGCAATGTTATCACAAGTATACACGGGGAGGACCACTTCCAATGGCATACTAACAGAAAGGAGGTTGCAAGCTTAAGAAGCCCATGG
ATAAGCATCTCAAGGCAATGGCAGAAAATAGAAGCCTTTGCAATCTTTAAAGTAGGAGATGGAAGAAGAATAACATTTTGGTCTGATCCTTGGATTGAGGATCAGCCCCT
TAAGGTCAGATTTCCAAGACTCTTCAAGCTAGCTCTCAAGACAACCGGTACAGTAGCAGATCATTGGGACCCATGTTCCTCCTCGTGGGATTTATTGTTCAAAAGACAGC
TAAAAGAGGAAGAATTAGGTGAGTTTCTGTCCCTCTCAAGCAGTGTGGCAAATAAGAGAGTTATGGTGCAGCCGGATAAAAGGATTTGGGCATTAGAAGGAAATGGTAGT
TTCTCCATAAAATCCCTAACCAATCATCTTTCAGTAGCTTCCCCAATTGACGATGAGCTGGTAAAAGGTCTATGGAAGTCCAAATGCCCTAGGAGAGTCAACATTCTGAT
TTGGACAATGCTATTTGGTTCATTAAATTGTGCTGCCACAATTCAAAGGAAACTCCCTTACCATTGCTTGTCCCCCCATATGTGTGTCCTCTGCCGCCAAGACCAAGAAG
ACATCCAGCACCTATTCTTCGGCTGCAGCTATGCTTCAAGTTGCTGGTCGAGACTATTTGGCTTCTTCGATTTGAGTTGGGTAATGGGGAGTGATTTCAAAAGCAATGTG
CTGCAAATTTTGTTGGGTCCAAGACTGAATAAGGAACCAAAGTTGTTGTGGGTCAATGCAGTCAAAGCTTTGTTATCAGAAATTTGGTTCGAAAGGAATCAACACGTATT
TAATGACAAAGCATCCAATTGGGAGGATCGGTTCGATCAGGCAAGGCTAAATGCCTCTTCCTGGTGCACTTTGTCAAAGACATTCAAGGATTATTCCATTCAAGAGCTTG
CTCTAAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGTAGTGAGCTGCAAAATCAATGGAGTGTTCTTCTGTTCATGGTTTGAAAAAGGAAAATTCTCAGTAGAAGATGTTGAAGTTCGAAAATCAATCCAACTCTCAAA
ATCGCAGTTGCTTTGGAAGGTAGAACAAATCAAAGACCTCCTCAACGGTTCAAGCATTCGTTTTTTCCTAAGAGAATGGAGAGATGATTTAGGAACAATAAGAATTTTGA
AGTTAAAGACCAAGGCTGGATGGAATTTAAGATGTGTGGTTTGGCTGATTTCCAGAGGGCGATTCTATATTCATGTACCAGAAGGAGTAGCTCAACATAGATGGTCAGAG
TTCTTGAAAATGCTGAAAGGAAGTCTACAACGACTCGAAAACCACAGAGAAAGTTCAATTTTTGAGAAGGATCAACAGAAATGCTCCCACTCATCACCATTGATACAACC
AAGCTTTGCTAAAGTAGTGTCAACAAGGAAAAGTCAGTTGTTTTCCTCTGTCTGGGCATCTACAAATCTAAGTATAAGAGAGGATACTCAGATCCAAAAAGCGCTCCATG
CTTCTGCAACAATAGAACAGAGTATACTACCTCGACGAGTTGCAAAAAAGGAGAAAGACAGCAAGAAGAATTTTAAAATGAGTTCAAAGGAAACCTCAGCAAAGGACCAG
TACTATGCTCCACCCAAACAGCAATTTTGGGTTAGGAAAAACAGAGAAGTCTTCAAGGAAGACTTTAACAACCTATGGGTTCTCTCTAGGCTACTTGCCTCTGATGATTG
GAAGGAAATTGCCAAATTCTTAGAAGTCTTATACAATGTGAAAGTTAAAATCAACCCCCTATTCGATGACAAAGCTCTAATCCGAGTAGTCCAAGGGAAGATAGAAGATC
TAATTGAAGCCCTCGGCAAATGGTTTAATTATAGAAAATTTCATTTATTATTTGAAAAATGGAATCCATTACTTCACAGTAGACCCTCTGTTTTAAGAAGCTTTGGTGGA
TGGATTGCAATCAAGAACTTACCGCTGGAGTATTGGAGCGGAGAAACATTTGAAGCAATTGGGCAGCACTTTGGAGGATTGGAGGAGATAAGTATTGAAACACTGAATTT
GTTAGACGTTTCAGAGGCGAAAATTAAAGTTAGAAAGAATGTTTGTGGCTTTATTCCAGCTACAATCGAAATTAACAATGAATTTAGAGGTTCCATTCATTTACTTTTTA
AAGACATCATACCAATGAAAGATTCAAGTTCCATTATCAAGGAATTTATTGGGAGCGATTTCAAAAATCCCATTGACCAAGTACATTTGAGCAAAGTTGAAGAAGATGAA
ACCCAATCGGGGCAGGAGGAAAAGCCACCGAAAATCTTGGTCGGGAGTCCAACAGTCAGAAAAAGGTCGAAGTTGACAGCAAATGGTCACGAAGGAAAAAGCATTAATGC
AGCCGTAGAAAGCCCAATAGGCGTGCTGAAGTCGATGCGGTCTGCAAAAGTTATGAAGAAAGAGAAGAGGAAAGAGAGGGTGTTTGGCCGAATTAATGAGAATATCAAAA
ACTTCCAGACCCCTGGCATTCCGTCTAGGGATTTCCTTCCATCAGGGGCGCCCACTGGCTTCGAAACTATCCAAAAAAACCGAAAAGACTCAAAAGTTCCTTTGTCATTC
ACAGAAGAAGCCGAAAAGTCTCCCCCAATTAGCATGAAGTCCAATCGGGAAGAGTTTCCCTTCCAGACATTGCCACCCCCTCTTCAGCTCCCTCGAGCTCGCGTGTTCAC
CCCTCTTCAGCAGTCTCCAGCACACAAGGTTTCCCCGATCAAGAGCCCTTTCTTCCCAGCCGTTAGAAGGAGATCAGACCTCTTCAAACCCTTCCTAAAACACTTCAGCA
GAGGAAAAACATCGTTCCTCAACCTTTGGTTAGCTTTGACAAATCCAGATCTACTTGAGGTTAGTTGTGCAAACTCTCAAGTCCCCCAACAATCTGGTAACAGGTCAGCA
CTCCCCCTCTCTTCTTCGCAAATATTTCACCAATCAAAAATTCTAATCCCAGGATCAAATATTCTCTTCTTGCGAGGCTCCCCAAGTTCTTACCGGCCCAAGCCAAATAA
GATTCAAGACTTAGAAGAGGAATCACTCATCAGCATAAGTAGTGATGAATTGAATGAGCCAGAAGAAGATGAGAACCACTTAAGTTTGGTTTTGGATGAATCCATCGAAG
GCAACAAGAGTATAGCCGACCACCTCTCTTCAATGCTTAACGAAAAAATTCCTCATCACTTGAGATCATTTGTAGAAGATTGTGGAATTTCATTGAAAGGAAAGTTATTC
TCAAAGAAAATCGTTTCTTGGAACATTAGAGGCCTTGGAGATTATTCAAAGCAGCTGGCAGTTAAGCACCTAATTATGAAGACAAATCCAAGAATTGTCGAAGCAATCAA
GAAACTATGGAGTTCAAGAGACATCAGATGGTCATTTGTGGAAGCCTATGGCAGATCAGGAGGGTTATTAATCATGTGGGATGAAAGTAAAATATCAGTCATTGAAACAC
TCAAAGGAGGCTACACTCTTTCTGTTAAATGTAAGACCTTACGCAGAAAAGTTTGTTGGGTAACAAATGTATACAGACCAACCGATTATAAAGAAAGAAGGCGCATCTGG
CCGGAGTTACAAGCTTTGGCAGCATATTGCACAAATGCCTGGTGCCTGGGTGGGGACTTCAACATCACTAGAGCAATCCATGAAAGAGTTCCAACTGGAAGATTTAGAGG
AATGAAGAAGTTTAACAAATTCATAGAAGAGGCACATTTAATGGAAATTCCTATGAGCAATGGGCGTTTCACATGGTCGAGAGAAGGAAACATAATATCAAGATCCTTGC
TAGACAGATTTTTGGTGACAAAGGAATGGGATAAAGCATTTGAAGGCACTCGAGTCTTTAGACAGGCTCGCATTATATCAGATCACTTTCCTCTCTTACTAGAAGTTGGA
GCGTTGGAATGGGGACCTTCCCCTTTTCGTTTTTGCAACAGTTGGCTGCTCAATTCCCAGTGCTGCAGCACTATAATTAGAACTCTTGAGGCTGGATATCATCAAGGCTG
GGCTGGTTTTGTCATTTTTGCTAAGTTGAGATCAGTTAAAACCTCCCTACAACAGTGGCACGCTGACTCTTTAATAAACCAAAGAAAACAGGAAGCGGAAATTGTAGAAA
AGCTGGATAAAGAGGAGCAAGGGGCAGAATTAGAGGATCCATCCTCCATTTTACAGGACCTAAGGGCATCTTTAAAAGCTGATTTGATGGACATTTACAGAAAAAAGGAA
AGAGATTTAATCCAGAAGAGTAAGTTGAATTGGTTACATTTAGGGGACGAAAACACGAGATTCTTCCGTTTTCTCGCTGCAAAAAAGAGAAAAAATCTTATAGCAGAGCT
AGTCAATGATCAAGGCATTCCAACTAATTCTGCGGGCTGTTTTCCCGCAAATTTGGAGTGGCAAAGGGTTTCAGTTGAACAAAACAATAGGCTGTCTTCAAAATTCAGCA
GAAAAGAAATCAGATTCGCTTTAAGAGGAATGGGGAAAAATAAAGTAGGAGGTCCAGATGGGTTTACGGTGGAATTCCTTAATAAATTTTGGGATAGAATCAAAGATGAT
TTTGTTGCCCTATTCAACAAATTCCATGAGAATGAGAAGCTAAACTCCTGTGTGAAGGAGAATGTCATTTGTCTGATCAAGGAAAAAGAAGATGCCATCATGGTTAAAGA
CTTCCGGCCTATTAGCCTCACTACACTAACATATAAAGTAATTGCCAAAGTGTTAGCTGAAAGGTTGAAATTGGTTATGCCAAGCATTATAGCCCCCACCCAAAGTGCTT
TTATCGAAGGCCGACAAATCTTAGATCCAATACTCATTGCCAATGAATTGGTAGAGGAATATAGAATAAAGAAGAAAAAAGGGTGGATTCTAAAGCTTGACCTAGAAAAA
GCCTTTGACAAAGTAGATTGGGGATTTCTAGAAAAGGTTCTACAAGGCAAGAATTTCGACTCAAAATGGACCTCCTGGACTCTGGGCTGCATCAAAAACCCTAAATTCTC
CATCTTCATTAATAGAAGACCAAGAGGAAGAGTTCAAGCCTCAAGAGGAGTGAGACAAGGAGACCCTCTTTCGCCCTTCATATTCCTCCTAGTGAGTGAAGTATTGAGTA
GTCTTATTTCAAGACTTCATAAAAGCAAGAAATTCGAGGGATTTATAGTTGGAAAGAAGAAGGTACATGTTCCAATTCTTCAATTCGCAGACGATATACTCCTATTTTGT
AAGTACGACTTCGATATGTTGGAAGCCTTAAGGAAAACTATTGAGTTTTTTGAATGGTGCTCCGGACAAAAGGTGAATTGGGAAAAATCAGCCTTGTGTGGACTGAACAT
TGATGATTTAGAGGTTAAATTGACTGCTGCAAAATTAAATTGCAAGGCTGAAAAATTGCCTCTAATGTACCTTGGTTTGCCCCTAGGGGGACACCCAAAAATGATTGTCT
TTTGGCAGCCTATCCTTGACAAAATTCAAGGAAAATTGAGCAAGTGGAAAAGATATAACCTCTCAAGAGGGGGTAGACTTACTCAACGTAAGACCGTGCTTTCAAACCTC
CCCTCTTCCTACATGTCTATCTTTCTAATGCCCGAAAAGGTAGTCCTATCAATAGAAAGAGCCATGAGAAATTTTTTTTGGGAAGGCACTGGAGGAAGCAAACTGAACCA
CCTTGCCCGATGGGAAACAGTCACAAAATACCATAAGGATGGAGGCCTTAGGCTGGAAAATTTGAAAATTAGAAACTTGGCATTGCTGTCCAAGTGGGGATGGCGTTTTA
TGCAAGAATCTGAAGCTCTTTGGTGCAATGTTATCACAAGTATACACGGGGAGGACCACTTCCAATGGCATACTAACAGAAAGGAGGTTGCAAGCTTAAGAAGCCCATGG
ATAAGCATCTCAAGGCAATGGCAGAAAATAGAAGCCTTTGCAATCTTTAAAGTAGGAGATGGAAGAAGAATAACATTTTGGTCTGATCCTTGGATTGAGGATCAGCCCCT
TAAGGTCAGATTTCCAAGACTCTTCAAGCTAGCTCTCAAGACAACCGGTACAGTAGCAGATCATTGGGACCCATGTTCCTCCTCGTGGGATTTATTGTTCAAAAGACAGC
TAAAAGAGGAAGAATTAGGTGAGTTTCTGTCCCTCTCAAGCAGTGTGGCAAATAAGAGAGTTATGGTGCAGCCGGATAAAAGGATTTGGGCATTAGAAGGAAATGGTAGT
TTCTCCATAAAATCCCTAACCAATCATCTTTCAGTAGCTTCCCCAATTGACGATGAGCTGGTAAAAGGTCTATGGAAGTCCAAATGCCCTAGGAGAGTCAACATTCTGAT
TTGGACAATGCTATTTGGTTCATTAAATTGTGCTGCCACAATTCAAAGGAAACTCCCTTACCATTGCTTGTCCCCCCATATGTGTGTCCTCTGCCGCCAAGACCAAGAAG
ACATCCAGCACCTATTCTTCGGCTGCAGCTATGCTTCAAGTTGCTGGTCGAGACTATTTGGCTTCTTCGATTTGAGTTGGGTAATGGGGAGTGATTTCAAAAGCAATGTG
CTGCAAATTTTGTTGGGTCCAAGACTGAATAAGGAACCAAAGTTGTTGTGGGTCAATGCAGTCAAAGCTTTGTTATCAGAAATTTGGTTCGAAAGGAATCAACACGTATT
TAATGACAAAGCATCCAATTGGGAGGATCGGTTCGATCAGGCAAGGCTAAATGCCTCTTCCTGGTGCACTTTGTCAAAGACATTCAAGGATTATTCCATTCAAGAGCTTG
CTCTAAATTAG
Protein sequenceShow/hide protein sequence
MEVVSCKINGVFFCSWFEKGKFSVEDVEVRKSIQLSKSQLLWKVEQIKDLLNGSSIRFFLREWRDDLGTIRILKLKTKAGWNLRCVVWLISRGRFYIHVPEGVAQHRWSE
FLKMLKGSLQRLENHRESSIFEKDQQKCSHSSPLIQPSFAKVVSTRKSQLFSSVWASTNLSIREDTQIQKALHASATIEQSILPRRVAKKEKDSKKNFKMSSKETSAKDQ
YYAPPKQQFWVRKNREVFKEDFNNLWVLSRLLASDDWKEIAKFLEVLYNVKVKINPLFDDKALIRVVQGKIEDLIEALGKWFNYRKFHLLFEKWNPLLHSRPSVLRSFGG
WIAIKNLPLEYWSGETFEAIGQHFGGLEEISIETLNLLDVSEAKIKVRKNVCGFIPATIEINNEFRGSIHLLFKDIIPMKDSSSIIKEFIGSDFKNPIDQVHLSKVEEDE
TQSGQEEKPPKILVGSPTVRKRSKLTANGHEGKSINAAVESPIGVLKSMRSAKVMKKEKRKERVFGRINENIKNFQTPGIPSRDFLPSGAPTGFETIQKNRKDSKVPLSF
TEEAEKSPPISMKSNREEFPFQTLPPPLQLPRARVFTPLQQSPAHKVSPIKSPFFPAVRRRSDLFKPFLKHFSRGKTSFLNLWLALTNPDLLEVSCANSQVPQQSGNRSA
LPLSSSQIFHQSKILIPGSNILFLRGSPSSYRPKPNKIQDLEEESLISISSDELNEPEEDENHLSLVLDESIEGNKSIADHLSSMLNEKIPHHLRSFVEDCGISLKGKLF
SKKIVSWNIRGLGDYSKQLAVKHLIMKTNPRIVEAIKKLWSSRDIRWSFVEAYGRSGGLLIMWDESKISVIETLKGGYTLSVKCKTLRRKVCWVTNVYRPTDYKERRRIW
PELQALAAYCTNAWCLGGDFNITRAIHERVPTGRFRGMKKFNKFIEEAHLMEIPMSNGRFTWSREGNIISRSLLDRFLVTKEWDKAFEGTRVFRQARIISDHFPLLLEVG
ALEWGPSPFRFCNSWLLNSQCCSTIIRTLEAGYHQGWAGFVIFAKLRSVKTSLQQWHADSLINQRKQEAEIVEKLDKEEQGAELEDPSSILQDLRASLKADLMDIYRKKE
RDLIQKSKLNWLHLGDENTRFFRFLAAKKRKNLIAELVNDQGIPTNSAGCFPANLEWQRVSVEQNNRLSSKFSRKEIRFALRGMGKNKVGGPDGFTVEFLNKFWDRIKDD
FVALFNKFHENEKLNSCVKENVICLIKEKEDAIMVKDFRPISLTTLTYKVIAKVLAERLKLVMPSIIAPTQSAFIEGRQILDPILIANELVEEYRIKKKKGWILKLDLEK
AFDKVDWGFLEKVLQGKNFDSKWTSWTLGCIKNPKFSIFINRRPRGRVQASRGVRQGDPLSPFIFLLVSEVLSSLISRLHKSKKFEGFIVGKKKVHVPILQFADDILLFC
KYDFDMLEALRKTIEFFEWCSGQKVNWEKSALCGLNIDDLEVKLTAAKLNCKAEKLPLMYLGLPLGGHPKMIVFWQPILDKIQGKLSKWKRYNLSRGGRLTQRKTVLSNL
PSSYMSIFLMPEKVVLSIERAMRNFFWEGTGGSKLNHLARWETVTKYHKDGGLRLENLKIRNLALLSKWGWRFMQESEALWCNVITSIHGEDHFQWHTNRKEVASLRSPW
ISISRQWQKIEAFAIFKVGDGRRITFWSDPWIEDQPLKVRFPRLFKLALKTTGTVADHWDPCSSSWDLLFKRQLKEEELGEFLSLSSSVANKRVMVQPDKRIWALEGNGS
FSIKSLTNHLSVASPIDDELVKGLWKSKCPRRVNILIWTMLFGSLNCAATIQRKLPYHCLSPHMCVLCRQDQEDIQHLFFGCSYASSCWSRLFGFFDLSWVMGSDFKSNV
LQILLGPRLNKEPKLLWVNAVKALLSEIWFERNQHVFNDKASNWEDRFDQARLNASSWCTLSKTFKDYSIQELALN