; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010466 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010466
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationchr9:47456674..47465401
RNA-Seq ExpressionLag0010466
SyntenyLag0010466
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR001969 - Aspartic peptidase, active site
IPR005162 - Retrotransposon gag domain
IPR012337 - Ribonuclease H-like superfamily
IPR016197 - Chromo-like domain superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AFK13856.1 Ty3/gypsy retrotransposon protein [Beta vulgaris subsp. vulgaris]0.0e+0048.88Show/hide
Query:  LEGRMLELSKEMAGMRQEISGIRGSISTEVEAAVSKSVGEAMKEG---------MQAILARIEGLKSTDQAVDNRKTKGKNVIGSEEVAVEVGEGSNTVL
        LEGR+    + + GM       +    TEV +A++K   E  K G         +   L   EGL+  ++A+D R        G  EV    G G     
Subjt:  LEGRMLELSKEMAGMRQEISGIRGSISTEVEAAVSKSVGEAMKEG---------MQAILARIEGLKSTDQAVDNRKTKGKNVIGSEEVAVEVGEGSNTVL

Query:  QSHLTKSLAPVFDLRLRKLEIPVFGGASPLEWLHQVERYFTVNRLTEAEKLEAAVLCLEGVALKWHYYENKLGSFSNWEDFRRLLLRRFLAPKEGTLHER
        ++       P  + R +KL++P F    P  W+ + ER+F    LT+AEK+EAAV+ +EG AL+W+ +ENK   F NWE  +  +L +F     G+LHE+
Subjt:  QSHLTKSLAPVFDLRLRKLEIPVFGGASPLEWLHQVERYFTVNRLTEAEKLEAAVLCLEGVALKWHYYENKLGSFSNWEDFRRLLLRRFLAPKEGTLHER

Query:  FFALKQESTVAEYREKFEEFAAAVDNLDGDSLEGKFVDGLVEDIKVEMRICMPVGLRAKMEMAQKIEDKLLHLENKQTGVTLSPKVVGQSVSNYKPISY-
        + +  Q ++V EYR KF E AA +D +  + L GKF+ GL  +++ E+R+  P  L   ME+A K+E++      ++TG    P+    S+ N  P S  
Subjt:  FFALKQESTVAEYREKFEEFAAAVDNLDGDSLEGKFVDGLVEDIKVEMRICMPVGLRAKMEMAQKIEDKLLHLENKQTGVTLSPKVVGQSVSNYKPISY-

Query:  ----------GAGASARTISVNPSKLGERVRKETPGKSFEASSSSNPYKRLTESEYQNRRDRGLCFKCEEKFSPGHRCKKKELQVLVVHDGESDEAEGND
                  G+ AS ++ ++N +     V    P     +S      +RLTE E Q +R +GLCFKC+EK+  GH+C++KEL VL + D E DE EG  
Subjt:  ----------GAGASARTISVNPSKLGERVRKETPGKSFEASSSSNPYKRLTESEYQNRRDRGLCFKCEEKFSPGHRCKKKELQVLVVHDGESDEAEGND

Query:  GVAMQTVEMADNKGEDGPKEHEAELALNSMVGLGSPKTLKVKGLIRSREVVVLIDSGATHNFIATTLVKELQLPTGQTEGYGVVLGTEGSSIKGTEICRE
          A+   E   +  E+ P     E++LNS++GL +PKT+K+ GLI + EVVV+ID GATHNF++   + +L +P  ++E +GV LG +G +++GT ICR 
Subjt:  GVAMQTVEMADNKGEDGPKEHEAELALNSMVGLGSPKTLKVKGLIRSREVVVLIDSGATHNFIATTLVKELQLPTGQTEGYGVVLGTEGSSIKGTEICRE

Query:  VVLTL-GPITIMQDFLPLPLGSADVILGVQWLSTLGDVTTNHQNLLMKFMLGKSAVVLQGDPSLVRSQVTLKTLIKQLKLEKQGFLVELNALQ-GGRMEI
        V L L G + +++DFLPL LG++DVILGVQWL TLG V +N +   M F LG     L GDP+L RS+V+LK +++ L+ E  G  +E N ++ GG   I
Subjt:  VVLTL-GPITIMQDFLPLPLGSADVILGVQWLSTLGDVTTNHQNLLMKFMLGKSAVVLQGDPSLVRSQVTLKTLIKQLKLEKQGFLVELNALQ-GGRMEI

Query:  SGSEIEGWEQIPDCIRSVLTRHFSVFQPLPGLPPTRMDDHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRAGVIQPSNSAFSSPVLLVKKKDGSW
          S++E  ++IP  ++ ++ R   VF+   GLPP R  +H I L  G+ PV VRPYRYPQ QKDEIE+L++EML AG+IQPS S FSSPV+LVKKKDGSW
Subjt:  SGSEIEGWEQIPDCIRSVLTRHFSVFQPLPGLPPTRMDDHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRAGVIQPSNSAFSSPVLLVKKKDGSW

Query:  RFCVDYRALNKVTVPDKFPIPMVDELLDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGLTNAPSTFQALMNKVLRPFLRKFAL
        RFCVDYRALNK TVPDK+PIP++DELLDELHGA VFSK+DL+AGYHQI ++ ED  K+AFRTHEGHYEFLVMPFGLTNAP+TFQ+LMN+V RPFLR+F L
Subjt:  RFCVDYRALNKVTVPDKFPIPMVDELLDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGLTNAPSTFQALMNKVLRPFLRKFAL

Query:  VFFDDILIYSSSLEEHAGHLAKIMQVLHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRNIKELRGFLGLTGYYRKFVQDYGLI
        VF DDILIYS S EEH GHL  ++ +L ++ L  N+KKC+F  +++ YLGH+IS  GV+ D  K+ A+ +W +P+N++ELRGFLGLTGYYRKFV +Y  I
Subjt:  VFFDDILIYSSSLEEHAGHLAKIMQVLHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRNIKELRGFLGLTGYYRKFVQDYGLI

Query:  AHPLTQQLKKDSFAWNDDATEAFFKLKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQESRPIAYFSHALPPLHRAKAVYERELMAIVMAIQKWRP
        A PLT+QLKKD+F W+  ATEAF +LKSAMV+ PVL +P+F   FVVE DASG G+GAVLMQ++RPIAY+S  L    + K+VYE+ELMAI  A+QKW+ 
Subjt:  AHPLTQQLKKDSFAWNDDATEAFFKLKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQESRPIAYFSHALPPLHRAKAVYERELMAIVMAIQKWRP

Query:  YLLGRRFTVRTDQSSLKFILDQRLISGEYQKLMTKLLGYDFQVVYKPGVENTVADALSR-LPSALEFAAVSLVGGLNTSIVIEQQRLDPALAAIRSKITE
        YLLGR F VRTDQ SL++I  QR I  E+QK ++KL+GYDF++ YKPG+ N VADALSR     +E  A+  V G+  + +  +   D  L  +R ++ E
Subjt:  YLLGRRFTVRTDQSSLKFILDQRLISGEYQKLMTKLLGYDFQVVYKPGVENTVADALSR-LPSALEFAAVSLVGGLNTSIVIEQQRLDPALAAIRSKITE

Query:  GEEVPVGYSLRGNVLYYRGKLVLPEDSPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECTVCQQAKYLALAPAGLLQPLPIPER
        G   P  ++L    L ++G+ V+P  S  IP LL  +H +P+GGH G LKT+ RLA E YW+GM+  V  +V +C +CQQ K     P GLLQPLPIP  
Subjt:  GEEVPVGYSLRGNVLYYRGKLVLPEDSPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECTVCQQAKYLALAPAGLLQPLPIPER

Query:  IWEDISMDFVVGLPRAETFDSVFVVVDRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSMFWEELFRCQGTQLKRSTAYHPQTD
        +WEDISMDF+ GLP ++  D++ V+VDRLSKYAHF+ +RHPFTA  +A  F+KEVVRLHG P SIVSDRD +F S+FW+ELFR  GT LKRS+AYHPQTD
Subjt:  IWEDISMDFVVGLPRAETFDSVFVVVDRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSMFWEELFRCQGTQLKRSTAYHPQTD

Query:  GQTEVVNRGMETYLRCFAMHCPLKWVRWLCWAEYSYNTSFHSGLKASPFEVVYGRSPPSVLPYHSGESPVSEVDHQLKERDRMLAMIRERLLDAQQYMAR
        GQTE+VNR +ETYLRCF    P  W +WL WAE+SYNTS H+  K SPF+V+YGR PP V+    G++ V  ++  L++RD ++  ++  L+ AQQ M  
Subjt:  GQTEVVNRGMETYLRCFAMHCPLKWVRWLCWAEYSYNTSFHSGLKASPFEVVYGRSPPSVLPYHSGESPVSEVDHQLKERDRMLAMIRERLLDAQQYMAR

Query:  MVNEKRRDVEFDVGDWAYLKLRPYRQGTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVRVHPVFHVSQLRKALGSSFPVVP-FPQNVSSDMLFK
          +  R +VEF VGD  +L+L+PYRQ +L+K    KLAPR+ GPF V  RIGA AY+L LP   ++HPVFHVS L+K +G++ PV+P  P ++  DM   
Subjt:  MVNEKRRDVEFDVGDWAYLKLRPYRQGTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVRVHPVFHVSQLRKALGSSFPVVP-FPQNVSSDMLFK

Query:  VVPVELLGLRQSVE-DSTKLEVLIRWAESEPSEATWEDAVQLATQFPDFHLEDKVALWGPG
        V P ELL +RQ  +   T  E LI+W      EATWED   +  +FP FHLEDKV +WG G
Subjt:  VVPVELLGLRQSVE-DSTKLEVLIRWAESEPSEATWEDAVQLATQFPDFHLEDKVALWGPG

KAA0039975.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0047.57Show/hide
Query:  METRVENLEGRMLELSKEMAGMRQEISGIRGSISTEVEAAVSKSVGEAMKEGMQAILARI-----------EGLKSTDQAVDNRKTKGKNVIGSEEVAVE
        ++TR+E    R+  + +E+AGM++E+S +     +  E A S  +     E  Q +L  I            G+ +   A +  K KGK    S     E
Subjt:  METRVENLEGRMLELSKEMAGMRQEISGIRGSISTEVEAAVSKSVGEAMKEGMQAILARI-----------EGLKSTDQAVDNRKTKGKNVIGSEEVAVE

Query:  VGEGSNTVLQSHLTKSLAPVFDL-RLRKLEIPVFGGASPLEWLHQVERYFTVNRLTEAEKLEAAVLCLEGVALKWHYYENKLGSFSNWEDFRRLLLRRFL
                     +       D  + +K+E+PVF G  P  WL + ERYF +++LT++EK+  + +  +G AL W   + +   F++W + +  LL RF 
Subjt:  VGEGSNTVLQSHLTKSLAPVFDL-RLRKLEIPVFGGASPLEWLHQVERYFTVNRLTEAEKLEAAVLCLEGVALKWHYYENKLGSFSNWEDFRRLLLRRFL

Query:  APKEGTLHERFFALKQESTVAEYREKFEEFAAAVDNLDGDSLEGKFVDGLVEDIKVEMRICMPVGLRAKMEMAQKIEDK-LLHLENKQTGVTLSPKVVGQ
        + K+GT+  +F  +KQE TV EY   F++  A V++L    +   F++GL+  ++ E+  C P  L   ME+AQ +E++ +   E K  G +   K+ GQ
Subjt:  APKEGTLHERFFALKQESTVAEYREKFEEFAAAVDNLDGDSLEGKFVDGLVEDIKVEMRICMPVGLRAKMEMAQKIEDK-LLHLENKQTGVTLSPKVVGQ

Query:  SVSNYKPISYGAGASARTISVNPSKLGERVRKETPGKSFEASSSSNPYKRLTESEYQNRRDRGLCFKCEEKFSPGHRCKKK---ELQVLVVHDGE-----
        +    K  + G     +  +V P +    +R   P ++    S    YKRL ++E+Q R+++GLCF+C EK+S  HRC+ K   EL++ VV +G      
Subjt:  SVSNYKPISYGAGASARTISVNPSKLGERVRKETPGKSFEASSSSNPYKRLTESEYQNRRDRGLCFKCEEKFSPGHRCKKK---ELQVLVVHDGE-----

Query:  -SDEAEGNDGVAMQTVEMADNKGEDGPKEHEAELALNSMVGLGSPKTLKVKGLIRSREVVVLIDSGATHNFIATTLVKELQLPTGQTEGYGVVLGTEGSS
          DE E  +   M+  E               EL++NS+VGL  P T+KV+G +   EVV+LID GATHNF++  LV++L LP  +T  YGV+LG+ G++
Subjt:  -SDEAEGNDGVAMQTVEMADNKGEDGPKEHEAELALNSMVGLGSPKTLKVKGLIRSREVVVLIDSGATHNFIATTLVKELQLPTGQTEGYGVVLGTEGSS

Query:  IKGTEICREVVLTLGPITIMQDFLPLPLGSADVILGVQWLSTLGDVTTNHQNLLMKFMLGKSAVVLQGDPSLVRSQVTLKTLIKQLKLEKQGFLVELNAL
        ++G  +C ++ + L    I++DFLPL LG  DVILG+QWL +LG  T + +NL + FM     V ++GDPSL +++++LK +IK  + +  GFL+E  +L
Subjt:  IKGTEICREVVLTLGPITIMQDFLPLPLGSADVILGVQWLSTLGDVTTNHQNLLMKFMLGKSAVVLQGDPSLVRSQVTLKTLIKQLKLEKQGFLVELNAL

Query:  QGGRME-----ISGSEIEGWEQIPDCIRSVLTRHFSVFQPLPGLPPTRMDDHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRAGVIQPSNSAFSS
        Q   +E     +  +E+E   +    I SV+ +   VF+    LPP R  +H I L  GT P++VRPYRY   QK+E+EKLV+EML +GVI+PS S +SS
Subjt:  QGGRME-----ISGSEIEGWEQIPDCIRSVLTRHFSVFQPLPGLPPTRMDDHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRAGVIQPSNSAFSS

Query:  PVLLVKKKDGSWRFCVDYRALNKVTVPDKFPIPMVDELLDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGLTNAPSTFQALMN
        PVLLVKKKDGSWRFCVDYRA+N  T+PDKFPIP+V+EL DEL GA +FSKIDLK+GYHQIR+  EDI K+AFRTHEGHYEFLVMPFGLTNAP+TFQALMN
Subjt:  PVLLVKKKDGSWRFCVDYRALNKVTVPDKFPIPMVDELLDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGLTNAPSTFQALMN

Query:  KVLRPFLRKFALVFFDDILIYSSSLEEHAGHLAKIMQVLHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRNIKELRGFLGLTG
         + +PFLRKF LVFFDDIL+YS + EEH  H+ K++ VL  ++L AN+KKC FA ++IEYLGH+IS +GV+ DP KI A+  WP P N+KE RGFLGLTG
Subjt:  KVLRPFLRKFALVFFDDILIYSSSLEEHAGHLAKIMQVLHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRNIKELRGFLGLTG

Query:  YYRKFVQDYGLIAHPLTQQLKKDSFAWNDDATEAFFKLKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQESRPIAYFSHALPPLHRAKAVYEREL
        YYR+FV+ YG IA PLTQ LKK  F W ++AT AF +LKSAM+++PVL LPDFSK+F +EADASG GVGAVL+Q+ RP+AY+SH L    R++ VYEREL
Subjt:  YYRKFVQDYGLIAHPLTQQLKKDSFAWNDDATEAFFKLKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQESRPIAYFSHALPPLHRAKAVYEREL

Query:  MAIVMAIQKWRPYLLGRRFTVRTDQSSLKFILDQRLISGEYQKLMTKLLGYDFQVVYKPGVENTVADALSRLPSALEFAAVSLVGGLNTSIVIEQQRLDP
        MA+V+A+Q+WRPYLL  +F V+TDQ +LKF+LDQR+I  +YQK + KLLGY F+VVYKPGVEN  ADALSR P  ++   +S+   ++  IV ++   DP
Subjt:  MAIVMAIQKWRPYLLGRRFTVRTDQSSLKFILDQRLISGEYQKLMTKLLGYDFQVVYKPGVENTVADALSRLPSALEFAAVSLVGGLNTSIVIEQQRLDP

Query:  ALAAIRSKITEGEEVPVG-YSLRGNVLYYRGKLVLPEDSPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECTVCQQAKYLALAP
            I  +I +GEE+ V  YSL+  +L ++ +LV+ + S  IP++L+ FH+S +GGH G L+T++R+A E+YW GMK+ +K    EC VCQ++K LAL+P
Subjt:  ALAAIRSKITEGEEVPVG-YSLRGNVLYYRGKLVLPEDSPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECTVCQQAKYLALAP

Query:  AGLLQPLPIPERIWEDISMDFVVGLPRAETFDSVFVVVDRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSMFWEELFRCQGTQ
        AGLL PL IP+ IW DISMDFV GLP++  ++ + VVVDRLSKY HF+P++HP+TA  +A+ F+KE+VRLHG P SIVSDRD VF S FW ELFR  GT+
Subjt:  AGLLQPLPIPERIWEDISMDFVVGLPRAETFDSVFVVVDRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSMFWEELFRCQGTQ

Query:  LKRSTAYHPQTDGQTEVVNRGMETYLRCFAMHCPLKWVRWLCWAEYSYNTSFHSGLKASPFEVVYGRSPPSVLPYHSGESPVSEVDHQLKERDRMLAMIR
        L +STAYHPQ+DGQTEVVNRG+ETYLRCF    P +WVRWL W EY YNT+FH  +  +PF+VVYGR PP++L Y +  S  S V+  L ERD +L  +R
Subjt:  LKRSTAYHPQTDGQTEVVNRGMETYLRCFAMHCPLKWVRWLCWAEYSYNTSFHSGLKASPFEVVYGRSPPSVLPYHSGESPVSEVDHQLKERDRMLAMIR

Query:  ERLLDAQQYMARMVNEKRRDVEFDVGDWAYLKLRPYRQGTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVRVHPVFHVSQLRKALGSSFPVVPF
        E L  AQ+ M    ++KRR VEF VG++ +L++RPYRQ T+    N KLAPR+ GP+ +  RIG VAYRL LPE+ R+HPVFHVSQLRK +G      P 
Subjt:  ERLLDAQQYMARMVNEKRRDVEFDVGDWAYLKLRPYRQGTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVRVHPVFHVSQLRKALGSSFPVVPF

Query:  PQNVSSDMLFKVVPVELLGLRQSVEDSTKLEVLIRWAESEPSEATWEDAVQLATQFPDFHLEDKVALWGPGSDRPPIKYVYRRKK
         Q +  + ++K  P E +  R+    + + EVL+ W      EA+WE   ++  ++P+FHLEDKV L G  + RP IK+VY R+K
Subjt:  PQNVSSDMLFKVVPVELLGLRQSVEDSTKLEVLIRWAESEPSEATWEDAVQLATQFPDFHLEDKVALWGPGSDRPPIKYVYRRKK

KAA0040118.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0047.18Show/hide
Query:  METRVENLEGRMLELSKEMAGMRQEISGIRGSISTEVEAAVSKSVGEAMKEGMQAIL-----------ARIEGLKSTDQAVDNRKTKGKNVIGSEEVAVE
        ++TR+E    R+  + +E+AGM++E+S +     +  E A S  +     E  Q +L           + +  L++   A +  K KGK    S   A++
Subjt:  METRVENLEGRMLELSKEMAGMRQEISGIRGSISTEVEAAVSKSVGEAMKEGMQAIL-----------ARIEGLKSTDQAVDNRKTKGKNVIGSEEVAVE

Query:  VGEGSNTVLQSHLTKSLAPVFDL-RLRKLEIPVFGGASPLEWLHQVERYFTVNRLTEAEKLEAAVLCLEGVALKWHYYENKLGSFSNWEDFRRLLLRRFL
         G                   D  + +K+E+PVF G  P  WL + ERYF +++LT++EK+  + +  +G AL W   + +   F+NW + +  LL RF 
Subjt:  VGEGSNTVLQSHLTKSLAPVFDL-RLRKLEIPVFGGASPLEWLHQVERYFTVNRLTEAEKLEAAVLCLEGVALKWHYYENKLGSFSNWEDFRRLLLRRFL

Query:  APKEGTLHERFFALKQESTVAEYREKFEEFAAAVDNLDGDSLEGKFVDGLVEDIKVEMRICMPVGLRAKMEMAQKIEDK-LLHLENKQTG-----VTLSP
        + K+GTL  +F  +KQE TV EY   F++  A V++L    +   F++GL+  ++ E+  C P  L   ME AQ +E++ +  +E K +G     +T + 
Subjt:  APKEGTLHERFFALKQESTVAEYREKFEEFAAAVDNLDGDSLEGKFVDGLVEDIKVEMRICMPVGLRAKMEMAQKIEDK-LLHLENKQTG-----VTLSP

Query:  KVVGQSVSNYKPISYGAGAS---------ARTISVNPSKLGERVRKETPGKSFEASSSSNPYKRLTESEYQNRRDRGLCFKCEEKFSPGHRCK---KKEL
          VG++       S GA A           RTI++  S   E  R+ T             YKRL ++E+Q R+++GLCF+C EK+S  H+C+   ++EL
Subjt:  KVVGQSVSNYKPISYGAGAS---------ARTISVNPSKLGERVRKETPGKSFEASSSSNPYKRLTESEYQNRRDRGLCFKCEEKFSPGHRCK---KKEL

Query:  QVLVVHDGESD---EAEGNDGVAMQTVEMADNKGEDGPKEHEAELALNSMVGLGSPKTLKVKGLIRSREVVVLIDSGATHNFIATTLVKELQLPTGQTEG
        ++ VV D   +     E N+   +  +E+ ++           EL++NS+VGL  P T+KV+G +   EVVVLID GATHNF++  LVK+L LP  +T  
Subjt:  QVLVVHDGESD---EAEGNDGVAMQTVEMADNKGEDGPKEHEAELALNSMVGLGSPKTLKVKGLIRSREVVVLIDSGATHNFIATTLVKELQLPTGQTEG

Query:  YGVVLGTEGSSIKGTEICREVVLTLGPITIMQDFLPLPLGSADVILGVQWLSTLGDVTTNHQNLLMKFMLGKSAVVLQGDPSLVRSQVTLKTLIKQLKLE
        YGV+LG+ G++++G  +C ++ + +G   I++DFLPL LG  DVILG+QWL +LG  T + +NL M F+     V ++GDPSL +++++LK LIK  + +
Subjt:  YGVVLGTEGSSIKGTEICREVVLTLGPITIMQDFLPLPLGSADVILGVQWLSTLGDVTTNHQNLLMKFMLGKSAVVLQGDPSLVRSQVTLKTLIKQLKLE

Query:  KQGFLVELNALQGGRMEISGSEIEGWEQIPDC-----IRSVLTRHFSVFQPLPGLPPTRMDDHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRAG
          G+L+E  +LQ   +E +   +   E +        I SV+ +   VF     LPP R  +H I L  GT P++VRPYRY   QK+E+EKLV EML +G
Subjt:  KQGFLVELNALQGGRMEISGSEIEGWEQIPDC-----IRSVLTRHFSVFQPLPGLPPTRMDDHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRAG

Query:  VIQPSNSAFSSPVLLVKKKDGSWRFCVDYRALNKVTVPDKFPIPMVDELLDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGLT
        VI+PS S +SSPVLLVKKKDGSWRFCVDYRA+N  T+PDKFPIP+V+EL DEL GA +FSKIDLK+GYHQIR+  ED+ K+AFRTHEGHYEFLVMPFGLT
Subjt:  VIQPSNSAFSSPVLLVKKKDGSWRFCVDYRALNKVTVPDKFPIPMVDELLDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGLT

Query:  NAPSTFQALMNKVLRPFLRKFALVFFDDILIYSSSLEEHAGHLAKIMQVLHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRNI
        NAP+TFQALMN + +PFLRKF LVFFDDIL+YS S EEH  H+ K++ +L +N+L AN+KKC FA K+IEYLGH+IS +GV+ DP KI ++  WP P N+
Subjt:  NAPSTFQALMNKVLRPFLRKFALVFFDDILIYSSSLEEHAGHLAKIMQVLHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRNI

Query:  KELRGFLGLTGYYRKFVQDYGLIAHPLTQQLKKDSFAWNDDATEAFFKLKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQESRPIAYFSHALPPL
        KE+RGFLGLTGYYR+FVQ YG IA PLTQQLKK  F WN++A  AF +LKSAM+ +PVL LPDFSK+F +EADASG GVGAVL+Q+ RPIA++SH L   
Subjt:  KELRGFLGLTGYYRKFVQDYGLIAHPLTQQLKKDSFAWNDDATEAFFKLKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQESRPIAYFSHALPPL

Query:  HRAKAVYERELMAIVMAIQKWRPYLLGRRFTVRTDQSSLKFILDQRLISGEYQKLMTKLLGYDFQVVYKPGVENTVADALSRLPSALEFAAVSLVGGLNT
         RA+ VYERELMA+V+A+Q+WRPYLL  +F V+TDQ +LKF+LDQR+I  +YQK + KLLGY F+VVYKPGV+N  ADALSR P  ++   +S+   ++ 
Subjt:  HRAKAVYERELMAIVMAIQKWRPYLLGRRFTVRTDQSSLKFILDQRLISGEYQKLMTKLLGYDFQVVYKPGVENTVADALSRLPSALEFAAVSLVGGLNT

Query:  SIVIEQQRLDPALAAIRSKITEGEEV-PVGYSLRGNVLYYRGKLVLPEDSPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECTV
         ++ ++   DP    I  ++ +GEE+    YS+R  +L Y+ +LV+ + S  IP++L+ FH+S VGGH G L+T++R A E++W+GMK+ +K    EC  
Subjt:  SIVIEQQRLDPALAAIRSKITEGEEV-PVGYSLRGNVLYYRGKLVLPEDSPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECTV

Query:  CQQAKYLALAPAGLLQPLPIPERIWEDISMDFVVGLPRAETFDSVFVVVDRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSMF
        CQ++K L+L+PAGLL PL IP+ +W DISMDFV GLP++  F+ V VVVDRLSKY HF+P++HP+TA  +A+ F+KE+VRLHG P SIVSDRD +F S F
Subjt:  CQQAKYLALAPAGLLQPLPIPERIWEDISMDFVVGLPRAETFDSVFVVVDRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSMF

Query:  WEELFRCQGTQLKRSTAYHPQTDGQTEVVNRGMETYLRCFAMHCPLKWVRWLCWAEYSYNTSFHSGLKASPFEVVYGRSPPSVLPYHSGESPVSEVDHQL
        W E+FR  GT+L +STAYHPQ+DGQTEVVNRG+ETYLRCF    P  W +WL W EY YNT+F   +  +PF+VVYGR PP++L Y S  S  S V+  L
Subjt:  WEELFRCQGTQLKRSTAYHPQTDGQTEVVNRGMETYLRCFAMHCPLKWVRWLCWAEYSYNTSFHSGLKASPFEVVYGRSPPSVLPYHSGESPVSEVDHQL

Query:  KERDRMLAMIRERLLDAQQYMARMVNEKRRDVEFDVGDWAYLKLRPYRQGTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVRVHPVFHVSQLRK
        +ERD +L  +RE LL AQ+ M R  ++KRR VE+ VG++ +L++RPYRQ ++    N KLAPR+ GP+++  RIG VAYRL LPE  ++HPVFHVSQLRK
Subjt:  KERDRMLAMIRERLLDAQQYMARMVNEKRRDVEFDVGDWAYLKLRPYRQGTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVRVHPVFHVSQLRK

Query:  ALGSSFPVVPFPQNVSSDMLFKVVPVELLGLRQSVEDSTKLEVLIRWAESEPSEATWEDAVQLATQFPDFHLEDKVALWGPGSDRPPIKYVYRRKK
         +G      P  Q V  + ++K  P E +  R++V  + + EVL+ W      EA+WE   ++  ++P FHLEDKV L G G  RP IK+VY R+K
Subjt:  ALGSSFPVVPFPQNVSSDMLFKVVPVELLGLRQSVEDSTKLEVLIRWAESEPSEATWEDAVQLATQFPDFHLEDKVALWGPGSDRPPIKYVYRRKK

PWA91318.1 Ty3/gypsy retrotransposon protein [Artemisia annua]0.0e+0051.04Show/hide
Query:  DLRLRKLEIPVFGGASPLEWLHQVERYFTVNRLTEAEKLEAAVLCLEGVALKWHYYENKLGSFSNWEDFRRLLLRRFLAPKEGTLHERFFALKQESTVAE
        D R+RKL++P+F G     W+++VERYF V  +   E+L AAVLC+EG AL W  +      F  WE  +R LL RF + +EGTL+E+F A+ QE +  E
Subjt:  DLRLRKLEIPVFGGASPLEWLHQVERYFTVNRLTEAEKLEAAVLCLEGVALKWHYYENKLGSFSNWEDFRRLLLRRFLAPKEGTLHERFFALKQESTVAE

Query:  YREKFEEFAAAVDNLDGDSLEGKFVDGLVEDIKVEMRICMPVGLRAKMEMAQKIEDKLLHLENKQTGVTLSPKVVGQSVSNYKPISYGAGASARTISVNP
        Y   FE  A  +  +    +EG F+ GL  +++  +R+  P GL   M++A  I+D      NK  G      VVG+  S+YKP +    AS        
Subjt:  YREKFEEFAAAVDNLDGDSLEGKFVDGLVEDIKVEMRICMPVGLRAKMEMAQKIEDKLLHLENKQTGVTLSPKVVGQSVSNYKPISYGAGASARTISVNP

Query:  SKLGERVRKETPGKSFEASSSSNPYKRLTESEYQNRRDRGLCFKCEEKFSPGHRCKKKELQVLVVHDGESDEAEGNDGVAMQTVEMADNKGEDGPKEHEA
        S+    +  +    +   +    P++RLT++E  ++R +G+CF+C+EKF+PGH C  K LQVL+V D E D+   ++ V + ++                
Subjt:  SKLGERVRKETPGKSFEASSSSNPYKRLTESEYQNRRDRGLCFKCEEKFSPGHRCKKKELQVLVVHDGESDEAEGNDGVAMQTVEMADNKGEDGPKEHEA

Query:  ELALNSMVGLGSPKTLKVKGLIRSREVVVLIDSGATHNFIATTLVKELQLPTGQTEGYGVVLGTEGSSIKGTEICREVVLTLGPITIMQDFLPLPLGSAD
        E++LNS++G  +P+T+KV+G +  R+VVVLID GATHNF++  LVK+L        GY   LG  G       +CR+VVLTL  + ++ DF PL LGS D
Subjt:  ELALNSMVGLGSPKTLKVKGLIRSREVVVLIDSGATHNFIATTLVKELQLPTGQTEGYGVVLGTEGSSIKGTEICREVVLTLGPITIMQDFLPLPLGSAD

Query:  VILGVQWLSTLGDVTTNHQNLLMKFMLGKSAVVLQGDPSLVRSQVTLKTLIKQLKLEKQGFLVELNALQGGRMEISGSEIEGWEQIPDCIRSVLTRHFSV
        +ILG++WL TLGD+T N + L M F+     V ++GD  L R+ V+LK++++ L+ EK+GFLVE+      +++ + +     E + D I  +L+ +  V
Subjt:  VILGVQWLSTLGDVTTNHQNLLMKFMLGKSAVVLQGDPSLVRSQVTLKTLIKQLKLEKQGFLVELNALQGGRMEISGSEIEGWEQIPDCIRSVLTRHFSV

Query:  FQPLPGLPPTRMDDHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRAGVIQPSNSAFSSPVLLVKKKDGSWRFCVDYRALNKVTVPDKFPIPMVDE
        F    GLPP R  +H I L  GT P+SVRPYRYP  QK+EIEKLV+EML AGVIQPS+S +SSPVLLVKKKDGSWRFCVDYRALNK TV DKFPIP++DE
Subjt:  FQPLPGLPPTRMDDHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRAGVIQPSNSAFSSPVLLVKKKDGSWRFCVDYRALNKVTVPDKFPIPMVDE

Query:  LLDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGLTNAPSTFQALMNKVLRPFLRKFALVFFDDILIYSSSLEEHAGHLAKIMQ
        LLDELHGA +FSK+DLK+GYHQIR+K  DI K+AFRTHEG YEFLVMPFGLTNAP+TFQ+LMNKV  P+LRKF LVFFDDIL++S + +EH  HL+ ++ 
Subjt:  LLDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGLTNAPSTFQALMNKVLRPFLRKFALVFFDDILIYSSSLEEHAGHLAKIMQ

Query:  VLHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRNIKELRGFLGLTGYYRKFVQDYGLIAHPLTQQLKKDSFAWNDDATEAFFK
         L +++L AN+KKC FA  +IEYLG+I++ +GV ADPSKIAAM  WPIP+NIKELRGFLGLTGYYRKFV+ YG IA  LT+QLKKD+F WN++AT AF  
Subjt:  VLHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRNIKELRGFLGLTGYYRKFVQDYGLIAHPLTQQLKKDSFAWNDDATEAFFK

Query:  LKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQESRPIAYFSHALPPLHRAKAVYERELMAIVMAIQKWRPYLLGRRFTVRTDQSSLKFILDQRLI
        LK AM  VPVL LPDF+K F+VE DASG GVGAVL+QE +PIAYFS  L                                     DQ SLK++L+QRL+
Subjt:  LKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQESRPIAYFSHALPPLHRAKAVYERELMAIVMAIQKWRPYLLGRRFTVRTDQSSLKFILDQRLI

Query:  SGEYQKLMTKLLGYDFQVVYKPGVENTVADALSRLPSALEFAAVSLVGGLNTSIVIEQQRLDPALAAIRSKITEGEEVPVGYSLRGNVLYYRGKLVLPED
         GEYQK ++KL+GYDF++ Y+PG +N+ ADALSR   ++E+ AVSL        +++    DP L  +R KI E +    GY+L    L YR +LVLP  
Subjt:  SGEYQKLMTKLLGYDFQVVYKPGVENTVADALSRLPSALEFAAVSLVGGLNTSIVIEQQRLDPALAAIRSKITEGEEVPVGYSLRGNVLYYRGKLVLPED

Query:  SPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECTVCQQAKYLALAPAGLLQPLPIPERIWEDISMDFVVGLPRAETFDSVFVVV
        S  IP L   FHSS  GGH GT+KT+ RLA E+YW GM+  +   VSEC VCQ+ KY  LAP+GLLQPL +PE++W++++MDF+ GLP +E F  + VVV
Subjt:  SPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECTVCQQAKYLALAPAGLLQPLPIPERIWEDISMDFVVGLPRAETFDSVFVVV

Query:  DRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSMFWEELFRCQGTQLKRSTAYHPQTDGQTEVVNRGMETYLRCFAMHCPLKWV
        DRLSKYAHF+P+RHP+TA  +A +F++EV+RLHG+P+++V+DRD VF S FW+E+F+ QGT LKRSTAYHPQTDGQTEVVNR +ETYLRCF+   P +W 
Subjt:  DRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSMFWEELFRCQGTQLKRSTAYHPQTDGQTEVVNRGMETYLRCFAMHCPLKWV

Query:  RWLCWAEYSYNTSFHSGLKASPFEVVYGRSPPSVLPYHSGESPVSEVDHQLKERDRMLAMIRERLLDAQQYMARMVNEKRRDVEFDVGDWAYLKLRPYRQ
        RWL WAEY YNTS+H+  KA+PF+++YGR PP ++PY +  +P  EVD  L+ERDR+L  ++  LL AQQ M    +  RRDV+F VGD  +LKLRPYRQ
Subjt:  RWLCWAEYSYNTSFHSGLKASPFEVVYGRSPPSVLPYHSGESPVSEVDHQLKERDRMLAMIRERLLDAQQYMARMVNEKRRDVEFDVGDWAYLKLRPYRQ

Query:  GTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVRVHPVFHVSQLRKALGSSFPVVPFPQNVSSDMLFKVVPVELLGLRQSVEDSTK-LEVLIRWA
         ++SK LN KLAPRY GPF+V  +IGAVAYRL LPE   +HPVFHVSQL+K +G       FP+ +S+DM   V P E+LG+R+   +S +  EVLIRW 
Subjt:  GTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVRVHPVFHVSQLRKALGSSFPVVPFPQNVSSDMLFKVVPVELLGLRQSVEDSTK-LEVLIRWA

Query:  ESEPSEATWEDAVQLATQFPDFHLEDKVALWGPGSD
             E+TWE    +  QFPDFHLEDKV LW  G D
Subjt:  ESEPSEATWEDAVQLATQFPDFHLEDKVALWGPGSD

XP_010680400.1 PREDICTED: uncharacterized protein LOC104895554 isoform X1 [Beta vulgaris subsp. vulgaris]0.0e+0048.24Show/hide
Query:  MATRMETRVENLE----GRMLELSKEMAGMRQEISGIRGSI--STEVEAAVSKSVGEAMKEGMQAILARIEGLKSTDQAVDNRKTKGKNVIGSEEVAVEV
        MA   + R+E LE    G    L++ MA  +     + G +  + E    +   + +  K+  + + A +  LK+T     N  ++ +    S  +   V
Subjt:  MATRMETRVENLE----GRMLELSKEMAGMRQEISGIRGSI--STEVEAAVSKSVGEAMKEGMQAILARIEGLKSTDQAVDNRKTKGKNVIGSEEVAVEV

Query:  GEGSNTVLQSHLTKSLAP--VFDLRLRKLEIPVFGGASPLEWLHQVERYFTVNRLTEAEKLEAAVLCLEGVALKWHYYENKLGSFSNWEDFRRLLLRRFL
        G+     L  H  ++  P    + R RKL++P+FGG++P  W+ + ERY+   RL E EKLEAAV+ LE  AL W+ +E++      W++ + LLLR+F 
Subjt:  GEGSNTVLQSHLTKSLAP--VFDLRLRKLEIPVFGGASPLEWLHQVERYFTVNRLTEAEKLEAAVLCLEGVALKWHYYENKLGSFSNWEDFRRLLLRRFL

Query:  APKEGTLHERFFALKQESTVAEYREKFEEFAAAVDNLDGDSLEGKFVDGLVEDIKVEMRICMPVGLRAKMEMAQKIEDKL---LHLENKQTGVTLSP---
           +G+L+E++  ++QE +V +Y+ +F E+AA ++N+    + G+F+ GL E+IK E+ +  P+ +   M++A K E K+    +L   +T  T++P   
Subjt:  APKEGTLHERFFALKQESTVAEYREKFEEFAAAVDNLDGDSLEGKFVDGLVEDIKVEMRICMPVGLRAKMEMAQKIEDKL---LHLENKQTGVTLSP---

Query:  ---KVVGQSVSNYKPISYGAGASARTISVNPSKLGER--VRKETPGK-SFEASSSSNPYKRLTESEYQNRRDRGLCFKCEEKFSPGHRCKKKELQVLVVH
             +  + +  KP S     +      NP+    +    K T  K S++   +  P +RLTE E Q RR+ GLCF+C++K+S GHRC+KKE+ VLV+ 
Subjt:  ---KVVGQSVSNYKPISYGAGASARTISVNPSKLGER--VRKETPGK-SFEASSSSNPYKRLTESEYQNRRDRGLCFKCEEKFSPGHRCKKKELQVLVVH

Query:  DGESD------EAEGNDGVAMQTVEMADNKGEDGPKEHEAELALNSMVGLGSPKTLKVKGLIRSREVVVLIDSGATHNFIATTLVKELQLPTGQTEGYGV
        +GE D      E E ND  A  + E+              EL+LNS+VGL SP+T+K+ G+I  +EVVV++D GATHNFI+   V++L +P      +GV
Subjt:  DGESD------EAEGNDGVAMQTVEMADNKGEDGPKEHEAELALNSMVGLGSPKTLKVKGLIRSREVVVLIDSGATHNFIATTLVKELQLPTGQTEGYGV

Query:  VLGTEGSSIKGTEICREVVLTLGPITIMQDFLPLPLGSADVILGVQWLSTLGDVTTNHQNLLMKFMLGKSAVVLQGDPSLVRSQVTLKTLIKQLKLEKQG
         LGT G+ +KG   C+ V+L +  + I ++FLPL LG++D+ILGVQWL  LG VTTN ++ LMKF +G+  V LQGDPSL R++++LK +++ L++E QG
Subjt:  VLGTEGSSIKGTEICREVVLTLGPITIMQDFLPLPLGSADVILGVQWLSTLGDVTTNHQNLLMKFMLGKSAVVLQGDPSLVRSQVTLKTLIKQLKLEKQG

Query:  FLVELNALQGGRMEISGSEIEGWEQIPDCIRSVLTRHFSVFQPLPGLPPTRMDDHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRAGVIQPSNSA
         LVE+N ++  +      +IE   ++P  ++ +L ++  VF    GLPP+R  +H I L  G+ PVSVRPYRYP  QK EIE+LV++ML AG+IQPS S 
Subjt:  FLVELNALQGGRMEISGSEIEGWEQIPDCIRSVLTRHFSVFQPLPGLPPTRMDDHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRAGVIQPSNSA

Query:  FSSPVLLVKKKDGSWRFCVDYRALNKVTVPDKFPIPMVDELLDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGLTNAPSTFQA
        FSSPVLLVKKKDGSWRFCVDYRALNK TVPDK+PIP++DELLDEL+G++VFSK+DLK+GYHQIR++ EDI K+AFRTHEGHYEFLVMPFGLTNAP+TFQ+
Subjt:  FSSPVLLVKKKDGSWRFCVDYRALNKVTVPDKFPIPMVDELLDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGLTNAPSTFQA

Query:  LMNKVLRPFLRKFALVFFDDILIYSSSLEEHAGHLAKIMQVLHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRNIKELRGFLG
        LMN+V RPFLRKF LVFFDDIL+YS   E H  HL +++ +L EN L AN +KC+F  +Q+ YLGH+IS  GV+AD  KI AM +WP+P+ I+ELRGFLG
Subjt:  LMNKVLRPFLRKFALVFFDDILIYSSSLEEHAGHLAKIMQVLHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRNIKELRGFLG

Query:  LTGYYRKFVQDYGLIAHPLTQQLKKDSFAWNDDATEAFFKLKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQESRPIAYFSHALPPLHRAKAVYE
        LTGYYRKF+ +Y  +A PLT QL+KDS+AW   AT+AF  LK AMV  PVL +PDFS++FV+EADASG G+GAVLMQ +RPIA++SH L P  R K++YE
Subjt:  LTGYYRKFVQDYGLIAHPLTQQLKKDSFAWNDDATEAFFKLKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQESRPIAYFSHALPPLHRAKAVYE

Query:  RELMAIVMAIQKWRPYLLGRRFTVRTDQSSLKFILDQRLISGEYQKLMTKLLGYDFQVVYKPGVENTVADALSRL-PSALEFAAVSLVGGLNTSIVIEQQ
        +ELMAIVMA+QKWR YLLGRRF +RTDQ SLKFI++QR +  EYQ+ ++KL+G++F++ YKPG+ N VADALSR  P+  E  A+    G +   V  Q 
Subjt:  RELMAIVMAIQKWRPYLLGRRFTVRTDQSSLKFILDQRLISGEYQKLMTKLLGYDFQVVYKPGVENTVADALSRL-PSALEFAAVSLVGGLNTSIVIEQQ

Query:  RLDPALAAIRSKITEGEEVPV-GYSLRGNVLYYRGKLVLPEDSPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECTVCQQAKYL
        + DP +  I +++ +G+  P+ G+S+   ++ Y+G++VLP  SP    LL+ +H SP GGH G LKT+ R+A E YW GM+  V  +V +C +CQQ K  
Subjt:  RLDPALAAIRSKITEGEEVPV-GYSLRGNVLYYRGKLVLPEDSPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECTVCQQAKYL

Query:  ALAPAGLLQPLPIPERIWEDISMDFVVGLPRAETFDSVFVVVDRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSMFWEELFRC
           PAGLLQPLP P ++WEDI+MDFV GLP +   D++ VVVDR +K+AHF+ ++HPFTA  +A TFIKE+VRLHG P SI+SDRD VF S+FW+ELFR 
Subjt:  ALAPAGLLQPLPIPERIWEDISMDFVVGLPRAETFDSVFVVVDRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSMFWEELFRC

Query:  QGTQLKRSTAYHPQTDGQTEVVNRGMETYLRCFAMHCPLKWVRWLCWAEYSYNTSFHSGLKASPFEVVYGRSPPSVLPYHSGESPVSEVDHQLKERDRML
        QGT+LKRSTAYHPQTDGQ+E VN+ +ETYLRCF    P KW  WL W E+ YNTS H   K +PF+ +YGR PP ++     ++PV  +D  L+ERD +L
Subjt:  QGTQLKRSTAYHPQTDGQTEVVNRGMETYLRCFAMHCPLKWVRWLCWAEYSYNTSFHSGLKASPFEVVYGRSPPSVLPYHSGESPVSEVDHQLKERDRML

Query:  AMIRERLLDAQQYMARMVNEKRRDVEFDVGDWAYLKLRPYRQGTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVRVHPVFHVSQLRKALGSSFP
          +R  LL AQQ M    +++RRD+  +VG + YLKL+PYRQ +L++    KLA RY GP+QV  RIGAVAYRL LP   ++HPVFHVSQL+ A G+   
Subjt:  AMIRERLLDAQQYMARMVNEKRRDVEFDVGDWAYLKLRPYRQGTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVRVHPVFHVSQLRKALGSSFP

Query:  VVPFPQNVSSDMLFKVVPVELLGLRQSVEDSTK-LEVLIRWAESEPSEATWEDAVQLATQFPDFHLEDKVALWGPG----SDRPPIKYVYRRKK
            P+ ++ D+   V P  LL +R       K LEVLI+W     +EATWED   +  +FP FHLEDKV LW  G    + +PP+K+VY R++
Subjt:  VVPFPQNVSSDMLFKVVPVELLGLRQSVEDSTK-LEVLIRWAESEPSEATWEDAVQLATQFPDFHLEDKVALWGPG----SDRPPIKYVYRRKK

TrEMBL top hitse value%identityAlignment
A0A2U1PZV9 Ty3/gypsy retrotransposon protein0.0e+0051.04Show/hide
Query:  DLRLRKLEIPVFGGASPLEWLHQVERYFTVNRLTEAEKLEAAVLCLEGVALKWHYYENKLGSFSNWEDFRRLLLRRFLAPKEGTLHERFFALKQESTVAE
        D R+RKL++P+F G     W+++VERYF V  +   E+L AAVLC+EG AL W  +      F  WE  +R LL RF + +EGTL+E+F A+ QE +  E
Subjt:  DLRLRKLEIPVFGGASPLEWLHQVERYFTVNRLTEAEKLEAAVLCLEGVALKWHYYENKLGSFSNWEDFRRLLLRRFLAPKEGTLHERFFALKQESTVAE

Query:  YREKFEEFAAAVDNLDGDSLEGKFVDGLVEDIKVEMRICMPVGLRAKMEMAQKIEDKLLHLENKQTGVTLSPKVVGQSVSNYKPISYGAGASARTISVNP
        Y   FE  A  +  +    +EG F+ GL  +++  +R+  P GL   M++A  I+D      NK  G      VVG+  S+YKP +    AS        
Subjt:  YREKFEEFAAAVDNLDGDSLEGKFVDGLVEDIKVEMRICMPVGLRAKMEMAQKIEDKLLHLENKQTGVTLSPKVVGQSVSNYKPISYGAGASARTISVNP

Query:  SKLGERVRKETPGKSFEASSSSNPYKRLTESEYQNRRDRGLCFKCEEKFSPGHRCKKKELQVLVVHDGESDEAEGNDGVAMQTVEMADNKGEDGPKEHEA
        S+    +  +    +   +    P++RLT++E  ++R +G+CF+C+EKF+PGH C  K LQVL+V D E D+   ++ V + ++                
Subjt:  SKLGERVRKETPGKSFEASSSSNPYKRLTESEYQNRRDRGLCFKCEEKFSPGHRCKKKELQVLVVHDGESDEAEGNDGVAMQTVEMADNKGEDGPKEHEA

Query:  ELALNSMVGLGSPKTLKVKGLIRSREVVVLIDSGATHNFIATTLVKELQLPTGQTEGYGVVLGTEGSSIKGTEICREVVLTLGPITIMQDFLPLPLGSAD
        E++LNS++G  +P+T+KV+G +  R+VVVLID GATHNF++  LVK+L        GY   LG  G       +CR+VVLTL  + ++ DF PL LGS D
Subjt:  ELALNSMVGLGSPKTLKVKGLIRSREVVVLIDSGATHNFIATTLVKELQLPTGQTEGYGVVLGTEGSSIKGTEICREVVLTLGPITIMQDFLPLPLGSAD

Query:  VILGVQWLSTLGDVTTNHQNLLMKFMLGKSAVVLQGDPSLVRSQVTLKTLIKQLKLEKQGFLVELNALQGGRMEISGSEIEGWEQIPDCIRSVLTRHFSV
        +ILG++WL TLGD+T N + L M F+     V ++GD  L R+ V+LK++++ L+ EK+GFLVE+      +++ + +     E + D I  +L+ +  V
Subjt:  VILGVQWLSTLGDVTTNHQNLLMKFMLGKSAVVLQGDPSLVRSQVTLKTLIKQLKLEKQGFLVELNALQGGRMEISGSEIEGWEQIPDCIRSVLTRHFSV

Query:  FQPLPGLPPTRMDDHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRAGVIQPSNSAFSSPVLLVKKKDGSWRFCVDYRALNKVTVPDKFPIPMVDE
        F    GLPP R  +H I L  GT P+SVRPYRYP  QK+EIEKLV+EML AGVIQPS+S +SSPVLLVKKKDGSWRFCVDYRALNK TV DKFPIP++DE
Subjt:  FQPLPGLPPTRMDDHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRAGVIQPSNSAFSSPVLLVKKKDGSWRFCVDYRALNKVTVPDKFPIPMVDE

Query:  LLDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGLTNAPSTFQALMNKVLRPFLRKFALVFFDDILIYSSSLEEHAGHLAKIMQ
        LLDELHGA +FSK+DLK+GYHQIR+K  DI K+AFRTHEG YEFLVMPFGLTNAP+TFQ+LMNKV  P+LRKF LVFFDDIL++S + +EH  HL+ ++ 
Subjt:  LLDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGLTNAPSTFQALMNKVLRPFLRKFALVFFDDILIYSSSLEEHAGHLAKIMQ

Query:  VLHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRNIKELRGFLGLTGYYRKFVQDYGLIAHPLTQQLKKDSFAWNDDATEAFFK
         L +++L AN+KKC FA  +IEYLG+I++ +GV ADPSKIAAM  WPIP+NIKELRGFLGLTGYYRKFV+ YG IA  LT+QLKKD+F WN++AT AF  
Subjt:  VLHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRNIKELRGFLGLTGYYRKFVQDYGLIAHPLTQQLKKDSFAWNDDATEAFFK

Query:  LKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQESRPIAYFSHALPPLHRAKAVYERELMAIVMAIQKWRPYLLGRRFTVRTDQSSLKFILDQRLI
        LK AM  VPVL LPDF+K F+VE DASG GVGAVL+QE +PIAYFS  L                                     DQ SLK++L+QRL+
Subjt:  LKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQESRPIAYFSHALPPLHRAKAVYERELMAIVMAIQKWRPYLLGRRFTVRTDQSSLKFILDQRLI

Query:  SGEYQKLMTKLLGYDFQVVYKPGVENTVADALSRLPSALEFAAVSLVGGLNTSIVIEQQRLDPALAAIRSKITEGEEVPVGYSLRGNVLYYRGKLVLPED
         GEYQK ++KL+GYDF++ Y+PG +N+ ADALSR   ++E+ AVSL        +++    DP L  +R KI E +    GY+L    L YR +LVLP  
Subjt:  SGEYQKLMTKLLGYDFQVVYKPGVENTVADALSRLPSALEFAAVSLVGGLNTSIVIEQQRLDPALAAIRSKITEGEEVPVGYSLRGNVLYYRGKLVLPED

Query:  SPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECTVCQQAKYLALAPAGLLQPLPIPERIWEDISMDFVVGLPRAETFDSVFVVV
        S  IP L   FHSS  GGH GT+KT+ RLA E+YW GM+  +   VSEC VCQ+ KY  LAP+GLLQPL +PE++W++++MDF+ GLP +E F  + VVV
Subjt:  SPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECTVCQQAKYLALAPAGLLQPLPIPERIWEDISMDFVVGLPRAETFDSVFVVV

Query:  DRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSMFWEELFRCQGTQLKRSTAYHPQTDGQTEVVNRGMETYLRCFAMHCPLKWV
        DRLSKYAHF+P+RHP+TA  +A +F++EV+RLHG+P+++V+DRD VF S FW+E+F+ QGT LKRSTAYHPQTDGQTEVVNR +ETYLRCF+   P +W 
Subjt:  DRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSMFWEELFRCQGTQLKRSTAYHPQTDGQTEVVNRGMETYLRCFAMHCPLKWV

Query:  RWLCWAEYSYNTSFHSGLKASPFEVVYGRSPPSVLPYHSGESPVSEVDHQLKERDRMLAMIRERLLDAQQYMARMVNEKRRDVEFDVGDWAYLKLRPYRQ
        RWL WAEY YNTS+H+  KA+PF+++YGR PP ++PY +  +P  EVD  L+ERDR+L  ++  LL AQQ M    +  RRDV+F VGD  +LKLRPYRQ
Subjt:  RWLCWAEYSYNTSFHSGLKASPFEVVYGRSPPSVLPYHSGESPVSEVDHQLKERDRMLAMIRERLLDAQQYMARMVNEKRRDVEFDVGDWAYLKLRPYRQ

Query:  GTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVRVHPVFHVSQLRKALGSSFPVVPFPQNVSSDMLFKVVPVELLGLRQSVEDSTK-LEVLIRWA
         ++SK LN KLAPRY GPF+V  +IGAVAYRL LPE   +HPVFHVSQL+K +G       FP+ +S+DM   V P E+LG+R+   +S +  EVLIRW 
Subjt:  GTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVRVHPVFHVSQLRKALGSSFPVVPFPQNVSSDMLFKVVPVELLGLRQSVEDSTK-LEVLIRWA

Query:  ESEPSEATWEDAVQLATQFPDFHLEDKVALWGPGSD
             E+TWE    +  QFPDFHLEDKV LW  G D
Subjt:  ESEPSEATWEDAVQLATQFPDFHLEDKVALWGPGSD

A0A5A7TAX4 Ty3/gypsy retrotransposon protein0.0e+0047.18Show/hide
Query:  METRVENLEGRMLELSKEMAGMRQEISGIRGSISTEVEAAVSKSVGEAMKEGMQAIL-----------ARIEGLKSTDQAVDNRKTKGKNVIGSEEVAVE
        ++TR+E    R+  + +E+AGM++E+S +     +  E A S  +     E  Q +L           + +  L++   A +  K KGK    S   A++
Subjt:  METRVENLEGRMLELSKEMAGMRQEISGIRGSISTEVEAAVSKSVGEAMKEGMQAIL-----------ARIEGLKSTDQAVDNRKTKGKNVIGSEEVAVE

Query:  VGEGSNTVLQSHLTKSLAPVFDL-RLRKLEIPVFGGASPLEWLHQVERYFTVNRLTEAEKLEAAVLCLEGVALKWHYYENKLGSFSNWEDFRRLLLRRFL
         G                   D  + +K+E+PVF G  P  WL + ERYF +++LT++EK+  + +  +G AL W   + +   F+NW + +  LL RF 
Subjt:  VGEGSNTVLQSHLTKSLAPVFDL-RLRKLEIPVFGGASPLEWLHQVERYFTVNRLTEAEKLEAAVLCLEGVALKWHYYENKLGSFSNWEDFRRLLLRRFL

Query:  APKEGTLHERFFALKQESTVAEYREKFEEFAAAVDNLDGDSLEGKFVDGLVEDIKVEMRICMPVGLRAKMEMAQKIEDK-LLHLENKQTG-----VTLSP
        + K+GTL  +F  +KQE TV EY   F++  A V++L    +   F++GL+  ++ E+  C P  L   ME AQ +E++ +  +E K +G     +T + 
Subjt:  APKEGTLHERFFALKQESTVAEYREKFEEFAAAVDNLDGDSLEGKFVDGLVEDIKVEMRICMPVGLRAKMEMAQKIEDK-LLHLENKQTG-----VTLSP

Query:  KVVGQSVSNYKPISYGAGAS---------ARTISVNPSKLGERVRKETPGKSFEASSSSNPYKRLTESEYQNRRDRGLCFKCEEKFSPGHRCK---KKEL
          VG++       S GA A           RTI++  S   E  R+ T             YKRL ++E+Q R+++GLCF+C EK+S  H+C+   ++EL
Subjt:  KVVGQSVSNYKPISYGAGAS---------ARTISVNPSKLGERVRKETPGKSFEASSSSNPYKRLTESEYQNRRDRGLCFKCEEKFSPGHRCK---KKEL

Query:  QVLVVHDGESD---EAEGNDGVAMQTVEMADNKGEDGPKEHEAELALNSMVGLGSPKTLKVKGLIRSREVVVLIDSGATHNFIATTLVKELQLPTGQTEG
        ++ VV D   +     E N+   +  +E+ ++           EL++NS+VGL  P T+KV+G +   EVVVLID GATHNF++  LVK+L LP  +T  
Subjt:  QVLVVHDGESD---EAEGNDGVAMQTVEMADNKGEDGPKEHEAELALNSMVGLGSPKTLKVKGLIRSREVVVLIDSGATHNFIATTLVKELQLPTGQTEG

Query:  YGVVLGTEGSSIKGTEICREVVLTLGPITIMQDFLPLPLGSADVILGVQWLSTLGDVTTNHQNLLMKFMLGKSAVVLQGDPSLVRSQVTLKTLIKQLKLE
        YGV+LG+ G++++G  +C ++ + +G   I++DFLPL LG  DVILG+QWL +LG  T + +NL M F+     V ++GDPSL +++++LK LIK  + +
Subjt:  YGVVLGTEGSSIKGTEICREVVLTLGPITIMQDFLPLPLGSADVILGVQWLSTLGDVTTNHQNLLMKFMLGKSAVVLQGDPSLVRSQVTLKTLIKQLKLE

Query:  KQGFLVELNALQGGRMEISGSEIEGWEQIPDC-----IRSVLTRHFSVFQPLPGLPPTRMDDHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRAG
          G+L+E  +LQ   +E +   +   E +        I SV+ +   VF     LPP R  +H I L  GT P++VRPYRY   QK+E+EKLV EML +G
Subjt:  KQGFLVELNALQGGRMEISGSEIEGWEQIPDC-----IRSVLTRHFSVFQPLPGLPPTRMDDHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRAG

Query:  VIQPSNSAFSSPVLLVKKKDGSWRFCVDYRALNKVTVPDKFPIPMVDELLDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGLT
        VI+PS S +SSPVLLVKKKDGSWRFCVDYRA+N  T+PDKFPIP+V+EL DEL GA +FSKIDLK+GYHQIR+  ED+ K+AFRTHEGHYEFLVMPFGLT
Subjt:  VIQPSNSAFSSPVLLVKKKDGSWRFCVDYRALNKVTVPDKFPIPMVDELLDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGLT

Query:  NAPSTFQALMNKVLRPFLRKFALVFFDDILIYSSSLEEHAGHLAKIMQVLHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRNI
        NAP+TFQALMN + +PFLRKF LVFFDDIL+YS S EEH  H+ K++ +L +N+L AN+KKC FA K+IEYLGH+IS +GV+ DP KI ++  WP P N+
Subjt:  NAPSTFQALMNKVLRPFLRKFALVFFDDILIYSSSLEEHAGHLAKIMQVLHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRNI

Query:  KELRGFLGLTGYYRKFVQDYGLIAHPLTQQLKKDSFAWNDDATEAFFKLKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQESRPIAYFSHALPPL
        KE+RGFLGLTGYYR+FVQ YG IA PLTQQLKK  F WN++A  AF +LKSAM+ +PVL LPDFSK+F +EADASG GVGAVL+Q+ RPIA++SH L   
Subjt:  KELRGFLGLTGYYRKFVQDYGLIAHPLTQQLKKDSFAWNDDATEAFFKLKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQESRPIAYFSHALPPL

Query:  HRAKAVYERELMAIVMAIQKWRPYLLGRRFTVRTDQSSLKFILDQRLISGEYQKLMTKLLGYDFQVVYKPGVENTVADALSRLPSALEFAAVSLVGGLNT
         RA+ VYERELMA+V+A+Q+WRPYLL  +F V+TDQ +LKF+LDQR+I  +YQK + KLLGY F+VVYKPGV+N  ADALSR P  ++   +S+   ++ 
Subjt:  HRAKAVYERELMAIVMAIQKWRPYLLGRRFTVRTDQSSLKFILDQRLISGEYQKLMTKLLGYDFQVVYKPGVENTVADALSRLPSALEFAAVSLVGGLNT

Query:  SIVIEQQRLDPALAAIRSKITEGEEV-PVGYSLRGNVLYYRGKLVLPEDSPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECTV
         ++ ++   DP    I  ++ +GEE+    YS+R  +L Y+ +LV+ + S  IP++L+ FH+S VGGH G L+T++R A E++W+GMK+ +K    EC  
Subjt:  SIVIEQQRLDPALAAIRSKITEGEEV-PVGYSLRGNVLYYRGKLVLPEDSPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECTV

Query:  CQQAKYLALAPAGLLQPLPIPERIWEDISMDFVVGLPRAETFDSVFVVVDRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSMF
        CQ++K L+L+PAGLL PL IP+ +W DISMDFV GLP++  F+ V VVVDRLSKY HF+P++HP+TA  +A+ F+KE+VRLHG P SIVSDRD +F S F
Subjt:  CQQAKYLALAPAGLLQPLPIPERIWEDISMDFVVGLPRAETFDSVFVVVDRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSMF

Query:  WEELFRCQGTQLKRSTAYHPQTDGQTEVVNRGMETYLRCFAMHCPLKWVRWLCWAEYSYNTSFHSGLKASPFEVVYGRSPPSVLPYHSGESPVSEVDHQL
        W E+FR  GT+L +STAYHPQ+DGQTEVVNRG+ETYLRCF    P  W +WL W EY YNT+F   +  +PF+VVYGR PP++L Y S  S  S V+  L
Subjt:  WEELFRCQGTQLKRSTAYHPQTDGQTEVVNRGMETYLRCFAMHCPLKWVRWLCWAEYSYNTSFHSGLKASPFEVVYGRSPPSVLPYHSGESPVSEVDHQL

Query:  KERDRMLAMIRERLLDAQQYMARMVNEKRRDVEFDVGDWAYLKLRPYRQGTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVRVHPVFHVSQLRK
        +ERD +L  +RE LL AQ+ M R  ++KRR VE+ VG++ +L++RPYRQ ++    N KLAPR+ GP+++  RIG VAYRL LPE  ++HPVFHVSQLRK
Subjt:  KERDRMLAMIRERLLDAQQYMARMVNEKRRDVEFDVGDWAYLKLRPYRQGTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVRVHPVFHVSQLRK

Query:  ALGSSFPVVPFPQNVSSDMLFKVVPVELLGLRQSVEDSTKLEVLIRWAESEPSEATWEDAVQLATQFPDFHLEDKVALWGPGSDRPPIKYVYRRKK
         +G      P  Q V  + ++K  P E +  R++V  + + EVL+ W      EA+WE   ++  ++P FHLEDKV L G G  RP IK+VY R+K
Subjt:  ALGSSFPVVPFPQNVSSDMLFKVVPVELLGLRQSVEDSTKLEVLIRWAESEPSEATWEDAVQLATQFPDFHLEDKVALWGPGSDRPPIKYVYRRKK

A0A5A7TET8 Ty3/gypsy retrotransposon protein0.0e+0047.57Show/hide
Query:  METRVENLEGRMLELSKEMAGMRQEISGIRGSISTEVEAAVSKSVGEAMKEGMQAILARI-----------EGLKSTDQAVDNRKTKGKNVIGSEEVAVE
        ++TR+E    R+  + +E+AGM++E+S +     +  E A S  +     E  Q +L  I            G+ +   A +  K KGK    S     E
Subjt:  METRVENLEGRMLELSKEMAGMRQEISGIRGSISTEVEAAVSKSVGEAMKEGMQAILARI-----------EGLKSTDQAVDNRKTKGKNVIGSEEVAVE

Query:  VGEGSNTVLQSHLTKSLAPVFDL-RLRKLEIPVFGGASPLEWLHQVERYFTVNRLTEAEKLEAAVLCLEGVALKWHYYENKLGSFSNWEDFRRLLLRRFL
                     +       D  + +K+E+PVF G  P  WL + ERYF +++LT++EK+  + +  +G AL W   + +   F++W + +  LL RF 
Subjt:  VGEGSNTVLQSHLTKSLAPVFDL-RLRKLEIPVFGGASPLEWLHQVERYFTVNRLTEAEKLEAAVLCLEGVALKWHYYENKLGSFSNWEDFRRLLLRRFL

Query:  APKEGTLHERFFALKQESTVAEYREKFEEFAAAVDNLDGDSLEGKFVDGLVEDIKVEMRICMPVGLRAKMEMAQKIEDK-LLHLENKQTGVTLSPKVVGQ
        + K+GT+  +F  +KQE TV EY   F++  A V++L    +   F++GL+  ++ E+  C P  L   ME+AQ +E++ +   E K  G +   K+ GQ
Subjt:  APKEGTLHERFFALKQESTVAEYREKFEEFAAAVDNLDGDSLEGKFVDGLVEDIKVEMRICMPVGLRAKMEMAQKIEDK-LLHLENKQTGVTLSPKVVGQ

Query:  SVSNYKPISYGAGASARTISVNPSKLGERVRKETPGKSFEASSSSNPYKRLTESEYQNRRDRGLCFKCEEKFSPGHRCKKK---ELQVLVVHDGE-----
        +    K  + G     +  +V P +    +R   P ++    S    YKRL ++E+Q R+++GLCF+C EK+S  HRC+ K   EL++ VV +G      
Subjt:  SVSNYKPISYGAGASARTISVNPSKLGERVRKETPGKSFEASSSSNPYKRLTESEYQNRRDRGLCFKCEEKFSPGHRCKKK---ELQVLVVHDGE-----

Query:  -SDEAEGNDGVAMQTVEMADNKGEDGPKEHEAELALNSMVGLGSPKTLKVKGLIRSREVVVLIDSGATHNFIATTLVKELQLPTGQTEGYGVVLGTEGSS
          DE E  +   M+  E               EL++NS+VGL  P T+KV+G +   EVV+LID GATHNF++  LV++L LP  +T  YGV+LG+ G++
Subjt:  -SDEAEGNDGVAMQTVEMADNKGEDGPKEHEAELALNSMVGLGSPKTLKVKGLIRSREVVVLIDSGATHNFIATTLVKELQLPTGQTEGYGVVLGTEGSS

Query:  IKGTEICREVVLTLGPITIMQDFLPLPLGSADVILGVQWLSTLGDVTTNHQNLLMKFMLGKSAVVLQGDPSLVRSQVTLKTLIKQLKLEKQGFLVELNAL
        ++G  +C ++ + L    I++DFLPL LG  DVILG+QWL +LG  T + +NL + FM     V ++GDPSL +++++LK +IK  + +  GFL+E  +L
Subjt:  IKGTEICREVVLTLGPITIMQDFLPLPLGSADVILGVQWLSTLGDVTTNHQNLLMKFMLGKSAVVLQGDPSLVRSQVTLKTLIKQLKLEKQGFLVELNAL

Query:  QGGRME-----ISGSEIEGWEQIPDCIRSVLTRHFSVFQPLPGLPPTRMDDHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRAGVIQPSNSAFSS
        Q   +E     +  +E+E   +    I SV+ +   VF+    LPP R  +H I L  GT P++VRPYRY   QK+E+EKLV+EML +GVI+PS S +SS
Subjt:  QGGRME-----ISGSEIEGWEQIPDCIRSVLTRHFSVFQPLPGLPPTRMDDHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRAGVIQPSNSAFSS

Query:  PVLLVKKKDGSWRFCVDYRALNKVTVPDKFPIPMVDELLDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGLTNAPSTFQALMN
        PVLLVKKKDGSWRFCVDYRA+N  T+PDKFPIP+V+EL DEL GA +FSKIDLK+GYHQIR+  EDI K+AFRTHEGHYEFLVMPFGLTNAP+TFQALMN
Subjt:  PVLLVKKKDGSWRFCVDYRALNKVTVPDKFPIPMVDELLDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGLTNAPSTFQALMN

Query:  KVLRPFLRKFALVFFDDILIYSSSLEEHAGHLAKIMQVLHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRNIKELRGFLGLTG
         + +PFLRKF LVFFDDIL+YS + EEH  H+ K++ VL  ++L AN+KKC FA ++IEYLGH+IS +GV+ DP KI A+  WP P N+KE RGFLGLTG
Subjt:  KVLRPFLRKFALVFFDDILIYSSSLEEHAGHLAKIMQVLHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRNIKELRGFLGLTG

Query:  YYRKFVQDYGLIAHPLTQQLKKDSFAWNDDATEAFFKLKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQESRPIAYFSHALPPLHRAKAVYEREL
        YYR+FV+ YG IA PLTQ LKK  F W ++AT AF +LKSAM+++PVL LPDFSK+F +EADASG GVGAVL+Q+ RP+AY+SH L    R++ VYEREL
Subjt:  YYRKFVQDYGLIAHPLTQQLKKDSFAWNDDATEAFFKLKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQESRPIAYFSHALPPLHRAKAVYEREL

Query:  MAIVMAIQKWRPYLLGRRFTVRTDQSSLKFILDQRLISGEYQKLMTKLLGYDFQVVYKPGVENTVADALSRLPSALEFAAVSLVGGLNTSIVIEQQRLDP
        MA+V+A+Q+WRPYLL  +F V+TDQ +LKF+LDQR+I  +YQK + KLLGY F+VVYKPGVEN  ADALSR P  ++   +S+   ++  IV ++   DP
Subjt:  MAIVMAIQKWRPYLLGRRFTVRTDQSSLKFILDQRLISGEYQKLMTKLLGYDFQVVYKPGVENTVADALSRLPSALEFAAVSLVGGLNTSIVIEQQRLDP

Query:  ALAAIRSKITEGEEVPVG-YSLRGNVLYYRGKLVLPEDSPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECTVCQQAKYLALAP
            I  +I +GEE+ V  YSL+  +L ++ +LV+ + S  IP++L+ FH+S +GGH G L+T++R+A E+YW GMK+ +K    EC VCQ++K LAL+P
Subjt:  ALAAIRSKITEGEEVPVG-YSLRGNVLYYRGKLVLPEDSPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECTVCQQAKYLALAP

Query:  AGLLQPLPIPERIWEDISMDFVVGLPRAETFDSVFVVVDRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSMFWEELFRCQGTQ
        AGLL PL IP+ IW DISMDFV GLP++  ++ + VVVDRLSKY HF+P++HP+TA  +A+ F+KE+VRLHG P SIVSDRD VF S FW ELFR  GT+
Subjt:  AGLLQPLPIPERIWEDISMDFVVGLPRAETFDSVFVVVDRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSMFWEELFRCQGTQ

Query:  LKRSTAYHPQTDGQTEVVNRGMETYLRCFAMHCPLKWVRWLCWAEYSYNTSFHSGLKASPFEVVYGRSPPSVLPYHSGESPVSEVDHQLKERDRMLAMIR
        L +STAYHPQ+DGQTEVVNRG+ETYLRCF    P +WVRWL W EY YNT+FH  +  +PF+VVYGR PP++L Y +  S  S V+  L ERD +L  +R
Subjt:  LKRSTAYHPQTDGQTEVVNRGMETYLRCFAMHCPLKWVRWLCWAEYSYNTSFHSGLKASPFEVVYGRSPPSVLPYHSGESPVSEVDHQLKERDRMLAMIR

Query:  ERLLDAQQYMARMVNEKRRDVEFDVGDWAYLKLRPYRQGTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVRVHPVFHVSQLRKALGSSFPVVPF
        E L  AQ+ M    ++KRR VEF VG++ +L++RPYRQ T+    N KLAPR+ GP+ +  RIG VAYRL LPE+ R+HPVFHVSQLRK +G      P 
Subjt:  ERLLDAQQYMARMVNEKRRDVEFDVGDWAYLKLRPYRQGTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVRVHPVFHVSQLRKALGSSFPVVPF

Query:  PQNVSSDMLFKVVPVELLGLRQSVEDSTKLEVLIRWAESEPSEATWEDAVQLATQFPDFHLEDKVALWGPGSDRPPIKYVYRRKK
         Q +  + ++K  P E +  R+    + + EVL+ W      EA+WE   ++  ++P+FHLEDKV L G  + RP IK+VY R+K
Subjt:  PQNVSSDMLFKVVPVELLGLRQSVEDSTKLEVLIRWAESEPSEATWEDAVQLATQFPDFHLEDKVALWGPGSDRPPIKYVYRRKK

A0A5D3DB20 Ty3/gypsy retrotransposon protein0.0e+0047.21Show/hide
Query:  METRVENLEGRMLELSKEMAGMRQEISGIRGSISTEVEAAVSKSVGEAMKEGMQAIL-----------ARIEGLKSTDQAVDNRKTKGKNVIGSEEVAVE
        ++TR+E    R+  + +E+AGM++E+S +     +  E A S  +     E  Q +L           + +  L++   A +  K KGK    S   A++
Subjt:  METRVENLEGRMLELSKEMAGMRQEISGIRGSISTEVEAAVSKSVGEAMKEGMQAIL-----------ARIEGLKSTDQAVDNRKTKGKNVIGSEEVAVE

Query:  VGEGSNTVLQSHLTKSLAPVFDL-RLRKLEIPVFGGASPLEWLHQVERYFTVNRLTEAEKLEAAVLCLEGVALKWHYYENKLGSFSNWEDFRRLLLRRFL
         G                   D  + +K+E+PVF G  P  WL + ERYF +++LT++EK+  + +  +G AL W   + +   F++W + +  LL RF 
Subjt:  VGEGSNTVLQSHLTKSLAPVFDL-RLRKLEIPVFGGASPLEWLHQVERYFTVNRLTEAEKLEAAVLCLEGVALKWHYYENKLGSFSNWEDFRRLLLRRFL

Query:  APKEGTLHERFFALKQESTVAEYREKFEEFAAAVDNLDGDSLEGKFVDGLVEDIKVEMRICMPVGLRAKMEMAQKIEDK-LLHLENKQTG-----VTLSP
        + K+GTL  +F  +KQE TV EY   F++  A V++L    +   F++GL+  ++ E+  C P  L   ME AQ +E++ +   E K +G     +T + 
Subjt:  APKEGTLHERFFALKQESTVAEYREKFEEFAAAVDNLDGDSLEGKFVDGLVEDIKVEMRICMPVGLRAKMEMAQKIEDK-LLHLENKQTG-----VTLSP

Query:  KVVGQSVSNYKPISYGAGAS---------ARTISVNPSKLGERVRKETPGKSFEASSSSNPYKRLTESEYQNRRDRGLCFKCEEKFSPGHRCK---KKEL
         VVG++       S GA A           RTI++  S   E  R+ T             YKRL ++E+Q R+++GLCF+C EK+S  H+C+   ++EL
Subjt:  KVVGQSVSNYKPISYGAGAS---------ARTISVNPSKLGERVRKETPGKSFEASSSSNPYKRLTESEYQNRRDRGLCFKCEEKFSPGHRCK---KKEL

Query:  QVLVVHDGESDE----AEGNDGVAMQTVEMADNKGEDGPKEHEAELALNSMVGLGSPKTLKVKGLIRSREVVVLIDSGATHNFIATTLVKELQLPTGQTE
        ++ VV   E DE     E N+   +  +E+ ++           EL++NS+VGL  P T+KV+G +   EVV+LID GATHNF++  LVK+L LP  +T 
Subjt:  QVLVVHDGESDE----AEGNDGVAMQTVEMADNKGEDGPKEHEAELALNSMVGLGSPKTLKVKGLIRSREVVVLIDSGATHNFIATTLVKELQLPTGQTE

Query:  GYGVVLGTEGSSIKGTEICREVVLTLGPITIMQDFLPLPLGSADVILGVQWLSTLGDVTTNHQNLLMKFMLGKSAVVLQGDPSLVRSQVTLKTLIKQLKL
         YGV+LG+ G++++G  +C ++ + +G   I++DFLPL LG  DVILG+QWL +LG  T + +NL M F+     V ++GDPSL +++++LK LIK  + 
Subjt:  GYGVVLGTEGSSIKGTEICREVVLTLGPITIMQDFLPLPLGSADVILGVQWLSTLGDVTTNHQNLLMKFMLGKSAVVLQGDPSLVRSQVTLKTLIKQLKL

Query:  EKQGFLVELNALQGGRMEISGSEIEGWEQIPDC-----IRSVLTRHFSVFQPLPGLPPTRMDDHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRA
        +  G+L+E  +LQ   +E +   +   E +        I SV+ +   VF+    LPP R  +H I L  GT P++VRPYRY   QK+E+EKLV+EML +
Subjt:  EKQGFLVELNALQGGRMEISGSEIEGWEQIPDC-----IRSVLTRHFSVFQPLPGLPPTRMDDHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRA

Query:  GVIQPSNSAFSSPVLLVKKKDGSWRFCVDYRALNKVTVPDKFPIPMVDELLDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGL
        GVI+PS S +SSPVLLVKKKDGSWRFCVDYRA+N  T+PDKFPIP+V+EL DEL GA +FSKIDLK+GYHQIR+  ED+ K+AFRTHEGHYEFLVMPFGL
Subjt:  GVIQPSNSAFSSPVLLVKKKDGSWRFCVDYRALNKVTVPDKFPIPMVDELLDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGL

Query:  TNAPSTFQALMNKVLRPFLRKFALVFFDDILIYSSSLEEHAGHLAKIMQVLHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRN
        TNAP+TFQALMN + +PFLRKF LVFFDDIL+YS S EEH  H+ K++ +L +N+L AN+KKC FA K+IEYLGH+IS +GV+ DP KI ++  WP P N
Subjt:  TNAPSTFQALMNKVLRPFLRKFALVFFDDILIYSSSLEEHAGHLAKIMQVLHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRN

Query:  IKELRGFLGLTGYYRKFVQDYGLIAHPLTQQLKKDSFAWNDDATEAFFKLKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQESRPIAYFSHALPP
        +KE+RGFLGLTGYYR+FVQ YG IA PLTQQLKK  F WN++A  AF +LKSAM+ +PVL LPDFSK+F +EADASG GVGAVL+Q+ RPIA++SH L  
Subjt:  IKELRGFLGLTGYYRKFVQDYGLIAHPLTQQLKKDSFAWNDDATEAFFKLKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQESRPIAYFSHALPP

Query:  LHRAKAVYERELMAIVMAIQKWRPYLLGRRFTVRTDQSSLKFILDQRLISGEYQKLMTKLLGYDFQVVYKPGVENTVADALSRLPSALEFAAVSLVGGLN
          RA+ VYERELMA+V+A+Q+WRPYLL  +F V+TDQ +LKF+LDQR+I  +YQK + KLLGY F+VVYKPGV+N  ADALSR P  ++   +S+   ++
Subjt:  LHRAKAVYERELMAIVMAIQKWRPYLLGRRFTVRTDQSSLKFILDQRLISGEYQKLMTKLLGYDFQVVYKPGVENTVADALSRLPSALEFAAVSLVGGLN

Query:  TSIVIEQQRLDPALAAIRSKITEGEEV-PVGYSLRGNVLYYRGKLVLPEDSPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECT
          ++ ++   DP    I  ++ +GEE+    YS+R  +L Y+ +LV+ + S  IP++L+ FH+S VGGH G L+T++R A E++W+GMK+ +K    EC 
Subjt:  TSIVIEQQRLDPALAAIRSKITEGEEV-PVGYSLRGNVLYYRGKLVLPEDSPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECT

Query:  VCQQAKYLALAPAGLLQPLPIPERIWEDISMDFVVGLPRAETFDSVFVVVDRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSM
         CQ++K L+L+PAGLL PL IP+ +W DISMDFV GLP++  F+ V VVVDRLSKY HF+P++HP+TA  +A+ F+KE+VRLHG P SIVSDRD +F S 
Subjt:  VCQQAKYLALAPAGLLQPLPIPERIWEDISMDFVVGLPRAETFDSVFVVVDRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSM

Query:  FWEELFRCQGTQLKRSTAYHPQTDGQTEVVNRGMETYLRCFAMHCPLKWVRWLCWAEYSYNTSFHSGLKASPFEVVYGRSPPSVLPYHSGESPVSEVDHQ
        FW E+FR  GT+L +STAYHPQ+DGQTEVVNRG+ETYLRCF    P  W +WL W EY YNT+F   +  +PF+VVYGR PP++L Y S  S  S V+  
Subjt:  FWEELFRCQGTQLKRSTAYHPQTDGQTEVVNRGMETYLRCFAMHCPLKWVRWLCWAEYSYNTSFHSGLKASPFEVVYGRSPPSVLPYHSGESPVSEVDHQ

Query:  LKERDRMLAMIRERLLDAQQYMARMVNEKRRDVEFDVGDWAYLKLRPYRQGTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVRVHPVFHVSQLR
        L+ERD +L  +RE LL AQ+ M R  ++KRR VE+ VG++ +L +RPYRQ ++    N KLAPR+ GP+++  RIG VAYRL LPE  ++HPVFHVSQLR
Subjt:  LKERDRMLAMIRERLLDAQQYMARMVNEKRRDVEFDVGDWAYLKLRPYRQGTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVRVHPVFHVSQLR

Query:  KALGSSFPVVPFPQNVSSDMLFKVVPVELLGLRQSVEDSTKLEVLIRWAESEPSEATWEDAVQLATQFPDFHLEDKVALWGPGSDRPPIKYVYRRKK
        K +G      P  Q V  + ++K  P E +  R++V  + + EVL+ W      EA+WE   ++  ++P FHLEDKV L G G  RP IK+VY R+K
Subjt:  KALGSSFPVVPFPQNVSSDMLFKVVPVELLGLRQSVEDSTKLEVLIRWAESEPSEATWEDAVQLATQFPDFHLEDKVALWGPGSDRPPIKYVYRRKK

J3SDF5 Ty3/gypsy retrotransposon protein0.0e+0048.88Show/hide
Query:  LEGRMLELSKEMAGMRQEISGIRGSISTEVEAAVSKSVGEAMKEG---------MQAILARIEGLKSTDQAVDNRKTKGKNVIGSEEVAVEVGEGSNTVL
        LEGR+    + + GM       +    TEV +A++K   E  K G         +   L   EGL+  ++A+D R        G  EV    G G     
Subjt:  LEGRMLELSKEMAGMRQEISGIRGSISTEVEAAVSKSVGEAMKEG---------MQAILARIEGLKSTDQAVDNRKTKGKNVIGSEEVAVEVGEGSNTVL

Query:  QSHLTKSLAPVFDLRLRKLEIPVFGGASPLEWLHQVERYFTVNRLTEAEKLEAAVLCLEGVALKWHYYENKLGSFSNWEDFRRLLLRRFLAPKEGTLHER
        ++       P  + R +KL++P F    P  W+ + ER+F    LT+AEK+EAAV+ +EG AL+W+ +ENK   F NWE  +  +L +F     G+LHE+
Subjt:  QSHLTKSLAPVFDLRLRKLEIPVFGGASPLEWLHQVERYFTVNRLTEAEKLEAAVLCLEGVALKWHYYENKLGSFSNWEDFRRLLLRRFLAPKEGTLHER

Query:  FFALKQESTVAEYREKFEEFAAAVDNLDGDSLEGKFVDGLVEDIKVEMRICMPVGLRAKMEMAQKIEDKLLHLENKQTGVTLSPKVVGQSVSNYKPISY-
        + +  Q ++V EYR KF E AA +D +  + L GKF+ GL  +++ E+R+  P  L   ME+A K+E++      ++TG    P+    S+ N  P S  
Subjt:  FFALKQESTVAEYREKFEEFAAAVDNLDGDSLEGKFVDGLVEDIKVEMRICMPVGLRAKMEMAQKIEDKLLHLENKQTGVTLSPKVVGQSVSNYKPISY-

Query:  ----------GAGASARTISVNPSKLGERVRKETPGKSFEASSSSNPYKRLTESEYQNRRDRGLCFKCEEKFSPGHRCKKKELQVLVVHDGESDEAEGND
                  G+ AS ++ ++N +     V    P     +S      +RLTE E Q +R +GLCFKC+EK+  GH+C++KEL VL + D E DE EG  
Subjt:  ----------GAGASARTISVNPSKLGERVRKETPGKSFEASSSSNPYKRLTESEYQNRRDRGLCFKCEEKFSPGHRCKKKELQVLVVHDGESDEAEGND

Query:  GVAMQTVEMADNKGEDGPKEHEAELALNSMVGLGSPKTLKVKGLIRSREVVVLIDSGATHNFIATTLVKELQLPTGQTEGYGVVLGTEGSSIKGTEICRE
          A+   E   +  E+ P     E++LNS++GL +PKT+K+ GLI + EVVV+ID GATHNF++   + +L +P  ++E +GV LG +G +++GT ICR 
Subjt:  GVAMQTVEMADNKGEDGPKEHEAELALNSMVGLGSPKTLKVKGLIRSREVVVLIDSGATHNFIATTLVKELQLPTGQTEGYGVVLGTEGSSIKGTEICRE

Query:  VVLTL-GPITIMQDFLPLPLGSADVILGVQWLSTLGDVTTNHQNLLMKFMLGKSAVVLQGDPSLVRSQVTLKTLIKQLKLEKQGFLVELNALQ-GGRMEI
        V L L G + +++DFLPL LG++DVILGVQWL TLG V +N +   M F LG     L GDP+L RS+V+LK +++ L+ E  G  +E N ++ GG   I
Subjt:  VVLTL-GPITIMQDFLPLPLGSADVILGVQWLSTLGDVTTNHQNLLMKFMLGKSAVVLQGDPSLVRSQVTLKTLIKQLKLEKQGFLVELNALQ-GGRMEI

Query:  SGSEIEGWEQIPDCIRSVLTRHFSVFQPLPGLPPTRMDDHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRAGVIQPSNSAFSSPVLLVKKKDGSW
          S++E  ++IP  ++ ++ R   VF+   GLPP R  +H I L  G+ PV VRPYRYPQ QKDEIE+L++EML AG+IQPS S FSSPV+LVKKKDGSW
Subjt:  SGSEIEGWEQIPDCIRSVLTRHFSVFQPLPGLPPTRMDDHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRAGVIQPSNSAFSSPVLLVKKKDGSW

Query:  RFCVDYRALNKVTVPDKFPIPMVDELLDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGLTNAPSTFQALMNKVLRPFLRKFAL
        RFCVDYRALNK TVPDK+PIP++DELLDELHGA VFSK+DL+AGYHQI ++ ED  K+AFRTHEGHYEFLVMPFGLTNAP+TFQ+LMN+V RPFLR+F L
Subjt:  RFCVDYRALNKVTVPDKFPIPMVDELLDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGLTNAPSTFQALMNKVLRPFLRKFAL

Query:  VFFDDILIYSSSLEEHAGHLAKIMQVLHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRNIKELRGFLGLTGYYRKFVQDYGLI
        VF DDILIYS S EEH GHL  ++ +L ++ L  N+KKC+F  +++ YLGH+IS  GV+ D  K+ A+ +W +P+N++ELRGFLGLTGYYRKFV +Y  I
Subjt:  VFFDDILIYSSSLEEHAGHLAKIMQVLHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRNIKELRGFLGLTGYYRKFVQDYGLI

Query:  AHPLTQQLKKDSFAWNDDATEAFFKLKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQESRPIAYFSHALPPLHRAKAVYERELMAIVMAIQKWRP
        A PLT+QLKKD+F W+  ATEAF +LKSAMV+ PVL +P+F   FVVE DASG G+GAVLMQ++RPIAY+S  L    + K+VYE+ELMAI  A+QKW+ 
Subjt:  AHPLTQQLKKDSFAWNDDATEAFFKLKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQESRPIAYFSHALPPLHRAKAVYERELMAIVMAIQKWRP

Query:  YLLGRRFTVRTDQSSLKFILDQRLISGEYQKLMTKLLGYDFQVVYKPGVENTVADALSR-LPSALEFAAVSLVGGLNTSIVIEQQRLDPALAAIRSKITE
        YLLGR F VRTDQ SL++I  QR I  E+QK ++KL+GYDF++ YKPG+ N VADALSR     +E  A+  V G+  + +  +   D  L  +R ++ E
Subjt:  YLLGRRFTVRTDQSSLKFILDQRLISGEYQKLMTKLLGYDFQVVYKPGVENTVADALSR-LPSALEFAAVSLVGGLNTSIVIEQQRLDPALAAIRSKITE

Query:  GEEVPVGYSLRGNVLYYRGKLVLPEDSPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECTVCQQAKYLALAPAGLLQPLPIPER
        G   P  ++L    L ++G+ V+P  S  IP LL  +H +P+GGH G LKT+ RLA E YW+GM+  V  +V +C +CQQ K     P GLLQPLPIP  
Subjt:  GEEVPVGYSLRGNVLYYRGKLVLPEDSPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECTVCQQAKYLALAPAGLLQPLPIPER

Query:  IWEDISMDFVVGLPRAETFDSVFVVVDRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSMFWEELFRCQGTQLKRSTAYHPQTD
        +WEDISMDF+ GLP ++  D++ V+VDRLSKYAHF+ +RHPFTA  +A  F+KEVVRLHG P SIVSDRD +F S+FW+ELFR  GT LKRS+AYHPQTD
Subjt:  IWEDISMDFVVGLPRAETFDSVFVVVDRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSMFWEELFRCQGTQLKRSTAYHPQTD

Query:  GQTEVVNRGMETYLRCFAMHCPLKWVRWLCWAEYSYNTSFHSGLKASPFEVVYGRSPPSVLPYHSGESPVSEVDHQLKERDRMLAMIRERLLDAQQYMAR
        GQTE+VNR +ETYLRCF    P  W +WL WAE+SYNTS H+  K SPF+V+YGR PP V+    G++ V  ++  L++RD ++  ++  L+ AQQ M  
Subjt:  GQTEVVNRGMETYLRCFAMHCPLKWVRWLCWAEYSYNTSFHSGLKASPFEVVYGRSPPSVLPYHSGESPVSEVDHQLKERDRMLAMIRERLLDAQQYMAR

Query:  MVNEKRRDVEFDVGDWAYLKLRPYRQGTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVRVHPVFHVSQLRKALGSSFPVVP-FPQNVSSDMLFK
          +  R +VEF VGD  +L+L+PYRQ +L+K    KLAPR+ GPF V  RIGA AY+L LP   ++HPVFHVS L+K +G++ PV+P  P ++  DM   
Subjt:  MVNEKRRDVEFDVGDWAYLKLRPYRQGTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVRVHPVFHVSQLRKALGSSFPVVP-FPQNVSSDMLFK

Query:  VVPVELLGLRQSVE-DSTKLEVLIRWAESEPSEATWEDAVQLATQFPDFHLEDKVALWGPG
        V P ELL +RQ  +   T  E LI+W      EATWED   +  +FP FHLEDKV +WG G
Subjt:  VVPVELLGLRQSVE-DSTKLEVLIRWAESEPSEATWEDAVQLATQFPDFHLEDKVALWGPG

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein2.7e-13332.42Show/hide
Query:  QPLPGLPPTRMDDHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRAGVIQPSNSAFSSPVLLVKKKDGSWRFCVDYRALNKVTVPDKFPIPMVDEL
        +P+ GL      +  +EL      + +R Y  P  +   +   + + L++G+I+ S +  + PV+ V KK+G+ R  VDY+ LNK   P+ +P+P++++L
Subjt:  QPLPGLPPTRMDDHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRAGVIQPSNSAFSSPVLLVKKKDGSWRFCVDYRALNKVTVPDKFPIPMVDEL

Query:  LDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGLTNAPSTFQALMNKVLRPFLRKFALVFFDDILIYSSSLEEHAGHLAKIMQV
        L ++ G+ +F+K+DLK+ YH IR++  D  K AFR   G +E+LVMP+G++ AP+ FQ  +N +L        + + DDILI+S S  EH  H+  ++Q 
Subjt:  LDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGLTNAPSTFQALMNKVLRPFLRKFALVFFDDILIYSSSLEEHAGHLAKIMQV

Query:  LHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRNIKELRGFLGLTGYYRKFVQDYGLIAHPLTQQLKKD-SFAWNDDATEAFFK
        L    L+ N+ KC+F   Q++++G+ IS  G +     I  + QW  P+N KELR FLG   Y RKF+     + HPL   LKKD  + W    T+A   
Subjt:  LHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRNIKELRGFLGLTGYYRKFVQDYGLIAHPLTQQLKKD-SFAWNDDATEAFFK

Query:  LKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQES-----RPIAYFSHALPPLHRAKAVYERELMAIVMAIQKWRPYLLG--RRFTVRTDQSSL--
        +K  +V+ PVLR  DFSK+ ++E DAS V VGAVL Q+       P+ Y+S  +       +V ++E++AI+ +++ WR YL      F + TD  +L  
Subjt:  LKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQES-----RPIAYFSHALPPLHRAKAVYERELMAIVMAIQKWRPYLLG--RRFTVRTDQSSL--

Query:  KFILDQRLISGEYQKLMTKLLGYDFQVVYKPGVENTVADALSR-------LPSALEFAAVSLVGGLN-----TSIVIEQQRLDPALAAIRSKITEGEEVP
        +   +    +    +    L  ++F++ Y+PG  N +ADALSR       +P   E  +++ V  ++      + V+ +   D  L  + +   E + V 
Subjt:  KFILDQRLISGEYQKLMTKLLGYDFQVVYKPGVENTVADALSR-------LPSALEFAAVSLVGGLN-----TSIVIEQQRLDPALAAIRSKITEGEEVP

Query:  VGYSLR-GNVLYYRGKLVLPEDSPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECTVCQQAKYLALAPAGLLQPLPIPERIWED
            L+ G ++  + +++LP D+     +++ +H      H G       + +   WKG++ +++ +V  C  CQ  K     P G LQP+P  ER WE 
Subjt:  VGYSLR-GNVLYYRGKLVLPEDSPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECTVCQQAKYLALAPAGLLQPLPIPERIWED

Query:  ISMDFVVGLPRAETFDSVFVVVDRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSMFWEELFRCQGTQLKRSTAYHPQTDGQTE
        +SMDF+  LP +  ++++FVVVDR SK A  +P     TA   A+ F + V+   G P+ I++D D +FTS  W++        +K S  Y PQTDGQTE
Subjt:  ISMDFVVGLPRAETFDSVFVVVDRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSMFWEELFRCQGTQLKRSTAYHPQTDGQTE

Query:  VVNRGMETYLRCFAMHCPLKWVRWLCWAEYSYNTSFHSGLKASPFEVVYGRSP---PSVLPYHSGESPVSEVDHQLKERDRMLAMIRERLLDAQQYMARM
          N+ +E  LRC     P  WV  +   + SYN + HS  + +PFE+V+  SP   P  LP  S      + D   +E  ++   ++E L      M + 
Subjt:  VVNRGMETYLRCFAMHCPLKWVRWLCWAEYSYNTSFHSGLKASPFEVVYGRSP---PSVLPYHSGESPVSEVDHQLKERDRMLAMIRERLLDAQQYMARM

Query:  VNEKRRDV-EFDVGDWAYLKLRPYRQGTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVR--VHPVFHVSQLRK
         + K +++ EF  GD   +K    + G L K  + KLAP + GPF V  + G   Y L LP+ ++      FHVS L K
Subjt:  VNEKRRDV-EFDVGDWAYLKLRPYRQGTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVR--VHPVFHVSQLRK

P0CT35 Transposon Tf2-2 polyprotein2.7e-13332.42Show/hide
Query:  QPLPGLPPTRMDDHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRAGVIQPSNSAFSSPVLLVKKKDGSWRFCVDYRALNKVTVPDKFPIPMVDEL
        +P+ GL      +  +EL      + +R Y  P  +   +   + + L++G+I+ S +  + PV+ V KK+G+ R  VDY+ LNK   P+ +P+P++++L
Subjt:  QPLPGLPPTRMDDHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRAGVIQPSNSAFSSPVLLVKKKDGSWRFCVDYRALNKVTVPDKFPIPMVDEL

Query:  LDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGLTNAPSTFQALMNKVLRPFLRKFALVFFDDILIYSSSLEEHAGHLAKIMQV
        L ++ G+ +F+K+DLK+ YH IR++  D  K AFR   G +E+LVMP+G++ AP+ FQ  +N +L        + + DDILI+S S  EH  H+  ++Q 
Subjt:  LDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGLTNAPSTFQALMNKVLRPFLRKFALVFFDDILIYSSSLEEHAGHLAKIMQV

Query:  LHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRNIKELRGFLGLTGYYRKFVQDYGLIAHPLTQQLKKD-SFAWNDDATEAFFK
        L    L+ N+ KC+F   Q++++G+ IS  G +     I  + QW  P+N KELR FLG   Y RKF+     + HPL   LKKD  + W    T+A   
Subjt:  LHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRNIKELRGFLGLTGYYRKFVQDYGLIAHPLTQQLKKD-SFAWNDDATEAFFK

Query:  LKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQES-----RPIAYFSHALPPLHRAKAVYERELMAIVMAIQKWRPYLLG--RRFTVRTDQSSL--
        +K  +V+ PVLR  DFSK+ ++E DAS V VGAVL Q+       P+ Y+S  +       +V ++E++AI+ +++ WR YL      F + TD  +L  
Subjt:  LKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQES-----RPIAYFSHALPPLHRAKAVYERELMAIVMAIQKWRPYLLG--RRFTVRTDQSSL--

Query:  KFILDQRLISGEYQKLMTKLLGYDFQVVYKPGVENTVADALSR-------LPSALEFAAVSLVGGLN-----TSIVIEQQRLDPALAAIRSKITEGEEVP
        +   +    +    +    L  ++F++ Y+PG  N +ADALSR       +P   E  +++ V  ++      + V+ +   D  L  + +   E + V 
Subjt:  KFILDQRLISGEYQKLMTKLLGYDFQVVYKPGVENTVADALSR-------LPSALEFAAVSLVGGLN-----TSIVIEQQRLDPALAAIRSKITEGEEVP

Query:  VGYSLR-GNVLYYRGKLVLPEDSPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECTVCQQAKYLALAPAGLLQPLPIPERIWED
            L+ G ++  + +++LP D+     +++ +H      H G       + +   WKG++ +++ +V  C  CQ  K     P G LQP+P  ER WE 
Subjt:  VGYSLR-GNVLYYRGKLVLPEDSPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECTVCQQAKYLALAPAGLLQPLPIPERIWED

Query:  ISMDFVVGLPRAETFDSVFVVVDRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSMFWEELFRCQGTQLKRSTAYHPQTDGQTE
        +SMDF+  LP +  ++++FVVVDR SK A  +P     TA   A+ F + V+   G P+ I++D D +FTS  W++        +K S  Y PQTDGQTE
Subjt:  ISMDFVVGLPRAETFDSVFVVVDRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSMFWEELFRCQGTQLKRSTAYHPQTDGQTE

Query:  VVNRGMETYLRCFAMHCPLKWVRWLCWAEYSYNTSFHSGLKASPFEVVYGRSP---PSVLPYHSGESPVSEVDHQLKERDRMLAMIRERLLDAQQYMARM
          N+ +E  LRC     P  WV  +   + SYN + HS  + +PFE+V+  SP   P  LP  S      + D   +E  ++   ++E L      M + 
Subjt:  VVNRGMETYLRCFAMHCPLKWVRWLCWAEYSYNTSFHSGLKASPFEVVYGRSP---PSVLPYHSGESPVSEVDHQLKERDRMLAMIRERLLDAQQYMARM

Query:  VNEKRRDV-EFDVGDWAYLKLRPYRQGTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVR--VHPVFHVSQLRK
         + K +++ EF  GD   +K    + G L K  + KLAP + GPF V  + G   Y L LP+ ++      FHVS L K
Subjt:  VNEKRRDV-EFDVGDWAYLKLRPYRQGTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVR--VHPVFHVSQLRK

P0CT41 Transposon Tf2-12 polyprotein2.7e-13332.42Show/hide
Query:  QPLPGLPPTRMDDHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRAGVIQPSNSAFSSPVLLVKKKDGSWRFCVDYRALNKVTVPDKFPIPMVDEL
        +P+ GL      +  +EL      + +R Y  P  +   +   + + L++G+I+ S +  + PV+ V KK+G+ R  VDY+ LNK   P+ +P+P++++L
Subjt:  QPLPGLPPTRMDDHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRAGVIQPSNSAFSSPVLLVKKKDGSWRFCVDYRALNKVTVPDKFPIPMVDEL

Query:  LDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGLTNAPSTFQALMNKVLRPFLRKFALVFFDDILIYSSSLEEHAGHLAKIMQV
        L ++ G+ +F+K+DLK+ YH IR++  D  K AFR   G +E+LVMP+G++ AP+ FQ  +N +L        + + DDILI+S S  EH  H+  ++Q 
Subjt:  LDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGLTNAPSTFQALMNKVLRPFLRKFALVFFDDILIYSSSLEEHAGHLAKIMQV

Query:  LHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRNIKELRGFLGLTGYYRKFVQDYGLIAHPLTQQLKKD-SFAWNDDATEAFFK
        L    L+ N+ KC+F   Q++++G+ IS  G +     I  + QW  P+N KELR FLG   Y RKF+     + HPL   LKKD  + W    T+A   
Subjt:  LHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRNIKELRGFLGLTGYYRKFVQDYGLIAHPLTQQLKKD-SFAWNDDATEAFFK

Query:  LKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQES-----RPIAYFSHALPPLHRAKAVYERELMAIVMAIQKWRPYLLG--RRFTVRTDQSSL--
        +K  +V+ PVLR  DFSK+ ++E DAS V VGAVL Q+       P+ Y+S  +       +V ++E++AI+ +++ WR YL      F + TD  +L  
Subjt:  LKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQES-----RPIAYFSHALPPLHRAKAVYERELMAIVMAIQKWRPYLLG--RRFTVRTDQSSL--

Query:  KFILDQRLISGEYQKLMTKLLGYDFQVVYKPGVENTVADALSR-------LPSALEFAAVSLVGGLN-----TSIVIEQQRLDPALAAIRSKITEGEEVP
        +   +    +    +    L  ++F++ Y+PG  N +ADALSR       +P   E  +++ V  ++      + V+ +   D  L  + +   E + V 
Subjt:  KFILDQRLISGEYQKLMTKLLGYDFQVVYKPGVENTVADALSR-------LPSALEFAAVSLVGGLN-----TSIVIEQQRLDPALAAIRSKITEGEEVP

Query:  VGYSLR-GNVLYYRGKLVLPEDSPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECTVCQQAKYLALAPAGLLQPLPIPERIWED
            L+ G ++  + +++LP D+     +++ +H      H G       + +   WKG++ +++ +V  C  CQ  K     P G LQP+P  ER WE 
Subjt:  VGYSLR-GNVLYYRGKLVLPEDSPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECTVCQQAKYLALAPAGLLQPLPIPERIWED

Query:  ISMDFVVGLPRAETFDSVFVVVDRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSMFWEELFRCQGTQLKRSTAYHPQTDGQTE
        +SMDF+  LP +  ++++FVVVDR SK A  +P     TA   A+ F + V+   G P+ I++D D +FTS  W++        +K S  Y PQTDGQTE
Subjt:  ISMDFVVGLPRAETFDSVFVVVDRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSMFWEELFRCQGTQLKRSTAYHPQTDGQTE

Query:  VVNRGMETYLRCFAMHCPLKWVRWLCWAEYSYNTSFHSGLKASPFEVVYGRSP---PSVLPYHSGESPVSEVDHQLKERDRMLAMIRERLLDAQQYMARM
          N+ +E  LRC     P  WV  +   + SYN + HS  + +PFE+V+  SP   P  LP  S      + D   +E  ++   ++E L      M + 
Subjt:  VVNRGMETYLRCFAMHCPLKWVRWLCWAEYSYNTSFHSGLKASPFEVVYGRSP---PSVLPYHSGESPVSEVDHQLKERDRMLAMIRERLLDAQQYMARM

Query:  VNEKRRDV-EFDVGDWAYLKLRPYRQGTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVR--VHPVFHVSQLRK
         + K +++ EF  GD   +K    + G L K  + KLAP + GPF V  + G   Y L LP+ ++      FHVS L K
Subjt:  VNEKRRDV-EFDVGDWAYLKLRPYRQGTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVR--VHPVFHVSQLRK

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein4.1e-13434.02Show/hide
Query:  LPPTRMD------DHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRAGVIQPSNSAFSSPVLLVKKKDGSWRFCVDYRALNKVTVPDKFPIPMVDE
        LPP   D       H IE+ PG     ++PY   +  + EI K+V+++L    I PS S  SSPV+LV KKDG++R CVDYR LNK T+ D FP+P +D 
Subjt:  LPPTRMD------DHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRAGVIQPSNSAFSSPVLLVKKKDGSWRFCVDYRALNKVTVPDKFPIPMVDE

Query:  LLDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGLTNAPSTFQALMNKVLRPFLRKFALVFFDDILIYSSSLEEHAGHLAKIMQ
        LL  +  A +F+ +DL +GYHQI ++ +D  K+AF T  G YE+ VMPFGL NAPSTF   M    R    +F  V+ DDILI+S S EEH  HL  +++
Subjt:  LLDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGLTNAPSTFQALMNKVLRPFLRKFALVFFDDILIYSSSLEEHAGHLAKIMQ

Query:  VLHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRNIKELRGFLGLTGYYRKFVQDYGLIAHPLTQQLKKDSFAWNDDATEAFFK
         L    L+  +KKC+FA ++ E+LG+ I +  ++    K AA+  +P P+ +K+ + FLG+  YYR+F+ +   IA P+ Q    D   W +   +A  K
Subjt:  VLHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRNIKELRGFLGLTGYYRKFVQDYGLIAHPLTQQLKKDSFAWNDDATEAFFK

Query:  LKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQESRP------IAYFSHALPPLHRAKAVYERELMAIVMAIQKWRPYLLGRRFTVRTDQSSLKFI
        LK+A+   PVL   +    + +  DAS  G+GAVL +          + YFS +L    +     E EL+ I+ A+  +R  L G+ FT+RTD  SL  +
Subjt:  LKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQESRP------IAYFSHALPPLHRAKAVYERELMAIVMAIQKWRPYLLGRRFTVRTDQSSLKFI

Query:  LDQRLISGEYQKLMTKLLGYDFQVVYKPGVENTVADALSRLPSAL-----------EFAAVSLVGGLNTSIVIEQQRLD---------PALAAIRSKITE
         ++   +   Q+ +  L  YDF + Y  G +N VADA+SR    +            + +      L ++++I  + L           A  + + K+  
Subjt:  LDQRLISGEYQKLMTKLLGYDFQVVYKPGVENTVADALSRLPSAL-----------EFAAVSLVGGLNTSIVIEQQRLD---------PALAAIRSKITE

Query:  GEEVPVGYSLRGNVLYYRGKLVLPEDSPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECTVCQQAKYLALAPAGLLQPLPIPER
         E     YSL   ++YY+ +LV+P       + L   H+   GGH G   T  +++   YW  ++  +  ++  C  CQ  K       GLLQPLPI E 
Subjt:  GEEVPVGYSLRGNVLYYRGKLVLPEDSPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECTVCQQAKYLALAPAGLLQPLPIPER

Query:  IWEDISMDFVVGL-PRAETFDSVFVVVDRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSMFWEELFRCQGTQLKRSTAYHPQT
         W DISMDFV GL P +   + + VVVDR SK AHFI  R    AT +     + +   HG PR+I SDRD   T+  ++EL +  G +   S+A HPQT
Subjt:  IWEDISMDFVVGL-PRAETFDSVFVVVDRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSMFWEELFRCQGTQLKRSTAYHPQT

Query:  DGQTEVVNRGMETYLRCFAMHCPLKWVRWLCWAEYSYNTSFHSGLKASPFEVVYGRSP--PSVLPYHSGESPVSEVDHQLKERDRMLAMIRERLLDAQQY
        DGQ+E   + +   LR +       W  +L   E+ YN++    L  SPFE+  G  P  P++       +         K    +    +E+L  AQ  
Subjt:  DGQTEVVNRGMETYLRCFAMHCPLKWVRWLCWAEYSYNTSFHSGLKASPFEVVYGRSP--PSVLPYHSGESPVSEVDHQLKERDRMLAMIRERLLDAQQY

Query:  MARMVNEKRRDVEFDVGDWAYLKLRPYRQGTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVRVHPVFHVSQLR
        M    N++R+ +  ++GD   +    +R     K    K+   YVGPF+V  +I   AY L L  H + H V +V  L+
Subjt:  MARMVNEKRRDVEFDVGDWAYLKLRPYRQGTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVRVHPVFHVSQLR

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.9e-13534.2Show/hide
Query:  LPPTRMD------DHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRAGVIQPSNSAFSSPVLLVKKKDGSWRFCVDYRALNKVTVPDKFPIPMVDE
        LPP   D       H IE+ PG     ++PY   +  + EI K+V+++L    I PS S  SSPV+LV KKDG++R CVDYR LNK T+ D FP+P +D 
Subjt:  LPPTRMD------DHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVEEMLRAGVIQPSNSAFSSPVLLVKKKDGSWRFCVDYRALNKVTVPDKFPIPMVDE

Query:  LLDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGLTNAPSTFQALMNKVLRPFLRKFALVFFDDILIYSSSLEEHAGHLAKIMQ
        LL  +  A +F+ +DL +GYHQI ++ +D  K+AF T  G YE+ VMPFGL NAPSTF   M    R    +F  V+ DDILI+S S EEH  HL  +++
Subjt:  LLDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGLTNAPSTFQALMNKVLRPFLRKFALVFFDDILIYSSSLEEHAGHLAKIMQ

Query:  VLHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRNIKELRGFLGLTGYYRKFVQDYGLIAHPLTQQLKKDSFAWNDDATEAFFK
         L    L+  +KKC+FA ++ E+LG+ I +  ++    K AA+  +P P+ +K+ + FLG+  YYR+F+ +   IA P+ Q    D   W +   +A  K
Subjt:  VLHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRNIKELRGFLGLTGYYRKFVQDYGLIAHPLTQQLKKDSFAWNDDATEAFFK

Query:  LKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQESRP------IAYFSHALPPLHRAKAVYERELMAIVMAIQKWRPYLLGRRFTVRTDQSSLKFI
        LK A+   PVL   +    + +  DAS  G+GAVL +          + YFS +L    +     E EL+ I+ A+  +R  L G+ FT+RTD  SL  +
Subjt:  LKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQESRP------IAYFSHALPPLHRAKAVYERELMAIVMAIQKWRPYLLGRRFTVRTDQSSLKFI

Query:  LDQRLISGEYQKLMTKLLGYDFQVVYKPGVENTVADALSRLPSAL-----------EFAAVSLVGGLNTSIVIEQQRLD---------PALAAIRSKITE
         ++   +   Q+ +  L  YDF + Y  G +N VADA+SR    +            + +      L ++++I  + L           A  + + K+  
Subjt:  LDQRLISGEYQKLMTKLLGYDFQVVYKPGVENTVADALSRLPSAL-----------EFAAVSLVGGLNTSIVIEQQRLD---------PALAAIRSKITE

Query:  GEEVPVGYSLRGNVLYYRGKLVLPEDSPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECTVCQQAKYLALAPAGLLQPLPIPER
         E     YSL   ++YY+ +LV+P       + L   H+   GGH G   T  +++   YW  ++  +  ++  C  CQ  K       GLLQPLPI E 
Subjt:  GEEVPVGYSLRGNVLYYRGKLVLPEDSPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECTVCQQAKYLALAPAGLLQPLPIPER

Query:  IWEDISMDFVVGL-PRAETFDSVFVVVDRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSMFWEELFRCQGTQLKRSTAYHPQT
         W DISMDFV GL P +   + + VVVDR SK AHFI  R    AT +     + +   HG PR+I SDRD   T+  ++EL +  G +   S+A HPQT
Subjt:  IWEDISMDFVVGL-PRAETFDSVFVVVDRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSMFWEELFRCQGTQLKRSTAYHPQT

Query:  DGQTEVVNRGMETYLRCFAMHCPLKWVRWLCWAEYSYNTSFHSGLKASPFEVVYGRSP--PSVLPYHSGESPVSEVDHQLKERDRMLAMIRERLLDAQQY
        DGQ+E   + +   LR +A      W  +L   E+ YN++    L  SPFE+  G  P  P++       +         K    +    +E+L  AQ  
Subjt:  DGQTEVVNRGMETYLRCFAMHCPLKWVRWLCWAEYSYNTSFHSGLKASPFEVVYGRSP--PSVLPYHSGESPVSEVDHQLKERDRMLAMIRERLLDAQQY

Query:  MARMVNEKRRDVEFDVGDWAYLKLRPYRQGTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVRVHPVFHVSQLRK
        M    N++R+ +  ++GD   +    +R     K    K+   YVGPF+V  +I   AY L L  H + H V +V  L+K
Subjt:  MARMVNEKRRDVEFDVGDWAYLKLRPYRQGTLSKTLNAKLAPRYVGPFQVEARIGAVAYRLILPEHVRVHPVFHVSQLRK

Arabidopsis top hitse value%identityAlignment
AT1G67020.1 unknown protein1.1e-1244.59Show/hide
Query:  LRKLEIPVFGGASPLEWLHQVERYFTVNRLTEAEKLEAAVLCLEGVALKWHYYENKLGSFSNWEDFRRLLLRRF
        +R++E+PVF G+   EW  +VER+F V R  +++KL+   L LEGVALKW   E     F +W  F + LL RF
Subjt:  LRKLEIPVFGGASPLEWLHQVERYFTVNRLTEAEKLEAAVLCLEGVALKWHYYENKLGSFSNWEDFRRLLLRRF

AT3G29750.1 Eukaryotic aspartyl protease family protein2.7e-1129.44Show/hide
Query:  HRCKKKELQVLVVHDGESDEAEGNDGVAMQTVEMADNKG--EDGPKEHEAELALNSMVGLGSPKTLKVKGLIRSREVVVLIDSGATHNFIATTLVKELQL
        ++ ++ EL  L +   + D  +   GV  +  E+  +      G ++   +L  N        K ++  G I   +VVV IDSGAT NFI   L   L+L
Subjt:  HRCKKKELQVLVVHDGESDEAEGNDGVAMQTVEMADNKG--EDGPKEHEAELALNSMVGLGSPKTLKVKGLIRSREVVVLIDSGATHNFIATTLVKELQL

Query:  PTGQTEGYGVVLGTEGSSIKGTEICREVVLTLGPITIMQDFLPLPLG--SADVILGVQWLSTLGDVTTNHQNLLMKFMLGKSAVVL---QGDPSLVRSQV
        PT  T    V+LG +   I+    C  + L +  + I ++FL L L     DVILG +WLS LG+   N QN    F   +  + L     +   V ++V
Subjt:  PTGQTEGYGVVLGTEGSSIKGTEICREVVLTLGPITIMQDFLPLPLG--SADVILGVQWLSTLGDVTTNHQNLLMKFMLGKSAVVL---QGDPSLVRSQV

Query:  TLKTLIKQLKLEKQ
         +K+  +Q  +E+Q
Subjt:  TLKTLIKQLKLEKQ

AT3G30770.1 Eukaryotic aspartyl protease family protein5.2e-0731.03Show/hide
Query:  KTLKVKGLIRSREVVVLIDSGATHNFIATTLVKELQLPTGQTEGYGVVLGTEGSSIKGTEICREVVLTLGPITIMQDFLPLPL--GSADVILGV------
        K ++  G I   +VVV+IDSGAT+NFI+  L   L+LPT  T    V+LG +   I+    C  + L +  + I ++FL L L     DVILG       
Subjt:  KTLKVKGLIRSREVVVLIDSGATHNFIATTLVKELQLPTGQTEGYGVVLGTEGSSIKGTEICREVVLTLGPITIMQDFLPLPL--GSADVILGV------

Query:  --QWLSTLGDVTTNHQNLLMKFMLGKSAVVLQGDPSLVRSQVTLKTLIKQLKLEKQGFLVELNALQGGRMEISG
          QWL  L    +   N     +  K   + Q     V ++V +K+  +Q K++   +L +   L+GG   + G
Subjt:  --QWLSTLGDVTTNHQNLLMKFMLGKSAVVLQGDPSLVRSQVTLKTLIKQLKLEKQGFLVELNALQGGRMEISG

AT3G42723.1 aminoacyl-tRNA ligases;ATP binding;nucleotide binding1.3e-1029.03Show/hide
Query:  LHQVERYFTVNRLTEAEKLEAAVLCLEGVALKWHYYENKLGSFSNWEDFRRLLLRRFLAPKEGTLHERFFALKQESTVAEYREKFEEFAAAVDNLDGDSL
        L   E YF  N + E E+L+     LEG   +W  +  K  S ++W++F+ ++ R      +      +  ++QE +V EYRE+FE        L G  L
Subjt:  LHQVERYFTVNRLTEAEKLEAAVLCLEGVALKWHYYENKLGSFSNWEDFRRLLLRRFLAPKEGTLHERFFALKQESTVAEYREKFEEFAAAVDNLDGDSL

Query:  EGKFVDGLVEDIKVEMRICMPVGLRAKMEMAQKIED-KLLHLENKQTGVTLSPKV
        E  F+ GL   ++  +R   P G+   M+ AQ +E+   L +      V   PKV
Subjt:  EGKFVDGLVEDIKVEMRICMPVGLRAKMEMAQKIED-KLLHLENKQTGVTLSPKV

ATMG00860.1 DNA/RNA polymerases superfamily protein3.1e-3656.92Show/hide
Query:  HLAKIMQVLHENQLVANRKKCQFALKQIEYLG--HIISVDGVSADPSKIAAMEQWPIPRNIKELRGFLGLTGYYRKFVQDYGLIAHPLTQQLKKDSFAWN
        HL  ++Q+  ++Q  ANRKKC F   QI YLG  HIIS +GVSADP+K+ AM  WP P+N  ELRGFLGLTGYYR+FV++YG I  PLT+ LKK+S  W 
Subjt:  HLAKIMQVLHENQLVANRKKCQFALKQIEYLG--HIISVDGVSADPSKIAAMEQWPIPRNIKELRGFLGLTGYYRKFVQDYGLIAHPLTQQLKKDSFAWN

Query:  DDATEAFFKLKSAMVTVPVLRLPDFSKEFV
        + A  AF  LK A+ T+PVL LPD    FV
Subjt:  DDATEAFFKLKSAMVTVPVLRLPDFSKEFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACAAGAATGGAAACACGAGTAGAGAATCTTGAAGGACGAATGTTGGAATTGAGCAAGGAAATGGCCGGAATGAGGCAAGAGATTTCGGGAATACGCGGATCTAT
TTCGACCGAAGTAGAAGCTGCGGTATCGAAGAGTGTGGGTGAGGCGATGAAGGAGGGAATGCAAGCCATCCTCGCCCGGATCGAAGGGTTGAAATCCACGGATCAGGCCG
TGGATAACAGGAAAACGAAAGGGAAGAACGTTATCGGGTCGGAGGAAGTGGCTGTCGAAGTAGGGGAGGGGAGTAACACGGTTTTACAATCACACCTGACCAAGTCTCTG
GCTCCGGTGTTCGATCTTCGTCTGCGGAAACTGGAAATACCAGTGTTTGGGGGAGCAAGTCCGTTGGAATGGTTACATCAAGTAGAACGGTACTTCACCGTCAACCGACT
GACGGAAGCGGAGAAGCTGGAAGCGGCGGTACTGTGTCTCGAAGGCGTGGCGCTTAAGTGGCATTACTATGAGAACAAATTGGGTTCGTTCAGTAACTGGGAGGATTTCC
GTCGATTATTGTTACGGAGGTTTCTGGCGCCAAAAGAAGGTACGCTTCACGAACGATTCTTTGCTCTCAAGCAAGAATCTACTGTAGCTGAATATAGGGAGAAATTTGAG
GAATTTGCTGCTGCCGTAGATAACCTAGACGGCGATTCGTTGGAGGGGAAATTTGTTGATGGACTGGTTGAAGACATTAAGGTCGAGATGAGAATATGTATGCCTGTTGG
GCTTAGGGCCAAGATGGAAATGGCCCAAAAGATTGAAGATAAGCTGCTGCATCTTGAAAATAAACAGACGGGAGTTACCTTGAGTCCAAAAGTGGTGGGCCAGAGTGTTA
GTAATTACAAGCCCATATCATATGGGGCTGGGGCTAGCGCTCGTACCATCTCGGTAAACCCAAGTAAATTGGGAGAGAGGGTACGTAAAGAAACTCCAGGGAAGTCATTC
GAGGCTTCTTCTTCATCAAATCCGTATAAGCGTCTTACTGAGTCCGAATACCAGAACAGACGGGATAGGGGCCTTTGCTTCAAATGTGAGGAAAAGTTTTCTCCAGGCCA
TCGGTGTAAAAAAAAGGAGTTGCAAGTCCTAGTAGTCCATGATGGCGAGAGCGATGAAGCTGAAGGTAATGATGGAGTAGCCATGCAAACCGTGGAGATGGCAGATAATA
AGGGTGAGGACGGTCCGAAGGAGCATGAAGCGGAACTGGCGTTAAATTCCATGGTAGGGCTGGGATCGCCAAAGACATTAAAGGTCAAGGGTTTGATTCGATCTCGAGAG
GTAGTCGTGCTGATTGACAGCGGCGCTACTCACAACTTCATTGCAACAACGTTGGTGAAGGAACTCCAACTTCCGACGGGCCAAACCGAAGGGTACGGGGTCGTTTTGGG
TACTGAGGGATCTTCGATAAAAGGTACCGAGATTTGTCGAGAGGTGGTCCTGACCTTGGGACCGATCACAATAATGCAAGATTTCTTACCCTTACCGTTGGGGAGTGCGG
ATGTGATTTTGGGCGTCCAGTGGTTATCAACTTTGGGTGACGTCACTACCAATCATCAGAACTTACTGATGAAGTTCATGCTAGGAAAATCTGCCGTCGTCCTCCAAGGC
GACCCGTCGTTGGTACGTTCCCAAGTGACCCTTAAAACTCTGATTAAACAGCTCAAGCTGGAGAAGCAGGGGTTTTTGGTCGAACTCAATGCTCTTCAGGGAGGGCGGAT
GGAAATTTCCGGTAGCGAGATCGAAGGCTGGGAGCAGATTCCGGACTGCATTCGTTCGGTCCTGACCCGCCATTTTTCGGTATTCCAACCGTTACCGGGCCTCCCACCCA
CACGCATGGATGATCATCGTATTGAGCTGTTGCCAGGAACGGCTCCTGTGAGTGTCCGTCCCTATCGATACCCTCAGTGTCAAAAGGATGAGATTGAGAAGTTGGTGGAA
GAAATGTTGAGGGCTGGCGTGATTCAACCGAGCAATAGTGCTTTTTCAAGTCCCGTGTTGCTAGTGAAGAAAAAGGATGGTAGTTGGCGATTTTGTGTCGATTACAGGGC
GTTAAACAAAGTGACGGTGCCGGACAAATTTCCAATTCCCATGGTGGATGAATTACTCGATGAGCTCCACGGCGCCATTGTTTTTTCGAAGATTGACTTAAAGGCAGGGT
ACCATCAAATTCGATTAAAGGCCGAGGATATACCAAAGTCTGCTTTTCGTACCCATGAGGGGCATTATGAATTTTTGGTGATGCCGTTCGGCCTTACGAATGCGCCCTCG
ACGTTCCAGGCTCTTATGAATAAGGTATTACGTCCCTTTCTTCGGAAGTTCGCTCTGGTGTTTTTCGACGACATATTAATTTACAGTTCATCCTTGGAGGAACATGCGGG
GCATTTGGCAAAAATAATGCAAGTGCTTCATGAAAATCAGCTGGTGGCTAATCGAAAAAAGTGTCAGTTCGCTCTCAAACAGATTGAGTACCTTGGGCATATTATTTCAG
TCGATGGAGTGTCAGCTGATCCGTCCAAGATCGCTGCCATGGAACAGTGGCCAATCCCTCGGAATATTAAGGAGTTACGGGGGTTCCTCGGTTTGACGGGATATTACCGT
AAGTTTGTTCAAGATTATGGGTTAATTGCTCATCCGTTAACTCAGCAGCTAAAGAAGGACAGTTTTGCTTGGAACGATGACGCCACCGAAGCATTTTTCAAACTGAAGTC
TGCCATGGTCACGGTTCCGGTTTTGCGTTTGCCCGATTTTTCCAAGGAGTTCGTCGTGGAGGCCGATGCATCGGGAGTGGGAGTTGGGGCTGTTTTGATGCAAGAATCAC
GACCTATTGCTTATTTTAGCCATGCCCTTCCTCCGTTACACAGAGCAAAAGCTGTTTATGAGAGGGAATTGATGGCCATCGTGATGGCTATACAGAAGTGGAGGCCGTAT
CTCTTGGGCCGTCGTTTTACCGTTAGAACAGATCAGAGCAGTTTGAAATTCATCCTCGACCAACGATTAATATCCGGGGAGTATCAGAAGTTGATGACCAAGTTGTTGGG
CTATGATTTTCAGGTAGTGTACAAACCAGGCGTGGAAAATACAGTGGCTGATGCGTTGTCGCGTTTACCTAGTGCACTGGAATTCGCGGCAGTCAGTCTGGTAGGAGGCC
TTAATACGAGCATTGTGATTGAACAACAGCGCCTTGACCCAGCTTTGGCTGCCATTCGGAGTAAAATCACGGAGGGGGAGGAAGTCCCAGTGGGCTATTCCTTACGAGGG
AATGTGTTGTATTACCGTGGGAAACTAGTCCTACCGGAGGATTCACCTACGATCCCTTTATTGTTGGAGGCTTTCCATTCTTCTCCAGTGGGCGGTCATGGGGGTACTTT
AAAAACTTTCCAACGTTTAGCTAAGGAGGTATATTGGAAAGGAATGAAATCCCGAGTGAAGCTTTTTGTGAGCGAATGTACCGTTTGTCAGCAAGCTAAGTATTTGGCTC
TTGCTCCCGCGGGACTTCTTCAACCATTACCAATTCCGGAGCGTATTTGGGAGGACATTTCCATGGATTTCGTGGTGGGTTTACCCCGCGCGGAGACTTTTGATTCAGTT
TTCGTTGTGGTGGATCGGCTATCGAAGTATGCCCATTTTATACCGATCAGACATCCATTTACAGCAACAGGTATGGCGCAAACCTTTATCAAGGAAGTGGTGCGGTTGCA
CGGGATTCCACGTAGCATTGTTTCTGATCGAGACCCCGTCTTCACGAGTATGTTTTGGGAGGAACTTTTCCGCTGTCAAGGTACGCAGCTGAAGCGGAGCACGGCTTACC
ACCCTCAGACGGATGGACAAACGGAAGTGGTAAATCGAGGTATGGAGACGTATTTACGCTGTTTCGCGATGCACTGTCCATTGAAGTGGGTCCGTTGGTTGTGTTGGGCT
GAATATAGCTATAATACATCATTTCATTCGGGCCTAAAAGCTTCCCCATTTGAAGTGGTTTATGGAAGGTCTCCCCCTTCTGTTCTGCCTTACCATTCGGGGGAGTCCCC
CGTTTCAGAGGTGGACCATCAGCTCAAGGAACGCGATCGCATGTTGGCAATGATTAGGGAACGGTTATTGGATGCACAACAATATATGGCCCGCATGGTAAACGAAAAAC
GAAGAGACGTGGAATTTGACGTAGGGGATTGGGCGTATTTGAAATTAAGACCTTATCGGCAAGGAACTTTATCTAAGACATTGAATGCCAAATTAGCCCCTCGCTATGTC
GGTCCTTTCCAGGTGGAAGCAAGGATAGGCGCTGTGGCGTATCGCCTGATATTGCCCGAGCATGTTCGGGTGCATCCTGTTTTTCACGTGTCTCAGCTACGGAAGGCCTT
GGGCAGTTCATTCCCTGTAGTTCCTTTTCCTCAGAATGTGTCTTCGGATATGTTGTTCAAAGTGGTGCCGGTGGAGCTATTAGGTTTGCGTCAGTCAGTGGAAGATTCTA
CTAAGTTGGAGGTATTGATTCGCTGGGCTGAGTCGGAACCCTCGGAAGCGACATGGGAGGATGCAGTTCAACTCGCTACCCAATTTCCTGATTTTCACCTTGAGGACAAG
GTGGCGCTTTGGGGGCCGGGTAGTGATAGACCACCAATTAAGTATGTGTATAGGAGGAAAAAGCTTATAACCAGTGTTTTCGAGGCGCACTCTGGCGCGTGCCTCAGGCA
AGGTGCCTCACTGAGTGCCCAGGCAAGGCGCCTCGCCCGATTGAAGGCACTCGCCTCTCGCCAAGGGTTTGAGTTCTTGGATTTGGGCTTTAATCAAAACTCTAATTCTG
GACATCAATCCAAGCTTCGTCCTCCTCCAAGAAACAAACTGAATAGGGTTGACAGAGCCCTAATTAAATCTCAATTGAGTTCCTCCCGCATTCCTTGGTGTTTCCTCGAA
GCTACCAGATCTACTGGGGATTATCATAATGAGGAGTGTCCCTGCCTTCCTTGTAAAGATCAGAGACTTGCTCAAACAAAAATCAAAGAAAACCTCCTGGAAATGGATGC
TATGGAAAAAGCATCTCTTGGAGACAAAAATGACAGAATTCTTCAAAAACTCGTGGAACACCATCAAATGCAATCTATTGGTCCTAAAATTGTGGATGATTGGGAAGTGA
GTAGAAAGAAAGGTATAGCTCTAAAGCTGGATCTGGAAAAGGCCGTGATAAGATTGATCGAGTCTTCTAAGATGCCATTCTACAAGTCAAAGGGTTTGGTGGTGTTTGGA
GGAAAGAGATCTTTGGATGCACATCTAGCACAGGCTTTTCAATCATTATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGACAAGAATGGAAACACGAGTAGAGAATCTTGAAGGACGAATGTTGGAATTGAGCAAGGAAATGGCCGGAATGAGGCAAGAGATTTCGGGAATACGCGGATCTAT
TTCGACCGAAGTAGAAGCTGCGGTATCGAAGAGTGTGGGTGAGGCGATGAAGGAGGGAATGCAAGCCATCCTCGCCCGGATCGAAGGGTTGAAATCCACGGATCAGGCCG
TGGATAACAGGAAAACGAAAGGGAAGAACGTTATCGGGTCGGAGGAAGTGGCTGTCGAAGTAGGGGAGGGGAGTAACACGGTTTTACAATCACACCTGACCAAGTCTCTG
GCTCCGGTGTTCGATCTTCGTCTGCGGAAACTGGAAATACCAGTGTTTGGGGGAGCAAGTCCGTTGGAATGGTTACATCAAGTAGAACGGTACTTCACCGTCAACCGACT
GACGGAAGCGGAGAAGCTGGAAGCGGCGGTACTGTGTCTCGAAGGCGTGGCGCTTAAGTGGCATTACTATGAGAACAAATTGGGTTCGTTCAGTAACTGGGAGGATTTCC
GTCGATTATTGTTACGGAGGTTTCTGGCGCCAAAAGAAGGTACGCTTCACGAACGATTCTTTGCTCTCAAGCAAGAATCTACTGTAGCTGAATATAGGGAGAAATTTGAG
GAATTTGCTGCTGCCGTAGATAACCTAGACGGCGATTCGTTGGAGGGGAAATTTGTTGATGGACTGGTTGAAGACATTAAGGTCGAGATGAGAATATGTATGCCTGTTGG
GCTTAGGGCCAAGATGGAAATGGCCCAAAAGATTGAAGATAAGCTGCTGCATCTTGAAAATAAACAGACGGGAGTTACCTTGAGTCCAAAAGTGGTGGGCCAGAGTGTTA
GTAATTACAAGCCCATATCATATGGGGCTGGGGCTAGCGCTCGTACCATCTCGGTAAACCCAAGTAAATTGGGAGAGAGGGTACGTAAAGAAACTCCAGGGAAGTCATTC
GAGGCTTCTTCTTCATCAAATCCGTATAAGCGTCTTACTGAGTCCGAATACCAGAACAGACGGGATAGGGGCCTTTGCTTCAAATGTGAGGAAAAGTTTTCTCCAGGCCA
TCGGTGTAAAAAAAAGGAGTTGCAAGTCCTAGTAGTCCATGATGGCGAGAGCGATGAAGCTGAAGGTAATGATGGAGTAGCCATGCAAACCGTGGAGATGGCAGATAATA
AGGGTGAGGACGGTCCGAAGGAGCATGAAGCGGAACTGGCGTTAAATTCCATGGTAGGGCTGGGATCGCCAAAGACATTAAAGGTCAAGGGTTTGATTCGATCTCGAGAG
GTAGTCGTGCTGATTGACAGCGGCGCTACTCACAACTTCATTGCAACAACGTTGGTGAAGGAACTCCAACTTCCGACGGGCCAAACCGAAGGGTACGGGGTCGTTTTGGG
TACTGAGGGATCTTCGATAAAAGGTACCGAGATTTGTCGAGAGGTGGTCCTGACCTTGGGACCGATCACAATAATGCAAGATTTCTTACCCTTACCGTTGGGGAGTGCGG
ATGTGATTTTGGGCGTCCAGTGGTTATCAACTTTGGGTGACGTCACTACCAATCATCAGAACTTACTGATGAAGTTCATGCTAGGAAAATCTGCCGTCGTCCTCCAAGGC
GACCCGTCGTTGGTACGTTCCCAAGTGACCCTTAAAACTCTGATTAAACAGCTCAAGCTGGAGAAGCAGGGGTTTTTGGTCGAACTCAATGCTCTTCAGGGAGGGCGGAT
GGAAATTTCCGGTAGCGAGATCGAAGGCTGGGAGCAGATTCCGGACTGCATTCGTTCGGTCCTGACCCGCCATTTTTCGGTATTCCAACCGTTACCGGGCCTCCCACCCA
CACGCATGGATGATCATCGTATTGAGCTGTTGCCAGGAACGGCTCCTGTGAGTGTCCGTCCCTATCGATACCCTCAGTGTCAAAAGGATGAGATTGAGAAGTTGGTGGAA
GAAATGTTGAGGGCTGGCGTGATTCAACCGAGCAATAGTGCTTTTTCAAGTCCCGTGTTGCTAGTGAAGAAAAAGGATGGTAGTTGGCGATTTTGTGTCGATTACAGGGC
GTTAAACAAAGTGACGGTGCCGGACAAATTTCCAATTCCCATGGTGGATGAATTACTCGATGAGCTCCACGGCGCCATTGTTTTTTCGAAGATTGACTTAAAGGCAGGGT
ACCATCAAATTCGATTAAAGGCCGAGGATATACCAAAGTCTGCTTTTCGTACCCATGAGGGGCATTATGAATTTTTGGTGATGCCGTTCGGCCTTACGAATGCGCCCTCG
ACGTTCCAGGCTCTTATGAATAAGGTATTACGTCCCTTTCTTCGGAAGTTCGCTCTGGTGTTTTTCGACGACATATTAATTTACAGTTCATCCTTGGAGGAACATGCGGG
GCATTTGGCAAAAATAATGCAAGTGCTTCATGAAAATCAGCTGGTGGCTAATCGAAAAAAGTGTCAGTTCGCTCTCAAACAGATTGAGTACCTTGGGCATATTATTTCAG
TCGATGGAGTGTCAGCTGATCCGTCCAAGATCGCTGCCATGGAACAGTGGCCAATCCCTCGGAATATTAAGGAGTTACGGGGGTTCCTCGGTTTGACGGGATATTACCGT
AAGTTTGTTCAAGATTATGGGTTAATTGCTCATCCGTTAACTCAGCAGCTAAAGAAGGACAGTTTTGCTTGGAACGATGACGCCACCGAAGCATTTTTCAAACTGAAGTC
TGCCATGGTCACGGTTCCGGTTTTGCGTTTGCCCGATTTTTCCAAGGAGTTCGTCGTGGAGGCCGATGCATCGGGAGTGGGAGTTGGGGCTGTTTTGATGCAAGAATCAC
GACCTATTGCTTATTTTAGCCATGCCCTTCCTCCGTTACACAGAGCAAAAGCTGTTTATGAGAGGGAATTGATGGCCATCGTGATGGCTATACAGAAGTGGAGGCCGTAT
CTCTTGGGCCGTCGTTTTACCGTTAGAACAGATCAGAGCAGTTTGAAATTCATCCTCGACCAACGATTAATATCCGGGGAGTATCAGAAGTTGATGACCAAGTTGTTGGG
CTATGATTTTCAGGTAGTGTACAAACCAGGCGTGGAAAATACAGTGGCTGATGCGTTGTCGCGTTTACCTAGTGCACTGGAATTCGCGGCAGTCAGTCTGGTAGGAGGCC
TTAATACGAGCATTGTGATTGAACAACAGCGCCTTGACCCAGCTTTGGCTGCCATTCGGAGTAAAATCACGGAGGGGGAGGAAGTCCCAGTGGGCTATTCCTTACGAGGG
AATGTGTTGTATTACCGTGGGAAACTAGTCCTACCGGAGGATTCACCTACGATCCCTTTATTGTTGGAGGCTTTCCATTCTTCTCCAGTGGGCGGTCATGGGGGTACTTT
AAAAACTTTCCAACGTTTAGCTAAGGAGGTATATTGGAAAGGAATGAAATCCCGAGTGAAGCTTTTTGTGAGCGAATGTACCGTTTGTCAGCAAGCTAAGTATTTGGCTC
TTGCTCCCGCGGGACTTCTTCAACCATTACCAATTCCGGAGCGTATTTGGGAGGACATTTCCATGGATTTCGTGGTGGGTTTACCCCGCGCGGAGACTTTTGATTCAGTT
TTCGTTGTGGTGGATCGGCTATCGAAGTATGCCCATTTTATACCGATCAGACATCCATTTACAGCAACAGGTATGGCGCAAACCTTTATCAAGGAAGTGGTGCGGTTGCA
CGGGATTCCACGTAGCATTGTTTCTGATCGAGACCCCGTCTTCACGAGTATGTTTTGGGAGGAACTTTTCCGCTGTCAAGGTACGCAGCTGAAGCGGAGCACGGCTTACC
ACCCTCAGACGGATGGACAAACGGAAGTGGTAAATCGAGGTATGGAGACGTATTTACGCTGTTTCGCGATGCACTGTCCATTGAAGTGGGTCCGTTGGTTGTGTTGGGCT
GAATATAGCTATAATACATCATTTCATTCGGGCCTAAAAGCTTCCCCATTTGAAGTGGTTTATGGAAGGTCTCCCCCTTCTGTTCTGCCTTACCATTCGGGGGAGTCCCC
CGTTTCAGAGGTGGACCATCAGCTCAAGGAACGCGATCGCATGTTGGCAATGATTAGGGAACGGTTATTGGATGCACAACAATATATGGCCCGCATGGTAAACGAAAAAC
GAAGAGACGTGGAATTTGACGTAGGGGATTGGGCGTATTTGAAATTAAGACCTTATCGGCAAGGAACTTTATCTAAGACATTGAATGCCAAATTAGCCCCTCGCTATGTC
GGTCCTTTCCAGGTGGAAGCAAGGATAGGCGCTGTGGCGTATCGCCTGATATTGCCCGAGCATGTTCGGGTGCATCCTGTTTTTCACGTGTCTCAGCTACGGAAGGCCTT
GGGCAGTTCATTCCCTGTAGTTCCTTTTCCTCAGAATGTGTCTTCGGATATGTTGTTCAAAGTGGTGCCGGTGGAGCTATTAGGTTTGCGTCAGTCAGTGGAAGATTCTA
CTAAGTTGGAGGTATTGATTCGCTGGGCTGAGTCGGAACCCTCGGAAGCGACATGGGAGGATGCAGTTCAACTCGCTACCCAATTTCCTGATTTTCACCTTGAGGACAAG
GTGGCGCTTTGGGGGCCGGGTAGTGATAGACCACCAATTAAGTATGTGTATAGGAGGAAAAAGCTTATAACCAGTGTTTTCGAGGCGCACTCTGGCGCGTGCCTCAGGCA
AGGTGCCTCACTGAGTGCCCAGGCAAGGCGCCTCGCCCGATTGAAGGCACTCGCCTCTCGCCAAGGGTTTGAGTTCTTGGATTTGGGCTTTAATCAAAACTCTAATTCTG
GACATCAATCCAAGCTTCGTCCTCCTCCAAGAAACAAACTGAATAGGGTTGACAGAGCCCTAATTAAATCTCAATTGAGTTCCTCCCGCATTCCTTGGTGTTTCCTCGAA
GCTACCAGATCTACTGGGGATTATCATAATGAGGAGTGTCCCTGCCTTCCTTGTAAAGATCAGAGACTTGCTCAAACAAAAATCAAAGAAAACCTCCTGGAAATGGATGC
TATGGAAAAAGCATCTCTTGGAGACAAAAATGACAGAATTCTTCAAAAACTCGTGGAACACCATCAAATGCAATCTATTGGTCCTAAAATTGTGGATGATTGGGAAGTGA
GTAGAAAGAAAGGTATAGCTCTAAAGCTGGATCTGGAAAAGGCCGTGATAAGATTGATCGAGTCTTCTAAGATGCCATTCTACAAGTCAAAGGGTTTGGTGGTGTTTGGA
GGAAAGAGATCTTTGGATGCACATCTAGCACAGGCTTTTCAATCATTATAA
Protein sequenceShow/hide protein sequence
MATRMETRVENLEGRMLELSKEMAGMRQEISGIRGSISTEVEAAVSKSVGEAMKEGMQAILARIEGLKSTDQAVDNRKTKGKNVIGSEEVAVEVGEGSNTVLQSHLTKSL
APVFDLRLRKLEIPVFGGASPLEWLHQVERYFTVNRLTEAEKLEAAVLCLEGVALKWHYYENKLGSFSNWEDFRRLLLRRFLAPKEGTLHERFFALKQESTVAEYREKFE
EFAAAVDNLDGDSLEGKFVDGLVEDIKVEMRICMPVGLRAKMEMAQKIEDKLLHLENKQTGVTLSPKVVGQSVSNYKPISYGAGASARTISVNPSKLGERVRKETPGKSF
EASSSSNPYKRLTESEYQNRRDRGLCFKCEEKFSPGHRCKKKELQVLVVHDGESDEAEGNDGVAMQTVEMADNKGEDGPKEHEAELALNSMVGLGSPKTLKVKGLIRSRE
VVVLIDSGATHNFIATTLVKELQLPTGQTEGYGVVLGTEGSSIKGTEICREVVLTLGPITIMQDFLPLPLGSADVILGVQWLSTLGDVTTNHQNLLMKFMLGKSAVVLQG
DPSLVRSQVTLKTLIKQLKLEKQGFLVELNALQGGRMEISGSEIEGWEQIPDCIRSVLTRHFSVFQPLPGLPPTRMDDHRIELLPGTAPVSVRPYRYPQCQKDEIEKLVE
EMLRAGVIQPSNSAFSSPVLLVKKKDGSWRFCVDYRALNKVTVPDKFPIPMVDELLDELHGAIVFSKIDLKAGYHQIRLKAEDIPKSAFRTHEGHYEFLVMPFGLTNAPS
TFQALMNKVLRPFLRKFALVFFDDILIYSSSLEEHAGHLAKIMQVLHENQLVANRKKCQFALKQIEYLGHIISVDGVSADPSKIAAMEQWPIPRNIKELRGFLGLTGYYR
KFVQDYGLIAHPLTQQLKKDSFAWNDDATEAFFKLKSAMVTVPVLRLPDFSKEFVVEADASGVGVGAVLMQESRPIAYFSHALPPLHRAKAVYERELMAIVMAIQKWRPY
LLGRRFTVRTDQSSLKFILDQRLISGEYQKLMTKLLGYDFQVVYKPGVENTVADALSRLPSALEFAAVSLVGGLNTSIVIEQQRLDPALAAIRSKITEGEEVPVGYSLRG
NVLYYRGKLVLPEDSPTIPLLLEAFHSSPVGGHGGTLKTFQRLAKEVYWKGMKSRVKLFVSECTVCQQAKYLALAPAGLLQPLPIPERIWEDISMDFVVGLPRAETFDSV
FVVVDRLSKYAHFIPIRHPFTATGMAQTFIKEVVRLHGIPRSIVSDRDPVFTSMFWEELFRCQGTQLKRSTAYHPQTDGQTEVVNRGMETYLRCFAMHCPLKWVRWLCWA
EYSYNTSFHSGLKASPFEVVYGRSPPSVLPYHSGESPVSEVDHQLKERDRMLAMIRERLLDAQQYMARMVNEKRRDVEFDVGDWAYLKLRPYRQGTLSKTLNAKLAPRYV
GPFQVEARIGAVAYRLILPEHVRVHPVFHVSQLRKALGSSFPVVPFPQNVSSDMLFKVVPVELLGLRQSVEDSTKLEVLIRWAESEPSEATWEDAVQLATQFPDFHLEDK
VALWGPGSDRPPIKYVYRRKKLITSVFEAHSGACLRQGASLSAQARRLARLKALASRQGFEFLDLGFNQNSNSGHQSKLRPPPRNKLNRVDRALIKSQLSSSRIPWCFLE
ATRSTGDYHNEECPCLPCKDQRLAQTKIKENLLEMDAMEKASLGDKNDRILQKLVEHHQMQSIGPKIVDDWEVSRKKGIALKLDLEKAVIRLIESSKMPFYKSKGLVVFG
GKRSLDAHLAQAFQSL