; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G09010 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G09010
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr7:6849384..6854055
RNA-Seq ExpressionCSPI07G09010
SyntenyCSPI07G09010
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR000953 - Chromo/chromo shadow domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR016197 - Chromo-like domain superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0062868.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0058.63Show/hide
Query:  MVQTRIEEKIETIDQEIHGIKKEIGKLLAIEKTLNNISKNM-------ERQNQLILRIMESAAQERTTMNERISELSIQTFSVKIIDEGEGSSRRESETN
        MVQTR EE++E+ +QE+ GIKKE+ K+  IE TL  +++NM       E+Q Q IL  ME  A+ER+   ER++E   Q          + SS R+ E  
Subjt:  MVQTRIEEKIETIDQEIHGIKKEIGKLLAIEKTLNNISKNM-------ERQNQLILRIMESAAQERTTMNERISELSIQTFSVKIIDEGEGSSRRESETN

Query:  NEEKKPNEEGNNDRNKFKKVEMPIFNGDDLDSWLFCAERYFLIHKLTESEKITVSTISFEGPTLNWYRSQEEREK-------------------------
        N +K   +E +NDR+KFKKVEMP+F G+D +SWLF AERYF IHKLTESEK+ VSTI F+GP LNWYR+QEEREK                         
Subjt:  NEEKKPNEEGNNDRNKFKKVEMPIFNGDDLDSWLFCAERYFLIHKLTESEKITVSTISFEGPTLNWYRSQEEREK-------------------------

Query:  RFLRIQQTTTMDEYQNLFDK---------------------WRWISVS------QKDARMIRIAQKIENREDIRREANLPRYSGGRTTNPQNS-TKTNTN
        RFLRIQQ TT++EY+NLFDK                     + WI         +  A M+R AQ +E+RE +R  ANL  Y GG+++ P ++ TK   +
Subjt:  RFLRIQQTTTMDEYQNLFDK---------------------WRWISVS------QKDARMIRIAQKIENREDIRREANLPRYSGGRTTNPQNS-TKTNTN

Query:  VNTGENKGGMNWPMRTITLRGTSKEEVPKEGPTKRLSDAEFQARKEKGLCFRCNEKYSHDHKCKAKELRELRMLVVKEDNEEYEIIEDGNNALKELNTIE
            ENK    +P+RTITL+  +  E  KEG +KRL DAEFQ R+EKGLCF+CNEKYS DHKCK +E RELRM VVK++NEE EI+E+      EL T+E
Subjt:  VNTGENKGGMNWPMRTITLRGTSKEEVPKEGPTKRLSDAEFQARKEKGLCFRCNEKYSHDHKCKAKELRELRMLVVKEDNEEYEIIEDGNNALKELNTIE

Query:  IMEEGQAIVELSMNSVVGLDNLGTMEVKGTIRGREIVVLIDCGATHNFVSERLVKELQLETKETSNYGVILGFGTAIKGKGICGAVELMLGDWKVIDEFL
        +  +  A VELS+NSVVGL++ GTM+V+GT++G+E+V+LIDCGATHNFVSE+LV  LQL  KET++YGVILG GTAI+GKGIC ++E+ + DW V ++FL
Subjt:  IMEEGQAIVELSMNSVVGLDNLGTMEVKGTIRGREIVVLIDCGATHNFVSERLVKELQLETKETSNYGVILGFGTAIKGKGICGAVELMLGDWKVIDEFL

Query:  PLELGGVDAILNMQWLYSLGIT-----------------------EALERCDLS--------EEED---LIE---------EILT--------EEESVAV
        PLELGGVD IL MQWLYSLG+T                        +L +  +S        EE D   LIE         E +T         EE +  
Subjt:  PLELGGVDAILNMQWLYSLGIT-----------------------EALERCDLS--------EEED---LIE---------EILT--------EEESVAV

Query:  VLKRFEDVFEWPETLPPWRIIEHHIHLKEGVNPVNVRPYRYAYQQKNEME---------------------------------------KALNNVTVPDK
        +L +F+D+FEWPE LPP R IEH IHLKEG NPVNVRPYRYAY QK EME                                       +ALNNVTVPDK
Subjt:  VLKRFEDVFEWPETLPPWRIIEHHIHLKEGVNPVNVRPYRYAYQQKNEME---------------------------------------KALNNVTVPDK

Query:  FPIPVIEELFDELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKFVLVFFDDILIYSKNLKAHL
        FPIPV+EELFDEL GA +FTKIDLKAGYHQIRM   DIEKTAFRTHEGHYEF+VMPFGLTNAP+TFQSLMN+IF+PYL +FVLVFFDDILIYS+NL+ HL
Subjt:  FPIPVIEELFDELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKFVLVFFDDILIYSKNLKAHL

Query:  DHIKAVLEVLRKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYGTIAAPLTQLLKKGGFRWTE
         HI+ V  VLRK+EL+AN+KKCSF  A+VEYLGH+IS  GVEVDPEKI+AI +WP PT+VRE RGFLGLTGYYRKFV HYGT+AAPLTQLLKKGGF W  
Subjt:  DHIKAVLEVLRKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYGTIAAPLTQLLKKGGFRWTE

Query:  ETQEAFVKLQMTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHTLAVRDRVKPVYERELMAVVMTVQRWRPYLLGKRFKVKTDPRSLK
        E ++AF KL+  M+ LP+LA+P F  PFEIETDASGYG+GAVLIQN+RPIA+YSHTLA+RDR +PVYERELMAVV+ VQRWRPYLLG RF V+TD +SLK
Subjt:  ETQEAFVKLQMTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHTLAVRDRVKPVYERELMAVVMTVQRWRPYLLGKRFKVKTDPRSLK

Query:  FFLEQRVIQPQYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVHLNQLTAPNIIDVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLMY
        F LEQRV+QPQYQ+W++KLLGY+F+V YKPG+ENKAADALSRI PTV +  +T P  +D+ +IKEEV +D +L KI  E+      Q SK+ +  GML Y
Subjt:  FFLEQRVIQPQYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVHLNQLTAPNIIDVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLMY

Query:  KNRMVISKTSKLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEECMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKEN
        K+R+VIS+TSKLI  ILH++H+S  GGHSGFLRTYKR++GELYW+GMK  VKKYC EC+ICQ+NKTL LSPAGLL PL +P  +W +ISMDFV+GLPK  
Subjt:  KNRMVISKTSKLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEECMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKEN

Query:  GFEVIFVVVDRFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYL
        GFEVIFVVVDR         F+P+KHPY+AK++AELFVKEVVRLH FP SIVSD D+VFLS+FWKE+FRLAGT+LN S+AYHPQSDGQTEVVNRGV  YL
Subjt:  GFEVIFVVVDRFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYL

Query:  LCFCGEKLKEWVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNSTLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDVQ----
         CFC +K KEWVKWI WA+YWYNTT+Q++LG+TPFQ VYGR PPPL+ YG + T N TLDEQLKERD  + +L+E+L++AQ++MKKYAD +RRD++    
Subjt:  LCFCGEKLKEWVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNSTLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDVQ----

Query:  --------PYRQVSLRRKRNEKLSAKYFGPYKVIERVGPVAYKLELPAQAAIHPVFH---------ENCD---DIPYLSENYEWRAMPEEVYGYLKNKAG
                PYRQ+SLRRKRNEKLSAKYFGPYK++ER+GPVAYKLELP  A IHPVFH         E+ D    I  L EN+ W+  P E   Y +NK G
Subjt:  --------PYRQVSLRRKRNEKLSAKYFGPYKVIERVGPVAYKLELPAQAAIHPVFH---------ENCD---DIPYLSENYEWRAMPEEVYGYLKNKAG

Query:  SWDVLVSWKGLPRHEATWELYEDLQQRFPDFHLEDKVNLKRESNDRPPILYQYTRKGKK
         W+V+V W GL  HE TWE Y D+  ++PDFHLEDKV+L+  SN RPPIL+QY+RK K+
Subjt:  SWDVLVSWKGLPRHEATWELYEDLQQRFPDFHLEDKVNLKRESNDRPPILYQYTRKGKK

KAA0068193.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0058.63Show/hide
Query:  MVQTRIEEKIETIDQEIHGIKKEIGKLLAIEKTLNNISKNM-------ERQNQLILRIMESAAQERTTMNERISELSIQTFSVKIIDEGEGSSRRESETN
        MVQTRIEE++E+ +QE+ GIKKE+ K+  IE TL  +++NM       E+Q Q IL  ME  A+ER+   ER++E   Q          + SS R+ E  
Subjt:  MVQTRIEEKIETIDQEIHGIKKEIGKLLAIEKTLNNISKNM-------ERQNQLILRIMESAAQERTTMNERISELSIQTFSVKIIDEGEGSSRRESETN

Query:  NEEKKPNEEGNNDRNKFKKVEMPIFNGDDLDSWLFCAERYFLIHKLTESEKITVSTISFEGPTLNWYRSQEEREK-------------------------
        N +K   +E +NDR+KFKKVEMP+F G+D +SWLF AERYF IHKLTESEK+ VSTI F+GP LNWYR+QEEREK                         
Subjt:  NEEKKPNEEGNNDRNKFKKVEMPIFNGDDLDSWLFCAERYFLIHKLTESEKITVSTISFEGPTLNWYRSQEEREK-------------------------

Query:  RFLRIQQTTTMDEYQNLFDK---------------------WRWISVS------QKDARMIRIAQKIENREDIRREANLPRYSGGRTTNPQNS-TKTNTN
        RFLRIQQ TT++EY+NLFDK                     + WI         +  A M+R AQ +E+RE +R  ANL  Y GG+++ P ++ TK   +
Subjt:  RFLRIQQTTTMDEYQNLFDK---------------------WRWISVS------QKDARMIRIAQKIENREDIRREANLPRYSGGRTTNPQNS-TKTNTN

Query:  VNTGENKGGMNWPMRTITLRGTSKEEVPKEGPTKRLSDAEFQARKEKGLCFRCNEKYSHDHKCKAKELRELRMLVVKEDNEEYEIIEDGNNALKELNTIE
            ENK    +P+RTITL+  +  E  KEG +KRL DAEFQ R+EKGLCF+CNEKYS DHKCK +E RELRM VVK++NEE EI+E+      EL T+E
Subjt:  VNTGENKGGMNWPMRTITLRGTSKEEVPKEGPTKRLSDAEFQARKEKGLCFRCNEKYSHDHKCKAKELRELRMLVVKEDNEEYEIIEDGNNALKELNTIE

Query:  IMEEGQAIVELSMNSVVGLDNLGTMEVKGTIRGREIVVLIDCGATHNFVSERLVKELQLETKETSNYGVILGFGTAIKGKGICGAVELMLGDWKVIDEFL
        +  +  A VELS+NSVVGL++ GTM+V+GT++G+E+V+LIDCGATHNFVSE+LV  LQL  KET++YGVILG GTAI+GKGIC ++E+ + DW V ++FL
Subjt:  IMEEGQAIVELSMNSVVGLDNLGTMEVKGTIRGREIVVLIDCGATHNFVSERLVKELQLETKETSNYGVILGFGTAIKGKGICGAVELMLGDWKVIDEFL

Query:  PLELGGVDAILNMQWLYSLGIT-----------------------EALERCDLS--------EEED---LIE---------EILT--------EEESVAV
        PLELGGVD IL MQWLYSLG+T                        +L +  +S        EE D   LIE         E +T         EE +  
Subjt:  PLELGGVDAILNMQWLYSLGIT-----------------------EALERCDLS--------EEED---LIE---------EILT--------EEESVAV

Query:  VLKRFEDVFEWPETLPPWRIIEHHIHLKEGVNPVNVRPYRYAYQQKNEME---------------------------------------KALNNVTVPDK
        +L +F+D+FEWPE LPP R IEH IHLKEG NPVNVRPYRYAY QK EME                                       +ALNNVTVPDK
Subjt:  VLKRFEDVFEWPETLPPWRIIEHHIHLKEGVNPVNVRPYRYAYQQKNEME---------------------------------------KALNNVTVPDK

Query:  FPIPVIEELFDELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKFVLVFFDDILIYSKNLKAHL
        FPIPV+EELFDEL GA +FTKIDLKAGYHQIRM   DIEKTAFRTHEGHYEF+VMPFGLTNAP+TFQSLMN+IF+PYL +FVLVFFDDILIYS+NL+ HL
Subjt:  FPIPVIEELFDELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKFVLVFFDDILIYSKNLKAHL

Query:  DHIKAVLEVLRKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYGTIAAPLTQLLKKGGFRWTE
         HI+ V  VLRK+EL+AN+KKCSF  A+VEYLGH+IS  GVEVDPEKI+AI +WP PT+VRE RGFLGLTGYYRKFV HYGT+AAPLTQLLKKGGF W  
Subjt:  DHIKAVLEVLRKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYGTIAAPLTQLLKKGGFRWTE

Query:  ETQEAFVKLQMTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHTLAVRDRVKPVYERELMAVVMTVQRWRPYLLGKRFKVKTDPRSLK
        E ++AF KL+  M+ LP+LA+P F  PFEIETDASGYG+GAVLIQN+RPIA+YSHTLA+RDR +PVYERELMAVV+ VQRWRPYLLG RF V+TD +SLK
Subjt:  ETQEAFVKLQMTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHTLAVRDRVKPVYERELMAVVMTVQRWRPYLLGKRFKVKTDPRSLK

Query:  FFLEQRVIQPQYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVHLNQLTAPNIIDVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLMY
        F LEQRV+QPQYQ+W++KLLGY+F+V YKPG+ENKAADALSRI PTV +  +T P  +D+ +IKEEV +D +L KI  E+      Q SK+ +  GML Y
Subjt:  FFLEQRVIQPQYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVHLNQLTAPNIIDVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLMY

Query:  KNRMVISKTSKLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEECMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKEN
        K+R+VIS+TSKLI  ILH++H+S  GGHSGFLRTYKR++GELYW+GMK  VKKYC EC+ICQ+NKTL LSPAGLL PL +P  +W +ISMDFV+GLPK  
Subjt:  KNRMVISKTSKLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEECMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKEN

Query:  GFEVIFVVVDRFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYL
        GFEVIFVVVDR         F+P+KHPY+AK++AELFVKEVVRLH FP SIVSD D+VFLS+FWKE+FRLAGT+LN S+AYHPQSDGQTEVVNRGV +YL
Subjt:  GFEVIFVVVDRFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYL

Query:  LCFCGEKLKEWVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNSTLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDVQ----
         CFC +K KEWVKWI WA+YWYNTT+Q++LG+TPFQ VYGR PPPL+ YG + T N TLDEQLKERD  + +L+E+L++AQ++MKKYAD +RRD++    
Subjt:  LCFCGEKLKEWVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNSTLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDVQ----

Query:  --------PYRQVSLRRKRNEKLSAKYFGPYKVIERVGPVAYKLELPAQAAIHPVFH---------ENCD---DIPYLSENYEWRAMPEEVYGYLKNKAG
                PYRQ+SLRRKRNEKLSAKYFGPYK++ER+GPVAYKLELP  A IHPVFH         E+ D    I  L EN+ W+  P E   Y +NK G
Subjt:  --------PYRQVSLRRKRNEKLSAKYFGPYKVIERVGPVAYKLELPAQAAIHPVFH---------ENCD---DIPYLSENYEWRAMPEEVYGYLKNKAG

Query:  SWDVLVSWKGLPRHEATWELYEDLQQRFPDFHLEDKVNLKRESNDRPPILYQYTRKGKK
         W+V++ W GL  HE TWE Y D+  ++PDFHLEDKV+L+  SN RPPIL+QY+RK K+
Subjt:  SWDVLVSWKGLPRHEATWELYEDLQQRFPDFHLEDKVNLKRESNDRPPILYQYTRKGKK

TYJ95763.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0058.3Show/hide
Query:  IETIDQEIHGIKKEIGKLLAIEKTLNNISKNM-------ERQNQLILRIMESAAQERTTMNERISELSIQTFSVKIIDEGEGSSRRE-SETNNEEKKPNE
        +E  +QEI GIKKE+ K+ AIE TL  I+KNM       E+Q Q IL  ME+ A+ER    ERI+E  IQ         G+ SS  +  ET+ E K  ++
Subjt:  IETIDQEIHGIKKEIGKLLAIEKTLNNISKNM-------ERQNQLILRIMESAAQERTTMNERISELSIQTFSVKIIDEGEGSSRRE-SETNNEEKKPNE

Query:  EGNNDRNKFKKVEMPIFNGDDLDSWLFCAERYFLIHKLTESEKITVSTISFEGPTLNWYRSQEEREK-------------------------RFLRIQQT
        E  NDR+KFKKVEMP+F G+D +SWLF AERYF IHKLTESEK+ VSTI F+GP LNWYRSQEEREK                         RFLRIQQ 
Subjt:  EGNNDRNKFKKVEMPIFNGDDLDSWLFCAERYFLIHKLTESEKITVSTISFEGPTLNWYRSQEEREK-------------------------RFLRIQQT

Query:  TTMDEYQNLFDK---------------------WRWISVS------QKDARMIRIAQKIENREDIRREANLPRYSGGR-TTNPQNSTKTNTNVNTGENKG
        TT++EY+N FDK                     + WI         +  A  +  AQ +E+RE +R  ANL  Y GG+ +       K +      E+K 
Subjt:  TTMDEYQNLFDK---------------------WRWISVS------QKDARMIRIAQKIENREDIRREANLPRYSGGR-TTNPQNSTKTNTNVNTGENKG

Query:  GMNWPMRTITLRGTSKEEVPKEGPTKRLSDAEFQARKEKGLCFRCNEKYSHDHKCKAKELRELRMLVVKEDNEEYEIIEDGNNALKELNTIEIMEEGQAI
          ++P+RTITL+  +  E+ KEG +KRL DAEFQ RKEKGLCF+CNEKYS DHKCK KE RELRM VVK DNEE EI+E+      E+   E+       
Subjt:  GMNWPMRTITLRGTSKEEVPKEGPTKRLSDAEFQARKEKGLCFRCNEKYSHDHKCKAKELRELRMLVVKEDNEEYEIIEDGNNALKELNTIEIMEEGQAI

Query:  VELSMNSVVGLDNLGTMEVKGTIRGREIVVLIDCGATHNFVSERLVKELQLETKETSNYGVILGFGTAIKGKGICGAVELMLGDWKVIDEFLPLELGGVD
        VELS+NSVVGL++ GTM+VKG+++G+E+V+LIDCGATHNFVSE++V  LQL  KET++YGVILG GTAI+GKGIC +VE+ + +W V ++FLPLELGGVD
Subjt:  VELSMNSVVGLDNLGTMEVKGTIRGREIVVLIDCGATHNFVSERLVKELQLETKETSNYGVILGFGTAIKGKGICGAVELMLGDWKVIDEFLPLELGGVD

Query:  AILNMQWLYSLGIT-----------------------EALERCDLS--------EEED---LIE-------EILTE--------EESVAVVLKRFEDVFE
         IL MQWLYSLG+T                        +L +  +S        EE D   LIE       E+ T         ++ +  +L +F DVFE
Subjt:  AILNMQWLYSLGIT-----------------------EALERCDLS--------EEED---LIE-------EILTE--------EESVAVVLKRFEDVFE

Query:  WPETLPPWRIIEHHIHLKEGVNPVNVRPYRYAYQQKNEMEK---------------------------------------ALNNVTVPDKFPIPVIEELF
        WPE LPP R IEH IHLKEG NPVNVRPYRYAY QK EMEK                                       ALNNVTVPDKFPIPV+EELF
Subjt:  WPETLPPWRIIEHHIHLKEGVNPVNVRPYRYAYQQKNEMEK---------------------------------------ALNNVTVPDKFPIPVIEELF

Query:  DELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKFVLVFFDDILIYSKNLKAHLDHIKAVLEVL
        DEL GA +FTKIDLKAGYHQIRM   DIEKTAFRTHEGHYEF+VMPFGLTNAP+TFQSLMN+IF+PYL KFVLVFFDDILIYS+N + HL H++ V  VL
Subjt:  DELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKFVLVFFDDILIYSKNLKAHLDHIKAVLEVL

Query:  RKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYGTIAAPLTQLLKKGGFRWTEETQEAFVKLQ
        RK+EL+AN+KKCSF  A+VEYLGH+IS  GVEVDPEKI+AI  WP PTNVRE RGFLGLTGYYRKFV HYGT+AAPLTQLLKKGGF+W  E ++AF KL+
Subjt:  RKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYGTIAAPLTQLLKKGGFRWTEETQEAFVKLQ

Query:  MTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHTLAVRDRVKPVYERELMAVVMTVQRWRPYLLGKRFKVKTDPRSLKFFLEQRVIQP
          M+ LP+LA+P F  PFEIETDASGYG+GAVLIQN+RPIA+YSHTLA RDR +PVYERELMAVV+ VQRWRPYLLG RF V+TD +SLKF LEQRV+QP
Subjt:  MTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHTLAVRDRVKPVYERELMAVVMTVQRWRPYLLGKRFKVKTDPRSLKFFLEQRVIQP

Query:  QYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVHLNQLTAPNIIDVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLMYKNRMVISKTS
        QYQ+W++KLLGY+F+V YKPG+ENKAADALSR+ PT+  + +T P  +D+ VIKEEV +D RL KI   L    +QQ +K+++  GML YK+R+VIS++S
Subjt:  QYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVHLNQLTAPNIIDVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLMYKNRMVISKTS

Query:  KLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEECMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKENGFEVIFVVVD
        KLI  +LH++H+S  GGHSGFLRTYKR+ GELYW+GMK  +KKYC EC+ICQRNKTL LSPAGLL PL +P  +W +ISMDFV+GLPK  GFEVIFVVVD
Subjt:  KLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEECMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKENGFEVIFVVVD

Query:  RFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYLLCFCGEKLKE
        R         FLP+KHPY+AK++A+LFVKEVVRLH FP SIVSD D+VFLS+FWKE+FRLAGT+LN S+AYHPQSDGQTEVVNRGV +YL C C +K KE
Subjt:  RFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYLLCFCGEKLKE

Query:  WVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNSTLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDVQ------------PY
        W+KWI WA+YWYNTT+QR+LG+TPFQ VYGR PPPL+ YG + TSN+TLDEQL+ERD  + +L+EHL++AQD+MKK AD KRRDV+            PY
Subjt:  WVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNSTLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDVQ------------PY

Query:  RQVSLRRKRNEKLSAKYFGPYKVIERVGPVAYKLELPAQAAIHPVFHEN------------CDDIPYLSENYEWRAMPEEVYGYLKNKAGSWDVLVSWKG
        RQ+SLRRKRNEKLSAKYFGPYK++ER+GPVAYKLELP    IHPVFH +               +  L EN+ W   P E   Y +NKA  W+V++ W+G
Subjt:  RQVSLRRKRNEKLSAKYFGPYKVIERVGPVAYKLELPAQAAIHPVFHEN------------CDDIPYLSENYEWRAMPEEVYGYLKNKAGSWDVLVSWKG

Query:  LPRHEATWELYEDLQQRFPDFHLEDKVNLKRESNDRPPILYQYTRKGKK
        L  HEATWE Y+D+  ++P+FHLEDKV+L+  SN RPPIL+QY+RK K+
Subjt:  LPRHEATWELYEDLQQRFPDFHLEDKVNLKRESNDRPPILYQYTRKGKK

TYJ96875.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0058.63Show/hide
Query:  MVQTRIEEKIETIDQEIHGIKKEIGKLLAIEKTLNNISKNM-------ERQNQLILRIMESAAQERTTMNERISELSIQTFSVKIIDEGEGSSRRESETN
        MVQTRIEE++E+ +QE+ GIKKE+ K+  IE TL  +++NM       E+Q Q IL  ME  A+ER+   ER++E   Q          + SS R+ E  
Subjt:  MVQTRIEEKIETIDQEIHGIKKEIGKLLAIEKTLNNISKNM-------ERQNQLILRIMESAAQERTTMNERISELSIQTFSVKIIDEGEGSSRRESETN

Query:  NEEKKPNEEGNNDRNKFKKVEMPIFNGDDLDSWLFCAERYFLIHKLTESEKITVSTISFEGPTLNWYRSQEEREK-------------------------
        N +K   +E +NDR+KFKKVEMP+F G+D +SWLF AERYF IHKLTESEK+ VSTI F+GP LNWYR+QEEREK                         
Subjt:  NEEKKPNEEGNNDRNKFKKVEMPIFNGDDLDSWLFCAERYFLIHKLTESEKITVSTISFEGPTLNWYRSQEEREK-------------------------

Query:  RFLRIQQTTTMDEYQNLFDK---------------------WRWISVS------QKDARMIRIAQKIENREDIRREANLPRYSGGRTTNPQNS-TKTNTN
        RFLRIQQ TT++EY+NLFDK                     + WI         +  A M+R AQ +E+RE +R  ANL  Y GG+++ P ++ TK   +
Subjt:  RFLRIQQTTTMDEYQNLFDK---------------------WRWISVS------QKDARMIRIAQKIENREDIRREANLPRYSGGRTTNPQNS-TKTNTN

Query:  VNTGENKGGMNWPMRTITLRGTSKEEVPKEGPTKRLSDAEFQARKEKGLCFRCNEKYSHDHKCKAKELRELRMLVVKEDNEEYEIIEDGNNALKELNTIE
            ENK    +P+RTITL+  +  E  KEG +KRL DAEFQ R+EKGLCF+CNEKYS DHKCK +E RELRM VVK++NEE EI+E+      EL T+E
Subjt:  VNTGENKGGMNWPMRTITLRGTSKEEVPKEGPTKRLSDAEFQARKEKGLCFRCNEKYSHDHKCKAKELRELRMLVVKEDNEEYEIIEDGNNALKELNTIE

Query:  IMEEGQAIVELSMNSVVGLDNLGTMEVKGTIRGREIVVLIDCGATHNFVSERLVKELQLETKETSNYGVILGFGTAIKGKGICGAVELMLGDWKVIDEFL
        +  +  A VELS+NSVVGL++ GTM+V+GT++G+E+V+LIDCGATHNFVSE+LV  LQL  KET++YGVILG GTAI+GKGIC ++E+ + DW V ++FL
Subjt:  IMEEGQAIVELSMNSVVGLDNLGTMEVKGTIRGREIVVLIDCGATHNFVSERLVKELQLETKETSNYGVILGFGTAIKGKGICGAVELMLGDWKVIDEFL

Query:  PLELGGVDAILNMQWLYSLGIT-----------------------EALERCDLS--------EEED---LIE---------EILT--------EEESVAV
        PLELGGVD IL MQWLYSLG+T                        +L +  +S        EE D   LIE         E +T         EE +  
Subjt:  PLELGGVDAILNMQWLYSLGIT-----------------------EALERCDLS--------EEED---LIE---------EILT--------EEESVAV

Query:  VLKRFEDVFEWPETLPPWRIIEHHIHLKEGVNPVNVRPYRYAYQQKNEME---------------------------------------KALNNVTVPDK
        +L +F+D+FEWPE LPP R IEH IHLKEG NPVNVRPYRYAY QK EME                                       +ALNNVTVPDK
Subjt:  VLKRFEDVFEWPETLPPWRIIEHHIHLKEGVNPVNVRPYRYAYQQKNEME---------------------------------------KALNNVTVPDK

Query:  FPIPVIEELFDELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKFVLVFFDDILIYSKNLKAHL
        FPIPV+EELFDEL GA +FTKIDLKAGYHQIRM   DIEKTAFRTHEGHYEF+VMPFGLTNAP+TFQSLMN+IF+PYL +FVLVFFDDILIYS+NL+ HL
Subjt:  FPIPVIEELFDELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKFVLVFFDDILIYSKNLKAHL

Query:  DHIKAVLEVLRKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYGTIAAPLTQLLKKGGFRWTE
         HI+ V  VLRK+EL+AN+KKCSF  A+VEYLGH+IS  GVEVDPEKI+AI +WP PT+VRE RGFLGLTGYYRKFV HYGT+AAPLTQLLKKGGF W  
Subjt:  DHIKAVLEVLRKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYGTIAAPLTQLLKKGGFRWTE

Query:  ETQEAFVKLQMTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHTLAVRDRVKPVYERELMAVVMTVQRWRPYLLGKRFKVKTDPRSLK
        E ++AF KL+  M+ LP+LA+P F  PFEIETDASGYG+GAVLIQN+RPIA+YSHTLA+RDR +PVYERELMAVV+ VQRWRPYLLG RF V+TD +SLK
Subjt:  ETQEAFVKLQMTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHTLAVRDRVKPVYERELMAVVMTVQRWRPYLLGKRFKVKTDPRSLK

Query:  FFLEQRVIQPQYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVHLNQLTAPNIIDVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLMY
        F LEQRV+QPQYQ+W++KLLGY+F+V YKPG+ENKAADALSRI PTV +  +T P  +D+ +IKEEV +D +L KI  E+      Q SK+ +  GML Y
Subjt:  FFLEQRVIQPQYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVHLNQLTAPNIIDVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLMY

Query:  KNRMVISKTSKLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEECMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKEN
        K+R+VIS+TSKLI  ILH++H+S  GGHSGFLRTYKR++GELYW+GMK  VKKYC EC+ICQ+NKTL LSPAGLL PL +P  +W +ISMDFV+GLPK  
Subjt:  KNRMVISKTSKLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEECMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKEN

Query:  GFEVIFVVVDRFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYL
        GFEVIFVVVDR         F+P+KHPY+AK++AELFVKEVVRLH FP SIVSD D+VFLS+FWKE+FRLAGT+LN S+AYHPQSDGQTEVVNRGV +YL
Subjt:  GFEVIFVVVDRFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYL

Query:  LCFCGEKLKEWVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNSTLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDVQ----
         CFC +K KEWVKWI WA+YWYNTT+Q++LG+TPFQ VYGR PPPL+ YG + T N TLDEQLKERD  + +L+E+L++AQ++MKKYAD +RRD++    
Subjt:  LCFCGEKLKEWVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNSTLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDVQ----

Query:  --------PYRQVSLRRKRNEKLSAKYFGPYKVIERVGPVAYKLELPAQAAIHPVFH---------ENCD---DIPYLSENYEWRAMPEEVYGYLKNKAG
                PYRQ+SLRRKRNEKLSAKYFGPYK++ER+GPVAYKLELP  A IHPVFH         E+ D    I  L EN+ W+  P E   Y +NK G
Subjt:  --------PYRQVSLRRKRNEKLSAKYFGPYKVIERVGPVAYKLELPAQAAIHPVFH---------ENCD---DIPYLSENYEWRAMPEEVYGYLKNKAG

Query:  SWDVLVSWKGLPRHEATWELYEDLQQRFPDFHLEDKVNLKRESNDRPPILYQYTRKGKK
         W+V++ W GL  HE TWE Y D+  ++PDFHLEDKV+L+  SN RPPIL+QY+RK K+
Subjt:  SWDVLVSWKGLPRHEATWELYEDLQQRFPDFHLEDKVNLKRESNDRPPILYQYTRKGKK

TYK22240.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0058.47Show/hide
Query:  MVQTRIEEKIETIDQEIHGIKKEIGKLLAIEKTLNNISKNM-------ERQNQLILRIMESAAQERTTMNERISELSIQTFSVKIIDEGEGSSRRE-SET
        MVQTRIEE++E  +QEI GIKKE+ K+ AIE TL  I+KNM       E+Q Q IL  ME+ A+ER    ERI+E  IQ         G+ SS  +  ET
Subjt:  MVQTRIEEKIETIDQEIHGIKKEIGKLLAIEKTLNNISKNM-------ERQNQLILRIMESAAQERTTMNERISELSIQTFSVKIIDEGEGSSRRE-SET

Query:  NNEEKKPNEEGNNDRNKFKKVEMPIFNGDDLDSWLFCAERYFLIHKLTESEKITVSTISFEGPTLNWYRSQEEREK------------------------
        + E K  ++E  NDR+KFKKVEMP+F G+D +SWLF AERYF IHKLTESEK+ VSTI F+GP LNWYRSQEEREK                        
Subjt:  NNEEKKPNEEGNNDRNKFKKVEMPIFNGDDLDSWLFCAERYFLIHKLTESEKITVSTISFEGPTLNWYRSQEEREK------------------------

Query:  -RFLRIQQTTTMDEYQNLFDK---------------------WRWISVS------QKDARMIRIAQKIENREDIRREANLPRYSGGR-TTNPQNSTKTNT
         RFLRIQQ TT++EY+N FDK                     + WI         +  A  +  AQ +E+RE +R  ANL  Y GG+ +       K + 
Subjt:  -RFLRIQQTTTMDEYQNLFDK---------------------WRWISVS------QKDARMIRIAQKIENREDIRREANLPRYSGGR-TTNPQNSTKTNT

Query:  NVNTGENKGGMNWPMRTITLRGTSKEEVPKEGPTKRLSDAEFQARKEKGLCFRCNEKYSHDHKCKAKELRELRMLVVKEDNEEYEIIEDGNNALKELNTI
             E+K   ++P+RTITL+  +  E+ KEG +KRL DAEFQ RKEKGLCF+CNEKYS DHKCK KE RELRM VVK DNEE EI+E+      E+   
Subjt:  NVNTGENKGGMNWPMRTITLRGTSKEEVPKEGPTKRLSDAEFQARKEKGLCFRCNEKYSHDHKCKAKELRELRMLVVKEDNEEYEIIEDGNNALKELNTI

Query:  EIMEEGQAIVELSMNSVVGLDNLGTMEVKGTIRGREIVVLIDCGATHNFVSERLVKELQLETKETSNYGVILGFGTAIKGKGICGAVELMLGDWKVIDEF
        E+       VELS+NSVVGL++ GTM+VKG+++G+E+V+LIDCGATHNFVSE++V  LQL  KET++YGVILG GTAI+GKGIC +VE+ + +W V ++F
Subjt:  EIMEEGQAIVELSMNSVVGLDNLGTMEVKGTIRGREIVVLIDCGATHNFVSERLVKELQLETKETSNYGVILGFGTAIKGKGICGAVELMLGDWKVIDEF

Query:  LPLELGGVDAILNMQWLYSLGIT-----------------------EALERCDLS--------EEED---LIE-------EILTE--------EESVAVV
        LPLELGGVD IL MQWLYSLG+T                        +L +  +S        EE D   LIE       E+ T         ++ +  +
Subjt:  LPLELGGVDAILNMQWLYSLGIT-----------------------EALERCDLS--------EEED---LIE-------EILTE--------EESVAVV

Query:  LKRFEDVFEWPETLPPWRIIEHHIHLKEGVNPVNVRPYRYAYQQKNEMEK---------------------------------------ALNNVTVPDKF
        L +F DVFEWPE LPP R IEH IHLKEG NPVNVRPYRYAY QK EMEK                                       ALNNVTVPDKF
Subjt:  LKRFEDVFEWPETLPPWRIIEHHIHLKEGVNPVNVRPYRYAYQQKNEMEK---------------------------------------ALNNVTVPDKF

Query:  PIPVIEELFDELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKFVLVFFDDILIYSKNLKAHLD
        PIPV+EELFDEL GA +FTKIDLKAGYHQIRM   DIEKTAFRTHEGHYEF+VMPFGLTNAP+TFQSLMN+IF+PYL KFVLVFFDDILIYS+N + HL 
Subjt:  PIPVIEELFDELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKFVLVFFDDILIYSKNLKAHLD

Query:  HIKAVLEVLRKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYGTIAAPLTQLLKKGGFRWTEE
        H++ V  VLRK+EL+AN+KKCSF  A+VEYLGH+IS  GVEVDPEKI+AI  WP PTNVRE RGFLGLTGYYRKFV HYGT+AAPLTQLLKKGGF+W  E
Subjt:  HIKAVLEVLRKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYGTIAAPLTQLLKKGGFRWTEE

Query:  TQEAFVKLQMTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHTLAVRDRVKPVYERELMAVVMTVQRWRPYLLGKRFKVKTDPRSLKF
         ++AF KL+  M+ LP+LA+P F  PFEIETDASGYG+GAVLIQN+RPIA+YSHTLA RDR +PVYERELMAVV+ VQRWRPYLLG RF V+TD +SLKF
Subjt:  TQEAFVKLQMTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHTLAVRDRVKPVYERELMAVVMTVQRWRPYLLGKRFKVKTDPRSLKF

Query:  FLEQRVIQPQYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVHLNQLTAPNIIDVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLMYK
         LEQRV+QPQYQ+W++KLLGY+F+V YKPG+ENKAADALSR+ PT+  + +T P  +D+ VIKEEV +D RL KI   L    +QQ +K+++  GML YK
Subjt:  FLEQRVIQPQYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVHLNQLTAPNIIDVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLMYK

Query:  NRMVISKTSKLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEECMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKENG
        +R+VIS++SKLI  +LH++H+S  GGHSGFLRTYKR+ GELYW+GMK  +KKYC EC+ICQRNKTL LSPAGLL PL +P  +W +ISMDFV+GLPK  G
Subjt:  NRMVISKTSKLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEECMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKENG

Query:  FEVIFVVVDRFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYLL
        FEVIFVVVDR         FLP+KHPY+AK++A+LFVKEVVRLH FP SIVSD D+VFLS+FWKE+FRLAGT+LN S+AYHPQSDGQTEVVNRGV +YL 
Subjt:  FEVIFVVVDRFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYLL

Query:  CFCGEKLKEWVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNSTLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDVQ-----
        C C +K KEW+KWI WA+YWYNTT+QR+LG+TPFQ VYGR PPPL+ YG + TSN+TLDEQL+ERD  + +L+EHL++AQD+MKK AD KRRDV+     
Subjt:  CFCGEKLKEWVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNSTLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDVQ-----

Query:  -------PYRQVSLRRKRNEKLSAKYFGPYKVIERVGPVAYKLELPAQAAIHPVFHEN------------CDDIPYLSENYEWRAMPEEVYGYLKNKAGS
               PYRQ+SLRRKRNEKLSAKYFGPYK++ER+GPVAYKLELP    IHPVFH +               +  L EN+ W   P E   Y +NKA  
Subjt:  -------PYRQVSLRRKRNEKLSAKYFGPYKVIERVGPVAYKLELPAQAAIHPVFHEN------------CDDIPYLSENYEWRAMPEEVYGYLKNKAGS

Query:  WDVLVSWKGLPRHEATWELYEDLQQRFPDFHLEDKVNLKRESNDRPPILYQYTRKGKK
        W+V++ W+GL  HEATWE Y+D+  ++P+FHLEDKV+L+  SN RPPIL+QY+RK K+
Subjt:  WDVLVSWKGLPRHEATWELYEDLQQRFPDFHLEDKVNLKRESNDRPPILYQYTRKGKK

TrEMBL top hitse value%identityAlignment
A0A5A7V5H5 Ty3/gypsy retrotransposon protein0.0e+0058.63Show/hide
Query:  MVQTRIEEKIETIDQEIHGIKKEIGKLLAIEKTLNNISKNM-------ERQNQLILRIMESAAQERTTMNERISELSIQTFSVKIIDEGEGSSRRESETN
        MVQTR EE++E+ +QE+ GIKKE+ K+  IE TL  +++NM       E+Q Q IL  ME  A+ER+   ER++E   Q          + SS R+ E  
Subjt:  MVQTRIEEKIETIDQEIHGIKKEIGKLLAIEKTLNNISKNM-------ERQNQLILRIMESAAQERTTMNERISELSIQTFSVKIIDEGEGSSRRESETN

Query:  NEEKKPNEEGNNDRNKFKKVEMPIFNGDDLDSWLFCAERYFLIHKLTESEKITVSTISFEGPTLNWYRSQEEREK-------------------------
        N +K   +E +NDR+KFKKVEMP+F G+D +SWLF AERYF IHKLTESEK+ VSTI F+GP LNWYR+QEEREK                         
Subjt:  NEEKKPNEEGNNDRNKFKKVEMPIFNGDDLDSWLFCAERYFLIHKLTESEKITVSTISFEGPTLNWYRSQEEREK-------------------------

Query:  RFLRIQQTTTMDEYQNLFDK---------------------WRWISVS------QKDARMIRIAQKIENREDIRREANLPRYSGGRTTNPQNS-TKTNTN
        RFLRIQQ TT++EY+NLFDK                     + WI         +  A M+R AQ +E+RE +R  ANL  Y GG+++ P ++ TK   +
Subjt:  RFLRIQQTTTMDEYQNLFDK---------------------WRWISVS------QKDARMIRIAQKIENREDIRREANLPRYSGGRTTNPQNS-TKTNTN

Query:  VNTGENKGGMNWPMRTITLRGTSKEEVPKEGPTKRLSDAEFQARKEKGLCFRCNEKYSHDHKCKAKELRELRMLVVKEDNEEYEIIEDGNNALKELNTIE
            ENK    +P+RTITL+  +  E  KEG +KRL DAEFQ R+EKGLCF+CNEKYS DHKCK +E RELRM VVK++NEE EI+E+      EL T+E
Subjt:  VNTGENKGGMNWPMRTITLRGTSKEEVPKEGPTKRLSDAEFQARKEKGLCFRCNEKYSHDHKCKAKELRELRMLVVKEDNEEYEIIEDGNNALKELNTIE

Query:  IMEEGQAIVELSMNSVVGLDNLGTMEVKGTIRGREIVVLIDCGATHNFVSERLVKELQLETKETSNYGVILGFGTAIKGKGICGAVELMLGDWKVIDEFL
        +  +  A VELS+NSVVGL++ GTM+V+GT++G+E+V+LIDCGATHNFVSE+LV  LQL  KET++YGVILG GTAI+GKGIC ++E+ + DW V ++FL
Subjt:  IMEEGQAIVELSMNSVVGLDNLGTMEVKGTIRGREIVVLIDCGATHNFVSERLVKELQLETKETSNYGVILGFGTAIKGKGICGAVELMLGDWKVIDEFL

Query:  PLELGGVDAILNMQWLYSLGIT-----------------------EALERCDLS--------EEED---LIE---------EILT--------EEESVAV
        PLELGGVD IL MQWLYSLG+T                        +L +  +S        EE D   LIE         E +T         EE +  
Subjt:  PLELGGVDAILNMQWLYSLGIT-----------------------EALERCDLS--------EEED---LIE---------EILT--------EEESVAV

Query:  VLKRFEDVFEWPETLPPWRIIEHHIHLKEGVNPVNVRPYRYAYQQKNEME---------------------------------------KALNNVTVPDK
        +L +F+D+FEWPE LPP R IEH IHLKEG NPVNVRPYRYAY QK EME                                       +ALNNVTVPDK
Subjt:  VLKRFEDVFEWPETLPPWRIIEHHIHLKEGVNPVNVRPYRYAYQQKNEME---------------------------------------KALNNVTVPDK

Query:  FPIPVIEELFDELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKFVLVFFDDILIYSKNLKAHL
        FPIPV+EELFDEL GA +FTKIDLKAGYHQIRM   DIEKTAFRTHEGHYEF+VMPFGLTNAP+TFQSLMN+IF+PYL +FVLVFFDDILIYS+NL+ HL
Subjt:  FPIPVIEELFDELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKFVLVFFDDILIYSKNLKAHL

Query:  DHIKAVLEVLRKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYGTIAAPLTQLLKKGGFRWTE
         HI+ V  VLRK+EL+AN+KKCSF  A+VEYLGH+IS  GVEVDPEKI+AI +WP PT+VRE RGFLGLTGYYRKFV HYGT+AAPLTQLLKKGGF W  
Subjt:  DHIKAVLEVLRKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYGTIAAPLTQLLKKGGFRWTE

Query:  ETQEAFVKLQMTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHTLAVRDRVKPVYERELMAVVMTVQRWRPYLLGKRFKVKTDPRSLK
        E ++AF KL+  M+ LP+LA+P F  PFEIETDASGYG+GAVLIQN+RPIA+YSHTLA+RDR +PVYERELMAVV+ VQRWRPYLLG RF V+TD +SLK
Subjt:  ETQEAFVKLQMTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHTLAVRDRVKPVYERELMAVVMTVQRWRPYLLGKRFKVKTDPRSLK

Query:  FFLEQRVIQPQYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVHLNQLTAPNIIDVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLMY
        F LEQRV+QPQYQ+W++KLLGY+F+V YKPG+ENKAADALSRI PTV +  +T P  +D+ +IKEEV +D +L KI  E+      Q SK+ +  GML Y
Subjt:  FFLEQRVIQPQYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVHLNQLTAPNIIDVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLMY

Query:  KNRMVISKTSKLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEECMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKEN
        K+R+VIS+TSKLI  ILH++H+S  GGHSGFLRTYKR++GELYW+GMK  VKKYC EC+ICQ+NKTL LSPAGLL PL +P  +W +ISMDFV+GLPK  
Subjt:  KNRMVISKTSKLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEECMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKEN

Query:  GFEVIFVVVDRFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYL
        GFEVIFVVVDR         F+P+KHPY+AK++AELFVKEVVRLH FP SIVSD D+VFLS+FWKE+FRLAGT+LN S+AYHPQSDGQTEVVNRGV  YL
Subjt:  GFEVIFVVVDRFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYL

Query:  LCFCGEKLKEWVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNSTLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDVQ----
         CFC +K KEWVKWI WA+YWYNTT+Q++LG+TPFQ VYGR PPPL+ YG + T N TLDEQLKERD  + +L+E+L++AQ++MKKYAD +RRD++    
Subjt:  LCFCGEKLKEWVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNSTLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDVQ----

Query:  --------PYRQVSLRRKRNEKLSAKYFGPYKVIERVGPVAYKLELPAQAAIHPVFH---------ENCD---DIPYLSENYEWRAMPEEVYGYLKNKAG
                PYRQ+SLRRKRNEKLSAKYFGPYK++ER+GPVAYKLELP  A IHPVFH         E+ D    I  L EN+ W+  P E   Y +NK G
Subjt:  --------PYRQVSLRRKRNEKLSAKYFGPYKVIERVGPVAYKLELPAQAAIHPVFH---------ENCD---DIPYLSENYEWRAMPEEVYGYLKNKAG

Query:  SWDVLVSWKGLPRHEATWELYEDLQQRFPDFHLEDKVNLKRESNDRPPILYQYTRKGKK
         W+V+V W GL  HE TWE Y D+  ++PDFHLEDKV+L+  SN RPPIL+QY+RK K+
Subjt:  SWDVLVSWKGLPRHEATWELYEDLQQRFPDFHLEDKVNLKRESNDRPPILYQYTRKGKK

A0A5A7VJA0 Ty3/gypsy retrotransposon protein0.0e+0058.63Show/hide
Query:  MVQTRIEEKIETIDQEIHGIKKEIGKLLAIEKTLNNISKNM-------ERQNQLILRIMESAAQERTTMNERISELSIQTFSVKIIDEGEGSSRRESETN
        MVQTRIEE++E+ +QE+ GIKKE+ K+  IE TL  +++NM       E+Q Q IL  ME  A+ER+   ER++E   Q          + SS R+ E  
Subjt:  MVQTRIEEKIETIDQEIHGIKKEIGKLLAIEKTLNNISKNM-------ERQNQLILRIMESAAQERTTMNERISELSIQTFSVKIIDEGEGSSRRESETN

Query:  NEEKKPNEEGNNDRNKFKKVEMPIFNGDDLDSWLFCAERYFLIHKLTESEKITVSTISFEGPTLNWYRSQEEREK-------------------------
        N +K   +E +NDR+KFKKVEMP+F G+D +SWLF AERYF IHKLTESEK+ VSTI F+GP LNWYR+QEEREK                         
Subjt:  NEEKKPNEEGNNDRNKFKKVEMPIFNGDDLDSWLFCAERYFLIHKLTESEKITVSTISFEGPTLNWYRSQEEREK-------------------------

Query:  RFLRIQQTTTMDEYQNLFDK---------------------WRWISVS------QKDARMIRIAQKIENREDIRREANLPRYSGGRTTNPQNS-TKTNTN
        RFLRIQQ TT++EY+NLFDK                     + WI         +  A M+R AQ +E+RE +R  ANL  Y GG+++ P ++ TK   +
Subjt:  RFLRIQQTTTMDEYQNLFDK---------------------WRWISVS------QKDARMIRIAQKIENREDIRREANLPRYSGGRTTNPQNS-TKTNTN

Query:  VNTGENKGGMNWPMRTITLRGTSKEEVPKEGPTKRLSDAEFQARKEKGLCFRCNEKYSHDHKCKAKELRELRMLVVKEDNEEYEIIEDGNNALKELNTIE
            ENK    +P+RTITL+  +  E  KEG +KRL DAEFQ R+EKGLCF+CNEKYS DHKCK +E RELRM VVK++NEE EI+E+      EL T+E
Subjt:  VNTGENKGGMNWPMRTITLRGTSKEEVPKEGPTKRLSDAEFQARKEKGLCFRCNEKYSHDHKCKAKELRELRMLVVKEDNEEYEIIEDGNNALKELNTIE

Query:  IMEEGQAIVELSMNSVVGLDNLGTMEVKGTIRGREIVVLIDCGATHNFVSERLVKELQLETKETSNYGVILGFGTAIKGKGICGAVELMLGDWKVIDEFL
        +  +  A VELS+NSVVGL++ GTM+V+GT++G+E+V+LIDCGATHNFVSE+LV  LQL  KET++YGVILG GTAI+GKGIC ++E+ + DW V ++FL
Subjt:  IMEEGQAIVELSMNSVVGLDNLGTMEVKGTIRGREIVVLIDCGATHNFVSERLVKELQLETKETSNYGVILGFGTAIKGKGICGAVELMLGDWKVIDEFL

Query:  PLELGGVDAILNMQWLYSLGIT-----------------------EALERCDLS--------EEED---LIE---------EILT--------EEESVAV
        PLELGGVD IL MQWLYSLG+T                        +L +  +S        EE D   LIE         E +T         EE +  
Subjt:  PLELGGVDAILNMQWLYSLGIT-----------------------EALERCDLS--------EEED---LIE---------EILT--------EEESVAV

Query:  VLKRFEDVFEWPETLPPWRIIEHHIHLKEGVNPVNVRPYRYAYQQKNEME---------------------------------------KALNNVTVPDK
        +L +F+D+FEWPE LPP R IEH IHLKEG NPVNVRPYRYAY QK EME                                       +ALNNVTVPDK
Subjt:  VLKRFEDVFEWPETLPPWRIIEHHIHLKEGVNPVNVRPYRYAYQQKNEME---------------------------------------KALNNVTVPDK

Query:  FPIPVIEELFDELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKFVLVFFDDILIYSKNLKAHL
        FPIPV+EELFDEL GA +FTKIDLKAGYHQIRM   DIEKTAFRTHEGHYEF+VMPFGLTNAP+TFQSLMN+IF+PYL +FVLVFFDDILIYS+NL+ HL
Subjt:  FPIPVIEELFDELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKFVLVFFDDILIYSKNLKAHL

Query:  DHIKAVLEVLRKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYGTIAAPLTQLLKKGGFRWTE
         HI+ V  VLRK+EL+AN+KKCSF  A+VEYLGH+IS  GVEVDPEKI+AI +WP PT+VRE RGFLGLTGYYRKFV HYGT+AAPLTQLLKKGGF W  
Subjt:  DHIKAVLEVLRKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYGTIAAPLTQLLKKGGFRWTE

Query:  ETQEAFVKLQMTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHTLAVRDRVKPVYERELMAVVMTVQRWRPYLLGKRFKVKTDPRSLK
        E ++AF KL+  M+ LP+LA+P F  PFEIETDASGYG+GAVLIQN+RPIA+YSHTLA+RDR +PVYERELMAVV+ VQRWRPYLLG RF V+TD +SLK
Subjt:  ETQEAFVKLQMTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHTLAVRDRVKPVYERELMAVVMTVQRWRPYLLGKRFKVKTDPRSLK

Query:  FFLEQRVIQPQYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVHLNQLTAPNIIDVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLMY
        F LEQRV+QPQYQ+W++KLLGY+F+V YKPG+ENKAADALSRI PTV +  +T P  +D+ +IKEEV +D +L KI  E+      Q SK+ +  GML Y
Subjt:  FFLEQRVIQPQYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVHLNQLTAPNIIDVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLMY

Query:  KNRMVISKTSKLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEECMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKEN
        K+R+VIS+TSKLI  ILH++H+S  GGHSGFLRTYKR++GELYW+GMK  VKKYC EC+ICQ+NKTL LSPAGLL PL +P  +W +ISMDFV+GLPK  
Subjt:  KNRMVISKTSKLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEECMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKEN

Query:  GFEVIFVVVDRFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYL
        GFEVIFVVVDR         F+P+KHPY+AK++AELFVKEVVRLH FP SIVSD D+VFLS+FWKE+FRLAGT+LN S+AYHPQSDGQTEVVNRGV +YL
Subjt:  GFEVIFVVVDRFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYL

Query:  LCFCGEKLKEWVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNSTLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDVQ----
         CFC +K KEWVKWI WA+YWYNTT+Q++LG+TPFQ VYGR PPPL+ YG + T N TLDEQLKERD  + +L+E+L++AQ++MKKYAD +RRD++    
Subjt:  LCFCGEKLKEWVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNSTLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDVQ----

Query:  --------PYRQVSLRRKRNEKLSAKYFGPYKVIERVGPVAYKLELPAQAAIHPVFH---------ENCD---DIPYLSENYEWRAMPEEVYGYLKNKAG
                PYRQ+SLRRKRNEKLSAKYFGPYK++ER+GPVAYKLELP  A IHPVFH         E+ D    I  L EN+ W+  P E   Y +NK G
Subjt:  --------PYRQVSLRRKRNEKLSAKYFGPYKVIERVGPVAYKLELPAQAAIHPVFH---------ENCD---DIPYLSENYEWRAMPEEVYGYLKNKAG

Query:  SWDVLVSWKGLPRHEATWELYEDLQQRFPDFHLEDKVNLKRESNDRPPILYQYTRKGKK
         W+V++ W GL  HE TWE Y D+  ++PDFHLEDKV+L+  SN RPPIL+QY+RK K+
Subjt:  SWDVLVSWKGLPRHEATWELYEDLQQRFPDFHLEDKVNLKRESNDRPPILYQYTRKGKK

A0A5D3B8Y6 Ty3/gypsy retrotransposon protein0.0e+0058.3Show/hide
Query:  IETIDQEIHGIKKEIGKLLAIEKTLNNISKNM-------ERQNQLILRIMESAAQERTTMNERISELSIQTFSVKIIDEGEGSSRRE-SETNNEEKKPNE
        +E  +QEI GIKKE+ K+ AIE TL  I+KNM       E+Q Q IL  ME+ A+ER    ERI+E  IQ         G+ SS  +  ET+ E K  ++
Subjt:  IETIDQEIHGIKKEIGKLLAIEKTLNNISKNM-------ERQNQLILRIMESAAQERTTMNERISELSIQTFSVKIIDEGEGSSRRE-SETNNEEKKPNE

Query:  EGNNDRNKFKKVEMPIFNGDDLDSWLFCAERYFLIHKLTESEKITVSTISFEGPTLNWYRSQEEREK-------------------------RFLRIQQT
        E  NDR+KFKKVEMP+F G+D +SWLF AERYF IHKLTESEK+ VSTI F+GP LNWYRSQEEREK                         RFLRIQQ 
Subjt:  EGNNDRNKFKKVEMPIFNGDDLDSWLFCAERYFLIHKLTESEKITVSTISFEGPTLNWYRSQEEREK-------------------------RFLRIQQT

Query:  TTMDEYQNLFDK---------------------WRWISVS------QKDARMIRIAQKIENREDIRREANLPRYSGGR-TTNPQNSTKTNTNVNTGENKG
        TT++EY+N FDK                     + WI         +  A  +  AQ +E+RE +R  ANL  Y GG+ +       K +      E+K 
Subjt:  TTMDEYQNLFDK---------------------WRWISVS------QKDARMIRIAQKIENREDIRREANLPRYSGGR-TTNPQNSTKTNTNVNTGENKG

Query:  GMNWPMRTITLRGTSKEEVPKEGPTKRLSDAEFQARKEKGLCFRCNEKYSHDHKCKAKELRELRMLVVKEDNEEYEIIEDGNNALKELNTIEIMEEGQAI
          ++P+RTITL+  +  E+ KEG +KRL DAEFQ RKEKGLCF+CNEKYS DHKCK KE RELRM VVK DNEE EI+E+      E+   E+       
Subjt:  GMNWPMRTITLRGTSKEEVPKEGPTKRLSDAEFQARKEKGLCFRCNEKYSHDHKCKAKELRELRMLVVKEDNEEYEIIEDGNNALKELNTIEIMEEGQAI

Query:  VELSMNSVVGLDNLGTMEVKGTIRGREIVVLIDCGATHNFVSERLVKELQLETKETSNYGVILGFGTAIKGKGICGAVELMLGDWKVIDEFLPLELGGVD
        VELS+NSVVGL++ GTM+VKG+++G+E+V+LIDCGATHNFVSE++V  LQL  KET++YGVILG GTAI+GKGIC +VE+ + +W V ++FLPLELGGVD
Subjt:  VELSMNSVVGLDNLGTMEVKGTIRGREIVVLIDCGATHNFVSERLVKELQLETKETSNYGVILGFGTAIKGKGICGAVELMLGDWKVIDEFLPLELGGVD

Query:  AILNMQWLYSLGIT-----------------------EALERCDLS--------EEED---LIE-------EILTE--------EESVAVVLKRFEDVFE
         IL MQWLYSLG+T                        +L +  +S        EE D   LIE       E+ T         ++ +  +L +F DVFE
Subjt:  AILNMQWLYSLGIT-----------------------EALERCDLS--------EEED---LIE-------EILTE--------EESVAVVLKRFEDVFE

Query:  WPETLPPWRIIEHHIHLKEGVNPVNVRPYRYAYQQKNEMEK---------------------------------------ALNNVTVPDKFPIPVIEELF
        WPE LPP R IEH IHLKEG NPVNVRPYRYAY QK EMEK                                       ALNNVTVPDKFPIPV+EELF
Subjt:  WPETLPPWRIIEHHIHLKEGVNPVNVRPYRYAYQQKNEMEK---------------------------------------ALNNVTVPDKFPIPVIEELF

Query:  DELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKFVLVFFDDILIYSKNLKAHLDHIKAVLEVL
        DEL GA +FTKIDLKAGYHQIRM   DIEKTAFRTHEGHYEF+VMPFGLTNAP+TFQSLMN+IF+PYL KFVLVFFDDILIYS+N + HL H++ V  VL
Subjt:  DELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKFVLVFFDDILIYSKNLKAHLDHIKAVLEVL

Query:  RKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYGTIAAPLTQLLKKGGFRWTEETQEAFVKLQ
        RK+EL+AN+KKCSF  A+VEYLGH+IS  GVEVDPEKI+AI  WP PTNVRE RGFLGLTGYYRKFV HYGT+AAPLTQLLKKGGF+W  E ++AF KL+
Subjt:  RKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYGTIAAPLTQLLKKGGFRWTEETQEAFVKLQ

Query:  MTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHTLAVRDRVKPVYERELMAVVMTVQRWRPYLLGKRFKVKTDPRSLKFFLEQRVIQP
          M+ LP+LA+P F  PFEIETDASGYG+GAVLIQN+RPIA+YSHTLA RDR +PVYERELMAVV+ VQRWRPYLLG RF V+TD +SLKF LEQRV+QP
Subjt:  MTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHTLAVRDRVKPVYERELMAVVMTVQRWRPYLLGKRFKVKTDPRSLKFFLEQRVIQP

Query:  QYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVHLNQLTAPNIIDVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLMYKNRMVISKTS
        QYQ+W++KLLGY+F+V YKPG+ENKAADALSR+ PT+  + +T P  +D+ VIKEEV +D RL KI   L    +QQ +K+++  GML YK+R+VIS++S
Subjt:  QYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVHLNQLTAPNIIDVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLMYKNRMVISKTS

Query:  KLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEECMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKENGFEVIFVVVD
        KLI  +LH++H+S  GGHSGFLRTYKR+ GELYW+GMK  +KKYC EC+ICQRNKTL LSPAGLL PL +P  +W +ISMDFV+GLPK  GFEVIFVVVD
Subjt:  KLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEECMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKENGFEVIFVVVD

Query:  RFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYLLCFCGEKLKE
        R         FLP+KHPY+AK++A+LFVKEVVRLH FP SIVSD D+VFLS+FWKE+FRLAGT+LN S+AYHPQSDGQTEVVNRGV +YL C C +K KE
Subjt:  RFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYLLCFCGEKLKE

Query:  WVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNSTLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDVQ------------PY
        W+KWI WA+YWYNTT+QR+LG+TPFQ VYGR PPPL+ YG + TSN+TLDEQL+ERD  + +L+EHL++AQD+MKK AD KRRDV+            PY
Subjt:  WVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNSTLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDVQ------------PY

Query:  RQVSLRRKRNEKLSAKYFGPYKVIERVGPVAYKLELPAQAAIHPVFHEN------------CDDIPYLSENYEWRAMPEEVYGYLKNKAGSWDVLVSWKG
        RQ+SLRRKRNEKLSAKYFGPYK++ER+GPVAYKLELP    IHPVFH +               +  L EN+ W   P E   Y +NKA  W+V++ W+G
Subjt:  RQVSLRRKRNEKLSAKYFGPYKVIERVGPVAYKLELPAQAAIHPVFHEN------------CDDIPYLSENYEWRAMPEEVYGYLKNKAGSWDVLVSWKG

Query:  LPRHEATWELYEDLQQRFPDFHLEDKVNLKRESNDRPPILYQYTRKGKK
        L  HEATWE Y+D+  ++P+FHLEDKV+L+  SN RPPIL+QY+RK K+
Subjt:  LPRHEATWELYEDLQQRFPDFHLEDKVNLKRESNDRPPILYQYTRKGKK

A0A5D3BEL2 Ty3/gypsy retrotransposon protein0.0e+0058.63Show/hide
Query:  MVQTRIEEKIETIDQEIHGIKKEIGKLLAIEKTLNNISKNM-------ERQNQLILRIMESAAQERTTMNERISELSIQTFSVKIIDEGEGSSRRESETN
        MVQTRIEE++E+ +QE+ GIKKE+ K+  IE TL  +++NM       E+Q Q IL  ME  A+ER+   ER++E   Q          + SS R+ E  
Subjt:  MVQTRIEEKIETIDQEIHGIKKEIGKLLAIEKTLNNISKNM-------ERQNQLILRIMESAAQERTTMNERISELSIQTFSVKIIDEGEGSSRRESETN

Query:  NEEKKPNEEGNNDRNKFKKVEMPIFNGDDLDSWLFCAERYFLIHKLTESEKITVSTISFEGPTLNWYRSQEEREK-------------------------
        N +K   +E +NDR+KFKKVEMP+F G+D +SWLF AERYF IHKLTESEK+ VSTI F+GP LNWYR+QEEREK                         
Subjt:  NEEKKPNEEGNNDRNKFKKVEMPIFNGDDLDSWLFCAERYFLIHKLTESEKITVSTISFEGPTLNWYRSQEEREK-------------------------

Query:  RFLRIQQTTTMDEYQNLFDK---------------------WRWISVS------QKDARMIRIAQKIENREDIRREANLPRYSGGRTTNPQNS-TKTNTN
        RFLRIQQ TT++EY+NLFDK                     + WI         +  A M+R AQ +E+RE +R  ANL  Y GG+++ P ++ TK   +
Subjt:  RFLRIQQTTTMDEYQNLFDK---------------------WRWISVS------QKDARMIRIAQKIENREDIRREANLPRYSGGRTTNPQNS-TKTNTN

Query:  VNTGENKGGMNWPMRTITLRGTSKEEVPKEGPTKRLSDAEFQARKEKGLCFRCNEKYSHDHKCKAKELRELRMLVVKEDNEEYEIIEDGNNALKELNTIE
            ENK    +P+RTITL+  +  E  KEG +KRL DAEFQ R+EKGLCF+CNEKYS DHKCK +E RELRM VVK++NEE EI+E+      EL T+E
Subjt:  VNTGENKGGMNWPMRTITLRGTSKEEVPKEGPTKRLSDAEFQARKEKGLCFRCNEKYSHDHKCKAKELRELRMLVVKEDNEEYEIIEDGNNALKELNTIE

Query:  IMEEGQAIVELSMNSVVGLDNLGTMEVKGTIRGREIVVLIDCGATHNFVSERLVKELQLETKETSNYGVILGFGTAIKGKGICGAVELMLGDWKVIDEFL
        +  +  A VELS+NSVVGL++ GTM+V+GT++G+E+V+LIDCGATHNFVSE+LV  LQL  KET++YGVILG GTAI+GKGIC ++E+ + DW V ++FL
Subjt:  IMEEGQAIVELSMNSVVGLDNLGTMEVKGTIRGREIVVLIDCGATHNFVSERLVKELQLETKETSNYGVILGFGTAIKGKGICGAVELMLGDWKVIDEFL

Query:  PLELGGVDAILNMQWLYSLGIT-----------------------EALERCDLS--------EEED---LIE---------EILT--------EEESVAV
        PLELGGVD IL MQWLYSLG+T                        +L +  +S        EE D   LIE         E +T         EE +  
Subjt:  PLELGGVDAILNMQWLYSLGIT-----------------------EALERCDLS--------EEED---LIE---------EILT--------EEESVAV

Query:  VLKRFEDVFEWPETLPPWRIIEHHIHLKEGVNPVNVRPYRYAYQQKNEME---------------------------------------KALNNVTVPDK
        +L +F+D+FEWPE LPP R IEH IHLKEG NPVNVRPYRYAY QK EME                                       +ALNNVTVPDK
Subjt:  VLKRFEDVFEWPETLPPWRIIEHHIHLKEGVNPVNVRPYRYAYQQKNEME---------------------------------------KALNNVTVPDK

Query:  FPIPVIEELFDELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKFVLVFFDDILIYSKNLKAHL
        FPIPV+EELFDEL GA +FTKIDLKAGYHQIRM   DIEKTAFRTHEGHYEF+VMPFGLTNAP+TFQSLMN+IF+PYL +FVLVFFDDILIYS+NL+ HL
Subjt:  FPIPVIEELFDELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKFVLVFFDDILIYSKNLKAHL

Query:  DHIKAVLEVLRKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYGTIAAPLTQLLKKGGFRWTE
         HI+ V  VLRK+EL+AN+KKCSF  A+VEYLGH+IS  GVEVDPEKI+AI +WP PT+VRE RGFLGLTGYYRKFV HYGT+AAPLTQLLKKGGF W  
Subjt:  DHIKAVLEVLRKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYGTIAAPLTQLLKKGGFRWTE

Query:  ETQEAFVKLQMTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHTLAVRDRVKPVYERELMAVVMTVQRWRPYLLGKRFKVKTDPRSLK
        E ++AF KL+  M+ LP+LA+P F  PFEIETDASGYG+GAVLIQN+RPIA+YSHTLA+RDR +PVYERELMAVV+ VQRWRPYLLG RF V+TD +SLK
Subjt:  ETQEAFVKLQMTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHTLAVRDRVKPVYERELMAVVMTVQRWRPYLLGKRFKVKTDPRSLK

Query:  FFLEQRVIQPQYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVHLNQLTAPNIIDVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLMY
        F LEQRV+QPQYQ+W++KLLGY+F+V YKPG+ENKAADALSRI PTV +  +T P  +D+ +IKEEV +D +L KI  E+      Q SK+ +  GML Y
Subjt:  FFLEQRVIQPQYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVHLNQLTAPNIIDVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLMY

Query:  KNRMVISKTSKLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEECMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKEN
        K+R+VIS+TSKLI  ILH++H+S  GGHSGFLRTYKR++GELYW+GMK  VKKYC EC+ICQ+NKTL LSPAGLL PL +P  +W +ISMDFV+GLPK  
Subjt:  KNRMVISKTSKLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEECMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKEN

Query:  GFEVIFVVVDRFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYL
        GFEVIFVVVDR         F+P+KHPY+AK++AELFVKEVVRLH FP SIVSD D+VFLS+FWKE+FRLAGT+LN S+AYHPQSDGQTEVVNRGV +YL
Subjt:  GFEVIFVVVDRFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYL

Query:  LCFCGEKLKEWVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNSTLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDVQ----
         CFC +K KEWVKWI WA+YWYNTT+Q++LG+TPFQ VYGR PPPL+ YG + T N TLDEQLKERD  + +L+E+L++AQ++MKKYAD +RRD++    
Subjt:  LCFCGEKLKEWVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNSTLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDVQ----

Query:  --------PYRQVSLRRKRNEKLSAKYFGPYKVIERVGPVAYKLELPAQAAIHPVFH---------ENCD---DIPYLSENYEWRAMPEEVYGYLKNKAG
                PYRQ+SLRRKRNEKLSAKYFGPYK++ER+GPVAYKLELP  A IHPVFH         E+ D    I  L EN+ W+  P E   Y +NK G
Subjt:  --------PYRQVSLRRKRNEKLSAKYFGPYKVIERVGPVAYKLELPAQAAIHPVFH---------ENCD---DIPYLSENYEWRAMPEEVYGYLKNKAG

Query:  SWDVLVSWKGLPRHEATWELYEDLQQRFPDFHLEDKVNLKRESNDRPPILYQYTRKGKK
         W+V++ W GL  HE TWE Y D+  ++PDFHLEDKV+L+  SN RPPIL+QY+RK K+
Subjt:  SWDVLVSWKGLPRHEATWELYEDLQQRFPDFHLEDKVNLKRESNDRPPILYQYTRKGKK

A0A5D3DFT1 Ty3/gypsy retrotransposon protein0.0e+0058.47Show/hide
Query:  MVQTRIEEKIETIDQEIHGIKKEIGKLLAIEKTLNNISKNM-------ERQNQLILRIMESAAQERTTMNERISELSIQTFSVKIIDEGEGSSRRE-SET
        MVQTRIEE++E  +QEI GIKKE+ K+ AIE TL  I+KNM       E+Q Q IL  ME+ A+ER    ERI+E  IQ         G+ SS  +  ET
Subjt:  MVQTRIEEKIETIDQEIHGIKKEIGKLLAIEKTLNNISKNM-------ERQNQLILRIMESAAQERTTMNERISELSIQTFSVKIIDEGEGSSRRE-SET

Query:  NNEEKKPNEEGNNDRNKFKKVEMPIFNGDDLDSWLFCAERYFLIHKLTESEKITVSTISFEGPTLNWYRSQEEREK------------------------
        + E K  ++E  NDR+KFKKVEMP+F G+D +SWLF AERYF IHKLTESEK+ VSTI F+GP LNWYRSQEEREK                        
Subjt:  NNEEKKPNEEGNNDRNKFKKVEMPIFNGDDLDSWLFCAERYFLIHKLTESEKITVSTISFEGPTLNWYRSQEEREK------------------------

Query:  -RFLRIQQTTTMDEYQNLFDK---------------------WRWISVS------QKDARMIRIAQKIENREDIRREANLPRYSGGR-TTNPQNSTKTNT
         RFLRIQQ TT++EY+N FDK                     + WI         +  A  +  AQ +E+RE +R  ANL  Y GG+ +       K + 
Subjt:  -RFLRIQQTTTMDEYQNLFDK---------------------WRWISVS------QKDARMIRIAQKIENREDIRREANLPRYSGGR-TTNPQNSTKTNT

Query:  NVNTGENKGGMNWPMRTITLRGTSKEEVPKEGPTKRLSDAEFQARKEKGLCFRCNEKYSHDHKCKAKELRELRMLVVKEDNEEYEIIEDGNNALKELNTI
             E+K   ++P+RTITL+  +  E+ KEG +KRL DAEFQ RKEKGLCF+CNEKYS DHKCK KE RELRM VVK DNEE EI+E+      E+   
Subjt:  NVNTGENKGGMNWPMRTITLRGTSKEEVPKEGPTKRLSDAEFQARKEKGLCFRCNEKYSHDHKCKAKELRELRMLVVKEDNEEYEIIEDGNNALKELNTI

Query:  EIMEEGQAIVELSMNSVVGLDNLGTMEVKGTIRGREIVVLIDCGATHNFVSERLVKELQLETKETSNYGVILGFGTAIKGKGICGAVELMLGDWKVIDEF
        E+       VELS+NSVVGL++ GTM+VKG+++G+E+V+LIDCGATHNFVSE++V  LQL  KET++YGVILG GTAI+GKGIC +VE+ + +W V ++F
Subjt:  EIMEEGQAIVELSMNSVVGLDNLGTMEVKGTIRGREIVVLIDCGATHNFVSERLVKELQLETKETSNYGVILGFGTAIKGKGICGAVELMLGDWKVIDEF

Query:  LPLELGGVDAILNMQWLYSLGIT-----------------------EALERCDLS--------EEED---LIE-------EILTE--------EESVAVV
        LPLELGGVD IL MQWLYSLG+T                        +L +  +S        EE D   LIE       E+ T         ++ +  +
Subjt:  LPLELGGVDAILNMQWLYSLGIT-----------------------EALERCDLS--------EEED---LIE-------EILTE--------EESVAVV

Query:  LKRFEDVFEWPETLPPWRIIEHHIHLKEGVNPVNVRPYRYAYQQKNEMEK---------------------------------------ALNNVTVPDKF
        L +F DVFEWPE LPP R IEH IHLKEG NPVNVRPYRYAY QK EMEK                                       ALNNVTVPDKF
Subjt:  LKRFEDVFEWPETLPPWRIIEHHIHLKEGVNPVNVRPYRYAYQQKNEMEK---------------------------------------ALNNVTVPDKF

Query:  PIPVIEELFDELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKFVLVFFDDILIYSKNLKAHLD
        PIPV+EELFDEL GA +FTKIDLKAGYHQIRM   DIEKTAFRTHEGHYEF+VMPFGLTNAP+TFQSLMN+IF+PYL KFVLVFFDDILIYS+N + HL 
Subjt:  PIPVIEELFDELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKFVLVFFDDILIYSKNLKAHLD

Query:  HIKAVLEVLRKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYGTIAAPLTQLLKKGGFRWTEE
        H++ V  VLRK+EL+AN+KKCSF  A+VEYLGH+IS  GVEVDPEKI+AI  WP PTNVRE RGFLGLTGYYRKFV HYGT+AAPLTQLLKKGGF+W  E
Subjt:  HIKAVLEVLRKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYGTIAAPLTQLLKKGGFRWTEE

Query:  TQEAFVKLQMTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHTLAVRDRVKPVYERELMAVVMTVQRWRPYLLGKRFKVKTDPRSLKF
         ++AF KL+  M+ LP+LA+P F  PFEIETDASGYG+GAVLIQN+RPIA+YSHTLA RDR +PVYERELMAVV+ VQRWRPYLLG RF V+TD +SLKF
Subjt:  TQEAFVKLQMTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHTLAVRDRVKPVYERELMAVVMTVQRWRPYLLGKRFKVKTDPRSLKF

Query:  FLEQRVIQPQYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVHLNQLTAPNIIDVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLMYK
         LEQRV+QPQYQ+W++KLLGY+F+V YKPG+ENKAADALSR+ PT+  + +T P  +D+ VIKEEV +D RL KI   L    +QQ +K+++  GML YK
Subjt:  FLEQRVIQPQYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVHLNQLTAPNIIDVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLMYK

Query:  NRMVISKTSKLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEECMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKENG
        +R+VIS++SKLI  +LH++H+S  GGHSGFLRTYKR+ GELYW+GMK  +KKYC EC+ICQRNKTL LSPAGLL PL +P  +W +ISMDFV+GLPK  G
Subjt:  NRMVISKTSKLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEECMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKENG

Query:  FEVIFVVVDRFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYLL
        FEVIFVVVDR         FLP+KHPY+AK++A+LFVKEVVRLH FP SIVSD D+VFLS+FWKE+FRLAGT+LN S+AYHPQSDGQTEVVNRGV +YL 
Subjt:  FEVIFVVVDRFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYLL

Query:  CFCGEKLKEWVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNSTLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDVQ-----
        C C +K KEW+KWI WA+YWYNTT+QR+LG+TPFQ VYGR PPPL+ YG + TSN+TLDEQL+ERD  + +L+EHL++AQD+MKK AD KRRDV+     
Subjt:  CFCGEKLKEWVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNSTLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDVQ-----

Query:  -------PYRQVSLRRKRNEKLSAKYFGPYKVIERVGPVAYKLELPAQAAIHPVFHEN------------CDDIPYLSENYEWRAMPEEVYGYLKNKAGS
               PYRQ+SLRRKRNEKLSAKYFGPYK++ER+GPVAYKLELP    IHPVFH +               +  L EN+ W   P E   Y +NKA  
Subjt:  -------PYRQVSLRRKRNEKLSAKYFGPYKVIERVGPVAYKLELPAQAAIHPVFHEN------------CDDIPYLSENYEWRAMPEEVYGYLKNKAGS

Query:  WDVLVSWKGLPRHEATWELYEDLQQRFPDFHLEDKVNLKRESNDRPPILYQYTRKGKK
        W+V++ W+GL  HEATWE Y+D+  ++P+FHLEDKV+L+  SN RPPIL+QY+RK K+
Subjt:  WDVLVSWKGLPRHEATWELYEDLQQRFPDFHLEDKVNLKRESNDRPPILYQYTRKGKK

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.1e-11329.48Show/hide
Query:  IVVLIDCGATHNFVSERLVKELQLETKETSNYGVILG--FGTAIKGKGICGAVELMLGDWKVIDEFLPLELGGVDAILNMQWLYSLGITEALERCDLSEE
        I  LID GA  N ++E  V+  +L T+  S   VI G  +   I  K I   + + L    +  EFL ++     A ++   LY   I  +  +  LS+ 
Subjt:  IVVLIDCGATHNFVSERLVKELQLETKETSNYGVILG--FGTAIKGKGICGAVELMLGDWKVIDEFLPLELGGVDAILNMQWLYSLGITEALERCDLSEE

Query:  EDLIEEILTEEESVAVVLKRFEDV-----------------FEWPET------------LPPWRIIEHHIHLKEGV--------NPVNVRPYRYAYQQKN
          +    + +E  +  + K F+D+                 FE   T            LPP ++   +  + +G+          +N  P  +  +++ 
Subjt:  EDLIEEILTEEESVAVVLKRFEDV-----------------FEWPET------------LPPWRIIEHHIHLKEGV--------NPVNVRPYRYAYQQKN

Query:  EME-----KALNNVTVPDKFPIPVIEELFDELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKF
         +      K LN    P+ +P+P+IE+L  ++ G+ IFTK+DLK+ YH IR+   D  K AFR   G +E++VMP+G++ AP+ FQ  +N I        
Subjt:  EME-----KALNNVTVPDKFPIPVIEELFDELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKF

Query:  VLVFFDDILIYSKNLKAHLDHIKAVLEVLRKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYG
        V+ + DDILI+SK+   H+ H+K VL+ L+   L  N+ KC F +++V+++G+ IS  G     E I  + +W  P N +E+R FLG   Y RKF+    
Subjt:  VLVFFDDILIYSKNLKAHLDHIKAVLEVLRKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYG

Query:  TIAAPLTQLLKKG-GFRWTEETQEAFVKLQMTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQ-----RPIAYYSHTLAVRDRVKPVYERELMAVV
         +  PL  LLKK   ++WT    +A   ++  +++ P+L   DFS    +ETDAS   +GAVL Q        P+ YYS  ++       V ++E++A++
Subjt:  TIAAPLTQLLKKG-GFRWTEETQEAFVKLQMTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQ-----RPIAYYSHTLAVRDRVKPVYERELMAVV

Query:  MTVQRWRPYLLG--KRFKVKTDPRSL--KFFLEQRVIQPQYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVH-------------LNQLTAPNII
         +++ WR YL    + FK+ TD R+L  +   E      +  +W   L  ++FE+ Y+PG  N  ADALSRI                  +NQ++  +  
Subjt:  MTVQRWRPYLLG--KRFKVKTDPRSL--KFFLEQRVIQPQYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVH-------------LNQLTAPNII

Query:  DVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLM-YKNRMVISKTSKLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEE
           V+ E  N  + L  +  E     ++      +K G+L+  K+++++   ++L   I+  +H      H G       +     W+G+++ +++Y + 
Subjt:  DVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLM-YKNRMVISKTSKLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEE

Query:  CMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKENGFEVIFVVVDRFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDK
        C  CQ NK+    P G L P+    R WE +SMDF+  LP+ +G+  +FVVVDRF     M   +P      A+  A +F + V+     PK I++D D 
Subjt:  CMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKENGFEVIFVVVDRFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDK

Query:  VFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYLLCFCGEKLKEWVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNS
        +F S  WK+        +  S  Y PQ+DGQTE  N+ V   L C C      WV  I   +  YN     +  +TPF+ V+ R  P L    +   S+ 
Subjt:  VFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYLLCFCGEKLKEWVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNS

Query:  TLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDV---QPYRQVSLRR------KRNEKLSAKYFGPYKVIERVGPVAYKLELP
        T DE  +E       +KEHL     KMKKY D+K +++   QP   V ++R       ++ KL+  + GP+ V+++ GP  Y+L+LP
Subjt:  TLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDV---QPYRQVSLRR------KRNEKLSAKYFGPYKVIERVGPVAYKLELP

P0CT35 Transposon Tf2-2 polyprotein1.1e-11329.48Show/hide
Query:  IVVLIDCGATHNFVSERLVKELQLETKETSNYGVILG--FGTAIKGKGICGAVELMLGDWKVIDEFLPLELGGVDAILNMQWLYSLGITEALERCDLSEE
        I  LID GA  N ++E  V+  +L T+  S   VI G  +   I  K I   + + L    +  EFL ++     A ++   LY   I  +  +  LS+ 
Subjt:  IVVLIDCGATHNFVSERLVKELQLETKETSNYGVILG--FGTAIKGKGICGAVELMLGDWKVIDEFLPLELGGVDAILNMQWLYSLGITEALERCDLSEE

Query:  EDLIEEILTEEESVAVVLKRFEDV-----------------FEWPET------------LPPWRIIEHHIHLKEGV--------NPVNVRPYRYAYQQKN
          +    + +E  +  + K F+D+                 FE   T            LPP ++   +  + +G+          +N  P  +  +++ 
Subjt:  EDLIEEILTEEESVAVVLKRFEDV-----------------FEWPET------------LPPWRIIEHHIHLKEGV--------NPVNVRPYRYAYQQKN

Query:  EME-----KALNNVTVPDKFPIPVIEELFDELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKF
         +      K LN    P+ +P+P+IE+L  ++ G+ IFTK+DLK+ YH IR+   D  K AFR   G +E++VMP+G++ AP+ FQ  +N I        
Subjt:  EME-----KALNNVTVPDKFPIPVIEELFDELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKF

Query:  VLVFFDDILIYSKNLKAHLDHIKAVLEVLRKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYG
        V+ + DDILI+SK+   H+ H+K VL+ L+   L  N+ KC F +++V+++G+ IS  G     E I  + +W  P N +E+R FLG   Y RKF+    
Subjt:  VLVFFDDILIYSKNLKAHLDHIKAVLEVLRKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYG

Query:  TIAAPLTQLLKKG-GFRWTEETQEAFVKLQMTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQ-----RPIAYYSHTLAVRDRVKPVYERELMAVV
         +  PL  LLKK   ++WT    +A   ++  +++ P+L   DFS    +ETDAS   +GAVL Q        P+ YYS  ++       V ++E++A++
Subjt:  TIAAPLTQLLKKG-GFRWTEETQEAFVKLQMTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQ-----RPIAYYSHTLAVRDRVKPVYERELMAVV

Query:  MTVQRWRPYLLG--KRFKVKTDPRSL--KFFLEQRVIQPQYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVH-------------LNQLTAPNII
         +++ WR YL    + FK+ TD R+L  +   E      +  +W   L  ++FE+ Y+PG  N  ADALSRI                  +NQ++  +  
Subjt:  MTVQRWRPYLLG--KRFKVKTDPRSL--KFFLEQRVIQPQYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVH-------------LNQLTAPNII

Query:  DVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLM-YKNRMVISKTSKLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEE
           V+ E  N  + L  +  E     ++      +K G+L+  K+++++   ++L   I+  +H      H G       +     W+G+++ +++Y + 
Subjt:  DVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLM-YKNRMVISKTSKLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEE

Query:  CMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKENGFEVIFVVVDRFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDK
        C  CQ NK+    P G L P+    R WE +SMDF+  LP+ +G+  +FVVVDRF     M   +P      A+  A +F + V+     PK I++D D 
Subjt:  CMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKENGFEVIFVVVDRFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDK

Query:  VFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYLLCFCGEKLKEWVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNS
        +F S  WK+        +  S  Y PQ+DGQTE  N+ V   L C C      WV  I   +  YN     +  +TPF+ V+ R  P L    +   S+ 
Subjt:  VFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYLLCFCGEKLKEWVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNS

Query:  TLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDV---QPYRQVSLRR------KRNEKLSAKYFGPYKVIERVGPVAYKLELP
        T DE  +E       +KEHL     KMKKY D+K +++   QP   V ++R       ++ KL+  + GP+ V+++ GP  Y+L+LP
Subjt:  TLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDV---QPYRQVSLRR------KRNEKLSAKYFGPYKVIERVGPVAYKLELP

P0CT36 Transposon Tf2-3 polyprotein1.1e-11329.48Show/hide
Query:  IVVLIDCGATHNFVSERLVKELQLETKETSNYGVILG--FGTAIKGKGICGAVELMLGDWKVIDEFLPLELGGVDAILNMQWLYSLGITEALERCDLSEE
        I  LID GA  N ++E  V+  +L T+  S   VI G  +   I  K I   + + L    +  EFL ++     A ++   LY   I  +  +  LS+ 
Subjt:  IVVLIDCGATHNFVSERLVKELQLETKETSNYGVILG--FGTAIKGKGICGAVELMLGDWKVIDEFLPLELGGVDAILNMQWLYSLGITEALERCDLSEE

Query:  EDLIEEILTEEESVAVVLKRFEDV-----------------FEWPET------------LPPWRIIEHHIHLKEGV--------NPVNVRPYRYAYQQKN
          +    + +E  +  + K F+D+                 FE   T            LPP ++   +  + +G+          +N  P  +  +++ 
Subjt:  EDLIEEILTEEESVAVVLKRFEDV-----------------FEWPET------------LPPWRIIEHHIHLKEGV--------NPVNVRPYRYAYQQKN

Query:  EME-----KALNNVTVPDKFPIPVIEELFDELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKF
         +      K LN    P+ +P+P+IE+L  ++ G+ IFTK+DLK+ YH IR+   D  K AFR   G +E++VMP+G++ AP+ FQ  +N I        
Subjt:  EME-----KALNNVTVPDKFPIPVIEELFDELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKF

Query:  VLVFFDDILIYSKNLKAHLDHIKAVLEVLRKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYG
        V+ + DDILI+SK+   H+ H+K VL+ L+   L  N+ KC F +++V+++G+ IS  G     E I  + +W  P N +E+R FLG   Y RKF+    
Subjt:  VLVFFDDILIYSKNLKAHLDHIKAVLEVLRKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYG

Query:  TIAAPLTQLLKKG-GFRWTEETQEAFVKLQMTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQ-----RPIAYYSHTLAVRDRVKPVYERELMAVV
         +  PL  LLKK   ++WT    +A   ++  +++ P+L   DFS    +ETDAS   +GAVL Q        P+ YYS  ++       V ++E++A++
Subjt:  TIAAPLTQLLKKG-GFRWTEETQEAFVKLQMTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQ-----RPIAYYSHTLAVRDRVKPVYERELMAVV

Query:  MTVQRWRPYLLG--KRFKVKTDPRSL--KFFLEQRVIQPQYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVH-------------LNQLTAPNII
         +++ WR YL    + FK+ TD R+L  +   E      +  +W   L  ++FE+ Y+PG  N  ADALSRI                  +NQ++  +  
Subjt:  MTVQRWRPYLLG--KRFKVKTDPRSL--KFFLEQRVIQPQYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVH-------------LNQLTAPNII

Query:  DVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLM-YKNRMVISKTSKLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEE
           V+ E  N  + L  +  E     ++      +K G+L+  K+++++   ++L   I+  +H      H G       +     W+G+++ +++Y + 
Subjt:  DVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLM-YKNRMVISKTSKLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEE

Query:  CMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKENGFEVIFVVVDRFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDK
        C  CQ NK+    P G L P+    R WE +SMDF+  LP+ +G+  +FVVVDRF     M   +P      A+  A +F + V+     PK I++D D 
Subjt:  CMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKENGFEVIFVVVDRFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDK

Query:  VFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYLLCFCGEKLKEWVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNS
        +F S  WK+        +  S  Y PQ+DGQTE  N+ V   L C C      WV  I   +  YN     +  +TPF+ V+ R  P L    +   S+ 
Subjt:  VFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYLLCFCGEKLKEWVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNS

Query:  TLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDV---QPYRQVSLRR------KRNEKLSAKYFGPYKVIERVGPVAYKLELP
        T DE  +E       +KEHL     KMKKY D+K +++   QP   V ++R       ++ KL+  + GP+ V+++ GP  Y+L+LP
Subjt:  TLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDV---QPYRQVSLRR------KRNEKLSAKYFGPYKVIERVGPVAYKLELP

P0CT37 Transposon Tf2-4 polyprotein1.1e-11329.48Show/hide
Query:  IVVLIDCGATHNFVSERLVKELQLETKETSNYGVILG--FGTAIKGKGICGAVELMLGDWKVIDEFLPLELGGVDAILNMQWLYSLGITEALERCDLSEE
        I  LID GA  N ++E  V+  +L T+  S   VI G  +   I  K I   + + L    +  EFL ++     A ++   LY   I  +  +  LS+ 
Subjt:  IVVLIDCGATHNFVSERLVKELQLETKETSNYGVILG--FGTAIKGKGICGAVELMLGDWKVIDEFLPLELGGVDAILNMQWLYSLGITEALERCDLSEE

Query:  EDLIEEILTEEESVAVVLKRFEDV-----------------FEWPET------------LPPWRIIEHHIHLKEGV--------NPVNVRPYRYAYQQKN
          +    + +E  +  + K F+D+                 FE   T            LPP ++   +  + +G+          +N  P  +  +++ 
Subjt:  EDLIEEILTEEESVAVVLKRFEDV-----------------FEWPET------------LPPWRIIEHHIHLKEGV--------NPVNVRPYRYAYQQKN

Query:  EME-----KALNNVTVPDKFPIPVIEELFDELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKF
         +      K LN    P+ +P+P+IE+L  ++ G+ IFTK+DLK+ YH IR+   D  K AFR   G +E++VMP+G++ AP+ FQ  +N I        
Subjt:  EME-----KALNNVTVPDKFPIPVIEELFDELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKF

Query:  VLVFFDDILIYSKNLKAHLDHIKAVLEVLRKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYG
        V+ + DDILI+SK+   H+ H+K VL+ L+   L  N+ KC F +++V+++G+ IS  G     E I  + +W  P N +E+R FLG   Y RKF+    
Subjt:  VLVFFDDILIYSKNLKAHLDHIKAVLEVLRKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYG

Query:  TIAAPLTQLLKKG-GFRWTEETQEAFVKLQMTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQ-----RPIAYYSHTLAVRDRVKPVYERELMAVV
         +  PL  LLKK   ++WT    +A   ++  +++ P+L   DFS    +ETDAS   +GAVL Q        P+ YYS  ++       V ++E++A++
Subjt:  TIAAPLTQLLKKG-GFRWTEETQEAFVKLQMTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQ-----RPIAYYSHTLAVRDRVKPVYERELMAVV

Query:  MTVQRWRPYLLG--KRFKVKTDPRSL--KFFLEQRVIQPQYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVH-------------LNQLTAPNII
         +++ WR YL    + FK+ TD R+L  +   E      +  +W   L  ++FE+ Y+PG  N  ADALSRI                  +NQ++  +  
Subjt:  MTVQRWRPYLLG--KRFKVKTDPRSL--KFFLEQRVIQPQYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVH-------------LNQLTAPNII

Query:  DVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLM-YKNRMVISKTSKLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEE
           V+ E  N  + L  +  E     ++      +K G+L+  K+++++   ++L   I+  +H      H G       +     W+G+++ +++Y + 
Subjt:  DVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLM-YKNRMVISKTSKLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEE

Query:  CMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKENGFEVIFVVVDRFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDK
        C  CQ NK+    P G L P+    R WE +SMDF+  LP+ +G+  +FVVVDRF     M   +P      A+  A +F + V+     PK I++D D 
Subjt:  CMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKENGFEVIFVVVDRFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDK

Query:  VFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYLLCFCGEKLKEWVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNS
        +F S  WK+        +  S  Y PQ+DGQTE  N+ V   L C C      WV  I   +  YN     +  +TPF+ V+ R  P L    +   S+ 
Subjt:  VFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYLLCFCGEKLKEWVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNS

Query:  TLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDV---QPYRQVSLRR------KRNEKLSAKYFGPYKVIERVGPVAYKLELP
        T DE  +E       +KEHL     KMKKY D+K +++   QP   V ++R       ++ KL+  + GP+ V+++ GP  Y+L+LP
Subjt:  TLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDV---QPYRQVSLRR------KRNEKLSAKYFGPYKVIERVGPVAYKLELP

P0CT41 Transposon Tf2-12 polyprotein1.1e-11329.48Show/hide
Query:  IVVLIDCGATHNFVSERLVKELQLETKETSNYGVILG--FGTAIKGKGICGAVELMLGDWKVIDEFLPLELGGVDAILNMQWLYSLGITEALERCDLSEE
        I  LID GA  N ++E  V+  +L T+  S   VI G  +   I  K I   + + L    +  EFL ++     A ++   LY   I  +  +  LS+ 
Subjt:  IVVLIDCGATHNFVSERLVKELQLETKETSNYGVILG--FGTAIKGKGICGAVELMLGDWKVIDEFLPLELGGVDAILNMQWLYSLGITEALERCDLSEE

Query:  EDLIEEILTEEESVAVVLKRFEDV-----------------FEWPET------------LPPWRIIEHHIHLKEGV--------NPVNVRPYRYAYQQKN
          +    + +E  +  + K F+D+                 FE   T            LPP ++   +  + +G+          +N  P  +  +++ 
Subjt:  EDLIEEILTEEESVAVVLKRFEDV-----------------FEWPET------------LPPWRIIEHHIHLKEGV--------NPVNVRPYRYAYQQKN

Query:  EME-----KALNNVTVPDKFPIPVIEELFDELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKF
         +      K LN    P+ +P+P+IE+L  ++ G+ IFTK+DLK+ YH IR+   D  K AFR   G +E++VMP+G++ AP+ FQ  +N I        
Subjt:  EME-----KALNNVTVPDKFPIPVIEELFDELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKF

Query:  VLVFFDDILIYSKNLKAHLDHIKAVLEVLRKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYG
        V+ + DDILI+SK+   H+ H+K VL+ L+   L  N+ KC F +++V+++G+ IS  G     E I  + +W  P N +E+R FLG   Y RKF+    
Subjt:  VLVFFDDILIYSKNLKAHLDHIKAVLEVLRKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYG

Query:  TIAAPLTQLLKKG-GFRWTEETQEAFVKLQMTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQ-----RPIAYYSHTLAVRDRVKPVYERELMAVV
         +  PL  LLKK   ++WT    +A   ++  +++ P+L   DFS    +ETDAS   +GAVL Q        P+ YYS  ++       V ++E++A++
Subjt:  TIAAPLTQLLKKG-GFRWTEETQEAFVKLQMTMMTLPMLAMPDFSIPFEIETDASGYGLGAVLIQNQ-----RPIAYYSHTLAVRDRVKPVYERELMAVV

Query:  MTVQRWRPYLLG--KRFKVKTDPRSL--KFFLEQRVIQPQYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVH-------------LNQLTAPNII
         +++ WR YL    + FK+ TD R+L  +   E      +  +W   L  ++FE+ Y+PG  N  ADALSRI                  +NQ++  +  
Subjt:  MTVQRWRPYLLG--KRFKVKTDPRSL--KFFLEQRVIQPQYQKWISKLLGYSFEVIYKPGLENKAADALSRIPPTVH-------------LNQLTAPNII

Query:  DVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLM-YKNRMVISKTSKLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEE
           V+ E  N  + L  +  E     ++      +K G+L+  K+++++   ++L   I+  +H      H G       +     W+G+++ +++Y + 
Subjt:  DVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLM-YKNRMVISKTSKLIAMILHTFHNSVFGGHSGFLRTYKRLTGELYWEGMKQYVKKYCEE

Query:  CMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKENGFEVIFVVVDRFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDK
        C  CQ NK+    P G L P+    R WE +SMDF+  LP+ +G+  +FVVVDRF     M   +P      A+  A +F + V+     PK I++D D 
Subjt:  CMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKENGFEVIFVVVDRFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKSIVSDIDK

Query:  VFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYLLCFCGEKLKEWVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNS
        +F S  WK+        +  S  Y PQ+DGQTE  N+ V   L C C      WV  I   +  YN     +  +TPF+ V+ R  P L    +   S+ 
Subjt:  VFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYLLCFCGEKLKEWVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNS

Query:  TLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDV---QPYRQVSLRR------KRNEKLSAKYFGPYKVIERVGPVAYKLELP
        T DE  +E       +KEHL     KMKKY D+K +++   QP   V ++R       ++ KL+  + GP+ V+++ GP  Y+L+LP
Subjt:  TLDEQLKERDVALDALKEHLQVAQDKMKKYADLKRRDV---QPYRQVSLRR------KRNEKLSAKYFGPYKVIERVGPVAYKLELP

Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein8.0e-1127.49Show/hide
Query:  MRTITLRGTSKEEVPKEGPTKRLSDAEFQARKEKGLCFRCNEKYSHDHKCKAKELRELRMLVVKEDNEEYEIIEDGNNALKELNTIEIMEEGQAIVELSM
        +R++TL G   EE+  +G    L  A  +  K  G+         + ++ +  EL  L ++  K D     +++     + EL  +E  ++   + +   
Subjt:  MRTITLRGTSKEEVPKEGPTKRLSDAEFQARKEKGLCFRCNEKYSHDHKCKAKELRELRMLVVKEDNEEYEIIEDGNNALKELNTIEIMEEGQAIVELSM

Query:  NSVVGLDNLGTMEVKGTIRGREIVVLIDCGATHNFVSERLVKELQLETKETSNYGVILGFGTAIKGKGICGAVELMLGDWKVIDEFLPLELG--GVDAIL
          V+ L     M   G I   ++VV ID GAT NF+   L   L+L T  T+   V+LG    I+  G C  + L + + ++ + FL L+L    VD IL
Subjt:  NSVVGLDNLGTMEVKGTIRGREIVVLIDCGATHNFVSERLVKELQLETKETSNYGVILGFGTAIKGKGICGAVELMLGDWKVIDEFLPLELG--GVDAIL

Query:  NMQWLYSLGIT
          +WL  LG T
Subjt:  NMQWLYSLGIT

AT3G30770.1 Eukaryotic aspartyl protease family protein6.3e-0830.56Show/hide
Query:  IMEEGQAIVELSMNSVVGLDNLGTMEVKGTIRGREIVVLIDCGATHNFVSERLVKELQLETKETSNYGVILGFGTAIKGKGICGAVELMLGDWKVIDEFL
        ++E+ + I ++   S         M   G I   ++VV+ID GAT+NF+S+ L   L+L T  T+   V+LG    I+  G C  + L++ + ++ + FL
Subjt:  IMEEGQAIVELSMNSVVGLDNLGTMEVKGTIRGREIVVLIDCGATHNFVSERLVKELQLETKETSNYGVILGFGTAIKGKGICGAVELMLGDWKVIDEFL

Query:  PLELGGVD
         L+L   D
Subjt:  PLELGGVD

ATMG00860.1 DNA/RNA polymerases superfamily protein1.6e-3551.91Show/hide
Query:  LDHIKAVLEVLRKNELYANKKKCSFAKARVEYLG--HIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYGTIAAPLTQLLKKGGFR
        ++H+  VL++  +++ YAN+KKC+F + ++ YLG  HIISG GV  DP K+ A+  WP P N  E+RGFLGLTGYYR+FVK+YG I  PLT+LLKK   +
Subjt:  LDHIKAVLEVLRKNELYANKKKCSFAKARVEYLG--HIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYGTIAAPLTQLLKKGGFR

Query:  WTEETQEAFVKLQMTMMTLPMLAMPDFSIPF
        WTE    AF  L+  + TLP+LA+PD  +PF
Subjt:  WTEETQEAFVKLQMTMMTLPMLAMPDFSIPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGCAGACGAGGATTGAAGAAAAGATTGAAACGATAGACCAAGAGATTCATGGCATCAAGAAGGAGATCGGGAAACTGCTGGCGATTGAGAAAACGTTGAACAATAT
ATCGAAGAACATGGAAAGACAAAATCAATTAATATTGAGAATTATGGAATCAGCCGCACAGGAGAGAACAACGATGAACGAAAGGATATCCGAATTGTCTATACAGACTT
TTTCGGTGAAGATTATAGATGAAGGCGAAGGATCTTCGAGGCGAGAAAGCGAAACGAACAACGAAGAGAAGAAACCGAATGAAGAAGGAAACAATGATCGAAACAAATTT
AAGAAAGTGGAGATGCCTATATTCAACGGAGATGATCTCGATTCGTGGTTATTTTGCGCTGAAAGGTATTTTCTTATCCATAAACTCACTGAATCTGAAAAGATAACCGT
ATCTACTATTAGTTTCGAAGGACCGACATTGAATTGGTATCGTTCACAAGAAGAACGGGAGAAGCGATTTTTGCGTATCCAACAAACAACAACCATGGATGAATATCAAA
ATTTATTCGATAAGTGGAGATGGATTTCTGTGAGCCAAAAGGACGCCCGCATGATAAGGATAGCGCAAAAAATTGAAAACAGAGAAGATATTCGGCGTGAGGCAAATCTT
CCCAGATATTCCGGAGGAAGAACAACTAATCCACAGAATAGTACCAAGACTAATACAAATGTGAATACTGGAGAAAATAAGGGAGGAATGAATTGGCCAATGAGAACCAT
CACTTTGCGAGGGACATCGAAAGAGGAAGTTCCAAAGGAGGGTCCAACAAAACGATTGTCTGATGCTGAATTCCAAGCTCGAAAGGAGAAGGGACTGTGTTTTCGGTGTA
ATGAGAAGTATTCTCACGACCATAAATGCAAGGCTAAGGAGTTGAGGGAATTACGAATGCTGGTAGTGAAGGAGGATAATGAAGAATATGAAATCATTGAAGATGGAAAT
AATGCTTTGAAGGAGTTGAATACCATCGAAATCATGGAAGAAGGCCAAGCGATAGTAGAATTATCTATGAACTCTGTAGTTGGATTGGACAATCTAGGGACAATGGAGGT
GAAGGGAACGATACGAGGGAGAGAAATTGTGGTGTTAATAGATTGTGGAGCGACACACAACTTTGTTTCAGAACGACTTGTGAAGGAGTTACAACTGGAAACAAAAGAAA
CCTCAAACTATGGGGTAATTTTGGGTTTCGGTACTGCTATAAAAGGGAAAGGGATATGTGGAGCTGTGGAACTGATGTTAGGTGATTGGAAAGTGATTGATGAATTCTTA
CCCTTGGAGTTAGGAGGAGTAGATGCCATACTGAATATGCAGTGGCTTTATTCTTTGGGAATAACAGAAGCCTTGGAAAGATGTGATTTGTCTGAAGAAGAAGACTTGAT
CGAAGAAATATTAACTGAAGAAGAGTCGGTAGCTGTGGTGTTGAAAAGATTTGAAGATGTTTTCGAGTGGCCAGAGACTTTGCCTCCTTGGCGGATAATAGAACATCACA
TCCATTTGAAAGAGGGAGTGAATCCAGTTAATGTACGGCCATATCGATATGCTTATCAACAAAAAAATGAAATGGAGAAAGCCTTGAATAACGTGACTGTGCCTGACAAA
TTTCCAATTCCAGTAATTGAAGAGTTATTTGATGAGTTAGATGGAGCTGTTATATTTACTAAGATTGATTTGAAGGCAGGGTATCATCAGATTCGAATGTGTGCAGATGA
TATTGAGAAGACTGCGTTTAGGACTCATGAAGGTCACTATGAGTTCATGGTGATGCCATTTGGACTAACAAATGCACCATCAACTTTCCAATCATTAATGAATGCTATAT
TCAAGCCGTATCTCATAAAGTTTGTGCTGGTATTCTTTGATGATATACTGATTTACAGCAAGAATTTAAAGGCTCATTTGGATCACATCAAAGCAGTGTTAGAAGTATTG
CGAAAAAATGAATTGTATGCAAACAAAAAGAAGTGCAGCTTTGCCAAAGCAAGGGTAGAATACCTGGGGCATATCATATCTGGAGGAGGAGTAGAGGTGGATCCGGAGAA
AATTAGAGCCATCAAAGAATGGCCTATTCCAACTAACGTGAGAGAAGTTCGAGGATTTTTGGGATTGACTGGATATTATCGAAAATTTGTAAAGCATTATGGGACGATAG
CAGCACCTCTGACCCAATTACTTAAAAAAGGGGGATTTAGATGGACTGAGGAGACTCAAGAAGCCTTTGTTAAGTTGCAAATGACCATGATGACATTGCCCATGTTAGCA
ATGCCAGATTTTAGCATTCCATTTGAGATTGAAACGGATGCTTCTGGATATGGGTTGGGTGCAGTGTTAATACAAAACCAAAGACCTATTGCTTATTATAGTCATACTCT
AGCAGTTAGAGATAGAGTTAAACCAGTTTATGAAAGGGAATTGATGGCTGTAGTAATGACAGTACAGAGATGGCGTCCCTATTTATTGGGGAAGAGATTCAAGGTGAAAA
CTGATCCGCGGTCATTAAAATTCTTTCTGGAACAACGGGTGATTCAGCCCCAGTACCAAAAGTGGATATCTAAGCTTTTAGGATATTCATTTGAGGTGATATATAAGCCG
GGGTTAGAAAACAAGGCTGCTGACGCGTTGTCTAGAATACCTCCAACCGTCCATTTGAATCAACTGACAGCTCCAAACATAATTGATGTGGCAGTAATTAAAGAAGAGGT
CAATCAAGATGAGAGACTTAAGAAGATCAGAGGAGAATTAGAAGAAAAAGGTGAGCAGCAGCCCAGTAAATACTCTATGAAGCAAGGGATGTTGATGTATAAGAATCGTA
TGGTAATTTCCAAAACTTCCAAACTAATCGCTATGATCTTACATACTTTTCATAATTCGGTATTTGGAGGACATTCAGGGTTTCTACGAACATACAAGAGGTTGACTGGA
GAGTTATATTGGGAAGGGATGAAACAATATGTTAAAAAATATTGTGAAGAGTGCATGATTTGTCAAAGAAATAAAACTTTGGCATTATCTCCAGCAGGGTTGTTAACACC
CTTGGAAGTTCCTAATAGAGTGTGGGAAGAGATTTCCATGGATTTTGTAGATGGATTACCAAAAGAAAATGGGTTTGAGGTTATCTTTGTAGTGGTAGACCGTTTCAGAC
TTTCAGTAAATATGGTCACTTTTTTACCAATGAAACATCCCTATAATGCGAAGTCTTTAGCTGAATTGTTTGTTAAAGAAGTTGTGAGACTACACGACTTTCCAAAGTCC
ATAGTGTCAGATATAGATAAGGTGTTTTTGAGTTCCTTCTGGAAAGAGTTATTCAGACTAGCAGGCACAAGATTGAATCATAGTACTGCATATCATCCTCAGTCAGATGG
ACAAACAGAAGTAGTTAATCGAGGAGTAGCGATTTATTTACTATGTTTTTGTGGAGAAAAACTGAAGGAGTGGGTGAAATGGATACCATGGGCTAAATATTGGTATAACA
CTACATATCAGCGGTCGTTGGGTATCACGCCATTTCAGGCAGTATATGGCAGATTACCTCCTCCATTGATATATTATGGGGTTAGAGATACATCGAATTCAACATTGGAT
GAACAACTGAAAGAGAGAGATGTAGCTTTGGATGCTTTGAAGGAACATTTACAAGTGGCACAGGATAAGATGAAAAAATATGCTGATTTGAAGAGGAGAGACGTACAGCC
TTATCGGCAAGTATCATTGAGAAGAAAGAGGAATGAGAAGCTCTCTGCTAAATATTTTGGGCCTTATAAAGTGATTGAAAGGGTTGGGCCAGTAGCATATAAACTGGAGT
TACCTGCACAGGCAGCCATACATCCAGTTTTTCATGAGAATTGTGATGATATTCCATACTTATCAGAGAATTATGAATGGAGGGCTATGCCAGAAGAAGTGTATGGATAT
TTGAAGAATAAGGCTGGAAGTTGGGATGTGTTAGTGAGTTGGAAGGGTCTACCGAGACATGAGGCAACATGGGAGTTATATGAAGATTTGCAACAGCGATTTCCAGATTT
TCACCTTGAGGACAAGGTGAATTTGAAGAGGGAGAGTAATGATAGACCCCCGATATTGTATCAATATACTAGGAAAGGGAAGAAGGGTAGTGGCTCGTGTGTAGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGCAGACGAGGATTGAAGAAAAGATTGAAACGATAGACCAAGAGATTCATGGCATCAAGAAGGAGATCGGGAAACTGCTGGCGATTGAGAAAACGTTGAACAATAT
ATCGAAGAACATGGAAAGACAAAATCAATTAATATTGAGAATTATGGAATCAGCCGCACAGGAGAGAACAACGATGAACGAAAGGATATCCGAATTGTCTATACAGACTT
TTTCGGTGAAGATTATAGATGAAGGCGAAGGATCTTCGAGGCGAGAAAGCGAAACGAACAACGAAGAGAAGAAACCGAATGAAGAAGGAAACAATGATCGAAACAAATTT
AAGAAAGTGGAGATGCCTATATTCAACGGAGATGATCTCGATTCGTGGTTATTTTGCGCTGAAAGGTATTTTCTTATCCATAAACTCACTGAATCTGAAAAGATAACCGT
ATCTACTATTAGTTTCGAAGGACCGACATTGAATTGGTATCGTTCACAAGAAGAACGGGAGAAGCGATTTTTGCGTATCCAACAAACAACAACCATGGATGAATATCAAA
ATTTATTCGATAAGTGGAGATGGATTTCTGTGAGCCAAAAGGACGCCCGCATGATAAGGATAGCGCAAAAAATTGAAAACAGAGAAGATATTCGGCGTGAGGCAAATCTT
CCCAGATATTCCGGAGGAAGAACAACTAATCCACAGAATAGTACCAAGACTAATACAAATGTGAATACTGGAGAAAATAAGGGAGGAATGAATTGGCCAATGAGAACCAT
CACTTTGCGAGGGACATCGAAAGAGGAAGTTCCAAAGGAGGGTCCAACAAAACGATTGTCTGATGCTGAATTCCAAGCTCGAAAGGAGAAGGGACTGTGTTTTCGGTGTA
ATGAGAAGTATTCTCACGACCATAAATGCAAGGCTAAGGAGTTGAGGGAATTACGAATGCTGGTAGTGAAGGAGGATAATGAAGAATATGAAATCATTGAAGATGGAAAT
AATGCTTTGAAGGAGTTGAATACCATCGAAATCATGGAAGAAGGCCAAGCGATAGTAGAATTATCTATGAACTCTGTAGTTGGATTGGACAATCTAGGGACAATGGAGGT
GAAGGGAACGATACGAGGGAGAGAAATTGTGGTGTTAATAGATTGTGGAGCGACACACAACTTTGTTTCAGAACGACTTGTGAAGGAGTTACAACTGGAAACAAAAGAAA
CCTCAAACTATGGGGTAATTTTGGGTTTCGGTACTGCTATAAAAGGGAAAGGGATATGTGGAGCTGTGGAACTGATGTTAGGTGATTGGAAAGTGATTGATGAATTCTTA
CCCTTGGAGTTAGGAGGAGTAGATGCCATACTGAATATGCAGTGGCTTTATTCTTTGGGAATAACAGAAGCCTTGGAAAGATGTGATTTGTCTGAAGAAGAAGACTTGAT
CGAAGAAATATTAACTGAAGAAGAGTCGGTAGCTGTGGTGTTGAAAAGATTTGAAGATGTTTTCGAGTGGCCAGAGACTTTGCCTCCTTGGCGGATAATAGAACATCACA
TCCATTTGAAAGAGGGAGTGAATCCAGTTAATGTACGGCCATATCGATATGCTTATCAACAAAAAAATGAAATGGAGAAAGCCTTGAATAACGTGACTGTGCCTGACAAA
TTTCCAATTCCAGTAATTGAAGAGTTATTTGATGAGTTAGATGGAGCTGTTATATTTACTAAGATTGATTTGAAGGCAGGGTATCATCAGATTCGAATGTGTGCAGATGA
TATTGAGAAGACTGCGTTTAGGACTCATGAAGGTCACTATGAGTTCATGGTGATGCCATTTGGACTAACAAATGCACCATCAACTTTCCAATCATTAATGAATGCTATAT
TCAAGCCGTATCTCATAAAGTTTGTGCTGGTATTCTTTGATGATATACTGATTTACAGCAAGAATTTAAAGGCTCATTTGGATCACATCAAAGCAGTGTTAGAAGTATTG
CGAAAAAATGAATTGTATGCAAACAAAAAGAAGTGCAGCTTTGCCAAAGCAAGGGTAGAATACCTGGGGCATATCATATCTGGAGGAGGAGTAGAGGTGGATCCGGAGAA
AATTAGAGCCATCAAAGAATGGCCTATTCCAACTAACGTGAGAGAAGTTCGAGGATTTTTGGGATTGACTGGATATTATCGAAAATTTGTAAAGCATTATGGGACGATAG
CAGCACCTCTGACCCAATTACTTAAAAAAGGGGGATTTAGATGGACTGAGGAGACTCAAGAAGCCTTTGTTAAGTTGCAAATGACCATGATGACATTGCCCATGTTAGCA
ATGCCAGATTTTAGCATTCCATTTGAGATTGAAACGGATGCTTCTGGATATGGGTTGGGTGCAGTGTTAATACAAAACCAAAGACCTATTGCTTATTATAGTCATACTCT
AGCAGTTAGAGATAGAGTTAAACCAGTTTATGAAAGGGAATTGATGGCTGTAGTAATGACAGTACAGAGATGGCGTCCCTATTTATTGGGGAAGAGATTCAAGGTGAAAA
CTGATCCGCGGTCATTAAAATTCTTTCTGGAACAACGGGTGATTCAGCCCCAGTACCAAAAGTGGATATCTAAGCTTTTAGGATATTCATTTGAGGTGATATATAAGCCG
GGGTTAGAAAACAAGGCTGCTGACGCGTTGTCTAGAATACCTCCAACCGTCCATTTGAATCAACTGACAGCTCCAAACATAATTGATGTGGCAGTAATTAAAGAAGAGGT
CAATCAAGATGAGAGACTTAAGAAGATCAGAGGAGAATTAGAAGAAAAAGGTGAGCAGCAGCCCAGTAAATACTCTATGAAGCAAGGGATGTTGATGTATAAGAATCGTA
TGGTAATTTCCAAAACTTCCAAACTAATCGCTATGATCTTACATACTTTTCATAATTCGGTATTTGGAGGACATTCAGGGTTTCTACGAACATACAAGAGGTTGACTGGA
GAGTTATATTGGGAAGGGATGAAACAATATGTTAAAAAATATTGTGAAGAGTGCATGATTTGTCAAAGAAATAAAACTTTGGCATTATCTCCAGCAGGGTTGTTAACACC
CTTGGAAGTTCCTAATAGAGTGTGGGAAGAGATTTCCATGGATTTTGTAGATGGATTACCAAAAGAAAATGGGTTTGAGGTTATCTTTGTAGTGGTAGACCGTTTCAGAC
TTTCAGTAAATATGGTCACTTTTTTACCAATGAAACATCCCTATAATGCGAAGTCTTTAGCTGAATTGTTTGTTAAAGAAGTTGTGAGACTACACGACTTTCCAAAGTCC
ATAGTGTCAGATATAGATAAGGTGTTTTTGAGTTCCTTCTGGAAAGAGTTATTCAGACTAGCAGGCACAAGATTGAATCATAGTACTGCATATCATCCTCAGTCAGATGG
ACAAACAGAAGTAGTTAATCGAGGAGTAGCGATTTATTTACTATGTTTTTGTGGAGAAAAACTGAAGGAGTGGGTGAAATGGATACCATGGGCTAAATATTGGTATAACA
CTACATATCAGCGGTCGTTGGGTATCACGCCATTTCAGGCAGTATATGGCAGATTACCTCCTCCATTGATATATTATGGGGTTAGAGATACATCGAATTCAACATTGGAT
GAACAACTGAAAGAGAGAGATGTAGCTTTGGATGCTTTGAAGGAACATTTACAAGTGGCACAGGATAAGATGAAAAAATATGCTGATTTGAAGAGGAGAGACGTACAGCC
TTATCGGCAAGTATCATTGAGAAGAAAGAGGAATGAGAAGCTCTCTGCTAAATATTTTGGGCCTTATAAAGTGATTGAAAGGGTTGGGCCAGTAGCATATAAACTGGAGT
TACCTGCACAGGCAGCCATACATCCAGTTTTTCATGAGAATTGTGATGATATTCCATACTTATCAGAGAATTATGAATGGAGGGCTATGCCAGAAGAAGTGTATGGATAT
TTGAAGAATAAGGCTGGAAGTTGGGATGTGTTAGTGAGTTGGAAGGGTCTACCGAGACATGAGGCAACATGGGAGTTATATGAAGATTTGCAACAGCGATTTCCAGATTT
TCACCTTGAGGACAAGGTGAATTTGAAGAGGGAGAGTAATGATAGACCCCCGATATTGTATCAATATACTAGGAAAGGGAAGAAGGGTAGTGGCTCGTGTGTAGAATGA
Protein sequenceShow/hide protein sequence
MVQTRIEEKIETIDQEIHGIKKEIGKLLAIEKTLNNISKNMERQNQLILRIMESAAQERTTMNERISELSIQTFSVKIIDEGEGSSRRESETNNEEKKPNEEGNNDRNKF
KKVEMPIFNGDDLDSWLFCAERYFLIHKLTESEKITVSTISFEGPTLNWYRSQEEREKRFLRIQQTTTMDEYQNLFDKWRWISVSQKDARMIRIAQKIENREDIRREANL
PRYSGGRTTNPQNSTKTNTNVNTGENKGGMNWPMRTITLRGTSKEEVPKEGPTKRLSDAEFQARKEKGLCFRCNEKYSHDHKCKAKELRELRMLVVKEDNEEYEIIEDGN
NALKELNTIEIMEEGQAIVELSMNSVVGLDNLGTMEVKGTIRGREIVVLIDCGATHNFVSERLVKELQLETKETSNYGVILGFGTAIKGKGICGAVELMLGDWKVIDEFL
PLELGGVDAILNMQWLYSLGITEALERCDLSEEEDLIEEILTEEESVAVVLKRFEDVFEWPETLPPWRIIEHHIHLKEGVNPVNVRPYRYAYQQKNEMEKALNNVTVPDK
FPIPVIEELFDELDGAVIFTKIDLKAGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNAIFKPYLIKFVLVFFDDILIYSKNLKAHLDHIKAVLEVL
RKNELYANKKKCSFAKARVEYLGHIISGGGVEVDPEKIRAIKEWPIPTNVREVRGFLGLTGYYRKFVKHYGTIAAPLTQLLKKGGFRWTEETQEAFVKLQMTMMTLPMLA
MPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHTLAVRDRVKPVYERELMAVVMTVQRWRPYLLGKRFKVKTDPRSLKFFLEQRVIQPQYQKWISKLLGYSFEVIYKP
GLENKAADALSRIPPTVHLNQLTAPNIIDVAVIKEEVNQDERLKKIRGELEEKGEQQPSKYSMKQGMLMYKNRMVISKTSKLIAMILHTFHNSVFGGHSGFLRTYKRLTG
ELYWEGMKQYVKKYCEECMICQRNKTLALSPAGLLTPLEVPNRVWEEISMDFVDGLPKENGFEVIFVVVDRFRLSVNMVTFLPMKHPYNAKSLAELFVKEVVRLHDFPKS
IVSDIDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVAIYLLCFCGEKLKEWVKWIPWAKYWYNTTYQRSLGITPFQAVYGRLPPPLIYYGVRDTSNSTLD
EQLKERDVALDALKEHLQVAQDKMKKYADLKRRDVQPYRQVSLRRKRNEKLSAKYFGPYKVIERVGPVAYKLELPAQAAIHPVFHENCDDIPYLSENYEWRAMPEEVYGY
LKNKAGSWDVLVSWKGLPRHEATWELYEDLQQRFPDFHLEDKVNLKRESNDRPPILYQYTRKGKKGSGSCVE