; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G25110 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G25110
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrotransposon protein, putative, Ty1-copia subclass
Genome locationChr3:22352269..22356445
RNA-Seq ExpressionCSPI03G25110
SyntenyCSPI03G25110
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG8482869.1 hypothetical protein CXB51_023985 [Gossypium anomalum]8.2e-24739.38Show/hide
Query:  FALWKAKIKALLGQQKSHKALLDPSELPTTLTATQKEEMKLNAYGTLILNLSDNIIRQVLEEETAYKVWMKLESLYASKDLPNKMYLREKNFTYRMDPSK
        FALW+ K++A+L Q     ALL   ++P+TLT  +K+     A   L L+LS+ I++ V++E+TA  +W +LE +  SK L +K++++++ + +R++   
Subjt:  FALWKAKIKALLGQQKSHKALLDPSELPTTLTATQKEEMKLNAYGTLILNLSDNIIRQVLEEETAYKVWMKLESLYASKDLPNKMYLREKNFTYRMDPSK

Query:  TLSENLDEFKKIVSDFKTLE---------------------------------------------------------------------DKLSDE-----
        ++ E+L  FK+I+S+ + +E                                                                     D+ +D+     
Subjt:  TLSENLDEFKKIVSDFKTLE---------------------------------------------------------------------DKLSDE-----

Query:  --------------------------------NEALKNQEKEK----DSKGKQPEVS---IVESSFTYTDALASTLDQANHVNPLGKHDWVLDSGCTYHM
                                        +E  K Q K K    + KGKQPE S    V   ++  + L ++++ +         +W+LDSGCT+HM
Subjt:  --------------------------------NEALKNQEKEK----DSKGKQPEVS---IVESSFTYTDALASTLDQANHVNPLGKHDWVLDSGCTYHM

Query:  TPFRAWFNTYREISGESVFIGNNNECNIAGIGSVTMKLKDKTVKLLRNVRHVPHLKRNLISLRMLDSLGCEYKGKCGVFQVFMGSKLVLVGE-KVNDLFI
        +P R WF TY  +S   V +GNN  C IAG+G++ +K+ D  V+ L +VR+VP LKRNLISL  LDS G  Y  + GV ++  GS +V+ G+ K   L++
Subjt:  TPFRAWFNTYREISGESVFIGNNNECNIAGIGSVTMKLKDKTVKLLRNVRHVPHLKRNLISLRMLDSLGCEYKGKCGVFQVFMGSKLVLVGE-KVNDLFI

Query:  IKGVEMIEEANTVLSLNLIEVDI---WHKRLSHISQKGLEALSKQGILP-QDICSKLSFCEHCVLGKTRKQNFTKAQHTTRGILDYIHLDLWGPASTPSL
        ++G  +  +A  V S +L + DI   WH RL H+S+ G+  LSK+G+L  Q IC KL+FCEHCV GK ++  FT+  H T   L+YIH DLWGP+  PS 
Subjt:  IKGVEMIEEANTVLSLNLIEVDI---WHKRLSHISQKGLEALSKQGILP-QDICSKLSFCEHCVLGKTRKQNFTKAQHTTRGILDYIHLDLWGPASTPSL

Query:  SGS--------------------------------------------RYLRTDNGLEFCGELFNNFCRQSGITRHKTVTYTPQQNRVAERLNKTIMERVR
         G+                                            +YLRTDNGLEFC + FN  C+  GI RH TV +TPQQN VAER+N+TI+E+VR
Subjt:  SGS--------------------------------------------RYLRTDNGLEFCGELFNNFCRQSGITRHKTVTYTPQQNRVAERLNKTIMERVR

Query:  CLLSDVILKEKFWAEAAAYVMHTLNRSPHTSL--------------------VFGCVGYAHQNKGKLKVRAAKCMFIGFTEGVKGFKLWHPTNKRFIISK
        C+LS+  L + FWAEAA+     +NRSP  ++                    +FGC  YAH N GKL+ R+ KC+F+G+  GVKG+KLW P N++ +IS+
Subjt:  CLLSDVILKEKFWAEAAAYVMHTLNRSPHTSL--------------------VFGCVGYAHQNKGKLKVRAAKCMFIGFTEGVKGFKLWHPTNKRFIISK

Query:  SSTEGNFDDTQSYTTQIEVENTGKSVQPTEEPTAIEQEQVENLSEEQDEMLEEQPDLSQYSLARDRQRRIIVPPARYAESNYISFVLN-ATVVPNDSEPS
              FD+T +    + +++        +    I  E    +S + +  +   P   QYS+A++R +R I PP +YAE++ +++ LN A  +  + EPS
Subjt:  SSTEGNFDDTQSYTTQIEVENTGKSVQPTEEPTAIEQEQVENLSEEQDEMLEEQPDLSQYSLARDRQRRIIVPPARYAESNYISFVLN-ATVVPNDSEPS

Query:  SFDEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLV
        ++ EA++  ++ +W+ AM EE+ SL+ N TW L  LPKG K +  KW+FK KEG     +P+YKARLVAKG++Q  G+D++++FSPVVK +SIR LL +V
Subjt:  SFDEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLV

Query:  AQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLKKSIYGLKQSPRCWYRRFDDFITSLGFQ----------------------KNLRE
        A ++LEL+QLDVKT FLH  LEE IYM QP+G+ V  KED   LLKKS+YGLKQSPR WY+RFD F+TS  F+                      K+L  
Subjt:  AQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLKKSIYGLKQSPRCWYRRFDDFITSLGFQ----------------------KNLRE

Query:  SKKILGIDIIRDRDQSTLSINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVNSP-SETDIEHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSASLVS
        +KKILG++I+RDR  S L ++Q  Y EK++ RFN+ +A+PV+ P+A HF+LS+  SP S+ +IE+   M +VPYS AVGSLMY M+ +RPDLSY+ S VS
Subjt:  SKKILGIDIIRDRDQSTLSINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVNSP-SETDIEHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSASLVS

Query:  KYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVFLYGRNLISWKAILQSIVALSTTEAEYIALSEG----
        +YMAN G+ HW+A +WI+RYL  + +  L + RT   E  +IGYVD+DFAGD D+RRSLTGYVF  G   ISWKA LQ+ VALSTTEAEY+A++E     
Subjt:  KYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVFLYGRNLISWKAILQSIVALSTTEAEYIALSEG----

Query:  --------------SIVKIFYDNQSAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSENAADMLTKSVSTLKLQKCFELIG
                       I  +F D+QSAI L+K+  +H RTKHID++YHF+R+ I  G+I V K+ T EN ADM+TKS+   K + C +L+G
Subjt:  --------------SIVKIFYDNQSAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSENAADMLTKSVSTLKLQKCFELIG

KAG8485270.1 hypothetical protein CXB51_021477 [Gossypium anomalum]6.7e-24939.91Show/hide
Query:  FALWKAKIKALLGQQKSHKALLDPSELPTTLTATQKEEMKLNAYGTLILNLSDNIIRQVLEEETAYKVWMKLESLYASKDLPNKMYLREKNFTYRMDPSK
        FALW+ K++A+L Q     ALL   ++P+TLT  +K+     A   L L+LS+ I++ V++E+TA  +W +LE +  SK L +K++++++ + +R++   
Subjt:  FALWKAKIKALLGQQKSHKALLDPSELPTTLTATQKEEMKLNAYGTLILNLSDNIIRQVLEEETAYKVWMKLESLYASKDLPNKMYLREKNFTYRMDPSK

Query:  TLSENLDEFKKIVSDFKTLE---------------------------------------------------------------------DKLSDE-----
        ++ E+L  FK+I+S+ + +E                                                                     D+ +D+     
Subjt:  TLSENLDEFKKIVSDFKTLE---------------------------------------------------------------------DKLSDE-----

Query:  -NEALKNQEKEK----DSKGKQPEVS---IVESSFTYTDALASTLDQANHVNPLGKHDWVLDSGCTYHMTPFRAWFNTYREISGESVFIGNNNECNIAGI
         +E  K Q K K    + KGKQPE S    V   ++  + L ++++ +         +W+LDSGCT+HM+P R WF TY  +S   V +GNN  C IAG+
Subjt:  -NEALKNQEKEK----DSKGKQPEVS---IVESSFTYTDALASTLDQANHVNPLGKHDWVLDSGCTYHMTPFRAWFNTYREISGESVFIGNNNECNIAGI

Query:  GSVTMKLKDKTVKLLRNVRHVPHLKRNLISLRMLDSLGCEYKGKCGVFQVFMGSKLVLVGE-KVNDLFIIKGVEMIEEANTVLSLNLIEVDI---WHKRL
        G++ +K+ D  V+ L +VR+VP LKRNLISL  LDS G  Y  + GV ++  GS +V+ G+ K   L++++G  +  +A TV S +L + DI   WH R+
Subjt:  GSVTMKLKDKTVKLLRNVRHVPHLKRNLISLRMLDSLGCEYKGKCGVFQVFMGSKLVLVGE-KVNDLFIIKGVEMIEEANTVLSLNLIEVDI---WHKRL

Query:  SHISQKGLEALSKQGILP-QDICSKLSFCEHCVLGKTRKQNFTKAQHTTRGILDYIHLDLWGPASTPSLSGS----------------------------
         H+S+ G+  LSK+G+L  Q IC KL+FCEHCV GK ++  FT+  H T+  L+YIH DLWGP+  PS  G+                            
Subjt:  SHISQKGLEALSKQGILP-QDICSKLSFCEHCVLGKTRKQNFTKAQHTTRGILDYIHLDLWGPASTPSLSGS----------------------------

Query:  ----------------RYLRTDNGLEFCGELFNNFCRQSGITRHKTVTYTPQQNRVAERLNKTIMERVRCLLSDVILKEKFWAEAAAYVMHTLNRSPHTS
                        +YLRTDNGLEFC + FN  C+  GI RH TV +TPQQN VAER+N+TIME+VRC+LS+  L + FWAEAA+     +NRSP  +
Subjt:  ----------------RYLRTDNGLEFCGELFNNFCRQSGITRHKTVTYTPQQNRVAERLNKTIMERVRCLLSDVILKEKFWAEAAAYVMHTLNRSPHTS

Query:  L-------VFGCVGYAHQNKGKLKVRAAKCMFIGFTEGVKGFKLWHPTNKRFIISKSSTEGNFDDTQSYTTQIEVENTGKSVQPTEEPTAIEQEQVENLS
        +       +FGC  YAH N GKL+ R+ KC+F+G+  GVKG+KLW P N++ +IS+      FD+T +    + ++++       +    I  E    +S
Subjt:  L-------VFGCVGYAHQNKGKLKVRAAKCMFIGFTEGVKGFKLWHPTNKRFIISKSSTEGNFDDTQSYTTQIEVENTGKSVQPTEEPTAIEQEQVENLS

Query:  EEQDEMLEEQPDLSQYSLARDRQRRIIVPPARYAESNYISFVLN-ATVVPNDSEPSSFDEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIA
         + +  +   P   QYS+A++R +R I PP +YAE++ +++ LN A  +  + EPS++ EA++  ++ +W+ AM EE+ SL+ N TW L  LPKG K + 
Subjt:  EEQDEMLEEQPDLSQYSLARDRQRRIIVPPARYAESNYISFVLN-ATVVPNDSEPSSFDEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIA

Query:  SKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYL
         KW+FK KEG     +P+YKARLVAKG++Q  G+D++++FSPVVK +SIR LL +VA ++LEL+QLDVKTTFLH  LEE IYM QP+G+ V  KED   L
Subjt:  SKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYL

Query:  LKKSIYGLKQSPRCWYRRFDDFITSLGFQ---------------------------------------------------KNLRESKKILGIDIIRDRDQ
        LKKS+YGLKQSPR WY+RFD F+TS  F+                                                   K+L  +KKILG++I+RDR  
Subjt:  LKKSIYGLKQSPRCWYRRFDDFITSLGFQ---------------------------------------------------KNLRESKKILGIDIIRDRDQ

Query:  STLSINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVNSP-SETDIEHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSASLVSKYMANSGRRHWEATK
        S L ++Q  Y EK++ RFN+ +A+PV+ P+A HF+LS+  SP S+ +IE+   M +VPYS AVGSLMY M+ +RPDLSY+ S VS+YMAN  + HW+A +
Subjt:  STLSINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVNSP-SETDIEHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSASLVSKYMANSGRRHWEATK

Query:  WIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVFLYGRNLISWKAILQSIVALSTTEAEYIALSEG------------------S
        WI+RYL  + +  L + RT   E  +IGYVD+DFAGD D+RRSLTGYVF  G   ISWKA LQ+ VALSTTEAEY+A++E                    
Subjt:  WIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVFLYGRNLISWKAILQSIVALSTTEAEYIALSEG------------------S

Query:  IVKIFYDNQSAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSENAADMLTKSVSTLKLQKCFEL
        I+ +F D+QSAI L+K+  +H RTKHID++YHF+R+ I  G+I V K+ T EN ADM+TKS+   K + C +L
Subjt:  IVKIFYDNQSAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSENAADMLTKSVSTLKLQKCFEL

KAG8491911.1 hypothetical protein CXB51_015266 [Gossypium anomalum]7.5e-24839.7Show/hide
Query:  FALWKAKIKALLGQQKSHKALLDPSELPTTLTATQKEEMKLNAYGTLILNLSDNIIRQVLEEETAYKVWMKLESLYASKDLPNKMYLREKNFTYRMDPSK
        FALW+ K++A+L Q     ALL   ++P+TLT  +K+     A   L L+LS+ I++ V++E+TA  +W +LE +  SK L +K++++++ + +R++   
Subjt:  FALWKAKIKALLGQQKSHKALLDPSELPTTLTATQKEEMKLNAYGTLILNLSDNIIRQVLEEETAYKVWMKLESLYASKDLPNKMYLREKNFTYRMDPSK

Query:  TLSENLDEFKKIVSDFKTLEDKLSDEN---------------------------------EALKNQEKEK------------------------------
        ++ E+L  FK+I+S+ + +E +   E+                                 ++L + +K K                              
Subjt:  TLSENLDEFKKIVSDFKTLEDKLSDEN---------------------------------EALKNQEKEK------------------------------

Query:  -DSKGKQPEVS---IVESSFTYTDALASTLDQANHVNPLGKHDWVLDSGCTYHMTPFRAWFNTYREISGESVFIGNNNECNIAGIGSVTMKLKDKTVKLL
         + KGKQPE S    V   ++  + L ++++ +         +W+LDSGCT+HM+P R WF TY  +S   V +GNN  C IAG+G++ +K+ D  V+ L
Subjt:  -DSKGKQPEVS---IVESSFTYTDALASTLDQANHVNPLGKHDWVLDSGCTYHMTPFRAWFNTYREISGESVFIGNNNECNIAGIGSVTMKLKDKTVKLL

Query:  RNVRHVPHLKRNLISLRMLDSLGCEYKGKCGVFQVFMGSKLVLVGE-KVNDLFIIKGVEMIEEANTVLSLNLIEVDI---WHKRLSHISQKGLEALSKQG
         +VR+VP LKRNLISL  LDS G  Y  + GV ++  GS +V+ G+ K   L++++G  +  +A  V S +L + DI   WH RL H+S+ G+  LSK+G
Subjt:  RNVRHVPHLKRNLISLRMLDSLGCEYKGKCGVFQVFMGSKLVLVGE-KVNDLFIIKGVEMIEEANTVLSLNLIEVDI---WHKRLSHISQKGLEALSKQG

Query:  ILP-QDICSKLSFCEHCVLGKTRKQNFTKAQHTTRGILDYIHLDLWGPASTPSLSGS-------------------------------------------
        +L  Q IC KL+FCEHCV GK ++  F +  H T+  L+YIH DLWGP+  PS  G+                                           
Subjt:  ILP-QDICSKLSFCEHCVLGKTRKQNFTKAQHTTRGILDYIHLDLWGPASTPSLSGS-------------------------------------------

Query:  -RYLRTDNGLEFCGELFNNFCRQSGITRHKTVTYTPQQNRVAERLNKTIMERVRCLLSDVILKEKFWAEAAAYVMHTLNRSPHTSL--------------
         +YLRTDNGLEFC + FN  C+  GI RH TV +TPQQN VAER+N+TIME+VRC+LS+  L + FWAEAA+     +NRSP  ++              
Subjt:  -RYLRTDNGLEFCGELFNNFCRQSGITRHKTVTYTPQQNRVAERLNKTIMERVRCLLSDVILKEKFWAEAAAYVMHTLNRSPHTSL--------------

Query:  ------VFGCVGYAHQNKGKLKVRAAKCMFIGFTEGVKGFKLWHPTNKRFIISKSSTEGNFDDTQSYTTQIEVENTGKSVQPTEEPTAIEQEQVENLSEE
              +FGC  YAH N GKL+ R+ KC+F+G+  GVKG+KLW P N++ +IS+      FD+T +    + +++        +    I  E    +S +
Subjt:  ------VFGCVGYAHQNKGKLKVRAAKCMFIGFTEGVKGFKLWHPTNKRFIISKSSTEGNFDDTQSYTTQIEVENTGKSVQPTEEPTAIEQEQVENLSEE

Query:  QDEMLEEQPDLSQYSLARDRQRRIIVPPARYAESNYISFVLN-ATVVPNDSEPSSFDEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIASK
         +  +   P   QYS+A++R +R I PP +YAE++ +++ LN A  +  + EPS++ EA++  ++ +W+ AM EE+ SL+ N TW L  LPKG K +  K
Subjt:  QDEMLEEQPDLSQYSLARDRQRRIIVPPARYAESNYISFVLN-ATVVPNDSEPSSFDEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIASK

Query:  WIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLK
        W+FK KEG     +P+YKARLVAKG++Q  G+D++++FSPVVK +SIR LL +VA ++LEL+QLDVKT FLH  LEE IYM QP+G+ V  KED   LLK
Subjt:  WIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLK

Query:  KSIYGLKQSPRCWYRRFDDFITSLGFQ---------------------------------------------------KNLRESKKILGIDIIRDRDQST
        KS+YGLKQSPR WY+RFD F+TS  F+                                                   K+L  +KKILG++I+RDR  S 
Subjt:  KSIYGLKQSPRCWYRRFDDFITSLGFQ---------------------------------------------------KNLRESKKILGIDIIRDRDQST

Query:  LSINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVNSPSETDIEHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWII
        L ++Q  Y EK++ RFN+ +A+PV+ P+A HF+LS+  SP ++D E K  M +VPYS AVGSLMY M+ +RPDLSY+ S VS+YMAN G+ HW+A +WI+
Subjt:  LSINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVNSPSETDIEHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWII

Query:  RYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVFLYGRNLISWKAILQSIVALSTTEAEYIALSEG------------------SIVK
        RYL  + +  L + RT   E  +IGYVD+DFAGD D+RRSLTGYVF  G   ISWKA LQ+ VALSTTEAEY+A++E                    I  
Subjt:  RYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVFLYGRNLISWKAILQSIVALSTTEAEYIALSEG------------------SIVK

Query:  IFYDNQSAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSENAADMLTKSVSTLKLQKCFELIG
        +F D+QSAI L+K+  +H RTKHID++YHF+R+ I  G+I V K+ T EN ADM+TKS+   K + C +L+G
Subjt:  IFYDNQSAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSENAADMLTKSVSTLKLQKCFELIG

KAG8492785.1 hypothetical protein CXB51_010233 [Gossypium anomalum]2.6e-24839.7Show/hide
Query:  FALWKAKIKALLGQQKSHKALLDPSELPTTLTATQKEEMKLNAYGTLILNLSDNIIRQVLEEETAYKVWMKLESLYASKDLPNKMYLREKNFTYRMDPSK
        FALW+ K++A+L Q     ALL   ++P+TLT  +K+     A   L L+LS+ I++ V++E+TA  +W +L+ +  SK L +K++++++ + +R++   
Subjt:  FALWKAKIKALLGQQKSHKALLDPSELPTTLTATQKEEMKLNAYGTLILNLSDNIIRQVLEEETAYKVWMKLESLYASKDLPNKMYLREKNFTYRMDPSK

Query:  TLSENLDEFKKIVSDFKTLE---------------------------------------------------------------------DKLSDE-----
        ++ E+L  FKKI+S+ + +E                                                                     D+ +D+     
Subjt:  TLSENLDEFKKIVSDFKTLE---------------------------------------------------------------------DKLSDE-----

Query:  --------------------------------NEALKNQEKEK----DSKGKQPEVS---IVESSFTYTDALASTLDQANHVNPLGKHDWVLDSGCTYHM
                                        +E  K Q K K    + KGKQPE S    V   ++  + L ++++ +         +W+LDSGCT+HM
Subjt:  --------------------------------NEALKNQEKEK----DSKGKQPEVS---IVESSFTYTDALASTLDQANHVNPLGKHDWVLDSGCTYHM

Query:  TPFRAWFNTYREISGESVFIGNNNECNIAGIGSVTMKLKDKTVKLLRNVRHVPHLKRNLISLRMLDSLGCEYKGKCGVFQVFMGSKLVLVGE-KVNDLFI
        +P R WF TY  +S   V +GNN  C IAG+G++ +K+ D  V+ L +VR+VP LKRNLISL  LDS G  Y  + GV ++  GS +++ G+ K   L++
Subjt:  TPFRAWFNTYREISGESVFIGNNNECNIAGIGSVTMKLKDKTVKLLRNVRHVPHLKRNLISLRMLDSLGCEYKGKCGVFQVFMGSKLVLVGE-KVNDLFI

Query:  IKGVEMIEEANTVLSLNLIEVDI---WHKRLSHISQKGLEALSKQGILP-QDICSKLSFCEHCVLGKTRKQNFTKAQHTTRGILDYIHLDLWGPASTPSL
        ++G  +  +A  V S +L + DI   WH RL H+S+ G+  LSK+G+L  Q IC KL+FCEHCV GK ++  FT+  H T+  L+YIH DLWGP+  PS 
Subjt:  IKGVEMIEEANTVLSLNLIEVDI---WHKRLSHISQKGLEALSKQGILP-QDICSKLSFCEHCVLGKTRKQNFTKAQHTTRGILDYIHLDLWGPASTPSL

Query:  SGS--RYLRTDNGLEFCGELFNNFCRQSGITRHKTVTYTPQQNRVAERLNKTIMERVRCLLSDVILKEKFWAEAAAYVMHTLNRSPHTSL----------
        +G   +YLRTDNGLEFC + FN  C+  GI RH TV +TPQQN VAER+N+TIME+VRC+LS+  L + FWAEAA+     +NRSP  ++          
Subjt:  SGS--RYLRTDNGLEFCGELFNNFCRQSGITRHKTVTYTPQQNRVAERLNKTIMERVRCLLSDVILKEKFWAEAAAYVMHTLNRSPHTSL----------

Query:  ----------VFGCVGYAHQNKGKLKVRAAKCMFIGFTEGVKGFKLWHPTNKRFIISKSSTEGNFDDTQSYTTQIEVENTGKSVQPTEEPTAIEQEQVEN
                  +FGC  YAH N GKL+ R+ KC+F+G+  GVKG+KLW P N++ +IS+      FD+T +    + ++++       +    I  E    
Subjt:  ----------VFGCVGYAHQNKGKLKVRAAKCMFIGFTEGVKGFKLWHPTNKRFIISKSSTEGNFDDTQSYTTQIEVENTGKSVQPTEEPTAIEQEQVEN

Query:  LSEEQDEMLEEQPDLSQYSLARDRQRRIIVPPARYAESNYISFVLN-ATVVPNDSEPSSFDEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKP
        +S + +  +   P   QYS+A++R +R I PP +YAE++ +++ LN A  +  + EPS++ EA++  ++ +W+ AM EE+ SL+ N TW L  LPKG K 
Subjt:  LSEEQDEMLEEQPDLSQYSLARDRQRRIIVPPARYAESNYISFVLN-ATVVPNDSEPSSFDEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKP

Query:  IASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLY
        +  KW+FK KE      +P+YKARLVAKG++Q  G+D++++FSPVVK +SIR LL +VA ++LEL+QLDVKT FLH  LEE IYM QP+G+ +  KED  
Subjt:  IASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLY

Query:  YLLKKSIYGLKQSPRCWYRRFDDFITSLGFQ---------------------------------------------------KNLRESKKILGIDIIRDR
         LLKKS+YGLKQSPR WY+RFD FITS  F+                                                   K+L  +KKILG++I+RDR
Subjt:  YLLKKSIYGLKQSPRCWYRRFDDFITSLGFQ---------------------------------------------------KNLRESKKILGIDIIRDR

Query:  DQSTLSINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVNSP-SETDIEHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSASLVSKYMANSGRRHWEA
          S L ++Q  Y EK++ RFN+ +A+PV+ P+A HF+LS+  SP S+ +IE+   M +VPYS AVGSLMY M+ +RPDLSY+ S VS+YMAN G+ HW+A
Subjt:  DQSTLSINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVNSP-SETDIEHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSASLVSKYMANSGRRHWEA

Query:  TKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVFLYGRNLISWKAILQSIVALSTTEAEYIALSEG-----------------
         +WI+RYL  + +  L + RT   E  +IGYVD+DFAGD D+RRSLTGYVF  G   ISWKA LQ+ VALSTTEAEY+A++E                  
Subjt:  TKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVFLYGRNLISWKAILQSIVALSTTEAEYIALSEG-----------------

Query:  -SIVKIFYDNQSAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSENAADMLTKSVSTLKLQKCFELIG
          I  +F D+QSAI L+K+  +H RTKHID++YHF+R+ I  G+I V K+ T EN ADM+TKS+   K + C +L+G
Subjt:  -SIVKIFYDNQSAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSENAADMLTKSVSTLKLQKCFELIG

KAG8496382.1 hypothetical protein CXB51_009119 [Gossypium anomalum]6.3e-24739.14Show/hide
Query:  FALWKAKIKALLGQQKSHKALLDPSELPTTLTATQKEEMKLNAYGTLILNLSDNIIRQVLEEETAYKVWMKLESLYASKDLPNKMYLREKNFTYRMDPSK
        FALW+ K++A+L Q     ALL   ++P+TLT  +K+     A   L L+LS+ I++ V++E+TA  +W +LE +  SK L +K++++++ + +R++   
Subjt:  FALWKAKIKALLGQQKSHKALLDPSELPTTLTATQKEEMKLNAYGTLILNLSDNIIRQVLEEETAYKVWMKLESLYASKDLPNKMYLREKNFTYRMDPSK

Query:  TLSENLDEFKKIVSDFKTLE---------------------------------------------------------------------DKLSDE-----
        ++ E+L  FK+I+S+ + +E                                                                     D+ +D+     
Subjt:  TLSENLDEFKKIVSDFKTLE---------------------------------------------------------------------DKLSDE-----

Query:  --------------------------------NEALKNQEKEK----DSKGKQPEVS---IVESSFTYTDALASTLDQANHVNPLGKHDWVLDSGCTYHM
                                        +E  K Q K K    + KGKQPE S    V   ++  + L ++++ +         +W+LDSGCT+HM
Subjt:  --------------------------------NEALKNQEKEK----DSKGKQPEVS---IVESSFTYTDALASTLDQANHVNPLGKHDWVLDSGCTYHM

Query:  TPFRAWFNTYREISGESVFIGNNNECNIAGIGSVTMKLKDKTVKLLRNVRHVPHLKRNLISLRMLDSLGCEYKGKCGVFQVFMGSKLVLVGE-KVNDLFI
        +P R WF TY  +S   V +GNN  C IAG+G++ +K+ D  V+ L +VR+VP LKRNLISL  LDS G  Y  + GV ++  GS +V+ G+ K   L++
Subjt:  TPFRAWFNTYREISGESVFIGNNNECNIAGIGSVTMKLKDKTVKLLRNVRHVPHLKRNLISLRMLDSLGCEYKGKCGVFQVFMGSKLVLVGE-KVNDLFI

Query:  IKGVEMIEEANTVLSLNLIEVDI---WHKRLSHISQKGLEALSKQGILP-QDICSKLSFCEHCVLGKTRKQNFTKAQHTTRGILDYIHLDLWGPASTPSL
        ++G  +  +A  V S +L + DI   WH RL H+S+ G+  LSK+G+L  Q IC KL+FCEHCV GK ++  FT+  H T+  L+YIH DLWGP+  PS 
Subjt:  IKGVEMIEEANTVLSLNLIEVDI---WHKRLSHISQKGLEALSKQGILP-QDICSKLSFCEHCVLGKTRKQNFTKAQHTTRGILDYIHLDLWGPASTPSL

Query:  SGS------------------RYLRTDNGLEFCGELFNNFCRQSGITRHKTVTYTPQQNRVAERLNKTIMERVRCLLSDVILKEKFWAEAAAYVMHTLNR
         G+                  +YLRTDNGLEFC + FN  C+  GI RH TV +TPQQN VAER+N+TIME+V+C+LS+  L + FWAEAA+     +NR
Subjt:  SGS------------------RYLRTDNGLEFCGELFNNFCRQSGITRHKTVTYTPQQNRVAERLNKTIMERVRCLLSDVILKEKFWAEAAAYVMHTLNR

Query:  SPHTSL--------------------VFGCVGYAHQNKGKLKVRAAKCMFIGFTEGVKGFKLWHPTNKRFIISKSSTEGNFDDTQSYTTQIEVENTGKSV
        SP  ++                    +FGC  YAH N GKL+ R+ KC+F+G+  GVKG+KLW P N++ +IS+      FD+T +    + ++++    
Subjt:  SPHTSL--------------------VFGCVGYAHQNKGKLKVRAAKCMFIGFTEGVKGFKLWHPTNKRFIISKSSTEGNFDDTQSYTTQIEVENTGKSV

Query:  QPTEEPTAIEQEQVENLSEEQDEMLEEQPDLSQYSLARDRQRRIIVPPARYAESNYISFVLN------ATVVPNDSEPSSFDEAVNSSNARQWIEAMNEE
           +    I  E    +S + +  +   P   QYS+A++R +R I PP +YAE++ +++ LN      A   P++ EPS++ EA++  ++ +W+ AM EE
Subjt:  QPTEEPTAIEQEQVENLSEEQDEMLEEQPDLSQYSLARDRQRRIIVPPARYAESNYISFVLN------ATVVPNDSEPSSFDEAVNSSNARQWIEAMNEE

Query:  INSLNVNDTWTLASLPKGCKPIASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTTFLHDYL
        + SL+ N TW L  LPKG K +  KW+FK KEG  +  +P+YKARLVAKG++Q  G+D++++FSPVVK + IR LL +VA ++LEL+QLDVKT FLH  L
Subjt:  INSLNVNDTWTLASLPKGCKPIASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTTFLHDYL

Query:  EETIYMVQPKGYEVQGKEDLYYLLKKSIYGLKQSPRCWYRRFDDFITSLGFQ------------------------------------------------
        EE IYM QP+G+ V  KED   LLKKS+YGLKQSPR WY+RFD F+TS  F+                                                
Subjt:  EETIYMVQPKGYEVQGKEDLYYLLKKSIYGLKQSPRCWYRRFDDFITSLGFQ------------------------------------------------

Query:  ---KNLRESKKILGIDIIRDRDQSTLSINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVNSP-SETDIEHKLQMKNVPYSQAVGSLMYLMISTRPDL
           K+L  +KKILG++I+RDR  S L ++Q  Y EK++ RFN+ +A+PV+ P+A HF+LS+  SP S+ +IE+   M +VPYS AVGSLMY M+ +RPDL
Subjt:  ---KNLRESKKILGIDIIRDRDQSTLSINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVNSP-SETDIEHKLQMKNVPYSQAVGSLMYLMISTRPDL

Query:  SYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVFLYGRNLISWKAILQSIVALSTTEAEYIA
        SY+ S VS+YMAN G+ HW+A +WI+RYL  + +  L + RT   E  +IGYVD+DFAGD D+RRSLTGYVF  G   ISWKA LQ+ VALSTTEAEY+A
Subjt:  SYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVFLYGRNLISWKAILQSIVALSTTEAEYIA

Query:  LSEG------------------SIVKIFYDNQSAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSENAADMLTKSVSTLKLQKCFELIG
        ++E                    I  +F D+QSAI L+K+  +H RTKHID++YHF+R+ I  G+I V K+ T EN ADM+TKS+   K + C +L+G
Subjt:  LSEG------------------SIVKIFYDNQSAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSENAADMLTKSVSTLKLQKCFELIG

TrEMBL top hitse value%identityAlignment
A0A2K3N065 Copia LTR rider2.5e-24138.67Show/hide
Query:  LGDFALWKAKIKALLGQQKSHKALLDPSELPTTLTATQKEEMKLNAYGTLILNLSDNIIRQVLEEETAYKVWMKLESLYASKDLPNKMYLREKNFTYRMD
        + DF LW+ K+KALL QQ   +AL   + +   LTA +K  M   A+  ++L+L D ++RQV +E TA  +W KLESLY +K L N++YL++  ++++M 
Subjt:  LGDFALWKAKIKALLGQQKSHKALLDPSELPTTLTATQKEEMKLNAYGTLILNLSDNIIRQVLEEETAYKVWMKLESLYASKDLPNKMYLREKNFTYRMD

Query:  PSKTLSENLDEFKKIVSDFKTLEDKLSDENEAL-------------------------------------KNQEKEK----------------------D
          K L+E LD F K++ D + ++ K+ DE++AL                                      N+ KE                       D
Subjt:  PSKTLSENLDEFKKIVSDFKTLEDKLSDENEAL-------------------------------------KNQEKEK----------------------D

Query:  SKGKQ---------------------------PE----------VSIVESSFTYTDALASTLDQANHVNPLGKHDWVLDSGCTYHMTPFRAWFNTYREIS
         KGK                            PE           +IV+  F  +D L  +   +       + +W++DSGCT+HMTP +  F    +  
Subjt:  SKGKQ---------------------------PE----------VSIVESSFTYTDALASTLDQANHVNPLGKHDWVLDSGCTYHMTPFRAWFNTYREIS

Query:  GESVFIGNNNECNIAGIGSVTMKLKDKTVKLLRNVRHVPHLKRNLISLRMLDSLGCEYKGKCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLS
        G SV +GNN  C IAG+GSV  KL D++++LL  VR+VP LKRNL+SL   D  G  ++G+  + +V  GSK VL G K   L+ ++   +    N V +
Subjt:  GESVFIGNNNECNIAGIGSVTMKLKDKTVKLLRNVRHVPHLKRNLISLRMLDSLGCEYKGKCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLS

Query:  LNLIEVDIWHKRLSHISQKGLEALSKQGILPQDICSKLSFCEHCVLGKTRKQNFTKAQHTTRGILDYIHLDLWGPASTPSLSGSRY--------------
          L + +IWH RL H+S++GL  L KQ +L  D   KL FCE CV GK+ +  F K +  T G LDYIH DLWGPA  PS SG+RY              
Subjt:  LNLIEVDIWHKRLSHISQKGLEALSKQGILPQDICSKLSFCEHCVLGKTRKQNFTKAQHTTRGILDYIHLDLWGPASTPSLSGSRY--------------

Query:  ------------------------------LRTDNGLEFCGELFNNFCRQSGITRHKTVTYTPQQNRVAERLNKTIMERVRCLLSDVILKEKFWAEAAAY
                                      LRTDNGLEFC E F+ FC  SGI RH+T   TPQQN +AER N+TI+ERVRC+L+   LK+ FWAEA + 
Subjt:  ------------------------------LRTDNGLEFCGELFNNFCRQSGITRHKTVTYTPQQNRVAERLNKTIMERVRCLLSDVILKEKFWAEAAAY

Query:  VMHTLNRSPHTSL--------------------VFGCVGYAHQNKGKLKVRAAKCMFIGFTEGVKGFKLW--HPTNKRFIISKSSTEGNFDDTQSYTTQI
          + +NR P T+L                    VFGCV YAH  + K++ RA KCMF+G+ EGVK ++LW   P +KR I S+       +     T  +
Subjt:  VMHTLNRSPHTSL--------------------VFGCVGYAHQNKGKLKVRAAKCMFIGFTEGVKGFKLW--HPTNKRFIISKSSTEGNFDDTQSYTTQI

Query:  --EVENTGKSVQPTEEPTAIEQ--------EQVENLSEEQDEMLEEQPDLSQYSLARDRQRRIIVPPARYAESNYISFVLNATVVPNDSEPSSFDEAVNS
            E + + ++  E P  +E         ++VE  +E  +E+ E   D   Y L+RDR RR+I  P R   ++ I++ L +     D EP  + E + S
Subjt:  --EVENTGKSVQPTEEPTAIEQ--------EQVENLSEEQDEMLEEQPDLSQYSLARDRQRRIIVPPARYAESNYISFVLNATVVPNDSEPSSFDEAVNS

Query:  SNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLVAQNNLELD
         N  +W++AM++E+ SL+ N TW L   P G + ++ KWIFK+KEGI   +  RYKARLVA+GFTQ+EG+D++++FSPVVK  SIR+LL++VAQ +LEL+
Subjt:  SNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLVAQNNLELD

Query:  QLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLKKSIYGLKQSPRCWYRRFDDFITSLGF------------------------------------
        Q+DVKT FL+  L+ETI M QP+GY  +GKED    LK+S+YGLKQSPR W RRFD F+  + F                                    
Subjt:  QLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLKKSIYGLKQSPRCWYRRFDDFITSLGF------------------------------------

Query:  ---------------QKNLRESKKILGIDIIRDRDQSTLSINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVNSPSETDIEHKLQMKNVPYSQAVGS
                        K+L  + +ILGIDI RDR +S L ++Q +Y  K++ +F ++N++PV  P    FKLS    PS TD+E +  M ++PY+  VGS
Subjt:  ---------------QKNLRESKKILGIDIIRDRDQSTLSINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVNSPSETDIEHKLQMKNVPYSQAVGS

Query:  LMYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELI--GYVDSDFAGDSDKRRSLTGYVFLYGRNLISWKAILQ
        LMY M+ TRPD++Y+ SLVS+YMAN G+ HW+A KWI+RY+  S N  L Y      + + +  GYVDSD+AG  D R+S++GYVF      ISWKA LQ
Subjt:  LMYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELI--GYVDSDFAGDSDKRRSLTGYVFLYGRNLISWKAILQ

Query:  SIVALSTTEAEYIALSE------------------GSIVKIFYDNQSAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSENAADMLTKSVS
         +VALSTTEAEYIAL+E                  G  + +  D+QSAI LSKN  YH RTKHID++ HF+R  IE GE+QVLKV T +NAADM+TK++ 
Subjt:  SIVALSTTEAEYIALSE------------------GSIVKIFYDNQSAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSENAADMLTKSVS

Query:  TLKLQKCFELI
        + K   C +LI
Subjt:  TLKLQKCFELI

A0A2P5W031 Uncharacterized protein1.1e-24439.47Show/hide
Query:  FALWKAKIKALLGQQKSHKALLDPSELPTTLTATQKEEMKLNAYGTLILNLSDNIIRQVLEEETAYKVWMKLESLYASKDLPNKMYLREKNFTYRMDPSK
        F+LW+ K++A+L QQ   KAL    +LP+TL+  QK++M   A+  ++L L D ++R+V +E+TA  +W++LES Y +K L N++YL+++ +  +M+   
Subjt:  FALWKAKIKALLGQQKSHKALLDPSELPTTLTATQKEEMKLNAYGTLILNLSDNIIRQVLEEETAYKVWMKLESLYASKDLPNKMYLREKNFTYRMDPSK

Query:  TLSENLDEFKKIVSDFKTLEDKLSDENE---------------------------------ALKNQEKEKDSKGKQPEVSIVE--------------SSF
         +S++LD+F  I+ D   +++K+ DE++                                 AL + E  K   GK  E +  E              SS 
Subjt:  TLSENLDEFKKIVSDFKTLEDKLSDENE---------------------------------ALKNQEKEKDSKGKQPEVSIVE--------------SSF

Query:  TY------------------------------TDALASTLDQANHVNPLGKHD---------------WVLDSGCTYHMTPFRAWFNTYREISGESVFIG
        ++                              +++     D+AN  +     D               W+LD+G T+H++  +  F+TY + SG SV +G
Subjt:  TY------------------------------TDALASTLDQANHVNPLGKHD---------------WVLDSGCTYHMTPFRAWFNTYREISGESVFIG

Query:  NNNECNIAGIGSVTMKLKDKTVKLLRNVRHVPHLKRNLISLRMLDSLGCEYKGKCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLS--LNLIE
        N++ C + GIG+V +K+ D  V+ L +VRH+P +K+NLISL  LD  G  Y  + GV +VF G+  V+ G     L+ + G  +   A    S  L+   
Subjt:  NNNECNIAGIGSVTMKLKDKTVKLLRNVRHVPHLKRNLISLRMLDSLGCEYKGKCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLS--LNLIE

Query:  VDIWHKRLSHISQKGLEALSKQGILPQDICSKLSFCEHCVLGKTRKQNFTKAQHTTRGILDYIHLDLWGPASTPSLSGSRYL------------------
          +WH RL H+S++GL  LSK+G+L      KL+FCEHCV GK  +  F+   H T+G +DY H DLWGP+ T S  G RYL                  
Subjt:  VDIWHKRLSHISQKGLEALSKQGILPQDICSKLSFCEHCVLGKTRKQNFTKAQHTTRGILDYIHLDLWGPASTPSLSGSRYL------------------

Query:  --------------------------RTDNGLEFCGELFNNFCRQSGITRHKTVTYTPQQNRVAERLNKTIMERVRCLLSDVILKEKFWAEAAAYVMHTL
                                  RTDNGLEFC   FN FC+  GI RH+TV  TPQQN VAER+N+T++ER RC+ S+  L E+FWAEA     + +
Subjt:  --------------------------RTDNGLEFCGELFNNFCRQSGITRHKTVTYTPQQNRVAERLNKTIMERVRCLLSDVILKEKFWAEAAAYVMHTL

Query:  NRSPHTSL--------------------VFGCVGYAHQNKGKLKVRAAKCMFIGFTEGVKGFKLW--HPTNKRFIISKSSTEGNFDDTQSYTTQIEVENT
        NRSP T++                    VFGC  YAH N+GKLK RA KC+F+G+ +GVKG++LW   P + +FIIS+  T       +S T   E E +
Subjt:  NRSPHTSL--------------------VFGCVGYAHQNKGKLKVRAAKCMFIGFTEGVKGFKLW--HPTNKRFIISKSSTEGNFDDTQSYTTQIEVENT

Query:  G--------KSVQ-------PTEEPTAIEQEQVENLSEEQDEMLEEQPDLSQYSLARDRQRRIIVPPARYAESNYISFVLNATVVPNDSEPSSFDEAVNS
                 K V+       PT E T+++ +QVE    ++DE  +E+P    YS+A  R +R I P  RYA  N +SF L +       EPSS++EAV  
Subjt:  G--------KSVQ-------PTEEPTAIEQEQVENLSEEQDEMLEEQPDLSQYSLARDRQRRIIVPPARYAESNYISFVLNATVVPNDSEPSSFDEAVNS

Query:  SNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLVAQNNLELD
          + QW  AM+EEI SL+ N TW L   P   K +  KW+FK KEGI      R+KARLVAKGFTQ+EGIDY+E+FSPVVK +SIR+LL++VA+++LEL+
Subjt:  SNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLVAQNNLELD

Query:  QLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLKKSIYGLKQSPRCWYRRFDDFITSLGF------------------------------------
        QLDVKT FLH  LEETIYM QP+G+ V GKED   LLKKS+YGLKQSPR WY+RFD F+   G+                                    
Subjt:  QLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLKKSIYGLKQSPRCWYRRFDDFITSLGF------------------------------------

Query:  ---------------QKNLRESKKILGIDIIRDRDQSTLSINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVNSPSETDIEHKLQMKNVPYSQAVGS
                        K+L  +KKILG+DI RDR    L ++Q +Y EKV++RF +  A+ V+ P+A HFKLSA  SP     E K QM ++PYS AVGS
Subjt:  ---------------QKNLRESKKILGIDIIRDRDQSTLSINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVNSPSETDIEHKLQMKNVPYSQAVGS

Query:  LMYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVFLYGRNLISWKAILQSI
        +MY M+ TRPD+S++ S+VS+YM+  G+ HW+A KWI+RYL  S +  L Y + ++    + GYVDSD+AGD DKRRSLTGYVF Y    ISWKA+LQS 
Subjt:  LMYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVFLYGRNLISWKAILQSI

Query:  VALSTTEAEYIALSEG------------------SIVKIFYDNQSAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSENAADMLTKSVSTL
        VALSTTEAEY+AL+E                       +F D+QSAI L+KN  +H RTKHID++YHF+RE +  G+I + KV T +N ADMLTK +   
Subjt:  VALSTTEAEYIALSEG------------------SIVKIFYDNQSAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSENAADMLTKSVSTL

Query:  KLQKCFELIG
        K + C +LIG
Subjt:  KLQKCFELIG

A0A2P5YYC3 Uncharacterized protein7.0e-24439.42Show/hide
Query:  FALWKAKIKALLGQQKSHKALLDPSELPTTLTATQKEEMKLNAYGTLILNLSDNIIRQVLEEETAYKVWMKLESLYASKDLPNKMYLREKNFTYRMDPSK
        F+LW+ K++A+L QQ   KAL    +LP+TL+  QK++M   A+  ++L L D ++R+V +E+TA  +W++LES Y +K L N++YL+++ +  +M+   
Subjt:  FALWKAKIKALLGQQKSHKALLDPSELPTTLTATQKEEMKLNAYGTLILNLSDNIIRQVLEEETAYKVWMKLESLYASKDLPNKMYLREKNFTYRMDPSK

Query:  TLSENLDEFKKIVSDFKTLEDKLSDENE---------------------------------ALKNQEKEKDSKGKQPEVSIVE--------------SSF
         +S++LD+F  I+ D   +++K+ DE++                                 AL + E  K   GK  E +  E              SS 
Subjt:  TLSENLDEFKKIVSDFKTLEDKLSDENE---------------------------------ALKNQEKEKDSKGKQPEVSIVE--------------SSF

Query:  TY------------------------------TDALASTLDQANHVNPLGKHD---------------WVLDSGCTYHMTPFRAWFNTYREISGESVFIG
        ++                              +++     D+AN  +     D               W+LD+G T+H++  +  F+TY + SG SV +G
Subjt:  TY------------------------------TDALASTLDQANHVNPLGKHD---------------WVLDSGCTYHMTPFRAWFNTYREISGESVFIG

Query:  NNNECNIAGIGSVTMKLKDKTVKLLRNVRHVPHLKRNLISLRMLDSLGCEYKGKCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLS--LNLIE
        N++ C + GIG+V +K+ D  V+ L +VRH+P +K+NLISL  LD  G  Y  + GV +VF G+  V+ G     L+ + G  +   A    S  L+   
Subjt:  NNNECNIAGIGSVTMKLKDKTVKLLRNVRHVPHLKRNLISLRMLDSLGCEYKGKCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLS--LNLIE

Query:  VDIWHKRLSHISQKGLEALSKQGILPQDICSKLSFCEHCVLGKTRKQNFTKAQHTTRGILDYIHLDLWGPASTPSLSGSRYL------------------
          +WH RL H+S++GL  LSK+G+L      KL+FCEHCV GK  +  F+   H T+G +DY H DLWGP+ T S  G RYL                  
Subjt:  VDIWHKRLSHISQKGLEALSKQGILPQDICSKLSFCEHCVLGKTRKQNFTKAQHTTRGILDYIHLDLWGPASTPSLSGSRYL------------------

Query:  --------------------------RTDNGLEFCGELFNNFCRQSGITRHKTVTYTPQQNRVAERLNKTIMERVRCLLSDVILKEKFWAEAAAYVMHTL
                                  RTDNGLEFC   FN FC+  GI RH+TV  TPQQN VAER+N+T++ER RC+ S+  L E+FWAEA     + +
Subjt:  --------------------------RTDNGLEFCGELFNNFCRQSGITRHKTVTYTPQQNRVAERLNKTIMERVRCLLSDVILKEKFWAEAAAYVMHTL

Query:  NRSPHTSL--------------------VFGCVGYAHQNKGKLKVRAAKCMFIGFTEGVKGFKLW--HPTNKRFIISKSSTEGNFDDTQSYTTQIEVENT
        NRSP T++                    VFGC  YAH N+GKLK RA KC+F+G+ +GVKG++LW   P + +FIIS+  T       +S T   E E +
Subjt:  NRSPHTSL--------------------VFGCVGYAHQNKGKLKVRAAKCMFIGFTEGVKGFKLW--HPTNKRFIISKSSTEGNFDDTQSYTTQIEVENT

Query:  G--------KSVQ-------PTEEPTAIEQEQVENLSEEQDEMLEEQPDLSQYSLARDRQRRIIVPPARYAESNYISFVLNATVVPNDSEPSSFDEAVNS
                 K V+       PT E T+++ +QVE    ++DE  +E+P    YS+A  R +R I P  RYA  N +SF L +       EPSS++EAV  
Subjt:  G--------KSVQ-------PTEEPTAIEQEQVENLSEEQDEMLEEQPDLSQYSLARDRQRRIIVPPARYAESNYISFVLNATVVPNDSEPSSFDEAVNS

Query:  SNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLVAQNNLELD
          + QW  AM+EEI SL+ N TW L   P   K +  KW+FK KEGI      R+KARLVAKGFTQ+EGIDY+E+FSPVVK +SIR+LL++VA+++LEL+
Subjt:  SNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLVAQNNLELD

Query:  QLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLKKSIYGLKQSPRCWYRRFDDFITSLGF------------------------------------
        QLDVKT FLH  LEETIYM QP+G+ V GKED   LLKKS+YGLKQSPR WY+RFD F+   G+                                    
Subjt:  QLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLKKSIYGLKQSPRCWYRRFDDFITSLGF------------------------------------

Query:  ---------------QKNLRESKKILGIDIIRDRDQSTLSINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVNSPSETDIEHKLQMKNVPYSQAVGS
                        K+L  +KKILG+DI RDR    L ++Q +Y EKV++RF +  A+ V+ P+A HFKLSA  SP     E K QM ++PYS AVGS
Subjt:  ---------------QKNLRESKKILGIDIIRDRDQSTLSINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVNSPSETDIEHKLQMKNVPYSQAVGS

Query:  LMYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVFLYGRNLISWKAILQSI
        +MY M+ TRPD+S++ S+VS+YM+  G+ HW+A KWI+RYL  S +  L Y + ++    + GYVDSD+AGD DKRRSLTGYVF Y    ISWKA+LQS 
Subjt:  LMYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVFLYGRNLISWKAILQSI

Query:  VALSTTEAEYIALSEG------------------SIVKIFYDNQSAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSENAADMLTKSVSTL
        VALSTTEAEY+AL+E                       +F D+QSAI L+KN  +H RTKHID++YHF+RE +  G+I + KV T +N ADMLTK +   
Subjt:  VALSTTEAEYIALSEG------------------SIVKIFYDNQSAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSENAADMLTKSVSTL

Query:  KLQKCFELI
        K + C +LI
Subjt:  KLQKCFELI

A0A438IR25 Retrovirus-related Pol polyprotein from transposon TNT 1-948.9e-23940.15Show/hide
Query:  DFALWKAKIKALLGQQKSHKALLDPSELPTTLTATQKEEMKLNAYGTLILNLSDNIIRQVLEEETAYKVWMKLESLYASKDLPNKMYLREKNFTYRMDPS
        DF LW+ K++ALL QQ  H ALL    LP+T+   +K ++   A+  +IL+L D ++R+V + E+  +VW+KLESLY +K L N+++ + K +T++M P 
Subjt:  DFALWKAKIKALLGQQKSHKALLDPSELPTTLTATQKEEMKLNAYGTLILNLSDNIIRQVLEEETAYKVWMKLESLYASKDLPNKMYLREKNFTYRMDPS

Query:  KTLSENLDEFKKIVSDFKTLEDKLSDENEAL-----------------------------KNQEKEKDSKGKQPEVSI------------------VESS
         ++ E+LD F KI+ D + ++  +SDE++A+                             KN +    SK K+ +  I                  ++ +
Subjt:  KTLSENLDEFKKIVSDFKTLEDKLSDENEAL-----------------------------KNQEKEKDSKGKQPEVSI------------------VESS

Query:  FTYTDA--LASTLDQANHVN----PLGKHDWVLDSGCTYHMTPFRAWFNTYREISGESVFIGNNNECNIAGIGSVTMKLKDKTVKLLRNVRHVPHLKRNL
            DA  +    D A  +N      GK +W+LDSGC++HM P +AWF  ++E  G  V +GNN  C I G G+V +K  D   ++L +VR++P LKRNL
Subjt:  FTYTDA--LASTLDQANHVN----PLGKHDWVLDSGCTYHMTPFRAWFNTYREISGESVFIGNNNECNIAGIGSVTMKLKDKTVKLLRNVRHVPHLKRNL

Query:  ISLRMLDSLGCEYKGKCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLIEVDIWHKRLSHISQKGLEALSKQGILPQDICSKLSFCEHCV
        ISL MLD  G  +K +    +V  GS  V+     N L+ + G  +I++A+TVL  ++    +WH+RL H+S KGL+ L KQG+L     + L FCEHCV
Subjt:  ISLRMLDSLGCEYKGKCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLIEVDIWHKRLSHISQKGLEALSKQGILPQDICSKLSFCEHCV

Query:  LGKTRKQNFTKAQHTTRGILDYIHLDLWGPASTPSLSGS----RYLRTDNGLEFCGELFNNFCRQSGITRHKTVTYTPQQNRVAERLNKTIMERVRCLLS
         GK  +  F KA H T+  LDYIH DLWGP+  PS+ G+    + LRTDNGLEF    FN+FC++ GI  H+TV YTPQQN +AER+N+TI+ER+RC+LS
Subjt:  LGKTRKQNFTKAQHTTRGILDYIHLDLWGPASTPSLSGS----RYLRTDNGLEFCGELFNNFCRQSGITRHKTVTYTPQQNRVAERLNKTIMERVRCLLS

Query:  DVILKEKFWAEAAAYVMHTLNRSPHTSL--------------------VFGCVGYAHQNKGKLKVRAAKCMFIGFTEGVKGFKLWHPTNKR--FIISKSS
           L + FWAEAA  V+H +NRSP ++L                    VFGC  Y H    KL+ RA KC+F+G+ +GVKG+KLW  T  +   IIS+  
Subjt:  DVILKEKFWAEAAAYVMHTLNRSPHTSL--------------------VFGCVGYAHQNKGKLKVRAAKCMFIGFTEGVKGFKLWHPTNKR--FIISKSS

Query:  TEGNFDDTQSYTTQIEVENTGKSVQPTEEPTAIEQEQVENLSE-EQDEMLEEQPD-----LSQYSLARDRQRRIIVPPARYAESNYISFVLNATVVPNDS
        T  N  D    T   +VE + +     E  T   ++  E  S+  Q+E++ E+ +     L  Y+L RDRQ+R + PP RY +    +F L+      D 
Subjt:  TEGNFDDTQSYTTQIEVENTGKSVQPTEEPTAIEQEQVENLSE-EQDEMLEEQPD-----LSQYSLARDRQRRIIVPPARYAESNYISFVLNATVVPNDS

Query:  EPSSFDEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLL
        EP ++ EA+NS+   QW++A+ EE++SL  N+TW L + PK  K + SKW+FK K+G   N  PRYKARLVAKGF+Q+EG+DY+EIFSPVVK +SIRLLL
Subjt:  EPSSFDEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLL

Query:  SLVAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQ-------------------GKEDLYYLLKKSIYGLKQSPRCWYRR----FDDFITSLGFQK
        + VA  +LELDQLDVKT FLH  L+E IYM   K ++                      ++++ YLL   +Y       C  +R      + + +    K
Subjt:  SLVAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQ-------------------GKEDLYYLLKKSIYGLKQSPRCWYRR----FDDFITSLGFQK

Query:  NLRESKKILGIDIIRDRDQSTLSINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVNSPSETDIEHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSAS
        +L  +K+ILG++I RDR +  L ++Q SY  KV+ RF + N + V+  +  HF+LS   +P ET  E K  M+ +PY+  VGS+MY M+ +RPDL+Y+ S
Subjt:  NLRESKKILGIDIIRDRDQSTLSINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVNSPSETDIEHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSAS

Query:  LVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVFLYGRNLISWKAILQSIVALSTTEAEYIALSE--
        ++S+YM+  G+ HW+A KW+ +YL  +++  L Y   +++  +L G+VD+D+AG+ D ++SLTGYVF+     +SWKA LQS+VALSTTEAEY+A++E  
Subjt:  LVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVFLYGRNLISWKAILQSIVALSTTEAEYIALSE--

Query:  -----------------GSIVKIFYDNQSAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSENAADMLTKSVSTLKLQKCFELI
                         G +V ++ DNQSAI L+KN  +H R+KHI+++ HF+R+ I AGEI V KV T +N +DMLTKS++  K + C  LI
Subjt:  -----------------GSIVKIFYDNQSAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSENAADMLTKSVSTLKLQKCFELI

A0A5A7UB25 Putative gag-pol polyprotein8.9e-24739.09Show/hide
Query:  NKLGDFALWKAKIKALLGQQKSHKALLDPSELPTTLTATQKEEMKLNAYGTLILNLSDNIIRQVLEEETAYKVWMKLESLYASKDLPNKMYLREKNFTYR
        N  GDF+LW+ KI+A+L Q K  K +LD   LP  +T ++K +M   AY T++L LSD ++R V E  T  ++W KLESLY +K L NK+Y++EK F Y+
Subjt:  NKLGDFALWKAKIKALLGQQKSHKALLDPSELPTTLTATQKEEMKLNAYGTLILNLSDNIIRQVLEEETAYKVWMKLESLYASKDLPNKMYLREKNFTYR

Query:  MDPSKTLSENLDEFKKIVSDFKTLEDKLSDENEAL-----------------------------------KNQEKEKDSK--------------------
        MD SK+L ENLDEF+KIV D   + +K+SDEN+A+                                   +N E +K+ K                    
Subjt:  MDPSKTLSENLDEFKKIVSDFKTLEDKLSDENEAL-----------------------------------KNQEKEKDSK--------------------

Query:  -----------------------------GKQPEVSIVESSFT--YTDA-------LASTLDQANHVNPLGKHD----WVLDSGCTYHMTPFRAWFNTYR
                                      K  E S  E++ T  Y  A        A T  ++  V  +   D    W++DSGCT+HMTP R +   ++
Subjt:  -----------------------------GKQPEVSIVESSFT--YTDA-------LASTLDQANHVNPLGKHD----WVLDSGCTYHMTPFRAWFNTYR

Query:  EISGESVFIGNNNECNIAGIGSVTMKLKDKTVKLLRNVRHVPHLKRNLISLRMLDSLGCEYKGKCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANT
        ++ G  V +G+N  C++ G GSV +   D  V++L NVR+VP LKRNLISL  LD  GC  K + GV +V  GS + L G   + L++++G  +   A  
Subjt:  EISGESVFIGNNNECNIAGIGSVTMKLKDKTVKLLRNVRHVPHLKRNLISLRMLDSLGCEYKGKCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANT

Query:  VLSLNLIEVD-IWHKRLSHISQKGLEALSKQGILPQDICSKLSFCEHCVLGKTRKQNFTKAQHTTRGILDYIHLDLWGPASTPSLSGSR-----------
        + S  + ++  +WHKRL+H+S++GL+ALS+QG+L      +L FCEHC++GK+ +  F K +HTT+GILDY+H DLWGP    S+ GSR           
Subjt:  VLSLNLIEVD-IWHKRLSHISQKGLEALSKQGILPQDICSKLSFCEHCVLGKTRKQNFTKAQHTTRGILDYIHLDLWGPASTPSLSGSR-----------

Query:  ---------------------------------YLRTDNGLEFCGELFNNFCRQSGITRHKTVTYTPQQNRVAERLNKTIMERVRCLLSDVILKEKFWAE
                                         YLRTDNGLEF    FN FC+  GITRH TVTYTPQQN +AER N+TIMER RCLL++  L  KFW E
Subjt:  ---------------------------------YLRTDNGLEFCGELFNNFCRQSGITRHKTVTYTPQQNRVAERLNKTIMERVRCLLSDVILKEKFWAE

Query:  AAAYVMHTLNRSPHTSL--------------------VFGCVGYAHQNKGKLKVRAAKCMFIGFTEGVKGFKLW--HPTNKRFIISKSSTEGNFDDTQ--
        AA    + +NRSP T+L                    VFGC  YAH   GKL  RA KCMFIG+ +GVKG+KLW       + IIS+  T   F++T+  
Subjt:  AAAYVMHTLNRSPHTSL--------------------VFGCVGYAHQNKGKLKVRAAKCMFIGFTEGVKGFKLW--HPTNKRFIISKSSTEGNFDDTQ--

Query:  -----------------------SYTTQIEVENTGKSVQPTEEPTAIE----QEQVENLSEEQDEMLEEQP---DLSQYSLARDRQRRIIVPPARYAESN
                                    I+++N    V   E+    E    Q Q E +  ++   +EE     DL  Y L RDR +R    P RY  ++
Subjt:  -----------------------SYTTQIEVENTGKSVQPTEEPTAIE----QEQVENLSEEQDEMLEEQP---DLSQYSLARDRQRRIIVPPARYAESN

Query:  YISFVLNATVVPNDSEPSSFDEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSE
         +++ L       ++EP +F+EA+ S + +QW +AM EE+ SL+ N TW+L   P   K I SKWI+K+K G   NS+PRYKARLVAKG+TQ+EG+D+ E
Subjt:  YISFVLNATVVPNDSEPSSFDEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSE

Query:  IFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLKKSIYGLKQSPRCWYRRFDDFITSLGFQKN-------
        IFSPVV+ +SIRL+LS+    ++ ++Q+DV T FLH  LEE IYM QPKGYEV+GKED+   L KS+YGLKQSPR WY RFD FI   GF +N       
Subjt:  IFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLKKSIYGLKQSPRCWYRRFDDFITSLGFQKN-------

Query:  --------------------------------------------LRESKKILGIDIIRDRDQSTLSINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSA
                                                    L E K+ILG+D+ RD+++  L+I+Q SY  K++ ++N+++++ V+ P+A HF+LS+
Subjt:  --------------------------------------------LRESKKILGIDIIRDRDQSTLSINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSA

Query:  VNSPSETDIEHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSD
           P     + +++M N+PY  AVGS+MYLMI TRPDL Y+ S++S++M+N G+ HW+A KW++RYL  S +  L Y R  +    L G+ D+D+A D D
Subjt:  VNSPSETDIEHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSD

Query:  KRRSLTGYVF-LYGRNLISWKAILQSIVALSTTEAEYIALSE------------GSIVK------IFYDNQSAIRLSKNPQYHSRTKHIDIKYHFIREKI
        KRRSL+G++F LYG N++SWK  LQ +VALSTTE+EYI+L E            G ++       I  D+QSAI L+KNP +H R+KHID+K+H+IR  I
Subjt:  KRRSLTGYVF-LYGRNLISWKAILQSIVALSTTEAEYIALSE------------GSIVK------IFYDNQSAIRLSKNPQYHSRTKHIDIKYHFIREKI

Query:  EAGEIQVLKVHTSENAADMLTKSVS
           +++++KVHT EN +DMLTK++S
Subjt:  EAGEIQVLKVHTSENAADMLTKSVS

SwissProt top hitse value%identityAlignment
P04146 Copia protein9.2e-10026.94Show/hide
Query:  DSKGKQPEVSIVESSFTYTDALASTLDQANHVNPLGKHDWVLDSGCTYHMTPFRAWFNTYREISGE-SVFIGNNNECNIAGIGSVTMKLKDKTVKLLRNV
        ++K K+ E  +  ++   +  +A  + + N+ + +    +VLDSG + H+    + +    E+     + +    E   A    + ++L++     L +V
Subjt:  DSKGKQPEVSIVESSFTYTDALASTLDQANHVNPLGKHDWVLDSGCTYHMTPFRAWFNTYREISGE-SVFIGNNNECNIAGIGSVTMKLKDKTVKLLRNV

Query:  RHVPHLKRNLISLRMLDSLGCEYK-GKCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLIEVDIWHKRLSHISQ-KGLEALSKQGILPQD
                NL+S++ L   G   +  K GV     G  +V     +N++ +I       +A ++ + +     +WH+R  HIS  K LE   K     Q 
Subjt:  RHVPHLKRNLISLRMLDSLGCEYK-GKCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLIEVDIWHKRLSHISQ-KGLEALSKQGILPQD

Query:  ICSKL----SFCEHCVLGKTRKQNFTKAQHTT--RGILDYIHLDLWGPASTPSLSGSR------------------------------------------
        + + L      CE C+ GK  +  F + +  T  +  L  +H D+ GP +  +L                                              
Subjt:  ICSKL----SFCEHCVLGKTRKQNFTKAQHTT--RGILDYIHLDLWGPASTPSLSGSR------------------------------------------

Query:  --YLRTDNGLEFCGELFNNFCRQSGITRHKTVTYTPQQNRVAERLNKTIMERVRCLLSDVILKEKFWAEAAAYVMHTLNRSPHTSL--------------
          YL  DNG E+       FC + GI+ H TV +TPQ N V+ER+ +TI E+ R ++S   L + FW EA     + +NR P  +L              
Subjt:  --YLRTDNGLEFCGELFNNFCRQSGITRHKTVTYTPQQNRVAERLNKTIMERVRCLLSDVILKEKFWAEAAAYVMHTLNRSPHTSL--------------

Query:  --------VFGCVGYAH--QNKGKLKVRAAKCMFIGFTEGVKGFKLWHPTNKRFII--------------------------SKSSTEGNFDDTQSYTTQ
                VFG   Y H    +GK   ++ K +F+G+     GFKLW   N++FI+                          SK S   NF +      Q
Subjt:  --------VFGCVGYAH--QNKGKLKVRAAKCMFIGFTEGVKGFKLWHPTNKRFII--------------------------SKSSTEGNFDDTQSYTTQ

Query:  IEVENTGKSVQPTE---------------EPTAIEQEQVENLSEEQD--EMLEEQPDLSQYSLARDRQRR------------------------------
         E  N  K     +               +   I Q +  N S+E D  + L++  + ++Y L   ++R+                              
Subjt:  IEVENTGKSVQPTE---------------EPTAIEQEQVENLSEEQD--EMLEEQPDLSQYSLARDRQRR------------------------------

Query:  -----------II---------VPPARYAE--SNYISFVLNATVVPNDSEPSSFDEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIASKWI
                   II          P   Y E  ++    VLNA  + ND  P+SFDE     +   W EA+N E+N+  +N+TWT+   P+    + S+W+
Subjt:  -----------II---------VPPARYAE--SNYISFVLNATVVPNDSEPSSFDEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIASKWI

Query:  FKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLKKS
        F +K     N   RYKARLVA+GFTQ+  IDY E F+PV + +S R +LSLV Q NL++ Q+DVKT FL+  L+E IYM  P+G  +    D    L K+
Subjt:  FKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLKKS

Query:  IYGLKQSPRCWYRRF------------------------------------DDFITSLG-------FQK---------NLRESKKILGIDIIRDRDQSTL
        IYGLKQ+ RCW+  F                                    DD + + G       F++         +L E K  +GI I    D+  L
Subjt:  IYGLKQSPRCWYRRF------------------------------------DDFITSLG-------FQK---------NLRESKKILGIDIIRDRDQSTL

Query:  SINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVNSPSETDIEHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWIIR
        S  Q++Y +K++ +FN+ N   V+ P+        +NS  +          N P    +G LMY+M+ TRPDL+ + +++S+Y + +    W+  K ++R
Subjt:  SINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVNSPSETDIEHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWIIR

Query:  YLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVF-LYGRNLISWKAILQSIVALSTTEAEYIALSEG-------------------SIV
        YL  + + +L +++    E ++IGYVDSD+AG    R+S TGY+F ++  NLI W    Q+ VA S+TEAEY+AL E                    + +
Subjt:  YLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVF-LYGRNLISWKAILQSIVALSTTEAEYIALSEG-------------------SIV

Query:  KIFYDNQSAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSENAADMLTKSVSTLKLQKCFELIG
        KI+ DNQ  I ++ NP  H R KHIDIKYHF RE+++   I +  + T    AD+ TK +   +  +  + +G
Subjt:  KIFYDNQSAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSENAADMLTKSVSTLKLQKCFELIG

P0CV72 Secreted RxLR effector protein 1611.1e-2344.19Show/hide
Query:  MKNVPYSQAVGSLMYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVFLYGR
        MKNVPY  AVG++MYLM+ TRPDL+ +  ++S++ ++    HW+A K ++RYL  ++   L + R      +L+GY D+D+AGD + RRS +GY+F    
Subjt:  MKNVPYSQAVGSLMYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVFLYGR

Query:  NLISWKAILQSIVALSTTEAEYIALSEGS
          +SW++  Q  VALS+TE EY+ALSE +
Subjt:  NLISWKAILQSIVALSTTEAEYIALSEGS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-17733.08Show/hide
Query:  FALWKAKIKALLGQQKSHKALLDPSELPTTLTATQKEEMKLNAYGTLILNLSDNIIRQVLEEETAYKVWMKLESLYASKDLPNKMYLREKNFTYRMDPSK
        F+ W+ +++ LL QQ  HK L   S+ P T+ A    ++   A   + L+LSD+++  +++E+TA  +W +LESLY SK L NK+YL+++ +   M    
Subjt:  FALWKAKIKALLGQQKSHKALLDPSELPTTLTATQKEEMKLNAYGTLILNLSDNIIRQVLEEETAYKVWMKLESLYASKDLPNKMYLREKNFTYRMDPSK

Query:  TLSENLDEFKKIVSDFKTL------EDK---------------------------LSDENEALKNQEKEKDSKGKQPEVSIVE--------SSFTYTDAL
            +L+ F  +++    L      EDK                           L D   AL   EK +     Q +  I E        SS  Y  + 
Subjt:  TLSENLDEFKKIVSDFKTL------EDK---------------------------LSDENEALKNQEKEKDSKGKQPEVSIVE--------SSFTYTDAL

Query:  A------------------------------------STLDQANHVNPLG------------------------KHDWVLDSGCTYHMTPFRAWFNTYRE
        A                                     T  Q N  N                           + +WV+D+  ++H TP R  F  Y  
Subjt:  A------------------------------------STLDQANHVNPLG------------------------KHDWVLDSGCTYHMTPFRAWFNTYRE

Query:  ISGESVFIGNNNECNIAGIGSVTMKLKDKTVKLLRNVRHVPHLKRNLISLRMLDSLGCEYKGKCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTV
            +V +GN +   IAGIG + +K       +L++VRHVP L+ NLIS   LD  G E       +++  GS ++  G     L+         E N  
Subjt:  ISGESVFIGNNNECNIAGIGSVTMKLKDKTVKLLRNVRHVPHLKRNLISLRMLDSLGCEYKGKCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTV

Query:  LSLNLIEVDIWHKRLSHISQKGLEALSKQGILPQDICSKLSFCEHCVLGKTRKQNFTKAQHTTRGILDYIHLDLWGPASTPSLSGSRY------------
         + + I VD+WHKR+ H+S+KGL+ L+K+ ++     + +  C++C+ GK  + +F  +      ILD ++ D+ GP    S+ G++Y            
Subjt:  LSLNLIEVDIWHKRLSHISQKGLEALSKQGILPQDICSKLSFCEHCVLGKTRKQNFTKAQHTTRGILDYIHLDLWGPASTPSLSGSRY------------

Query:  --------------------------------LRTDNGLEFCGELFNNFCRQSGITRHKTVTYTPQQNRVAERLNKTIMERVRCLLSDVILKEKFWAEAA
                                        LR+DNG E+    F  +C   GI   KTV  TPQ N VAER+N+TI+E+VR +L    L + FW EA 
Subjt:  --------------------------------LRTDNGLEFCGELFNNFCRQSGITRHKTVTYTPQQNRVAERLNKTIMERVRCLLSDVILKEKFWAEAA

Query:  AYVMHTLNRSPHTSL--------------------VFGCVGYAH---QNKGKLKVRAAKCMFIGFTEGVKGFKLWHPTNKRFIISKSSTEGNFDDTQSYT
            + +NRSP   L                    VFGC  +AH   + + KL  ++  C+FIG+ +   G++LW P  K+ I S+      F +++  T
Subjt:  AYVMHTLNRSPHTSL--------------------VFGCVGYAH---QNKGKLKVRAAKCMFIGFTEGVKGFKLWHPTNKRFIISKSSTEGNFDDTQSYT

Query:  TQIEVENTGKSVQP--------TEEPTAIE------QEQVENLSE--EQDEMLEEQPDLSQYSLARDRQRRII-------VPPARYAESNYISFVLNATV
             E     + P        +  PT+ E       EQ E   E  EQ E L+E  +  ++    + Q + +       V   RY  + Y+       +
Subjt:  TQIEVENTGKSVQP--------TEEPTAIE------QEQVENLSE--EQDEMLEEQPDLSQYSLARDRQRRII-------VPPARYAESNYISFVLNATV

Query:  VPNDSEPSSFDEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIASKWIFKL-KEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQT
        + +D EP S  E ++     Q ++AM EE+ SL  N T+ L  LPKG +P+  KW+FKL K+G  K    RYKARLV KGF Q++GID+ EIFSPVVK T
Subjt:  VPNDSEPSSFDEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIASKWIFKL-KEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQT

Query:  SIRLLLSLVAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLKKSIYGLKQSPRCWYRRFDDFITSLGF-------------------
        SIR +LSL A  +LE++QLDVKT FLH  LEE IYM QP+G+EV GK+ +   L KS+YGLKQ+PR WY +FD F+ S  +                   
Subjt:  SIRLLLSLVAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLKKSIYGLKQSPRCWYRRFDDFITSLGF-------------------

Query:  --------------------------------QKNLRESKKILGIDIIRDRDQSTLSINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVNSPSETDI
                                         K+L  +++ILG+ I+R+R    L ++Q  Y E+V+ RFN+ NA+PV+ P+A H KLS    P  T +
Subjt:  --------------------------------QKNLRESKKILGIDIIRDRDQSTLSINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVNSPSETDI

Query:  EHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYV
        E K  M  VPYS AVGSLMY M+ TRPD++++  +VS+++ N G+ HWEA KWI+RYL  +    L +  +  I   L GY D+D AGD D R+S TGY+
Subjt:  EHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYV

Query:  FLYGRNLISWKAILQSIVALSTTEAEYIALSE---------------GSIVK---IFYDNQSAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKV
        F +    ISW++ LQ  VALSTTEAEYIA +E               G   K   ++ D+QSAI LSKN  YH+RTKHID++YH+IRE ++   ++VLK+
Subjt:  FLYGRNLISWKAILQSIVALSTTEAEYIALSE---------------GSIVK---IFYDNQSAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKV

Query:  HTSENAADMLTKSVSTLKLQKCFELIG
         T+EN ADMLTK V   K + C EL+G
Subjt:  HTSENAADMLTKSVSTLKLQKCFELIG

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.8e-6432.15Show/hide
Query:  QPDLSQYSLARDRQRRIIVPPARYAESNYISFVLNATVVPNDSEPSSFDEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIAS-KWIFKLKE
        Q  L+ +S+    +  II P  +Y+          A  +  +SEP +   A+ +    +W  AM  EIN+   N TW L   P     I   +WIF  K 
Subjt:  QPDLSQYSLARDRQRRIIVPPARYAESNYISFVLNATVVPNDSEPSSFDEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIAS-KWIFKLKE

Query:  GITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLKKSIYGLK
          +  S  RYKARLVAKG+ QR G+DY+E FSPV+K TSIR++L +    +  + QLDV   FL   L + +YM QP G+  + + +    L+K++YGLK
Subjt:  GITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLKKSIYGLK

Query:  QSPRCWYRRFDDFITSLGFQKN--------LRESKKILGI-----DI----------------------IRDRDQ-------------STLSINQTSYCE
        Q+PR WY    +++ ++GF  +        L+  K I+ +     DI                      ++D ++             + L ++Q  Y  
Subjt:  QSPRCWYRRFDDFITSLGFQKN--------LRESKKILGI-----DI----------------------IRDRDQ-------------STLSINQTSYCE

Query:  KVIRRFNLTNARPVTLPIAHHFKLSAVNSPSETDIEHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNAR
         ++ R N+  A+PVT P+A   KLS  +    TD           Y   VGSL YL   TRPD+SY+ + +S++M      H +A K I+RYL  + N  
Subjt:  KVIRRFNLTNARPVTLPIAHHFKLSAVNSPSETDIEHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNAR

Query:  LNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVFLYGRNLISWKAILQSIVALSTTEAEYIALSEGSIVK-------------------IFYDNQSAI
        +  ++   + L    Y D+D+AGD D   S  GY+   G + ISW +  Q  V  S+TEAEY +++  S                      I+ DN  A 
Subjt:  LNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVFLYGRNLISWKAILQSIVALSTTEAEYIALSEGSIVK-------------------IFYDNQSAI

Query:  RLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSENAADMLTKSVSTLKLQKCFELIG
         L  NP +HSR KHI I YHFIR ++++G ++V+ V T +  AD LTK +S    Q     IG
Subjt:  RLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSENAADMLTKSVSTLKLQKCFELIG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.8e-6432.77Show/hide
Query:  ATVVPNDSEPSSFDEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIAS-KWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVV
        AT +  +SEP +   A+ +    +W +AM  EIN+   N TW L   P     I   +WIF  K+  +  S  RYKARLVAKG+ QR G+DY+E FSPV+
Subjt:  ATVVPNDSEPSSFDEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIAS-KWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVV

Query:  KQTSIRLLLSLVAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLKKSIYGLKQSPRCWYRRFDDFITSLGFQKNLRESK--------
        K TSIR++L +    +  + QLDV   FL   L + +YM QP G+  + + D    L+K+IYGLKQ+PR WY     ++ ++GF  ++ ++         
Subjt:  KQTSIRLLLSLVAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLKKSIYGLKQSPRCWYRRFDDFITSLGFQKNLRESK--------

Query:  ------------KILGIDII----------------------------RDRDQSTLSINQTSYCEKVIRRFNLTNARPVTLPIAHHFKL---SAVNSPSE
                     I G D +                              R    L ++Q  Y   ++ R N+  A+PV  P+A   KL   S    P  
Subjt:  ------------KILGIDII----------------------------RDRDQSTLSINQTSYCEKVIRRFNLTNARPVTLPIAHHFKL---SAVNSPSE

Query:  TDIEHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLT
        T+           Y   VGSL YL   TRPDLSY+ + +S+YM      HW A K ++RYL  + +  +  ++   + L    Y D+D+AGD+D   S  
Subjt:  TDIEHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLT

Query:  GYVFLYGRNLISWKAILQSIVALSTTEAEYIALSEGSI-------------------VKIFYDNQSAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQ
        GY+   G + ISW +  Q  V  S+TEAEY +++  S                      I+ DN  A  L  NP +HSR KHI + YHFIR ++++G ++
Subjt:  GYVFLYGRNLISWKAILQSIVALSTTEAEYIALSEGSI-------------------VKIFYDNQSAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQ

Query:  VLKVHTSENAADMLTKSVSTLKLQKCFELIG
        V+ V T +  AD LTK +S +  Q     IG
Subjt:  VLKVHTSENAADMLTKSVSTLKLQKCFELIG

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.2e-5631.08Show/hide
Query:  YISFVLNATVVPNDSEPSSFDEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSE
        Y SF++    +    EPS+++EA        W  AM++EI ++    TW + +LP   KPI  KW++K+K   +  +  RYKARLVAKG+TQ+EGID+ E
Subjt:  YISFVLNATVVPNDSEPSSFDEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSE

Query:  IFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDL----YYLLKKSIYGLKQSPRCWYRRFDDFITSLGF------
         FSPV K TS++L+L++ A  N  L QLD+   FL+  L+E IYM  P GY  +  + L       LKKSIYGLKQ+ R W+ +F   +   GF      
Subjt:  IFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDL----YYLLKKSIYGLKQSPRCWYRRFDDFITSLGF------

Query:  --------------------------------------------QKNLRESKKILGIDIIRDRDQSTLSINQTSYCEKVIRRFNLTNARPVTLPIAHHFK
                                                     ++L   K  LG++I   R  + ++I Q  Y   ++    L   +P ++P+     
Subjt:  --------------------------------------------QKNLRESKKILGIDIIRDRDQSTLSINQTSYCEKVIRRFNLTNARPVTLPIAHHFK

Query:  LSAVNSPSETDIEHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAG
         SA +     D +         Y + +G LMYL I TR D+S++ + +S++       H +A   I+ Y+  +    L Y    E++L++  + D+ F  
Subjt:  LSAVNSPSETDIEHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAG

Query:  DSDKRRSLTGYVFLYGRNLISWKAILQSIVALSTTEAEYIALSEGS-------------------IVKIFYDNQSAIRLSKNPQYHSRTKHIDIKYHFIR
          D RRS  GY    G +LISWK+  Q +V+ S+ EAEY ALS  +                      +F DN +AI ++ N  +H RTKHI+   H +R
Subjt:  DSDKRRSLTGYVFLYGRNLISWKAILQSIVALSTTEAEYIALSEGS-------------------IVKIFYDNQSAIRLSKNPQYHSRTKHIDIKYHFIR

Query:  EK
        E+
Subjt:  EK

ATMG00300.1 Gag-Pol-related retrotransposon family protein5.5e-2342.74Show/hide
Query:  GVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLIEVDIWHKRLSHISQKGLEALSKQGILPQDICSKLSFCEHCVLGKTRKQNFTKAQHTTR
        GV +V  G + +L G + + L+I++G     E+N   +    E  +WH RL+H+SQ+G+E L K+G L     S L FCE C+ GKT + NF+  QHTT+
Subjt:  GVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLIEVDIWHKRLSHISQKGLEALSKQGILPQDICSKLSFCEHCVLGKTRKQNFTKAQHTTR

Query:  GILDYIHLDLWGPASTP
          LDY+H DLWG  S P
Subjt:  GILDYIHLDLWGPASTP

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.0e-0534.94Show/hide
Query:  LNKTIMERVRCLLSDVILKEKFWAEAAAYVMHTLNRSPHTSL--------------------VFGCVGYAHQNKGKLKVRAAK
        +N+TI+E+VR +L +  L + F A+AA   +H +N+ P T++                     FGCV Y H ++GKLK RA K
Subjt:  LNKTIMERVRCLLSDVILKEKFWAEAAAYVMHTLNRSPHTSL--------------------VFGCVGYAHQNKGKLKVRAAK

ATMG00810.1 DNA/RNA polymerases superfamily protein7.7e-1730.7Show/hide
Query:  ITSLGFQKNLRESKKILGIDIIRDRDQSTLSINQTSYCEKVIRRFNLTNARPVT--LPIAHHFKLSAVNSPSETDIEHKLQMKNVPYSQAVGSLMYLMIS
        ++S    K+L      LGI I      S L ++QT Y E+++    + + +P++  LP+  +  +S    P  +D           +   VG+L YL + 
Subjt:  ITSLGFQKNLRESKKILGIDIIRDRDQSTLSINQTSYCEKVIRRFNLTNARPVT--LPIAHHFKLSAVNSPSETDIEHKLQMKNVPYSQAVGSLMYLMIS

Query:  TRPDLSYSASLVSKYMANSGRRHWEATKWIIRY--------LIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVFLYGRNLISWKAILQS
        TRPD+SY+ ++V + M       ++  K ++RY        L   KN++LN Q           + DSD+AG +  RRS TG+    G N+ISW A  Q 
Subjt:  TRPDLSYSASLVSKYMANSGRRHWEATKWIIRY--------LIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVFLYGRNLISWKAILQS

Query:  IVALSTTEAEYIALS
         V+ S+TE EY AL+
Subjt:  IVALSTTEAEYIALS

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.0e-1343.75Show/hide
Query:  WIEAMNEEINSLNVNDTWTLASLPKGCKPIASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLVAQNNLELDQ
        W +AM EE+++L+ N TW L   P     +  KW+FK K   +  +  R KARLVAKGF Q EGI + E +SPVV+  +IR +L++  Q  LE+ Q
Subjt:  WIEAMNEEINSLNVNDTWTLASLPKGCKPIASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLVAQNNLELDQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCCTTCTCCTCCATGCTTTCTTTCATCCTGCACTAATAGAGGCCTAAACCAAAATTATACCTTTAGGCCCAACAAATTGGGAGACTTTGCATTATGGAAAGCAAA
GATCAAAGCCTTGCTTGGACAGCAAAAGTCTCATAAAGCCCTTTTAGATCCTTCAGAACTTCCAACAACCCTCACAGCAACACAAAAAGAAGAAATGAAATTAAATGCCT
ATGGAACACTAATACTAAACCTTAGTGACAACATAATTAGGCAAGTTCTAGAAGAAGAAACAGCATATAAAGTTTGGATGAAACTAGAAAGCCTATATGCCTCTAAAGAT
CTCCCAAACAAAATGTACCTAAGGGAAAAAAATTTCACATATAGAATGGATCCTTCTAAAACCTTATCTGAAAACTTAGATGAATTCAAGAAAATAGTTTCAGATTTTAA
AACCCTTGAGGACAAACTAAGTGATGAAAATGAGGCATTAAAGAATCAAGAAAAAGAAAAAGATTCTAAAGGGAAACAACCAGAAGTCTCTATAGTAGAAAGCTCTTTTA
CCTACACAGATGCCTTGGCATCAACCTTAGACCAAGCCAACCATGTCAACCCCTTAGGAAAACATGATTGGGTCCTAGACTCAGGATGCACCTACCATATGACACCTTTT
AGAGCATGGTTCAATACCTATAGAGAGATCAGTGGAGAATCTGTGTTCATAGGGAATAATAATGAATGTAACATTGCTGGAATTGGATCAGTTACCATGAAACTAAAAGA
TAAGACTGTAAAACTCCTTAGAAATGTAAGACATGTTCCTCACCTTAAAAGAAATTTAATCTCCCTAAGAATGTTAGACTCTCTAGGGTGTGAATATAAAGGAAAATGTG
GAGTTTTCCAAGTGTTTATGGGATCTAAGTTAGTCTTGGTTGGGGAAAAGGTAAATGATTTGTTCATAATAAAAGGAGTAGAAATGATAGAGGAGGCAAATACAGTTTTA
TCTCTAAACCTAATAGAAGTTGATATTTGGCATAAAAGATTGTCCCACATTAGTCAGAAGGGTCTTGAGGCACTATCTAAACAGGGCATTCTGCCTCAAGACATATGCAG
CAAGTTGTCCTTTTGTGAACACTGTGTACTAGGCAAAACAAGAAAACAAAACTTCACCAAAGCACAACACACAACAAGAGGAATCCTAGACTACATCCATTTGGATCTAT
GGGGTCCTGCATCCACTCCAAGCCTAAGTGGCTCAAGATATCTTAGAACTGACAATGGTTTGGAGTTTTGTGGAGAGTTGTTTAATAACTTTTGCAGGCAAAGTGGAATA
ACAAGACACAAAACAGTGACCTACACACCCCAACAAAACAGAGTGGCAGAAAGACTAAACAAAACTATAATGGAAAGGGTAAGGTGTCTATTATCAGATGTCATCCTAAA
AGAAAAGTTTTGGGCTGAAGCTGCTGCCTATGTTATGCACACGTTGAACAGAAGCCCTCATACCTCCTTAGTGTTCGGATGTGTAGGGTATGCTCACCAAAATAAAGGGA
AACTAAAGGTCAGGGCTGCTAAGTGTATGTTTATTGGTTTCACAGAAGGGGTAAAGGGTTTCAAGTTGTGGCATCCTACTAACAAGAGGTTTATAATCAGTAAAAGCAGC
ACTGAAGGGAACTTTGATGACACACAATCCTATACTACTCAGATTGAGGTGGAGAATACAGGAAAAAGTGTTCAACCTACTGAGGAACCTACAGCTATTGAACAAGAACA
AGTGGAGAACTTAAGTGAAGAACAAGATGAAATGCTTGAAGAACAACCTGACTTGAGCCAATATTCCCTAGCAAGAGACAGACAAAGAAGGATAATTGTCCCTCCAGCCA
GGTATGCTGAATCTAATTACATAAGTTTTGTTTTGAATGCTACTGTAGTTCCTAATGATTCAGAACCAAGTTCCTTTGATGAAGCTGTGAACAGTAGCAATGCAAGACAG
TGGATTGAAGCTATGAATGAAGAAATAAATTCACTAAATGTGAATGATACTTGGACACTAGCTTCTCTACCTAAAGGATGCAAACCAATAGCATCTAAGTGGATTTTCAA
ACTCAAAGAAGGAATTACTAAAAACTCGCAACCAAGGTACAAGGCAAGGCTGGTAGCAAAGGGTTTCACTCAAAGAGAAGGTATTGACTATTCTGAAATCTTCTCCCCTG
TAGTTAAACAAACCTCTATTAGACTTCTCCTATCCCTAGTTGCTCAAAACAACCTAGAACTGGATCAACTTGATGTAAAAACAACTTTTCTCCATGACTATCTAGAAGAG
ACAATCTATATGGTTCAACCTAAGGGTTATGAGGTTCAAGGTAAGGAAGACCTCTATTATTTACTAAAGAAGTCTATATATGGGTTAAAACAATCTCCTAGATGTTGGTA
TAGAAGATTTGATGATTTCATTACTAGTTTAGGTTTTCAAAAAAACTTAAGGGAATCTAAGAAGATCCTAGGTATAGACATCATTAGAGATAGGGACCAATCTACACTGA
GCATAAACCAAACATCCTACTGTGAAAAAGTGATAAGAAGATTCAACCTCACTAATGCTAGACCAGTTACTCTCCCTATTGCTCATCATTTTAAGCTATCAGCTGTCAAT
TCCCCTAGTGAAACTGATATAGAACACAAGCTACAAATGAAAAATGTTCCTTACAGTCAAGCAGTGGGAAGTTTAATGTACCTAATGATCTCTACAAGACCTGATCTATC
CTATTCAGCAAGCCTAGTCAGCAAATATATGGCTAATTCTGGAAGAAGACACTGGGAAGCTACTAAATGGATAATCAGATACTTAATTTGGTCAAAGAATGCTAGATTGA
ACTACCAAAGGACCACTGAAATAGAACTAGAACTAATAGGATATGTAGACTCAGACTTTGCTGGTGACAGTGATAAAAGGAGAAGCCTAACCGGTTACGTTTTTCTCTAC
GGTCGTAATCTAATAAGTTGGAAAGCAATCCTACAATCCATTGTTGCTCTCTCAACTACCGAAGCAGAATATATTGCACTATCAGAAGGTTCGATTGTTAAAATCTTTTA
TGACAACCAAAGTGCCATTCGCCTATCCAAAAATCCTCAATACCACAGCCGAACAAAGCACATAGACATAAAATATCACTTCATACGAGAGAAAATTGAAGCTGGAGAAA
TTCAAGTATTGAAAGTTCATACCTCTGAGAACGCCGCTGATATGCTCACTAAGTCGGTCTCAACATTGAAGCTTCAGAAGTGCTTCGAACTGATAGGTTTCGACCTACCT
GAAAAAGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTCCTTCTCCTCCATGCTTTCTTTCATCCTGCACTAATAGAGGCCTAAACCAAAATTATACCTTTAGGCCCAACAAATTGGGAGACTTTGCATTATGGAAAGCAAA
GATCAAAGCCTTGCTTGGACAGCAAAAGTCTCATAAAGCCCTTTTAGATCCTTCAGAACTTCCAACAACCCTCACAGCAACACAAAAAGAAGAAATGAAATTAAATGCCT
ATGGAACACTAATACTAAACCTTAGTGACAACATAATTAGGCAAGTTCTAGAAGAAGAAACAGCATATAAAGTTTGGATGAAACTAGAAAGCCTATATGCCTCTAAAGAT
CTCCCAAACAAAATGTACCTAAGGGAAAAAAATTTCACATATAGAATGGATCCTTCTAAAACCTTATCTGAAAACTTAGATGAATTCAAGAAAATAGTTTCAGATTTTAA
AACCCTTGAGGACAAACTAAGTGATGAAAATGAGGCATTAAAGAATCAAGAAAAAGAAAAAGATTCTAAAGGGAAACAACCAGAAGTCTCTATAGTAGAAAGCTCTTTTA
CCTACACAGATGCCTTGGCATCAACCTTAGACCAAGCCAACCATGTCAACCCCTTAGGAAAACATGATTGGGTCCTAGACTCAGGATGCACCTACCATATGACACCTTTT
AGAGCATGGTTCAATACCTATAGAGAGATCAGTGGAGAATCTGTGTTCATAGGGAATAATAATGAATGTAACATTGCTGGAATTGGATCAGTTACCATGAAACTAAAAGA
TAAGACTGTAAAACTCCTTAGAAATGTAAGACATGTTCCTCACCTTAAAAGAAATTTAATCTCCCTAAGAATGTTAGACTCTCTAGGGTGTGAATATAAAGGAAAATGTG
GAGTTTTCCAAGTGTTTATGGGATCTAAGTTAGTCTTGGTTGGGGAAAAGGTAAATGATTTGTTCATAATAAAAGGAGTAGAAATGATAGAGGAGGCAAATACAGTTTTA
TCTCTAAACCTAATAGAAGTTGATATTTGGCATAAAAGATTGTCCCACATTAGTCAGAAGGGTCTTGAGGCACTATCTAAACAGGGCATTCTGCCTCAAGACATATGCAG
CAAGTTGTCCTTTTGTGAACACTGTGTACTAGGCAAAACAAGAAAACAAAACTTCACCAAAGCACAACACACAACAAGAGGAATCCTAGACTACATCCATTTGGATCTAT
GGGGTCCTGCATCCACTCCAAGCCTAAGTGGCTCAAGATATCTTAGAACTGACAATGGTTTGGAGTTTTGTGGAGAGTTGTTTAATAACTTTTGCAGGCAAAGTGGAATA
ACAAGACACAAAACAGTGACCTACACACCCCAACAAAACAGAGTGGCAGAAAGACTAAACAAAACTATAATGGAAAGGGTAAGGTGTCTATTATCAGATGTCATCCTAAA
AGAAAAGTTTTGGGCTGAAGCTGCTGCCTATGTTATGCACACGTTGAACAGAAGCCCTCATACCTCCTTAGTGTTCGGATGTGTAGGGTATGCTCACCAAAATAAAGGGA
AACTAAAGGTCAGGGCTGCTAAGTGTATGTTTATTGGTTTCACAGAAGGGGTAAAGGGTTTCAAGTTGTGGCATCCTACTAACAAGAGGTTTATAATCAGTAAAAGCAGC
ACTGAAGGGAACTTTGATGACACACAATCCTATACTACTCAGATTGAGGTGGAGAATACAGGAAAAAGTGTTCAACCTACTGAGGAACCTACAGCTATTGAACAAGAACA
AGTGGAGAACTTAAGTGAAGAACAAGATGAAATGCTTGAAGAACAACCTGACTTGAGCCAATATTCCCTAGCAAGAGACAGACAAAGAAGGATAATTGTCCCTCCAGCCA
GGTATGCTGAATCTAATTACATAAGTTTTGTTTTGAATGCTACTGTAGTTCCTAATGATTCAGAACCAAGTTCCTTTGATGAAGCTGTGAACAGTAGCAATGCAAGACAG
TGGATTGAAGCTATGAATGAAGAAATAAATTCACTAAATGTGAATGATACTTGGACACTAGCTTCTCTACCTAAAGGATGCAAACCAATAGCATCTAAGTGGATTTTCAA
ACTCAAAGAAGGAATTACTAAAAACTCGCAACCAAGGTACAAGGCAAGGCTGGTAGCAAAGGGTTTCACTCAAAGAGAAGGTATTGACTATTCTGAAATCTTCTCCCCTG
TAGTTAAACAAACCTCTATTAGACTTCTCCTATCCCTAGTTGCTCAAAACAACCTAGAACTGGATCAACTTGATGTAAAAACAACTTTTCTCCATGACTATCTAGAAGAG
ACAATCTATATGGTTCAACCTAAGGGTTATGAGGTTCAAGGTAAGGAAGACCTCTATTATTTACTAAAGAAGTCTATATATGGGTTAAAACAATCTCCTAGATGTTGGTA
TAGAAGATTTGATGATTTCATTACTAGTTTAGGTTTTCAAAAAAACTTAAGGGAATCTAAGAAGATCCTAGGTATAGACATCATTAGAGATAGGGACCAATCTACACTGA
GCATAAACCAAACATCCTACTGTGAAAAAGTGATAAGAAGATTCAACCTCACTAATGCTAGACCAGTTACTCTCCCTATTGCTCATCATTTTAAGCTATCAGCTGTCAAT
TCCCCTAGTGAAACTGATATAGAACACAAGCTACAAATGAAAAATGTTCCTTACAGTCAAGCAGTGGGAAGTTTAATGTACCTAATGATCTCTACAAGACCTGATCTATC
CTATTCAGCAAGCCTAGTCAGCAAATATATGGCTAATTCTGGAAGAAGACACTGGGAAGCTACTAAATGGATAATCAGATACTTAATTTGGTCAAAGAATGCTAGATTGA
ACTACCAAAGGACCACTGAAATAGAACTAGAACTAATAGGATATGTAGACTCAGACTTTGCTGGTGACAGTGATAAAAGGAGAAGCCTAACCGGTTACGTTTTTCTCTAC
GGTCGTAATCTAATAAGTTGGAAAGCAATCCTACAATCCATTGTTGCTCTCTCAACTACCGAAGCAGAATATATTGCACTATCAGAAGGTTCGATTGTTAAAATCTTTTA
TGACAACCAAAGTGCCATTCGCCTATCCAAAAATCCTCAATACCACAGCCGAACAAAGCACATAGACATAAAATATCACTTCATACGAGAGAAAATTGAAGCTGGAGAAA
TTCAAGTATTGAAAGTTCATACCTCTGAGAACGCCGCTGATATGCTCACTAAGTCGGTCTCAACATTGAAGCTTCAGAAGTGCTTCGAACTGATAGGTTTCGACCTACCT
GAAAAAGGATAG
Protein sequenceShow/hide protein sequence
MSPSPPCFLSSCTNRGLNQNYTFRPNKLGDFALWKAKIKALLGQQKSHKALLDPSELPTTLTATQKEEMKLNAYGTLILNLSDNIIRQVLEEETAYKVWMKLESLYASKD
LPNKMYLREKNFTYRMDPSKTLSENLDEFKKIVSDFKTLEDKLSDENEALKNQEKEKDSKGKQPEVSIVESSFTYTDALASTLDQANHVNPLGKHDWVLDSGCTYHMTPF
RAWFNTYREISGESVFIGNNNECNIAGIGSVTMKLKDKTVKLLRNVRHVPHLKRNLISLRMLDSLGCEYKGKCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVL
SLNLIEVDIWHKRLSHISQKGLEALSKQGILPQDICSKLSFCEHCVLGKTRKQNFTKAQHTTRGILDYIHLDLWGPASTPSLSGSRYLRTDNGLEFCGELFNNFCRQSGI
TRHKTVTYTPQQNRVAERLNKTIMERVRCLLSDVILKEKFWAEAAAYVMHTLNRSPHTSLVFGCVGYAHQNKGKLKVRAAKCMFIGFTEGVKGFKLWHPTNKRFIISKSS
TEGNFDDTQSYTTQIEVENTGKSVQPTEEPTAIEQEQVENLSEEQDEMLEEQPDLSQYSLARDRQRRIIVPPARYAESNYISFVLNATVVPNDSEPSSFDEAVNSSNARQ
WIEAMNEEINSLNVNDTWTLASLPKGCKPIASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTTFLHDYLEE
TIYMVQPKGYEVQGKEDLYYLLKKSIYGLKQSPRCWYRRFDDFITSLGFQKNLRESKKILGIDIIRDRDQSTLSINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVN
SPSETDIEHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVFLY
GRNLISWKAILQSIVALSTTEAEYIALSEGSIVKIFYDNQSAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSENAADMLTKSVSTLKLQKCFELIGFDLP
EKG