; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G26700 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G26700
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr3:24145010..24151189
RNA-Seq ExpressionCSPI03G26700
SyntenyCSPI03G26700
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR005162 - Retrotransposon gag domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032454.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.9e-23348.2Show/hide
Query:  MVQTRIEERMEAH-------KQELSKIPVIEEKLTTISQNMEGLQAQVEKTHQLVMIFMETVAKERLLANGKGSESPVQTTSTMKSVEGESSGSKETRND
        MVQTRIEERME         K+EL K+P IE  L  I++NME ++ Q EK  Q ++ +ME  AKER +A  + +ES +Q +   KS  G++S S +    
Subjt:  MVQTRIEERMEAH-------KQELSKIPVIEEKLTTISQNMEGLQAQVEKTHQLVMIFMETVAKERLLANGKGSESPVQTTSTMKSVEGESSGSKETRND

Query:  MSKKKVDDEGENNDQSKFKRVEMPVFNGDDPDSWLFQADRYFQIHKLTNSERLIVATISFEGPTLNWYRAQEERDKFIDWLNLKERLLLRFRFMCEGSLY
         +++K D +   ND+SKFK+VEMPVF G+DP+SWLF+A+RYFQIHKLT SE+++V+TI F+GP LNWYR+QEER+KF  W NLKERLL+RF+   EG++ 
Subjt:  MSKKKVDDEGENNDQSKFKRVEMPVFNGDDPDSWLFQADRYFQIHKLTNSERLIVATISFEGPTLNWYRAQEERDKFIDWLNLKERLLLRFRFMCEGSLY

Query:  GCFLRIQQDLSAEEYRNLFDKWMAPLADIQEKIVEETFMGGL---------------LAEMMKCAQMVEHREIVRREANLPGYSGSKSPNYPHTNAKTNT
        G FLRIQQ+ + EEYRN FDK +APL+D+++++VEETFM GL               LAE M  AQ+VE REI+R  ANL  Y G K      T  K + 
Subjt:  GCFLRIQQDLSAEEYRNLFDKWMAPLADIQEKIVEETFMGGL---------------LAEMMKCAQMVEHREIVRREANLPGYSGSKSPNYPHTNAKTNT

Query:  VANDQSFKPRE---------KRDSDSSVTKNITH----DINARQRIYECLCIR-------EHRRSNRTLYQLGGWVDEPWDYED----KGADLRQRSCRV
           ++  K            K  +   + K  T     D   + R  + LC +       +H+   +   +L  +V +  D E+    +  +      RV
Subjt:  VANDQSFKPRE---------KRDSDSSVTKNITH----DINARQRIYECLCIR-------EHRRSNRTLYQLGGWVDEPWDYED----KGADLRQRSCRV

Query:  SRLRRHTQLYIG-KIGHNLATSDQGHLQLWGHL-----------GVRNNDQRKRRLRTGETQSQWLDRHRRLSTVGTRRSGRNTGDAMAIFTRS------
        + ++ HT  Y+   I   +  +D G +++ G L           G  +N   ++ + + +   +    +  +   GT   G+   +++ I  ++      
Subjt:  SRLRRHTQLYIG-KIGHNLATSDQGHLQLWGHL-----------GVRNNDQRKRRLRTGETQSQWLDRHRRLSTVGTRRSGRNTGDAMAIFTRS------

Query:  ----------------------------------------------------NRDGLEELDHSWTDSDLGYLVECRALETRIMEYEPRAEDGLISIPDKV
                                                             R  L+ L  +W + D GYL+ECR++E  + E +   ++       K+
Subjt:  ----------------------------------------------------NRDGLEELDHSWTDSDLGYLVECRALETRIMEYEPRAEDGLISIPDKV

Query:  QAVLKQYEDMFEWPEKLPPERDIKHHIHIKGGTDPINVRLYRYAYKQKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNVTIP
          +L Q+ D+FEWPEKLPP R I+H IH+K GT+P+NVR YRYAY QKEEMEKLV+EML S IIRPS SPYSSPVLLVKKKDGSWRFCVDYRALNNVT+P
Subjt:  QAVLKQYEDMFEWPEKLPPERDIKHHIHIKGGTDPINVRLYRYAYKQKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNVTIP

Query:  DKFPIPMVEELFDELNGANLFSKIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLVFFDDILVYSQGWEE
        DKFPIP+VEELFDEL GA+LF+KIDLKAGYHQIRM + +IEK AFRTHEGHYEFLVMPFGL NAPATFQSLMNSIFR Y++KFVLVFFDDIL+YS+ WE+
Subjt:  DKFPIPMVEELFDELNGANLFSKIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLVFFDDILVYSQGWEE

Query:  HCQHIELVLEVLRSHILFANKKKCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQLLKLGSFKW
        H +H+E+V  VLR H LFAN+KKCSF   K+EYLGH++S  GVEVDPEKI++I  WP PTNVR+ RGFLGLTGYYR FV HYG++AAPLTQLLK G FKW
Subjt:  HCQHIELVLEVLRSHILFANKKKCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQLLKLGSFKW

Query:  NQEAQAAFERLQQAMMTLPTLALPDFNAPFELETDASGYGVGAVLM
        N EA+ AFE+L++AM+ LP LALP F+ PFE+ETDASGYG+GAVL+
Subjt:  NQEAQAAFERLQQAMMTLPTLALPDFNAPFELETDASGYGVGAVLM

KAA0065392.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]5.8e-23448.41Show/hide
Query:  MVQTRIEERMEAH-------KQELSKIPVIEEKLTTISQNMEGLQAQVEKTHQLVMIFMETVAKERLLANGKGSESPVQTTSTMKSVEGESSGSKETRND
        MVQTRIEERME         K+EL K+P IE  L  I++NME ++ Q EK  Q ++ +ME  AKER +   + +ES +Q + T KS  G++S S +    
Subjt:  MVQTRIEERMEAH-------KQELSKIPVIEEKLTTISQNMEGLQAQVEKTHQLVMIFMETVAKERLLANGKGSESPVQTTSTMKSVEGESSGSKETRND

Query:  MSKKKVDDEGENNDQSKFKRVEMPVFNGDDPDSWLFQADRYFQIHKLTNSERLIVATISFEGPTLNWYRAQEERDKFIDWLNLKERLLLRFRFMCEGSLY
         ++KK D +   ND+SKFK+VEMPVF G+DP+SWLF+A+RYFQIHKLT SE+++V+TI F+GP LNWYR+QEER+KF  W NLKERLL+RF+   EG++ 
Subjt:  MSKKKVDDEGENNDQSKFKRVEMPVFNGDDPDSWLFQADRYFQIHKLTNSERLIVATISFEGPTLNWYRAQEERDKFIDWLNLKERLLLRFRFMCEGSLY

Query:  GCFLRIQQDLSAEEYRNLFDKWMAPLADIQEKIVEETFMGGL---------------LAEMMKCAQMVEHREIVRREANLPGYSGSKSPNYPHTNAKTNT
        G FLRIQQ+ + EEYRN FDK +APL+D+++++VEETFM GL               LAE M  AQ+VE REI+R  ANL  Y G K      T  K + 
Subjt:  GCFLRIQQDLSAEEYRNLFDKWMAPLADIQEKIVEETFMGGL---------------LAEMMKCAQMVEHREIVRREANLPGYSGSKSPNYPHTNAKTNT

Query:  VANDQSFKPRE---------KRDSDSSVTKNITH----DINARQRIYECLCIR-------EHRRSNRTLYQLGGWVDEPWDYED----KGADLRQRSCRV
           ++  K            K  +   + K  T     D   + R  + LC +       +H+   +   +L  +V +  D E+    + A+      RV
Subjt:  VANDQSFKPRE---------KRDSDSSVTKNITH----DINARQRIYECLCIR-------EHRRSNRTLYQLGGWVDEPWDYED----KGADLRQRSCRV

Query:  SRLRRHTQLYIG-KIGHNLATSDQGHLQLWGHL-----------GVRNNDQRKRRLRTGETQSQWLDRHRRLSTVGTRRSGRNTGDAMAIFTRS------
        + ++ HT  Y+   I   +  +D G +++ G L           G  +N   ++ + + +   +    +  +   GT   G+   +++ I  ++      
Subjt:  SRLRRHTQLYIG-KIGHNLATSDQGHLQLWGHL-----------GVRNNDQRKRRLRTGETQSQWLDRHRRLSTVGTRRSGRNTGDAMAIFTRS------

Query:  ----------------------------------------------------NRDGLEELDHSWTDSDLGYLVECRALETRIMEYEPRAEDGLISIPDKV
                                                             R  L+ L  +W + D GYL+ECR++E  + E +   ++       K+
Subjt:  ----------------------------------------------------NRDGLEELDHSWTDSDLGYLVECRALETRIMEYEPRAEDGLISIPDKV

Query:  QAVLKQYEDMFEWPEKLPPERDIKHHIHIKGGTDPINVRLYRYAYKQKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNVTIP
          +L Q+ D+FEWPEKLPP R I+H IH+K GT+P+NVR YRYAY QKEEMEKLV+EML S IIRPS SPYSSPVLLVKKKDGSWRFCVDYRALNNVT+P
Subjt:  QAVLKQYEDMFEWPEKLPPERDIKHHIHIKGGTDPINVRLYRYAYKQKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNVTIP

Query:  DKFPIPMVEELFDELNGANLFSKIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLVFFDDILVYSQGWEE
        DKFPIP+VEELFDEL GA+LF+KIDLKAGYHQIRM + +IEK AFRTHEGHYEFLVMPFGL NAPATFQSLMNSIFR Y++KFVLVFFDDIL+YS+ WE+
Subjt:  DKFPIPMVEELFDELNGANLFSKIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLVFFDDILVYSQGWEE

Query:  HCQHIELVLEVLRSHILFANKKKCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQLLKLGSFKW
        H +H+E+V  VLR H LFAN+KKCSF   K+EYLGH++S  GVEVDPEKI++I  WP PTNVR+ RGFLGLTGYYR FV HYG++AAPLTQLLK G FKW
Subjt:  HCQHIELVLEVLRSHILFANKKKCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQLLKLGSFKW

Query:  NQEAQAAFERLQQAMMTLPTLALPDFNAPFELETDASGYGVGAVLM
        N EA+ AFE+L++AM+ LP LALP F+ PFE+ETDASGYG+GAVL+
Subjt:  NQEAQAAFERLQQAMMTLPTLALPDFNAPFELETDASGYGVGAVLM

TYJ96875.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.8e-23348.36Show/hide
Query:  MVQTRIEERMEAH-------KQELSKIPVIEEKLTTISQNMEGLQAQVEKTHQLVMIFMETVAKERLLANGKGSESPVQTTSTMKSVEGESSGSKETRND
        MVQTRIEERME+        K+EL+K+PVIE  L  +++NME ++ Q EK  Q ++ +ME  AKER +A  + +ES  Q + T+KS   ++S S++    
Subjt:  MVQTRIEERMEAH-------KQELSKIPVIEEKLTTISQNMEGLQAQVEKTHQLVMIFMETVAKERLLANGKGSESPVQTTSTMKSVEGESSGSKETRND

Query:  MSKKKVDDEGENNDQSKFKRVEMPVFNGDDPDSWLFQADRYFQIHKLTNSERLIVATISFEGPTLNWYRAQEERDKFIDWLNLKERLLLRFRFMCEGSLY
         +KK   DE  +ND+SKFK+VEMPVF G+DP+SWLF+A+RYFQIHKLT SE+++V+TI F+GP LNWYRAQEER+KF+ W NLKERLL+RF+   EG+ +
Subjt:  MSKKKVDDEGENNDQSKFKRVEMPVFNGDDPDSWLFQADRYFQIHKLTNSERLIVATISFEGPTLNWYRAQEERDKFIDWLNLKERLLLRFRFMCEGSLY

Query:  GCFLRIQQDLSAEEYRNLFDKWMAPLADIQEKIVEETFMGGL---------------LAEMMKCAQMVEHREIVRREANLPGYSGSKS--------PNYP
        G FLRIQQ+ + EEYRNLFDK +APL+D+++++VEETFM GL               LAEMM+ AQ+VE RE++R  ANL GY G KS         +Y 
Subjt:  GCFLRIQQDLSAEEYRNLFDKWMAPLADIQEKIVEETFMGGL---------------LAEMMKCAQMVEHREIVRREANLPGYSGSKS--------PNYP

Query:  HTNAKTNTV-----ANDQSFKPREKRDSDSSVTKNITHDINARQRIYECLCIR-------EHRRSNRTLYQLGGWVDEPWDYEDKGADLRQRSCRVSRLR
        H   K N           + K     ++    T     D   + R  + LC +       +H+   R   +L  +V +  D  ++   + +     + LR
Subjt:  HTNAKTNTV-----ANDQSFKPREKRDSDSSVTKNITHDINARQRIYECLCIR-------EHRRSNRTLYQLGGWVDEPWDYEDKGADLRQRSCRVSRLR

Query:  ------RHTQLYIGKIGHNLATSDQGHLQLWGHL-----------GVRNNDQRKRRLRTGETQSQWLDRHRRLSTVGTRRSGRNTGDAMAIFTR------
              + T      I   +  +D G +++ G L           G  +N   ++ + T +   +    +  +   GT   G+   +++ +  +      
Subjt:  ------RHTQLYIGKIGHNLATSDQGHLQLWGHL-----------GVRNNDQRKRRLRTGETQSQWLDRHRRLSTVGTRRSGRNTGDAMAIFTR------

Query:  ----------------------------------------------------SNRDGLEELDHSWTDSDLGYLVECRALETRIMEYEPRAEDGLISIPDK
                                                              R  L+ L  +W + D GYL+ECR++   I E     ++    I +K
Subjt:  ----------------------------------------------------SNRDGLEELDHSWTDSDLGYLVECRALETRIMEYEPRAEDGLISIPDK

Query:  VQAVLKQYEDMFEWPEKLPPERDIKHHIHIKGGTDPINVRLYRYAYKQKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNVTI
        +  +L Q++D+FEWPEKLPP R I+H IH+K GT+P+NVR YRYAY QKEEME+LV+EML+S IIRPS SPYSSPVLLVKKKDGSWRFCVDYRALNNVT+
Subjt:  VQAVLKQYEDMFEWPEKLPPERDIKHHIHIKGGTDPINVRLYRYAYKQKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNVTI

Query:  PDKFPIPMVEELFDELNGANLFSKIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLVFFDDILVYSQGWE
        PDKFPIP+VEELFDEL GA+LF+KIDLKAGYHQIRM + +IEK AFRTHEGHYEFLVMPFGL NAPATFQSLMNSIFR Y+++FVLVFFDDIL+YS+  E
Subjt:  PDKFPIPMVEELFDELNGANLFSKIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLVFFDDILVYSQGWE

Query:  EHCQHIELVLEVLRSHILFANKKKCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQLLKLGSFK
        +H +HIE V  VLR H LFAN+KKCSF   K+EYLGH++S  GVEVDPEKI++I  WP PT+VR+ RGFLGLTGYYR FV HYG++AAPLTQLLK G F 
Subjt:  EHCQHIELVLEVLRSHILFANKKKCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQLLKLGSFK

Query:  WNQEAQAAFERLQQAMMTLPTLALPDFNAPFELETDASGYGVGAVLM
        WN EA+ AFE+L++AM+ LP LALP F+ PFE+ETDASGYGVGAVL+
Subjt:  WNQEAQAAFERLQQAMMTLPTLALPDFNAPFELETDASGYGVGAVLM

TYK22240.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.8e-23348.2Show/hide
Query:  MVQTRIEERMEAH-------KQELSKIPVIEEKLTTISQNMEGLQAQVEKTHQLVMIFMETVAKERLLANGKGSESPVQTTSTMKSVEGESSGSKETRND
        MVQTRIEERME         K+EL K+P IE  L  I++NME ++ Q EK  Q ++ +ME  AKER +A  + +ES +Q +   KS  G++S S +    
Subjt:  MVQTRIEERMEAH-------KQELSKIPVIEEKLTTISQNMEGLQAQVEKTHQLVMIFMETVAKERLLANGKGSESPVQTTSTMKSVEGESSGSKETRND

Query:  MSKKKVDDEGENNDQSKFKRVEMPVFNGDDPDSWLFQADRYFQIHKLTNSERLIVATISFEGPTLNWYRAQEERDKFIDWLNLKERLLLRFRFMCEGSLY
         +++K D +   ND+SKFK+VEMPVF G+DP+SWLF+A+RYFQIHKLT SE+++V+TI F+GP LNWYR+QEER+KF  W NLKERLL+RF+   EG++ 
Subjt:  MSKKKVDDEGENNDQSKFKRVEMPVFNGDDPDSWLFQADRYFQIHKLTNSERLIVATISFEGPTLNWYRAQEERDKFIDWLNLKERLLLRFRFMCEGSLY

Query:  GCFLRIQQDLSAEEYRNLFDKWMAPLADIQEKIVEETFMGGL---------------LAEMMKCAQMVEHREIVRREANLPGYSGSKSPNYPHTNAKTNT
        G FLRIQQ+ + EEYRN FDK +APL+D+++++VEETFM GL               LAE M  AQ+VE REI+R  ANL  Y G K      T  K + 
Subjt:  GCFLRIQQDLSAEEYRNLFDKWMAPLADIQEKIVEETFMGGL---------------LAEMMKCAQMVEHREIVRREANLPGYSGSKSPNYPHTNAKTNT

Query:  VANDQSFKPRE---------KRDSDSSVTKNITH----DINARQRIYECLCIR-------EHRRSNRTLYQLGGWVDEPWDYED----KGADLRQRSCRV
           ++  K            K  +   + K  T     D   + R  + LC +       +H+   +   +L  +V +  D E+    +  +      RV
Subjt:  VANDQSFKPRE---------KRDSDSSVTKNITH----DINARQRIYECLCIR-------EHRRSNRTLYQLGGWVDEPWDYED----KGADLRQRSCRV

Query:  SRLRRHTQLYIG-KIGHNLATSDQGHLQLWGHL-----------GVRNNDQRKRRLRTGETQSQWLDRHRRLSTVGTRRSGRNTGDAMAIFTRS------
        + ++ HT  Y+   I   +  +D G +++ G L           G  +N   ++ + + +   +    +  +   GT   G+   +++ I  ++      
Subjt:  SRLRRHTQLYIG-KIGHNLATSDQGHLQLWGHL-----------GVRNNDQRKRRLRTGETQSQWLDRHRRLSTVGTRRSGRNTGDAMAIFTRS------

Query:  ----------------------------------------------------NRDGLEELDHSWTDSDLGYLVECRALETRIMEYEPRAEDGLISIPDKV
                                                             R  L+ L  +W + D GYL+ECR++E  + E +   ++       K+
Subjt:  ----------------------------------------------------NRDGLEELDHSWTDSDLGYLVECRALETRIMEYEPRAEDGLISIPDKV

Query:  QAVLKQYEDMFEWPEKLPPERDIKHHIHIKGGTDPINVRLYRYAYKQKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNVTIP
          +L Q+ D+FEWPEKLPP R I+H IH+K GT+P+NVR YRYAY QKEEMEKLV+EML S IIRPS SPYSSPVLLVKKKDGSWRFCVDYRALNNVT+P
Subjt:  QAVLKQYEDMFEWPEKLPPERDIKHHIHIKGGTDPINVRLYRYAYKQKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNVTIP

Query:  DKFPIPMVEELFDELNGANLFSKIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLVFFDDILVYSQGWEE
        DKFPIP+VEELFDEL GA+LF+KIDLKAGYHQIRM + +IEK AFRTHEGHYEFLVMPFGL NAPATFQSLMNSIFR Y++KFVLVFFDDIL+YS+ WE+
Subjt:  DKFPIPMVEELFDELNGANLFSKIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLVFFDDILVYSQGWEE

Query:  HCQHIELVLEVLRSHILFANKKKCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQLLKLGSFKW
        H +H+E+V  VLR H LFAN+KKCSF   K+EYLGH++S  GVEVDPEKI++I  WP PTNVR+ RGFLGLTGYYR FV HYG++AAPLTQLLK G FKW
Subjt:  HCQHIELVLEVLRSHILFANKKKCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQLLKLGSFKW

Query:  NQEAQAAFERLQQAMMTLPTLALPDFNAPFELETDASGYGVGAVLM
        N EA+ AFE+L++AM+ LP LALP F+ PFE+ETDASGYG+GAVL+
Subjt:  NQEAQAAFERLQQAMMTLPTLALPDFNAPFELETDASGYGVGAVLM

TYK30083.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.8e-23348.2Show/hide
Query:  MVQTRIEERMEAH-------KQELSKIPVIEEKLTTISQNMEGLQAQVEKTHQLVMIFMETVAKERLLANGKGSESPVQTTSTMKSVEGESSGSKETRND
        MVQTRIEERME         K+EL K+P IE  L  I++NME ++ Q EK  Q ++ +ME  AKER +A  + +ES +Q +   KS  G++S S +    
Subjt:  MVQTRIEERMEAH-------KQELSKIPVIEEKLTTISQNMEGLQAQVEKTHQLVMIFMETVAKERLLANGKGSESPVQTTSTMKSVEGESSGSKETRND

Query:  MSKKKVDDEGENNDQSKFKRVEMPVFNGDDPDSWLFQADRYFQIHKLTNSERLIVATISFEGPTLNWYRAQEERDKFIDWLNLKERLLLRFRFMCEGSLY
         +++K D +   ND+SKFK+VEMPVF G+DP+SWLF+A+RYFQIHKLT SE+++V+TI F+GP LNWYR+QEER+KF  W NLKERLL+RF+   EG++ 
Subjt:  MSKKKVDDEGENNDQSKFKRVEMPVFNGDDPDSWLFQADRYFQIHKLTNSERLIVATISFEGPTLNWYRAQEERDKFIDWLNLKERLLLRFRFMCEGSLY

Query:  GCFLRIQQDLSAEEYRNLFDKWMAPLADIQEKIVEETFMGGL---------------LAEMMKCAQMVEHREIVRREANLPGYSGSKSPNYPHTNAKTNT
        G FLRIQQ+ + EEYRN FDK +APL+D+++++VEETFM GL               LAE M  AQ+VE REI+R  ANL  Y G K      T  K + 
Subjt:  GCFLRIQQDLSAEEYRNLFDKWMAPLADIQEKIVEETFMGGL---------------LAEMMKCAQMVEHREIVRREANLPGYSGSKSPNYPHTNAKTNT

Query:  VANDQSFKPRE---------KRDSDSSVTKNITH----DINARQRIYECLCIR-------EHRRSNRTLYQLGGWVDEPWDYED----KGADLRQRSCRV
           ++  K            K  +   + K  T     D   + R  + LC +       +H+   +   +L  +V +  D E+    +  +      RV
Subjt:  VANDQSFKPRE---------KRDSDSSVTKNITH----DINARQRIYECLCIR-------EHRRSNRTLYQLGGWVDEPWDYED----KGADLRQRSCRV

Query:  SRLRRHTQLYIG-KIGHNLATSDQGHLQLWGHL-----------GVRNNDQRKRRLRTGETQSQWLDRHRRLSTVGTRRSGRNTGDAMAIFTRS------
        + ++ HT  Y+   I   +  +D G +++ G L           G  +N   ++ + + +   +    +  +   GT   G+   +++ I  ++      
Subjt:  SRLRRHTQLYIG-KIGHNLATSDQGHLQLWGHL-----------GVRNNDQRKRRLRTGETQSQWLDRHRRLSTVGTRRSGRNTGDAMAIFTRS------

Query:  ----------------------------------------------------NRDGLEELDHSWTDSDLGYLVECRALETRIMEYEPRAEDGLISIPDKV
                                                             R  L+ L  +W + D GYL+ECR++E  + E +   ++       K+
Subjt:  ----------------------------------------------------NRDGLEELDHSWTDSDLGYLVECRALETRIMEYEPRAEDGLISIPDKV

Query:  QAVLKQYEDMFEWPEKLPPERDIKHHIHIKGGTDPINVRLYRYAYKQKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNVTIP
          +L Q+ D+FEWPEKLPP R I+H IH+K GT+P+NVR YRYAY QKEEMEKLV+EML S IIRPS SPYSSPVLLVKKKDGSWRFCVDYRALNNVT+P
Subjt:  QAVLKQYEDMFEWPEKLPPERDIKHHIHIKGGTDPINVRLYRYAYKQKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNVTIP

Query:  DKFPIPMVEELFDELNGANLFSKIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLVFFDDILVYSQGWEE
        DKFPIP+VEELFDEL GA+LF+KIDLKAGYHQIRM + +IEK AFRTHEGHYEFLVMPFGL NAPATFQSLMNSIFR Y++KFVLVFFDDIL+YS+ WE+
Subjt:  DKFPIPMVEELFDELNGANLFSKIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLVFFDDILVYSQGWEE

Query:  HCQHIELVLEVLRSHILFANKKKCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQLLKLGSFKW
        H +H+E+V  VLR H LFAN+KKCSF   K+EYLGH++S  GVEVDPEKI++I  WP PTNVR+ RGFLGLTGYYR FV HYG++AAPLTQLLK G FKW
Subjt:  HCQHIELVLEVLRSHILFANKKKCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQLLKLGSFKW

Query:  NQEAQAAFERLQQAMMTLPTLALPDFNAPFELETDASGYGVGAVLM
        N EA+ AFE+L++AM+ LP LALP F+ PFE+ETDASGYG+GAVL+
Subjt:  NQEAQAAFERLQQAMMTLPTLALPDFNAPFELETDASGYGVGAVLM

TrEMBL top hitse value%identityAlignment
A0A5A7SMQ7 Ty3/gypsy retrotransposon protein2.4e-23348.2Show/hide
Query:  MVQTRIEERMEAH-------KQELSKIPVIEEKLTTISQNMEGLQAQVEKTHQLVMIFMETVAKERLLANGKGSESPVQTTSTMKSVEGESSGSKETRND
        MVQTRIEERME         K+EL K+P IE  L  I++NME ++ Q EK  Q ++ +ME  AKER +A  + +ES +Q +   KS  G++S S +    
Subjt:  MVQTRIEERMEAH-------KQELSKIPVIEEKLTTISQNMEGLQAQVEKTHQLVMIFMETVAKERLLANGKGSESPVQTTSTMKSVEGESSGSKETRND

Query:  MSKKKVDDEGENNDQSKFKRVEMPVFNGDDPDSWLFQADRYFQIHKLTNSERLIVATISFEGPTLNWYRAQEERDKFIDWLNLKERLLLRFRFMCEGSLY
         +++K D +   ND+SKFK+VEMPVF G+DP+SWLF+A+RYFQIHKLT SE+++V+TI F+GP LNWYR+QEER+KF  W NLKERLL+RF+   EG++ 
Subjt:  MSKKKVDDEGENNDQSKFKRVEMPVFNGDDPDSWLFQADRYFQIHKLTNSERLIVATISFEGPTLNWYRAQEERDKFIDWLNLKERLLLRFRFMCEGSLY

Query:  GCFLRIQQDLSAEEYRNLFDKWMAPLADIQEKIVEETFMGGL---------------LAEMMKCAQMVEHREIVRREANLPGYSGSKSPNYPHTNAKTNT
        G FLRIQQ+ + EEYRN FDK +APL+D+++++VEETFM GL               LAE M  AQ+VE REI+R  ANL  Y G K      T  K + 
Subjt:  GCFLRIQQDLSAEEYRNLFDKWMAPLADIQEKIVEETFMGGL---------------LAEMMKCAQMVEHREIVRREANLPGYSGSKSPNYPHTNAKTNT

Query:  VANDQSFKPRE---------KRDSDSSVTKNITH----DINARQRIYECLCIR-------EHRRSNRTLYQLGGWVDEPWDYED----KGADLRQRSCRV
           ++  K            K  +   + K  T     D   + R  + LC +       +H+   +   +L  +V +  D E+    +  +      RV
Subjt:  VANDQSFKPRE---------KRDSDSSVTKNITH----DINARQRIYECLCIR-------EHRRSNRTLYQLGGWVDEPWDYED----KGADLRQRSCRV

Query:  SRLRRHTQLYIG-KIGHNLATSDQGHLQLWGHL-----------GVRNNDQRKRRLRTGETQSQWLDRHRRLSTVGTRRSGRNTGDAMAIFTRS------
        + ++ HT  Y+   I   +  +D G +++ G L           G  +N   ++ + + +   +    +  +   GT   G+   +++ I  ++      
Subjt:  SRLRRHTQLYIG-KIGHNLATSDQGHLQLWGHL-----------GVRNNDQRKRRLRTGETQSQWLDRHRRLSTVGTRRSGRNTGDAMAIFTRS------

Query:  ----------------------------------------------------NRDGLEELDHSWTDSDLGYLVECRALETRIMEYEPRAEDGLISIPDKV
                                                             R  L+ L  +W + D GYL+ECR++E  + E +   ++       K+
Subjt:  ----------------------------------------------------NRDGLEELDHSWTDSDLGYLVECRALETRIMEYEPRAEDGLISIPDKV

Query:  QAVLKQYEDMFEWPEKLPPERDIKHHIHIKGGTDPINVRLYRYAYKQKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNVTIP
          +L Q+ D+FEWPEKLPP R I+H IH+K GT+P+NVR YRYAY QKEEMEKLV+EML S IIRPS SPYSSPVLLVKKKDGSWRFCVDYRALNNVT+P
Subjt:  QAVLKQYEDMFEWPEKLPPERDIKHHIHIKGGTDPINVRLYRYAYKQKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNVTIP

Query:  DKFPIPMVEELFDELNGANLFSKIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLVFFDDILVYSQGWEE
        DKFPIP+VEELFDEL GA+LF+KIDLKAGYHQIRM + +IEK AFRTHEGHYEFLVMPFGL NAPATFQSLMNSIFR Y++KFVLVFFDDIL+YS+ WE+
Subjt:  DKFPIPMVEELFDELNGANLFSKIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLVFFDDILVYSQGWEE

Query:  HCQHIELVLEVLRSHILFANKKKCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQLLKLGSFKW
        H +H+E+V  VLR H LFAN+KKCSF   K+EYLGH++S  GVEVDPEKI++I  WP PTNVR+ RGFLGLTGYYR FV HYG++AAPLTQLLK G FKW
Subjt:  HCQHIELVLEVLRSHILFANKKKCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQLLKLGSFKW

Query:  NQEAQAAFERLQQAMMTLPTLALPDFNAPFELETDASGYGVGAVLM
        N EA+ AFE+L++AM+ LP LALP F+ PFE+ETDASGYG+GAVL+
Subjt:  NQEAQAAFERLQQAMMTLPTLALPDFNAPFELETDASGYGVGAVLM

A0A5A7VAR4 Ty3/gypsy retrotransposon protein2.8e-23448.41Show/hide
Query:  MVQTRIEERMEAH-------KQELSKIPVIEEKLTTISQNMEGLQAQVEKTHQLVMIFMETVAKERLLANGKGSESPVQTTSTMKSVEGESSGSKETRND
        MVQTRIEERME         K+EL K+P IE  L  I++NME ++ Q EK  Q ++ +ME  AKER +   + +ES +Q + T KS  G++S S +    
Subjt:  MVQTRIEERMEAH-------KQELSKIPVIEEKLTTISQNMEGLQAQVEKTHQLVMIFMETVAKERLLANGKGSESPVQTTSTMKSVEGESSGSKETRND

Query:  MSKKKVDDEGENNDQSKFKRVEMPVFNGDDPDSWLFQADRYFQIHKLTNSERLIVATISFEGPTLNWYRAQEERDKFIDWLNLKERLLLRFRFMCEGSLY
         ++KK D +   ND+SKFK+VEMPVF G+DP+SWLF+A+RYFQIHKLT SE+++V+TI F+GP LNWYR+QEER+KF  W NLKERLL+RF+   EG++ 
Subjt:  MSKKKVDDEGENNDQSKFKRVEMPVFNGDDPDSWLFQADRYFQIHKLTNSERLIVATISFEGPTLNWYRAQEERDKFIDWLNLKERLLLRFRFMCEGSLY

Query:  GCFLRIQQDLSAEEYRNLFDKWMAPLADIQEKIVEETFMGGL---------------LAEMMKCAQMVEHREIVRREANLPGYSGSKSPNYPHTNAKTNT
        G FLRIQQ+ + EEYRN FDK +APL+D+++++VEETFM GL               LAE M  AQ+VE REI+R  ANL  Y G K      T  K + 
Subjt:  GCFLRIQQDLSAEEYRNLFDKWMAPLADIQEKIVEETFMGGL---------------LAEMMKCAQMVEHREIVRREANLPGYSGSKSPNYPHTNAKTNT

Query:  VANDQSFKPRE---------KRDSDSSVTKNITH----DINARQRIYECLCIR-------EHRRSNRTLYQLGGWVDEPWDYED----KGADLRQRSCRV
           ++  K            K  +   + K  T     D   + R  + LC +       +H+   +   +L  +V +  D E+    + A+      RV
Subjt:  VANDQSFKPRE---------KRDSDSSVTKNITH----DINARQRIYECLCIR-------EHRRSNRTLYQLGGWVDEPWDYED----KGADLRQRSCRV

Query:  SRLRRHTQLYIG-KIGHNLATSDQGHLQLWGHL-----------GVRNNDQRKRRLRTGETQSQWLDRHRRLSTVGTRRSGRNTGDAMAIFTRS------
        + ++ HT  Y+   I   +  +D G +++ G L           G  +N   ++ + + +   +    +  +   GT   G+   +++ I  ++      
Subjt:  SRLRRHTQLYIG-KIGHNLATSDQGHLQLWGHL-----------GVRNNDQRKRRLRTGETQSQWLDRHRRLSTVGTRRSGRNTGDAMAIFTRS------

Query:  ----------------------------------------------------NRDGLEELDHSWTDSDLGYLVECRALETRIMEYEPRAEDGLISIPDKV
                                                             R  L+ L  +W + D GYL+ECR++E  + E +   ++       K+
Subjt:  ----------------------------------------------------NRDGLEELDHSWTDSDLGYLVECRALETRIMEYEPRAEDGLISIPDKV

Query:  QAVLKQYEDMFEWPEKLPPERDIKHHIHIKGGTDPINVRLYRYAYKQKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNVTIP
          +L Q+ D+FEWPEKLPP R I+H IH+K GT+P+NVR YRYAY QKEEMEKLV+EML S IIRPS SPYSSPVLLVKKKDGSWRFCVDYRALNNVT+P
Subjt:  QAVLKQYEDMFEWPEKLPPERDIKHHIHIKGGTDPINVRLYRYAYKQKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNVTIP

Query:  DKFPIPMVEELFDELNGANLFSKIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLVFFDDILVYSQGWEE
        DKFPIP+VEELFDEL GA+LF+KIDLKAGYHQIRM + +IEK AFRTHEGHYEFLVMPFGL NAPATFQSLMNSIFR Y++KFVLVFFDDIL+YS+ WE+
Subjt:  DKFPIPMVEELFDELNGANLFSKIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLVFFDDILVYSQGWEE

Query:  HCQHIELVLEVLRSHILFANKKKCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQLLKLGSFKW
        H +H+E+V  VLR H LFAN+KKCSF   K+EYLGH++S  GVEVDPEKI++I  WP PTNVR+ RGFLGLTGYYR FV HYG++AAPLTQLLK G FKW
Subjt:  HCQHIELVLEVLRSHILFANKKKCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQLLKLGSFKW

Query:  NQEAQAAFERLQQAMMTLPTLALPDFNAPFELETDASGYGVGAVLM
        N EA+ AFE+L++AM+ LP LALP F+ PFE+ETDASGYG+GAVL+
Subjt:  NQEAQAAFERLQQAMMTLPTLALPDFNAPFELETDASGYGVGAVLM

A0A5D3BEL2 Ty3/gypsy retrotransposon protein1.8e-23348.36Show/hide
Query:  MVQTRIEERMEAH-------KQELSKIPVIEEKLTTISQNMEGLQAQVEKTHQLVMIFMETVAKERLLANGKGSESPVQTTSTMKSVEGESSGSKETRND
        MVQTRIEERME+        K+EL+K+PVIE  L  +++NME ++ Q EK  Q ++ +ME  AKER +A  + +ES  Q + T+KS   ++S S++    
Subjt:  MVQTRIEERMEAH-------KQELSKIPVIEEKLTTISQNMEGLQAQVEKTHQLVMIFMETVAKERLLANGKGSESPVQTTSTMKSVEGESSGSKETRND

Query:  MSKKKVDDEGENNDQSKFKRVEMPVFNGDDPDSWLFQADRYFQIHKLTNSERLIVATISFEGPTLNWYRAQEERDKFIDWLNLKERLLLRFRFMCEGSLY
         +KK   DE  +ND+SKFK+VEMPVF G+DP+SWLF+A+RYFQIHKLT SE+++V+TI F+GP LNWYRAQEER+KF+ W NLKERLL+RF+   EG+ +
Subjt:  MSKKKVDDEGENNDQSKFKRVEMPVFNGDDPDSWLFQADRYFQIHKLTNSERLIVATISFEGPTLNWYRAQEERDKFIDWLNLKERLLLRFRFMCEGSLY

Query:  GCFLRIQQDLSAEEYRNLFDKWMAPLADIQEKIVEETFMGGL---------------LAEMMKCAQMVEHREIVRREANLPGYSGSKS--------PNYP
        G FLRIQQ+ + EEYRNLFDK +APL+D+++++VEETFM GL               LAEMM+ AQ+VE RE++R  ANL GY G KS         +Y 
Subjt:  GCFLRIQQDLSAEEYRNLFDKWMAPLADIQEKIVEETFMGGL---------------LAEMMKCAQMVEHREIVRREANLPGYSGSKS--------PNYP

Query:  HTNAKTNTV-----ANDQSFKPREKRDSDSSVTKNITHDINARQRIYECLCIR-------EHRRSNRTLYQLGGWVDEPWDYEDKGADLRQRSCRVSRLR
        H   K N           + K     ++    T     D   + R  + LC +       +H+   R   +L  +V +  D  ++   + +     + LR
Subjt:  HTNAKTNTV-----ANDQSFKPREKRDSDSSVTKNITHDINARQRIYECLCIR-------EHRRSNRTLYQLGGWVDEPWDYEDKGADLRQRSCRVSRLR

Query:  ------RHTQLYIGKIGHNLATSDQGHLQLWGHL-----------GVRNNDQRKRRLRTGETQSQWLDRHRRLSTVGTRRSGRNTGDAMAIFTR------
              + T      I   +  +D G +++ G L           G  +N   ++ + T +   +    +  +   GT   G+   +++ +  +      
Subjt:  ------RHTQLYIGKIGHNLATSDQGHLQLWGHL-----------GVRNNDQRKRRLRTGETQSQWLDRHRRLSTVGTRRSGRNTGDAMAIFTR------

Query:  ----------------------------------------------------SNRDGLEELDHSWTDSDLGYLVECRALETRIMEYEPRAEDGLISIPDK
                                                              R  L+ L  +W + D GYL+ECR++   I E     ++    I +K
Subjt:  ----------------------------------------------------SNRDGLEELDHSWTDSDLGYLVECRALETRIMEYEPRAEDGLISIPDK

Query:  VQAVLKQYEDMFEWPEKLPPERDIKHHIHIKGGTDPINVRLYRYAYKQKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNVTI
        +  +L Q++D+FEWPEKLPP R I+H IH+K GT+P+NVR YRYAY QKEEME+LV+EML+S IIRPS SPYSSPVLLVKKKDGSWRFCVDYRALNNVT+
Subjt:  VQAVLKQYEDMFEWPEKLPPERDIKHHIHIKGGTDPINVRLYRYAYKQKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNVTI

Query:  PDKFPIPMVEELFDELNGANLFSKIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLVFFDDILVYSQGWE
        PDKFPIP+VEELFDEL GA+LF+KIDLKAGYHQIRM + +IEK AFRTHEGHYEFLVMPFGL NAPATFQSLMNSIFR Y+++FVLVFFDDIL+YS+  E
Subjt:  PDKFPIPMVEELFDELNGANLFSKIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLVFFDDILVYSQGWE

Query:  EHCQHIELVLEVLRSHILFANKKKCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQLLKLGSFK
        +H +HIE V  VLR H LFAN+KKCSF   K+EYLGH++S  GVEVDPEKI++I  WP PT+VR+ RGFLGLTGYYR FV HYG++AAPLTQLLK G F 
Subjt:  EHCQHIELVLEVLRSHILFANKKKCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQLLKLGSFK

Query:  WNQEAQAAFERLQQAMMTLPTLALPDFNAPFELETDASGYGVGAVLM
        WN EA+ AFE+L++AM+ LP LALP F+ PFE+ETDASGYGVGAVL+
Subjt:  WNQEAQAAFERLQQAMMTLPTLALPDFNAPFELETDASGYGVGAVLM

A0A5D3DFT1 Ty3/gypsy retrotransposon protein1.8e-23348.2Show/hide
Query:  MVQTRIEERMEAH-------KQELSKIPVIEEKLTTISQNMEGLQAQVEKTHQLVMIFMETVAKERLLANGKGSESPVQTTSTMKSVEGESSGSKETRND
        MVQTRIEERME         K+EL K+P IE  L  I++NME ++ Q EK  Q ++ +ME  AKER +A  + +ES +Q +   KS  G++S S +    
Subjt:  MVQTRIEERMEAH-------KQELSKIPVIEEKLTTISQNMEGLQAQVEKTHQLVMIFMETVAKERLLANGKGSESPVQTTSTMKSVEGESSGSKETRND

Query:  MSKKKVDDEGENNDQSKFKRVEMPVFNGDDPDSWLFQADRYFQIHKLTNSERLIVATISFEGPTLNWYRAQEERDKFIDWLNLKERLLLRFRFMCEGSLY
         +++K D +   ND+SKFK+VEMPVF G+DP+SWLF+A+RYFQIHKLT SE+++V+TI F+GP LNWYR+QEER+KF  W NLKERLL+RF+   EG++ 
Subjt:  MSKKKVDDEGENNDQSKFKRVEMPVFNGDDPDSWLFQADRYFQIHKLTNSERLIVATISFEGPTLNWYRAQEERDKFIDWLNLKERLLLRFRFMCEGSLY

Query:  GCFLRIQQDLSAEEYRNLFDKWMAPLADIQEKIVEETFMGGL---------------LAEMMKCAQMVEHREIVRREANLPGYSGSKSPNYPHTNAKTNT
        G FLRIQQ+ + EEYRN FDK +APL+D+++++VEETFM GL               LAE M  AQ+VE REI+R  ANL  Y G K      T  K + 
Subjt:  GCFLRIQQDLSAEEYRNLFDKWMAPLADIQEKIVEETFMGGL---------------LAEMMKCAQMVEHREIVRREANLPGYSGSKSPNYPHTNAKTNT

Query:  VANDQSFKPRE---------KRDSDSSVTKNITH----DINARQRIYECLCIR-------EHRRSNRTLYQLGGWVDEPWDYED----KGADLRQRSCRV
           ++  K            K  +   + K  T     D   + R  + LC +       +H+   +   +L  +V +  D E+    +  +      RV
Subjt:  VANDQSFKPRE---------KRDSDSSVTKNITH----DINARQRIYECLCIR-------EHRRSNRTLYQLGGWVDEPWDYED----KGADLRQRSCRV

Query:  SRLRRHTQLYIG-KIGHNLATSDQGHLQLWGHL-----------GVRNNDQRKRRLRTGETQSQWLDRHRRLSTVGTRRSGRNTGDAMAIFTRS------
        + ++ HT  Y+   I   +  +D G +++ G L           G  +N   ++ + + +   +    +  +   GT   G+   +++ I  ++      
Subjt:  SRLRRHTQLYIG-KIGHNLATSDQGHLQLWGHL-----------GVRNNDQRKRRLRTGETQSQWLDRHRRLSTVGTRRSGRNTGDAMAIFTRS------

Query:  ----------------------------------------------------NRDGLEELDHSWTDSDLGYLVECRALETRIMEYEPRAEDGLISIPDKV
                                                             R  L+ L  +W + D GYL+ECR++E  + E +   ++       K+
Subjt:  ----------------------------------------------------NRDGLEELDHSWTDSDLGYLVECRALETRIMEYEPRAEDGLISIPDKV

Query:  QAVLKQYEDMFEWPEKLPPERDIKHHIHIKGGTDPINVRLYRYAYKQKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNVTIP
          +L Q+ D+FEWPEKLPP R I+H IH+K GT+P+NVR YRYAY QKEEMEKLV+EML S IIRPS SPYSSPVLLVKKKDGSWRFCVDYRALNNVT+P
Subjt:  QAVLKQYEDMFEWPEKLPPERDIKHHIHIKGGTDPINVRLYRYAYKQKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNVTIP

Query:  DKFPIPMVEELFDELNGANLFSKIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLVFFDDILVYSQGWEE
        DKFPIP+VEELFDEL GA+LF+KIDLKAGYHQIRM + +IEK AFRTHEGHYEFLVMPFGL NAPATFQSLMNSIFR Y++KFVLVFFDDIL+YS+ WE+
Subjt:  DKFPIPMVEELFDELNGANLFSKIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLVFFDDILVYSQGWEE

Query:  HCQHIELVLEVLRSHILFANKKKCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQLLKLGSFKW
        H +H+E+V  VLR H LFAN+KKCSF   K+EYLGH++S  GVEVDPEKI++I  WP PTNVR+ RGFLGLTGYYR FV HYG++AAPLTQLLK G FKW
Subjt:  HCQHIELVLEVLRSHILFANKKKCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQLLKLGSFKW

Query:  NQEAQAAFERLQQAMMTLPTLALPDFNAPFELETDASGYGVGAVLM
        N EA+ AFE+L++AM+ LP LALP F+ PFE+ETDASGYG+GAVL+
Subjt:  NQEAQAAFERLQQAMMTLPTLALPDFNAPFELETDASGYGVGAVLM

A0A5D3E1V9 Ty3/gypsy retrotransposon protein1.8e-23348.2Show/hide
Query:  MVQTRIEERMEAH-------KQELSKIPVIEEKLTTISQNMEGLQAQVEKTHQLVMIFMETVAKERLLANGKGSESPVQTTSTMKSVEGESSGSKETRND
        MVQTRIEERME         K+EL K+P IE  L  I++NME ++ Q EK  Q ++ +ME  AKER +A  + +ES +Q +   KS  G++S S +    
Subjt:  MVQTRIEERMEAH-------KQELSKIPVIEEKLTTISQNMEGLQAQVEKTHQLVMIFMETVAKERLLANGKGSESPVQTTSTMKSVEGESSGSKETRND

Query:  MSKKKVDDEGENNDQSKFKRVEMPVFNGDDPDSWLFQADRYFQIHKLTNSERLIVATISFEGPTLNWYRAQEERDKFIDWLNLKERLLLRFRFMCEGSLY
         +++K D +   ND+SKFK+VEMPVF G+DP+SWLF+A+RYFQIHKLT SE+++V+TI F+GP LNWYR+QEER+KF  W NLKERLL+RF+   EG++ 
Subjt:  MSKKKVDDEGENNDQSKFKRVEMPVFNGDDPDSWLFQADRYFQIHKLTNSERLIVATISFEGPTLNWYRAQEERDKFIDWLNLKERLLLRFRFMCEGSLY

Query:  GCFLRIQQDLSAEEYRNLFDKWMAPLADIQEKIVEETFMGGL---------------LAEMMKCAQMVEHREIVRREANLPGYSGSKSPNYPHTNAKTNT
        G FLRIQQ+ + EEYRN FDK +APL+D+++++VEETFM GL               LAE M  AQ+VE REI+R  ANL  Y G K      T  K + 
Subjt:  GCFLRIQQDLSAEEYRNLFDKWMAPLADIQEKIVEETFMGGL---------------LAEMMKCAQMVEHREIVRREANLPGYSGSKSPNYPHTNAKTNT

Query:  VANDQSFKPRE---------KRDSDSSVTKNITH----DINARQRIYECLCIR-------EHRRSNRTLYQLGGWVDEPWDYED----KGADLRQRSCRV
           ++  K            K  +   + K  T     D   + R  + LC +       +H+   +   +L  +V +  D E+    +  +      RV
Subjt:  VANDQSFKPRE---------KRDSDSSVTKNITH----DINARQRIYECLCIR-------EHRRSNRTLYQLGGWVDEPWDYED----KGADLRQRSCRV

Query:  SRLRRHTQLYIG-KIGHNLATSDQGHLQLWGHL-----------GVRNNDQRKRRLRTGETQSQWLDRHRRLSTVGTRRSGRNTGDAMAIFTRS------
        + ++ HT  Y+   I   +  +D G +++ G L           G  +N   ++ + + +   +    +  +   GT   G+   +++ I  ++      
Subjt:  SRLRRHTQLYIG-KIGHNLATSDQGHLQLWGHL-----------GVRNNDQRKRRLRTGETQSQWLDRHRRLSTVGTRRSGRNTGDAMAIFTRS------

Query:  ----------------------------------------------------NRDGLEELDHSWTDSDLGYLVECRALETRIMEYEPRAEDGLISIPDKV
                                                             R  L+ L  +W + D GYL+ECR++E  + E +   ++       K+
Subjt:  ----------------------------------------------------NRDGLEELDHSWTDSDLGYLVECRALETRIMEYEPRAEDGLISIPDKV

Query:  QAVLKQYEDMFEWPEKLPPERDIKHHIHIKGGTDPINVRLYRYAYKQKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNVTIP
          +L Q+ D+FEWPEKLPP R I+H IH+K GT+P+NVR YRYAY QKEEMEKLV+EML S IIRPS SPYSSPVLLVKKKDGSWRFCVDYRALNNVT+P
Subjt:  QAVLKQYEDMFEWPEKLPPERDIKHHIHIKGGTDPINVRLYRYAYKQKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNVTIP

Query:  DKFPIPMVEELFDELNGANLFSKIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLVFFDDILVYSQGWEE
        DKFPIP+VEELFDEL GA+LF+KIDLKAGYHQIRM + +IEK AFRTHEGHYEFLVMPFGL NAPATFQSLMNSIFR Y++KFVLVFFDDIL+YS+ WE+
Subjt:  DKFPIPMVEELFDELNGANLFSKIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLVFFDDILVYSQGWEE

Query:  HCQHIELVLEVLRSHILFANKKKCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQLLKLGSFKW
        H +H+E+V  VLR H LFAN+KKCSF   K+EYLGH++S  GVEVDPEKI++I  WP PTNVR+ RGFLGLTGYYR FV HYG++AAPLTQLLK G FKW
Subjt:  HCQHIELVLEVLRSHILFANKKKCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQLLKLGSFKW

Query:  NQEAQAAFERLQQAMMTLPTLALPDFNAPFELETDASGYGVGAVLM
        N EA+ AFE+L++AM+ LP LALP F+ PFE+ETDASGYG+GAVL+
Subjt:  NQEAQAAFERLQQAMMTLPTLALPDFNAPFELETDASGYGVGAVLM

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.8e-7442.82Show/hide
Query:  AVLKQYEDM-FEWPEKLPPERDIKHHIHIKGGTDPINVRLY-RYAYKQ--KEEMEKLVDEMLSSDIIRPSTSPYSSPVLLV-KKKDGS----WRFCVDYR
        A+L++Y D+ +   +KL      KH I+ K      N+ LY +Y+Y Q  ++E+E  + +ML+  IIR S SPY+SP+ +V KK+D S    +R  +DYR
Subjt:  AVLKQYEDM-FEWPEKLPPERDIKHHIHIKGGTDPINVRLY-RYAYKQ--KEEMEKLVDEMLSSDIIRPSTSPYSSPVLLV-KKKDGS----WRFCVDYR

Query:  ALNNVTIPDKFPIPMVEELFDELNGANLFSKIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLVFFDDIL
         LN +T+ D+ PIP ++E+  +L   N F+ IDL  G+HQI M  + + K AF T  GHYE+L MPFGL NAPATFQ  MN I R  + K  LV+ DDI+
Subjt:  ALNNVTIPDKFPIPMVEELFDELNGANLFSKIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLVFFDDIL

Query:  VYSQGWEEHCQHIELVLEVLRSHILFANKKKCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQL
        V+S   +EH Q + LV E L    L     KC F   +  +LGHVL+ +G++ +PEKI +I+++PIPT  ++++ FLGLTGYYR F+ ++  IA P+T+ 
Subjt:  VYSQGWEEHCQHIELVLEVLRSHILFANKKKCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQL

Query:  LK--LGSFKWNQEAQAAFERLQQAMMTLPTLALPDFNAPFELETDASGYGVGAVL
        LK  +     N E  +AF++L+  +   P L +PDF   F L TDAS   +GAVL
Subjt:  LK--LGSFKWNQEAQAAFERLQQAMMTLPTLALPDFNAPFELETDASGYGVGAVL

P20825 Retrovirus-related Pol polyprotein from transposon 2979.1e-7341.13Show/hide
Query:  KVQAVLKQYEDM-FEWPEKLPPERDIKHHIHIKGGTDPINVRLYRYAYKQKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKKD-----GSWRFCVDYR
        K++ +L ++ ++ ++  EKL     IKH ++    + PI  + Y  A   + E+E  V EML+  +IR S SPY+SP  +V KK        +R  +DYR
Subjt:  KVQAVLKQYEDM-FEWPEKLPPERDIKHHIHIKGGTDPINVRLYRYAYKQKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKKD-----GSWRFCVDYR

Query:  ALNNVTIPDKFPIPMVEELFDELNGANLFSKIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLVFFDDIL
         LN +TIPD++PIP ++E+  +L     F+ IDL  G+HQI M  + I K AF T  GHYE+L MPFGL NAPATFQ  MN+I R  + K  LV+ DDI+
Subjt:  ALNNVTIPDKFPIPMVEELFDELNGANLFSKIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLVFFDDIL

Query:  VYSQGWEEHCQHIELVLEVLRSHILFANKKKCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQL
        ++S    EH   I+LV   L    L     KC F   +  +LGH+++ +G++ +P K+++I  +PIPT  +++R FLGLTGYYR F+ +Y  IA P+T  
Subjt:  VYSQGWEEHCQHIELVLEVLRSHILFANKKKCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQL

Query:  LKLGSFKWNQEAQ--AAFERLQQAMMTLPTLALPDFNAPFELETDASGYGVGAVL
        LK  +    Q+ +   AFE+L+  ++  P L LPDF   F L TDAS   +GAVL
Subjt:  LKLGSFKWNQEAQ--AAFERLQQAMMTLPTLALPDFNAPFELETDASGYGVGAVL

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein3.0e-6841.18Show/hide
Query:  IKHHIHIKGGTDPINVRLYRYAYKQKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNVTIPDKFPIPMVEELFDELNGANLFS
        +KH I IK G     ++ Y    K ++E+ K+V ++L +  I PS SP SSPV+LV KKDG++R CVDYR LN  TI D FP+P ++ L   +  A +F+
Subjt:  IKHHIHIKGGTDPINVRLYRYAYKQKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNVTIPDKFPIPMVEELFDELNGANLFS

Query:  KIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLVFFDDILVYSQGWEEHCQHIELVLEVLRSHILFANKK
         +DL +GYHQI M  ++  K AF T  G YE+ VMPFGL+NAP+TF   M   FR    +FV V+ DDIL++S+  EEH +H++ VLE L++  L   KK
Subjt:  KIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLVFFDDILVYSQGWEEHCQHIELVLEVLRSHILFANKK

Query:  KCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQLLKLGSFKWNQEAQAAFERLQQAMMTLPTLA
        KC F + + E+LG+ +    +     K  +I+ +P P  V++ + FLG+  YYR F+ +   IA P+ QL      +W ++   A E+L+ A+   P L 
Subjt:  KCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQLLKLGSFKWNQEAQAAFERLQQAMMTLPTLA

Query:  LPDFNAPFELETDASGYGVGAVL
          +  A + L TDAS  G+GAVL
Subjt:  LPDFNAPFELETDASGYGVGAVL

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus3.2e-7038.27Show/hide
Query:  PDKVQAVLK----QYEDMFEWP-EKLPPERDIKHHIHIKGGTDPINVRLYRYAYKQKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKK-----DGSWR
        PD  Q +L     ++  +FE P   +  E  +K  I      DPI  + Y Y    + E+E+ +DE+L   IIRPS SPY+SP+ +V KK     +  +R
Subjt:  PDKVQAVLK----QYEDMFEWP-EKLPPERDIKHHIHIKGGTDPINVRLYRYAYKQKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKK-----DGSWR

Query:  FCVDYRALNNVTIPDKFPIPMVEELFDELNGANLFSKIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLV
          VD++ LN VTIPD +PIP +      L  A  F+ +DL +G+HQI M   +I K AF T  G YEFL +PFGL NAPA FQ +++ I R ++ K   V
Subjt:  FCVDYRALNNVTIPDKFPIPMVEELFDELNGANLFSKIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLV

Query:  FFDDILVYSQGWEEHCQHIELVLEVLRSHILFANKKKCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIA
        + DDI+V+S+ ++ H +++ LVL  L    L  N +K  F  T++E+LG++++ +G++ DP+K+R+I + P PT+V++++ FLG+T YYR F+Q Y  +A
Subjt:  FFDDILVYSQGWEEHCQHIELVLEVLRSHILFANKKKCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIA

Query:  APLTQLLK--LGSFKWNQEAQA----------AFERLQQAMMTLPTLALPDFNAPFELETDASGYGVGAVL
         PLT L +    + K +Q ++           +F  L+  + +   LA P F  PF L TDAS + +GAVL
Subjt:  APLTQLLK--LGSFKWNQEAQA----------AFERLQQAMMTLPTLALPDFNAPFELETDASGYGVGAVL

Q99315 Transposon Ty3-G Gag-Pol polyprotein5.2e-6840.87Show/hide
Query:  IKHHIHIKGGTDPINVRLYRYAYKQKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNVTIPDKFPIPMVEELFDELNGANLFS
        +KH I IK G     ++ Y    K ++E+ K+V ++L +  I PS SP SSPV+LV KKDG++R CVDYR LN  TI D FP+P ++ L   +  A +F+
Subjt:  IKHHIHIKGGTDPINVRLYRYAYKQKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNVTIPDKFPIPMVEELFDELNGANLFS

Query:  KIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLVFFDDILVYSQGWEEHCQHIELVLEVLRSHILFANKK
         +DL +GYHQI M  ++  K AF T  G YE+ VMPFGL+NAP+TF   M   FR    +FV V+ DDIL++S+  EEH +H++ VLE L++  L   KK
Subjt:  KIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLVFFDDILVYSQGWEEHCQHIELVLEVLRSHILFANKK

Query:  KCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQLLKLGSFKWNQEAQAAFERLQQAMMTLPTLA
        KC F + + E+LG+ +    +     K  +I+ +P P  V++ + FLG+  YYR F+ +   IA P+ QL      +W ++   A ++L+ A+   P L 
Subjt:  KCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQLLKLGSFKWNQEAQAAFERLQQAMMTLPTLA

Query:  LPDFNAPFELETDASGYGVGAVL
          +  A + L TDAS  G+GAVL
Subjt:  LPDFNAPFELETDASGYGVGAVL

Arabidopsis top hitse value%identityAlignment
AT1G67020.1 unknown protein8.6e-1033.75Show/hide
Query:  NNDQSKFKRVEMPVFNGDDPDSWLFQADRYFQIHKLTNSERLIVATISFEGPTLNWYRAQEERDKFIDWLNLKERLLLRF
        +N  S  +R+EMPVF+G     W  + +R+F++ +  +S++L +  +S EG  L W+  +    +F DW + ++RLL RF
Subjt:  NNDQSKFKRVEMPVFNGDDPDSWLFQADRYFQIHKLTNSERLIVATISFEGPTLNWYRAQEERDKFIDWLNLKERLLLRF

ATMG00850.1 DNA/RNA polymerases superfamily protein1.7e-0556.41Show/hide
Query:  QKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKKDGSW
        ++  ++  + EML + II+PS SPYSSPVLLV+KKDG W
Subjt:  QKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKKDGSW

ATMG00860.1 DNA/RNA polymerases superfamily protein4.2e-3349.61Show/hide
Query:  HIELVLEVLRSHILFANKKKCSFTATKLEYLG--HVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQLLKLGSFKWN
        H+ +VL++   H  +AN+KKC+F   ++ YLG  H++S  GV  DP K+ ++  WP P N  ++RGFLGLTGYYR FV++YG I  PLT+LLK  S KW 
Subjt:  HIELVLEVLRSHILFANKKKCSFTATKLEYLG--HVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQLLKLGSFKWN

Query:  QEAQAAFERLQQAMMTLPTLALPDFNAPF
        + A  AF+ L+ A+ TLP LALPD   PF
Subjt:  QEAQAAFERLQQAMMTLPTLALPDFNAPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTATCGGAGCATGATATTGACCAATCCCTAGACAACACAACTCTTGATGGAAACCAAACAGAGGAGACAATCGTCGGTATCAACGTTGCCTTCGCTGCCGCCGTCGA
TGCTCGGATCAGCACTGCCATGAATGAGTGGTTCCACTGCCGAAGACCTCCAGACCAGCCCATCTATCATCATTTCGAGGTTAGGAACCCTCAAATTCACTCAACATTTG
AGCAATTGGAAGAGCTTCGACAACGGATTGCAACAAATACAATTAAGGCTACCTTAAGGGCAACATCCAACACTCCTGGTCATCACCTTTACCGATGGGAAAAGTTGAAC
AACAATAGCTATTTCTCATGGTCTCCGAAAGTGAAAATGATTGGCAAACCCTTACTGTACGATGCAACTGCTAAGGATATTTGGGACACAACCCAGAAACTATATTCCAA
ACGTCAAAATGCATCTCGTCTGTACACACTACAGAAACAGGTTCATGAGTGCAAAGATGACATGATTTATGACTTTCTTACTGGCCTTAACCCTAAGTTTGATGTACTTT
GCGAGCGTATACTGGGCCAAAGACCGATTCCATCCCCGATGGAGGTCTGTTCTGAAATCCGCCTTGAAGAGAATCGTACAAGTACTATGAATATTTCGGCGACTCATGCT
ATTGACTCTGCTACTTTTAATGTGAGGTCCTCTACCAGTGGCAGTGACAAGCACAATGGAAAACCAGTTTCAGTTTGTGAGCATTGCAATGAACAATGGCATACTAAAGA
TCAATATTGGAAGTTACATGGTCGTTCCCTAAGAGGTAAAAGCGCTCTTCCAACGACAAATGTAACACAAGGCGGGCTTATTCCTTCGGTCTTATTAGTGTTGATGGGAA
AGAACCCTTGGATTCTAGACTCTGGTGCCATAGATCATTTGATTGGTTCCACTAAACATTTTGTGTCCTATATTCCATGTATTGGTAACAAAACAATCAATATTGTAGAT
GACTCCTTGTCTCCCATTGTTGGGAATGGGAAGATTTCTCCCTTTAGCTTGAACTCGATGAGGATGATTGACACTGCCCGATATAGTAGGGGACTCTACCTCCTTGATGA
CGTGTGTCTGGGGACTATCCTAACTACGTCATCTGGGAAACGGTGGTTTGTAACCTTTGTTGATGATCATACCTGTCTTACTTGGGTCTTTCTCGTCTCCGACAAATCTG
AGAGTGACAATGCTCGTGAGTTCCAAAACCACACCCTCAATGAGTTTTTGTCCTCCAAAGGAAATTTCCACAAAAATTCCTATGCCTACCCCCCCCAACAAAATGGCGTA
GCCGAGCGCAAAAACCATCTCCTTTTGGAAACCCCATTAGAGTGTCTTAAAGAGTCATATTCCTCTACACACCTCATTTTTTATGTTCCTTTTCCGGAGAAATTTGTTAG
ACCTCCTATAATACACCAATATAGCATAAGGAAGGATCTGAAGGAAAAGAAGGACAAGAAAATAGTGGAGAATAATATTTTCTGTGTATTCTTGAGGGAAGAAATAGCAG
AGAGGGAGAGTAGTGTTCTACGAACTGAGCACGACGATCTGGAAAGAAGAACAATCATGGTACAAACAAGAATCGAGGAAAGGATGGAAGCGCACAAACAAGAGCTTAGT
AAAATACCGGTGATAGAGGAGAAACTCACGACGATTTCGCAAAACATGGAAGGCCTGCAGGCTCAAGTAGAAAAAACTCACCAGTTGGTAATGATATTCATGGAGACGGT
GGCAAAGGAACGACTGCTAGCAAATGGTAAGGGGAGTGAGTCACCGGTGCAAACAACCTCAACGATGAAATCGGTGGAAGGAGAGAGCTCGGGGAGCAAGGAGACCAGAA
ACGATATGTCAAAGAAGAAAGTAGACGACGAAGGAGAGAACAATGATCAGAGCAAGTTCAAAAGAGTGGAAATGCCAGTGTTTAACGGTGACGATCCCGACTCGTGGTTG
TTTCAAGCTGACAGGTATTTTCAAATACATAAGTTAACTAATTCGGAAAGGCTCATTGTAGCCACGATTAGCTTCGAAGGACCGACTCTGAATTGGTATCGAGCCCAAGA
AGAGAGAGATAAGTTTATCGATTGGTTAAACCTGAAGGAACGACTACTGTTACGTTTTCGGTTTATGTGTGAAGGGTCTCTGTATGGCTGCTTTTTACGTATACAACAAG
ATTTGAGCGCAGAAGAGTATCGAAATCTTTTCGATAAGTGGATGGCGCCATTAGCAGATATTCAGGAGAAGATTGTGGAGGAGACGTTTATGGGAGGATTATTGGCTGAG
ATGATGAAATGTGCGCAGATGGTAGAGCACCGGGAAATCGTGAGGAGGGAAGCAAATCTTCCTGGTTACTCAGGATCGAAATCCCCAAACTATCCCCATACTAACGCCAA
AACAAATACAGTTGCAAATGACCAAAGTTTCAAGCCAAGAGAGAAAAGGGACTCTGATTCAAGTGTGACGAAAAATATTACTCATGACATAAATGCAAGGCAAAGGATCT
ACGAATGTTTGTGTATAAGGGAACATCGAAGGAGTAATAGAACTTTGTATCAACTTGGTGGTTGGGTTGACGAACCCTGGGACTATGAAGATAAGGGGGCTGACCTAAGA
CAAAGAAGTTGTCGTGTTAGTCGATTGCGGCGCCACACACAACTTTATATCGGAAAGATTGGTCACAACCTTGCAACTTCCGACCAAGGACACCTCCAATTATGGGGTCA
TCTTGGGGTACGGAACAACGATCAAAGGAAAAGGCGTTTGCGAACAGGTGAAACTCAGTCTCAATGGTTGGATCGTCACCGCAGACTTTCTACCGTTGGAACTAGGAGAA
GTGGACGTAATACTGGGGATGCAATGGCTATATTCACTAGGAGTAACAGAGATGGACTGGAAGAACTTGATCATTCTTGGACTGATTCGGATCTGGGATATTTGGTTGAA
TGTCGAGCACTGGAAACAAGAATCATGGAATACGAGCCAAGAGCAGAGGATGGGCTCATCAGCATTCCAGACAAGGTGCAGGCAGTGTTAAAGCAGTACGAGGACATGTT
TGAATGGCCTGAAAAATTACCACCGGAGAGGGATATTAAACATCACATACACATCAAGGGAGGCACCGACCCAATCAACGTCAGACTTTATAGGTATGCCTACAAACAGA
AAGAAGAGATGGAGAAACTAGTGGATGAAATGCTCTCATCCGATATTATTCGCCCAAGTACCAGCCCCTACTCAAGCCCAGTCTTATTAGTAAAGAAAAAAGATGGGAGT
TGGAGGTTTTGCGTGGATTACAGAGCACTCAACAATGTCACCATTCCAGATAAGTTTCCAATTCCTATGGTTGAAGAATTGTTTGATGAATTAAATGGAGCGAACCTATT
TTCTAAAATCGATCTAAAGGCTGGTTATCACCAAATTAGAATGTGTAATCAAGAAATAGAAAAAATAGCCTTTAGGACTCATGAAGGACATTATGAATTCTTAGTAATGC
CGTTTGGACTCATGAATGCCCCGGCCACGTTTCAGTCCCTGATGAATAGTATATTTAGAGCCTACATGCAGAAATTTGTCTTGGTTTTTTTCGATGACATTCTAGTATAC
AGTCAAGGATGGGAAGAACATTGTCAGCACATAGAATTGGTATTGGAAGTCTTGAGAAGCCATATATTGTTTGCGAATAAGAAGAAGTGCAGCTTCACGGCTACTAAACT
GGAATATTTGGGGCATGTGCTGTCTGAAAATGGGGTGGAAGTGGATCCAGAAAAAATCAGATCCATCAAACAATGGCCGATTCCTACGAATGTACGTAAAGTAAGAGGAT
TTTTGGGGTTGACCGGATACTACCGTCATTTCGTACAACATTATGGGTCCATTGCAGCTCCCTTAACACAGTTATTGAAGTTGGGATCCTTTAAATGGAATCAGGAAGCT
CAAGCAGCATTTGAAAGGTTGCAGCAAGCCATGATGACCCTGCCTACTTTAGCTCTCCCAGACTTCAATGCACCGTTCGAATTGGAGACTGATGCATCTGGTTATGGGGT
GGGAGCTGTCCTGATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTATCGGAGCATGATATTGACCAATCCCTAGACAACACAACTCTTGATGGAAACCAAACAGAGGAGACAATCGTCGGTATCAACGTTGCCTTCGCTGCCGCCGTCGA
TGCTCGGATCAGCACTGCCATGAATGAGTGGTTCCACTGCCGAAGACCTCCAGACCAGCCCATCTATCATCATTTCGAGGTTAGGAACCCTCAAATTCACTCAACATTTG
AGCAATTGGAAGAGCTTCGACAACGGATTGCAACAAATACAATTAAGGCTACCTTAAGGGCAACATCCAACACTCCTGGTCATCACCTTTACCGATGGGAAAAGTTGAAC
AACAATAGCTATTTCTCATGGTCTCCGAAAGTGAAAATGATTGGCAAACCCTTACTGTACGATGCAACTGCTAAGGATATTTGGGACACAACCCAGAAACTATATTCCAA
ACGTCAAAATGCATCTCGTCTGTACACACTACAGAAACAGGTTCATGAGTGCAAAGATGACATGATTTATGACTTTCTTACTGGCCTTAACCCTAAGTTTGATGTACTTT
GCGAGCGTATACTGGGCCAAAGACCGATTCCATCCCCGATGGAGGTCTGTTCTGAAATCCGCCTTGAAGAGAATCGTACAAGTACTATGAATATTTCGGCGACTCATGCT
ATTGACTCTGCTACTTTTAATGTGAGGTCCTCTACCAGTGGCAGTGACAAGCACAATGGAAAACCAGTTTCAGTTTGTGAGCATTGCAATGAACAATGGCATACTAAAGA
TCAATATTGGAAGTTACATGGTCGTTCCCTAAGAGGTAAAAGCGCTCTTCCAACGACAAATGTAACACAAGGCGGGCTTATTCCTTCGGTCTTATTAGTGTTGATGGGAA
AGAACCCTTGGATTCTAGACTCTGGTGCCATAGATCATTTGATTGGTTCCACTAAACATTTTGTGTCCTATATTCCATGTATTGGTAACAAAACAATCAATATTGTAGAT
GACTCCTTGTCTCCCATTGTTGGGAATGGGAAGATTTCTCCCTTTAGCTTGAACTCGATGAGGATGATTGACACTGCCCGATATAGTAGGGGACTCTACCTCCTTGATGA
CGTGTGTCTGGGGACTATCCTAACTACGTCATCTGGGAAACGGTGGTTTGTAACCTTTGTTGATGATCATACCTGTCTTACTTGGGTCTTTCTCGTCTCCGACAAATCTG
AGAGTGACAATGCTCGTGAGTTCCAAAACCACACCCTCAATGAGTTTTTGTCCTCCAAAGGAAATTTCCACAAAAATTCCTATGCCTACCCCCCCCAACAAAATGGCGTA
GCCGAGCGCAAAAACCATCTCCTTTTGGAAACCCCATTAGAGTGTCTTAAAGAGTCATATTCCTCTACACACCTCATTTTTTATGTTCCTTTTCCGGAGAAATTTGTTAG
ACCTCCTATAATACACCAATATAGCATAAGGAAGGATCTGAAGGAAAAGAAGGACAAGAAAATAGTGGAGAATAATATTTTCTGTGTATTCTTGAGGGAAGAAATAGCAG
AGAGGGAGAGTAGTGTTCTACGAACTGAGCACGACGATCTGGAAAGAAGAACAATCATGGTACAAACAAGAATCGAGGAAAGGATGGAAGCGCACAAACAAGAGCTTAGT
AAAATACCGGTGATAGAGGAGAAACTCACGACGATTTCGCAAAACATGGAAGGCCTGCAGGCTCAAGTAGAAAAAACTCACCAGTTGGTAATGATATTCATGGAGACGGT
GGCAAAGGAACGACTGCTAGCAAATGGTAAGGGGAGTGAGTCACCGGTGCAAACAACCTCAACGATGAAATCGGTGGAAGGAGAGAGCTCGGGGAGCAAGGAGACCAGAA
ACGATATGTCAAAGAAGAAAGTAGACGACGAAGGAGAGAACAATGATCAGAGCAAGTTCAAAAGAGTGGAAATGCCAGTGTTTAACGGTGACGATCCCGACTCGTGGTTG
TTTCAAGCTGACAGGTATTTTCAAATACATAAGTTAACTAATTCGGAAAGGCTCATTGTAGCCACGATTAGCTTCGAAGGACCGACTCTGAATTGGTATCGAGCCCAAGA
AGAGAGAGATAAGTTTATCGATTGGTTAAACCTGAAGGAACGACTACTGTTACGTTTTCGGTTTATGTGTGAAGGGTCTCTGTATGGCTGCTTTTTACGTATACAACAAG
ATTTGAGCGCAGAAGAGTATCGAAATCTTTTCGATAAGTGGATGGCGCCATTAGCAGATATTCAGGAGAAGATTGTGGAGGAGACGTTTATGGGAGGATTATTGGCTGAG
ATGATGAAATGTGCGCAGATGGTAGAGCACCGGGAAATCGTGAGGAGGGAAGCAAATCTTCCTGGTTACTCAGGATCGAAATCCCCAAACTATCCCCATACTAACGCCAA
AACAAATACAGTTGCAAATGACCAAAGTTTCAAGCCAAGAGAGAAAAGGGACTCTGATTCAAGTGTGACGAAAAATATTACTCATGACATAAATGCAAGGCAAAGGATCT
ACGAATGTTTGTGTATAAGGGAACATCGAAGGAGTAATAGAACTTTGTATCAACTTGGTGGTTGGGTTGACGAACCCTGGGACTATGAAGATAAGGGGGCTGACCTAAGA
CAAAGAAGTTGTCGTGTTAGTCGATTGCGGCGCCACACACAACTTTATATCGGAAAGATTGGTCACAACCTTGCAACTTCCGACCAAGGACACCTCCAATTATGGGGTCA
TCTTGGGGTACGGAACAACGATCAAAGGAAAAGGCGTTTGCGAACAGGTGAAACTCAGTCTCAATGGTTGGATCGTCACCGCAGACTTTCTACCGTTGGAACTAGGAGAA
GTGGACGTAATACTGGGGATGCAATGGCTATATTCACTAGGAGTAACAGAGATGGACTGGAAGAACTTGATCATTCTTGGACTGATTCGGATCTGGGATATTTGGTTGAA
TGTCGAGCACTGGAAACAAGAATCATGGAATACGAGCCAAGAGCAGAGGATGGGCTCATCAGCATTCCAGACAAGGTGCAGGCAGTGTTAAAGCAGTACGAGGACATGTT
TGAATGGCCTGAAAAATTACCACCGGAGAGGGATATTAAACATCACATACACATCAAGGGAGGCACCGACCCAATCAACGTCAGACTTTATAGGTATGCCTACAAACAGA
AAGAAGAGATGGAGAAACTAGTGGATGAAATGCTCTCATCCGATATTATTCGCCCAAGTACCAGCCCCTACTCAAGCCCAGTCTTATTAGTAAAGAAAAAAGATGGGAGT
TGGAGGTTTTGCGTGGATTACAGAGCACTCAACAATGTCACCATTCCAGATAAGTTTCCAATTCCTATGGTTGAAGAATTGTTTGATGAATTAAATGGAGCGAACCTATT
TTCTAAAATCGATCTAAAGGCTGGTTATCACCAAATTAGAATGTGTAATCAAGAAATAGAAAAAATAGCCTTTAGGACTCATGAAGGACATTATGAATTCTTAGTAATGC
CGTTTGGACTCATGAATGCCCCGGCCACGTTTCAGTCCCTGATGAATAGTATATTTAGAGCCTACATGCAGAAATTTGTCTTGGTTTTTTTCGATGACATTCTAGTATAC
AGTCAAGGATGGGAAGAACATTGTCAGCACATAGAATTGGTATTGGAAGTCTTGAGAAGCCATATATTGTTTGCGAATAAGAAGAAGTGCAGCTTCACGGCTACTAAACT
GGAATATTTGGGGCATGTGCTGTCTGAAAATGGGGTGGAAGTGGATCCAGAAAAAATCAGATCCATCAAACAATGGCCGATTCCTACGAATGTACGTAAAGTAAGAGGAT
TTTTGGGGTTGACCGGATACTACCGTCATTTCGTACAACATTATGGGTCCATTGCAGCTCCCTTAACACAGTTATTGAAGTTGGGATCCTTTAAATGGAATCAGGAAGCT
CAAGCAGCATTTGAAAGGTTGCAGCAAGCCATGATGACCCTGCCTACTTTAGCTCTCCCAGACTTCAATGCACCGTTCGAATTGGAGACTGATGCATCTGGTTATGGGGT
GGGAGCTGTCCTGATGTAG
Protein sequenceShow/hide protein sequence
MVSEHDIDQSLDNTTLDGNQTEETIVGINVAFAAAVDARISTAMNEWFHCRRPPDQPIYHHFEVRNPQIHSTFEQLEELRQRIATNTIKATLRATSNTPGHHLYRWEKLN
NNSYFSWSPKVKMIGKPLLYDATAKDIWDTTQKLYSKRQNASRLYTLQKQVHECKDDMIYDFLTGLNPKFDVLCERILGQRPIPSPMEVCSEIRLEENRTSTMNISATHA
IDSATFNVRSSTSGSDKHNGKPVSVCEHCNEQWHTKDQYWKLHGRSLRGKSALPTTNVTQGGLIPSVLLVLMGKNPWILDSGAIDHLIGSTKHFVSYIPCIGNKTINIVD
DSLSPIVGNGKISPFSLNSMRMIDTARYSRGLYLLDDVCLGTILTTSSGKRWFVTFVDDHTCLTWVFLVSDKSESDNAREFQNHTLNEFLSSKGNFHKNSYAYPPQQNGV
AERKNHLLLETPLECLKESYSSTHLIFYVPFPEKFVRPPIIHQYSIRKDLKEKKDKKIVENNIFCVFLREEIAERESSVLRTEHDDLERRTIMVQTRIEERMEAHKQELS
KIPVIEEKLTTISQNMEGLQAQVEKTHQLVMIFMETVAKERLLANGKGSESPVQTTSTMKSVEGESSGSKETRNDMSKKKVDDEGENNDQSKFKRVEMPVFNGDDPDSWL
FQADRYFQIHKLTNSERLIVATISFEGPTLNWYRAQEERDKFIDWLNLKERLLLRFRFMCEGSLYGCFLRIQQDLSAEEYRNLFDKWMAPLADIQEKIVEETFMGGLLAE
MMKCAQMVEHREIVRREANLPGYSGSKSPNYPHTNAKTNTVANDQSFKPREKRDSDSSVTKNITHDINARQRIYECLCIREHRRSNRTLYQLGGWVDEPWDYEDKGADLR
QRSCRVSRLRRHTQLYIGKIGHNLATSDQGHLQLWGHLGVRNNDQRKRRLRTGETQSQWLDRHRRLSTVGTRRSGRNTGDAMAIFTRSNRDGLEELDHSWTDSDLGYLVE
CRALETRIMEYEPRAEDGLISIPDKVQAVLKQYEDMFEWPEKLPPERDIKHHIHIKGGTDPINVRLYRYAYKQKEEMEKLVDEMLSSDIIRPSTSPYSSPVLLVKKKDGS
WRFCVDYRALNNVTIPDKFPIPMVEELFDELNGANLFSKIDLKAGYHQIRMCNQEIEKIAFRTHEGHYEFLVMPFGLMNAPATFQSLMNSIFRAYMQKFVLVFFDDILVY
SQGWEEHCQHIELVLEVLRSHILFANKKKCSFTATKLEYLGHVLSENGVEVDPEKIRSIKQWPIPTNVRKVRGFLGLTGYYRHFVQHYGSIAAPLTQLLKLGSFKWNQEA
QAAFERLQQAMMTLPTLALPDFNAPFELETDASGYGVGAVLM