; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh00G000060 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh00G000060
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase
Genome locationCmo_Chr00:1500247..1502312
RNA-Seq ExpressionCmoCh00G000060
SyntenyCmoCh00G000060
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004518 - nuclease activity (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150099.1 uncharacterized protein LOC111018360 [Momordica charantia]8.6e-27369.09Show/hide
Query:  KSGWMDRPPQNNQAGTSRGPYHQRNHPTTPLQCMLCKGPHKVSYCPHRAFLTALQVSIQEINDARVESMLDKKEDQDNPRMGALKYLSALQWKVES-KEI
        KSG  D+ PQ    G SRGPY Q  +    +   LCKGPH+V+ CPHRA LTALQ S+Q  N+  VE+  +K+ED++ PRM ALK+LSA+Q +V   K  
Subjt:  KSGWMDRPPQNNQAGTSRGPYHQRNHPTTPLQCMLCKGPHKVSYCPHRAFLTALQVSIQEINDARVESMLDKKEDQDNPRMGALKYLSALQWKVES-KEI

Query:  IEKGLMFVDAAINSQPIKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKVVNSEALPIVGVSKRVLFKIGDWTGELDLVVVRMDDFDVVLGIEFLLEH
         EKGLMFVDA IN    KST++DSGATHNFI++QEARRL LTI KD GKMKVVN EALPIVGVSKRV+ K+G WTG +D VVVRMDDFDVVLG+EFL+EH
Subjt:  IEKGLMFVDAAINSQPIKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKVVNSEALPIVGVSKRVLFKIGDWTGELDLVVVRMDDFDVVLGIEFLLEH

Query:  KVIPMPLAKCLVITDRNPTVIPASIKQPGNIRMISAIQLKRGLAREEPTFMAIPLMEEATTEETVSNEIKEVLNSYADIMPESLPQTLPPRRGIDHEIEL
        KVIPMPLAKC+++T  +PTV+  SIKQPG IRMISA+QLK+GL REEPTFMAIP++E+      V  EI+ V+  Y DIMP+SLP+TLPPRRGIDHEIEL
Subjt:  KVIPMPLAKCLVITDRNPTVIPASIKQPGNIRMISAIQLKRGLAREEPTFMAIPLMEEATTEETVSNEIKEVLNSYADIMPESLPQTLPPRRGIDHEIEL

Query:  LPGVKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSG
        +PG KPPAKNAYRMAPPELA+LRKQL+ELL AGFIRP KAP+GAPVLFQKKKDGTLRLCIDY+ALNKVTVRNKYPLPII+DLFDQLHGAKYFTKLDLRSG
Subjt:  LPGVKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSG

Query:  YYQVRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVYSKTLEEHKVHLKLVFDKLRQNQL------------
        YYQVRIAEGDEPKTTCVTRYGAFEFL                           FV+VYLDDIVVYS TL+EH++HL+LVFDKLRQNQL            
Subjt:  YYQVRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVYSKTLEEHKVHLKLVFDKLRQNQL------------

Query:  ---------------MDSDKIKAIQEWKVPTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTK
                       MD+DK+KAIQEW+VPTSV E+RSFLGLANYYRRF+EGFSR A P+TELLKK   W WS + Q AFE+LK  M +G VLGL DVTK
Subjt:  ---------------MDSDKIKAIQEWKVPTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTK

Query:  PFEIETDASDFALGGVLIQEGHPIAYESRMLNDAERRYTISEKEMLAVVHCLRVWRQYLLGSQFVVKTDNTAICHFFDQPKLPAKQARW
        PFE+ETDASD+ALGGVL+Q+ HPI YESR LN+AERRYT+SEKEMLAVVHCLR WRQYLLGS FVVKTDN+AICHFF+QPKL +KQARW
Subjt:  PFEIETDASDFALGGVLIQEGHPIAYESRMLNDAERRYTISEKEMLAVVHCLRVWRQYLLGSQFVVKTDNTAICHFFDQPKLPAKQARW

XP_022975176.1 uncharacterized protein LOC111474215 [Cucurbita maxima]6.6e-27375.3Show/hide
Query:  SGWMDRPPQNNQAGTSRGPYHQRNHPTTPLQCMLCKGPHKVSYCPHRAFLTALQVSIQEINDARVESMLDKKEDQDNPRMGALKYLSALQWKVESKEIIE
        SGW DRPPQNNQAGTSRGPYHQRNHPTTPLQCMLCKGPHKVSYCPHRA LTALQVSIQE NDA+VE+MLDKKEDQDNPRMGALK+LSALQ KVE KEI+E
Subjt:  SGWMDRPPQNNQAGTSRGPYHQRNHPTTPLQCMLCKGPHKVSYCPHRAFLTALQVSIQEINDARVESMLDKKEDQDNPRMGALKYLSALQWKVESKEIIE

Query:  KGLMFVDAAINSQPIKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKVVNSEALPIVGVSKRVLFKIGDWTGELDLVVVRMDDFDVVLGIEFLLEHKV
        KGL+                                                            V FKIGDWTGELDLVV RMDDFDVVLG+EFLLEHKV
Subjt:  KGLMFVDAAINSQPIKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKVVNSEALPIVGVSKRVLFKIGDWTGELDLVVVRMDDFDVVLGIEFLLEHKV

Query:  IPMPLAKCLVITDRNPTVIPASIKQPGNIRMISAIQLKRGLAREEPTFMAIPLMEEATTEETVSNEIKEVLNSYADIMPESLPQTLPPRRGIDHEIELLP
        IPMPLAKCLV+TDRNPTVIPASIKQPGN+RMIS IQLKRGLAREE TFMAIPLME ATTEETV NEI EVLNSYADIM ESLPQTLPPRR          
Subjt:  IPMPLAKCLVITDRNPTVIPASIKQPGNIRMISAIQLKRGLAREEPTFMAIPLMEEATTEETVSNEIKEVLNSYADIMPESLPQTLPPRRGIDHEIELLP

Query:  GVKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYY
             AKNAYRMA PELA+LRKQL+ELL AGFIRP KAPY APVLFQKKKDGTL LCIDY+ALNKVTVRNK+PLPIISDLFDQLHGAKYFTKLDL SGYY
Subjt:  GVKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYY

Query:  QVRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVYSKTLEEHKVHLKLVFDKLRQNQLMDSDKIKAIQEWKV
        QVRIAEGDEP+TTCVTRYGAFEFL                           FVIVYLDDIVVYS TLEEHKVHLKL+         MDSDKIKAIQEWKV
Subjt:  QVRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVYSKTLEEHKVHLKLVFDKLRQNQLMDSDKIKAIQEWKV

Query:  PTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTKPFEIETDASDFALGGVLIQEGHPIAYESR
        PTSVS++RSFLGLANYYRRFVEGFSR AAPL ELLKKDHPW WSNDCQMAFE+LKTTM  G VLGLVDVTKPFEIETDASDFALGGVLIQEGHPIA+ESR
Subjt:  PTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTKPFEIETDASDFALGGVLIQEGHPIAYESR

Query:  MLNDAERRYTISEKEMLAVVHCLRVWRQYLLGSQFVVKTDNTAICHFFDQPKLPAKQARW
         LNDAERRY +SEK+ML VVHCLRVWRQYLLGSQFVVKTDN+ ICHFFDQPKL AKQARW
Subjt:  MLNDAERRYTISEKEMLAVVHCLRVWRQYLLGSQFVVKTDNTAICHFFDQPKLPAKQARW

XP_022975516.1 uncharacterized protein LOC111474945, partial [Cucurbita maxima]4.3e-28076.24Show/hide
Query:  SGWMDRPPQNNQAGTSRGPYHQRNHPTTPLQCMLCKGPHKVSYCPHRAFLTALQVSIQEINDARVESMLDKKEDQDNPRMGALKYLSALQWKVESKEIIE
        SGW DRPPQNNQAGTSRGPYHQRNHPTTPLQCMLCKGPHKVSYCPHRA LTALQVSIQE NDA+VE+MLDKKEDQDNPRMGALK+LSALQ KVE KEI+E
Subjt:  SGWMDRPPQNNQAGTSRGPYHQRNHPTTPLQCMLCKGPHKVSYCPHRAFLTALQVSIQEINDARVESMLDKKEDQDNPRMGALKYLSALQWKVESKEIIE

Query:  KGLMFVDAAINSQPIKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKVVNSEALPIVGVSKRVLFKIGDWTGELDLVVVRMDDFDVVLGIEFLLEHKV
        KGL+                                                            V FKIGDWTGELDLVV RMDDFDVVLG+EFLLEHKV
Subjt:  KGLMFVDAAINSQPIKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKVVNSEALPIVGVSKRVLFKIGDWTGELDLVVVRMDDFDVVLGIEFLLEHKV

Query:  IPMPLAKCLVITDRNPTVIPASIKQPGNIRMISAIQLKRGLAREEPTFMAIPLMEEATTEETVSNEIKEVLNSYADIMPESLPQTLPPRRGIDHEIELLP
        IPMPLAKCLVITDRNPTVIPASIKQP                               TTEETV NEI EVLN YADIMPESLPQTLPPRRGIDHEIEL+P
Subjt:  IPMPLAKCLVITDRNPTVIPASIKQPGNIRMISAIQLKRGLAREEPTFMAIPLMEEATTEETVSNEIKEVLNSYADIMPESLPQTLPPRRGIDHEIELLP

Query:  GVKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYY
        GVKPPAKNAYRMAPPELA+LRKQL+ELL AGFIRP KAPYGAPVLFQKKKDGTLRLCIDY+ALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYY
Subjt:  GVKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYY

Query:  QVRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVYSKTLEEHKVHLKLVFDKLRQNQL-----MDSDKIKAI
        QVRIAEGDEPKTTCVTRYGAFEFL                           FVIVYLDDIVVYS TLEEHKVHLKLVFDKLRQNQL     MDSDKIKAI
Subjt:  QVRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVYSKTLEEHKVHLKLVFDKLRQNQL-----MDSDKIKAI

Query:  QEWKVPTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTKPFEIETDASDFALGGVLIQEGHPI
        QEWKVPTSVS++RSFLGLANYYRRFVEGFSR AAPLTELLKKDH WSWS+DCQMAFE+LKTTMTRG VLGLVDVTKPFEIETDASDFALGGVLIQEGHPI
Subjt:  QEWKVPTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTKPFEIETDASDFALGGVLIQEGHPI

Query:  AYESRMLNDAERRYTISEKEMLAVVHCLRVWRQYLLGSQFVVKTDNTAICHFFDQPKLPAKQARW
        A+ESR LNDAERRYT+SEKEMLAVVHCLRVWRQYLLGSQFVVKTDN+AICHFFDQPKL AKQARW
Subjt:  AYESRMLNDAERRYTISEKEMLAVVHCLRVWRQYLLGSQFVVKTDNTAICHFFDQPKLPAKQARW

XP_023524533.1 uncharacterized protein LOC111788429 [Cucurbita pepo subsp. pepo]0.0e+0085.88Show/hide
Query:  SGWMDRPPQNNQAGTSRGPYHQRNHPTTPLQCMLCKGPHKVSYCPHRAFLTALQVSIQEINDARVESMLDKKEDQDNPRMGALKYLSALQWKVESKEIIE
        +GW DRPPQNNQAGTSRGPYHQ+NHPTTPLQCMLCKGPHKVSYCPHRA LTALQVSIQE N+ARVE++LDKKED DNPRMGALK+LSALQ KVE KEIIE
Subjt:  SGWMDRPPQNNQAGTSRGPYHQRNHPTTPLQCMLCKGPHKVSYCPHRAFLTALQVSIQEINDARVESMLDKKEDQDNPRMGALKYLSALQWKVESKEIIE

Query:  KGLMFVDAAINSQPIKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKVVNSEALPIVGVSKRVLFKIGDWTGELDLVVVRMDDFDVVLGIEFLLEHKV
        KGLMFVDA INS+  KSTLIDSGATHNFIADQEARRLGLTIGKDPGKMK VNSEALPIVGVSK V FKIGDWTGELDLVVVRMDDFDVVLG+EFLLEHKV
Subjt:  KGLMFVDAAINSQPIKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKVVNSEALPIVGVSKRVLFKIGDWTGELDLVVVRMDDFDVVLGIEFLLEHKV

Query:  IPMPLAKCLVITDRNPTVIPASIKQPGNIRMISAIQLKRGLAREEPTFMAIPLMEEATTEETVSNEIKEVLNSYADIMPESLPQTLPPRRGIDHEIELLP
        IPMPLAKCLVITDRNPTVIPASIKQPGN+RMISAIQLKRGLAREEPTFMAIPL+EEATTEETV  EIKEVL++Y DIMPESLPQTLPPRRGIDHEIELLP
Subjt:  IPMPLAKCLVITDRNPTVIPASIKQPGNIRMISAIQLKRGLAREEPTFMAIPLMEEATTEETVSNEIKEVLNSYADIMPESLPQTLPPRRGIDHEIELLP

Query:  GVKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYY
        GVKPPAKNAYRMAPPELA+LRKQL+ELL AGFIRP KAPYGAPVLFQKKKDGTLRLCIDY+ALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYY
Subjt:  GVKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYY

Query:  QVRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVYSKTLEEHKVHLKLVFDKLRQNQL--------------
        QVRI EGDEPKTTCVTRYGAFEFL                           FVIVYLDDIVVYS TLEEHKVHLKLVFDKLRQNQL              
Subjt:  QVRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVYSKTLEEHKVHLKLVFDKLRQNQL--------------

Query:  -------------MDSDKIKAIQEWKVPTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTKPF
                     MDSDKIKAIQEWKVPTSVSE+RSFLGLANYYRRFVEGFSR AAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRG VLGLVDVTKPF
Subjt:  -------------MDSDKIKAIQEWKVPTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTKPF

Query:  EIETDASDFALGGVLIQEGHPIAYESRMLNDAERRYTISEKEMLAVVHCLRVWRQYLLGSQFVVKTDNTAICHFFDQPKLPAKQARW
        E+ETDASDFALGGVLIQEGHPIAYESR LNDAERRYT+SEKEMLAVVHCLRVWRQYLLGSQFVVKTDN+A CHFFDQPKL AKQARW
Subjt:  EIETDASDFALGGVLIQEGHPIAYESRMLNDAERRYTISEKEMLAVVHCLRVWRQYLLGSQFVVKTDNTAICHFFDQPKLPAKQARW

XP_023537907.1 uncharacterized protein LOC111798805 [Cucurbita pepo subsp. pepo]0.0e+0086.03Show/hide
Query:  SGWMDRPPQNNQAGTSRGPYHQRNHPTTPLQCMLCKGPHKVSYCPHRAFLTALQVSIQEINDARVESMLDKKEDQDNPRMGALKYLSALQWKVESKEIIE
        +GW DRPPQNNQAGTSRGPYHQ+NHPTTPLQCMLCKGPHKVSYCPHRA LTALQVSIQE N+ARVE++LDKKED DNPRMGALK+LSALQ KVE KEIIE
Subjt:  SGWMDRPPQNNQAGTSRGPYHQRNHPTTPLQCMLCKGPHKVSYCPHRAFLTALQVSIQEINDARVESMLDKKEDQDNPRMGALKYLSALQWKVESKEIIE

Query:  KGLMFVDAAINSQPIKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKVVNSEALPIVGVSKRVLFKIGDWTGELDLVVVRMDDFDVVLGIEFLLEHKV
        KGLMFVDA INS+  KSTLIDSGATHNFIADQEARRLGLTIGKDPGKMK VNSEALPIVGVSK V FKIGDWTGELDLVVVRMDDFDVVLG+EFLLEHKV
Subjt:  KGLMFVDAAINSQPIKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKVVNSEALPIVGVSKRVLFKIGDWTGELDLVVVRMDDFDVVLGIEFLLEHKV

Query:  IPMPLAKCLVITDRNPTVIPASIKQPGNIRMISAIQLKRGLAREEPTFMAIPLMEEATTEETVSNEIKEVLNSYADIMPESLPQTLPPRRGIDHEIELLP
        IPMPLAKCLVITDRNPTVIPASIKQPGN+RMISAIQLKRGLAREEPTFMAIPL+EEATTEETV  EIK+VL+SY DIMPESLPQTLPPRRGIDHEIELLP
Subjt:  IPMPLAKCLVITDRNPTVIPASIKQPGNIRMISAIQLKRGLAREEPTFMAIPLMEEATTEETVSNEIKEVLNSYADIMPESLPQTLPPRRGIDHEIELLP

Query:  GVKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYY
        GVKPPAKNAYRMAPPELA+LRKQL+ELL AGFIRP KAPYGAPVLFQKKKDGTLRLCIDY+ALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYY
Subjt:  GVKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYY

Query:  QVRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVYSKTLEEHKVHLKLVFDKLRQNQL--------------
        QVRIAEGDEPKTTCVTRYGAFEFL                           FVIVYLDDIVVYS TLEEHKVHLKLVFDKLRQNQL              
Subjt:  QVRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVYSKTLEEHKVHLKLVFDKLRQNQL--------------

Query:  -------------MDSDKIKAIQEWKVPTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTKPF
                     MDSDKIKAIQEWKVPTSVSE+RSFLGLANYYRRFVEGFSR AAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRG VLGLVDVTKPF
Subjt:  -------------MDSDKIKAIQEWKVPTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTKPF

Query:  EIETDASDFALGGVLIQEGHPIAYESRMLNDAERRYTISEKEMLAVVHCLRVWRQYLLGSQFVVKTDNTAICHFFDQPKLPAKQARW
        E+ETDASDFALGGVLIQEGHPIAYESR LNDAERRYT+SEKEMLAVVHCLRVWRQYLLGSQFVVKTDN+A CHFFDQPKL AKQARW
Subjt:  EIETDASDFALGGVLIQEGHPIAYESRMLNDAERRYTISEKEMLAVVHCLRVWRQYLLGSQFVVKTDNTAICHFFDQPKLPAKQARW

TrEMBL top hitse value%identityAlignment
A0A5D3C9P8 Reverse transcriptase3.9e-24763.27Show/hide
Query:  RPPQNNQAGTSRGPYHQRNHPTTPLQCMLCKGPHKVSYCPHRAFLTALQVSIQEIND---ARVESMLDKKEDQDNPRMGALKYLSALQWKV-ESKEIIEK
        R  Q+N   + RGP +Q N    PL C +CKGPH    CP++    A Q S+   +D    + E   D+ E+ DNPRMGALK+LS+LQ KV E+   +E+
Subjt:  RPPQNNQAGTSRGPYHQRNHPTTPLQCMLCKGPHKVSYCPHRAFLTALQVSIQEIND---ARVESMLDKKEDQDNPRMGALKYLSALQWKV-ESKEIIEK

Query:  GLMFVDAAINSQPIKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKVVNSEALPIVGVSKRVLFKIGDWTGELDLVVVRMDDFDVVLGIEFLLEHKVI
        GLM+VD  IN +P KST++DSGATHNFI + EA+RL L   KD G+MK VNS ALPI+G+ KR + ++G W+G +D VVV+MDDFDVVLG+EFLLEH+VI
Subjt:  GLMFVDAAINSQPIKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKVVNSEALPIVGVSKRVLFKIGDWTGELDLVVVRMDDFDVVLGIEFLLEHKVI

Query:  PMPLAKCLVITDRNPTVIPASIKQPGNIRMISAIQLKRGLAREEPTFMAIPLMEEATTEETVSNEIKEVLNSYADIMPESLPQTLPPRRGIDHEIELLPG
        PMPLAKCLVIT   P+V+   ++QP  ++MISA+QLK+GL+R+EPTFMAIPL     + ETV  EI  VL  Y D+MP+SLP++LPPRR IDHEIEL+PG
Subjt:  PMPLAKCLVITDRNPTVIPASIKQPGNIRMISAIQLKRGLAREEPTFMAIPLMEEATTEETVSNEIKEVLNSYADIMPESLPQTLPPRRGIDHEIELLPG

Query:  VKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQ
         KPPAKNAYRMAPPELA+LRKQL+ELL AGFIRP KAPYGAPVLFQKKKDG+LRLCIDY+ALNK+TVRNKYPLPII+DLFD+LHGAKYF+KLDLRSGYYQ
Subjt:  VKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQ

Query:  VRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVYSKTLEEHKVHLKLVFDKLRQNQL---------------
        VRIAEGDEPKTTCVTRYGAFEFL                           FV+VYLDDIVVYS T+EEH+ HL+ VF KL++NQL               
Subjt:  VRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVYSKTLEEHKVHLKLVFDKLRQNQL---------------

Query:  ------------MDSDKIKAIQEWKVPTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTKPFE
                    M+  KI AI++W +P SVSE+RSFLGLANYYRRFVEGFS+ A+PLTELLKKD  W+W  +CQ AF+ LK  +  G +LG+ DVTKPFE
Subjt:  ------------MDSDKIKAIQEWKVPTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTKPFE

Query:  IETDASDFALGGVLIQEGHPIAYESRMLNDAERRYTISEKEMLAVVHCLRVWRQYLLGSQFVVKTDNTAICHFFDQPKLPAKQARW
        +ETDASD+ALGGVL+Q GHPIAYESR LN AERRYT+SEKEMLAVVHCLR WRQYLLGS FVVKTDN+A CHFF QPKL +KQARW
Subjt:  IETDASDFALGGVLIQEGHPIAYESRMLNDAERRYTISEKEMLAVVHCLRVWRQYLLGSQFVVKTDNTAICHFFDQPKLPAKQARW

A0A6J1D906 Reverse transcriptase4.2e-27369.09Show/hide
Query:  KSGWMDRPPQNNQAGTSRGPYHQRNHPTTPLQCMLCKGPHKVSYCPHRAFLTALQVSIQEINDARVESMLDKKEDQDNPRMGALKYLSALQWKVES-KEI
        KSG  D+ PQ    G SRGPY Q  +    +   LCKGPH+V+ CPHRA LTALQ S+Q  N+  VE+  +K+ED++ PRM ALK+LSA+Q +V   K  
Subjt:  KSGWMDRPPQNNQAGTSRGPYHQRNHPTTPLQCMLCKGPHKVSYCPHRAFLTALQVSIQEINDARVESMLDKKEDQDNPRMGALKYLSALQWKVES-KEI

Query:  IEKGLMFVDAAINSQPIKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKVVNSEALPIVGVSKRVLFKIGDWTGELDLVVVRMDDFDVVLGIEFLLEH
         EKGLMFVDA IN    KST++DSGATHNFI++QEARRL LTI KD GKMKVVN EALPIVGVSKRV+ K+G WTG +D VVVRMDDFDVVLG+EFL+EH
Subjt:  IEKGLMFVDAAINSQPIKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKVVNSEALPIVGVSKRVLFKIGDWTGELDLVVVRMDDFDVVLGIEFLLEH

Query:  KVIPMPLAKCLVITDRNPTVIPASIKQPGNIRMISAIQLKRGLAREEPTFMAIPLMEEATTEETVSNEIKEVLNSYADIMPESLPQTLPPRRGIDHEIEL
        KVIPMPLAKC+++T  +PTV+  SIKQPG IRMISA+QLK+GL REEPTFMAIP++E+      V  EI+ V+  Y DIMP+SLP+TLPPRRGIDHEIEL
Subjt:  KVIPMPLAKCLVITDRNPTVIPASIKQPGNIRMISAIQLKRGLAREEPTFMAIPLMEEATTEETVSNEIKEVLNSYADIMPESLPQTLPPRRGIDHEIEL

Query:  LPGVKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSG
        +PG KPPAKNAYRMAPPELA+LRKQL+ELL AGFIRP KAP+GAPVLFQKKKDGTLRLCIDY+ALNKVTVRNKYPLPII+DLFDQLHGAKYFTKLDLRSG
Subjt:  LPGVKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSG

Query:  YYQVRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVYSKTLEEHKVHLKLVFDKLRQNQL------------
        YYQVRIAEGDEPKTTCVTRYGAFEFL                           FV+VYLDDIVVYS TL+EH++HL+LVFDKLRQNQL            
Subjt:  YYQVRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVYSKTLEEHKVHLKLVFDKLRQNQL------------

Query:  ---------------MDSDKIKAIQEWKVPTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTK
                       MD+DK+KAIQEW+VPTSV E+RSFLGLANYYRRF+EGFSR A P+TELLKK   W WS + Q AFE+LK  M +G VLGL DVTK
Subjt:  ---------------MDSDKIKAIQEWKVPTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTK

Query:  PFEIETDASDFALGGVLIQEGHPIAYESRMLNDAERRYTISEKEMLAVVHCLRVWRQYLLGSQFVVKTDNTAICHFFDQPKLPAKQARW
        PFE+ETDASD+ALGGVL+Q+ HPI YESR LN+AERRYT+SEKEMLAVVHCLR WRQYLLGS FVVKTDN+AICHFF+QPKL +KQARW
Subjt:  PFEIETDASDFALGGVLIQEGHPIAYESRMLNDAERRYTISEKEMLAVVHCLRVWRQYLLGSQFVVKTDNTAICHFFDQPKLPAKQARW

A0A6J1DK29 uncharacterized protein LOC1110218292.5e-24666.67Show/hide
Query:  KSGWMDRPPQNNQAGTSRGPYHQRNHPTTPLQCMLCKGPHKVSYCPHRAFLTALQVSIQEINDARVESMLDKKEDQDNPRMGALKYLSALQWKVES-KEI
        K G  D+ P     G SRGPY Q  +   P  C LC+GPH+V+ CPHRA LTALQ S+Q  N+  V +  +K+ED++ PRMGALK+LSA+Q +V   K  
Subjt:  KSGWMDRPPQNNQAGTSRGPYHQRNHPTTPLQCMLCKGPHKVSYCPHRAFLTALQVSIQEINDARVESMLDKKEDQDNPRMGALKYLSALQWKVES-KEI

Query:  IEKGLMFVDAAINSQPIKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKVVNSEALPIVGVSKRVLFKIGDWTGELDLVVVRMDDFDVVLGIEFLLEH
         EKGLMFVDA IN  P KS ++DSGATHNFI++QEA RL LTI KD GKMK VNSEALPIVGVSKRV  K+G WTG  D VVVRMDDFDVVLG+EFL+EH
Subjt:  IEKGLMFVDAAINSQPIKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKVVNSEALPIVGVSKRVLFKIGDWTGELDLVVVRMDDFDVVLGIEFLLEH

Query:  KVIPMPLAKCLVITDRNPTVIPASIKQPGNIRMISAIQLKRGLAREEPTFMAIPLMEEATTEETVSNEIKEVLNSYADIMPESLPQTLPPRRGIDHEIEL
        KVIPMPLAKC+++T  +PTV+ ASIKQPG IRMISA+QLK+GL REEPTFM      +      V  EI+ V+  Y DIMP+SLP+TLPPRRGIDHEIEL
Subjt:  KVIPMPLAKCLVITDRNPTVIPASIKQPGNIRMISAIQLKRGLAREEPTFMAIPLMEEATTEETVSNEIKEVLNSYADIMPESLPQTLPPRRGIDHEIEL

Query:  LPGVKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSG
        +PG KPPAKNAYRMAPP+LA+LRKQL+EL   GFIRP KAP+GA VLFQKKKDGTLRLCIDY+ALNKVTVRNKYPLPII+DLFDQLHGAKYFTKLDLRSG
Subjt:  LPGVKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSG

Query:  YYQVRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVYSKTLEEHKVHLKLVFDKLRQNQL------------
        YYQVRIAEGDEPKTTCVTRYGAFEFL                           FV+VYLDDIVVYS TL+EH++HL+LVFDKLRQNQL            
Subjt:  YYQVRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVYSKTLEEHKVHLKLVFDKLRQNQL------------

Query:  ---------------MDSDKIKAIQEWKVPTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTK
                       MD+DK+K IQEW+VPTSV+E+RSFLGLANYYRRF+EGFSR A P+TELL K     WS + Q AFE+LK  M +G VLGL DVTK
Subjt:  ---------------MDSDKIKAIQEWKVPTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTK

Query:  PFEIETDASDFALGGVLIQEGHPIAYESRMLNDAERRYTISEKEMLAVVHCLRVWRQ
        PFE+ETDASD+ALGGVL+Q+ HPIAYESR LN+AERRYT+SEKEMLAVVHCLR WRQ
Subjt:  PFEIETDASDFALGGVLIQEGHPIAYESRMLNDAERRYTISEKEMLAVVHCLRVWRQ

A0A6J1IDF7 uncharacterized protein LOC1114742153.2e-27375.3Show/hide
Query:  SGWMDRPPQNNQAGTSRGPYHQRNHPTTPLQCMLCKGPHKVSYCPHRAFLTALQVSIQEINDARVESMLDKKEDQDNPRMGALKYLSALQWKVESKEIIE
        SGW DRPPQNNQAGTSRGPYHQRNHPTTPLQCMLCKGPHKVSYCPHRA LTALQVSIQE NDA+VE+MLDKKEDQDNPRMGALK+LSALQ KVE KEI+E
Subjt:  SGWMDRPPQNNQAGTSRGPYHQRNHPTTPLQCMLCKGPHKVSYCPHRAFLTALQVSIQEINDARVESMLDKKEDQDNPRMGALKYLSALQWKVESKEIIE

Query:  KGLMFVDAAINSQPIKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKVVNSEALPIVGVSKRVLFKIGDWTGELDLVVVRMDDFDVVLGIEFLLEHKV
        KGL+                                                            V FKIGDWTGELDLVV RMDDFDVVLG+EFLLEHKV
Subjt:  KGLMFVDAAINSQPIKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKVVNSEALPIVGVSKRVLFKIGDWTGELDLVVVRMDDFDVVLGIEFLLEHKV

Query:  IPMPLAKCLVITDRNPTVIPASIKQPGNIRMISAIQLKRGLAREEPTFMAIPLMEEATTEETVSNEIKEVLNSYADIMPESLPQTLPPRRGIDHEIELLP
        IPMPLAKCLV+TDRNPTVIPASIKQPGN+RMIS IQLKRGLAREE TFMAIPLME ATTEETV NEI EVLNSYADIM ESLPQTLPPRR          
Subjt:  IPMPLAKCLVITDRNPTVIPASIKQPGNIRMISAIQLKRGLAREEPTFMAIPLMEEATTEETVSNEIKEVLNSYADIMPESLPQTLPPRRGIDHEIELLP

Query:  GVKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYY
             AKNAYRMA PELA+LRKQL+ELL AGFIRP KAPY APVLFQKKKDGTL LCIDY+ALNKVTVRNK+PLPIISDLFDQLHGAKYFTKLDL SGYY
Subjt:  GVKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYY

Query:  QVRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVYSKTLEEHKVHLKLVFDKLRQNQLMDSDKIKAIQEWKV
        QVRIAEGDEP+TTCVTRYGAFEFL                           FVIVYLDDIVVYS TLEEHKVHLKL+         MDSDKIKAIQEWKV
Subjt:  QVRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVYSKTLEEHKVHLKLVFDKLRQNQLMDSDKIKAIQEWKV

Query:  PTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTKPFEIETDASDFALGGVLIQEGHPIAYESR
        PTSVS++RSFLGLANYYRRFVEGFSR AAPL ELLKKDHPW WSNDCQMAFE+LKTTM  G VLGLVDVTKPFEIETDASDFALGGVLIQEGHPIA+ESR
Subjt:  PTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTKPFEIETDASDFALGGVLIQEGHPIAYESR

Query:  MLNDAERRYTISEKEMLAVVHCLRVWRQYLLGSQFVVKTDNTAICHFFDQPKLPAKQARW
         LNDAERRY +SEK+ML VVHCLRVWRQYLLGSQFVVKTDN+ ICHFFDQPKL AKQARW
Subjt:  MLNDAERRYTISEKEMLAVVHCLRVWRQYLLGSQFVVKTDNTAICHFFDQPKLPAKQARW

A0A6J1IEF9 uncharacterized protein LOC1114749452.1e-28076.24Show/hide
Query:  SGWMDRPPQNNQAGTSRGPYHQRNHPTTPLQCMLCKGPHKVSYCPHRAFLTALQVSIQEINDARVESMLDKKEDQDNPRMGALKYLSALQWKVESKEIIE
        SGW DRPPQNNQAGTSRGPYHQRNHPTTPLQCMLCKGPHKVSYCPHRA LTALQVSIQE NDA+VE+MLDKKEDQDNPRMGALK+LSALQ KVE KEI+E
Subjt:  SGWMDRPPQNNQAGTSRGPYHQRNHPTTPLQCMLCKGPHKVSYCPHRAFLTALQVSIQEINDARVESMLDKKEDQDNPRMGALKYLSALQWKVESKEIIE

Query:  KGLMFVDAAINSQPIKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKVVNSEALPIVGVSKRVLFKIGDWTGELDLVVVRMDDFDVVLGIEFLLEHKV
        KGL+                                                            V FKIGDWTGELDLVV RMDDFDVVLG+EFLLEHKV
Subjt:  KGLMFVDAAINSQPIKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKVVNSEALPIVGVSKRVLFKIGDWTGELDLVVVRMDDFDVVLGIEFLLEHKV

Query:  IPMPLAKCLVITDRNPTVIPASIKQPGNIRMISAIQLKRGLAREEPTFMAIPLMEEATTEETVSNEIKEVLNSYADIMPESLPQTLPPRRGIDHEIELLP
        IPMPLAKCLVITDRNPTVIPASIKQP                               TTEETV NEI EVLN YADIMPESLPQTLPPRRGIDHEIEL+P
Subjt:  IPMPLAKCLVITDRNPTVIPASIKQPGNIRMISAIQLKRGLAREEPTFMAIPLMEEATTEETVSNEIKEVLNSYADIMPESLPQTLPPRRGIDHEIELLP

Query:  GVKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYY
        GVKPPAKNAYRMAPPELA+LRKQL+ELL AGFIRP KAPYGAPVLFQKKKDGTLRLCIDY+ALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYY
Subjt:  GVKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYY

Query:  QVRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVYSKTLEEHKVHLKLVFDKLRQNQL-----MDSDKIKAI
        QVRIAEGDEPKTTCVTRYGAFEFL                           FVIVYLDDIVVYS TLEEHKVHLKLVFDKLRQNQL     MDSDKIKAI
Subjt:  QVRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVYSKTLEEHKVHLKLVFDKLRQNQL-----MDSDKIKAI

Query:  QEWKVPTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTKPFEIETDASDFALGGVLIQEGHPI
        QEWKVPTSVS++RSFLGLANYYRRFVEGFSR AAPLTELLKKDH WSWS+DCQMAFE+LKTTMTRG VLGLVDVTKPFEIETDASDFALGGVLIQEGHPI
Subjt:  QEWKVPTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTKPFEIETDASDFALGGVLIQEGHPI

Query:  AYESRMLNDAERRYTISEKEMLAVVHCLRVWRQYLLGSQFVVKTDNTAICHFFDQPKLPAKQARW
        A+ESR LNDAERRYT+SEKEMLAVVHCLRVWRQYLLGSQFVVKTDN+AICHFFDQPKL AKQARW
Subjt:  AYESRMLNDAERRYTISEKEMLAVVHCLRVWRQYLLGSQFVVKTDNTAICHFFDQPKLPAKQARW

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein3.7e-6133.03Show/hide
Query:  EIKEVLNSYADIMPESLPQTLP-PRRGIDHEIELL-PGVKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKAL
        E+ ++   + DI  E+  + LP P +G++ E+EL     + P +N Y + P ++  +  ++N+ L +G IR +KA    PV+F  KK+GTLR+ +DYK L
Subjt:  EIKEVLNSYADIMPESLPQTLP-PRRGIDHEIELL-PGVKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKAL

Query:  NKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVY
        NK    N YPLP+I  L  ++ G+  FTKLDL+S Y+ +R+ +GDE K       G FE+L                            V+ Y+DDI+++
Subjt:  NKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVY

Query:  SKTLEEHKVHLKLVFDKLRQNQLMDS---------------------------DKIKAIQEWKVPTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLK
        SK+  EH  H+K V  KL+   L+ +                           + I  + +WK P +  E+R FLG  NY R+F+   S+L  PL  LLK
Subjt:  SKTLEEHKVHLKLVFDKLRQNQLMDS---------------------------DKIKAIQEWKVPTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLK

Query:  KDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTKPFEIETDASDFALGGVLIQEG-----HPIAYESRMLNDAERRYTISEKEMLAVVHCLRVWRQYLL
        KD  W W+     A EN+K  +    VL   D +K   +ETDASD A+G VL Q+      +P+ Y S  ++ A+  Y++S+KEMLA++  L+ WR YL 
Subjt:  KDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTKPFEIETDASDFALGGVLIQEG-----HPIAYESRMLNDAERRYTISEKEMLAVVHCLRVWRQYLL

Query:  GS--QFVVKTDNTAICHFFDQPKLPAKQ--ARW
         +   F + TD+  +         P  +  ARW
Subjt:  GS--QFVVKTDNTAICHFFDQPKLPAKQ--ARW

P0CT35 Transposon Tf2-2 polyprotein3.7e-6133.03Show/hide
Query:  EIKEVLNSYADIMPESLPQTLP-PRRGIDHEIELL-PGVKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKAL
        E+ ++   + DI  E+  + LP P +G++ E+EL     + P +N Y + P ++  +  ++N+ L +G IR +KA    PV+F  KK+GTLR+ +DYK L
Subjt:  EIKEVLNSYADIMPESLPQTLP-PRRGIDHEIELL-PGVKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKAL

Query:  NKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVY
        NK    N YPLP+I  L  ++ G+  FTKLDL+S Y+ +R+ +GDE K       G FE+L                            V+ Y+DDI+++
Subjt:  NKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVY

Query:  SKTLEEHKVHLKLVFDKLRQNQLMDS---------------------------DKIKAIQEWKVPTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLK
        SK+  EH  H+K V  KL+   L+ +                           + I  + +WK P +  E+R FLG  NY R+F+   S+L  PL  LLK
Subjt:  SKTLEEHKVHLKLVFDKLRQNQLMDS---------------------------DKIKAIQEWKVPTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLK

Query:  KDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTKPFEIETDASDFALGGVLIQEG-----HPIAYESRMLNDAERRYTISEKEMLAVVHCLRVWRQYLL
        KD  W W+     A EN+K  +    VL   D +K   +ETDASD A+G VL Q+      +P+ Y S  ++ A+  Y++S+KEMLA++  L+ WR YL 
Subjt:  KDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTKPFEIETDASDFALGGVLIQEG-----HPIAYESRMLNDAERRYTISEKEMLAVVHCLRVWRQYLL

Query:  GS--QFVVKTDNTAICHFFDQPKLPAKQ--ARW
         +   F + TD+  +         P  +  ARW
Subjt:  GS--QFVVKTDNTAICHFFDQPKLPAKQ--ARW

P0CT36 Transposon Tf2-3 polyprotein3.7e-6133.03Show/hide
Query:  EIKEVLNSYADIMPESLPQTLP-PRRGIDHEIELL-PGVKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKAL
        E+ ++   + DI  E+  + LP P +G++ E+EL     + P +N Y + P ++  +  ++N+ L +G IR +KA    PV+F  KK+GTLR+ +DYK L
Subjt:  EIKEVLNSYADIMPESLPQTLP-PRRGIDHEIELL-PGVKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKAL

Query:  NKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVY
        NK    N YPLP+I  L  ++ G+  FTKLDL+S Y+ +R+ +GDE K       G FE+L                            V+ Y+DDI+++
Subjt:  NKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVY

Query:  SKTLEEHKVHLKLVFDKLRQNQLMDS---------------------------DKIKAIQEWKVPTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLK
        SK+  EH  H+K V  KL+   L+ +                           + I  + +WK P +  E+R FLG  NY R+F+   S+L  PL  LLK
Subjt:  SKTLEEHKVHLKLVFDKLRQNQLMDS---------------------------DKIKAIQEWKVPTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLK

Query:  KDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTKPFEIETDASDFALGGVLIQEG-----HPIAYESRMLNDAERRYTISEKEMLAVVHCLRVWRQYLL
        KD  W W+     A EN+K  +    VL   D +K   +ETDASD A+G VL Q+      +P+ Y S  ++ A+  Y++S+KEMLA++  L+ WR YL 
Subjt:  KDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTKPFEIETDASDFALGGVLIQEG-----HPIAYESRMLNDAERRYTISEKEMLAVVHCLRVWRQYLL

Query:  GS--QFVVKTDNTAICHFFDQPKLPAKQ--ARW
         +   F + TD+  +         P  +  ARW
Subjt:  GS--QFVVKTDNTAICHFFDQPKLPAKQ--ARW

P0CT37 Transposon Tf2-4 polyprotein3.7e-6133.03Show/hide
Query:  EIKEVLNSYADIMPESLPQTLP-PRRGIDHEIELL-PGVKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKAL
        E+ ++   + DI  E+  + LP P +G++ E+EL     + P +N Y + P ++  +  ++N+ L +G IR +KA    PV+F  KK+GTLR+ +DYK L
Subjt:  EIKEVLNSYADIMPESLPQTLP-PRRGIDHEIELL-PGVKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKAL

Query:  NKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVY
        NK    N YPLP+I  L  ++ G+  FTKLDL+S Y+ +R+ +GDE K       G FE+L                            V+ Y+DDI+++
Subjt:  NKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVY

Query:  SKTLEEHKVHLKLVFDKLRQNQLMDS---------------------------DKIKAIQEWKVPTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLK
        SK+  EH  H+K V  KL+   L+ +                           + I  + +WK P +  E+R FLG  NY R+F+   S+L  PL  LLK
Subjt:  SKTLEEHKVHLKLVFDKLRQNQLMDS---------------------------DKIKAIQEWKVPTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLK

Query:  KDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTKPFEIETDASDFALGGVLIQEG-----HPIAYESRMLNDAERRYTISEKEMLAVVHCLRVWRQYLL
        KD  W W+     A EN+K  +    VL   D +K   +ETDASD A+G VL Q+      +P+ Y S  ++ A+  Y++S+KEMLA++  L+ WR YL 
Subjt:  KDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTKPFEIETDASDFALGGVLIQEG-----HPIAYESRMLNDAERRYTISEKEMLAVVHCLRVWRQYLL

Query:  GS--QFVVKTDNTAICHFFDQPKLPAKQ--ARW
         +   F + TD+  +         P  +  ARW
Subjt:  GS--QFVVKTDNTAICHFFDQPKLPAKQ--ARW

P0CT41 Transposon Tf2-12 polyprotein3.7e-6133.03Show/hide
Query:  EIKEVLNSYADIMPESLPQTLP-PRRGIDHEIELL-PGVKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKAL
        E+ ++   + DI  E+  + LP P +G++ E+EL     + P +N Y + P ++  +  ++N+ L +G IR +KA    PV+F  KK+GTLR+ +DYK L
Subjt:  EIKEVLNSYADIMPESLPQTLP-PRRGIDHEIELL-PGVKPPAKNAYRMAPPELAKLRKQLNELLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKAL

Query:  NKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVY
        NK    N YPLP+I  L  ++ G+  FTKLDL+S Y+ +R+ +GDE K       G FE+L                            V+ Y+DDI+++
Subjt:  NKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFL---------------------------FVIVYLDDIVVY

Query:  SKTLEEHKVHLKLVFDKLRQNQLMDS---------------------------DKIKAIQEWKVPTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLK
        SK+  EH  H+K V  KL+   L+ +                           + I  + +WK P +  E+R FLG  NY R+F+   S+L  PL  LLK
Subjt:  SKTLEEHKVHLKLVFDKLRQNQLMDS---------------------------DKIKAIQEWKVPTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLK

Query:  KDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTKPFEIETDASDFALGGVLIQEG-----HPIAYESRMLNDAERRYTISEKEMLAVVHCLRVWRQYLL
        KD  W W+     A EN+K  +    VL   D +K   +ETDASD A+G VL Q+      +P+ Y S  ++ A+  Y++S+KEMLA++  L+ WR YL 
Subjt:  KDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTKPFEIETDASDFALGGVLIQEG-----HPIAYESRMLNDAERRYTISEKEMLAVVHCLRVWRQYLL

Query:  GS--QFVVKTDNTAICHFFDQPKLPAKQ--ARW
         +   F + TD+  +         P  +  ARW
Subjt:  GS--QFVVKTDNTAICHFFDQPKLPAKQ--ARW

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein3.0e-1343.02Show/hide
Query:  DSDKIKAIQEWKVPTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTKPF
        D  K++A+  W  P + +E+R FLGL  YYRRFV+ + ++  PLTELLKK +   W+    +AF+ LK  +T   VL L D+  PF
Subjt:  DSDKIKAIQEWKVPTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTKPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGTAAAGAGCGGATGGATGGATAGACCTCCTCAAAACAATCAAGCGGGGACATCTCGAGGGCCTTACCATCAAAGGAACCACCCGACGACACCTTTACAATGCAT
GTTGTGTAAAGGTCCCCATAAAGTATCCTACTGCCCTCATCGAGCCTTTCTCACTGCGCTCCAAGTGTCTATTCAAGAGATCAATGACGCAAGGGTCGAATCTATGTTAG
ACAAGAAGGAAGATCAAGACAATCCCCGAATGGGCGCGCTTAAATACTTGTCAGCCCTCCAATGGAAGGTCGAATCGAAGGAGATAATAGAAAAAGGACTCATGTTCGTA
GATGCGGCAATAAATTCTCAACCGATCAAGAGCACTCTGATAGATTCAGGAGCAACCCACAACTTTATCGCCGATCAAGAAGCCCGAAGATTGGGACTCACCATAGGAAA
GGACCCGGGGAAAATGAAAGTCGTCAACTCTGAGGCCTTGCCTATTGTGGGAGTTTCCAAAAGAGTCCTCTTCAAGATCGGGGATTGGACAGGGGAGCTAGACCTTGTCG
TGGTTCGCATGGACGACTTCGACGTGGTACTGGGGATAGAGTTCCTCCTGGAACACAAAGTCATCCCAATGCCACTGGCGAAATGTTTGGTGATCACTGACCGCAATCCC
ACAGTAATACCTGCAAGCATCAAGCAACCAGGTAATATTCGAATGATCTCGGCCATACAACTAAAAAGGGGACTCGCACGAGAGGAACCCACATTTATGGCCATACCACT
GATGGAGGAAGCGACCACCGAGGAAACTGTCTCAAATGAAATCAAGGAGGTACTAAACAGTTATGCTGACATAATGCCAGAGAGCTTACCCCAAACATTACCACCTCGTC
GAGGCATTGATCACGAAATCGAACTCCTCCCCGGGGTTAAACCGCCAGCAAAGAACGCATACCGGATGGCTCCGCCCGAGCTAGCCAAATTGAGGAAACAACTGAATGAG
TTGTTGACGGCGGGATTCATCCGCCCGACAAAGGCACCTTACGGGGCCCCCGTACTATTCCAGAAGAAGAAGGATGGGACGTTGCGTCTGTGCATAGACTATAAAGCCTT
AAACAAGGTGACGGTACGCAACAAATATCCACTACCGATAATATCTGACTTGTTCGACCAGCTTCACGGGGCCAAATACTTCACGAAGTTGGACTTGCGATCTGGGTATT
ACCAAGTACGTATTGCCGAGGGGGACGAGCCCAAGACGACATGTGTAACAAGATATGGGGCCTTCGAATTCCTATTCGTCATAGTATACCTCGACGATATAGTTGTCTAT
AGCAAAACCCTGGAGGAACACAAAGTGCACTTGAAGCTAGTGTTTGACAAGCTGCGGCAGAACCAACTTATGGATAGCGACAAGATAAAAGCTATCCAAGAATGGAAAGT
TCCTACTTCCGTATCCGAGGTGCGGTCCTTCTTAGGATTAGCCAACTACTATAGGCGGTTCGTCGAAGGGTTTTCACGACTAGCCGCCCCATTGACAGAGCTGTTGAAGA
AAGACCACCCTTGGTCGTGGTCAAATGATTGTCAAATGGCCTTTGAAAATCTGAAAACAACCATGACGAGGGGTCTTGTCCTCGGGTTGGTAGATGTTACAAAGCCATTT
GAAATAGAAACAGATGCTTCCGACTTTGCTCTAGGTGGGGTCCTTATTCAGGAAGGCCACCCCATCGCTTACGAAAGTCGAATGCTCAACGATGCCGAACGGAGATACAC
AATCTCCGAGAAAGAAATGTTGGCTGTAGTCCATTGCCTTCGAGTCTGGAGACAGTATCTCTTGGGATCACAGTTCGTAGTGAAGACGGATAACACCGCCATTTGCCACT
TCTTTGATCAACCAAAATTACCGGCAAAACAAGCCCGGTGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGGTAAAGAGCGGATGGATGGATAGACCTCCTCAAAACAATCAAGCGGGGACATCTCGAGGGCCTTACCATCAAAGGAACCACCCGACGACACCTTTACAATGCAT
GTTGTGTAAAGGTCCCCATAAAGTATCCTACTGCCCTCATCGAGCCTTTCTCACTGCGCTCCAAGTGTCTATTCAAGAGATCAATGACGCAAGGGTCGAATCTATGTTAG
ACAAGAAGGAAGATCAAGACAATCCCCGAATGGGCGCGCTTAAATACTTGTCAGCCCTCCAATGGAAGGTCGAATCGAAGGAGATAATAGAAAAAGGACTCATGTTCGTA
GATGCGGCAATAAATTCTCAACCGATCAAGAGCACTCTGATAGATTCAGGAGCAACCCACAACTTTATCGCCGATCAAGAAGCCCGAAGATTGGGACTCACCATAGGAAA
GGACCCGGGGAAAATGAAAGTCGTCAACTCTGAGGCCTTGCCTATTGTGGGAGTTTCCAAAAGAGTCCTCTTCAAGATCGGGGATTGGACAGGGGAGCTAGACCTTGTCG
TGGTTCGCATGGACGACTTCGACGTGGTACTGGGGATAGAGTTCCTCCTGGAACACAAAGTCATCCCAATGCCACTGGCGAAATGTTTGGTGATCACTGACCGCAATCCC
ACAGTAATACCTGCAAGCATCAAGCAACCAGGTAATATTCGAATGATCTCGGCCATACAACTAAAAAGGGGACTCGCACGAGAGGAACCCACATTTATGGCCATACCACT
GATGGAGGAAGCGACCACCGAGGAAACTGTCTCAAATGAAATCAAGGAGGTACTAAACAGTTATGCTGACATAATGCCAGAGAGCTTACCCCAAACATTACCACCTCGTC
GAGGCATTGATCACGAAATCGAACTCCTCCCCGGGGTTAAACCGCCAGCAAAGAACGCATACCGGATGGCTCCGCCCGAGCTAGCCAAATTGAGGAAACAACTGAATGAG
TTGTTGACGGCGGGATTCATCCGCCCGACAAAGGCACCTTACGGGGCCCCCGTACTATTCCAGAAGAAGAAGGATGGGACGTTGCGTCTGTGCATAGACTATAAAGCCTT
AAACAAGGTGACGGTACGCAACAAATATCCACTACCGATAATATCTGACTTGTTCGACCAGCTTCACGGGGCCAAATACTTCACGAAGTTGGACTTGCGATCTGGGTATT
ACCAAGTACGTATTGCCGAGGGGGACGAGCCCAAGACGACATGTGTAACAAGATATGGGGCCTTCGAATTCCTATTCGTCATAGTATACCTCGACGATATAGTTGTCTAT
AGCAAAACCCTGGAGGAACACAAAGTGCACTTGAAGCTAGTGTTTGACAAGCTGCGGCAGAACCAACTTATGGATAGCGACAAGATAAAAGCTATCCAAGAATGGAAAGT
TCCTACTTCCGTATCCGAGGTGCGGTCCTTCTTAGGATTAGCCAACTACTATAGGCGGTTCGTCGAAGGGTTTTCACGACTAGCCGCCCCATTGACAGAGCTGTTGAAGA
AAGACCACCCTTGGTCGTGGTCAAATGATTGTCAAATGGCCTTTGAAAATCTGAAAACAACCATGACGAGGGGTCTTGTCCTCGGGTTGGTAGATGTTACAAAGCCATTT
GAAATAGAAACAGATGCTTCCGACTTTGCTCTAGGTGGGGTCCTTATTCAGGAAGGCCACCCCATCGCTTACGAAAGTCGAATGCTCAACGATGCCGAACGGAGATACAC
AATCTCCGAGAAAGAAATGTTGGCTGTAGTCCATTGCCTTCGAGTCTGGAGACAGTATCTCTTGGGATCACAGTTCGTAGTGAAGACGGATAACACCGCCATTTGCCACT
TCTTTGATCAACCAAAATTACCGGCAAAACAAGCCCGGTGGTAG
Protein sequenceShow/hide protein sequence
MEVKSGWMDRPPQNNQAGTSRGPYHQRNHPTTPLQCMLCKGPHKVSYCPHRAFLTALQVSIQEINDARVESMLDKKEDQDNPRMGALKYLSALQWKVESKEIIEKGLMFV
DAAINSQPIKSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKVVNSEALPIVGVSKRVLFKIGDWTGELDLVVVRMDDFDVVLGIEFLLEHKVIPMPLAKCLVITDRNP
TVIPASIKQPGNIRMISAIQLKRGLAREEPTFMAIPLMEEATTEETVSNEIKEVLNSYADIMPESLPQTLPPRRGIDHEIELLPGVKPPAKNAYRMAPPELAKLRKQLNE
LLTAGFIRPTKAPYGAPVLFQKKKDGTLRLCIDYKALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLFVIVYLDDIVVY
SKTLEEHKVHLKLVFDKLRQNQLMDSDKIKAIQEWKVPTSVSEVRSFLGLANYYRRFVEGFSRLAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGLVLGLVDVTKPF
EIETDASDFALGGVLIQEGHPIAYESRMLNDAERRYTISEKEMLAVVHCLRVWRQYLLGSQFVVKTDNTAICHFFDQPKLPAKQARW