; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0019163 (gene) of Chayote v1 genome

Gene IDSed0019163
OrganismSechium edule (Chayote v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationLG04:28821474..28823432
RNA-Seq ExpressionSed0019163
SyntenySed0019163
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KZV17946.1 hypothetical protein F511_10775 [Dorcoceras hygrometricum]1.2e-14143.48Show/hide
Query:  MGLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEEEHKG-DTNIKSNITLAATQSKTTYKGKDSKPVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCGLQG
        MGLN+ ++  R+QILLMDPLP ++K FSL+VQEE  +  +  ++  I                +P+    G     +   Y   G   ++  C HC L  
Subjt:  MGLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEEEHKG-DTNIKSNITLAATQSKTTYKGKDSKPVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCGLQG

Query:  HTIDVCYKIHGYPPSNKQRKNNYKQTNDNQGSVQPENKSCKSATVAASNIESDPFQQCHDILTLLQSKLAGIKNDNGANLTQHMAGMTQ------TYDYF
        HT+D CYK+HGYPP + +    YK    ++ S   ++ S      +  N    P + C  ++  L S+L   +  NG  +T      +       TY   
Subjt:  HTIDVCYKIHGYPPSNKQRKNNYKQTNDNQGSVQPENKSCKSATVAASNIESDPFQQCHDILTLLQSKLAGIKNDNGANLTQHMAGMTQ------TYDYF

Query:  KDR-------WILDSGAAAHICHNKDIFMNLKRIDTSVILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNACIIQD
                  WI+D+GA  HIC +   F++ +  +++V LPN   I +TH GS++L   I L  VL+VP FK+NLLSIS+LT     LV+F++ +C IQ 
Subjt:  KDR-------WILDSGAAAHICHNKDIFMNLKRIDTSVILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNACIIQD

Query:  KRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASPSLWHRRLGHPADLPLVALKNVL--SFDANCKGAENCTICPLAKQNRLRFISNNNKSDAIFD
            K IG G     LY+L     S+    VC+   +   LWH RLGH     L  L + L  SF  N      C IC L+KQ RL FISNN+  D  FD
Subjt:  KRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASPSLWHRRLGHPADLPLVALKNVL--SFDANCKGAENCTICPLAKQNRLRFISNNNKSDAIFD

Query:  LIHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPELKFTEFFKKNGVEHQYSCVARPEQNSVV
        L+H+DIWGPF   +  G++YFLTIVDD SRYTW+ LLK+KS+V+ + P F +++H Q+ K IK  RSDNAPELKF+EFFK  G+   +SCV RP+QNSVV
Subjt:  LIHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPELKFTEFFKKNGVEHQYSCVARPEQNSVV

Query:  ERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQGMKG
        ERKHQHILNV RA+ FQS IPL YW ECILTAV+LINRTP+  L  KTP+EL+  + P Y  L+VFGCL Y ST+   R KFS RA  S+F+GYP G KG
Subjt:  ERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQGMKG

Query:  FKLLDLENNNIFVSRDVVFHEEVFPFQSKNTIENMPDFIMNQVL
        +KLL+L+ N +++SRDV+FHE VFPF++K+T  + P+  ++ ++
Subjt:  FKLLDLENNNIFVSRDVVFHEEVFPFQSKNTIENMPDFIMNQVL

KZV25004.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Dorcoceras hygrometricum]3.7e-14643.68Show/hide
Query:  MGLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEEEHKG-----------DTNIKSNITLAATQSKTTYKGKDSKPVCKHCGLIGHTIDVCYRIHGYPDNR
        MGLND ++  R+Q+L+++PLP + K F+L++QEE  +             + I SN+  +A  + +    ++SK                    G   +R
Subjt:  MGLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEEEHKG-----------DTNIKSNITLAATQSKTTYKGKDSKPVCKHCGLIGHTIDVCYRIHGYPDNR

Query:  PVCKHCGLQGHTIDVCYKIHGYPPSNKQRKNNYKQTNDNQGSVQPENKSCKS----ATVAASNIESDPFQQCHDILTLLQSKLAGIKN------------
         +C HC  + HT+D CYK+HGYPP + + K+       +QGS      S  S     T    + +S    QC  ++  L SKL   +N            
Subjt:  PVCKHCGLQGHTIDVCYKIHGYPPSNKQRKNNYKQTNDNQGSVQPENKSCKS----ATVAASNIESDPFQQCHDILTLLQSKLAGIKN------------

Query:  --DNGANLTQHMAGMTQTYDYFKDRWILDSGAAAHICHNKDIFMNLKRIDTSVILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYNLLSISALTLND
              + T H+  +T+     KD WI+D+GA  HIC +  +F + + I + V+LPN   I +T AG++ +  +++L  VLYVP F++NLLS+S+LT N 
Subjt:  --DNGANLTQHMAGMTQTYDYFKDRWILDSGAAAHICHNKDIFMNLKRIDTSVILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYNLLSISALTLND

Query:  AVLVNFTTNACIIQDKRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASPSLWHRRLGHPADLPLVALKNVLSFDANCKGAENCTICPLAKQNRLR
           V+F +++C IQD   ++MIG G     LYVL++ P     + +C+   ++  LWHRR+GHP+   L +LKNVL+ + N      C  C L+KQ RL 
Subjt:  AVLVNFTTNACIIQDKRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASPSLWHRRLGHPADLPLVALKNVLSFDANCKGAENCTICPLAKQNRLR

Query:  FISNNNKSDAIFDLIHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPELKFTEFFKKNGVEHQ
          S NN S  IF+L+H+D WGPF+ TS  G+R+F TIVDD SRYTW+++LK+KSDVL++ P F ++V TQ+   +K  RSDNAPEL F +FF K G+ H 
Subjt:  FISNNNKSDAIFDLIHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPELKFTEFFKKNGVEHQ

Query:  YSCVARPEQNSVVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAI
        +SCV RP+QNSVVERKHQHILNV RA+ FQS IPLDYW +CI T+V+LINRTPS  L  KTP+ELL  ++P+Y  LKVFGCL YAST+   R+KFS RAI
Subjt:  YSCVARPEQNSVVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAI

Query:  PSVFVGYPQGMKGFKLLDLENNNIFVSRDVVFHEEVFPFQS
          VF+GYP G KG+KLL+LE N IF+SRDV+FHE  FP+Q+
Subjt:  PSVFVGYPQGMKGFKLLDLENNNIFVSRDVVFHEEVFPFQS

KZV39348.1 hypothetical protein F511_17540 [Dorcoceras hygrometricum]1.5e-14243.92Show/hide
Query:  MGLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEEEHKGDTNIKSNITLAATQSKTTYKGKDSK----PVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCG
        MGLN+ ++  R+QILLMDPLP ++K FSL+VQEE                 + ++ ++G   K    P+  + G     +   Y   G   ++  C HC 
Subjt:  MGLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEEEHKGDTNIKSNITLAATQSKTTYKGKDSK----PVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCG

Query:  LQGHTIDVCYKIHGYPPSNKQRKNNYKQTNDNQGSVQPENKSCKSATVAASNIESDPFQQCHDILTLLQSKLAGIKNDNGANLTQHMAGMTQTYDYFKD-
        L  HT+D CYK+HGYPP + + K   KQ++     +Q + ++  +A+V    ++    + C  ++  L S+L  + N     L Q       +   F D 
Subjt:  LQGHTIDVCYKIHGYPPSNKQRKNNYKQTNDNQGSVQPENKSCKSATVAASNIESDPFQQCHDILTLLQSKLAGIKNDNGANLTQHMAGMTQTYDYFKD-

Query:  -------------RWILDSGAAAHICHNKDIFMNLKRIDTSVILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNAC
                      WI+D+GA  HIC +   F++ K  +++V LPN   I +TH GS++L   IIL  VL+VP FK+NLLSIS+LT      V+F++  C
Subjt:  -------------RWILDSGAAAHICHNKDIFMNLKRIDTSVILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNAC

Query:  IIQDKRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASPSLWHRRLGHPADLPLVALKNVL--SFDANCKGAENCTICPLAKQNRLRFISNNNKSD
         IQ     + IG G     LY+L   P S     VC+   +   LWH RLGH +   L  L +V+  SF +N +    C IC L+KQ RL FISNN   D
Subjt:  IIQDKRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASPSLWHRRLGHPADLPLVALKNVL--SFDANCKGAENCTICPLAKQNRLRFISNNNKSD

Query:  AIFDLIHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPELKFTEFFKKNGVEHQYSCVARPEQ
        + FDL+H+DIWGPF   +  G++YFLTIVDD SRYTW+ LLK+KSDV  + P F +++ TQ+ K IK  RSDNAPEL+F+EFFK  G+   +SCV RP+Q
Subjt:  AIFDLIHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPELKFTEFFKKNGVEHQYSCVARPEQ

Query:  NSVVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQ
        NS+VERKHQHILNV RA+ FQS IPL YW +CILT+V+LINR P+  L  KTP+E++  ++PN+  L+VFGCL Y ST+  HR KFS RAI S+F+GYP 
Subjt:  NSVVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQ

Query:  GMKGFKLLDLENNNIFVSRDVVFHEEVFPFQSK
        G KG+KLL+L+ N I++SRDV FHE VFPF++K
Subjt:  GMKGFKLLDLENNNIFVSRDVVFHEEVFPFQSK

RVW82526.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.8e-14042.84Show/hide
Query:  MGLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEEEHKGDTNIKSNITLAATQSKTTYKGKDSKPVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCGLQGH
        +GLN+ F+  ++QILLM+P PP+NK FSL+VQEE  +  T   S        S+     + S P                       +RP+C HC + GH
Subjt:  MGLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEEEHKGDTNIKSNITLAATQSKTTYKGKDSKPVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCGLQGH

Query:  TIDVCYKIHGYPPSNKQRKNNYKQTNDNQGSVQPENKSCKSATVAASNIES--------DPFQQCHDILTLLQSKLAGIKNDNGANLTQHMAGMTQTYDY
        T+D CYKIHGY P  + R  N++        + P +      T+   +I S        D   Q   +L+L  S  +     +   L Q ++  T     
Subjt:  TIDVCYKIHGYPPSNKQRKNNYKQTNDNQGSVQPENKSCKSATVAASNIES--------DPFQQCHDILTLLQSKLAGIKNDNGANLTQHMAGMTQTYDY

Query:  FKDR-------WILDSGAAAHICHNKDIFMNLKRIDT-SVILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNACII
                   WILDSGA  H+C N  +F ++    + +V LP   +I IT  G+I L   ++L+ VLY+P+F++NL+SISALT  +    +FT + C I
Subjt:  FKDR-------WILDSGAAAHICHNKDIFMNLKRIDT-SVILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNACII

Query:  QDKRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASP---SLWHRRLGHPADLPLVALKNVLSFDANCKGAENCTICPLAKQNRLRFISNNNKSDA
        QD    K+IG G  +  LY+L+     +  ++     + S     LWH RL HP+++ L  LK  L   +N     +C+ICPLAKQ RL F  +NN S +
Subjt:  QDKRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASP---SLWHRRLGHPADLPLVALKNVLSFDANCKGAENCTICPLAKQNRLRFISNNNKSDA

Query:  IFDLIHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPELKFTEFFKKNGVEHQYSCVARPEQN
         FDLIH DIWGPF   +H G+RYFLTIVDD +R TW+ LL+ KSDV T+ P FF +V T++   IK  RSDNAPEL  +  F +  V H +SCV  P+QN
Subjt:  IFDLIHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPELKFTEFFKKNGVEHQYSCVARPEQN

Query:  SVVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQG
        SVVERKHQHILNV RA+YFQS IP+ YWG+C+LT+V+LINR PS  L+ KTP+ELL  + P+Y  LK FGCL Y+ST+   R+KFS RA+P VF+GYP G
Subjt:  SVVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQG

Query:  MKGFKLLDLENNNIFVSRDVVFHEEVFPFQ-SKNTIENMPDFIMNQVLP
         KG+K+LDLE N I VSR+V F E VFPF+ S+N      DF   +VLP
Subjt:  MKGFKLLDLENNNIFVSRDVVFHEEVFPFQ-SKNTIENMPDFIMNQVLP

XP_012857659.1 PREDICTED: uncharacterized protein LOC105976934 [Erythranthe guttata]7.2e-14241.22Show/hide
Query:  MGLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEEEHK-----GDTNIKSNITLAATQSKT--------------TYKGKDSKPVCKHCGLIGHTIDVCYR
        MGLND  ++TR QILLMDPLPP+NK F+L+ QEE H+       ++++ ++  AA   +T              T   +  K  C HC   GHT++ CYR
Subjt:  MGLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEEEHK-----GDTNIKSNITLAATQSKT--------------TYKGKDSKPVCKHCGLIGHTIDVCYR

Query:  IHGYPDNRPVCKHCGLQGHTIDVCYKIHGYPPSNKQRKNNYKQTNDNQGSVQPENKSCKSATVAASNIESDPF------QQCHDILTLLQSKLAGIKN--
        +HG+P                   Y+    P     + +  K   +    +   + S  S +++ S   SD F       QC  +L+ + S LA   N  
Subjt:  IHGYPDNRPVCKHCGLQGHTIDVCYKIHGYPPSNKQRKNNYKQTNDNQGSVQPENKSCKSATVAASNIESDPF------QQCHDILTLLQSKLAGIKN--

Query:  --DNGANL--TQHMAGMT--------QTYDYFKDRWILDSGAAAHICHNKDIFMNLKRIDTS-VILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYN
          D  + +  T H++ +T         T  +    WILDSGA+ HICHNK +F+N+K +  + V+LP+   +++   G + L   ++L  V YVP FK+N
Subjt:  --DNGANL--TQHMAGMT--------QTYDYFKDRWILDSGAAAHICHNKDIFMNLKRIDTS-VILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYN

Query:  LLSISALTLNDAVLVNFTTNACIIQDKRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASPSLWHRRLGH--PADLPLVALKNVLSFDANCKGAEN
        L+S+SAL    + +V F   +  IQD R +  IGKGN  QGLYVL+ V  S   +  C+  SA  ++WH RLGH     L  +A K  LS D     +  
Subjt:  LLSISALTLNDAVLVNFTTNACIIQDKRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASPSLWHRRLGH--PADLPLVALKNVLSFDANCKGAEN

Query:  CTICPLAKQNRLRFISNNNKSDAIFDLIHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPELK
        C +CPLAKQ RL F ++++ S A+FDLIH DIWGPF   S+ G+ YF+T+VDD+SR+TW+ LLK KS+V+TV+P F K+V  Q+ K IK FRSDNA EL+
Subjt:  CTICPLAKQNRLRFISNNNKSDAIFDLIHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPELK

Query:  FTEFFKKNGVEHQYSCVARPEQNSVVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMP-NYQTLKVFGCLAYAS
        F   F + GV HQ+SCV  P+QN++VERKHQHILNV R+++FQS IP+ YW ECILTAV LINR P+ NL+  +PYELL    P +Y +LK FGCL +A+
Subjt:  FTEFFKKNGVEHQYSCVARPEQNSVVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMP-NYQTLKVFGCLAYAS

Query:  TIREHRNKFSSRAIPSVFVGYPQGMKGFKLLDLENNNIFVSRDVVFHEEVFPFQSKNTIENMPDFIMNQVLPKACDISLEYKDSIHDNY
         +  H++KF  RA   VF+GYP G+KG+KLLDL ++ +F+SRDV+FHE ++PF +K++         + + P    I   + DSI + Y
Subjt:  TIREHRNKFSSRAIPSVFVGYPQGMKGFKLLDLENNNIFVSRDVVFHEEVFPFQSKNTIENMPDFIMNQVLPKACDISLEYKDSIHDNY

TrEMBL top hitse value%identityAlignment
A0A2Z7AFV2 Integrase catalytic domain-containing protein6.0e-14243.48Show/hide
Query:  MGLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEEEHKG-DTNIKSNITLAATQSKTTYKGKDSKPVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCGLQG
        MGLN+ ++  R+QILLMDPLP ++K FSL+VQEE  +  +  ++  I                +P+    G     +   Y   G   ++  C HC L  
Subjt:  MGLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEEEHKG-DTNIKSNITLAATQSKTTYKGKDSKPVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCGLQG

Query:  HTIDVCYKIHGYPPSNKQRKNNYKQTNDNQGSVQPENKSCKSATVAASNIESDPFQQCHDILTLLQSKLAGIKNDNGANLTQHMAGMTQ------TYDYF
        HT+D CYK+HGYPP + +    YK    ++ S   ++ S      +  N    P + C  ++  L S+L   +  NG  +T      +       TY   
Subjt:  HTIDVCYKIHGYPPSNKQRKNNYKQTNDNQGSVQPENKSCKSATVAASNIESDPFQQCHDILTLLQSKLAGIKNDNGANLTQHMAGMTQ------TYDYF

Query:  KDR-------WILDSGAAAHICHNKDIFMNLKRIDTSVILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNACIIQD
                  WI+D+GA  HIC +   F++ +  +++V LPN   I +TH GS++L   I L  VL+VP FK+NLLSIS+LT     LV+F++ +C IQ 
Subjt:  KDR-------WILDSGAAAHICHNKDIFMNLKRIDTSVILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNACIIQD

Query:  KRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASPSLWHRRLGHPADLPLVALKNVL--SFDANCKGAENCTICPLAKQNRLRFISNNNKSDAIFD
            K IG G     LY+L     S+    VC+   +   LWH RLGH     L  L + L  SF  N      C IC L+KQ RL FISNN+  D  FD
Subjt:  KRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASPSLWHRRLGHPADLPLVALKNVL--SFDANCKGAENCTICPLAKQNRLRFISNNNKSDAIFD

Query:  LIHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPELKFTEFFKKNGVEHQYSCVARPEQNSVV
        L+H+DIWGPF   +  G++YFLTIVDD SRYTW+ LLK+KS+V+ + P F +++H Q+ K IK  RSDNAPELKF+EFFK  G+   +SCV RP+QNSVV
Subjt:  LIHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPELKFTEFFKKNGVEHQYSCVARPEQNSVV

Query:  ERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQGMKG
        ERKHQHILNV RA+ FQS IPL YW ECILTAV+LINRTP+  L  KTP+EL+  + P Y  L+VFGCL Y ST+   R KFS RA  S+F+GYP G KG
Subjt:  ERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQGMKG

Query:  FKLLDLENNNIFVSRDVVFHEEVFPFQSKNTIENMPDFIMNQVL
        +KLL+L+ N +++SRDV+FHE VFPF++K+T  + P+  ++ ++
Subjt:  FKLLDLENNNIFVSRDVVFHEEVFPFQSKNTIENMPDFIMNQVL

A0A2Z7AT15 Cysteine-rich RLK (Receptor-like protein kinase) 81.8e-14643.68Show/hide
Query:  MGLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEEEHKG-----------DTNIKSNITLAATQSKTTYKGKDSKPVCKHCGLIGHTIDVCYRIHGYPDNR
        MGLND ++  R+Q+L+++PLP + K F+L++QEE  +             + I SN+  +A  + +    ++SK                    G   +R
Subjt:  MGLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEEEHKG-----------DTNIKSNITLAATQSKTTYKGKDSKPVCKHCGLIGHTIDVCYRIHGYPDNR

Query:  PVCKHCGLQGHTIDVCYKIHGYPPSNKQRKNNYKQTNDNQGSVQPENKSCKS----ATVAASNIESDPFQQCHDILTLLQSKLAGIKN------------
         +C HC  + HT+D CYK+HGYPP + + K+       +QGS      S  S     T    + +S    QC  ++  L SKL   +N            
Subjt:  PVCKHCGLQGHTIDVCYKIHGYPPSNKQRKNNYKQTNDNQGSVQPENKSCKS----ATVAASNIESDPFQQCHDILTLLQSKLAGIKN------------

Query:  --DNGANLTQHMAGMTQTYDYFKDRWILDSGAAAHICHNKDIFMNLKRIDTSVILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYNLLSISALTLND
              + T H+  +T+     KD WI+D+GA  HIC +  +F + + I + V+LPN   I +T AG++ +  +++L  VLYVP F++NLLS+S+LT N 
Subjt:  --DNGANLTQHMAGMTQTYDYFKDRWILDSGAAAHICHNKDIFMNLKRIDTSVILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYNLLSISALTLND

Query:  AVLVNFTTNACIIQDKRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASPSLWHRRLGHPADLPLVALKNVLSFDANCKGAENCTICPLAKQNRLR
           V+F +++C IQD   ++MIG G     LYVL++ P     + +C+   ++  LWHRR+GHP+   L +LKNVL+ + N      C  C L+KQ RL 
Subjt:  AVLVNFTTNACIIQDKRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASPSLWHRRLGHPADLPLVALKNVLSFDANCKGAENCTICPLAKQNRLR

Query:  FISNNNKSDAIFDLIHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPELKFTEFFKKNGVEHQ
          S NN S  IF+L+H+D WGPF+ TS  G+R+F TIVDD SRYTW+++LK+KSDVL++ P F ++V TQ+   +K  RSDNAPEL F +FF K G+ H 
Subjt:  FISNNNKSDAIFDLIHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPELKFTEFFKKNGVEHQ

Query:  YSCVARPEQNSVVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAI
        +SCV RP+QNSVVERKHQHILNV RA+ FQS IPLDYW +CI T+V+LINRTPS  L  KTP+ELL  ++P+Y  LKVFGCL YAST+   R+KFS RAI
Subjt:  YSCVARPEQNSVVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAI

Query:  PSVFVGYPQGMKGFKLLDLENNNIFVSRDVVFHEEVFPFQS
          VF+GYP G KG+KLL+LE N IF+SRDV+FHE  FP+Q+
Subjt:  PSVFVGYPQGMKGFKLLDLENNNIFVSRDVVFHEEVFPFQS

A0A2Z7C0E8 Integrase catalytic domain-containing protein7.1e-14343.92Show/hide
Query:  MGLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEEEHKGDTNIKSNITLAATQSKTTYKGKDSK----PVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCG
        MGLN+ ++  R+QILLMDPLP ++K FSL+VQEE                 + ++ ++G   K    P+  + G     +   Y   G   ++  C HC 
Subjt:  MGLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEEEHKGDTNIKSNITLAATQSKTTYKGKDSK----PVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCG

Query:  LQGHTIDVCYKIHGYPPSNKQRKNNYKQTNDNQGSVQPENKSCKSATVAASNIESDPFQQCHDILTLLQSKLAGIKNDNGANLTQHMAGMTQTYDYFKD-
        L  HT+D CYK+HGYPP + + K   KQ++     +Q + ++  +A+V    ++    + C  ++  L S+L  + N     L Q       +   F D 
Subjt:  LQGHTIDVCYKIHGYPPSNKQRKNNYKQTNDNQGSVQPENKSCKSATVAASNIESDPFQQCHDILTLLQSKLAGIKNDNGANLTQHMAGMTQTYDYFKD-

Query:  -------------RWILDSGAAAHICHNKDIFMNLKRIDTSVILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNAC
                      WI+D+GA  HIC +   F++ K  +++V LPN   I +TH GS++L   IIL  VL+VP FK+NLLSIS+LT      V+F++  C
Subjt:  -------------RWILDSGAAAHICHNKDIFMNLKRIDTSVILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNAC

Query:  IIQDKRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASPSLWHRRLGHPADLPLVALKNVL--SFDANCKGAENCTICPLAKQNRLRFISNNNKSD
         IQ     + IG G     LY+L   P S     VC+   +   LWH RLGH +   L  L +V+  SF +N +    C IC L+KQ RL FISNN   D
Subjt:  IIQDKRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASPSLWHRRLGHPADLPLVALKNVL--SFDANCKGAENCTICPLAKQNRLRFISNNNKSD

Query:  AIFDLIHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPELKFTEFFKKNGVEHQYSCVARPEQ
        + FDL+H+DIWGPF   +  G++YFLTIVDD SRYTW+ LLK+KSDV  + P F +++ TQ+ K IK  RSDNAPEL+F+EFFK  G+   +SCV RP+Q
Subjt:  AIFDLIHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPELKFTEFFKKNGVEHQYSCVARPEQ

Query:  NSVVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQ
        NS+VERKHQHILNV RA+ FQS IPL YW +CILT+V+LINR P+  L  KTP+E++  ++PN+  L+VFGCL Y ST+  HR KFS RAI S+F+GYP 
Subjt:  NSVVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQ

Query:  GMKGFKLLDLENNNIFVSRDVVFHEEVFPFQSK
        G KG+KLL+L+ N I++SRDV FHE VFPF++K
Subjt:  GMKGFKLLDLENNNIFVSRDVVFHEEVFPFQSK

A0A2Z7D0U1 Integrase catalytic domain-containing protein7.3e-14041.98Show/hide
Query:  GLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEE---------------EHKGDTNIKSNITLAATQSKTTYKGKDSKPVCKHCGLIGHTIDVCYRIHGYP
        GLN+ ++  R+Q+L+M+P P +   F+L+VQEE                H    N+ SNI  + T  +    GK  K VC HC    HT+D CY++HGYP
Subjt:  GLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEE---------------EHKGDTNIKSNITLAATQSKTTYKGKDSKPVCKHCGLIGHTIDVCYRIHGYP

Query:  DNRPVCK----HCGLQGHTIDVCYKIHGYPPSNKQRKNNYKQTNDNQGSVQPENKSCKSATVAASNIESDPFQQCHDILTLLQSKLAGIKNDNGANLTQH
           P  K        Q H I    + +   P +   +N  KQ          E  S K     +S +E     Q H+  T   S   GI      +   H
Subjt:  DNRPVCK----HCGLQGHTIDVCYKIHGYPPSNKQRKNNYKQTNDNQGSVQPENKSCKSATVAASNIESDPFQQCHDILTLLQSKLAGIKNDNGANLTQH

Query:  MAGMTQTYDYFKDRWILDSGAAAHICHNKDIFMNLKRIDTSVILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNAC
         + +T T       W+LD+GA  HIC +  +F + K +++ ++LPN   I +T   S+ L   +IL  VLYVP F++NLLSIS+LT N A  V+F +++C
Subjt:  MAGMTQTYDYFKDRWILDSGAAAHICHNKDIFMNLKRIDTSVILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNAC

Query:  IIQDKRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASPSLWHRRLGHPADLPLVALKNVLSFDANCKGAENCTICPLAKQNRLRFISNNNKSDAI
         IQD +  K IG G     LYVL +  +++  + VC+V    P L H R+GHP+   L +L N+L FD+       C +C ++KQ RL F S+N  +   
Subjt:  IIQDKRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASPSLWHRRLGHPADLPLVALKNVLSFDANCKGAENCTICPLAKQNRLRFISNNNKSDAI

Query:  FDLIHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPELKFTEFFKKNGVEHQYSCVARPEQNS
        F+L+H+D+WGPF+  S  GYR+FLTIVDD + +TW+++L++KS+V +++P F ++V TQ+   IK FRSDNAPEL F   F + G+ H YSCV RP+QNS
Subjt:  FDLIHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPELKFTEFFKKNGVEHQYSCVARPEQNS

Query:  VVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQGM
        +VERKHQHILNV RA+ FQS +P+DYW +CI+T+V+LINRTPS +L  KTP+ELL  + P Y  LK+FGCL YAST+   R+K S RAI  VF GYP G 
Subjt:  VVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQGM

Query:  KGFKLLDLENNNIFVSRDVVFHEEVFPFQSKNTIENMP-DFIMNQVLP
        +G+KLL+L+ N I +SRDV+FHE  FPFQ+ +  ++ P D   + +LP
Subjt:  KGFKLLDLENNNIFVSRDVVFHEEVFPFQSKNTIENMP-DFIMNQVLP

A0A438HDI8 Retrovirus-related Pol polyprotein from transposon TNT 1-948.6e-14142.84Show/hide
Query:  MGLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEEEHKGDTNIKSNITLAATQSKTTYKGKDSKPVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCGLQGH
        +GLN+ F+  ++QILLM+P PP+NK FSL+VQEE  +  T   S        S+     + S P                       +RP+C HC + GH
Subjt:  MGLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEEEHKGDTNIKSNITLAATQSKTTYKGKDSKPVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCGLQGH

Query:  TIDVCYKIHGYPPSNKQRKNNYKQTNDNQGSVQPENKSCKSATVAASNIES--------DPFQQCHDILTLLQSKLAGIKNDNGANLTQHMAGMTQTYDY
        T+D CYKIHGY P  + R  N++        + P +      T+   +I S        D   Q   +L+L  S  +     +   L Q ++  T     
Subjt:  TIDVCYKIHGYPPSNKQRKNNYKQTNDNQGSVQPENKSCKSATVAASNIES--------DPFQQCHDILTLLQSKLAGIKNDNGANLTQHMAGMTQTYDY

Query:  FKDR-------WILDSGAAAHICHNKDIFMNLKRIDT-SVILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNACII
                   WILDSGA  H+C N  +F ++    + +V LP   +I IT  G+I L   ++L+ VLY+P+F++NL+SISALT  +    +FT + C I
Subjt:  FKDR-------WILDSGAAAHICHNKDIFMNLKRIDT-SVILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNACII

Query:  QDKRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASP---SLWHRRLGHPADLPLVALKNVLSFDANCKGAENCTICPLAKQNRLRFISNNNKSDA
        QD    K+IG G  +  LY+L+     +  ++     + S     LWH RL HP+++ L  LK  L   +N     +C+ICPLAKQ RL F  +NN S +
Subjt:  QDKRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASP---SLWHRRLGHPADLPLVALKNVLSFDANCKGAENCTICPLAKQNRLRFISNNNKSDA

Query:  IFDLIHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPELKFTEFFKKNGVEHQYSCVARPEQN
         FDLIH DIWGPF   +H G+RYFLTIVDD +R TW+ LL+ KSDV T+ P FF +V T++   IK  RSDNAPEL  +  F +  V H +SCV  P+QN
Subjt:  IFDLIHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPELKFTEFFKKNGVEHQYSCVARPEQN

Query:  SVVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQG
        SVVERKHQHILNV RA+YFQS IP+ YWG+C+LT+V+LINR PS  L+ KTP+ELL  + P+Y  LK FGCL Y+ST+   R+KFS RA+P VF+GYP G
Subjt:  SVVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQG

Query:  MKGFKLLDLENNNIFVSRDVVFHEEVFPFQ-SKNTIENMPDFIMNQVLP
         KG+K+LDLE N I VSR+V F E VFPF+ S+N      DF   +VLP
Subjt:  MKGFKLLDLENNNIFVSRDVVFHEEVFPFQ-SKNTIENMPDFIMNQVLP

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.6e-3727.8Show/hide
Query:  WILDSGAAAHICHNKDIFMNLKRIDTSV---ILPNKDRIIITHAGSILLCG--SIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNACIIQDKRTLK
        ++LDSGA+ H+ +++ ++ +   +   +   +    + I  T  G + L     I L+ VL+      NL+S+  L     + + F  +   I     + 
Subjt:  WILDSGAAAHICHNKDIFMNLKRIDTSV---ILPNKDRIIITHAGSILLCG--SIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNACIIQDKRTLK

Query:  MIGKGNLEQGLYVLEEVP-LSAALNIVCSVRSASPSLWHRRLGHPADLPLVALKNVLSFDANC------KGAENCTICPLAKQNRLRFISNNNKSDAIFD
        +   G       +L  VP ++     + +    +  LWH R GH +D  L+ +K    F             E C  C   KQ RL F    +K+     
Subjt:  MIGKGNLEQGLYVLEEVP-LSAALNIVCSVRSASPSLWHRRLGHPADLPLVALKNVLSFDANC------KGAENCTICPLAKQNRLRFISNNNKSDAIFD

Query:  L--IHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPEL---KFTEFFKKNGVEHQYSCVARPE
        L  +H D+ GP    +     YF+  VD F+ Y   +L+K KSDV ++   F       ++  +     DN  E    +  +F  K G+ +  +    P+
Subjt:  L--IHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPEL---KFTEFFKKNGVEHQYSCVARPE

Query:  QNSVVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNL--DWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVG
         N V ER  + I    R +   +K+   +WGE +LTA +LINR PS+ L    KTPYE+   + P  + L+VFG   Y   I+  + KF  ++  S+FVG
Subjt:  QNSVVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNL--DWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVG

Query:  YPQGMKGFKLLDLENNNIFVSRDVVFHE
        Y     GFKL D  N    V+RDVV  E
Subjt:  YPQGMKGFKLLDLENNNIFVSRDVVFHE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.0e-5530.43Show/hide
Query:  KDRWILDSGAAAHICHNKDIFMNLKRIDTSVI-LPNKDRIIITHAGSILL-----CGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNACIIQDK
        +  W++D+ A+ H    +D+F      D   + + N     I   G I +     C +++L  V +VP  + NL  IS + L+     ++  N      K
Subjt:  KDRWILDSGAAAHICHNKDIFMNLKRIDTSVI-LPNKDRIIITHAGSILL-----CGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNACIIQDK

Query:  RTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASPSLWHRRLGHPAD--LPLVALKNVLSFDANCKGAENCTICPLAKQNRLRFISNNNKSDAIFDL
         +L +I KG     LY          LN   +    S  LWH+R+GH ++  L ++A K+++S+ A     + C  C   KQ+R+ F +++ +   I DL
Subjt:  RTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASPSLWHRRLGHPAD--LPLVALKNVLSFDANCKGAENCTICPLAKQNRLRFISNNNKSDAIFDL

Query:  IHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPEL---KFTEFFKKNGVEHQYSCVARPEQNS
        ++ D+ GP    S  G +YF+T +DD SR  W+++LK K  V  V   F  LV  +  + +K  RSDN  E    +F E+   +G+ H+ +    P+ N 
Subjt:  IHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPEL---KFTEFFKKNGVEHQYSCVARPEQNS

Query:  VVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQGM
        V ER ++ I+   R++   +K+P  +WGE + TA +LINR+PS  L ++ P  +   +  +Y  LKVFGC A+A   +E R K   ++IP +F+GY    
Subjt:  VVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQGM

Query:  KGFKLLDLENNNIFVSRDVVFHEEVFPFQSKNTIENMPDFIMNQVLP
         G++L D     +  SRDVVF E         T  +M + + N ++P
Subjt:  KGFKLLDLENNNIFVSRDVVFHEEVFPFQSKNTIENMPDFIMNQVLP

P25384 Transposon Ty2-C Gag-Pol polyprotein4.6e-1422.85Show/hide
Query:  YKIHGYPPSNKQRKNNYKQTNDNQGSVQPENKSCKSATVAASNIESDPFQQC-HDILTLLQSKLAGIKNDNGANLTQHMAGMTQTY-----DYFKDRWIL
        YK H    +  +   N   T     + Q  N S   A  A +   S  F +  +D +         + +DN  +L Q       T+     D   D  ++
Subjt:  YKIHGYPPSNKQRKNNYKQTNDNQGSVQPENKSCKSATVAASNIESDPFQQC-HDILTLLQSKLAGIKNDNGANLTQHMAGMTQTY-----DYFKDRWIL

Query:  DSGA------AAHICHNKDIFMNLKRIDTSVILPNKDRIIITHAGSI---LLCGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNACIIQDKRTL
        DSGA      +AH  H+      +  +D       K  I I   G++      G+    + L+ P+  Y+LLS+S L  N  +   FT N     D   L
Subjt:  DSGA------AAHICHNKDIFMNLKRIDTSVILPNKDRIIITHAGSI---LLCGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNACIIQDKRTL

Query:  KMIGKGNLEQGL---YVLEEVPLSAALNIVCSVRSASP---SLWHRRLGHP--ADLPLVALKNVLSF----DANCKGAE--NCTICPLAKQNRLRFISNN
          I K      L   Y++        +N V   +S +     L HR LGH     +     KN +++    D     A    C  C + K  + R +  +
Subjt:  KMIGKGNLEQGL---YVLEEVPLSAALNIVCSVRSASP---SLWHRRLGHP--ADLPLVALKNVLSF----DANCKGAE--NCTICPLAKQNRLRFISNN

Query:  ----NKSDAIFDLIHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLL--KNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPEL---KFTEFFKKNG
             +S   F  +H DI+GP  H       YF++  D+ +R+ W++ L  + +  +L V       +  Q++  +   + D   E       +FF   G
Subjt:  ----NKSDAIFDLIHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLL--KNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPEL---KFTEFFKKNG

Query:  VEHQYSCVARPEQNSVVERKHQHILNVGRAIYFQSKIPLDYW
        +   Y+  A    + V ER ++ +LN  R +   S +P   W
Subjt:  VEHQYSCVARPEQNSVVERKHQHILNVGRAIYFQSKIPLDYW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.1e-5529.38Show/hide
Query:  NKQRKNNYKQTNDNQGSVQPENKSCKS--ATVAASNIESDPFQQCHDILTLLQSKLAGIKNDNGANLTQHMAGMTQTYDYFKDRWILDSGAAAHICHNKD
        N+   NN K    +  +  P N   K          ++    ++C  +   L S +   +  +     Q  A +     Y  + W+LDSGA  HI  +  
Subjt:  NKQRKNNYKQTNDNQGSVQPENKSCKS--ATVAASNIESDPFQQCHDILTLLQSKLAGIKNDNGANLTQHMAGMTQTYDYFKDRWILDSGAAAHICHNKD

Query:  IFMNLKRID-----TSVILPNKDRIIITHAGSILLCGS---IILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNACIIQDKRTLKMIGKGNLEQGLYV
         F NL           V++ +   I I+H GS  L      + L  +LYVP+   NL+S+  L   + V V F   +  ++D  T   + +G  +  LY 
Subjt:  IFMNLKRID-----TSVILPNKDRIIITHAGSILLCGS---IILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNACIIQDKRTLKMIGKGNLEQGLYV

Query:  LEEVPLSAA--LNIVCSVRS-ASPSLWHRRLGHPADLPLVALKNVLSFDANCKGAE--NCTICPLAKQNRLRFISNNNKSDAIFDLIHVDIWGPFAHTSH
          E P++++  +++  S  S A+ S WH RLGHPA   L ++ +  S        +  +C+ C + K N++ F  +   S    + I+ D+W      SH
Subjt:  LEEVPLSAA--LNIVCSVRS-ASPSLWHRRLGHPADLPLVALKNVLSFDANCKGAE--NCTICPLAKQNRLRFISNNNKSDAIFDLIHVDIWGPFAHTSH

Query:  LGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPE-LKFTEFFKKNGVEHQYSCVARPEQNSVVERKHQHILNVGRAI
          YRY++  VD F+RYTW++ LK KS V      F  L+  ++   I  F SDN  E +   E+F ++G+ H  S    PE N + ERKH+HI+  G  +
Subjt:  LGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPE-LKFTEFFKKNGVEHQYSCVARPEQNSVVERKHQHILNVGRAI

Query:  YFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQGMKGFKLLDLENNNIFVS
           + IP  YW      AV+LINR P+  L  ++P++ L    PNY  L+VFGC  Y      +++K   ++   VF+GY      +  L L+ + +++S
Subjt:  YFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQGMKGFKLLDLENNNIFVS

Query:  RDVVFHEEVFPFQS
        R V F E  FPF +
Subjt:  RDVVFHEEVFPFQS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-5328.54Show/hide
Query:  SNKQRKNNYKQTNDNQGSVQPENKSCKS---------ATVAASNIESDPFQQCHDILTLLQSKLAGIKNDNGANLTQHMAGMTQTYDYFKDRWILDSGAA
        +N+    NY   N+   S QP +   +S               +++    ++C   L   QS     ++ +     Q  A +     Y  + W+LDSGA 
Subjt:  SNKQRKNNYKQTNDNQGSVQPENKSCKS---------ATVAASNIESDPFQQCHDILTLLQSKLAGIKNDNGANLTQHMAGMTQTYDYFKDRWILDSGAA

Query:  AHIC--HNKDIFMNLKRIDTSVILPNKDRIIITHAGSILL---CGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNACIIQDKRTLKMIGKGNLE
         HI    N   F         V++ +   I ITH GS  L     S+ L++VLYVP+   NL+S+  L   + V V F   +  ++D  T   + +G  +
Subjt:  AHIC--HNKDIFMNLKRIDTSVILPNKDRIIITHAGSILL---CGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNACIIQDKRTLKMIGKGNLE

Query:  QGLYVLEEVPLSAALNIVCSVRSASPSLWHRRLGHPADLPLVALKNVLSFDA-----NCKGAENCTICPLAKQNRLRFISNNNKSDAIFDLIHVDIW-GP
          LY        A          A+ S WH RLGHP+   L  L +V+S  +           +C+ C + K +++ F ++   S    + I+ D+W  P
Subjt:  QGLYVLEEVPLSAALNIVCSVRSASPSLWHRRLGHPADLPLVALKNVLSFDA-----NCKGAENCTICPLAKQNRLRFISNNNKSDAIFDLIHVDIW-GP

Query:  FAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPE-LKFTEFFKKNGVEHQYSCVARPEQNSVVERKHQHIL
             +  YRY++  VD F+RYTW++ LK KS V      F  LV  ++   I    SDN  E +   ++  ++G+ H  S    PE N + ERKH+HI+
Subjt:  FAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPE-LKFTEFFKKNGVEHQYSCVARPEQNSVVERKHQHIL

Query:  NVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQGMKGFKLLDLEN
         +G  +   + +P  YW      AV+LINR P+  L  ++P++ L  + PNY+ LKVFGC  Y      +R+K   ++    F+GY      +  L +  
Subjt:  NVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQGMKGFKLLDLEN

Query:  NNIFVSRDVVFHEEVFPFQSKN
          ++ SR V F E  FPF + N
Subjt:  NNIFVSRDVVFHEEVFPFQSKN

Arabidopsis top hitse value%identityAlignment
AT4G05360.1 Zinc knuckle (CCHC-type) family protein7.5e-0432.73Show/hide
Query:  KPVCKHCGLIGHTIDVCYRI----------HGYPDNRPVCKHCGLQGHTIDVCYK
        +PVC HCG++GH    C+R+          +    + P C H G+QGH    C++
Subjt:  KPVCKHCGLIGHTIDVCYRI----------HGYPDNRPVCKHCGLQGHTIDVCYK

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.8e-0532.31Show/hide
Query:  ILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAY
        I+   R++  +  +P  +  +   TAVH+IN+ PS  +++  P E+  + +P Y  L+ FGC+AY
Subjt:  ILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCTAAATGATGAGTTTTCTACTACTAGATCACAAATACTTCTCATGGATCCATTGCCACCAGTCAATAAGGCTTTTTCTTTAATCGTACAAGAAGAGGAACATAA
GGGAGATACAAATATTAAGAGTAATATTACCTTAGCTGCCACTCAGTCTAAAACCACATACAAAGGGAAGGATTCTAAGCCAGTATGCAAGCATTGTGGTCTCATAGGAC
ACACAATTGATGTTTGCTATAGAATACATGGATATCCGGATAATAGACCTGTGTGCAAGCATTGTGGGTTACAAGGACACACCATCGATGTATGTTATAAAATACATGGG
TATCCACCTAGTAACAAGCAAAGGAAAAATAACTACAAGCAAACCAATGATAACCAAGGTTCTGTACAACCTGAAAACAAATCTTGCAAATCAGCAACAGTTGCAGCTAG
CAATATTGAAAGTGATCCTTTTCAACAATGTCATGATATATTGACTCTTCTTCAATCCAAGTTAGCTGGCATCAAGAATGACAATGGAGCGAACCTAACGCAACATATGG
CAGGTATGACACAAACATATGATTATTTTAAAGATAGATGGATACTAGATTCAGGTGCAGCAGCCCATATATGTCATAACAAAGATATATTCATGAATTTAAAAAGGATT
GATACCTCTGTGATATTACCTAATAAGGATAGGATTATAATCACTCATGCTGGATCTATATTGTTGTGTGGATCTATCATTTTAGATAGAGTCTTGTATGTTCCAAGTTT
TAAATACAATTTATTGTCTATTAGTGCACTAACCTTGAATGATGCAGTGTTAGTAAATTTCACAACTAATGCTTGTATTATCCAGGACAAGCGCACTTTGAAGATGATTG
GGAAGGGTAATCTTGAGCAAGGATTATATGTGTTAGAGGAGGTACCTTTATCTGCAGCATTGAATATTGTTTGTAGTGTAAGGAGTGCCTCACCATCCCTATGGCATAGG
AGATTAGGTCATCCAGCTGATTTACCTTTAGTTGCTTTAAAAAATGTACTTTCTTTTGATGCAAATTGTAAAGGGGCTGAAAATTGTACTATATGCCCTTTGGCTAAACA
AAACAGATTGAGATTCATTTCAAATAATAATAAATCAGATGCTATTTTTGATCTCATACATGTTGATATATGGGGGCCTTTTGCTCATACCTCACATTTAGGATACAGAT
ATTTTCTAACAATAGTAGATGATTTTAGTAGATACACTTGGATATTTCTTTTGAAAAATAAATCAGATGTTTTAACTGTTATTCCACACTTCTTTAAACTAGTGCACACA
CAATATTCAAAAGTGATAAAGTGTTTTAGATCTGATAATGCGCCAGAACTTAAATTTACTGAATTCTTTAAGAAAAACGGAGTAGAACATCAATACTCTTGTGTAGCTCG
TCCGGAACAAAACTCAGTAGTTGAGCGAAAACACCAACACATCCTCAATGTAGGAAGAGCCATCTATTTCCAATCCAAAATACCTTTAGATTATTGGGGAGAGTGTATAC
TAACTGCTGTTCATTTAATAAATAGGACACCTTCTAAGAACCTAGATTGGAAAACTCCCTATGAATTACTAAAGAAAGAAATGCCAAACTACCAAACCTTAAAAGTTTTT
GGGTGTTTGGCATATGCATCTACCATTCGTGAACATAGAAATAAATTTTCATCTAGAGCAATTCCAAGTGTTTTTGTAGGCTATCCACAAGGCATGAAAGGTTTTAAACT
TCTGGATTTAGAAAATAACAATATTTTTGTTTCAAGGGATGTAGTTTTCCATGAGGAAGTTTTCCCTTTTCAAAGTAAAAATACCATAGAAAACATGCCAGATTTTATTA
TGAACCAAGTATTGCCGAAAGCTTGTGATATATCTCTAGAATATAAAGATAGCATACATGACAATTATGGTGATGAAGTACAAAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTCTAAATGATGAGTTTTCTACTACTAGATCACAAATACTTCTCATGGATCCATTGCCACCAGTCAATAAGGCTTTTTCTTTAATCGTACAAGAAGAGGAACATAA
GGGAGATACAAATATTAAGAGTAATATTACCTTAGCTGCCACTCAGTCTAAAACCACATACAAAGGGAAGGATTCTAAGCCAGTATGCAAGCATTGTGGTCTCATAGGAC
ACACAATTGATGTTTGCTATAGAATACATGGATATCCGGATAATAGACCTGTGTGCAAGCATTGTGGGTTACAAGGACACACCATCGATGTATGTTATAAAATACATGGG
TATCCACCTAGTAACAAGCAAAGGAAAAATAACTACAAGCAAACCAATGATAACCAAGGTTCTGTACAACCTGAAAACAAATCTTGCAAATCAGCAACAGTTGCAGCTAG
CAATATTGAAAGTGATCCTTTTCAACAATGTCATGATATATTGACTCTTCTTCAATCCAAGTTAGCTGGCATCAAGAATGACAATGGAGCGAACCTAACGCAACATATGG
CAGGTATGACACAAACATATGATTATTTTAAAGATAGATGGATACTAGATTCAGGTGCAGCAGCCCATATATGTCATAACAAAGATATATTCATGAATTTAAAAAGGATT
GATACCTCTGTGATATTACCTAATAAGGATAGGATTATAATCACTCATGCTGGATCTATATTGTTGTGTGGATCTATCATTTTAGATAGAGTCTTGTATGTTCCAAGTTT
TAAATACAATTTATTGTCTATTAGTGCACTAACCTTGAATGATGCAGTGTTAGTAAATTTCACAACTAATGCTTGTATTATCCAGGACAAGCGCACTTTGAAGATGATTG
GGAAGGGTAATCTTGAGCAAGGATTATATGTGTTAGAGGAGGTACCTTTATCTGCAGCATTGAATATTGTTTGTAGTGTAAGGAGTGCCTCACCATCCCTATGGCATAGG
AGATTAGGTCATCCAGCTGATTTACCTTTAGTTGCTTTAAAAAATGTACTTTCTTTTGATGCAAATTGTAAAGGGGCTGAAAATTGTACTATATGCCCTTTGGCTAAACA
AAACAGATTGAGATTCATTTCAAATAATAATAAATCAGATGCTATTTTTGATCTCATACATGTTGATATATGGGGGCCTTTTGCTCATACCTCACATTTAGGATACAGAT
ATTTTCTAACAATAGTAGATGATTTTAGTAGATACACTTGGATATTTCTTTTGAAAAATAAATCAGATGTTTTAACTGTTATTCCACACTTCTTTAAACTAGTGCACACA
CAATATTCAAAAGTGATAAAGTGTTTTAGATCTGATAATGCGCCAGAACTTAAATTTACTGAATTCTTTAAGAAAAACGGAGTAGAACATCAATACTCTTGTGTAGCTCG
TCCGGAACAAAACTCAGTAGTTGAGCGAAAACACCAACACATCCTCAATGTAGGAAGAGCCATCTATTTCCAATCCAAAATACCTTTAGATTATTGGGGAGAGTGTATAC
TAACTGCTGTTCATTTAATAAATAGGACACCTTCTAAGAACCTAGATTGGAAAACTCCCTATGAATTACTAAAGAAAGAAATGCCAAACTACCAAACCTTAAAAGTTTTT
GGGTGTTTGGCATATGCATCTACCATTCGTGAACATAGAAATAAATTTTCATCTAGAGCAATTCCAAGTGTTTTTGTAGGCTATCCACAAGGCATGAAAGGTTTTAAACT
TCTGGATTTAGAAAATAACAATATTTTTGTTTCAAGGGATGTAGTTTTCCATGAGGAAGTTTTCCCTTTTCAAAGTAAAAATACCATAGAAAACATGCCAGATTTTATTA
TGAACCAAGTATTGCCGAAAGCTTGTGATATATCTCTAGAATATAAAGATAGCATACATGACAATTATGGTGATGAAGTACAAAAGTAA
Protein sequenceShow/hide protein sequence
MGLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEEEHKGDTNIKSNITLAATQSKTTYKGKDSKPVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCGLQGHTIDVCYKIHG
YPPSNKQRKNNYKQTNDNQGSVQPENKSCKSATVAASNIESDPFQQCHDILTLLQSKLAGIKNDNGANLTQHMAGMTQTYDYFKDRWILDSGAAAHICHNKDIFMNLKRI
DTSVILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNACIIQDKRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASPSLWHR
RLGHPADLPLVALKNVLSFDANCKGAENCTICPLAKQNRLRFISNNNKSDAIFDLIHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHT
QYSKVIKCFRSDNAPELKFTEFFKKNGVEHQYSCVARPEQNSVVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVF
GCLAYASTIREHRNKFSSRAIPSVFVGYPQGMKGFKLLDLENNNIFVSRDVVFHEEVFPFQSKNTIENMPDFIMNQVLPKACDISLEYKDSIHDNYGDEVQK