; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005056 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005056
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr6:10089771..10094576
RNA-Seq ExpressionLag0005056
SyntenyLag0005056
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR005162 - Retrotransposon gag domain
IPR012337 - Ribonuclease H-like superfamily
IPR029472 - Retrotransposon Copia-like, N-terminal
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065480.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]1.6e-12839.25Show/hide
Query:  NGWDISSSSAARPTNPSVIDQYVNPYFLHHSDGTNLVLVSELLTES-NYNSWSQAMLLGLMVKNKEGFVNGIITKPTN-ELLHSWKICNGVVKAWILNAL
        NG      +AA   N    D  +NPYF+HHS G    +V++ LT + NY SWS+AML+ +  +NK GF+ G I KP++  LL +W   N ++ +WILN++
Subjt:  NGWDISSSSAARPTNPSVIDQYVNPYFLHHSDGTNLVLVSELLTES-NYNSWSQAMLLGLMVKNKEGFVNGIITKPTN-ELLHSWKICNGVVKAWILNAL

Query:  TKEIAASLNFSDTAREMWVDLQQRYQRRNRPRIFQLRRKISNLAQDQLSVFAYFAKLKALWNELSSYRPSCSCGNCSCGGVKELAKYFQTEHVMAFLMGL
        +KEIAAS+ +  + +E+W +L+QR+++ N P I+QLR++   L Q  L++  Y+ KLK +W  L+ YR      +C+CGG+K    + ++E++MAFLMGL
Subjt:  TKEIAASLNFSDTAREMWVDLQQRYQRRNRPRIFQLRRKISNLAQDQLSVFAYFAKLKALWNELSSYRPSCSCGNCSCGGVKELAKYFQTEHVMAFLMGL

Query:  NESFGQIRTQLLLMEPEPTITRAFSLIAQEVEQRAS---VPPVSNVATPSTIEATTLLAKNQSSSRSQVIKKKERPHCTHCNILGHTVDKCCKIHGYPPR
        N+S+  +R Q+LLM+P P+I   FSL+ QE +QR++    PP+  VA    I ++  ++ +++       +KKERP C++C I GH  DKC K HGYPP 
Subjt:  NESFGQIRTQLLLMEPEPTITRAFSLIAQEVEQRAS---VPPVSNVATPSTIEATTLLAKNQSSSRSQVIKKKERPHCTHCNILGHTVDKCCKIHGYPPR

Query:  Y--RNQKGST--------------------------------------SVLNQRL------PVFPVSVLSNVKGFLPFSKH------IWRLRPNVVR-IL
        Y  RN    T                                      ++LN  L      P+   + +++  G    + H       W +     R I 
Subjt:  Y--RNQKGST--------------------------------------SVLNQRL------PVFPVSVLSNVKGFLPFSKH------IWRLRPNVVR-IL

Query:  PRLPMEMHMLQVSGFSVTLPNQNRLIVDHVGIVQLSPQLVLKNVLFVPEFRFNLVSISALINSSPILLNFVGGYCLIQDRSSLKMIGKAKTWQGLYLLNH
            +  +    +   V LPN +R+ VD +G +Q++  L LK+VLFV +F +NL+S+S L+ +  I L+F    C+IQD S   MIGKA    GLY+LN 
Subjt:  PRLPMEMHMLQVSGFSVTLPNQNRLIVDHVGIVQLSPQLVLKNVLFVPEFRFNLVSISALINSSPILLNFVGGYCLIQDRSSLKMIGKAKTWQGLYLLNH

Query:  EDLLSENDSLTRASAFNVSC--QKVS----KCNSSLSSP-----------------------LSFHASGHIAANPFDLVHCDTWGPFNTPTHSGHQYFLS
        E   +   +  + +A +V    Q++     KC SSLSS                        LSFH++ ++A++PFDLVH D WGPF  P++ G++YFL+
Subjt:  EDLLSENDSLTRASAFNVSC--QKVS----KCNSSLSSP-----------------------LSFHASGHIAANPFDLVHCDTWGPFNTPTHSGHQYFLS

Query:  IVDDFSRYTWIFLMKKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPELSFVEFFRTKGVVHHLSCVERPQQNSVVERKHQHLLNVARALLF
        +VDD  R+TW++++++KSDV HI+P FFQ IETQ+ + IK FRSDNAPEL   EFF  KG VH  SCVE+PQQNSVVERKHQHLLNVARAL F
Subjt:  IVDDFSRYTWIFLMKKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPELSFVEFFRTKGVVHHLSCVERPQQNSVVERKHQHLLNVARALLF

RVW82526.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.5e-11837.82Show/hide
Query:  PTNP--SVIDQYVNPYFLHHSDGTNLVLVSELL--TESNYNSWSQAMLLGLMVKNKEGFVNGIITKP--TNELLHSWKICNGVVKAWILNALTKEIAASL
        P +P  S ++ + +PYFLH+ D  +L LVS  L  + SNY+SW ++M+  L  KNK GF++G I++P  T+ L   W  CN +V +W+ N++ KEIA S+
Subjt:  PTNP--SVIDQYVNPYFLHHSDGTNLVLVSELL--TESNYNSWSQAMLLGLMVKNKEGFVNGIITKP--TNELLHSWKICNGVVKAWILNALTKEIAASL

Query:  NFSDTAREMWVDLQQRYQRRNRPRIFQLRRKISNLAQDQLSVFAYFAKLKALWNELSSYRPSCSCGNCSCGGVKELAKYFQTEHVMAFLMGLNESFGQIR
         + +TA E+W DL +R+ + + PRIF+L++KI    Q    V  Y+ +LK+LW+EL  ++   +   C+CGG++   +  Q E VM FL+GLNESF  I+
Subjt:  NFSDTAREMWVDLQQRYQRRNRPRIFQLRRKISNLAQDQLSVFAYFAKLKALWNELSSYRPSCSCGNCSCGGVKELAKYFQTEHVMAFLMGLNESFGQIR

Query:  TQLLLMEPEPTITRAFSLIAQEVEQRA---SVPPVSNVATPSTIEATTLLAKNQSSSRSQVIKKKERPHCTHCNILGHTVDKCCKIHGYPPRYRNQ---K
         Q+LLMEP P + + FSL+ QE  QR+   S  P       S  +A +  +   +SSRS    +K+RP CTHCNILGHTVD+C KIHGY P +RN+   +
Subjt:  TQLLLMEPEPTITRAFSLIAQEVEQRA---SVPPVSNVATPSTIEATTLLAKNQSSSRSQVIKKKERPHCTHCNILGHTVDKCCKIHGYPPRYRNQ---K

Query:  GSTSVLNQRLP-----------------VFPVSV---------------------------------LSNVKGFLPFSKHIWRLRPNV--------VRIL
         + S  NQ LP                   P  +                                 +SN  G L  S     L P++          + 
Subjt:  GSTSVLNQRLP-----------------VFPVSV---------------------------------LSNVKGFLPFSKHIWRLRPNV--------VRIL

Query:  PRLPMEMHMLQVSGFSVTLPNQNRLIVDHVGIVQLSPQLVLKNVLFVPEFRFNLVSISALINSSPILLNFVGGYCLIQDRSSLKMIGKAKTWQGLYLLNH
            M   +   S  +VTLP   ++ +  +G + LSP LVL++VL++P F+FNL+SISAL  ++    +F   +C IQD S  K+IG  +    LYLL+ 
Subjt:  PRLPMEMHMLQVSGFSVTLPNQNRLIVDHVGIVQLSPQLVLKNVLFVPEFRFNLVSISALINSSPILLNFVGGYCLIQDRSSLKMIGKAKTWQGLYLLNH

Query:  EDLLS-----------------------ENDSLTRASAFNVSCQKVSKCNSSLS---------SPLSFHASGHIAANPFDLVHCDTWGPFNTPTHSGHQY
            S                        + S  + S      Q  S  N++LS           L F    +++++PFDL+HCD WGPF+ PTH G +Y
Subjt:  EDLLS-----------------------ENDSLTRASAFNVSCQKVSKCNSSLS---------SPLSFHASGHIAANPFDLVHCDTWGPFNTPTHSGHQY

Query:  FLSIVDDFSRYTWIFLMKKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPELSFVEFFRTKGVVHHLSCVERPQQNSVVERKHQHLLNVARALLFQSRV
        FL+IVDD +R TW+ L++ KSDV  I P FF  ++T++  TIK  RSDNAPEL+    F    V+H  SCVE PQQNSVVERKHQH+LNVARAL FQS +
Subjt:  FLSIVDDFSRYTWIFLMKKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPELSFVEFFRTKGVVHHLSCVERPQQNSVVERKHQHLLNVARALLFQSRV

Query:  PFNILG
        P    G
Subjt:  PFNILG

XP_012856897.1 PREDICTED: uncharacterized protein LOC105976150 [Erythranthe guttata]2.6e-13442.11Show/hide
Query:  IDQYVNPYFLHHSDGTNLVLVSELLTESNYNSWSQAMLLGLMVKNKEGFVNGIITKPTNE---LLHSWKICNGVVKAWILNALTKEIAASLNFSDTAREM
        +D   +P++LH SDG  LVLVS+LL+E N+ SWS+AM + L VKNK GF+NG IT+P+ +   L ++W   N +V +WILNA++K+I AS+ +SD+A EM
Subjt:  IDQYVNPYFLHHSDGTNLVLVSELLTESNYNSWSQAMLLGLMVKNKEGFVNGIITKPTNE---LLHSWKICNGVVKAWILNALTKEIAASLNFSDTAREM

Query:  WVDLQQRYQRRNRPRIFQLRRKISNLAQDQLSVFAYFAKLKALWNELSSYRPSCSCGNCSCGGVKELAKYFQTEHVMAFLMGLNESFGQIRTQLLLMEPE
        W DL  R+ + N PRIFQLRR++SNL QD  SV  YF KLKA+W+ELS++RPSC+CG C+CGGV++L +++  EHVMAFLMGLNES    R Q+LLM+P 
Subjt:  WVDLQQRYQRRNRPRIFQLRRKISNLAQDQLSVFAYFAKLKALWNELSSYRPSCSCGNCSCGGVKELAKYFQTEHVMAFLMGLNESFGQIRTQLLLMEPE

Query:  PTITRAFSLIAQEVEQRASVPPVSNVATPSTIEATTLLAKNQSSSRSQVI----KKKERPHCTHCNILGHTVDKCCKIHGYPPRYRNQKGSTSVLNQRLP
        P I + F+L++QE  QR S+    N    S   +       Q S  +QV     K+KER  CTHCNI GHT+DKC K+HGYPP Y+ +   +S+   R  
Subjt:  PTITRAFSLIAQEVEQRASVPPVSNVATPSTIEATTLLAKNQSSSRSQVI----KKKERPHCTHCNILGHTVDKCCKIHGYPPRYRNQKGSTSVLNQRLP

Query:  VFPVS------------------------VLSN----------------------------------------VKGFL-------PFSKHIWRLRPNVVR
        V  V+                        VL+N                                        V G          F  H W +     R
Subjt:  VFPVS------------------------VLSN----------------------------------------VKGFL-------PFSKHIWRLRPNVVR

Query:  -ILPRLPMEMHMLQVSGFSVTLPNQNRLIVDHVGIVQLSPQLVLKNVLFVPEFRFNLVSISALINSSPILLNFVGGYCLIQDRSSLKMIGK-------AK
         I     +   + +V+   V LP+ + ++V+++G V LS  L+LKNV +VP F+FNL+S+SAL++  P  + F     LIQD+  LK IGK       A 
Subjt:  -ILPRLPMEMHMLQVSGFSVTLPNQNRLIVDHVGIVQLSPQLVLKNVLFVPEFRFNLVSISALINSSPILLNFVGGYCLIQDRSSLKMIGK-------AK

Query:  TWQGLYLLNH-----EDLLSENDSLTRASAFNVSCQKVSKCNSSLSSPLSFHASGHIAANPFDLVHCDTWGPFNTPTHSGHQYFLSIVDDFSRYTWIFLM
         W     L H      D+LS   SL      N SC  +  C  +    L F  S  ++++ FDL+HCD WGP+   +H+G++YF+++VDD+SR+TW+ L+
Subjt:  TWQGLYLLNH-----EDLLSENDSLTRASAFNVSCQKVSKCNSSLSSPLSFHASGHIAANPFDLVHCDTWGPFNTPTHSGHQYFLSIVDDFSRYTWIFLM

Query:  KKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPELSFVEFFRTKGVVHHLSCVERPQQNSVVERKHQHLLNVARALLFQSRVP
        K KSDV   IP FF  ++TQ+   IK FRSDNA EL F + F   GV+H  SCV  PQQN+VVERKHQH+LNVAR+LLFQS +P
Subjt:  KKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPELSFVEFFRTKGVVHHLSCVERPQQNSVVERKHQHLLNVARALLFQSRVP

XP_012857659.1 PREDICTED: uncharacterized protein LOC105976934 [Erythranthe guttata]6.7e-13039.35Show/hide
Query:  SVIDQYVNPYFLHHSDGTNLVLVSELLTESNYNSWSQAMLLGLMVKNKEGFVNGIITKPTNE---LLHSWKICNGVVKAWILNALTKEIAASLNFSDTAR
        S ID   +PY+LH SDG  LVLVS LL E NY +W++AM++ L VKNK GF++G ITKP  +   LL++W   N +V +WILNA++ +I AS+ +S++A 
Subjt:  SVIDQYVNPYFLHHSDGTNLVLVSELLTESNYNSWSQAMLLGLMVKNKEGFVNGIITKPTNE---LLHSWKICNGVVKAWILNALTKEIAASLNFSDTAR

Query:  EMWVDLQQRYQRRNRPRIFQLRRKISNLAQDQLSVFAYFAKLKALWNELSSYRPSCSCGNCSCGGVKELAKYFQTEHVMAFLMGLNESFGQIRTQLLLME
        ++W DL+ R+ + N PRIFQLRR+++NL QDQ SV  YF KLKA+W+EL ++RP+C+CG CSCGGV +L  +   EHVM+FLMGLN+S    R Q+LLM+
Subjt:  EMWVDLQQRYQRRNRPRIFQLRRKISNLAQDQLSVFAYFAKLKALWNELSSYRPSCSCGNCSCGGVKELAKYFQTEHVMAFLMGLNESFGQIRTQLLLME

Query:  PEPTITRAFSLIAQEVEQRASVPPVSNVATPSTIEATTLLAKNQSSSRSQ-------VIKKKERPHCTHCNILGHTVDKCCKIHGYPPRY--RNQKGSTS
        P P I + F+L++QE   R+     S+    S   A   +  NQ   R Q         ++K++ +CTHC+  GHTV+KC ++HG+PP Y  R + G TS
Subjt:  PEPTITRAFSLIAQEVEQRASVPPVSNVATPSTIEATTLLAKNQSSSRSQ-------VIKKKERPHCTHCNILGHTVDKCCKIHGYPPRY--RNQKGSTS

Query:  --------------------------VLNQRLP---------------------------------------VFPVSVLSNVKGFL--------PFSKHI
                                   L+Q LP                                       +F  S +S V G           F  H 
Subjt:  --------------------------VLNQRLP---------------------------------------VFPVSVLSNVKGFL--------PFSKHI

Query:  WRLRPNVVR-ILPRLPMEMHMLQVSGFSVTLPNQNRLIVDHVGIVQLSPQLVLKNVLFVPEFRFNLVSISALINSSPILLNFVGGYCLIQDRSSLKMIGK
        W L     R I     + ++M  VS   V LP+ + ++V+ +G VQL+  LVL NV +VPEF+FNLVS+SAL++ S  ++ F      IQDR  +  IGK
Subjt:  WRLRPNVVR-ILPRLPMEMHMLQVSGFSVTLPNQNRLIVDHVGIVQLSPQLVLKNVLFVPEFRFNLVSISALINSSPILLNFVGGYCLIQDRSSLKMIGK

Query:  AKTWQGLYLLN--------------------HEDL--LSENDSLTRASAFNVSCQKVSKCNSSLSSPLS------FHASGHIAANPFDLVHCDTWGPFNT
            QGLY+L+                    H  L  + +      A  F++S  K+S+ +     PL+      F  S  ++   FDL+HCD WGPF  
Subjt:  AKTWQGLYLLN--------------------HEDL--LSENDSLTRASAFNVSCQKVSKCNSSLSSPLS------FHASGHIAANPFDLVHCDTWGPFNT

Query:  PTHSGHQYFLSIVDDFSRYTWIFLMKKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPELSFVEFFRTKGVVHHLSCVERPQQNSVVERKHQHLLNVAR
        P++SG  YF+++VDD+SR+TW+ L+K KS+V  ++P F + +  Q+ ++IK FRSDNA EL F   F   GV+H  SCV  PQQN++VERKHQH+LNVAR
Subjt:  PTHSGHQYFLSIVDDFSRYTWIFLMKKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPELSFVEFFRTKGVVHHLSCVERPQQNSVVERKHQHLLNVAR

Query:  ALLFQSRVP
        +L FQS +P
Subjt:  ALLFQSRVP

XP_038895765.1 uncharacterized protein LOC120083929 [Benincasa hispida]2.8e-12042.06Show/hide
Query:  SSSAARPTNP---SVIDQYVNPYFLHHSDGTNLVLVSELLTESNYNSWSQAMLLGLMVKNKEGFVNGIITKPTNELLHSWKICNGVVKAWILNALTKEIA
        SSS   PT     S +DQY  PY LHHSD +NLVLVSELLT+ NY SWS++M+L L ++NK GF++G + +PT +LLH W   N VV +WIL +++K I+
Subjt:  SSSAARPTNP---SVIDQYVNPYFLHHSDGTNLVLVSELLTESNYNSWSQAMLLGLMVKNKEGFVNGIITKPTNELLHSWKICNGVVKAWILNALTKEIA

Query:  ASLNFSDTAREMWVDLQQRYQRRNRPRIFQLRRKISNLAQDQLSVFAYFAKLKALWNELSSYRPSCSCGNCSCGGVKELAKYFQTEHVMAFLMGLNESFG
        +S+ F+++A+ +W+DLQ  +QRRN PRIF L+R++S+L QDQ SV  YF K+K+  +E  SYRP C+CG C+CGG+K +  + Q E+++ F MGLN+SF 
Subjt:  ASLNFSDTAREMWVDLQQRYQRRNRPRIFQLRRKISNLAQDQLSVFAYFAKLKALWNELSSYRPSCSCGNCSCGGVKELAKYFQTEHVMAFLMGLNESFG

Query:  QIRTQLLLMEPEPTITRAFSLIAQEVEQRASVPPVSNVATPSTIEATTLLAKNQSSSRSQVIKKKERPHCTHCNILGHTVDKCCKIHGYPPRYRNQKGST
          R+QLLLM+P P + +AFS + Q+ + ++   P S+V         TL  +   +   Q  K K R +  H                     +    S 
Subjt:  QIRTQLLLMEPEPTITRAFSLIAQEVEQRASVPPVSNVATPSTIEATTLLAKNQSSSRSQVIKKKERPHCTHCNILGHTVDKCCKIHGYPPRYRNQKGST

Query:  SVLNQRLPVFPVSVLSNVKGFLPFSKHIWRLRPNVVRILPRLPMEMHMLQVSGFSVTLPNQNRLIVDHVGIVQLSPQLVLKNVLFVPEFRFNLVSISALI
        S +N      P+   S+     P           + ++L  L  +++M +    SVT                            V EF++NL+SISAL 
Subjt:  SVLNQRLPVFPVSVLSNVKGFLPFSKHIWRLRPNVVRILPRLPMEMHMLQVSGFSVTLPNQNRLIVDHVGIVQLSPQLVLKNVLFVPEFRFNLVSISALI

Query:  NSSPILLNFVGGYCLIQDRSSLKMIGKAKTWQGLYLLNHEDL--LSENDSLTRASAFNV-SCQKVSKCNSSLSSPLSFHASGHIAANPFDLVHCDTWGPF
            + ++F    C IQ +S+LK IGK +   G  LL H  L  LS    + R    ++ S      C       LSF    + ++  FDL+H + WGPF
Subjt:  NSSPILLNFVGGYCLIQDRSSLKMIGKAKTWQGLYLLNHEDL--LSENDSLTRASAFNV-SCQKVSKCNSSLSSPLSFHASGHIAANPFDLVHCDTWGPF

Query:  NTPTHSGHQYFLSIVDDFSRYTWIFLMKKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPELSFVEFFRTKGVVHHLSCVERPQQNSVVERKHQHLLNV
        +TPTH GH++FL+IVDD SR+TW+F++K KS V  IIP FF Y+ETQY + IK FRSDNAPELSFVEFF  +GV+H  SCV RP+QNSVVERKHQHLLNV
Subjt:  NTPTHSGHQYFLSIVDDFSRYTWIFLMKKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPELSFVEFFRTKGVVHHLSCVERPQQNSVVERKHQHLLNV

Query:  ARALLFQSRVP
        +RAL FQSR P
Subjt:  ARALLFQSRVP

TrEMBL top hitse value%identityAlignment
A0A2N9G1L6 Integrase catalytic domain-containing protein1.4e-11736.21Show/hide
Query:  SSSSAARPTNPSVIDQYV--NPYFLHHSDGTNLVLVSELLTESNYNSWSQAMLLGLMVKNKEGFVNGIITKP----TNELLHSWKICNGVVKAWILNALT
        S+ S +  +  + ID  +  +PY+LH SD ++L+LV+E LT  N++SW ++M + L +KNK GFV+G I +P     + L   W  CN VV  WILN ++
Subjt:  SSSSAARPTNPSVIDQYV--NPYFLHHSDGTNLVLVSELLTESNYNSWSQAMLLGLMVKNKEGFVNGIITKP----TNELLHSWKICNGVVKAWILNALT

Query:  KEIAASLNFSDTAREMWVDLQQRYQRRNRPRIFQLRRKISNLAQDQLSVFAYFAKLKALWNELSSYRPS--CSC-GNCSCGGVKELAKYFQTEHVMAFLM
        K+I AS+ +  TA  +W +LQ ++ + N P+IFQL + I +L Q+Q SV  Y+  L+ LW EL +Y P+  C+C  +CSCG + +  + ++   VM FLM
Subjt:  KEIAASLNFSDTAREMWVDLQQRYQRRNRPRIFQLRRKISNLAQDQLSVFAYFAKLKALWNELSSYRPS--CSC-GNCSCGGVKELAKYFQTEHVMAFLM

Query:  GLNESFGQIRTQLLLMEPEPTITRAFSLIAQEVEQRASVPPVSNVATPSTIEATTLLAKNQ----SSSRSQVIKKKERPHCTHCNILGHTVDKCCKIHGY
        GLNESF  +R Q+LLM+P P I + FSLI QE  QR S+  ++       +E+T L+ K        +R  +  KKERP CTHC +LGHTVDKC K+HG+
Subjt:  GLNESFGQIRTQLLLMEPEPTITRAFSLIAQEVEQRASVPPVSNVATPSTIEATTLLAKNQ----SSSRSQVIKKKERPHCTHCNILGHTVDKCCKIHGY

Query:  PPRYRNQKGSTSVLNQ------------------------------------------------------------RLPVFPVSVLSNV---------KG
        PP Y+ +  + +V NQ                                                                FP+S ++ +           
Subjt:  PPRYRNQKGSTSVLNQ------------------------------------------------------------RLPVFPVSVLSNV---------KG

Query:  FLP------FSKHI----------WRLRPNVV-RILPRLPMEMHMLQVSGFSVTLPNQNRLIVDHVGIVQLSPQLVLKNVLFVPEFRFNLVSISALINSS
        F P      FS H           W L       ++  L     +  +   +V LPN N + V H+G V+LS  L+L +VL VP F FNL+S+S L++SS
Subjt:  FLP------FSKHI----------WRLRPNVV-RILPRLPMEMHMLQVSGFSVTLPNQNRLIVDHVGIVQLSPQLVLKNVLFVPEFRFNLVSISALINSS

Query:  PILLNFVGGYCLIQDRSSLKMIGKAKTWQGLYLLNHEDL-LSENDSLTRASAFNVSCQKVSKCNSS--------LSSP----------------------
           L F+  YC IQ  +  +MIG  K   GLY+L+  +L LS + S +  S  +V+    S  N +        L  P                      
Subjt:  PILLNFVGGYCLIQDRSSLKMIGKAKTWQGLYLLNHEDL-LSENDSLTRASAFNVSCQKVSKCNSS--------LSSP----------------------

Query:  -------------LSFHASGHIAANPFDLVHCDTWGPFNTPTHSGHQYFLSIVDDFSRYTWIFLMKKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPE
                     L F  +GH  ++ FDL+HCD WGP+  PTH G +YFL+IVDD SR TW++LM  K     ++  FF  IETQ+   IK  RSDN  E
Subjt:  -------------LSFHASGHIAANPFDLVHCDTWGPFNTPTHSGHQYFLSIVDDFSRYTWIFLMKKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPE

Query:  LSFVEFFRTKGVVHHLSCVERPQQNSVVERKHQHLLNVARALLFQSRVPFNILG
            +FF +KGV+H  SCV+ PQQNSVVERKHQHLLNVARA+ FQS +P +  G
Subjt:  LSFVEFFRTKGVVHHLSCVERPQQNSVVERKHQHLLNVARALLFQSRVPFNILG

A0A2N9GZW3 Integrase catalytic domain-containing protein1.7e-11835.59Show/hide
Query:  LRFSLCKGFFCLPNGWDISSSSAARPTNPSVIDQYV-----NPYFLHHSDGTNLVLVSELLTESNYNSWSQAMLLGLMVKNKEGFVNGIITKPTNEL---
        + F   +G  C        ++S + P+ P+   +Y+     + Y+LHH D    +LVS+ L   NY++WS++M++ L  KNK GFVNG+I +P +E    
Subjt:  LRFSLCKGFFCLPNGWDISSSSAARPTNPSVIDQYV-----NPYFLHHSDGTNLVLVSELLTESNYNSWSQAMLLGLMVKNKEGFVNGIITKPTNEL---

Query:  LHSWKICNGVVKAWILNALTKEIAASLNFSDTAREMWVDLQQRYQRRNRPRIFQLRRKISNLAQDQLSVFAYFAKLKALWNELSSYRPSCSCGNCSCGGV
         ++W  CN +V +W+LN+L+KEIA+S+ +++TA+E+W DL++R+ + N PRIF++++ IS L+QD  SV +Y+ +LK+LW+ELS++RP     +CSCG +
Subjt:  LHSWKICNGVVKAWILNALTKEIAASLNFSDTAREMWVDLQQRYQRRNRPRIFQLRRKISNLAQDQLSVFAYFAKLKALWNELSSYRPSCSCGNCSCGGV

Query:  KELAKYFQTEHVMAFLMGLNESFGQIRTQLLLMEPEPTITRAFSLIAQEVEQR-----ASVPPVSNVATPSTIEATT-LLAKNQSSSRSQVIKKKERPHC
        K L    Q E+VM FLMGLN+SF  +R Q+L+ +P P+IT+AF+L+ QE  QR     +  P   +VA  +  EAT     KNQS        KK+RP C
Subjt:  KELAKYFQTEHVMAFLMGLNESFGQIRTQLLLMEPEPTITRAFSLIAQEVEQR-----ASVPPVSNVATPSTIEATT-LLAKNQSSSRSQVIKKKERPHC

Query:  THCNILGHTVDKCCKIHGYPPRYR-----------------------------------NQKGSTSVLNQRLPV--------------FPVSVLSNVKGF
        +HC I GHTVDKC K+HGYPP Y+                                   +Q    S+ + + PV               P    S +  F
Subjt:  THCNILGHTVDKCCKIHGYPPRYR-----------------------------------NQKGSTSVLNQRLPV--------------FPVSVLSNVKGF

Query:  LP------------------------FSKHIWRLRPNVV-RILPRLPMEMHMLQVSGFSVTLPNQNRLIVDHVGIVQLSPQLVLKNVLFVPEFRFNLVSI
        +                         FS   W L       ++  L     +       + LPN  +++  H+G VQ++  L+L +VL VP F FNL+SI
Subjt:  LP------------------------FSKHIWRLRPNVV-RILPRLPMEMHMLQVSGFSVTLPNQNRLIVDHVGIVQLSPQLVLKNVLFVPEFRFNLVSI

Query:  SALINSSPILLNFVGGYCLIQDRSSLKMIGKAKTWQGLYLLNHE---------DLLSENDSLTRASAFNV--------SCQKVS----------------
        S L N+    + F+  +C IQD  + K IG  +   GLY L             L++ + ++     F+V        S  ++S                
Subjt:  SALINSSPILLNFVGGYCLIQDRSSLKMIGKAKTWQGLYLLNHE---------DLLSENDSLTRASAFNV--------SCQKVS----------------

Query:  ---KCNSSLSSPLSFHASGHIAANPFDLVHCDTWGPFNTPTHSGHQYFLSIVDDFSRYTWIFLMKKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPEL
            C+ S    L FH + H A  PFDL+HCD WGP++ PT    +YFL+IVDD +R TW+FLMK+KS+ S +I  FF  I+TQ+  +IK  RSDN PE 
Subjt:  ---KCNSSLSSPLSFHASGHIAANPFDLVHCDTWGPFNTPTHSGHQYFLSIVDDFSRYTWIFLMKKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPEL

Query:  SFVEFFRTKGVVHHLSCVERPQQNSVVERKHQHLLNVARALLFQSRVPFNILG
            F+   G +H  SCV  PQQN+ VERKHQHLL VARAL FQ+ +P    G
Subjt:  SFVEFFRTKGVVHHLSCVERPQQNSVVERKHQHLLNVARALLFQSRVPFNILG

A0A2N9HYD2 Integrase catalytic domain-containing protein4.7e-12136.91Show/hide
Query:  ISSSSAARPTNPSVI--DQYVNPYFLHHSDGTNLVLVSELLTESNYNSWSQAMLLGLMVKNKEGFVNGIITKPTNELLHS---WKICNGVVKAWILNALT
        +++ ++A   +PS +  D+  N +FLHH D    +LVS+ L+  NY++WS++M++ L  KNK GF+NG I+ P ++ L S   W  CN +V +W+LN+++
Subjt:  ISSSSAARPTNPSVI--DQYVNPYFLHHSDGTNLVLVSELLTESNYNSWSQAMLLGLMVKNKEGFVNGIITKPTNELLHS---WKICNGVVKAWILNALT

Query:  KEIAASLNFSDTAREMWVDLQQRYQRRNRPRIFQLRRKISNLAQDQLSVFAYFAKLKALWNELSSYRPSCSCGNCSCGGVKELAKYFQTEHVMAFLMGLN
        KEIA+S+ +++TA+EMW DL++R+ + N PRIF++++ IS+L QDQ +V AYF KLK+LW+EL++YR   S   CSCG +K L    Q E+VM FLMGLN
Subjt:  KEIAASLNFSDTAREMWVDLQQRYQRRNRPRIFQLRRKISNLAQDQLSVFAYFAKLKALWNELSSYRPSCSCGNCSCGGVKELAKYFQTEHVMAFLMGLN

Query:  ESFGQIRTQLLLMEPEPTITRAFSLIAQEVEQRA-SVPPVSNVATPSTIEATTLLAKNQSSSRSQVIKKKERPHCTHCNILGHTVDKCCKIHGYPPRYRN
        +SF  +R Q+L+MEP P I +AFSL+ QE  QR+  V  + N      +   + + +N    RS    KKERP C+HC I GH VDKC K+HG+PP ++ 
Subjt:  ESFGQIRTQLLLMEPEPTITRAFSLIAQEVEQRA-SVPPVSNVATPSTIEATTLLAKNQSSSRSQVIKKKERPHCTHCNILGHTVDKCCKIHGYPPRYRN

Query:  QKGS-----TSVLNQRLPVFPVS------------------------VLSNVKGFLP-----------FSKHIWRLRPNVVRIL--------------PR
        +  S      SV+ +  P  P++                         L++ +  LP            S+      P+ V  +              P 
Subjt:  QKGS-----TSVLNQRLPVFPVS------------------------VLSNVKGFLP-----------FSKHIWRLRPNVVRIL--------------PR

Query:  LPMEM-------------------------HML-QVSGFS---------VTLPNQNRLIVDHVGIVQLSPQLVLKNVLFVPEFRFNLVSISALINSSPIL
        +P                            HM+  +S F+         + LPN  +++  H+G VQ+S  L L NVL VP F FNL+SI+ L +S P  
Subjt:  LPMEM-------------------------HML-QVSGFS---------VTLPNQNRLIVDHVGIVQLSPQLVLKNVLFVPEFRFNLVSISALINSSPIL

Query:  LNFVGGYCLIQDRSSLKMIGKAKTWQGLYLL-----------NHEDLLSENDSLTRASAFNV---------------------------SCQKVSKCNSS
        + F   +C IQD  S K IG AK   GLY+L           N    L    S    + FNV                           +    + CN S
Subjt:  LNFVGGYCLIQDRSSLKMIGKAKTWQGLYLL-----------NHEDLLSENDSLTRASAFNV---------------------------SCQKVSKCNSS

Query:  LSSPLSFHASGHIAANPFDLVHCDTWGPFNTPTHSGHQYFLSIVDDFSRYTWIFLMKKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPELSFVEFFRT
            L F  S HI+  PFDL+HCD WGPF+ PT +  +YFL+IVDD +R TWIFLMK KS+   ++  FF  ++TQ+  +IK  RSDN  E S  EF+  
Subjt:  LSSPLSFHASGHIAANPFDLVHCDTWGPFNTPTHSGHQYFLSIVDDFSRYTWIFLMKKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPELSFVEFFRT

Query:  KGVVHHLSCVERPQQNSVVERKHQHLLNVARALLFQSRVPFNILG
         G +H  SC+  PQQNS VERKHQHLL VAR+L FQ+ +P    G
Subjt:  KGVVHHLSCVERPQQNSVVERKHQHLLNVARALLFQSRVPFNILG

A0A438HDI8 Retrovirus-related Pol polyprotein from transposon TNT 1-947.5e-11937.82Show/hide
Query:  PTNP--SVIDQYVNPYFLHHSDGTNLVLVSELL--TESNYNSWSQAMLLGLMVKNKEGFVNGIITKP--TNELLHSWKICNGVVKAWILNALTKEIAASL
        P +P  S ++ + +PYFLH+ D  +L LVS  L  + SNY+SW ++M+  L  KNK GF++G I++P  T+ L   W  CN +V +W+ N++ KEIA S+
Subjt:  PTNP--SVIDQYVNPYFLHHSDGTNLVLVSELL--TESNYNSWSQAMLLGLMVKNKEGFVNGIITKP--TNELLHSWKICNGVVKAWILNALTKEIAASL

Query:  NFSDTAREMWVDLQQRYQRRNRPRIFQLRRKISNLAQDQLSVFAYFAKLKALWNELSSYRPSCSCGNCSCGGVKELAKYFQTEHVMAFLMGLNESFGQIR
         + +TA E+W DL +R+ + + PRIF+L++KI    Q    V  Y+ +LK+LW+EL  ++   +   C+CGG++   +  Q E VM FL+GLNESF  I+
Subjt:  NFSDTAREMWVDLQQRYQRRNRPRIFQLRRKISNLAQDQLSVFAYFAKLKALWNELSSYRPSCSCGNCSCGGVKELAKYFQTEHVMAFLMGLNESFGQIR

Query:  TQLLLMEPEPTITRAFSLIAQEVEQRA---SVPPVSNVATPSTIEATTLLAKNQSSSRSQVIKKKERPHCTHCNILGHTVDKCCKIHGYPPRYRNQ---K
         Q+LLMEP P + + FSL+ QE  QR+   S  P       S  +A +  +   +SSRS    +K+RP CTHCNILGHTVD+C KIHGY P +RN+   +
Subjt:  TQLLLMEPEPTITRAFSLIAQEVEQRA---SVPPVSNVATPSTIEATTLLAKNQSSSRSQVIKKKERPHCTHCNILGHTVDKCCKIHGYPPRYRNQ---K

Query:  GSTSVLNQRLP-----------------VFPVSV---------------------------------LSNVKGFLPFSKHIWRLRPNV--------VRIL
         + S  NQ LP                   P  +                                 +SN  G L  S     L P++          + 
Subjt:  GSTSVLNQRLP-----------------VFPVSV---------------------------------LSNVKGFLPFSKHIWRLRPNV--------VRIL

Query:  PRLPMEMHMLQVSGFSVTLPNQNRLIVDHVGIVQLSPQLVLKNVLFVPEFRFNLVSISALINSSPILLNFVGGYCLIQDRSSLKMIGKAKTWQGLYLLNH
            M   +   S  +VTLP   ++ +  +G + LSP LVL++VL++P F+FNL+SISAL  ++    +F   +C IQD S  K+IG  +    LYLL+ 
Subjt:  PRLPMEMHMLQVSGFSVTLPNQNRLIVDHVGIVQLSPQLVLKNVLFVPEFRFNLVSISALINSSPILLNFVGGYCLIQDRSSLKMIGKAKTWQGLYLLNH

Query:  EDLLS-----------------------ENDSLTRASAFNVSCQKVSKCNSSLS---------SPLSFHASGHIAANPFDLVHCDTWGPFNTPTHSGHQY
            S                        + S  + S      Q  S  N++LS           L F    +++++PFDL+HCD WGPF+ PTH G +Y
Subjt:  EDLLS-----------------------ENDSLTRASAFNVSCQKVSKCNSSLS---------SPLSFHASGHIAANPFDLVHCDTWGPFNTPTHSGHQY

Query:  FLSIVDDFSRYTWIFLMKKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPELSFVEFFRTKGVVHHLSCVERPQQNSVVERKHQHLLNVARALLFQSRV
        FL+IVDD +R TW+ L++ KSDV  I P FF  ++T++  TIK  RSDNAPEL+    F    V+H  SCVE PQQNSVVERKHQH+LNVARAL FQS +
Subjt:  FLSIVDDFSRYTWIFLMKKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPELSFVEFFRTKGVVHHLSCVERPQQNSVVERKHQHLLNVARALLFQSRV

Query:  PFNILG
        P    G
Subjt:  PFNILG

A0A5A7VE66 Cysteine-rich RLK (Receptor-like protein kinase) 88.0e-12939.25Show/hide
Query:  NGWDISSSSAARPTNPSVIDQYVNPYFLHHSDGTNLVLVSELLTES-NYNSWSQAMLLGLMVKNKEGFVNGIITKPTN-ELLHSWKICNGVVKAWILNAL
        NG      +AA   N    D  +NPYF+HHS G    +V++ LT + NY SWS+AML+ +  +NK GF+ G I KP++  LL +W   N ++ +WILN++
Subjt:  NGWDISSSSAARPTNPSVIDQYVNPYFLHHSDGTNLVLVSELLTES-NYNSWSQAMLLGLMVKNKEGFVNGIITKPTN-ELLHSWKICNGVVKAWILNAL

Query:  TKEIAASLNFSDTAREMWVDLQQRYQRRNRPRIFQLRRKISNLAQDQLSVFAYFAKLKALWNELSSYRPSCSCGNCSCGGVKELAKYFQTEHVMAFLMGL
        +KEIAAS+ +  + +E+W +L+QR+++ N P I+QLR++   L Q  L++  Y+ KLK +W  L+ YR      +C+CGG+K    + ++E++MAFLMGL
Subjt:  TKEIAASLNFSDTAREMWVDLQQRYQRRNRPRIFQLRRKISNLAQDQLSVFAYFAKLKALWNELSSYRPSCSCGNCSCGGVKELAKYFQTEHVMAFLMGL

Query:  NESFGQIRTQLLLMEPEPTITRAFSLIAQEVEQRAS---VPPVSNVATPSTIEATTLLAKNQSSSRSQVIKKKERPHCTHCNILGHTVDKCCKIHGYPPR
        N+S+  +R Q+LLM+P P+I   FSL+ QE +QR++    PP+  VA    I ++  ++ +++       +KKERP C++C I GH  DKC K HGYPP 
Subjt:  NESFGQIRTQLLLMEPEPTITRAFSLIAQEVEQRAS---VPPVSNVATPSTIEATTLLAKNQSSSRSQVIKKKERPHCTHCNILGHTVDKCCKIHGYPPR

Query:  Y--RNQKGST--------------------------------------SVLNQRL------PVFPVSVLSNVKGFLPFSKH------IWRLRPNVVR-IL
        Y  RN    T                                      ++LN  L      P+   + +++  G    + H       W +     R I 
Subjt:  Y--RNQKGST--------------------------------------SVLNQRL------PVFPVSVLSNVKGFLPFSKH------IWRLRPNVVR-IL

Query:  PRLPMEMHMLQVSGFSVTLPNQNRLIVDHVGIVQLSPQLVLKNVLFVPEFRFNLVSISALINSSPILLNFVGGYCLIQDRSSLKMIGKAKTWQGLYLLNH
            +  +    +   V LPN +R+ VD +G +Q++  L LK+VLFV +F +NL+S+S L+ +  I L+F    C+IQD S   MIGKA    GLY+LN 
Subjt:  PRLPMEMHMLQVSGFSVTLPNQNRLIVDHVGIVQLSPQLVLKNVLFVPEFRFNLVSISALINSSPILLNFVGGYCLIQDRSSLKMIGKAKTWQGLYLLNH

Query:  EDLLSENDSLTRASAFNVSC--QKVS----KCNSSLSSP-----------------------LSFHASGHIAANPFDLVHCDTWGPFNTPTHSGHQYFLS
        E   +   +  + +A +V    Q++     KC SSLSS                        LSFH++ ++A++PFDLVH D WGPF  P++ G++YFL+
Subjt:  EDLLSENDSLTRASAFNVSC--QKVS----KCNSSLSSP-----------------------LSFHASGHIAANPFDLVHCDTWGPFNTPTHSGHQYFLS

Query:  IVDDFSRYTWIFLMKKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPELSFVEFFRTKGVVHHLSCVERPQQNSVVERKHQHLLNVARALLF
        +VDD  R+TW++++++KSDV HI+P FFQ IETQ+ + IK FRSDNAPEL   EFF  KG VH  SCVE+PQQNSVVERKHQHLLNVARAL F
Subjt:  IVDDFSRYTWIFLMKKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPELSFVEFFRTKGVVHHLSCVERPQQNSVVERKHQHLLNVARALLF

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.4e-1021.16Show/hide
Query:  NELLHSWKICNGVVKAWILNALTKEIAASLNFSDTAREMWVDLQQRYQRRNRPRIFQLRRKISNL-AQDQLSVFAYFAKLKALWNELSSYRPSCSCGNCS
        NE+  SWK      K+ I+  L+           TAR++  +L   Y+R++      LR+++ +L    ++S+ ++F     L +EL             
Subjt:  NELLHSWKICNGVVKAWILNALTKEIAASLNFSDTAREMWVDLQQRYQRRNRPRIFQLRRKISNL-AQDQLSVFAYFAKLKALWNELSSYRPSCSCGNCS

Query:  CGGVKELAKYFQTEHVMAFLMGLNESFGQIRTQLLLMEPEPTITRAF---SLIAQEVEQR-----ASVPPVSNVATPSTIEATTLLAKNQSSSRSQVIK-
           +   AK  + + +   L+ L   +  I T +  +  E  +T AF    L+ QE++ +      S   ++ +   +       L KN+ +   ++ K 
Subjt:  CGGVKELAKYFQTEHVMAFLMGLNESFGQIRTQLLLMEPEPTITRAF---SLIAQEVEQR-----ASVPPVSNVATPSTIEATTLLAKNQSSSRSQVIK-

Query:  -KKERPHCTHCNILGHTVDKCCKIHGYPPRYRNQKGSTSV--LNQRLPVFPVSVLSNVK-----GFL---PFSKHIWR---LRPNVVRILPRLPMEMHML
          K +  C HC   GH + K C  +      +N++    V         F V  ++N       GF+     S H+     L  + V ++P  P+++ + 
Subjt:  -KKERPHCTHCNILGHTVDKCCKIHGYPPRYRNQKGSTSV--LNQRLPVFPVSVLSNVK-----GFL---PFSKHIWR---LRPNVVRILPRLPMEMHML

Query:  QVSGFSVTLPNQNRLIVDHVGIVQL--SPQLVLKNVLFVPEFRFNLVSI-----------------------------SALINSSPILLNFVGGYCLIQD
        +   F         +     GIV+L    ++ L++VLF  E   NL+S+                             S ++N+ P+ +NF       + 
Subjt:  QVSGFSVTLPNQNRLIVDHVGIVQL--SPQLVLKNVLFVPEFRFNLVSI-----------------------------SALINSSPILLNFVGGYCLIQD

Query:  RSSLKM----IGKAKTWQGLYLLNHEDLLSENDSLTRASAFNVSCQKVSKCNSSLSSPLSFHA---SGHIAANPFDLVHCDTWGPFNTPTHSGHQYFLSI
        +++ ++     G     + L  +  +++ S+   L       +SC+    C +   + L F       HI   P  +VH D  GP    T     YF+  
Subjt:  RSSLKM----IGKAKTWQGLYLLNHEDLLSENDSLTRASAFNVSCQKVSKCNSSLSSPLSFHA---SGHIAANPFDLVHCDTWGPFNTPTHSGHQYFLSI

Query:  VDDFSRYTWIFLMKKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPEL---SFVEFFRTKGVVHHLSCVERPQQNSVVERKHQHLLNVARALLFQSRVP
        VD F+ Y   +L+K KSDV  +   F    E  +   +     DN  E       +F   KG+ +HL+    PQ N V ER  + +   AR ++  +++ 
Subjt:  VDDFSRYTWIFLMKKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPEL---SFVEFFRTKGVVHHLSCVERPQQNSVVERKHQHLLNVARALLFQSRVP

Query:  FNILG
         +  G
Subjt:  FNILG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.9e-1630.56Show/hide
Query:  LSFHASGHIAANPFDLVHCDTWGPFNTPTHSGHQYFLSIVDDFSRYTWIFLMKKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPELS---FVEFFRTK
        +SF  S     N  DLV+ D  GP    +  G++YF++ +DD SR  W++++K K  V  +   F   +E +  R +K  RSDN  E +   F E+  + 
Subjt:  LSFHASGHIAANPFDLVHCDTWGPFNTPTHSGHQYFLSIVDDFSRYTWIFLMKKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPELS---FVEFFRTK

Query:  GVVHHLSCVERPQQNSVVERKHQHLLNVARALLFQSRVPFNILG
        G+ H  +    PQ N V ER ++ ++   R++L  +++P +  G
Subjt:  GVVHHLSCVERPQQNSVVERKHQHLLNVARALLFQSRVPFNILG

Q12491 Transposon Ty2-B Gag-Pol polyprotein1.5e-0724.03Show/hide
Query:  PFDLVHCDTWGPFNTPTHSGHQYFLSIVDDFSRYTWIFLM--KKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPEL---SFVEFFRTKGVVHHLSCVE
        PF  +H D +GP +    S   YF+S  D+ +R+ W++ +  +++  + ++      +I+ Q+   +   + D   E    +  +FF  +G+    +   
Subjt:  PFDLVHCDTWGPFNTPTHSGHQYFLSIVDDFSRYTWIFLM--KKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPEL---SFVEFFRTKGVVHHLSCVE

Query:  RPQQNSVVERKHQHLLNVARALLFQSRVP
          + + V ER ++ LLN  R LL  S +P
Subjt:  RPQQNSVVERKHQHLLNVARALLFQSRVP

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.9e-2921.17Show/hide
Query:  LTESNYNSWSQAMLLGLMVKNKEGFVNGIITKP-----------TNELLHSWKICNGVVKAWILNALTKEIAASLNFSDTAREMWVDLQQRYQRRNRPRI
        LT +NY  WS+ +          GF++G  T P            N     WK  + ++ + +L A++  +  +++ + TA ++W  L++ Y   +   +
Subjt:  LTESNYNSWSQAMLLGLMVKNKEGFVNGIITKP-----------TNELLHSWKICNGVVKAWILNALTKEIAASLNFSDTAREMWVDLQQRYQRRNRPRI

Query:  FQLRRKISNLAQDQLSVFAYFAKLKALWNELSSYRPSCSCGNCSCGGVKELAKYFQTEHVMAFLMGLNESFGQIRTQLLLMEPEPTITRAFSLIAQEVEQ
         QLR ++    +   ++  Y   L   +++L+                         E V   L  L E +  +  Q+   +  PT+T     +     +
Subjt:  FQLRRKISNLAQDQLSVFAYFAKLKALWNELSSYRPSCSCGNCSCGGVKELAKYFQTEHVMAFLMGLNESFGQIRTQLLLMEPEPTITRAFSLIAQEVEQ

Query:  -----RASVPPVSNVATPSTIEATTLLAKN--------------------QSSSRSQVIKKKERPH---CTHCNILGHTVDKCCKIHGYPPRYRNQKGST
              A+V P++  A       TT    N                    QSS+       + +P+   C  C + GH+  +C ++  +     +Q+  +
Subjt:  -----RASVPPVSNVATPSTIEATTLLAKN--------------------QSSSRSQVIKKKERPH---CTHCNILGHTVDKCCKIHGYPPRYRNQKGST

Query:  SVLNQRLPVFPVSVLSNVKGFLPFSKHIWRLRPNVVRILPR--LPMEMHMLQVSGFSVTLPNQNRLIVDHVGIVQLSPQ---LVLKNVLFVPEFRFNLVS
               P  P    +N+    P+S + W L       +      + +H     G  V + + + + + H G   LS +   L L N+L+VP    NL+S
Subjt:  SVLNQRLPVFPVSVLSNVKGFLPFSKHIWRLRPNVVRILPR--LPMEMHMLQVSGFSVTLPNQNRLIVDHVGIVQLSPQ---LVLKNVLFVPEFRFNLVS

Query:  ISALINSSPILLNFVGGYCLIQDRSSLKMIGKAKTWQGLY---------------------------LLNH--EDLLSENDSLTRASAFNVSCQ--KVSK
        +  L N++ + + F      ++D ++   + + KT   LY                            L H    +L+   S    S  N S +    S 
Subjt:  ISALINSSPILLNFVGGYCLIQDRSSLKMIGKAKTWQGLY---------------------------LLNH--EDLLSENDSLTRASAFNVSCQ--KVSK

Query:  CNSSLSSPLSFHASGHIAANPFDLVHCDTWGPFNTP--THSGHQYFLSIVDDFSRYTWIFLMKKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPE-LS
        C  + S+ + F  S   +  P + ++ D W   ++P  +H  ++Y++  VD F+RYTW++ +K+KS V      F   +E ++Q  I  F SDN  E ++
Subjt:  CNSSLSSPLSFHASGHIAANPFDLVHCDTWGPFNTP--THSGHQYFLSIVDDFSRYTWIFLMKKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPE-LS

Query:  FVEFFRTKGVVHHLSCVERPQQNSVVERKHQHLLNVARALLFQSRVP
          E+F   G+ H  S    P+ N + ERKH+H++     LL  + +P
Subjt:  FVEFFRTKGVVHHLSCVERPQQNSVVERKHQHLLNVARALLFQSRVP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.7e-2421.73Show/hide
Query:  LTESNYNSWSQAMLLGLMVKNKEGFVNGIITKP-----------TNELLHSWKICNGVVKAWILNALTKEIAASLNFSDTAREMWVDLQQRYQRRNRPRI
        LT +NY  WS+ +          GF++G    P            N     W+  + ++ + IL A++  +  +++ + TA ++W  L++ Y   +   +
Subjt:  LTESNYNSWSQAMLLGLMVKNKEGFVNGIITKP-----------TNELLHSWKICNGVVKAWILNALTKEIAASLNFSDTAREMWVDLQQRYQRRNRPRI

Query:  FQLRRKISNLAQDQLSVFAYFAKLKALWNELSSYRPSCSCGNCSCGGVKELAKYFQTEHVMAFLMGLNESFGQIRTQLLLMEPEPTITRAFS-LIAQE--
         QLR                F +L  L   +                          E V   L  L + +  +  Q+   +  P++T     LI +E  
Subjt:  FQLRRKISNLAQDQLSVFAYFAKLKALWNELSSYRPSCSCGNCSCGGVKELAKYFQTEHVMAFLMGLNESFGQIRTQLLLMEPEPTITRAFS-LIAQE--

Query:  ---VEQRASVPPVSNVATPSTIEATTLLAKNQ---------------------SSSRSQVIKKKERPH---CTHCNILGHTVDKCCKIHGYPPRYRNQKG
           +     VP  +NV T       T   +NQ                     SSS S+   ++ +P+   C  C++ GH+  +C ++H +     NQ+ 
Subjt:  ---VEQRASVPPVSNVATPSTIEATTLLAKNQ---------------------SSSRSQVIKKKERPH---CTHCNILGHTVDKCCKIHGYPPRYRNQKG

Query:  STSVLNQRLPVFPVSVLSNVKGFLPFSKHIWRLRPNVVRILPR--LPMEMHMLQVSGFSVTLPNQNRLIVDHVGIVQL---SPQLVLKNVLFVPEFRFNL
        STS      P  P    +N+    P++ + W L       +      +  H     G  V + + + + + H G   L   S  L L  VL+VP    NL
Subjt:  STSVLNQRLPVFPVSVLSNVKGFLPFSKHIWRLRPNVVRILPR--LPMEMHMLQVSGFSVTLPNQNRLIVDHVGIVQL---SPQLVLKNVLFVPEFRFNL

Query:  VSISALINSSPILLNFVGGYCLIQDRSSLKMIGKAKTWQGLY---------------------------LLNHEDLLSENDSLTRAS--AFNVSCQ--KV
        +S+  L N++ + + F      ++D ++   + + KT   LY                            L H  L   N  ++  S    N S +    
Subjt:  VSISALINSSPILLNFVGGYCLIQDRSSLKMIGKAKTWQGLY---------------------------LLNHEDLLSENDSLTRAS--AFNVSCQ--KV

Query:  SKCNSSLSSPLSFHASGHIAANPFDLVHCDTWGPFNTPTHS--GHQYFLSIVDDFSRYTWIFLMKKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPEL
        S C  + S  + F  S   ++ P + ++ D W   ++P  S   ++Y++  VD F+RYTW++ +K+KS V      F   +E ++Q  I    SDN  E 
Subjt:  SKCNSSLSSPLSFHASGHIAANPFDLVHCDTWGPFNTPTHS--GHQYFLSIVDDFSRYTWIFLMKKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPEL

Query:  SFV-EFFRTKGVVHHLSCVERPQQNSVVERKHQHLLNVARALLFQSRVP
          + ++    G+ H  S    P+ N + ERKH+H++ +   LL  + VP
Subjt:  SFV-EFFRTKGVVHHLSCVERPQQNSVVERKHQHLLNVARALLFQSRVP

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).8.4e-3032.74Show/hide
Query:  SSSSAARPTNPSVIDQYVNPYFLHHSDGTNLVLVSELLTESNYNSWSQAMLLGLMVKNKEGFVNGIITKPT--NELLHSWKICNGVVKAWILNALTKEIA
        S S  + P +P     Y  P  +HH    ++  +S+   E NY +W       L V  K GF++G + KP   + L   W+ CN +V  W++N++T ++ 
Subjt:  SSSSAARPTNPSVIDQYVNPYFLHHSDGTNLVLVSELLTESNYNSWSQAMLLGLMVKNKEGFVNGIITKPT--NELLHSWKICNGVVKAWILNALTKEIA

Query:  ASLNFSDTAREMWVDLQQRYQRRNRPRIFQLRRKISNLAQDQLSVFAYFAKLKALWNELSSYR--PSCSCGNCSCGGVKELAKYFQTEHVMAFLMG--LN
         S+ +++TA +MW DL++ +      +I+QLRR+++ L Q   SV  YF KL  +W ELS Y   P C CG C+C   K   +  + E    FLMG  LN
Subjt:  ASLNFSDTAREMWVDLQQRYQRRNRPRIFQLRRKISNLAQDQLSVFAYFAKLKALWNELSSYR--PSCSCGNCSCGGVKELAKYFQTEHVMAFLMG--LN

Query:  ESFGQIRTQLLLMEPEPTITRAFSLI
        + F  + T+++  +P P++  AF+++
Subjt:  ESFGQIRTQLLLMEPEPTITRAFSLI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTCCGAAGGCGAAACAGGCCAAAGGGTCGGGCCAAGGCTGGAGGGGTCGGGCCTTGGCCCAACCCCTCCTCGGCCTCGGCCTGCTTGTGCGAGCCGAGCC
ACGTCTTCCCTCTCTCAAAAAAATTTACCGTTGGTGGCACGTGAAGGTCATTATTCCTTGCTCCGTTTCTCACTCTGCAAAGGTTTCTTCTGCTTGCCAAATGGC
TGGGACATTTCTTCTTCTTCCGCCGCGCGACCCACAAATCCTTCAGTAATCGATCAGTATGTTAACCCGTATTTTCTTCATCACTCTGATGGTACCAATCTTGTA
CTCGTCTCTGAATTACTTACAGAGTCGAATTACAATTCTTGGAGCCAAGCCATGCTTCTTGGCTTAATGGTGAAGAACAAAGAAGGATTCGTCAATGGAATCATT
ACGAAACCTACAAATGAACTGCTCCACTCTTGGAAGATTTGCAACGGCGTGGTCAAAGCTTGGATTCTCAATGCTTTAACAAAAGAAATTGCCGCGAGCCTGAAT
TTTTCCGATACCGCCAGAGAGATGTGGGTAGACCTTCAACAGCGGTATCAACGTAGGAACAGGCCTCGAATTTTCCAATTACGGCGGAAAATTTCCAATCTCGCT
CAAGATCAGCTTTCTGTTTTTGCGTACTTTGCGAAATTAAAGGCCCTGTGGAATGAATTGTCGTCTTATCGACCGTCTTGTTCCTGCGGTAACTGTTCTTGCGGA
GGTGTCAAAGAGCTGGCCAAATATTTCCAGACAGAGCATGTTATGGCCTTCTTGATGGGGTTAAACGAGTCTTTTGGACAAATTCGCACCCAGCTTCTTCTGATG
GAGCCTGAACCAACCATCACCCGAGCTTTTTCGCTCATTGCCCAGGAGGTTGAGCAACGAGCGTCTGTTCCACCGGTCTCGAATGTCGCCACTCCAAGCACAATT
GAGGCAACAACCCTTCTTGCAAAGAATCAAAGTTCTTCTCGCTCCCAAGTGATCAAGAAAAAGGAACGTCCCCATTGCACCCACTGCAATATCCTCGGACACACC
GTGGATAAGTGCTGTAAAATTCATGGTTACCCTCCAAGGTATCGCAACCAAAAGGGCAGTACTTCCGTCCTAAACCAGAGACTCCCAGTCTTTCCAGTCTCAGTA
CTGAGCAATGTCAAGGGCTTCTTGCCATTCTCCAAACACATTTGGCGTCTGCGGCCAAATGTAGTACGAATACTCCCGAGGCTTCCAATGGAAATGCACATGTTG
CAGGTCTCTGGATTTTCAGTCACTCTTCCCAATCAGAACAGATTGATAGTGGATCATGTTGGTATTGTACAATTATCCCCCCAACTTGTTTTGAAGAATGTTCTT
TTTGTGCCTGAGTTTCGGTTCAATCTTGTATCAATAAGCGCTCTTATTAATAGCTCACCAATCTTGCTGAATTTTGTTGGTGGTTATTGCCTCATTCAGGACAGA
TCCTCCTTGAAGATGATTGGCAAGGCTAAGACATGGCAAGGTTTATACTTACTCAACCATGAAGACTTATTGTCTGAAAATGATAGTCTTACTCGGGCTTCTGCC
TTCAATGTTTCTTGTCAAAAAGTTTCAAAGTGTAACTCTTCACTAAGCAGCCCTCTTTCTTTCCATGCTAGTGGTCATATTGCTGCTAATCCTTTTGATCTAGTA
CATTGTGACACATGGGGTCCGTTTAACACTCCTACACATTCTGGCCATCAGTACTTTCTTTCCATAGTAGATGATTTCTCCAGATATACTTGGATATTCTTGATG
AAGAAAAAGTCAGATGTTTCCCACATTATACCTCATTTTTTTCAGTACATAGAAACTCAATATCAAAGAACAATTAAGTGCTTCCGCTCAGACAATGCGCCTGAA
TTGTCCTTCGTTGAGTTTTTTCGGACTAAAGGGGTGGTGCATCATCTTTCTTGTGTTGAACGTCCCCAGCAGAACTCGGTTGTGGAAAGAAAGCATCAGCATTTG
TTAAATGTTGCTAGAGCCTTGTTGTTTCAATCTCGAGTCCCTTTTAACATTCTGGGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACTCCGAAGGCGAAACAGGCCAAAGGGTCGGGCCAAGGCTGGAGGGGTCGGGCCTTGGCCCAACCCCTCCTCGGCCTCGGCCTGCTTGTGCGAGCCGAGCC
ACGTCTTCCCTCTCTCAAAAAAATTTACCGTTGGTGGCACGTGAAGGTCATTATTCCTTGCTCCGTTTCTCACTCTGCAAAGGTTTCTTCTGCTTGCCAAATGGC
TGGGACATTTCTTCTTCTTCCGCCGCGCGACCCACAAATCCTTCAGTAATCGATCAGTATGTTAACCCGTATTTTCTTCATCACTCTGATGGTACCAATCTTGTA
CTCGTCTCTGAATTACTTACAGAGTCGAATTACAATTCTTGGAGCCAAGCCATGCTTCTTGGCTTAATGGTGAAGAACAAAGAAGGATTCGTCAATGGAATCATT
ACGAAACCTACAAATGAACTGCTCCACTCTTGGAAGATTTGCAACGGCGTGGTCAAAGCTTGGATTCTCAATGCTTTAACAAAAGAAATTGCCGCGAGCCTGAAT
TTTTCCGATACCGCCAGAGAGATGTGGGTAGACCTTCAACAGCGGTATCAACGTAGGAACAGGCCTCGAATTTTCCAATTACGGCGGAAAATTTCCAATCTCGCT
CAAGATCAGCTTTCTGTTTTTGCGTACTTTGCGAAATTAAAGGCCCTGTGGAATGAATTGTCGTCTTATCGACCGTCTTGTTCCTGCGGTAACTGTTCTTGCGGA
GGTGTCAAAGAGCTGGCCAAATATTTCCAGACAGAGCATGTTATGGCCTTCTTGATGGGGTTAAACGAGTCTTTTGGACAAATTCGCACCCAGCTTCTTCTGATG
GAGCCTGAACCAACCATCACCCGAGCTTTTTCGCTCATTGCCCAGGAGGTTGAGCAACGAGCGTCTGTTCCACCGGTCTCGAATGTCGCCACTCCAAGCACAATT
GAGGCAACAACCCTTCTTGCAAAGAATCAAAGTTCTTCTCGCTCCCAAGTGATCAAGAAAAAGGAACGTCCCCATTGCACCCACTGCAATATCCTCGGACACACC
GTGGATAAGTGCTGTAAAATTCATGGTTACCCTCCAAGGTATCGCAACCAAAAGGGCAGTACTTCCGTCCTAAACCAGAGACTCCCAGTCTTTCCAGTCTCAGTA
CTGAGCAATGTCAAGGGCTTCTTGCCATTCTCCAAACACATTTGGCGTCTGCGGCCAAATGTAGTACGAATACTCCCGAGGCTTCCAATGGAAATGCACATGTTG
CAGGTCTCTGGATTTTCAGTCACTCTTCCCAATCAGAACAGATTGATAGTGGATCATGTTGGTATTGTACAATTATCCCCCCAACTTGTTTTGAAGAATGTTCTT
TTTGTGCCTGAGTTTCGGTTCAATCTTGTATCAATAAGCGCTCTTATTAATAGCTCACCAATCTTGCTGAATTTTGTTGGTGGTTATTGCCTCATTCAGGACAGA
TCCTCCTTGAAGATGATTGGCAAGGCTAAGACATGGCAAGGTTTATACTTACTCAACCATGAAGACTTATTGTCTGAAAATGATAGTCTTACTCGGGCTTCTGCC
TTCAATGTTTCTTGTCAAAAAGTTTCAAAGTGTAACTCTTCACTAAGCAGCCCTCTTTCTTTCCATGCTAGTGGTCATATTGCTGCTAATCCTTTTGATCTAGTA
CATTGTGACACATGGGGTCCGTTTAACACTCCTACACATTCTGGCCATCAGTACTTTCTTTCCATAGTAGATGATTTCTCCAGATATACTTGGATATTCTTGATG
AAGAAAAAGTCAGATGTTTCCCACATTATACCTCATTTTTTTCAGTACATAGAAACTCAATATCAAAGAACAATTAAGTGCTTCCGCTCAGACAATGCGCCTGAA
TTGTCCTTCGTTGAGTTTTTTCGGACTAAAGGGGTGGTGCATCATCTTTCTTGTGTTGAACGTCCCCAGCAGAACTCGGTTGTGGAAAGAAAGCATCAGCATTTG
TTAAATGTTGCTAGAGCCTTGTTGTTTCAATCTCGAGTCCCTTTTAACATTCTGGGGTGA
Protein sequenceShow/hide protein sequence
MDSEGETGQRVGPRLEGSGLGPTPPRPRPACASRATSSLSQKNLPLVAREGHYSLLRFSLCKGFFCLPNGWDISSSSAARPTNPSVIDQYVNPYFLHHSDGTNLV
LVSELLTESNYNSWSQAMLLGLMVKNKEGFVNGIITKPTNELLHSWKICNGVVKAWILNALTKEIAASLNFSDTAREMWVDLQQRYQRRNRPRIFQLRRKISNLA
QDQLSVFAYFAKLKALWNELSSYRPSCSCGNCSCGGVKELAKYFQTEHVMAFLMGLNESFGQIRTQLLLMEPEPTITRAFSLIAQEVEQRASVPPVSNVATPSTI
EATTLLAKNQSSSRSQVIKKKERPHCTHCNILGHTVDKCCKIHGYPPRYRNQKGSTSVLNQRLPVFPVSVLSNVKGFLPFSKHIWRLRPNVVRILPRLPMEMHML
QVSGFSVTLPNQNRLIVDHVGIVQLSPQLVLKNVLFVPEFRFNLVSISALINSSPILLNFVGGYCLIQDRSSLKMIGKAKTWQGLYLLNHEDLLSENDSLTRASA
FNVSCQKVSKCNSSLSSPLSFHASGHIAANPFDLVHCDTWGPFNTPTHSGHQYFLSIVDDFSRYTWIFLMKKKSDVSHIIPHFFQYIETQYQRTIKCFRSDNAPE
LSFVEFFRTKGVVHHLSCVERPQQNSVVERKHQHLLNVARALLFQSRVPFNILG