; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024204 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024204
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationchr10:1200901..1206804
RNA-Seq ExpressionLag0024204
SyntenyLag0024204
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR005162 - Retrotransposon gag domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032774.1 putative retroelement pol polyprotein [Cucumis melo var. makuwa]7.3e-4429.25Show/hide
Query:  IFERFRPMQEGSICARFLVARQDNTVQEFRERFEELSAPLPHLTDEIMESTFFNGLDPEVKVEVLSFEPVDLAAFIKAAQRVEDKNLARNLKANP-NKGK
        ++ RFR  + G+ CARFL  +Q+ +V+E+ ++F+ELSAPLP + ++++E TF NGLDP ++ EV S   V L   ++AAQ VE+K      K  P  + K
Subjt:  IFERFRPMQEGSICARFLVARQDNTVQEFRERFEELSAPLPHLTDEIMESTFFNGLDPEVKVEVLSFEPVDLAAFIKAAQRVEDKNLARNLKANP-NKGK

Query:  STNSTSPNYELAKPNPKVIDATPIRSITLPTSSSPQH-RETPYKLTDT-KLKSKKEMGLFIDAIKKIMRP-------LKIISAAKKARVIFKIDFEKVYD
            T+P     KP+PK  +   ++ +TL            PY+     K + +K +   +D+   I+RP       L I+   K     F +D+     
Subjt:  STNSTSPNYELAKPNPKVIDATPIRSITLPTSSSPQH-RETPYKLTDT-KLKSKKEMGLFIDAIKKIMRP-------LKIISAAKKARVIFKIDFEKVYD

Query:  HVDWDYIDMVLAYKAPTLQKWRSVSVGHVSDTDTPRHRKIFVSGHSGVFKIKLGEFRSFPNNQRSTLGKVNLWPTQESRQNPESNAMLNTTR-HCASKVV
                             R+++   V D                 F I + +      N  S   K++L       +  E +      R H      
Subjt:  HVDWDYIDMVLAYKAPTLQKWRSVSVGHVSDTDTPRHRKIFVSGHSGVFKIKLGEFRSFPNNQRSTLGKVNLWPTQESRQNPESNAMLNTTR-HCASKVV

Query:  NVEPLSLASRCSKTASAGIPASQQWPNNSTRSTNGLTTFQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQE
         ++P  L                         TN   TFQ+LMN+VF P+LR+ +LVFFD IL+Y+ D +T+L HL +VF +LR+++LFAN+KKC F +E
Subjt:  NVEPLSLASRCSKTASAGIPASQQWPNNSTRSTNGLTTFQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQE

Query:  RIAYLGHWIFANGVSGVAILPLFITIVYNTRPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFTNGVEADGENIQAMT-----IVMNN
        RI YLGHW    GV                                                 QE+I  +  W     V  +      +T      V N 
Subjt:  RIAYLGHWIFANGVSGVAILPLFITIVYNTRPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFTNGVEADGENIQAMT-----IVMNN

Query:  GKIATPDTIVEE-----------------RRFPIQLPVLALPDFDQPFTIETDASGTGLGVVLTQKQRPVAFFSQSQVVHGKNQSTKE
        G IATP T + +                 ++  + LPVLALPDF QPF IETDASG GLG VL+Q +RP+A+FSQ      + +S  E
Subjt:  GKIATPDTIVEE-----------------RRFPIQLPVLALPDFDQPFTIETDASGTGLGVVLTQKQRPVAFFSQSQVVHGKNQSTKE

KAA0033837.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.3e-4524.76Show/hide
Query:  VMLDEAEEEEEEEKSQTKKPDEDEDEARRRRSRSKLKKTKPVGSWNKLKKTET-----VPPDDWLFRAAQYFTIKRLSDEEKVQVAAISFTEAVLQWLQW
        V+ +  +   +E ++ + K  E +   R  ++  + +  +  G  NK KK E        PD WLFRA +YF I +LS+ EK+ V+A+SF    L W + 
Subjt:  VMLDEAEEEEEEEKSQTKKPDEDEDEARRRRSRSKLKKTKPVGSWNKLKKTET-----VPPDDWLFRAAQYFTIKRLSDEEKVQVAAISFTEAVLQWLQW

Query:  AKGKEPFQGWKDLMNRIFERFRPMQEGSICARFLVARQDNTVQEFRERFEELSAPLPHLTDEIMESTFFNGLDPEVKVEVLSFEPVDLAAFIKAAQRVED
         + K+ F    +L  R+  RFR  ++G+IC +FL  RQ++TV+E+R  F++L APL  L + ++E TF NGL P ++ EV    P  LA  ++  Q VE+
Subjt:  AKGKEPFQGWKDLMNRIFERFRPMQEGSICARFLVARQDNTVQEFRERFEELSAPLPHLTDEIMESTFFNGLDPEVKVEVLSFEPVDLAAFIKAAQRVED

Query:  KNLARNLKANPN---KGK----STNSTSPNYELAKPNPKVIDATPIRSITLPTSSSPQHRE--TPYKLTDTKLKSKKEMGLFIDAIKKIMRPLKI-ISAA
        + + RN +AN N    GK    +T +T  + +    + K     PI+++T  +SS  ++R+  T  +L D + +++KE GL     +K     K  +   
Subjt:  KNLARNLKANPN---KGK----STNSTSPNYELAKPNPKVIDATPIRSITLPTSSSPQHRE--TPYKLTDTKLKSKKEMGLFIDAIKKIMRPLKI-ISAA

Query:  KKARVIFKIDFEKVYDHVDWDYIDM--VLAY----------KAPTLQKWRSVSVGHVSDTDTPRHRKIFVSG----------------------------
        ++ R+   ++ ++ ++ ++ + ++   +  Y          ++P   + + V   H+   +T  +  I  SG                            
Subjt:  KKARVIFKIDFEKVYDHVDWDYIDM--VLAY----------KAPTLQKWRSVSVGHVSDTDTPRHRKIFVSG----------------------------

Query:  -------------------------HSGVFKIKLGEFRSFPNNQRSTLGKVNL-------WPTQ-------------ESRQNP-----------------
                                  +G+    +    +    + S++  V         WP Q             +   NP                 
Subjt:  -------------------------HSGVFKIKLGEFRSFPNNQRSTLGKVNL-------WPTQ-------------ESRQNP-----------------

Query:  -------ESNAMLNTTRHCASKVVNVE------------------------PLSLASR-----CSKTASAGIP----------ASQQWPNNSTRS-----
               ES  +  +T   +S V+ V+                        P+ +        C  +  + I           A +     + R+     
Subjt:  -------ESNAMLNTTRHCASKVVNVE------------------------PLSLASR-----CSKTASAGIP----------ASQQWPNNSTRS-----

Query:  ---------TNGLTTFQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFANGVSGVAILPLF
                 TN   TFQ+LMN +FKPFLR+ +LVFFDDIL+YS +   +L HLG V ++LR + L+AN+KKC FAQ +I YLGH I   GV         
Subjt:  ---------TNGLTTFQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFANGVSGVAILPLF

Query:  ITIVYNTRPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLG-HWIFTNGVEADGENIQAMTIVMNNGKIATPDTIV----EERRFPIQLPVLA
                P+   ++    +  N+                +E   +LG    +   V+  G     +T ++  G     D  +    + ++  + LPVLA
Subjt:  ITIVYNTRPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLG-HWIFTNGVEADGENIQAMTIVMNNGKIATPDTIV----EERRFPIQLPVLA

Query:  LPDFDQPFTIETDASGTGLGVVLTQKQRPVAFFSQS
        LP+F QPF IETDASG G G +L Q  RP A++S +
Subjt:  LPDFDQPFTIETDASGTGLGVVLTQKQRPVAFFSQS

KAA0043539.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.3e-4425.48Show/hide
Query:  EEEKSQTKKPDEDEDEARRRRSRSKLKKTKPVGSWNKLKKTETVPPDDWLFRAAQYFTIKRLSDEEKVQVAAISFTEAVLQWLQWAKGKEPFQGWKDLMN
        EE K +TK  +E      R   RSK KK        ++   +   PD WLFRA +YF I  LSD EK+ VA ISF    L W +  + +E   GW DL  
Subjt:  EEEKSQTKKPDEDEDEARRRRSRSKLKKTKPVGSWNKLKKTETVPPDDWLFRAAQYFTIKRLSDEEKVQVAAISFTEAVLQWLQWAKGKEPFQGWKDLMN

Query:  RIFERFRPMQEGSICARFLVARQDNTVQEFRERFEELSAPLPHLTDEIMESTFFNGLDPEVKVEVLSFEPVDLAAFIKAAQRVEDKNLAR---------N
        ++  RFR  +EG++  RFL  +Q+ TV+E+R  F++L A +  L   ++E TF NGL+P +K EV + EP  LA  +K A ++E++ L R         +
Subjt:  RIFERFRPMQEGSICARFLVARQDNTVQEFRERFEELSAPLPHLTDEIMESTFFNGLDPEVKVEVLSFEPVDLAAFIKAAQRVEDKNLAR---------N

Query:  LKANPNKGKSTNSTSPNYELAKPNPKVIDATPIRSITL-PTSSSPQHRETPYK-LTDTKLKSKKEMGL--------------------------------
        +K   +K + T +T P    A        + P+R+ITL   ++    RE P K L+D + ++++E GL                                
Subjt:  LKANPNKGKSTNSTSPNYELAKPNPKVIDATPIRSITL-PTSSSPQHRETPYK-LTDTKLKSKKEMGL--------------------------------

Query:  -------FIDAIKKIMRPLKIISAAKKARVIFKIDFEKVYDHVDWDYIDMV-----------LAYKAPTLQKWRSVSVGHVSDTDTPRHRKIFVSGHSGV
               F DA +  M+ +++     +  V+  ID    ++ +  + +  +           +   + T  K + V        +  + RK+ + G   +
Subjt:  -------FIDAIKKIMRPLKIISAAKKARVIFKIDFEKVYDHVDWDYIDMV-----------LAYKAPTLQKWRSVSVGHVSDTDTPRHRKIFVSGHSGV

Query:  FKIK-----------------LGEFRSF------PNNQRSTLGKVNL---------------WPT-------------------------------QESR
         K +                 L E R+        + Q    G+V++               WPT                               Q+  
Subjt:  FKIK-----------------LGEFRSF------PNNQRSTLGKVNL---------------WPT-------------------------------QESR

Query:  QNPESNAMLNTTRHCASKVVNVEPLSL----------------------------------------ASRCSKT-ASAGIPASQQWPNNSTRS-------
             + ML++   C SK     P+ L                                        AS  SK    AG    +  P +  ++       
Subjt:  QNPESNAMLNTTRHCASKVVNVEPLSL----------------------------------------ASRCSKT-ASAGIPASQQWPNNSTRS-------

Query:  -----------TNGLTTFQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFANGVSGVAILP
                   TN  +TFQ+LMN+VFKP+LRR +LVFFDDIL+YS  M+ ++ HL +V  +L++  L+AN +KC FA+ RI+YLGH+I   G+       
Subjt:  -----------TNGLTTFQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFANGVSGVAILP

Query:  LFITIVYNTRPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFTNGVEADGENIQAMTIVMNNGKI---ATPDTIVEERRFPIQLPVLA
                  P+    ++      NV +                 +   G+  +   V+  G     +T ++  G     A  +T   + +  +  PVL 
Subjt:  LFITIVYNTRPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFTNGVEADGENIQAMTIVMNNGKI---ATPDTIVEERRFPIQLPVLA

Query:  LPDFDQPFTIETDASGTGLGVVLTQKQRPVAFFSQS
        +PDF+ PF IE+DASG GLG VLTQ ++ VA+FS++
Subjt:  LPDFDQPFTIETDASGTGLGVVLTQKQRPVAFFSQS

KAA0063300.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]7.8e-4625.36Show/hide
Query:  PDDWLFRAAQYFTIKRLSDEEKVQVAAISFTEAVLQWLQWAKGKEPFQGWKDLMNRIFERFRPMQEGSICARFLVARQDNTVQEFRERFEELSAPLPHLT
        PD WLFRA +YF I  LSD EK+ VA ISF    L W +  + +E F+GW+DL  ++  RFR + +G++  RFL  +Q+ TV+E+R RF++  AP+  L 
Subjt:  PDDWLFRAAQYFTIKRLSDEEKVQVAAISFTEAVLQWLQWAKGKEPFQGWKDLMNRIFERFRPMQEGSICARFLVARQDNTVQEFRERFEELSAPLPHLT

Query:  DEIMESTFFNGLDPEVKVEVLSFEPVDLAAFIKAAQRVEDKNLARN----LKANPNKG--KSTNSTSPNYELAKPNPKVIDATPIRSITL-PTSSSPQHR
          ++E TF NGL P +K EV ++EPV LA  +K A ++E++ + R       A  +K   ++TN+ + N      N       P+R+ITL   ++    R
Subjt:  DEIMESTFFNGLDPEVKVEVLSFEPVDLAAFIKAAQRVEDKNLARN----LKANPNKG--KSTNSTSPNYELAKPNPKVIDATPIRSITL-PTSSSPQHR

Query:  ETPYK-LTDTKLKSKKEMGLFIDAIKKIMRPLKI-ISAAKKARVIFKIDFEKVYDHVDWDYIDMVLAYKAPTLQKWRS----VSVGHVSDTDTPRHRKIF
        E P K LTD + ++++E GL     +K     +     +K+ R++   +  +  + ++ +Y D     K P +Q   +    +S+  V     PR  K  
Subjt:  ETPYK-LTDTKLKSKKEMGLFIDAIKKIMRPLKI-ISAAKKARVIFKIDFEKVYDHVDWDYIDMVLAYKAPTLQKWRS----VSVGHVSDTDTPRHRKIF

Query:  VSGHSGVFKI---------------KLGEFRSFPNNQRSTLG----------------KVNLW-------------------------------------
        V G  G  ++               KL E    P  +    G                KV LW                                     
Subjt:  VSGHSGVFKI---------------KLGEFRSFPNNQRSTLG----------------KVNLW-------------------------------------

Query:  ---------------------PTQESRQNPESNAMLNTTRH---------------------------------CASKVVNVEPLSLASR--------CS
                               +E  ++ E++     T                                     +  VNV P   A            
Subjt:  ---------------------PTQESRQNPESNAMLNTTRH---------------------------------CASKVVNVEPLSLASR--------CS

Query:  KTASAGI--PASQQWP----------------------NNST----------------------------------------------------------
        +  ++GI  P++  +                       NN T                                                          
Subjt:  KTASAGI--PASQQWP----------------------NNST----------------------------------------------------------

Query:  ---RSTNGLTTFQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFANGVSGVAILPLFITIV
             TN  +TFQ+LMN+VFKPFLRR +LVFFDDIL+YS  M+ +  HL +V  +LR++ L+AN  KC FA+ RI+YLGH+I    + G+ + P  I  V
Subjt:  ---RSTNGLTTFQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFANGVSGVAILPLFITIV

Query:  --YNTRPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFTNGVEADGENIQAMTIVMNNGKIATPDTIVEERRFPIQLPVLALPDFDQP
          +    +++     LG+     R      N         ++   G + +T   EA  E +                     ++  + LPVLA+PDF+ P
Subjt:  --YNTRPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFTNGVEADGENIQAMTIVMNNGKIATPDTIVEERRFPIQLPVLALPDFDQP

Query:  FTIETDASGTGLGVVLTQKQRPVAFFSQ
        F IE+DASG G+G VL Q +RPVA+FS+
Subjt:  FTIETDASGTGLGVVLTQKQRPVAFFSQ

XP_031745419.1 uncharacterized protein LOC116405630 [Cucumis sativus]2.4e-4726.51Show/hide
Query:  NKLKKTET-----VPPDDWLFRAAQYFTIKRLSDEEKVQVAAISFTEAVLQWLQWAKGKEPFQGWKDLMNRIFERFRPMQEGSICARFLVARQDNTVQEF
        +K KK E      V P+ W +RA  +F I  L + EKV+VA +SF +  + W +W+  ++  + W+DL  R+F+ F    + S+ AR +  +Q+ +  E+
Subjt:  NKLKKTET-----VPPDDWLFRAAQYFTIKRLSDEEKVQVAAISFTEAVLQWLQWAKGKEPFQGWKDLMNRIFERFRPMQEGSICARFLVARQDNTVQEF

Query:  RERFEELSAPLPHLTDEIMESTFFNGLDPEVKVEVLSFEPVDLAAFIKAAQRVEDKNLARNLKANP----NKGKSTNSTSPNYELAKPNPKVIDATPIRS
         ++F   SAPLP + + ++   F  GL+P ++ EV+S  P  L   +K AQ   D+N+A  L           ++ NS+    +  K   K  +   ++ 
Subjt:  RERFEELSAPLPHLTDEIMESTFFNGLDPEVKVEVLSFEPVDLAAFIKAAQRVEDKNLARNLKANP----NKGKSTNSTSPNYELAKPNPKVIDATPIRS

Query:  ITLPTSSSPQHRETPYK-LTDTKLKSKKEMGLFIDAIKKIMRPLKIISAAKKARVIFKIDFEKVYDH-------------VDWDYIDM------------
        +T+P  +  Q ++ P K L+DT+ +++ + GL     +K     +     K+  ++  ++ E+  DH             ++ +++++            
Subjt:  ITLPTSSSPQHRETPYK-LTDTKLKSKKEMGLFIDAIKKIMRPLKIISAAKKARVIFKIDFEKVYDH-------------VDWDYIDM------------

Query:  -VLAYKAPTLQKWRSVSVGHVSDTDTPRHRKIFVSGHSGVFKIKLGEFRSFPNNQRSTLGKVNL-----WPTQESRQN-----PESNAMLNTTRHCA-SK
         V +     L+   S+  G           +   SG     K+KL E     +     LG V+L     W             P   A+ +     A  K
Subjt:  -VLAYKAPTLQKWRSVSVGHVSDTDTPRHRKIFVSGHSGVFKIKLGEFRSFPNNQRSTLGKVNL-----WPTQESRQN-----PESNAMLNTTRHCA-SK

Query:  VVNVEPL--------SLASRCSKTASAGIPASQQWPNNST------------------------------------------------------------
         +NV P          +     +   AG+    + P +S                                                             
Subjt:  VVNVEPL--------SLASRCSKTASAGIPASQQWPNNST------------------------------------------------------------

Query:  -------------------------RSTNGLTTFQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYL
                                   TN   TFQSLMN VFKPFLRRC+LVFF DIL+YS D+  ++ HLGMVF +LRD+ LFAN+ KCV A  ++ YL
Subjt:  -------------------------RSTNGLTTFQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYL

Query:  GHWIFANGVSGVAILPLFITIVYNTRPDMQTNLT-HLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFTNGVEADGENIQAMTIVMNNGKIATPDTI
        GH I + GV   A      ++V   RP   T L   LG+     R    +          E  A L   +  N    + E     TI  +  K+A     
Subjt:  GHWIFANGVSGVAILPLFITIVYNTRPDMQTNLT-HLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFTNGVEADGENIQAMTIVMNNGKIATPDTI

Query:  VEERRFPIQLPVLALPDFDQPFTIETDASGTGLGVVLTQKQRPVAFFSQSQVVHGKNQSTKE
                 LPVLALPD+ QPFTIETDASG GLG VL+Q   P+AFFSQ      + +S  E
Subjt:  VEERRFPIQLPVLALPDFDQPFTIETDASGTGLGVVLTQKQRPVAFFSQSQVVHGKNQSTKE

TrEMBL top hitse value%identityAlignment
A0A5A7SNK3 Putative retroelement pol polyprotein3.5e-4429.25Show/hide
Query:  IFERFRPMQEGSICARFLVARQDNTVQEFRERFEELSAPLPHLTDEIMESTFFNGLDPEVKVEVLSFEPVDLAAFIKAAQRVEDKNLARNLKANP-NKGK
        ++ RFR  + G+ CARFL  +Q+ +V+E+ ++F+ELSAPLP + ++++E TF NGLDP ++ EV S   V L   ++AAQ VE+K      K  P  + K
Subjt:  IFERFRPMQEGSICARFLVARQDNTVQEFRERFEELSAPLPHLTDEIMESTFFNGLDPEVKVEVLSFEPVDLAAFIKAAQRVEDKNLARNLKANP-NKGK

Query:  STNSTSPNYELAKPNPKVIDATPIRSITLPTSSSPQH-RETPYKLTDT-KLKSKKEMGLFIDAIKKIMRP-------LKIISAAKKARVIFKIDFEKVYD
            T+P     KP+PK  +   ++ +TL            PY+     K + +K +   +D+   I+RP       L I+   K     F +D+     
Subjt:  STNSTSPNYELAKPNPKVIDATPIRSITLPTSSSPQH-RETPYKLTDT-KLKSKKEMGLFIDAIKKIMRP-------LKIISAAKKARVIFKIDFEKVYD

Query:  HVDWDYIDMVLAYKAPTLQKWRSVSVGHVSDTDTPRHRKIFVSGHSGVFKIKLGEFRSFPNNQRSTLGKVNLWPTQESRQNPESNAMLNTTR-HCASKVV
                             R+++   V D                 F I + +      N  S   K++L       +  E +      R H      
Subjt:  HVDWDYIDMVLAYKAPTLQKWRSVSVGHVSDTDTPRHRKIFVSGHSGVFKIKLGEFRSFPNNQRSTLGKVNLWPTQESRQNPESNAMLNTTR-HCASKVV

Query:  NVEPLSLASRCSKTASAGIPASQQWPNNSTRSTNGLTTFQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQE
         ++P  L                         TN   TFQ+LMN+VF P+LR+ +LVFFD IL+Y+ D +T+L HL +VF +LR+++LFAN+KKC F +E
Subjt:  NVEPLSLASRCSKTASAGIPASQQWPNNSTRSTNGLTTFQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQE

Query:  RIAYLGHWIFANGVSGVAILPLFITIVYNTRPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFTNGVEADGENIQAMT-----IVMNN
        RI YLGHW    GV                                                 QE+I  +  W     V  +      +T      V N 
Subjt:  RIAYLGHWIFANGVSGVAILPLFITIVYNTRPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFTNGVEADGENIQAMT-----IVMNN

Query:  GKIATPDTIVEE-----------------RRFPIQLPVLALPDFDQPFTIETDASGTGLGVVLTQKQRPVAFFSQSQVVHGKNQSTKE
        G IATP T + +                 ++  + LPVLALPDF QPF IETDASG GLG VL+Q +RP+A+FSQ      + +S  E
Subjt:  GKIATPDTIVEE-----------------RRFPIQLPVLALPDFDQPFTIETDASGTGLGVVLTQKQRPVAFFSQSQVVHGKNQSTKE

A0A5A7SX47 Ty3/gypsy retrotransposon protein6.4e-4624.76Show/hide
Query:  VMLDEAEEEEEEEKSQTKKPDEDEDEARRRRSRSKLKKTKPVGSWNKLKKTET-----VPPDDWLFRAAQYFTIKRLSDEEKVQVAAISFTEAVLQWLQW
        V+ +  +   +E ++ + K  E +   R  ++  + +  +  G  NK KK E        PD WLFRA +YF I +LS+ EK+ V+A+SF    L W + 
Subjt:  VMLDEAEEEEEEEKSQTKKPDEDEDEARRRRSRSKLKKTKPVGSWNKLKKTET-----VPPDDWLFRAAQYFTIKRLSDEEKVQVAAISFTEAVLQWLQW

Query:  AKGKEPFQGWKDLMNRIFERFRPMQEGSICARFLVARQDNTVQEFRERFEELSAPLPHLTDEIMESTFFNGLDPEVKVEVLSFEPVDLAAFIKAAQRVED
         + K+ F    +L  R+  RFR  ++G+IC +FL  RQ++TV+E+R  F++L APL  L + ++E TF NGL P ++ EV    P  LA  ++  Q VE+
Subjt:  AKGKEPFQGWKDLMNRIFERFRPMQEGSICARFLVARQDNTVQEFRERFEELSAPLPHLTDEIMESTFFNGLDPEVKVEVLSFEPVDLAAFIKAAQRVED

Query:  KNLARNLKANPN---KGK----STNSTSPNYELAKPNPKVIDATPIRSITLPTSSSPQHRE--TPYKLTDTKLKSKKEMGLFIDAIKKIMRPLKI-ISAA
        + + RN +AN N    GK    +T +T  + +    + K     PI+++T  +SS  ++R+  T  +L D + +++KE GL     +K     K  +   
Subjt:  KNLARNLKANPN---KGK----STNSTSPNYELAKPNPKVIDATPIRSITLPTSSSPQHRE--TPYKLTDTKLKSKKEMGLFIDAIKKIMRPLKI-ISAA

Query:  KKARVIFKIDFEKVYDHVDWDYIDM--VLAY----------KAPTLQKWRSVSVGHVSDTDTPRHRKIFVSG----------------------------
        ++ R+   ++ ++ ++ ++ + ++   +  Y          ++P   + + V   H+   +T  +  I  SG                            
Subjt:  KKARVIFKIDFEKVYDHVDWDYIDM--VLAY----------KAPTLQKWRSVSVGHVSDTDTPRHRKIFVSG----------------------------

Query:  -------------------------HSGVFKIKLGEFRSFPNNQRSTLGKVNL-------WPTQ-------------ESRQNP-----------------
                                  +G+    +    +    + S++  V         WP Q             +   NP                 
Subjt:  -------------------------HSGVFKIKLGEFRSFPNNQRSTLGKVNL-------WPTQ-------------ESRQNP-----------------

Query:  -------ESNAMLNTTRHCASKVVNVE------------------------PLSLASR-----CSKTASAGIP----------ASQQWPNNSTRS-----
               ES  +  +T   +S V+ V+                        P+ +        C  +  + I           A +     + R+     
Subjt:  -------ESNAMLNTTRHCASKVVNVE------------------------PLSLASR-----CSKTASAGIP----------ASQQWPNNSTRS-----

Query:  ---------TNGLTTFQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFANGVSGVAILPLF
                 TN   TFQ+LMN +FKPFLR+ +LVFFDDIL+YS +   +L HLG V ++LR + L+AN+KKC FAQ +I YLGH I   GV         
Subjt:  ---------TNGLTTFQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFANGVSGVAILPLF

Query:  ITIVYNTRPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLG-HWIFTNGVEADGENIQAMTIVMNNGKIATPDTIV----EERRFPIQLPVLA
                P+   ++    +  N+                +E   +LG    +   V+  G     +T ++  G     D  +    + ++  + LPVLA
Subjt:  ITIVYNTRPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLG-HWIFTNGVEADGENIQAMTIVMNNGKIATPDTIV----EERRFPIQLPVLA

Query:  LPDFDQPFTIETDASGTGLGVVLTQKQRPVAFFSQS
        LP+F QPF IETDASG G G +L Q  RP A++S +
Subjt:  LPDFDQPFTIETDASGTGLGVVLTQKQRPVAFFSQS

A0A5A7TQU2 Ty3/gypsy retrotransposon protein1.6e-4425.48Show/hide
Query:  EEEKSQTKKPDEDEDEARRRRSRSKLKKTKPVGSWNKLKKTETVPPDDWLFRAAQYFTIKRLSDEEKVQVAAISFTEAVLQWLQWAKGKEPFQGWKDLMN
        EE K +TK  +E      R   RSK KK        ++   +   PD WLFRA +YF I  LSD EK+ VA ISF    L W +  + +E   GW DL  
Subjt:  EEEKSQTKKPDEDEDEARRRRSRSKLKKTKPVGSWNKLKKTETVPPDDWLFRAAQYFTIKRLSDEEKVQVAAISFTEAVLQWLQWAKGKEPFQGWKDLMN

Query:  RIFERFRPMQEGSICARFLVARQDNTVQEFRERFEELSAPLPHLTDEIMESTFFNGLDPEVKVEVLSFEPVDLAAFIKAAQRVEDKNLAR---------N
        ++  RFR  +EG++  RFL  +Q+ TV+E+R  F++L A +  L   ++E TF NGL+P +K EV + EP  LA  +K A ++E++ L R         +
Subjt:  RIFERFRPMQEGSICARFLVARQDNTVQEFRERFEELSAPLPHLTDEIMESTFFNGLDPEVKVEVLSFEPVDLAAFIKAAQRVEDKNLAR---------N

Query:  LKANPNKGKSTNSTSPNYELAKPNPKVIDATPIRSITL-PTSSSPQHRETPYK-LTDTKLKSKKEMGL--------------------------------
        +K   +K + T +T P    A        + P+R+ITL   ++    RE P K L+D + ++++E GL                                
Subjt:  LKANPNKGKSTNSTSPNYELAKPNPKVIDATPIRSITL-PTSSSPQHRETPYK-LTDTKLKSKKEMGL--------------------------------

Query:  -------FIDAIKKIMRPLKIISAAKKARVIFKIDFEKVYDHVDWDYIDMV-----------LAYKAPTLQKWRSVSVGHVSDTDTPRHRKIFVSGHSGV
               F DA +  M+ +++     +  V+  ID    ++ +  + +  +           +   + T  K + V        +  + RK+ + G   +
Subjt:  -------FIDAIKKIMRPLKIISAAKKARVIFKIDFEKVYDHVDWDYIDMV-----------LAYKAPTLQKWRSVSVGHVSDTDTPRHRKIFVSGHSGV

Query:  FKIK-----------------LGEFRSF------PNNQRSTLGKVNL---------------WPT-------------------------------QESR
         K +                 L E R+        + Q    G+V++               WPT                               Q+  
Subjt:  FKIK-----------------LGEFRSF------PNNQRSTLGKVNL---------------WPT-------------------------------QESR

Query:  QNPESNAMLNTTRHCASKVVNVEPLSL----------------------------------------ASRCSKT-ASAGIPASQQWPNNSTRS-------
             + ML++   C SK     P+ L                                        AS  SK    AG    +  P +  ++       
Subjt:  QNPESNAMLNTTRHCASKVVNVEPLSL----------------------------------------ASRCSKT-ASAGIPASQQWPNNSTRS-------

Query:  -----------TNGLTTFQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFANGVSGVAILP
                   TN  +TFQ+LMN+VFKP+LRR +LVFFDDIL+YS  M+ ++ HL +V  +L++  L+AN +KC FA+ RI+YLGH+I   G+       
Subjt:  -----------TNGLTTFQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFANGVSGVAILP

Query:  LFITIVYNTRPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFTNGVEADGENIQAMTIVMNNGKI---ATPDTIVEERRFPIQLPVLA
                  P+    ++      NV +                 +   G+  +   V+  G     +T ++  G     A  +T   + +  +  PVL 
Subjt:  LFITIVYNTRPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFTNGVEADGENIQAMTIVMNNGKI---ATPDTIVEERRFPIQLPVLA

Query:  LPDFDQPFTIETDASGTGLGVVLTQKQRPVAFFSQS
        +PDF+ PF IE+DASG GLG VLTQ ++ VA+FS++
Subjt:  LPDFDQPFTIETDASGTGLGVVLTQKQRPVAFFSQS

A0A5A7V7S9 Ty3/gypsy retrotransposon protein3.8e-4625.36Show/hide
Query:  PDDWLFRAAQYFTIKRLSDEEKVQVAAISFTEAVLQWLQWAKGKEPFQGWKDLMNRIFERFRPMQEGSICARFLVARQDNTVQEFRERFEELSAPLPHLT
        PD WLFRA +YF I  LSD EK+ VA ISF    L W +  + +E F+GW+DL  ++  RFR + +G++  RFL  +Q+ TV+E+R RF++  AP+  L 
Subjt:  PDDWLFRAAQYFTIKRLSDEEKVQVAAISFTEAVLQWLQWAKGKEPFQGWKDLMNRIFERFRPMQEGSICARFLVARQDNTVQEFRERFEELSAPLPHLT

Query:  DEIMESTFFNGLDPEVKVEVLSFEPVDLAAFIKAAQRVEDKNLARN----LKANPNKG--KSTNSTSPNYELAKPNPKVIDATPIRSITL-PTSSSPQHR
          ++E TF NGL P +K EV ++EPV LA  +K A ++E++ + R       A  +K   ++TN+ + N      N       P+R+ITL   ++    R
Subjt:  DEIMESTFFNGLDPEVKVEVLSFEPVDLAAFIKAAQRVEDKNLARN----LKANPNKG--KSTNSTSPNYELAKPNPKVIDATPIRSITL-PTSSSPQHR

Query:  ETPYK-LTDTKLKSKKEMGLFIDAIKKIMRPLKI-ISAAKKARVIFKIDFEKVYDHVDWDYIDMVLAYKAPTLQKWRS----VSVGHVSDTDTPRHRKIF
        E P K LTD + ++++E GL     +K     +     +K+ R++   +  +  + ++ +Y D     K P +Q   +    +S+  V     PR  K  
Subjt:  ETPYK-LTDTKLKSKKEMGLFIDAIKKIMRPLKI-ISAAKKARVIFKIDFEKVYDHVDWDYIDMVLAYKAPTLQKWRS----VSVGHVSDTDTPRHRKIF

Query:  VSGHSGVFKI---------------KLGEFRSFPNNQRSTLG----------------KVNLW-------------------------------------
        V G  G  ++               KL E    P  +    G                KV LW                                     
Subjt:  VSGHSGVFKI---------------KLGEFRSFPNNQRSTLG----------------KVNLW-------------------------------------

Query:  ---------------------PTQESRQNPESNAMLNTTRH---------------------------------CASKVVNVEPLSLASR--------CS
                               +E  ++ E++     T                                     +  VNV P   A            
Subjt:  ---------------------PTQESRQNPESNAMLNTTRH---------------------------------CASKVVNVEPLSLASR--------CS

Query:  KTASAGI--PASQQWP----------------------NNST----------------------------------------------------------
        +  ++GI  P++  +                       NN T                                                          
Subjt:  KTASAGI--PASQQWP----------------------NNST----------------------------------------------------------

Query:  ---RSTNGLTTFQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFANGVSGVAILPLFITIV
             TN  +TFQ+LMN+VFKPFLRR +LVFFDDIL+YS  M+ +  HL +V  +LR++ L+AN  KC FA+ RI+YLGH+I    + G+ + P  I  V
Subjt:  ---RSTNGLTTFQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFANGVSGVAILPLFITIV

Query:  --YNTRPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFTNGVEADGENIQAMTIVMNNGKIATPDTIVEERRFPIQLPVLALPDFDQP
          +    +++     LG+     R      N         ++   G + +T   EA  E +                     ++  + LPVLA+PDF+ P
Subjt:  --YNTRPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFTNGVEADGENIQAMTIVMNNGKIATPDTIVEERRFPIQLPVLALPDFDQP

Query:  FTIETDASGTGLGVVLTQKQRPVAFFSQ
        F IE+DASG G+G VL Q +RPVA+FS+
Subjt:  FTIETDASGTGLGVVLTQKQRPVAFFSQ

A0A5D3BC89 Ty3-gypsy retrotransposon protein6.0e-4425.95Show/hide
Query:  DGGESCSALSLLDVML------DEAEEEEEEEKSQTKKPDEDEDEARRRRSRSKLKKTKPVGSWN-------KLKKTET-----VPPDDWLFRAAQYFTI
        DG E    LSL ++ML      +   +E +E +S  K+ + +  +    + + K+++T+ +   N       K KK E        P+ W++R   +F I
Subjt:  DGGESCSALSLLDVML------DEAEEEEEEEKSQTKKPDEDEDEARRRRSRSKLKKTKPVGSWN-------KLKKTET-----VPPDDWLFRAAQYFTI

Query:  KRLSDEEKVQVAAISFTEAVLQWLQWAKGKEPFQGWKDLMNRIFERFRPMQEGSICARFLVARQDNTVQEFRERFEELSAPLPHLTDEIMESTFFNGLDP
          L   EKV+VA ++F +  + W  W+  ++  + W+DL  R+FE FR   + S+ AR +  +QD +  ++ ++F   SAPLP++ + ++   F  GL+P
Subjt:  KRLSDEEKVQVAAISFTEAVLQWLQWAKGKEPFQGWKDLMNRIFERFRPMQEGSICARFLVARQDNTVQEFRERFEELSAPLPHLTDEIMESTFFNGLDP

Query:  EVKVEVLSFEPVDLAAFIKAAQRVEDKNLARNL------KANPNKGKSTNSTSPNYELAKPNPKVIDATPIRSITLPTSSSPQHRETPYK-LTDTKLKSK
         ++ EV+S  P  L   +K AQ V DKNLA          A  ++G S+       E  K  P+  D   ++ IT+P     +  E P K L+D + ++K
Subjt:  EVKVEVLSFEPVDLAAFIKAAQRVEDKNLARNL------KANPNKGKSTNSTSPNYELAKPNPKVIDATPIRSITLPTSSSPQHRETPYK-LTDTKLKSK

Query:  KEMGLFIDAIKKIMRPLKIISAAKKARVIFKIDFEKVYDH-----------VDWDYIDMVLAYKAPTLQKWRSVSVGHVSDTDTPRHRKIFV--------
         + GL+     K     +  +  K+  + F I+ E+  +            V+   +++    K          S G +       H++I V        
Subjt:  KEMGLFIDAIKKIMRPLKIISAAKKARVIFKIDFEKVYDH-----------VDWDYIDMVLAYKAPTLQKWRSVSVGHVSDTDTPRHRKIFV--------

Query:  ---------------------------------SGHSGVFKIKLGEFRSFPNNQRSTLGKVNL----------------WPT------QESRQNPESNAM
                                          G     ++KL E     +     LG V+                 WP+       E RQ       
Subjt:  ---------------------------------SGHSGVFKIKLGEFRSFPNNQRSTLGKVNL----------------WPT------QESRQNPESNAM

Query:  LNTTRHCASKVV----------------NVE------------------------------------------------------PLSLAS---------
          T   C+ K +                N+E                                                       L L S         
Subjt:  LNTTRHCASKVV----------------NVE------------------------------------------------------PLSLAS---------

Query:  -RCSKTASAGIPASQQWPNNSTRSTNGLTTFQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHW
            KTA        ++   S    N   TFQSLMN+VFKPFLRRC+LVFFDDIL+YS D+  +  HLGMVF VLRDN L+AN+KKCVFA  +I YLGH 
Subjt:  -RCSKTASAGIPASQQWPNNSTRSTNGLTTFQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHW

Query:  IFANGVSGVAILPLFITIVYNTRPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFTNGVEADGENIQAMTIVMNNGKIATPDTIVEER
        I  +GV         +  V    P +           N  R NA                      F  G EA+   ++ + + M               
Subjt:  IFANGVSGVAILPLFITIVYNTRPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFTNGVEADGENIQAMTIVMNNGKIATPDTIVEER

Query:  RFPIQLPVLALPDFDQPFTIETDASGTGLGVVLTQKQRPVAFFS
             +PVLALPD++ PFTIETDASG+GLG VL+Q+  P+AFFS
Subjt:  RFPIQLPVLALPDFDQPFTIETDASGTGLGVVLTQKQRPVAFFS

SwissProt top hitse value%identityAlignment
O92815 Gag-Pol polyprotein1.2e-0422.87Show/hide
Query:  FQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFANGVSGVAILPLFITIVYNTRPDMQTNL
        +QSL    FK     C  ++ DD+LI S D  TNL    ++   L       ++KK    Q+ + YLG  +   G                  PD +  +
Subjt:  FQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFANGVSGVAILPLFITIVYNTRPDMQTNL

Query:  THLGMVFNVLRDNALFANQKKCVFAQERIAYLGHW-----IFTNGVEADGENIQAMTIVMNNGKIATPDTIVEERRFPIQLPVLALPDFDQPFTIETDAS
        +       + +  A              + Y  HW     I +  +E   +   A    +++ ++   +   + +      PVL +PD  +PF + T  S
Subjt:  THLGMVFNVLRDNALFANQKKCVFAQERIAYLGHW-----IFTNGVEADGENIQAMTIVMNNGKIATPDTIVEERRFPIQLPVLALPDFDQPFTIETDAS

Query:  GTGLGVVLTQKQ----RPVAFFS
              VLTQK     RP+AF S
Subjt:  GTGLGVVLTQKQ----RPVAFFS

P04323 Retrovirus-related Pol polyprotein from transposon 17.62.4e-1326.69Show/hide
Query:  NGLTTFQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFANGVS-GVAILPLFITIVYNTRP
        N   TFQ  MN + +P L +  LV+ DDI+++S  +  +L  LG+VF  L    L     KC F ++   +LGH +  +G+      +         T+P
Subjt:  NGLTTFQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFANGVS-GVAILPLFITIVYNTRP

Query:  -DMQTNLTHLGMVFNVLRDNALFAN-QKKCVFAQERIAYLGHWIFTNGVEADGENIQAMTIVMNNGKIATPDTIVEERRFPIQLPVLALPDFDQPFTIET
         +++  L   G     + + A  A    KC+    +I        T   E D    +   ++  +                   P+L +PDF + FT+ T
Subjt:  -DMQTNLTHLGMVFNVLRDNALFAN-QKKCVFAQERIAYLGHWIFTNGVEADGENIQAMTIVMNNGKIATPDTIVEERRFPIQLPVLALPDFDQPFTIET

Query:  DASGTGLGVVLTQKQRPVAFFSQSQVVHGKNQSTKE
        DAS   LG VL+Q   P+++ S++   H  N ST E
Subjt:  DASGTGLGVVLTQKQRPVAFFSQSQVVHGKNQSTKE

P10401 Retrovirus-related Pol polyprotein from transposon gypsy7.2e-1025.83Show/hide
Query:  NGLTTFQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFANG-------VSGVAILPLFITI
        N  + FQ  ++ V +  + +   V+ DD++I+S +   ++ H+  V   L D  +  +Q+K  F +E + YLG  +  +G       V  +   P     
Subjt:  NGLTTFQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFANG-------VSGVAILPLFITI

Query:  VYNTRPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFTNGVEADGENIQAMTIVMNNGKIATPDTIVEERRFPIQLPVLALPDFDQPF
        VY  R  +   L     VF  ++D A  A     +   E  +   H      VE +     A   + N   +A+ D I            L  PDF +PF
Subjt:  VYNTRPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFTNGVEADGENIQAMTIVMNNGKIATPDTIVEERRFPIQLPVLALPDFDQPF

Query:  TIETDASGTGLGVVLTQKQRPVAFFSQSQVVHGKNQSTKE
         + TDAS +G+G VL+Q+ RP+   S++     +N +T E
Subjt:  TIETDASGTGLGVVLTQKQRPVAFFSQSQVVHGKNQSTKE

P20825 Retrovirus-related Pol polyprotein from transposon 2976.3e-1425Show/hide
Query:  NGLTTFQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFANGV--SGVAILPLFITIVYNTR
        N   TFQ  MN + +P L +  LV+ DDI+I+S  +  +L  + +VF  L D  L     KC F ++   +LGH +  +G+  + + +  +    +    
Subjt:  NGLTTFQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFANGV--SGVAILPLFITIVYNTR

Query:  PDMQTNLTHLGMVFNVLRDNALFAN-QKKCVFAQERIAYLGHWIFTNGVEADGENIQAMTIVMNNGKIATPDTIVEERRFPIQLPVLALPDFDQPFTIET
         +++  L   G     + + A  A     C+  + +I                          +  K+   +   + +   I+ P+L LPDF++ F + T
Subjt:  PDMQTNLTHLGMVFNVLRDNALFAN-QKKCVFAQERIAYLGHWIFTNGVEADGENIQAMTIVMNNGKIATPDTIVEERRFPIQLPVLALPDFDQPFTIET

Query:  DASGTGLGVVLTQKQRPVAFFSQSQVVHGKNQSTKE
        DAS   LG VL+Q   P++F S++   H  N S  E
Subjt:  DASGTGLGVVLTQKQRPVAFFSQSQVVHGKNQSTKE

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus3.0e-0822.78Show/hide
Query:  NGLTTFQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFANGVSGVAILPLFITIVYNTRPD
        N    FQ +++ + +  + +   V+ DDI+++S D  T+  +L +V   L    L  N +K  F   ++ +LG+ + A+G+      P  +  +    P 
Subjt:  NGLTTFQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFANGVSGVAILPLFITIVYNTRPD

Query:  MQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFTNGVEADGENIQAMTIVMNNGKIATPDTIVEERRFPIQLPVLALPDFDQPFTIETDAS
                  V  + R   + +  +K +    ++A       T G+ A+ ++ Q+  + +   + A   +  + +       +LA P F +PF + TDAS
Subjt:  MQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFTNGVEADGENIQAMTIVMNNGKIATPDTIVEERRFPIQLPVLALPDFDQPFTIETDAS

Query:  GTGLGVVLTQ----KQRPVAFFSQSQVVHGKNQSTKE
           +G VL+Q    + RP+A+ S+S     +N +T E
Subjt:  GTGLGVVLTQ----KQRPVAFFSQSQVVHGKNQSTKE

Arabidopsis top hitse value%identityAlignment
AT3G42723.1 aminoacyl-tRNA ligases;ATP binding;nucleotide binding4.2e-0519.79Show/hide
Query:  YFTIKRLSDEEKVQVAAISFTEAVLQWLQWAKGKEPFQGWKDLMNRIFERFRPMQEGSICARFLVARQDNTVQEFRERFEELSAPLPHLTDEIMESTFFN
        YF    + ++E++Q+   +    + QW++    K     WK+    +    +   + +    +   +Q+ +V+E+RERFE L      L  + +E+ F  
Subjt:  YFTIKRLSDEEKVQVAAISFTEAVLQWLQWAKGKEPFQGWKDLMNRIFERFRPMQEGSICARFLVARQDNTVQEFRERFEELSAPLPHLTDEIMESTFFN

Query:  GLDPEVKVEVLSFEPVDLAAFIKAAQRVEDKN------LARNLKANPNKGKSTNSTSPNYELAKPNPKVIDATPIRSITLPTSSSPQHRETPYKLTDTKL
        GL P ++  V   +P  +   +  AQ +E+ N         +++  P    +T +   +  L     + +  TP R     T +  Q  E P     T++
Subjt:  GLDPEVKVEVLSFEPVDLAAFIKAAQRVEDKN------LARNLKANPNKGKSTNSTSPNYELAKPNPKVIDATPIRSITLPTSSSPQHRETPYKLTDTKL

Query:  KSKKEMGLFIDAI-KKIMRPLKIISAAKKA--RVIFKI-DFEKVYDHVDW----DYIDMVLAYK-----APTLQKWRSVSVGHVSDTD
         + + M  +   + +++    ++ S  K++   +  +I D + V D+  W    D +D++L Y+       T   W++ S   + + D
Subjt:  KSKKEMGLFIDAI-KKIMRPLKIISAAKKA--RVIFKI-DFEKVYDHVDW----DYIDMVLAYK-----APTLQKWRSVSVGHVSDTD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGATGGTGGTGAATCCTGTTCAGCTTTAAGTTTGTTAGATGTTATGTTGGACGAAGCAGAAGAAGAAGAAGAAGAAGAAAAGAGTCAGACGAAGAAGCCAGACGA
AGACGAAGACGAAGCCAGACGCAGACGAAGCCGGAGCAAGTTGAAGAAGACGAAGCCGGTTGGAAGTTGGAACAAGTTGAAGAAGACGGAGACGGTTCCCCCGGACGATT
GGTTGTTTCGGGCGGCACAATATTTCACTATTAAAAGGTTGTCCGATGAAGAAAAAGTGCAGGTGGCGGCTATTTCCTTCACTGAGGCGGTATTGCAGTGGCTTCAATGG
GCAAAAGGCAAGGAACCGTTCCAGGGATGGAAGGATCTCATGAATCGGATATTCGAAAGATTCAGACCTATGCAAGAAGGCTCAATATGTGCGAGATTCCTAGTTGCGCG
ACAAGACAACACTGTACAAGAGTTCAGAGAACGTTTTGAAGAATTATCAGCACCCCTGCCACACCTAACAGACGAGATTATGGAGAGCACATTTTTTAATGGACTGGACC
CGGAGGTAAAGGTTGAGGTGTTATCTTTTGAGCCAGTGGACCTAGCAGCCTTCATAAAGGCAGCCCAACGCGTAGAGGATAAGAACTTGGCCCGCAACCTTAAAGCAAAC
CCAAACAAGGGAAAATCTACGAATTCAACCAGTCCCAATTATGAATTAGCAAAACCAAATCCAAAGGTAATTGATGCCACACCCATCAGGTCCATTACCCTGCCAACTAG
CTCGTCACCTCAACATAGGGAAACACCCTACAAGTTAACTGACACGAAACTCAAGTCGAAAAAGGAGATGGGCCTCTTTATAGATGCGATAAAAAAAATTATGAGACCAT
TGAAGATTATTAGTGCAGCAAAAAAGGCGAGGGTCATATTCAAGATTGATTTTGAAAAAGTTTATGACCATGTGGATTGGGATTATATTGACATGGTTCTCGCCTATAAA
GCACCGACACTTCAAAAATGGCGAAGTGTCAGTGTCGGACACGTGTCGGACACCGACACGCCCCGACACAGGAAAATTTTTGTTTCAGGCCATTCCGGAGTGTTCAAGAT
CAAATTGGGGGAATTTCGATCATTTCCCAACAATCAAAGGTCAACACTAGGAAAAGTCAACTTGTGGCCAACTCAAGAGTCAAGGCAAAATCCCGAGAGCAACGCTATGC
TCAACACCACAAGGCATTGTGCATCCAAAGTGGTCAACGTCGAGCCGCTGTCCTTAGCATCTCGATGCTCTAAGACCGCTAGCGCTGGAATTCCAGCATCTCAACAATGG
CCCAACAACTCAACGAGGTCTACCAATGGGCTAACCACCTTCCAATCCCTCATGAATCGAGTGTTCAAACCATTTTTACGCCGCTGCTTATTAGTGTTCTTTGATGACAT
CCTCATATACAGTCCTGATATGCAGACTAATCTAACTCATCTCGGTATGGTCTTCAATGTTTTAAGAGACAATGCCTTATTTGCCAATCAAAAGAAATGTGTTTTTGCCC
AAGAGAGGATAGCCTATCTGGGCCATTGGATTTTCGCGAATGGAGTATCAGGGGTGGCTATTTTACCGTTGTTCATCACAATCGTCTATAATACCCGTCCTGATATGCAG
ACTAATCTGACTCATCTCGGTATGGTCTTCAATGTTTTAAGAGACAATGCCTTATTTGCCAATCAAAAGAAATGTGTTTTTGCCCAAGAGAGGATAGCCTATCTGGGCCA
TTGGATTTTCACGAATGGAGTAGAAGCAGATGGAGAAAATATTCAAGCCATGACGATTGTGATGAACAACGGTAAAATAGCCACCCCTGATACAATTGTTGAAGAAAGAC
GCTTTCCAATACAATTGCCTGTTTTGGCTCTACCAGATTTTGACCAACCATTCACCATTGAGACCGATGCTTCAGGCACAGGGTTAGGTGTCGTGCTCACTCAGAAACAG
AGACCAGTGGCCTTCTTCAGCCAAAGTCAAGTCGTGCACGGGAAAAATCAGTCTACGAAAGAGAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGATGGTGGTGAATCCTGTTCAGCTTTAAGTTTGTTAGATGTTATGTTGGACGAAGCAGAAGAAGAAGAAGAAGAAGAAAAGAGTCAGACGAAGAAGCCAGACGA
AGACGAAGACGAAGCCAGACGCAGACGAAGCCGGAGCAAGTTGAAGAAGACGAAGCCGGTTGGAAGTTGGAACAAGTTGAAGAAGACGGAGACGGTTCCCCCGGACGATT
GGTTGTTTCGGGCGGCACAATATTTCACTATTAAAAGGTTGTCCGATGAAGAAAAAGTGCAGGTGGCGGCTATTTCCTTCACTGAGGCGGTATTGCAGTGGCTTCAATGG
GCAAAAGGCAAGGAACCGTTCCAGGGATGGAAGGATCTCATGAATCGGATATTCGAAAGATTCAGACCTATGCAAGAAGGCTCAATATGTGCGAGATTCCTAGTTGCGCG
ACAAGACAACACTGTACAAGAGTTCAGAGAACGTTTTGAAGAATTATCAGCACCCCTGCCACACCTAACAGACGAGATTATGGAGAGCACATTTTTTAATGGACTGGACC
CGGAGGTAAAGGTTGAGGTGTTATCTTTTGAGCCAGTGGACCTAGCAGCCTTCATAAAGGCAGCCCAACGCGTAGAGGATAAGAACTTGGCCCGCAACCTTAAAGCAAAC
CCAAACAAGGGAAAATCTACGAATTCAACCAGTCCCAATTATGAATTAGCAAAACCAAATCCAAAGGTAATTGATGCCACACCCATCAGGTCCATTACCCTGCCAACTAG
CTCGTCACCTCAACATAGGGAAACACCCTACAAGTTAACTGACACGAAACTCAAGTCGAAAAAGGAGATGGGCCTCTTTATAGATGCGATAAAAAAAATTATGAGACCAT
TGAAGATTATTAGTGCAGCAAAAAAGGCGAGGGTCATATTCAAGATTGATTTTGAAAAAGTTTATGACCATGTGGATTGGGATTATATTGACATGGTTCTCGCCTATAAA
GCACCGACACTTCAAAAATGGCGAAGTGTCAGTGTCGGACACGTGTCGGACACCGACACGCCCCGACACAGGAAAATTTTTGTTTCAGGCCATTCCGGAGTGTTCAAGAT
CAAATTGGGGGAATTTCGATCATTTCCCAACAATCAAAGGTCAACACTAGGAAAAGTCAACTTGTGGCCAACTCAAGAGTCAAGGCAAAATCCCGAGAGCAACGCTATGC
TCAACACCACAAGGCATTGTGCATCCAAAGTGGTCAACGTCGAGCCGCTGTCCTTAGCATCTCGATGCTCTAAGACCGCTAGCGCTGGAATTCCAGCATCTCAACAATGG
CCCAACAACTCAACGAGGTCTACCAATGGGCTAACCACCTTCCAATCCCTCATGAATCGAGTGTTCAAACCATTTTTACGCCGCTGCTTATTAGTGTTCTTTGATGACAT
CCTCATATACAGTCCTGATATGCAGACTAATCTAACTCATCTCGGTATGGTCTTCAATGTTTTAAGAGACAATGCCTTATTTGCCAATCAAAAGAAATGTGTTTTTGCCC
AAGAGAGGATAGCCTATCTGGGCCATTGGATTTTCGCGAATGGAGTATCAGGGGTGGCTATTTTACCGTTGTTCATCACAATCGTCTATAATACCCGTCCTGATATGCAG
ACTAATCTGACTCATCTCGGTATGGTCTTCAATGTTTTAAGAGACAATGCCTTATTTGCCAATCAAAAGAAATGTGTTTTTGCCCAAGAGAGGATAGCCTATCTGGGCCA
TTGGATTTTCACGAATGGAGTAGAAGCAGATGGAGAAAATATTCAAGCCATGACGATTGTGATGAACAACGGTAAAATAGCCACCCCTGATACAATTGTTGAAGAAAGAC
GCTTTCCAATACAATTGCCTGTTTTGGCTCTACCAGATTTTGACCAACCATTCACCATTGAGACCGATGCTTCAGGCACAGGGTTAGGTGTCGTGCTCACTCAGAAACAG
AGACCAGTGGCCTTCTTCAGCCAAAGTCAAGTCGTGCACGGGAAAAATCAGTCTACGAAAGAGAATTGA
Protein sequenceShow/hide protein sequence
MGDGGESCSALSLLDVMLDEAEEEEEEEKSQTKKPDEDEDEARRRRSRSKLKKTKPVGSWNKLKKTETVPPDDWLFRAAQYFTIKRLSDEEKVQVAAISFTEAVLQWLQW
AKGKEPFQGWKDLMNRIFERFRPMQEGSICARFLVARQDNTVQEFRERFEELSAPLPHLTDEIMESTFFNGLDPEVKVEVLSFEPVDLAAFIKAAQRVEDKNLARNLKAN
PNKGKSTNSTSPNYELAKPNPKVIDATPIRSITLPTSSSPQHRETPYKLTDTKLKSKKEMGLFIDAIKKIMRPLKIISAAKKARVIFKIDFEKVYDHVDWDYIDMVLAYK
APTLQKWRSVSVGHVSDTDTPRHRKIFVSGHSGVFKIKLGEFRSFPNNQRSTLGKVNLWPTQESRQNPESNAMLNTTRHCASKVVNVEPLSLASRCSKTASAGIPASQQW
PNNSTRSTNGLTTFQSLMNRVFKPFLRRCLLVFFDDILIYSPDMQTNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFANGVSGVAILPLFITIVYNTRPDMQ
TNLTHLGMVFNVLRDNALFANQKKCVFAQERIAYLGHWIFTNGVEADGENIQAMTIVMNNGKIATPDTIVEERRFPIQLPVLALPDFDQPFTIETDASGTGLGVVLTQKQ
RPVAFFSQSQVVHGKNQSTKEN