; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039278 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039278
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase Ty1/copia-type domain-containing protein
Genome locationchr2:40415204..40420914
RNA-Seq ExpressionLag0039278
SyntenyLag0039278
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR029472 - Retrotransposon Copia-like, N-terminal
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061282.1 putative mitochondrial protein [Cucumis melo var. makuwa]1.6e-10642.1Show/hide
Query:  SMASIVASSSSTISESSS--SATSSPMFLLSNICNFVPLRLDSTNYVLWKFQVSSILKAHSLFGHIDDSLPKP----------------TQYILTLAGQQ
        S  S +  +S+ I ESSS  S  +S +FLLSNICN VP+RLDSTNYVLWK+QVSSILKAHSLFGHIDD+LP P                 +Y+  L+  Q
Subjt:  SMASIVASSSSTISESSS--SATSSPMFLLSNICNFVPLRLDSTNYVLWKFQVSSILKAHSLFGHIDDSLPKP----------------TQYILTLAGQQ

Query:  ----------------------RQRSVSITFEKRYASNTRSSILDLRSALYKIKKLSSKSIEKYTYRIKEIVDKLVAALVKIEDEEILVHTLNGLPAEFN
                                +++ ++ EKRY+SNTRS+ILDLRSALY I K  S+SI++YT RIK +VDKL AA V +EDEEILVHTLNGLPA FN
Subjt:  ----------------------RQRSVSITFEKRYASNTRSSILDLRSALYKIKKLSSKSIEKYTYRIKEIVDKLVAALVKIEDEEILVHTLNGLPAEFN

Query:  AFRTSIRTRSGTLSLEELHVLLAAEE----QTLQLHAIPYTYDNGDCSKPYLSGKRTM-RSDNILGSPSSNLNSGFGSGSIFCQICSKSGHGALDCYNRM
        AFRTSIRTRSG +SLEELH LL +EE    +T  + AIP          P   G R   R  +  G+P+ N +S            + S  G        
Subjt:  AFRTSIRTRSGTLSLEELHVLLAAEE----QTLQLHAIPYTYDNGDCSKPYLSGKRTM-RSDNILGSPSSNLNSGFGSGSIFCQICSKSGHGALDCYNRM

Query:  NFSYQGRYPPAQLAAMADKVSDKILYTGENINGLYPIPSPSMLSSDMHPKNFNFMAKQECSFWHHRFGHPAPKILRSSLSRLDSPKPATPSHPDITPLLT
        NF+   R                         GL                N NF+                                             
Subjt:  NFSYQGRYPPAQLAAMADKVSDKILYTGENINGLYPIPSPSMLSSDMHPKNFNFMAKQECSFWHHRFGHPAPKILRSSLSRLDSPKPATPSHPDITPLLT

Query:  LLHTSPNPSSPSPPTQLSTVDTNSSPQEILHVSFSNIADLPTDESSDEHSVSQNVHIPASTVNNHLMQTRAKLEIFKPKVFVSTITTSVPIDPPSYSVAS
                                                  D  ++ ++ + N+  P S  N H MQTRAK  IFKPK F   ITT++P  P SY+ AS
Subjt:  LLHTSPNPSSPSPPTQLSTVDTNSSPQEILHVSFSNIADLPTDESSDEHSVSQNVHIPASTVNNHLMQTRAKLEIFKPKVFVSTITTSVPIDPPSYSVAS

Query:  KYSEWRSAMCEEFNALQEQVTWSLVPCLPSMNVVGCKWVFRTKYNTDGTIARYKARLVAKGYHQVEGFDFTETFSPVVKEPTIRVIIALAANYQWSLTQL
        KY EWR+ M EEFNALQ Q TWSLVP LPSMNVVGCKWVFR KYN DGTIAR+KARLVAKGYHQ                                    
Subjt:  KYSEWRSAMCEEFNALQEQVTWSLVPCLPSMNVVGCKWVFRTKYNTDGTIARYKARLVAKGYHQVEGFDFTETFSPVVKEPTIRVIIALAANYQWSLTQL

Query:  DVKNVFLHGHLKEEVYMSQPPGFLDKSSPHHVYRLHKSLYG--QAPRAWFNHFTSYLFTLGFTAFAADSSLFVRSVGSSLTYLLLYVDDIVITGPD
                    E VYM+QP GF +K+ P+HV  LHKSLYG  QAPRAWF  FTSYLFTLGF A  AD SLF+RSVGSSLTYLLLYVDDI++TGPD
Subjt:  DVKNVFLHGHLKEEVYMSQPPGFLDKSSPHHVYRLHKSLYG--QAPRAWFNHFTSYLFTLGFTAFAADSSLFVRSVGSSLTYLLLYVDDIVITGPD

KAB2607536.1 hypothetical protein D8674_007253 [Pyrus ussuriensis x Pyrus communis]2.0e-8029.99Show/hide
Query:  VPLRLDSTNYVLWKFQVSSILKAHSLFGHID--DSLPKP------------------------------TQYILTLAGQQRQRSVSITFEKRYASNTRSS
        VP++L S+NY+ W+   + IL+ + L G ID  +  P P                               + I    G    R + +  E+R+   + ++
Subjt:  VPLRLDSTNYVLWKFQVSSILKAHSLFGHID--DSLPKP------------------------------TQYILTLAGQQRQRSVSITFEKRYASNTRSS

Query:  ILDLRSALYKIKKLSSKSIEKYTYRIKEIVDKLVAALVKIEDEEILVHTLNGLPAEFNAFRTSIRTRSGTLSLEELHVLLAAEE----------------
        I  LRS L  ++K  S SI  Y  +IKEI D L AA   + D +++  TL+GLP EF +F  SI  R  + SL+ELH L   +E                
Subjt:  ILDLRSALYKIKKLSSKSIEKYTYRIKEIVDKLVAALVKIEDEEILVHTLNGLPAEFNAFRTSIRTRSGTLSLEELHVLLAAEE----------------

Query:  QTLQLHAIP-------------------YTYDNGDCSKPYLSGKRTMRSDNILGSPS--------SNLNSGFGSGSIFCQICSKSGHGALDCYNRMNFSY
        Q   + + P                   Y  + G  S+      R  R+     +P+        S  +SG       CQIC  + H A+DC++RMN   
Subjt:  QTLQLHAIP-------------------YTYDNGDCSKPYLSGKRTMRSDNILGSPS--------SNLNSGFGSGSIFCQICSKSGHGALDCYNRMNFSY

Query:  QGRYPPAQLAAMA---------------------------------------------------------------------------------------
         GR PPA+LAA+                                                                                        
Subjt:  QGRYPPAQLAAMA---------------------------------------------------------------------------------------

Query:  -DKVSDKILYTGENINGLYPIPSPSMLSSDMHPKNFNFMAKQECSFWHHRFGHPAPKILRSSLSRL---------------------DSPKPATPSHPDI
         D++S + L  G   +G YP+PS S  S  +H    +   K     WH R GHP+  I R  +S                       +   P   +    
Subjt:  -DKVSDKILYTGENINGLYPIPSPSMLSSDMHPKNFNFMAKQECSFWHHRFGHPAPKILRSSLSRL---------------------DSPKPATPSHPDI

Query:  TPLLTLLH----------------------------------TSPNPSSPSPPTQLSTVDTNSSPQEILHVSFSNIADLPTDESSDEHSVSQNVHIPAST
        T  L LLH                                   + + S P PP+  ++V   SS    L+ +      LP   +S   S S +  IP + 
Subjt:  TPLLTLLH----------------------------------TSPNPSSPSPPTQLSTVDTNSSPQEILHVSFSNIADLPTDESSDEHSVSQNVHIPAST

Query:  V----------------------------NNHLMQTRAKLEIFKPKVFVST---ITTSVPIDPPSYSVASKYSEWRSAMCEEFNALQEQVTWSLVPCLPS
        V                            N H M TR+K  I+KPK + +T   +   +   P +Y  ASK++ WR+AM +E+NAL    TWSLVP   +
Subjt:  V----------------------------NNHLMQTRAKLEIFKPKVFVST---ITTSVPIDPPSYSVASKYSEWRSAMCEEFNALQEQVTWSLVPCLPS

Query:  MNVVGCKWVFRTKYNTDGTIARYKARLVAKGYHQVEGFDFTETFSPVVKEPTIRVIIALAANYQWSLTQLDVKNVFLHGHLKEEVYMSQPPGFLDKSSPH
         NVVGCKWVFR K   DGT+ RYKARLVAKG+HQ EG DF +TFSPV K  TIR+++ LA  + W L QLD+ N FLHG L E+VYM QPPGF D   P 
Subjt:  MNVVGCKWVFRTKYNTDGTIARYKARLVAKGYHQVEGFDFTETFSPVVKEPTIRVIIALAANYQWSLTQLDVKNVFLHGHLKEEVYMSQPPGFLDKSSPH

Query:  HVYRLHKSLYG--QAPRAWFNHFTSYLFTLGFTAFAADSSLFVRSVGSSLTYLLLYVDDIVITGPDS
        HV +L KSLYG  QAPRAWF+     L  LGFT  ++D+SLFV + G  L  +L+YVDDI++T P+S
Subjt:  HVYRLHKSLYG--QAPRAWFNHFTSYLFTLGFTAFAADSSLFVRSVGSSLTYLLLYVDDIVITGPDS

KAB2613850.1 hypothetical protein D8674_036166 [Pyrus ussuriensis x Pyrus communis]1.7e-7929.73Show/hide
Query:  VPLRLDSTNYVLWKFQVSSILKAHSLFGHIDDSLPKPT--------------------------------QYILTLAGQQRQRSVSITFEKRYASNTRSS
        VP++L  TNY+ W    + IL+ + L G +D + P P                                 + I    G    R + +  E+R+   + + 
Subjt:  VPLRLDSTNYVLWKFQVSSILKAHSLFGHIDDSLPKPT--------------------------------QYILTLAGQQRQRSVSITFEKRYASNTRSS

Query:  ILDLRSALYKIKKLSSKSIEKYTYRIKEIVDKLVAALVKIEDEEILVHTLNGLPAEFNAFRTSIRTRSGTLSLEELHVLLAAEEQTLQ------------
        +  LRS L  I+K  ++S+  Y   +KEI D L AA   I D +++  TL GLP EF +F  SI  R  + SL+ELH LL  +E +L             
Subjt:  ILDLRSALYKIKKLSSKSIEKYTYRIKEIVDKLVAALVKIEDEEILVHTLNGLPAEFNAFRTSIRTRSGTLSLEELHVLLAAEEQTLQ------------

Query:  ----------------------------LHAIPYTYDNGDCSKPYLSGKR-------TMRSDNILGSP--SSNLNSGFGSG-SIFCQICSKSGHGALDCY
                                     H+  Y    G  ++ +    R         R ++  G     ++ N G  SG    CQIC  S H A+DC+
Subjt:  ----------------------------LHAIPYTYDNGDCSKPYLSGKR-------TMRSDNILGSP--SSNLNSGFGSG-SIFCQICSKSGHGALDCY

Query:  NRMNFSYQGRYPPAQLAAMA--------------------------------------------------------------------------------
        +RMN    GR PPA+L AM                                                                                 
Subjt:  NRMNFSYQGRYPPAQLAAMA--------------------------------------------------------------------------------

Query:  -------------DKVSDKILYTGENINGLYPIPSPSMLSSDMHPKNFNFMA----KQECSFWHHRFGHPAPKILRSSLSRL------------------
                     D+ + K+L  G   +G YP+ S +  SS     +    A    K     WH R GHP+  I R  +S                    
Subjt:  -------------DKVSDKILYTGENINGLYPIPSPSMLSSDMHPKNFNFMA----KQECSFWHHRFGHPAPKILRSSLSRL------------------

Query:  ---DSPKPATPSHPDITPLLTLLH------TSPNPSS-------------------------------PSPPTQLSTVDTNSSPQEILHVSFSNIAD---
           +  +P   S    T  L LLH       S   SS                               PS    +S V  NS   ++   ++++ A    
Subjt:  ---DSPKPATPSHPDITPLLTLLH------TSPNPSS-------------------------------PSPPTQLSTVDTNSSPQEILHVSFSNIAD---

Query:  -----------LPTDESSDEHSVSQNVHIPAS----TVNNHLMQTRAKLEIFKPKVFVST---ITTSVPIDPPSYSVASKYSEWRSAMCEEFNALQEQVT
                   +P+  +S   + S    +  S     VN H M TR+K  I+KPK + +T   ++ ++   P +Y  ASK+  WR+AM +E+NAL    T
Subjt:  -----------LPTDESSDEHSVSQNVHIPAS----TVNNHLMQTRAKLEIFKPKVFVST---ITTSVPIDPPSYSVASKYSEWRSAMCEEFNALQEQVT

Query:  WSLVPCLPSMNVVGCKWVFRTKYNTDGTIARYKARLVAKGYHQVEGFDFTETFSPVVKEPTIRVIIALAANYQWSLTQLDVKNVFLHGHLKEEVYMSQPP
        WSLVP   S N+VGCKWVFR K N DGT+ RYKARLVAKG+HQ +G DF ETFSPV K  TIR++++LA  + W L QLD+ N FLHG LKE+VYM QPP
Subjt:  WSLVPCLPSMNVVGCKWVFRTKYNTDGTIARYKARLVAKGYHQVEGFDFTETFSPVVKEPTIRVIIALAANYQWSLTQLDVKNVFLHGHLKEEVYMSQPP

Query:  GFLDKSSPHHVYRLHKSLYG--QAPRAWFNHFTSYLFTLGFTAFAADSSLFVRSVGSSLTYLLLYVDDIVITGPDSFI
        GF+D   P HV +L KSLYG  QAPRAWF+     L  LGF   ++D+SLFV + G  L  +L+YVDDI++TGPDS +
Subjt:  GFLDKSSPHHVYRLHKSLYG--QAPRAWFNHFTSYLFTLGFTAFAADSSLFVRSVGSSLTYLLLYVDDIVITGPDSFI

TQE03277.1 hypothetical protein C1H46_011089 [Malus baccata]4.7e-8232.95Show/hide
Query:  ISESSSSATSSPM-FLLSNICNFVPLRLDSTNYVLWKFQVSSILKAHSLFGHIDDSLPKPTQYILT------------------------LAGQQRQRSV
        ++ S      SP+  L+S I   V ++ D TNY+ W+FQ+  +L+ H +   +D S   P ++++                         L  +   + +
Subjt:  ISESSSSATSSPM-FLLSNICNFVPLRLDSTNYVLWKFQVSSILKAHSLFGHIDDSLPKPTQYILT------------------------LAGQQRQRSV

Query:  SIT---------------------FEKRYASNTRSSILDLRSALYKIKKLSSKSIEKYTYRIKEIVDKLVAALVKIEDEEILVHTLNGLPAEFNAFRTSI
        + T                      ++ +++ +R+S+  ++S L  IKK  S S+ KY  RIKE  D L AA V   DE+I++  LNGLP E+N FR  I
Subjt:  SIT---------------------FEKRYASNTRSSILDLRSALYKIKKLSSKSIEKYTYRIKEIVDKLVAALVKIEDEEILVHTLNGLPAEFNAFRTSI

Query:  RTRSGTLSLEELHVLLAAEEQTLQLHA-------------------------------IPYTYDNGDCSKPYLSGKRTMRSDNILGSPSSNLNSGF----
        R R   +SL++    L AEE  ++ +                                +PY+  NG  S  +  G R   ++   G      N G+    
Subjt:  RTRSGTLSLEELHVLLAAEEQTLQLHA-------------------------------IPYTYDNGDCSKPYLSGKRTMRSDNILGSPSSNLNSGF----

Query:  ------------GSG------------SIFCQICSKSGHGALDCYNRM-------------NFSYQGRYPPAQLAAMADKVSDKILYTGENINGLYPIPS
                     SG            S+FCQ+C+  GH A  C+++              + ++   Y    L+ +  +      Y+  +    +P   
Subjt:  ------------GSG------------SIFCQICSKSGHGALDCYNRM-------------NFSYQGRYPPAQLAAMADKVSDKILYTGENINGLYPIPS

Query:  PSMLSSDMHPKNFNFMAKQEC-SFWHHRFGHPAPKILRSSLSRLD--SPKP--------------ATPSHPDITPLLTLLHTS------PNPSSPSPPTQ
               M P +    + Q     W    G  A   + + LS L   SP P              ++ +  DI P L L   S      P P S    + 
Subjt:  PSMLSSDMHPKNFNFMAKQEC-SFWHHRFGHPAPKILRSSLSRLD--SPKP--------------ATPSHPDITPLLTLLHTS------PNPSSPSPPTQ

Query:  LSTVDTNSSPQEILHVSFSNIADLPTD-ESSDEHSVSQNVHIPASTVNNHLMQTRAKLEIFKPKVFVSTITTSVPID-----PPSYSVASKYSEWRSAMC
        L+  D +S P   + VS S+ + LP D E    H   Q V +P +++N H MQTR+K  I K K F+S+++ S  +D     P +Y  A K   W  AM 
Subjt:  LSTVDTNSSPQEILHVSFSNIADLPTD-ESSDEHSVSQNVHIPASTVNNHLMQTRAKLEIFKPKVFVSTITTSVPID-----PPSYSVASKYSEWRSAMC

Query:  EEFNALQEQVTWSLVPCLPSMNVVGCKWVFRTKYNTDGTIARYKARLVAKGYHQVEGFDFTETFSPVVKEPTIRVIIALAANYQWSLTQLDVKNVFLHGH
        EE  AL  Q TWSLVP L   N+VGCKW+F+ K N+DG+I+R+KARLVAKG+ Q  G D+ ETFSPV+K  TIR+I+ALAA++ WSL QLDVKN FLHG 
Subjt:  EEFNALQEQVTWSLVPCLPSMNVVGCKWVFRTKYNTDGTIARYKARLVAKGYHQVEGFDFTETFSPVVKEPTIRVIIALAANYQWSLTQLDVKNVFLHGH

Query:  LKEEVYMSQPPGFLDKSSPHHVYRLHKSLYG--QAPRAWFNHFTSYLFTLGFTAFAADSSLFVRSVGSSLTYLLLYVDDIVITGPDSFI
        L EEVYM+QPPGF D + P  V +LHKSLYG  QAPRAW   FT++L +LGF +  ADSSLFV+ +G+ +  LLLYVDDI++TG  S +
Subjt:  LKEEVYMSQPPGFLDKSSPHHVYRLHKSLYG--QAPRAWFNHFTSYLFTLGFTAFAADSSLFVRSVGSSLTYLLLYVDDIVITGPDSFI

XP_022158189.1 uncharacterized protein LOC111024722 [Momordica charantia]1.4e-8640.04Show/hide
Query:  DKVSDKILYTGENINGLYPIPSPSMLSS---DMHPKNFNFMAKQECSFWHHRFGHPAPKILRSSL------------------------SRLDSPKPATP
        DKV+D  LY G+++NGLYPIPS S LSS   ++HPKN   +AK     WHHR GH +PKILR +L                        S+L  P   + 
Subjt:  DKVSDKILYTGENINGLYPIPSPSMLSS---DMHPKNFNFMAKQECSFWHHRFGHPAPKILRSSL------------------------SRLDSPKPATP

Query:  S-------HPDI-------------------------------------------------------------------------------------TPL
        S       H D+                                                                                       L
Subjt:  S-------HPDI-------------------------------------------------------------------------------------TPL

Query:  LTLLHTSP----NPSSPSP--------------------------------PTQLSTVDTNSSPQEILHVSFS-----NIADLPTDESSDE--HSVSQNV
        LTL H  P     P SPS                                 PT ++T     +P  +L  +FS      ++  P    SD    S S + 
Subjt:  LTLLHTSP----NPSSPSP--------------------------------PTQLSTVDTNSSPQEILHVSFS-----NIADLPTDESSDE--HSVSQNV

Query:  HIPASTVNNHLMQTRAKLEIFKPKVF-VSTITTSVPIDPPSYSVASKYSEWRSAMCEEFNALQEQVTWSLVPCLPSMNVVGCKWVFRTKYNTDGTIARYK
              +N HLMQT AK  IFKP+ + V + + ++   P   + A+++SEWR+AM ++F ALQEQ TWSLVP  P MNVVGCKWVF TK+N+DG+ ARYK
Subjt:  HIPASTVNNHLMQTRAKLEIFKPKVF-VSTITTSVPIDPPSYSVASKYSEWRSAMCEEFNALQEQVTWSLVPCLPSMNVVGCKWVFRTKYNTDGTIARYK

Query:  ARLVAKGYHQVEGFDFTETFSPVVKEPTIRVIIALAANYQWSLTQLDVKNVFLHGHLKEEVYMSQPPGFLDKSSPHHVYRLHKSLYG--QAPRAWFNHFT
        ARL+AKGYH++EGFDF ETFSPVVK+PTIRV+++LAA++ WSLTQLDVKNVFLHG+L+++V+M Q   F+D S P +V  LHKSLYG  QAPRAWF+ FT
Subjt:  ARLVAKGYHQVEGFDFTETFSPVVKEPTIRVIIALAANYQWSLTQLDVKNVFLHGHLKEEVYMSQPPGFLDKSSPHHVYRLHKSLYG--QAPRAWFNHFT

Query:  SYLFTLGFTAFAADSSLFVRSVGSSLTYLLLYVDDIVITGPD-SFISISPKE-AVGNRVSDI
        +YLFTLGF A   D+SLFVRSV  SLT+LLLYVDDI+ITGPD S+I++  K  A   ++SD+
Subjt:  SYLFTLGFTAFAADSSLFVRSVGSSLTYLLLYVDDIVITGPD-SFISISPKE-AVGNRVSDI

TrEMBL top hitse value%identityAlignment
A0A2N9EJB2 CCHC-type domain-containing protein3.3e-11838.09Show/hide
Query:  SMASIVASSSSTISESSSSATSSPMFLLSNICNFVPLRLDSTNYVLWKFQVSSILKAHSLFGHIDDSLPKPTQYILT-----------------------
        SMAS  ++S +T          +P+ LLSN+ N + ++LDSTN+++WK Q+SSILKA+S+   +D ++P P+++++                        
Subjt:  SMASIVASSSSTISESSSSATSSPMFLLSNICNFVPLRLDSTNYVLWKFQVSSILKAHSLFGHIDDSLPKPTQYILT-----------------------

Query:  ---------------LAGQQRQRSVSITFEKRYASNTRSSILDLRSALYKIKKLSSKSIEKYTYRIKEIVDKLVAALVKIEDEEILVHTLNGLPAEFNAF
                       + GQ   +SV  T E R+ S +R+++L+L+  L+ +KK  S+++  Y  ++K   DKL+A    I++EE+L   L GLP E+  F
Subjt:  ---------------LAGQQRQRSVSITFEKRYASNTRSSILDLRSALYKIKKLSSKSIEKYTYRIKEIVDKLVAALVKIEDEEILVHTLNGLPAEFNAF

Query:  RTSIRTRSGTLSLEELHVLLAAEEQTL--------QLHAIPYT---------YDNGDCSKPYLSGKRTMRSDNILGSPSSNLNSGFGSGSIFCQICSKSG
         ++IRTR+  +S EE+ VLL  EEQ++         +H++            Y++ + +  Y S  +   S++   + SS            CQIC KSG
Subjt:  RTSIRTRSGTLSLEELHVLLAAEEQTL--------QLHAIPYT---------YDNGDCSKPYLSGKRTMRSDNILGSPSSNLNSGFGSGSIFCQICSKSG

Query:  HGALDCYNRMNFSYQGRYPPA-----QLAAMA-DKVSD-----------------------KILYTGENINGLYPIPSPSMLSSDMHPKNFN--------
        H ALDCY+RM+F+YQG           L   A  K SD                       ++LY G + NG YPI +  +  S + P   +        
Subjt:  HGALDCYNRMNFSYQGRYPPA-----QLAAMA-DKVSD-----------------------KILYTGENINGLYPIPSPSMLSSDMHPKNFN--------

Query:  --------FMAKQECSFWHHRFGHPAPKILRSSLSRLDSPKPATPSHPDI------TPLLTLLHTSPNPSSPSPPTQLSTVDTNSSP--QEIL-------
                  +K +   WHHR GHP+ ++L +S S + S     P    +        LL  +++  + +SP   T + T+   SSP  QE+        
Subjt:  --------FMAKQECSFWHHRFGHPAPKILRSSLSRLDSPKPATPSHPDI------TPLLTLLHTSPNPSSPSPPTQLSTVDTNSSP--QEIL-------

Query:  HVSFSNIADLPTDESSDEHSVSQNVHIPASTVNNHLMQTRAKLEIFKPKVFVSTITTSVPIDPPSYSVASKYSEWRSAMCEEFNALQEQVTWSLVPCLPS
        H   S  A LP  ES+   S S  +  P++ VN H M TR+K  IFKPK F +T T     +PP+Y +ASK+ +W SAM EEF+ALQ Q TWSLVP    
Subjt:  HVSFSNIADLPTDESSDEHSVSQNVHIPASTVNNHLMQTRAKLEIFKPKVFVSTITTSVPIDPPSYSVASKYSEWRSAMCEEFNALQEQVTWSLVPCLPS

Query:  MNVVGCKWVFRTKYNTDGTIARYKARLVAKGYHQVEGFDFTETFSPVVKEPTIRVIIALAANYQWSLTQLDVKNVFLHGHLKEEVYMSQPPGFLDKSSPH
         N+VGCKWVF+ K ++DG+I+RYKARLVAKG+HQ  G DF ETFS VVK PTIR+I+ALA  Y W L QLD++N FLHG LKEEVYM QPPG++D   P 
Subjt:  MNVVGCKWVFRTKYNTDGTIARYKARLVAKGYHQVEGFDFTETFSPVVKEPTIRVIIALAANYQWSLTQLDVKNVFLHGHLKEEVYMSQPPGFLDKSSPH

Query:  HVYRLHKSLYG--QAPRAWFNHFTSYLFTLGFTAFAADSSLFVRSVGSSLTYLLLYVDDIVITG
        HV RL KS+YG  QAPRAWF  FT+ L  L F +  A+SSLF+   GS + +LLLYVDDIV+TG
Subjt:  HVYRLHKSLYG--QAPRAWFNHFTSYLFTLGFTAFAADSSLFVRSVGSSLTYLLLYVDDIVITG

A0A2N9EN11 Reverse transcriptase Ty1/copia-type domain-containing protein2.2e-11436.65Show/hide
Query:  SESSSSATSSPMFLLSNICNFVPLRLDSTNYVLWKFQVSSILKAHSLFGHIDDSLPKPTQYILTLAGQQRQRSVSITFE---------------------
        S + +  T +P FLLSNI N+V ++LD TNY++WKFQ++ IL A+SL  HI+D +P P +++L+  G   Q    I  +                     
Subjt:  SESSSSATSSPMFLLSNICNFVPLRLDSTNYVLWKFQVSSILKAHSLFGHIDDSLPKPTQYILTLAGQQRQRSVSITFE---------------------

Query:  -----------------KRYASNTRSSILDLRSALYKIKKLSSKSIEKYTYRIKEIVDKLVAALVKIEDEEILVHTLNGLPAEFNAFRTSIRTRSGTLSL
                          RY S +RSSI++L+  L  IKK +S S+  Y  +IKE  DKLV+  V I+DEEIL   L GL ++F++F +++ T++  +S 
Subjt:  -----------------KRYASNTRSSILDLRSALYKIKKLSSKSIEKYTYRIKEIVDKLVAALVKIEDEEILVHTLNGLPAEFNAFRTSIRTRSGTLSL

Query:  EELHVLLAAEEQTLQLHA---------------IPYTYDNGDCSKPY---------LSGKRTMRSD--NILGSPSSNLNSGFGSGS--------IFCQIC
        EELH L+  EE  L+  A                  +  N   + P+           G R  R    +       N NS     S          CQIC
Subjt:  EELHVLLAAEEQTLQLHA---------------IPYTYDNGDCSKPY---------LSGKRTMRSD--NILGSPSSNLNSGFGSGS--------IFCQIC

Query:  SKSGHGALDCYNRMNFSYQGRYPPAQLAAMA---------------------------------------------------------------------
         K GH ALDCY RMN+++QGR+PPA+LAAMA                                                                     
Subjt:  SKSGHGALDCYNRMNFSYQGRYPPAQLAAMA---------------------------------------------------------------------

Query:  ------------------------------------DKVSDKILYTGENINGLYPI------PSPSMLSS-DMHPKNF------NFM-----AKQECS--
                                            D ++ K LY G + +GLYPI      P  S LSS    P  F      ++M     A+Q  S  
Subjt:  ------------------------------------DKVSDKILYTGENINGLYPI------PSPSMLSS-DMHPKNF------NFM-----AKQECS--

Query:  -FWHHRFGHPAPKILRSSLSRLDSPKPATPSHPDITPLLTLLHTSPNPSSPSPPTQLS-TVDTNSSPQEILHVSFSNIADLPTDESSD--EHSVSQNVHI
          WH R GHP  +           P   TP+ P + P     + SP P  P P T  +  +  N SPQ  LH+        PT  SS    ++V     I
Subjt:  -FWHHRFGHPAPKILRSSLSRLDSPKPATPSHPDITPLLTLLHTSPNPSSPSPPTQLS-TVDTNSSPQEILHVSFSNIADLPTDESSD--EHSVSQNVHI

Query:  PASTVNNHLMQTRAKLEIFKPKVFVSTITTS-VPIDPPSYSVASKYSEWRSAMCEEFNALQEQVTWSLVPCLPSMNVVGCKWVFRTKYNTDGTIARYKAR
        P   +++H M TR K  I K K+F +T +   +  +PP+Y++ASK  EWR+ M  EF ALQ Q TWSLVP  PS N+VGC+WV++ K  TDG+++RYKAR
Subjt:  PASTVNNHLMQTRAKLEIFKPKVFVSTITTS-VPIDPPSYSVASKYSEWRSAMCEEFNALQEQVTWSLVPCLPSMNVVGCKWVFRTKYNTDGTIARYKAR

Query:  LVAKGYHQVEGFDFTETFSPVVKEPTIRVIIALAANYQWSLTQLDVKNVFLHGHLKEEVYMSQPPGFLDKSSPHHVYRLHKSLYG--QAPRAWFNHFTSY
        LVAKG+HQ  G D+ ETFSPVVK PT+R+I++LAA  +W+L QLDV N FLHG LKE VYM+QP GF+D+  P HV  LHKSLYG  QAPRAWF  FTS+
Subjt:  LVAKGYHQVEGFDFTETFSPVVKEPTIRVIIALAANYQWSLTQLDVKNVFLHGHLKEEVYMSQPPGFLDKSSPHHVYRLHKSLYG--QAPRAWFNHFTSY

Query:  LFTLGFTAFAADSSLFVRSVGSSLTYLLLYVDDIVITGPDSFI
        L TLGFTA  AD+SLFV   GS + YLLLYVDDI+ITG DS +
Subjt:  LFTLGFTAFAADSSLFVRSVGSSLTYLLLYVDDIVITGPDSFI

A0A2N9FBP7 Reverse transcriptase Ty1/copia-type domain-containing protein6.1e-11238.78Show/hide
Query:  ASIVASSSSTISESSSSATS-SPMFLLSNICNFVPLRLDSTNYVLWKFQVSSILKAHSLFGHIDDSLPKPTQYILTLAGQQ-------------RQRSVS
        +S + S+++    + ++ATS SP+ LL+N+ N +  +LDS+NY++WK Q+S++L A+S+  H+D S  +P+Q++ +  G Q               + V 
Subjt:  ASIVASSSSTISESSSSATS-SPMFLLSNICNFVPLRLDSTNYVLWKFQVSSILKAHSLFGHIDDSLPKPTQYILTLAGQQ-------------RQRSVS

Query:  ITFEKRYASNTRSSILDLRSALYKIKKLSSKSIEKYTYRIKEIVDKLVAALVKIEDEEILVHTLNGLPAEFNAFRTSIRTRSGTLSLEELHVLLAAEEQT
           E+R+    R+++L+L+  L  IKK  ++S+  Y  RIK + DKL A  V+ + EE+L   L GLP E+  F ++IRTR G LSLE+L VLL  EEQ+
Subjt:  ITFEKRYASNTRSSILDLRSALYKIKKLSSKSIEKYTYRIKEIVDKLVAALVKIEDEEILVHTLNGLPAEFNAFRTSIRTRSGTLSLEELHVLLAAEEQT

Query:  LQLHAIPYTYD------------NGDCSKPYLSGKRTMRSDNIL----GSPSSNLNSGFGSGSI---------------------FCQICSKSGHGALDC
        +Q    P++              NG       +  R     N      G  SSN  S F + S                       CQIC K GH A+DC
Subjt:  LQLHAIPYTYD------------NGDCSKPYLSGKRTMRSDNIL----GSPSSNLNSGFGSGSI---------------------FCQICSKSGHGALDC

Query:  YNRMNFSYQGRYPPAQLAAMADK-----------------VSDKILYTGENINGLYPIPSPSMLSSD------------MHPKNFN---FMAKQECSFW-
        Y+RM+F+YQG+ P  +LAAMA                    SD I  +  N++   P      +S              +H  + +     A      W 
Subjt:  YNRMNFSYQGRYPPAQLAAMADK-----------------VSDKILYTGENINGLYPIPSPSMLSSD------------MHPKNFN---FMAKQECSFW-

Query:  -----HHRFGHPAPKILRSSLSRLDSPKPATPSHPDITPLLTLLHTSPNPSSPSPPTQLSTVDTNSSPQEILHVSFSNIADLPTDESSDEHSVSQNVHIP
              H   H     +  SLS    P P   SHP     +     +PNP+  S PT  S +   +SP             LPTD +S            
Subjt:  -----HHRFGHPAPKILRSSLSRLDSPKPATPSHPDITPLLTLLHTSPNPSSPSPPTQLSTVDTNSSPQEILHVSFSNIADLPTDESSDEHSVSQNVHIP

Query:  ASTVNNHLMQTRAKLEIFKPKVFVSTITTSVPIDPPSYSVASKYSEWRSAMCEEFNALQEQVTWSLVPCLPSMNVVGCKWVFRTKYNTDGTIARYKARLV
        +    +H MQTR+K  IFKPKV  +    S   +P SY+ ASK+ +W +AM EEF AL +Q TW+LVP  PS N+VGCKWV++ KY++DGT+AR+KA+LV
Subjt:  ASTVNNHLMQTRAKLEIFKPKVFVSTITTSVPIDPPSYSVASKYSEWRSAMCEEFNALQEQVTWSLVPCLPSMNVVGCKWVFRTKYNTDGTIARYKARLV

Query:  AKGYHQVEGFDFTETFSPVVKEPTIRVIIALAANYQWSLTQLDVKNVFLHGHLKEEVYMSQPPGFLDKSSPHHVYRLHKSLYG--QAPRAWFNHFTSYLF
        AKG+HQ  G DF ETFSPVVK PT+R+I++LA +  WSL QLDVKN FLHG LKEEVYM+QP G++D   P HV +L KS+YG  QAPRAWF  FTS L 
Subjt:  AKGYHQVEGFDFTETFSPVVKEPTIRVIIALAANYQWSLTQLDVKNVFLHGHLKEEVYMSQPPGFLDKSSPHHVYRLHKSLYG--QAPRAWFNHFTSYLF

Query:  TLGFTAFAADSSLFVRSVGSSLTYLLLYVDDIVIT
         LGFTA  ADSSLF+    S + YLLLYVDDIV+T
Subjt:  TLGFTAFAADSSLFVRSVGSSLTYLLLYVDDIVIT

A0A2N9FVX0 Reverse transcriptase Ty1/copia-type domain-containing protein7.2e-11334.02Show/hide
Query:  SSSSTISESSSSATSSPMFLLSNICNFVPLRLDSTNYVLWKFQVSSILKAHSLFGHIDDSLPKPTQYILTLAG-------------QQRQRS----VSIT
        +S  T + SSSS   +P+FLLSNI  +V ++LD +N++ WKFQ++ IL+A+SL  +++     P ++++   G             Q R ++    +S T
Subjt:  SSSSTISESSSSATSSPMFLLSNICNFVPLRLDSTNYVLWKFQVSSILKAHSLFGHIDDSLPKPTQYILTLAG-------------QQRQRS----VSIT

Query:  FE---------------------KRYASNTRSSILDLRSALYKIKKLSSKSIEKYTYRIKEIVDKLVAALVKIEDEEILVHTLNGLPAEFNAFRTSIRTR
                               KRY S +RS+IL+L+  L+ +KK ++ ++ +Y  RIKE  DKL A    ++DE++L   L GLP+E+ +F T++ T+
Subjt:  FE---------------------KRYASNTRSSILDLRSALYKIKKLSSKSIEKYTYRIKEIVDKLVAALVKIEDEEILVHTLNGLPAEFNAFRTSIRTR

Query:  SGTLSLEELHVLLAAEEQTLQLH--------------------AIPYTYDN--------------------------------GDCSKPYLSGKR-----
        S  +S EELHVL+  +E+ L+                      + P  ++N                                   ++ Y+SG       
Subjt:  SGTLSLEELHVLLAAEEQTLQLH--------------------AIPYTYDN--------------------------------GDCSKPYLSGKR-----

Query:  -----TMRSDNILGSPSSNLNSGFGSGSIFCQICSKSGHGALDCYNRMNFSYQGRYPPAQLAAMA-----------------------------------
             T  + N   S ++  N+        CQIC K GH ALDCYNRMNFSYQGR+PPA+LAA+A                                   
Subjt:  -----TMRSDNILGSPSSNLNSGFGSGSIFCQICSKSGHGALDCYNRMNFSYQGRYPPAQLAAMA-----------------------------------

Query:  ------------------------------------------------DKVSDKILYTGENINGLYPIPSPSM-LSSDMHPKNFNFMA---------KQE
                                                        D+++ K LY G + +GLYP+   S+ L S   P + +  A            
Subjt:  ------------------------------------------------DKVSDKILYTGENINGLYPIPSPSM-LSSDMHPKNFNFMA---------KQE

Query:  CSFWHHRFGHPAPKILRSSLSRLDSPKPATPSH-----PDITPLLTLLHTSPNPS-----SPSPPTQLSTVDTNSS--------PQEILHVSFSNIADLP
         + WH RFGHP  ++LR  L +  +P  +  SH       + P  ++L   P+PS     SPSP   +S+++ +S+        P  +   +  N  + P
Subjt:  CSFWHHRFGHPAPKILRSSLSRLDSPKPATPSH-----PDITPLLTLLHTSPNPS-----SPSPPTQLSTVDTNSS--------PQEILHVSFSNIADLP

Query:  TDES----------------SDEHSVSQNVHIPASTVNNHLMQTRAKLEIFKPKVFVSTITTSVP----IDPPSYSVASKYSEWRSAMCEEFNALQEQVT
        T+ +                +  +  S ++  P S  N H M TR+K  I K K+  +T T   P     +PP+ ++ SK  EW +AM +EF+ALQ Q T
Subjt:  TDES----------------SDEHSVSQNVHIPASTVNNHLMQTRAKLEIFKPKVFVSTITTSVP----IDPPSYSVASKYSEWRSAMCEEFNALQEQVT

Query:  WSLVPCLPSMNVVGCKWVFRTKYNTDGTIARYKARLVAKGYHQVEGFDFTETFSPVVKEPTIRVIIALAANYQWSLTQLDVKNVFLHGHLKEEVYMSQPP
        WSLVP     NV+GC+WVF+ K N+DG+I+RYKARLVAKG+HQ  G DF ETFSPVVK PT+R++++LAA +QW L QLD+ N FLHG LKE+V+M QPP
Subjt:  WSLVPCLPSMNVVGCKWVFRTKYNTDGTIARYKARLVAKGYHQVEGFDFTETFSPVVKEPTIRVIIALAANYQWSLTQLDVKNVFLHGHLKEEVYMSQPP

Query:  GFLDKSSPHHVYRLHKSLYG--QAPRAWFNHFTSYLFTLGFTAFAADSSLFVRSVGSSLTYLLLYVDDIVITG
        GF+    P+HV +LHKSLYG  QAPRAWF  FTS+L T+GFTA  AD SLFV   GS+L YLLLYVDDI++TG
Subjt:  GFLDKSSPHHVYRLHKSLYG--QAPRAWFNHFTSYLFTLGFTAFAADSSLFVRSVGSSLTYLLLYVDDIVITG

A0A2N9J0P4 Uncharacterized protein1.5e-11035.53Show/hide
Query:  TSSPMFLLSNICNFVPLRLDSTNYVLWKFQVSSILKAHSLFGHIDDSLPKPTQYIL--------------------------------------TLAGQQ
        T +P FLLSN  N+V ++LD TNY++WKFQ++ IL A+SL  H++D +P P++++L                                       + GQ 
Subjt:  TSSPMFLLSNICNFVPLRLDSTNYVLWKFQVSSILKAHSLFGHIDDSLPKPTQYIL--------------------------------------TLAGQQ

Query:  RQRSVSITFEKRYASNTRSSILDLRSALYKIKKLSSKSIEKYTYRIKEIVDKLVAALVKIEDEEILVHTLNGLPAEFNAFRTSIRTRSGTLSLEELHVLL
            +      RY S +RSSI++L+  L+ IKK  S S+ +Y  +IKE  DKLV+  V I+DEEIL   L GLP E+++F +++ T++  +  EELH L+
Subjt:  RQRSVSITFEKRYASNTRSSILDLRSALYKIKKLSSKSIEKYTYRIKEIVDKLVAALVKIEDEEILVHTLNGLPAEFNAFRTSIRTRSGTLSLEELHVLL

Query:  AAEEQTLQL---HAIPYTYDNGDCSKPYLSGKRTM-------------------------RSDNILGSPSSNLNSGFGSGS-------------IFCQIC
          EE  L+    ++    +     SK + S                              RS    G  + N NS F   S               CQIC
Subjt:  AAEEQTLQL---HAIPYTYDNGDCSKPYLSGKRTM-------------------------RSDNILGSPSSNLNSGFGSGS-------------IFCQIC

Query:  SKSGHGALDCYNRMNFSYQGRYPPAQLAAMA----------------------------DKVSDKILYTGENI----NGLYPIPSPSMLSSDMHPKNFNF
         K GH A+DCY RMN+++QGR+P A+LAAMA                              + D   YT   +    NG + +P   + +S +   N+ F
Subjt:  SKSGHGALDCYNRMNFSYQGRYPPAQLAAMA----------------------------DKVSDKILYTGENI----NGLYPIPSPSMLSSDMHPKNFNF

Query:  MAKQ-----------------------ECSFWHHRF-------GHPAPKILRS----------------------SLSRLDSPKPATPSHPDITPLLTLL
          ++                         SF  H F       G P  K +                        S SR  SP  A+P      P     
Subjt:  MAKQ-----------------------ECSFWHHRF-------GHPAPKILRS----------------------SLSRLDSPKPATPSHPDITPLLTLL

Query:  HTSPNPSSPSPPTQLSTVDTNSSPQEILH-VSFSNI--------------ADLPTDESSDEHSVSQNVH----------IPASTV-NNHLMQTRAKLEIF
        + S   S+  PP   S +  +  P    H +S +N+                    +S   ++ SQ +           IP+  + ++H MQTR+K  I 
Subjt:  HTSPNPSSPSPPTQLSTVDTNSSPQEILH-VSFSNI--------------ADLPTDESSDEHSVSQNVH----------IPASTV-NNHLMQTRAKLEIF

Query:  KPKVFVSTITTS-VPIDPPSYSVASKYSEWRSAMCEEFNALQEQVTWSLVPCLPSMNVVGCKWVFRTKYNTDGTIARYKARLVAKGYHQVEGFDFTETFS
        K K F S+ T + +  +PP+Y++ASK  EWR AM  EF AL  Q TWSLVP  P  N++GC+WV++ K NTDG+++RYKARLVAKG+HQ  G DF ETFS
Subjt:  KPKVFVSTITTS-VPIDPPSYSVASKYSEWRSAMCEEFNALQEQVTWSLVPCLPSMNVVGCKWVFRTKYNTDGTIARYKARLVAKGYHQVEGFDFTETFS

Query:  PVVKEPTIRVIIALAANYQWSLTQLDVKNVFLHGHLKEEVYMSQPPGFLDKSSPHHVYRLHKSLYG--QAPRAWFNHFTSYLFTLGFTAFAADSSLFVRS
        PVVK PT+R+I++LAA  QWSL QLDV N FLHG LKE VYM+QP GF+D + P HV +LHKSLYG  QAPRAWF  FTS+L TLGF+A  AD+SLF+  
Subjt:  PVVKEPTIRVIIALAANYQWSLTQLDVKNVFLHGHLKEEVYMSQPPGFLDKSSPHHVYRLHKSLYG--QAPRAWFNHFTSYLFTLGFTAFAADSSLFVRS

Query:  VGSSLTYLLLYVDDIVITG
         GS+  YLLLYVDDI+ITG
Subjt:  VGSSLTYLLLYVDDIVITG

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.9e-3138.14Show/hide
Query:  TITTSVPIDPPSYSVASKYSEWRSAMCEEFNALQEQVTWSLVPCLPSMNVVGCKWVFRTKYNTDGTIARYKARLVAKGYHQVEGFDFTETFSPVVKEPTI
        TI   VP            S W  A+  E NA +   TW++     + N+V  +WVF  KYN  G   RYKARLVA+G+ Q    D+ ETF+PV +  + 
Subjt:  TITTSVPIDPPSYSVASKYSEWRSAMCEEFNALQEQVTWSLVPCLPSMNVVGCKWVFRTKYNTDGTIARYKARLVAKGYHQVEGFDFTETFSPVVKEPTI

Query:  RVIIALAANYQWSLTQLDVKNVFLHGHLKEEVYMSQPPGFLDKSSPHHVYRLHKSLYG--QAPRAWFNHFTSYLFTLGFTAFAADSSLFVRSVG--SSLT
        R I++L   Y   + Q+DVK  FL+G LKEE+YM  P G    S   +V +L+K++YG  QA R WF  F   L    F   + D  +++   G  +   
Subjt:  RVIIALAANYQWSLTQLDVKNVFLHGHLKEEVYMSQPPGFLDKSSPHHVYRLHKSLYG--QAPRAWFNHFTSYLFTLGFTAFAADSSLFVRSVG--SSLT

Query:  YLLLYVDDIVITGPD
        Y+LLYVDD+VI   D
Subjt:  YLLLYVDDIVITGPD

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.4e-3536.04Show/hide
Query:  ITPLLTLLHTSPNPSSPSPPTQLSTVDTNSSPQEILHVSFSNIADLPTDESSDEHSVSQNVHIPASTVNNHLMQTRAKLEIFKPKVFVST--ITTSVPID
        I   +T+  TS NP+S    T     +    P E++             E  DE    + V  P      H    R++    + + + ST  +  S   +
Subjt:  ITPLLTLLHTSPNPSSPSPPTQLSTVDTNSSPQEILHVSFSNIADLPTDESSDEHSVSQNVHIPASTVNNHLMQTRAKLEIFKPKVFVST--ITTSVPID

Query:  PPSYSVASKYSE---WRSAMCEEFNALQEQVTWSLVPCLPSMNVVGCKWVFRTKYNTDGTIARYKARLVAKGYHQVEGFDFTETFSPVVKEPTIRVIIAL
        P S      + E      AM EE  +LQ+  T+ LV        + CKWVF+ K + D  + RYKARLV KG+ Q +G DF E FSPVVK  +IR I++L
Subjt:  PPSYSVASKYSE---WRSAMCEEFNALQEQVTWSLVPCLPSMNVVGCKWVFRTKYNTDGTIARYKARLVAKGYHQVEGFDFTETFSPVVKEPTIRVIIAL

Query:  AANYQWSLTQLDVKNVFLHGHLKEEVYMSQPPGFLDKSSPHHVYRLHKSLYG--QAPRAWFNHFTSYLFTLGFTAFAADSSL-FVRSVGSSLTYLLLYVD
        AA+    + QLDVK  FLHG L+EE+YM QP GF      H V +L+KSLYG  QAPR W+  F S++ +  +    +D  + F R   ++   LLLYVD
Subjt:  AANYQWSLTQLDVKNVFLHGHLKEEVYMSQPPGFLDKSSPHHVYRLHKSLYG--QAPRAWFNHFTSYLFTLGFTAFAADSSL-FVRSVGSSLTYLLLYVD

Query:  DIVITGPD
        D++I G D
Subjt:  DIVITGPD

P92520 Uncharacterized mitochondrial protein AtMg008204.2e-2551.59Show/hide
Query:  MQTRAKLEIFK--PKVFVSTITTSVPIDPPSYSVASKYSEWRSAMCEEFNALQEQVTWSLVPCLPSMNVVGCKWVFRTKYNTDGTIARYKARLVAKGYHQ
        M TR+K  I K  PK +  TITT++  +P S   A K   W  AM EE +AL    TW LVP   + N++GCKWVF+TK ++DGT+ R KARLVAKG+HQ
Subjt:  MQTRAKLEIFK--PKVFVSTITTSVPIDPPSYSVASKYSEWRSAMCEEFNALQEQVTWSLVPCLPSMNVVGCKWVFRTKYNTDGTIARYKARLVAKGYHQ

Query:  VEGFDFTETFSPVVKEPTIRVIIALA
         EG  F ET+SPVV+  TIR I+ +A
Subjt:  VEGFDFTETFSPVVKEPTIRVIIALA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.0e-6140.95Show/hide
Query:  FGHPAPKILRSSLSRLDSPKPATPSHPDITPLLTLLHTSPNPSSPSPPTQ----LSTVDTNSSPQEILHVSFSNIADLPTDESSDEHS-------VSQNV
        F    P     +  R + P+P T      T   +  +TS N  +   P+Q    LST   +SS       S S+ +  PT  S   H        V+ N 
Subjt:  FGHPAPKILRSSLSRLDSPKPATPSHPDITPLLTLLHTSPNPSSPSPPTQ----LSTVDTNSSPQEILHVSFSNIADLPTDESSDEHS-------VSQNV

Query:  HIPASTVNNHLMQTRAKLEIFKPKVFVS-TITTSVPIDPPSYSVASKYSEWRSAMCEEFNALQEQVTWSLVPCLPS-MNVVGCKWVFRTKYNTDGTIARY
          P   +N H M TRAK  I KP    S  ++ +   +P +   A K   WR+AM  E NA     TW LVP  PS + +VGC+W+F  KYN+DG++ RY
Subjt:  HIPASTVNNHLMQTRAKLEIFKPKVFVS-TITTSVPIDPPSYSVASKYSEWRSAMCEEFNALQEQVTWSLVPCLPS-MNVVGCKWVFRTKYNTDGTIARY

Query:  KARLVAKGYHQVEGFDFTETFSPVVKEPTIRVIIALAANYQWSLTQLDVKNVFLHGHLKEEVYMSQPPGFLDKSSPHHVYRLHKSLYG--QAPRAWFNHF
        KARLVAKGY+Q  G D+ ETFSPV+K  +IR+++ +A +  W + QLDV N FL G L ++VYMSQPPGF+DK  P++V +L K+LYG  QAPRAW+   
Subjt:  KARLVAKGYHQVEGFDFTETFSPVVKEPTIRVIIALAANYQWSLTQLDVKNVFLHGHLKEEVYMSQPPGFLDKSSPHHVYRLHKSLYG--QAPRAWFNHF

Query:  TSYLFTLGFTAFAADSSLFVRSVGSSLTYLLLYVDDIVITGPDSFISISPKEAVGNRVS
         +YL T+GF    +D+SLFV   G S+ Y+L+YVDDI+ITG D  +  +  + +  R S
Subjt:  TSYLFTLGFTAFAADSSLFVRSVGSSLTYLLLYVDDIVITGPDSFISISPKEAVGNRVS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.1e-6041.38Show/hide
Query:  DSPKPATPSHP-----DITPLLTLLHTSPNPSSPSP--PTQLSTVDTNSSPQEILHVSFSNIADLPTDESSDEHSVSQNVHIPA---------STVNNHL
        + P+P    H        +P+L     +PNP+SPSP  P Q S +  +      +    ++I++  +  SS   +      +PA         + VN H 
Subjt:  DSPKPATPSHP-----DITPLLTLLHTSPNPSSPSP--PTQLSTVDTNSSPQEILHVSFSNIADLPTDESSDEHSVSQNVHIPA---------STVNNHL

Query:  MQTRAKLEIFKPKVFVSTITTSVPIDPPSYSV-ASKYSEWRSAMCEEFNALQEQVTWSLV-PCLPSMNVVGCKWVFRTKYNTDGTIARYKARLVAKGYHQ
        M TRAK  I KP    S  T+      P  ++ A K   WR AM  E NA     TW LV P  PS+ +VGC+W+F  K+N+DG++ RYKARLVAKGY+Q
Subjt:  MQTRAKLEIFKPKVFVSTITTSVPIDPPSYSV-ASKYSEWRSAMCEEFNALQEQVTWSLV-PCLPSMNVVGCKWVFRTKYNTDGTIARYKARLVAKGYHQ

Query:  VEGFDFTETFSPVVKEPTIRVIIALAANYQWSLTQLDVKNVFLHGHLKEEVYMSQPPGFLDKSSPHHVYRLHKSLYG--QAPRAWFNHFTSYLFTLGFTA
          G D+ ETFSPV+K  +IR+++ +A +  W + QLDV N FL G L +EVYMSQPPGF+DK  P +V RL K++YG  QAPRAW+    +YL T+GF  
Subjt:  VEGFDFTETFSPVVKEPTIRVIIALAANYQWSLTQLDVKNVFLHGHLKEEVYMSQPPGFLDKSSPHHVYRLHKSLYG--QAPRAWFNHFTSYLFTLGFTA

Query:  FAADSSLFVRSVGSSLTYLLLYVDDIVITGPDSFISISPKEAVGNRVS
          +D+SLFV   G S+ Y+L+YVDDI+ITG D+ +     +A+  R S
Subjt:  FAADSSLFVRSVGSSLTYLLLYVDDIVITGPDSFISISPKEAVGNRVS

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.4e-0421.69Show/hide
Query:  LSNICNFVPLRLD--STNYVLWKFQVSSILKAHSLFGHIDDSL------------------------PKPTQYILTLAGQQRQRSVSITFEKRYASNTRS
        +SNI + +P+ LD   +NY  W+    +   +  + GHID +L                          P Q+  +       R + +  + ++ +N  +
Subjt:  LSNICNFVPLRLD--STNYVLWKFQVSSILKAHSLFGHIDDSL------------------------PKPTQYILTLAGQQRQRSVSITFEKRYASNTRS

Query:  SILDLRSALYKIKKLSSKSIEKYTYRIKEIVDKLVAALVKIEDEEILVHTLNGLPAEFNAFRTSIRTRSGTLSLEELHVLLAAEEQTLQ
          L L S L + K +    +  Y  ++K++ D L    V + D  ++++ LNGL  +F+     I+ R    S ++   +L  EE  L+
Subjt:  SILDLRSALYKIKKLSSKSIEKYTYRIKEIVDKLVAALVKIEDEEILVHTLNGLPAEFNAFRTSIRTRSGTLSLEELHVLLAAEEQTLQ

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 89.8e-4644.88Show/hide
Query:  DPPSYSVASKYSEWRSAMCEEFNALQEQVTWSLVPCLPSMNVVGCKWVFRTKYNTDGTIARYKARLVAKGYHQVEGFDFTETFSPVVKEPTIRVIIALAA
        +P +Y+ A ++  W  AM +E  A++   TW +    P+   +GCKWV++ KYN+DGTI RYKARLVAKGY Q EG DF ETFSPV K  ++++I+A++A
Subjt:  DPPSYSVASKYSEWRSAMCEEFNALQEQVTWSLVPCLPSMNVVGCKWVFRTKYNTDGTIARYKARLVAKGYHQVEGFDFTETFSPVVKEPTIRVIIALAA

Query:  NYQWSLTQLDVKNVFLHGHLKEEVYMSQPPGFL----DKSSPHHVYRLHKSLYG--QAPRAWFNHFTSYLFTLGFTAFAADSSLFVRSVGSSLTYLLLYV
         Y ++L QLD+ N FL+G L EE+YM  PPG+     D   P+ V  L KS+YG  QA R WF  F+  L   GF    +D + F++   +    +L+YV
Subjt:  NYQWSLTQLDVKNVFLHGHLKEEVYMSQPPGFL----DKSSPHHVYRLHKSLYG--QAPRAWFNHFTSYLFTLGFTAFAADSSLFVRSVGSSLTYLLLYV

Query:  DDIVI
        DDI+I
Subjt:  DDIVI

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.0e-2651.59Show/hide
Query:  MQTRAKLEIFK--PKVFVSTITTSVPIDPPSYSVASKYSEWRSAMCEEFNALQEQVTWSLVPCLPSMNVVGCKWVFRTKYNTDGTIARYKARLVAKGYHQ
        M TR+K  I K  PK +  TITT++  +P S   A K   W  AM EE +AL    TW LVP   + N++GCKWVF+TK ++DGT+ R KARLVAKG+HQ
Subjt:  MQTRAKLEIFK--PKVFVSTITTSVPIDPPSYSVASKYSEWRSAMCEEFNALQEQVTWSLVPCLPSMNVVGCKWVFRTKYNTDGTIARYKARLVAKGYHQ

Query:  VEGFDFTETFSPVVKEPTIRVIIALA
         EG  F ET+SPVV+  TIR I+ +A
Subjt:  VEGFDFTETFSPVVKEPTIRVIIALA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGGTGGATTTGGGGTTTTTCAGTGTGGGTTTGGTGTGTGGTTTAAGAAGAAGAAGATGGGTATTTAGGGTGGGTTTGAAGTTTTGTTTAAGAAGAAGAAGAGGTGG
GGGTGGGTTTGGTTTTGAAGAAGAAGAAGAGGAGGGAGGTGGGGCGGGGTGGGGTTGGGTATATAGTTTTAAAGAAGAAGAAAAGGAGAAGGGGAAGAAGAAGTATGCGG
CGGTTGCGAACTATCGACGGCGGCGGCGGTGGGCTATTGCCGGCGAGAAGGGGAAGGGAGTTGATGGAATTTTCGTGCTTCTTCATCATTACAATATTGAAGGAGTGTAC
AACACTCGTGCGGAAGCGATTTCGAATCTTAGGCACTTAGTTTACATGAATTTCTCAATCAAAATACTCTATTTCGGTTCAATGGCTAGCATAGTGGCTTCATCTTCATC
TACCATCTCTGAATCATCTTCGTCCGCTACCTCCTCACCGATGTTTCTCTTATCAAATATTTGCAATTTTGTGCCGCTTCGTTTGGATTCAACGAATTATGTGCTGTGGA
AATTTCAAGTCTCCTCAATCTTGAAAGCTCACTCTCTCTTCGGGCATATCGACGACTCACTACCCAAACCTACCCAATACATTCTTACTCTTGCAGGACAGCAACGACAG
AGGTCAGTCTCGATTACATTCGAAAAACGATATGCATCCAACACTAGATCTAGCATTCTTGATTTACGATCTGCCTTGTATAAAATCAAGAAACTCTCTTCAAAATCCAT
TGAAAAATATACCTATCGCATTAAGGAAATTGTTGATAAACTAGTTGCTGCTTTGGTTAAAATTGAGGATGAAGAAATTCTTGTGCATACCTTGAATGGTCTTCCAGCTG
AGTTCAACGCCTTCCGCACATCAATCCGAACACGAAGTGGAACCCTATCTTTAGAAGAACTTCACGTTCTGTTGGCGGCTGAAGAACAAACACTACAACTCCATGCAATA
CCCTATACCTACGACAATGGTGACTGTTCAAAACCCTACCTTTCGGGGAAGAGGACGATGCGGTCTGATAATATTTTGGGCTCTCCCTCCTCAAATTTAAATTCTGGTTT
TGGATCAGGCAGTATTTTCTGTCAAATATGTTCAAAATCTGGACATGGGGCTCTTGATTGCTACAATCGGATGAATTTTTCCTATCAGGGTCGTTATCCACCAGCTCAAT
TGGCTGCTATGGCGGATAAAGTCTCTGACAAAATCCTATACACTGGTGAAAACATCAATGGCCTTTATCCCATCCCAAGTCCATCCATGCTATCTTCTGATATGCACCCC
AAAAACTTTAATTTTATGGCAAAACAGGAGTGTTCTTTTTGGCATCATCGGTTTGGTCATCCCGCTCCCAAAATATTACGCTCTAGTTTATCTCGTCTTGATTCTCCAAA
ACCTGCCACACCTTCACATCCTGACATAACACCCTTACTTACCCTTTTACATACATCTCCCAACCCATCTTCACCTTCCCCACCTACCCAATTATCCACTGTTGACACTA
ATTCTTCTCCTCAAGAGATTTTGCATGTTTCATTTTCTAATATTGCAGATTTACCAACAGATGAGTCATCTGATGAGCATTCAGTATCCCAAAATGTTCACATTCCTGCA
AGCACTGTCAATAATCATCTGATGCAAACACGAGCTAAGTTAGAAATTTTCAAGCCAAAGGTGTTTGTTTCCACCATAACCACTTCAGTTCCAATAGATCCTCCTTCTTA
TTCTGTTGCTTCGAAGTATTCAGAATGGAGATCCGCTATGTGTGAGGAATTTAATGCCCTTCAGGAACAAGTTACGTGGTCCTTAGTACCTTGTTTACCTTCCATGAATG
TTGTGGGTTGCAAATGGGTCTTTAGGACTAAATATAACACTGATGGCACTATTGCTCGATACAAAGCCAGATTAGTTGCTAAAGGATATCATCAGGTCGAAGGGTTTGAC
TTTACTGAAACCTTCAGTCCAGTTGTTAAAGAGCCCACAATCAGAGTTATCATTGCTCTTGCTGCTAATTATCAATGGTCTTTAACCCAACTGGATGTAAAGAATGTCTT
TCTACATGGTCACTTAAAAGAGGAAGTTTATATGTCCCAACCTCCTGGCTTTCTTGACAAATCCAGTCCACATCATGTTTATCGCCTTCATAAAAGTCTATATGGGCAAG
CTCCTCGAGCTTGGTTCAATCACTTTACATCATATTTGTTCACCTTGGGATTTACTGCTTTTGCTGCTGATTCATCCTTATTTGTACGCTCAGTTGGATCTTCTTTGACA
TATCTGTTACTTTATGTTGATGATATCGTTATCACTGGACCAGATTCTTTCATATCTATCAGTCCTAAAGAAGCAGTTGGCAACAGAGTTTCAGATATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGGTGGATTTGGGGTTTTTCAGTGTGGGTTTGGTGTGTGGTTTAAGAAGAAGAAGATGGGTATTTAGGGTGGGTTTGAAGTTTTGTTTAAGAAGAAGAAGAGGTGG
GGGTGGGTTTGGTTTTGAAGAAGAAGAAGAGGAGGGAGGTGGGGCGGGGTGGGGTTGGGTATATAGTTTTAAAGAAGAAGAAAAGGAGAAGGGGAAGAAGAAGTATGCGG
CGGTTGCGAACTATCGACGGCGGCGGCGGTGGGCTATTGCCGGCGAGAAGGGGAAGGGAGTTGATGGAATTTTCGTGCTTCTTCATCATTACAATATTGAAGGAGTGTAC
AACACTCGTGCGGAAGCGATTTCGAATCTTAGGCACTTAGTTTACATGAATTTCTCAATCAAAATACTCTATTTCGGTTCAATGGCTAGCATAGTGGCTTCATCTTCATC
TACCATCTCTGAATCATCTTCGTCCGCTACCTCCTCACCGATGTTTCTCTTATCAAATATTTGCAATTTTGTGCCGCTTCGTTTGGATTCAACGAATTATGTGCTGTGGA
AATTTCAAGTCTCCTCAATCTTGAAAGCTCACTCTCTCTTCGGGCATATCGACGACTCACTACCCAAACCTACCCAATACATTCTTACTCTTGCAGGACAGCAACGACAG
AGGTCAGTCTCGATTACATTCGAAAAACGATATGCATCCAACACTAGATCTAGCATTCTTGATTTACGATCTGCCTTGTATAAAATCAAGAAACTCTCTTCAAAATCCAT
TGAAAAATATACCTATCGCATTAAGGAAATTGTTGATAAACTAGTTGCTGCTTTGGTTAAAATTGAGGATGAAGAAATTCTTGTGCATACCTTGAATGGTCTTCCAGCTG
AGTTCAACGCCTTCCGCACATCAATCCGAACACGAAGTGGAACCCTATCTTTAGAAGAACTTCACGTTCTGTTGGCGGCTGAAGAACAAACACTACAACTCCATGCAATA
CCCTATACCTACGACAATGGTGACTGTTCAAAACCCTACCTTTCGGGGAAGAGGACGATGCGGTCTGATAATATTTTGGGCTCTCCCTCCTCAAATTTAAATTCTGGTTT
TGGATCAGGCAGTATTTTCTGTCAAATATGTTCAAAATCTGGACATGGGGCTCTTGATTGCTACAATCGGATGAATTTTTCCTATCAGGGTCGTTATCCACCAGCTCAAT
TGGCTGCTATGGCGGATAAAGTCTCTGACAAAATCCTATACACTGGTGAAAACATCAATGGCCTTTATCCCATCCCAAGTCCATCCATGCTATCTTCTGATATGCACCCC
AAAAACTTTAATTTTATGGCAAAACAGGAGTGTTCTTTTTGGCATCATCGGTTTGGTCATCCCGCTCCCAAAATATTACGCTCTAGTTTATCTCGTCTTGATTCTCCAAA
ACCTGCCACACCTTCACATCCTGACATAACACCCTTACTTACCCTTTTACATACATCTCCCAACCCATCTTCACCTTCCCCACCTACCCAATTATCCACTGTTGACACTA
ATTCTTCTCCTCAAGAGATTTTGCATGTTTCATTTTCTAATATTGCAGATTTACCAACAGATGAGTCATCTGATGAGCATTCAGTATCCCAAAATGTTCACATTCCTGCA
AGCACTGTCAATAATCATCTGATGCAAACACGAGCTAAGTTAGAAATTTTCAAGCCAAAGGTGTTTGTTTCCACCATAACCACTTCAGTTCCAATAGATCCTCCTTCTTA
TTCTGTTGCTTCGAAGTATTCAGAATGGAGATCCGCTATGTGTGAGGAATTTAATGCCCTTCAGGAACAAGTTACGTGGTCCTTAGTACCTTGTTTACCTTCCATGAATG
TTGTGGGTTGCAAATGGGTCTTTAGGACTAAATATAACACTGATGGCACTATTGCTCGATACAAAGCCAGATTAGTTGCTAAAGGATATCATCAGGTCGAAGGGTTTGAC
TTTACTGAAACCTTCAGTCCAGTTGTTAAAGAGCCCACAATCAGAGTTATCATTGCTCTTGCTGCTAATTATCAATGGTCTTTAACCCAACTGGATGTAAAGAATGTCTT
TCTACATGGTCACTTAAAAGAGGAAGTTTATATGTCCCAACCTCCTGGCTTTCTTGACAAATCCAGTCCACATCATGTTTATCGCCTTCATAAAAGTCTATATGGGCAAG
CTCCTCGAGCTTGGTTCAATCACTTTACATCATATTTGTTCACCTTGGGATTTACTGCTTTTGCTGCTGATTCATCCTTATTTGTACGCTCAGTTGGATCTTCTTTGACA
TATCTGTTACTTTATGTTGATGATATCGTTATCACTGGACCAGATTCTTTCATATCTATCAGTCCTAAAGAAGCAGTTGGCAACAGAGTTTCAGATATCTGA
Protein sequenceShow/hide protein sequence
MRVDLGFFSVGLVCGLRRRRWVFRVGLKFCLRRRRGGGGFGFEEEEEEGGGAGWGWVYSFKEEEKEKGKKKYAAVANYRRRRRWAIAGEKGKGVDGIFVLLHHYNIEGVY
NTRAEAISNLRHLVYMNFSIKILYFGSMASIVASSSSTISESSSSATSSPMFLLSNICNFVPLRLDSTNYVLWKFQVSSILKAHSLFGHIDDSLPKPTQYILTLAGQQRQ
RSVSITFEKRYASNTRSSILDLRSALYKIKKLSSKSIEKYTYRIKEIVDKLVAALVKIEDEEILVHTLNGLPAEFNAFRTSIRTRSGTLSLEELHVLLAAEEQTLQLHAI
PYTYDNGDCSKPYLSGKRTMRSDNILGSPSSNLNSGFGSGSIFCQICSKSGHGALDCYNRMNFSYQGRYPPAQLAAMADKVSDKILYTGENINGLYPIPSPSMLSSDMHP
KNFNFMAKQECSFWHHRFGHPAPKILRSSLSRLDSPKPATPSHPDITPLLTLLHTSPNPSSPSPPTQLSTVDTNSSPQEILHVSFSNIADLPTDESSDEHSVSQNVHIPA
STVNNHLMQTRAKLEIFKPKVFVSTITTSVPIDPPSYSVASKYSEWRSAMCEEFNALQEQVTWSLVPCLPSMNVVGCKWVFRTKYNTDGTIARYKARLVAKGYHQVEGFD
FTETFSPVVKEPTIRVIIALAANYQWSLTQLDVKNVFLHGHLKEEVYMSQPPGFLDKSSPHHVYRLHKSLYGQAPRAWFNHFTSYLFTLGFTAFAADSSLFVRSVGSSLT
YLLLYVDDIVITGPDSFISISPKEAVGNRVSDI