; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005750 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005750
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr6:28163464..28177726
RNA-Seq ExpressionLag0005750
SyntenyLag0005750
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]2.9e-5929.52Show/hide
Query:  IVYASINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFWIFVGRIKHVIGC-----------------------SLLPWEHHHWDALVQAFLKKFFP
        I+   INANNFELK  LI M +   + GSP +DPN HL  F      +K + G                        SL P     W  + + FL KFFP
Subjt:  IVYASINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFWIFVGRIKHVIGC-----------------------SLLPWEHHHWDALVQAFLKKFFP

Query:  PAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPK
        PAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL   T+TIVDAA+GGTL+SKT E A  LLE+MA+N+YQWP+ER+  K
Subjt:  PAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPK

Query:  KIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAAALA-------------------------------------------------------
        K+ AG+ E++ ++AL AQ+ +L++     +     QS E  A+ +                                                       
Subjt:  KIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAAALA-------------------------------------------------------

Query:  -----SRPQEETLNM--------------FSKA----------------------------------------PAEQEKPQMEYCKAITVHQ-EESEEEP
             S+P E+ +++              F K                                         P+  E    E CKAIT+   +E E  P
Subjt:  -----SRPQEETLNM--------------FSKA----------------------------------------PAEQEKPQMEYCKAITVHQ-EESEEEP

Query:  ESEDYETPT----GEAEEDTSSDEAEKPNLE-------------PPIPSPTLMVPKEKKKKK--------------------------------------
          E   TPT    G++++    +E     LE             PPI +P L  P+  +K+K                                      
Subjt:  ESEDYETPT----GEAEEDTSSDEAEKPNLE-------------PPIPSPTLMVPKEKKKKK--------------------------------------

Query:  --KKKNNQHQSTT------------KVPEKVADPGSFSVPCSFA--------------------------------------PIGDQSVVRPVGIVENVL
          KK+  +   T             K+P+K+ DPGSF++PC+                                         + D+S+  P GI+E+VL
Subjt:  --KKKNNQHQSTT------------KVPEKVADPGSFSVPCSFA--------------------------------------PIGDQSVVRPVGIVENVL

Query:  IRVGRFFLPIDLYVMDMIENPSMSVILGRPFLATGRVIIDIERRELTIRVKNEKEIFKAVEDSK
        ++V +F  P D  V+DM E+  + +ILGRPFLATGR +ID+++ ELT+RV  E+ +F   +  K
Subjt:  IRVGRFFLPIDLYVMDMIENPSMSVILGRPFLATGRVIIDIERRELTIRVKNEKEIFKAVEDSK

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]5.9e-6029.41Show/hide
Query:  RYDRKRPDATVFPYSERAFEGVRFGTLESRKEDRTEHLEAENKGRDLEINPLDSIGIVYASINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFWIF
        R  R R    V P  ER    +R   + +  E+  E L    K     +   +   I+   INANNFELK  LI M +   + GSP +DPN HL  F   
Subjt:  RYDRKRPDATVFPYSERAFEGVRFGTLESRKEDRTEHLEAENKGRDLEINPLDSIGIVYASINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFWIF

Query:  VGRIKHVIGC-----------------------SLLPWEHHHWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDW
           +K + G                        SL P     W  + + FL KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDW
Subjt:  VGRIKHVIGC-----------------------SLLPWEHHHWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDW

Query:  LQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAAAL
        LQVQ+FYNGL   T+TIVDAA+GGTL+SKT E A  LLE+MA+N+YQWP+ER+  KK+ AG+ +++ ++AL AQ+ +L++     +     QS E  A+ 
Subjt:  LQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAAAL

Query:  A------------------------------------------------------------SRPQE--------------ETLNMFSKA-----------
        +                                                            S+P E              ET   F K            
Subjt:  A------------------------------------------------------------SRPQE--------------ETLNMFSKA-----------

Query:  -----------------------------PAEQEKPQMEYCKAITVHQ-EESEEEPESEDYETPT----GEAEEDTSSDEAEKPNLE-------------
                                     P+  E    E CKAIT+   +E E  P  E   TPT    G+++     DE     LE             
Subjt:  -----------------------------PAEQEKPQMEYCKAITVHQ-EESEEEPESEDYETPT----GEAEEDTSSDEAEKPNLE-------------

Query:  PPIPSPTLMVPKEKKKKK----------------------------------------KKKNNQHQSTT------------KVPEKVADPGSFSVPCSFA
        PPI +P L  P+  +K+K                                        KK+  +   T             K+P+K+ DPGSF++PC+  
Subjt:  PPIPSPTLMVPKEKKKKK----------------------------------------KKKNNQHQSTT------------KVPEKVADPGSFSVPCSFA

Query:  --------------------------------------PIGDQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMSVILGRPFLATGRVIIDIERR
                                               + D+S+  P GI+E+VL++V +F  P D  V+DM E+  + +ILGRPFLATGR +ID+++ 
Subjt:  --------------------------------------PIGDQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMSVILGRPFLATGRVIIDIERR

Query:  ELTIRVKNEKEIFK
        ELT+RV  E+ +FK
Subjt:  ELTIRVKNEKEIFK

WP_217833153.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]1.8e-6453.52Show/hide
Query:  PLDSIGIVYASINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFWIFVGRIK------HVIGCSLLPWEHHH----------------WDALVQAFL
        P    GI+   IN NNFELK GLIQMAR+ A+RG   EDP+ HL+SF    G +K        I   L P+                    W+ L QAFL
Subjt:  PLDSIGIVYASINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFWIFVGRIK------HVIGCSLLPWEHHH----------------WDALVQAFL

Query:  KKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSE
         K+FPPAK+ +LRTEIGTF+Q  DEQL+EAWER+K+LLR+CPQHGYPDWLQ+QLFYNGL  STK+I+DA AGG++ SK  + A T+LED+AT SY WP E
Subjt:  KKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSE

Query:  RSTPK-KIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQ----SIESAAALAS
        R++P    AAG++EVD+V++L+AQM SL NA  K +  G AQ    SI S AALAS
Subjt:  RSTPK-KIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQ----SIESAAALAS

XP_022157708.1 uncharacterized protein LOC111024361 [Momordica charantia]6.0e-7337.52Show/hide
Query:  IGIVYASINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFWIFVGRIK------HVIGCSLLPWEHHHWDALVQAFLKKFFPPAKTVKLRTEIGTFQ
        +GI+   INANN ELK GLIQM R+  +RG+ TEDPN+HL  F    G +K        I   L P      + +VQAFL  FFPPAKT +LRTEI +F+
Subjt:  IGIVYASINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFWIFVGRIK------HVIGCSLLPWEHHHWDALVQAFLKKFFPPAKTVKLRTEIGTFQ

Query:  QQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSAL
        +   EQLFE WER+KELLRKCPQHG  +WLQ+Q+FYNGL   T+TI+DAAAGGTLLS+T ENA  LL+DMA NS+QWPSERS  KK+ AG++E+D++S+L
Subjt:  QQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSAL

Query:  QAQMTSLANAFMKFSGTGSAQSIESAAA---------------LASRPQEE-------------------------------------------------
        +AQ+ +L NA  K SG G++ S E  AA                 S P E+                                                 
Subjt:  QAQMTSLANAFMKFSGTGSAQSIESAAA---------------LASRPQEE-------------------------------------------------

Query:  ---TLNMFSKA--PAEQEKPQMEYCKAITVHQEESEEEPESEDYETPTGEAEEDTSSDE----------AEKPN----LEPP--IPSP------------
           TLN   K   P++ E    E+CKA+T+   +  +EPE +  E P    EE  + +E          A+KP       PP  +P P            
Subjt:  ---TLNMFSKA--PAEQEKPQMEYCKAITVHQEESEEEPESEDYETPTGEAEEDTSSDE----------AEKPN----LEPP--IPSP------------

Query:  ---TLMVPKEKKKKKKKKNNQHQSTT----KVPEKVADPGSFSVPCSFAPIGDQSV---------VRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMS
            +M  K K +  +  N   + +     K+P+K+ DPGSF++PC+ +                + P+G++E+VL++V R   P D  V+   E+  + 
Subjt:  ---TLMVPKEKKKKKKKKNNQHQSTT----KVPEKVADPGSFSVPCSFAPIGDQSV---------VRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMS

Query:  VILGRPFLATGRVIIDIERRELTIRVKNEKEIF
        +ILGR FLATG  +ID++   LT+RV  E  +F
Subjt:  VILGRPFLATGRVIIDIERRELTIRVKNEKEIF

XP_023929660.1 uncharacterized protein LOC112040975 [Quercus suber]6.5e-5930.6Show/hide
Query:  GIVYASINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFWIFVGRIK------HVIGCSLLPWEHH----------------HWDALVQAFLKKFFP
        GI + +INANNFEL   LI M +   + GSP +DPN HL  F      +K        I   L P+                    W  + + FL KFFP
Subjt:  GIVYASINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFWIFVGRIK------HVIGCSLLPWEHH----------------HWDALVQAFLKKFFP

Query:  PAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPK
        PAKT +LR+EIG F+Q   E L+EAWER+K+L+R CPQHG  DWLQVQ+FYNGL   T+TIVDAA+GGTL+SKT E A +LLE+MA+N YQWP+ER+  K
Subjt:  PAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPK

Query:  KIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAAALA-------------------------------------------------------
        K+ AG+ E++  +AL AQ+ SL++     +     QS+E  AA +                                                       
Subjt:  KIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAAALA-------------------------------------------------------

Query:  ---SRPQ--------------EETLNMFSKA----------------------------------------PAEQEKPQMEYCKAITVHQ-EESEEEPES
           S+P               EET   F K+                                        P+  E    E CKAIT+    E E  P  
Subjt:  ---SRPQ--------------EETLNMFSKA----------------------------------------PAEQEKPQMEYCKAITVHQ-EESEEEPES

Query:  EDYETPT----GEAEEDTSSDEAEKPNL-------------EPPIPSPTLMVPKEKKKK-------------KKKKNNQHQST-----------TKVPEK
        E   TPT    G+++     +E     L              PPI S  L  P+   ++             KK++  + ++             K+P+K
Subjt:  EDYETPT----GEAEEDTSSDEAEKPNL-------------EPPIPSPTLMVPKEKKKK-------------KKKKNNQHQST-----------TKVPEK

Query:  VADPGSFSVPCSFA--------------------------------------PIGDQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMSVILGRP
        + DPGSF++PC+                                         + ++S+  P GI+E+VL++V +F  P D  V+DM E+  + +ILGRP
Subjt:  VADPGSFSVPCSFA--------------------------------------PIGDQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMSVILGRP

Query:  FLATGRVIIDIERRELTIRVKNEKEIFKAVEDSK
        FLATGR ++D+++ ELT+RVK E+ +F   E  K
Subjt:  FLATGRVIIDIERRELTIRVKNEKEIFKAVEDSK

TrEMBL top hitse value%identityAlignment
A0A1U7Z951 uncharacterized protein LOC1045905681.2e-4729.18Show/hide
Query:  IVYASINANNFELKTGLIQMARDCA-YRGSPTEDPNSHLKSFWIFVGRIKH------VIGCSLLPWEHHH----------------WDALVQAFLKKFFP
        IV  +I ANNF++K  +IQM ++   + G   EDPN+H+ +F       KH      V+   L P+                    WD +   FL K+FP
Subjt:  IVYASINANNFELKTGLIQMARDCA-YRGSPTEDPNSHLKSFWIFVGRIKH------VIGCSLLPWEHHH----------------WDALVQAFLKKFFP

Query:  PAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPK
        P+K  K+R +I TF QQ  E L+E+WER+KELLRK P HG P WLQVQ FYN LT + KTI+DA AGG++ +KT E A  L+E+M  N+YQW SER+  +
Subjt:  PAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPK

Query:  KIAAGVFEVDKVSALQAQMTSLANAF--MKFSGTG-SAQSIESAAALASRPQEETLNMFSKAPAE-------QEKPQMEYCKAIT---------VHQEES
        +    +  VD  + L AQ+ +L+     M+ +GT     + E         + +  N F ++P +       Q +   +Y +A             Q ++
Subjt:  KIAAGVFEVDKVSALQAQMTSLANAF--MKFSGTG-SAQSIESAAALASRPQEETLNMFSKAPAE-------QEKPQMEYCKAIT---------VHQEES

Query:  EEEPESEDYETPTGE-------------AEEDTSSDEAEKPNLEPPI---------------PSPTLMVPKEK---------------KKKKKKKN----
          +P  +++  P  +             AE    + EA   NLE  +               PS T   P+E+               +KK K+++    
Subjt:  EEEPESEDYETPTGE-------------AEEDTSSDEAEKPNLEPPI---------------PSPTLMVPKEK---------------KKKKKKKN----

Query:  --------------------------NQHQS---TTKVPEKVADPGSFSVPCS------------------------FAPIG--------------DQSV
                                  N+  S     K+P+K+ DPGSF++PC+                        F  +G              D+S+
Subjt:  --------------------------NQHQS---TTKVPEKVADPGSFSVPCS------------------------FAPIG--------------DQSV

Query:  VRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMSVILGRPFLATGRVIIDIERRELTIRVKNEKEIFKAVE-----DSKDEVLYM
          P GI+E+VL++V +F  P+D  V+DM E+  + +ILGRPFLATG+  +D+++ +L++++++E+ IFK  +      S D+  YM
Subjt:  VRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMSVILGRPFLATGRVIIDIERRELTIRVKNEKEIFKAVE-----DSKDEVLYM

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129453.3e-4828.59Show/hide
Query:  SINANNFELKTGLIQMARDCA-YRGSPTEDPNSHLKSFWIFVGRIKH------VIGCSLLPWEHHH----------------WDALVQAFLKKFFPPAKT
        SINANNFE+K   IQM +    + G P++DPNSHL +F       K+       I   L P+                    W+ L Q FL KFFPPAKT
Subjt:  SINANNFELKTGLIQMARDCA-YRGSPTEDPNSHLKSFWIFVGRIKH------VIGCSLLPWEHHH----------------WDALVQAFLKKFFPPAKT

Query:  VKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAA
         K+R +I +F Q   E L+EAWERFKELLR+CP HG PDWLQVQ FYNGL  S KTI+DAAAGG L+SK   +A  LLE+MA+N+YQWPSERS  +K A 
Subjt:  VKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAA

Query:  GVFEVDKVSALQAQMTSLA------------NAFMKFSGTGSAQSIESAA--------------------------------------------------
        G +E+D +  L  Q+ +L+            N+ +     G + S +                                                     
Subjt:  GVFEVDKVSALQAQMTSLA------------NAFMKFSGTGSAQSIESAA--------------------------------------------------

Query:  ---ALASRPQ---------EETLNMFSK----------------------APAEQEKPQ-------------MEYCKAIT---------VHQEESEEEPE
              +RPQ         E  L   SK                      A +   +PQ              E C+AIT         V+Q+  E E E
Subjt:  ---ALASRPQ---------EETLNMFSK----------------------APAEQEKPQ-------------MEYCKAIT---------VHQEESEEEPE

Query:  SED------YETPTGEAEEDTSSDEAEKPNLEPPIPSPTLMVPKEKKKK------------------------------------KKKKNNQHQST----
          D       E    + ++D + ++     + PP P P  +  ++ +K+                                    KK+K  + ++     
Subjt:  SED------YETPTGEAEEDTSSDEAEKPNLEPPIPSPTLMVPKEKKKK------------------------------------KKKKNNQHQST----

Query:  -------TKVPEKVADPGSFSVPCSFA--------------------------------------PIGDQSVVRPVGIVENVLIRVGRFFLPIDLYVMDM
                K+P K+ DPGSF++PC+                                         + D+S V P GI+E+VL++V +F  P+D  ++DM
Subjt:  -------TKVPEKVADPGSFSVPCSFA--------------------------------------PIGDQSVVRPVGIVENVLIRVGRFFLPIDLYVMDM

Query:  IENPSMSVILGRPFLATGRVIIDIERRELTIRVKNEKEIFKAVEDSK
         E+  + +ILGRPFLAT   IID+   +++ +V  E   F     SK
Subjt:  IENPSMSVILGRPFLATGRVIIDIERRELTIRVKNEKEIFKAVEDSK

A0A6J0ZYV0 uncharacterized protein LOC1104134137.2e-4849.32Show/hide
Query:  SINANNFELKTGLIQMARDCA-YRGSPTEDPNSHLKSFWIFVGRIKH------VIGCSLLPWEHHH----------------WDALVQAFLKKFFPPAKT
        SINANNFE+K   IQM +    + G P++DPNSHL +F       K+       I   L P+                    W+ L Q FL KFFPPAKT
Subjt:  SINANNFELKTGLIQMARDCA-YRGSPTEDPNSHLKSFWIFVGRIKH------VIGCSLLPWEHHH----------------WDALVQAFLKKFFPPAKT

Query:  VKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAA
         K+R +I +F Q   E L+EAWERFKELLR+CP HG PDWLQVQ FYNGL  S KTI+DAAAGG L+SK   +A  LLE+MA+N+YQWPSERS  +K A 
Subjt:  VKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAA

Query:  GVFEVDKVSALQAQMTSLA
        G +E+D +  L  Q+ +L+
Subjt:  GVFEVDKVSALQAQMTSLA

A0A6J1DU19 uncharacterized protein LOC1110243612.9e-7337.52Show/hide
Query:  IGIVYASINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFWIFVGRIK------HVIGCSLLPWEHHHWDALVQAFLKKFFPPAKTVKLRTEIGTFQ
        +GI+   INANN ELK GLIQM R+  +RG+ TEDPN+HL  F    G +K        I   L P      + +VQAFL  FFPPAKT +LRTEI +F+
Subjt:  IGIVYASINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFWIFVGRIK------HVIGCSLLPWEHHHWDALVQAFLKKFFPPAKTVKLRTEIGTFQ

Query:  QQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSAL
        +   EQLFE WER+KELLRKCPQHG  +WLQ+Q+FYNGL   T+TI+DAAAGGTLLS+T ENA  LL+DMA NS+QWPSERS  KK+ AG++E+D++S+L
Subjt:  QQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSAL

Query:  QAQMTSLANAFMKFSGTGSAQSIESAAA---------------LASRPQEE-------------------------------------------------
        +AQ+ +L NA  K SG G++ S E  AA                 S P E+                                                 
Subjt:  QAQMTSLANAFMKFSGTGSAQSIESAAA---------------LASRPQEE-------------------------------------------------

Query:  ---TLNMFSKA--PAEQEKPQMEYCKAITVHQEESEEEPESEDYETPTGEAEEDTSSDE----------AEKPN----LEPP--IPSP------------
           TLN   K   P++ E    E+CKA+T+   +  +EPE +  E P    EE  + +E          A+KP       PP  +P P            
Subjt:  ---TLNMFSKA--PAEQEKPQMEYCKAITVHQEESEEEPESEDYETPTGEAEEDTSSDE----------AEKPN----LEPP--IPSP------------

Query:  ---TLMVPKEKKKKKKKKNNQHQSTT----KVPEKVADPGSFSVPCSFAPIGDQSV---------VRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMS
            +M  K K +  +  N   + +     K+P+K+ DPGSF++PC+ +                + P+G++E+VL++V R   P D  V+   E+  + 
Subjt:  ---TLMVPKEKKKKKKKKNNQHQSTT----KVPEKVADPGSFSVPCSFAPIGDQSV---------VRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMS

Query:  VILGRPFLATGRVIIDIERRELTIRVKNEKEIF
        +ILGR FLATG  +ID++   LT+RV  E  +F
Subjt:  VILGRPFLATGRVIIDIERRELTIRVKNEKEIF

A0A6P6XAQ1 Reverse transcriptase1.6e-5530.13Show/hide
Query:  IVYASINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFWIFVGRIK------HVIGCSLLP---------WEHHH-------WDALVQAFLKKFFPP
        IV  ++NANNFE+K  LIQM +   Y G+ TEDPNSHL +F      IK        I   L P         W   H       WD L +AFL KFFPP
Subjt:  IVYASINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFWIFVGRIK------HVIGCSLLP---------WEHHH-------WDALVQAFLKKFFPP

Query:  AKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKK
         KT KLR +I +F QQ  E L+EAWER++EL R+CP HG PDWL VQ FYNGLT  TKT VDAAAGG L+ KT E A+ L+E+MA N+YQW +ER   ++
Subjt:  AKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKK

Query:  IAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAAA-----------LASRPQEETLNMFSKAP------------------------AEQEKP
          AG+ EVD ++ L A+M ++     +  G+ S Q +  A+             +S  Q + LN +++ P                          Q++P
Subjt:  IAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAAA-----------LASRPQEETLNMFSKAP------------------------AEQEKP

Query:  ----------------------------------------------------------------------------------------QMEYCKAITVHQ
                                                                                                  E+ KAIT+  
Subjt:  ----------------------------------------------------------------------------------------QMEYCKAITVHQ

Query:  EESEEEP-------ESEDYETPTGEAEEDTSSDEAEKPNLE-------------PPIPSPTLMVPKEKKKKKKKKNNQHQSTT---------KVPEKVAD
         +   EP       E E  E       ++ S +E  K  +E             PPIPS    + +   KK+K  +++  + T         K+P K+ D
Subjt:  EESEEEP-------ESEDYETPTGEAEEDTSSDEAEKPNLE-------------PPIPSPTLMVPKEKKKKKKKKNNQHQSTT---------KVPEKVAD

Query:  PGSFSVPCSFA--------------------------------------PIGDQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMSVILGRPFLA
        PGSF+VPC+                                         + D+S+  P+GI+ENVLI+V +F +P+D  V+DM E+ ++ +ILGRPFLA
Subjt:  PGSFSVPCSFA--------------------------------------PIGDQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMSVILGRPFLA

Query:  TGRVIIDIERRELTIRVKNEKEIF
        T   IID++R +   ++  E+  F
Subjt:  TGRVIIDIERRELTIRVKNEKEIF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTGTCCATCATCTTTTTCCAGCCTTCTCCATCTCTTTTATTATTGAATGTCATGCATTTAATGCTCAAGAAGAAGACAACACATTGCCTTTTGGTGAGGTGTA
CTTAGAGGAAATGGTTGATTGGAGCTATAATGCCATATTAGAGTTAATCGGGTGCTCGGGGCGTGAAAAGATGCATAGGAATGAAAAGAGTAAAAGAGGAAAAAAGTCAA
ATCTCGGTCAACAGCAGGCTAGCGTCGAGACGCTAGCTCTTGAGCGTCTCGACGCTCACATTCCATATCAGATTAGGCGCGTAAAGCTCACAGCGTCGAGACGCTATGAT
AGGAAGCGTCCCGACGCTACCGTTTTTCCTTATTCAGAACGCGCGTTTGAGGGAGTTCGATTTGGGACCTTGGAGAGCCGTAAAGAGGACAGAACAGAGCACTTGGAGGC
TGAAAACAAAGGGAGAGACTTGGAAATCAACCCATTGGACAGCATCGGGATTGTCTATGCATCGATTAATGCCAACAACTTTGAGCTGAAGACCGGCCTCATTCAGATGG
CTCGAGACTGTGCATATAGAGGATCGCCCACTGAGGATCCAAATTCTCATCTTAAATCATTTTGGATATTTGTGGGACGGATAAAGCACGTGATTGGTTGCAGTCTATTA
CCCTGGGAGCATCACCACTGGGATGCTTTGGTCCAGGCCTTTTTAAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACATTCCAACAACA
ATATGATGAGCAGCTGTTCGAAGCTTGGGAGCGATTCAAAGAGCTACTGAGGAAGTGTCCTCAGCATGGTTACCCCGATTGGCTTCAGGTACAGTTGTTTTATAATGGTT
TAACTCCTAGTACAAAAACGATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACCGTGGAAAACGCTCGCACACTTTTAGAGGATATGGCCACCAACAGCTAT
CAGTGGCCATCTGAGCGGTCTACACCTAAAAAGATTGCTGCTGGAGTGTTTGAGGTTGATAAAGTAAGTGCACTCCAGGCCCAGATGACCTCCCTTGCTAATGCTTTTAT
GAAATTTTCAGGTACAGGGAGTGCACAGTCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGAGGAGACATTGAACATGTTCAGTAAGGCCCCAGCTGAACAAG
AGAAACCTCAGATGGAGTACTGTAAGGCAATCACTGTGCACCAGGAGGAATCTGAAGAGGAACCTGAATCTGAGGACTATGAAACGCCTACAGGGGAAGCTGAGGAGGAC
ACATCATCTGATGAGGCTGAAAAGCCTAACCTTGAGCCTCCTATTCCTTCTCCCACACTGATGGTTCCCAAGGAAAAGAAAAAGAAAAAGAAGAAAAAGAATAATCAGCA
CCAGAGTACAACAAAAGTACCTGAAAAAGTAGCAGATCCAGGGAGTTTTTCTGTTCCTTGTAGTTTTGCTCCAATTGGTGATCAATCTGTGGTGAGACCAGTTGGTATTG
TAGAAAATGTGTTAATCAGAGTAGGTAGATTTTTCCTCCCTATTGACTTGTATGTTATGGACATGATAGAAAATCCTTCAATGTCTGTCATATTAGGACGACCATTCCTC
GCTACTGGGCGAGTGATTATAGATATTGAGCGCAGGGAGCTCACTATTAGAGTCAAGAACGAAAAAGAAATCTTTAAAGCAGTTGAAGACTCTAAAGATGAAGTGCTTTA
CATGGGCTACAGGAAAGGTCTTGTTTTCTTTCTCTCCTTTGCTTTCAACTTTCAAGATTCCAAGCTCTCAAGCAAGAAGTCATCATCATTCCAGGCCAGAGCGAATTATT
GGGTTCAGATGCTTGTTGTCACCTCGATAGGTGTAGGCTATCTACCATTCCAATCCCTGAAATGGGGGTTTATTGAGTCGACGATGTTGAGTCACTCTCACCATGCAGAC
CTAAGGATAAGCCCGAATAAACAGGATTTCATAGTTAGCTCAGGATTAAGATCGAGTTACCTAGGTCATCACATTGAAACTGTCAGTCTTGAAGTTAGTGAGGTGGGATC
CACTTCCATAAGTTGTCTAGTGAGTCAAGTAGTCGATGACTCATTGGCACTCATTCATGAGCGTAGTTCTTTCGATCTTGATGAGGATGAGCAGTTGGGGTTTTTGGGGA
CTTCGAATTGGGGACTTTTGGAGCCGACATTAGAGCAGAAAGAGGACAGAATTGAGCTGAAGCAAAGGGGATTAGTTGGAAATCAACCCATTGTTCGTGGGGATCGTGAC
GGGGACATAGGGATATCGGGTTCAATTGCATATGGGAGAATCATACCTTGGAAGGTGAGGTTGACTGCCGAGATCAGTGTGAGAAGACCTTGGAAGGTAAGAATCCATGA
TCTTCAGAGATTTGATACAAGATTTGGACTTTTCGAACGATTCTGTGAAGATCTCGTGAATCTCGATATTCTGTCAGAAAGCCCAGTTGCGGCTGCATATCGGGCGCATT
TTCTTCTATGTGCTGCTTTGACAACTCTGCCTTCAACTGGGACTACTTTCCTTAGAAGGTTGCAGGTGCATATTGGTCGCATATCATACATCCTCAGTTGCCTCAAATTA
CTTTATGAGCTACCAGTGGGACCTAATGGACCTGCAGATCAGAAGCTCCAACGATTATTATTAGCGGATTCCTCCGGAACTCCGGATCTTTTGGGATATGTTTCAAACCC
AATGAAGTACGACCGATCAAGCACTAGGCAACTGGCTATCCCTAGCTCAGTCACTGGCGTTCACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTGTCCATCATCTTTTTCCAGCCTTCTCCATCTCTTTTATTATTGAATGTCATGCATTTAATGCTCAAGAAGAAGACAACACATTGCCTTTTGGTGAGGTGTA
CTTAGAGGAAATGGTTGATTGGAGCTATAATGCCATATTAGAGTTAATCGGGTGCTCGGGGCGTGAAAAGATGCATAGGAATGAAAAGAGTAAAAGAGGAAAAAAGTCAA
ATCTCGGTCAACAGCAGGCTAGCGTCGAGACGCTAGCTCTTGAGCGTCTCGACGCTCACATTCCATATCAGATTAGGCGCGTAAAGCTCACAGCGTCGAGACGCTATGAT
AGGAAGCGTCCCGACGCTACCGTTTTTCCTTATTCAGAACGCGCGTTTGAGGGAGTTCGATTTGGGACCTTGGAGAGCCGTAAAGAGGACAGAACAGAGCACTTGGAGGC
TGAAAACAAAGGGAGAGACTTGGAAATCAACCCATTGGACAGCATCGGGATTGTCTATGCATCGATTAATGCCAACAACTTTGAGCTGAAGACCGGCCTCATTCAGATGG
CTCGAGACTGTGCATATAGAGGATCGCCCACTGAGGATCCAAATTCTCATCTTAAATCATTTTGGATATTTGTGGGACGGATAAAGCACGTGATTGGTTGCAGTCTATTA
CCCTGGGAGCATCACCACTGGGATGCTTTGGTCCAGGCCTTTTTAAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACATTCCAACAACA
ATATGATGAGCAGCTGTTCGAAGCTTGGGAGCGATTCAAAGAGCTACTGAGGAAGTGTCCTCAGCATGGTTACCCCGATTGGCTTCAGGTACAGTTGTTTTATAATGGTT
TAACTCCTAGTACAAAAACGATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACCGTGGAAAACGCTCGCACACTTTTAGAGGATATGGCCACCAACAGCTAT
CAGTGGCCATCTGAGCGGTCTACACCTAAAAAGATTGCTGCTGGAGTGTTTGAGGTTGATAAAGTAAGTGCACTCCAGGCCCAGATGACCTCCCTTGCTAATGCTTTTAT
GAAATTTTCAGGTACAGGGAGTGCACAGTCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGAGGAGACATTGAACATGTTCAGTAAGGCCCCAGCTGAACAAG
AGAAACCTCAGATGGAGTACTGTAAGGCAATCACTGTGCACCAGGAGGAATCTGAAGAGGAACCTGAATCTGAGGACTATGAAACGCCTACAGGGGAAGCTGAGGAGGAC
ACATCATCTGATGAGGCTGAAAAGCCTAACCTTGAGCCTCCTATTCCTTCTCCCACACTGATGGTTCCCAAGGAAAAGAAAAAGAAAAAGAAGAAAAAGAATAATCAGCA
CCAGAGTACAACAAAAGTACCTGAAAAAGTAGCAGATCCAGGGAGTTTTTCTGTTCCTTGTAGTTTTGCTCCAATTGGTGATCAATCTGTGGTGAGACCAGTTGGTATTG
TAGAAAATGTGTTAATCAGAGTAGGTAGATTTTTCCTCCCTATTGACTTGTATGTTATGGACATGATAGAAAATCCTTCAATGTCTGTCATATTAGGACGACCATTCCTC
GCTACTGGGCGAGTGATTATAGATATTGAGCGCAGGGAGCTCACTATTAGAGTCAAGAACGAAAAAGAAATCTTTAAAGCAGTTGAAGACTCTAAAGATGAAGTGCTTTA
CATGGGCTACAGGAAAGGTCTTGTTTTCTTTCTCTCCTTTGCTTTCAACTTTCAAGATTCCAAGCTCTCAAGCAAGAAGTCATCATCATTCCAGGCCAGAGCGAATTATT
GGGTTCAGATGCTTGTTGTCACCTCGATAGGTGTAGGCTATCTACCATTCCAATCCCTGAAATGGGGGTTTATTGAGTCGACGATGTTGAGTCACTCTCACCATGCAGAC
CTAAGGATAAGCCCGAATAAACAGGATTTCATAGTTAGCTCAGGATTAAGATCGAGTTACCTAGGTCATCACATTGAAACTGTCAGTCTTGAAGTTAGTGAGGTGGGATC
CACTTCCATAAGTTGTCTAGTGAGTCAAGTAGTCGATGACTCATTGGCACTCATTCATGAGCGTAGTTCTTTCGATCTTGATGAGGATGAGCAGTTGGGGTTTTTGGGGA
CTTCGAATTGGGGACTTTTGGAGCCGACATTAGAGCAGAAAGAGGACAGAATTGAGCTGAAGCAAAGGGGATTAGTTGGAAATCAACCCATTGTTCGTGGGGATCGTGAC
GGGGACATAGGGATATCGGGTTCAATTGCATATGGGAGAATCATACCTTGGAAGGTGAGGTTGACTGCCGAGATCAGTGTGAGAAGACCTTGGAAGGTAAGAATCCATGA
TCTTCAGAGATTTGATACAAGATTTGGACTTTTCGAACGATTCTGTGAAGATCTCGTGAATCTCGATATTCTGTCAGAAAGCCCAGTTGCGGCTGCATATCGGGCGCATT
TTCTTCTATGTGCTGCTTTGACAACTCTGCCTTCAACTGGGACTACTTTCCTTAGAAGGTTGCAGGTGCATATTGGTCGCATATCATACATCCTCAGTTGCCTCAAATTA
CTTTATGAGCTACCAGTGGGACCTAATGGACCTGCAGATCAGAAGCTCCAACGATTATTATTAGCGGATTCCTCCGGAACTCCGGATCTTTTGGGATATGTTTCAAACCC
AATGAAGTACGACCGATCAAGCACTAGGCAACTGGCTATCCCTAGCTCAGTCACTGGCGTTCACTAA
Protein sequenceShow/hide protein sequence
MASVHHLFPAFSISFIIECHAFNAQEEDNTLPFGEVYLEEMVDWSYNAILELIGCSGREKMHRNEKSKRGKKSNLGQQQASVETLALERLDAHIPYQIRRVKLTASRRYD
RKRPDATVFPYSERAFEGVRFGTLESRKEDRTEHLEAENKGRDLEINPLDSIGIVYASINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFWIFVGRIKHVIGCSLL
PWEHHHWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSY
QWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAAALASRPQEETLNMFSKAPAEQEKPQMEYCKAITVHQEESEEEPESEDYETPTGEAEED
TSSDEAEKPNLEPPIPSPTLMVPKEKKKKKKKKNNQHQSTTKVPEKVADPGSFSVPCSFAPIGDQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMSVILGRPFL
ATGRVIIDIERRELTIRVKNEKEIFKAVEDSKDEVLYMGYRKGLVFFLSFAFNFQDSKLSSKKSSSFQARANYWVQMLVVTSIGVGYLPFQSLKWGFIESTMLSHSHHAD
LRISPNKQDFIVSSGLRSSYLGHHIETVSLEVSEVGSTSISCLVSQVVDDSLALIHERSSFDLDEDEQLGFLGTSNWGLLEPTLEQKEDRIELKQRGLVGNQPIVRGDRD
GDIGISGSIAYGRIIPWKVRLTAEISVRRPWKVRIHDLQRFDTRFGLFERFCEDLVNLDILSESPVAAAYRAHFLLCAALTTLPSTGTTFLRRLQVHIGRISYILSCLKL
LYELPVGPNGPADQKLQRLLLADSSGTPDLLGYVSNPMKYDRSSTRQLAIPSSVTGVH