; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028154 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028154
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr8:14648541..14650562
RNA-Seq ExpressionLag0028154
SyntenyLag0028154
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6734747.1 hypothetical protein I3842_01G285500 [Carya illinoinensis]7.7e-8033.57Show/hide
Query:  MADQNPLEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTDLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVRINGVSEDAIRLRLFPFSLQDKARD
        MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + GSP +DPN HL  FL+IC TV+INGV+ED IRLRLFPFSL+DKAR 
Subjt:  MADQNPLEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTDLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVRINGVSEDAIRLRLFPFSLQDKARD

Query:  WLQSITLGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFET--------------------------------TKTIVDAAA-GTGSAQS
        WLQS+  GSI +W  + + FL KFFPPAKT +LR+EIG F+Q   E L+E                                 T+TIVDAA+ GT  +++
Subjt:  WLQSITLGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFET--------------------------------TKTIVDAAA-GTGSAQS

Query:  IESAAALASR--------PQEETI----------------------------------------------------------------------------
         E A AL           P E T+                                                                            
Subjt:  IESAAALASR--------PQEETI----------------------------------------------------------------------------

Query:  --------EQSLSNRTTK---------------------LDEAVIA------------------INTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPA
                 ++LS   TK                     L++A+++                  I T  +   A +KN+E Q+GQL   ++   +G  P+
Subjt:  --------EQSLSNRTTK---------------------LDEAVIA------------------INTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPA

Query:  EQEKMQMEYCKTITVHQ-EESEDELESEDYETPTGEAEEDTSSDETEKPE------------------PEPPIHSPTLMVPKEKKKKKKKKNNQVQFDKF
          E    E CK IT+   +E E     E   TPT  A    S D+ E+ E                    PPI +P L  P+  +K+K  K    QF KF
Subjt:  EQEKMQMEYCKTITVHQ-EESEDELESEDYETPTGEAEEDTSSDETEKPE------------------PEPPIHSPTLMVPKEKKKKKKKKNNQVQFDKF

Query:  MNAFMNLNINIPFAEALE-MPQY-------------------------NSTRVQQKVPEKVADPGSFSVPYSFGTYSF-RALCDLGASINIIPLSLCKKL
        ++ F  ++INIPFA+ALE MP Y                          S  +Q+K+P+K+ DP SF++P + G   F R LCDLGASIN++P  +C+KL
Subjt:  MNAFMNLNINIPFAEALE-MPQY-------------------------NSTRVQQKVPEKVADPGSFSVPYSFGTYSF-RALCDLGASINIIPLSLCKKL

Query:  DIGEIKSTPVKLQLADQSMVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNKKKSLKQLKTLK
         +GE+K T + LQLAD+S+  P GI+E+VL++V +F  P D  V+DM E+  +P+ILGRPFLATGR +ID+++ ELT+RV  ++      + +K
Subjt:  DIGEIKSTPVKLQLADQSMVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNKKKSLKQLKTLK

KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]1.8e-8133.86Show/hide
Query:  MADQNPLEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTDLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVRINGVSEDAIRLRLFPFSLQDKARD
        MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + GSP +DPN HL  FL+IC TV+INGV+ED IRLRLFPFSL+DKAR 
Subjt:  MADQNPLEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTDLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVRINGVSEDAIRLRLFPFSLQDKARD

Query:  WLQSITLGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFET--------------------------------TKTIVDAAA-GTGSAQS
        WLQS+  GSI +W  + + FL KFFPPAKT +LR+EIG F+Q   E L+E                                 T+TIVDAA+ GT  +++
Subjt:  WLQSITLGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFET--------------------------------TKTIVDAAA-GTGSAQS

Query:  IESAAALASR--------PQEETI----------------------------------------------------------------------------
         E A AL           P E T+                                                                            
Subjt:  IESAAALASR--------PQEETI----------------------------------------------------------------------------

Query:  --------EQSLSNRTTK---------------------LDEAVIA------------------INTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPA
                 ++LS   TK                     L++A+++                  I T  +   A +KN+E Q+GQL   ++   +G  P+
Subjt:  --------EQSLSNRTTK---------------------LDEAVIA------------------INTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPA

Query:  EQEKMQMEYCKTITVHQ-EESEDELESEDYETPTGEAEEDTSSDETEKPE------------------PEPPIHSPTLMVPKEKKKKKKKKNNQVQFDKF
          E    E CK IT+   +E E     E   TPT  A    S D+ E+ E                    PPI +P L  P+  +K+K  K    QF KF
Subjt:  EQEKMQMEYCKTITVHQ-EESEDELESEDYETPTGEAEEDTSSDETEKPE------------------PEPPIHSPTLMVPKEKKKKKKKKNNQVQFDKF

Query:  MNAFMNLNINIPFAEALE-MPQY-------------------------NSTRVQQKVPEKVADPGSFSVPYSFGTYSF-RALCDLGASINIIPLSLCKKL
        ++ F  ++INIPFA+ALE MP Y                          S  +Q+K+P+K+ DPGSF++P + G   F R LCDLGASIN++P S+C+KL
Subjt:  MNAFMNLNINIPFAEALE-MPQY-------------------------NSTRVQQKVPEKVADPGSFSVPYSFGTYSF-RALCDLGASINIIPLSLCKKL

Query:  DIGEIKSTPVKLQLADQSMVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNKKKSLKQLKTLK
         +GE+K T + LQLAD+S+  P GI+E+VL++V +F  P D  V+DM E+  +P+ILGRPFLATGR +ID+++ ELT+RV  ++      + +K
Subjt:  DIGEIKSTPVKLQLADQSMVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNKKKSLKQLKTLK

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]1.4e-8133.72Show/hide
Query:  MADQNPLEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTDLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVRINGVSEDAIRLRLFPFSLQDKARD
        MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + GSP +DPN HL  FL+IC TV+INGV+ED IRLRLFPFSL+DKAR 
Subjt:  MADQNPLEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTDLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVRINGVSEDAIRLRLFPFSLQDKARD

Query:  WLQSITLGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFET--------------------------------TKTIVDAAA-GTGSAQS
        WLQS+  GSI +W  + + FL KFFPPAKT +LR+EIG F+Q   E L+E                                 T+TIVDAA+ GT  +++
Subjt:  WLQSITLGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFET--------------------------------TKTIVDAAA-GTGSAQS

Query:  IESAAALASR--------PQEETIEQSL------------------------------------------------------------------------
         E A AL           P E T+ + +                                                                        
Subjt:  IESAAALASR--------PQEETIEQSL------------------------------------------------------------------------

Query:  ---------------------------------SNRTTKLDEAVIA------------------INTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPA
                                         S R   L++A+++                  I T  +   AAIKNIE Q+GQL   ++   +G  P+
Subjt:  ---------------------------------SNRTTKLDEAVIA------------------INTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPA

Query:  EQEKMQMEYCKTITVHQ-EESEDELESEDYETPT----GEAEEDTSSDETEKPEPE-------------PPIHSPTLMVPKEKKKKKKKKNNQVQFDKFM
          E    E CK IT+   +E E     E   TPT    G+++     DE      E             PPI +P L  P+  +K+K  K    QF KF+
Subjt:  EQEKMQMEYCKTITVHQ-EESEDELESEDYETPT----GEAEEDTSSDETEKPEPE-------------PPIHSPTLMVPKEKKKKKKKKNNQVQFDKFM

Query:  NAFMNLNINIPFAEALE-MPQY-------------------------NSTRVQQKVPEKVADPGSFSVPYSFGTYSF-RALCDLGASINIIPLSLCKKLD
        + F  ++INIPFA+ALE MP Y                          S  +Q+K+P+K+ DPGSF++P + G   F + LCDLGASIN++PLS+C+KL 
Subjt:  NAFMNLNINIPFAEALE-MPQY-------------------------NSTRVQQKVPEKVADPGSFSVPYSFGTYSF-RALCDLGASINIIPLSLCKKLD

Query:  IGEIKSTPVKLQLADQSMVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNKKKSLKQLKTLKM
        + E+K T + LQLAD+S+  P GI+E+VL++V +F  P D  V+DM E+  +P+ILGRPFLATGR +ID+++ ELT+RV  ++   K  + +++
Subjt:  IGEIKSTPVKLQLADQSMVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNKKKSLKQLKTLKM

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]2.5e-7832.17Show/hide
Query:  MADQNPLEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTDLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVRINGVSEDAIRLRLFPFSLQDKARD
        MA+     +PR ++DY +P+     SGI    INANNFELK  LI M +   + GSP +DPN HL  FL+IC T+++NGV+ED IRLRLFPFSL+DKAR 
Subjt:  MADQNPLEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTDLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVRINGVSEDAIRLRLFPFSLQDKARD

Query:  WLQSITLGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFET--------------------------------TKTIVDAAAG-------
        WLQS+  GSIT+W  + + FL KFFPPAKT +LR+EIG F+Q   E L+E                                 T+TIVDAA+G       
Subjt:  WLQSITLGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFET--------------------------------TKTIVDAAAG-------

Query:  ---------------------------TGSAQSIESAAALASR------------------------------PQEETIEQSL-----------------
                                         +E  AAL+++                              P  E  ++ +                 
Subjt:  ---------------------------TGSAQSIESAAALASR------------------------------PQEETIEQSL-----------------

Query:  -------------------------------SNRTTKLDEAVIA------------------INTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQ
                                       S +   L++A+++                  I T  +   A +KN+E Q+GQL   ++   +G  P+  
Subjt:  -------------------------------SNRTTKLDEAVIA------------------INTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQ

Query:  EKMQMEYCKTITVHQ-EESEDELESEDYETPTG-------------EAEEDTSSDETEKPEPEPPIHSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNL
        E    E CK IT+    E E     E   TPT              E  EDT  +    P    P + P L  P    ++ +K+    QF KF++ F  +
Subjt:  EKMQMEYCKTITVHQ-EESEDELESEDYETPTG-------------EAEEDTSSDETEKPEPEPPIHSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNL

Query:  NINIPFAEALE-MPQYN-------------------------STRVQQKVPEKVADPGSFSVPYSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKS
        +INIPFA+ALE MP Y                          S  +Q+K+P+K+ DPGSF++P + G   F + LCDLGASIN++PLS+ +KL +GE+K 
Subjt:  NINIPFAEALE-MPQYN-------------------------STRVQQKVPEKVADPGSFSVPYSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKS

Query:  TPVKLQLADQSMVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNKKKSLKQLKTLK
        T + LQLAD+S+  P GI+E+VL++V +F  P D  V+DM E+  +P+ILGRPFLATGR ++D+++ ELT+RV  ++ +    + +K
Subjt:  TPVKLQLADQSMVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNKKKSLKQLKTLK

XP_030497486.1 uncharacterized protein LOC115713139 [Cannabis sativa]4.2e-7836.41Show/hide
Query:  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTDLIQMARDCA-YRGSPTEDPNSHLKSFLDICGTVRINGVSEDAIRLRLFPFSLQDKARDWLQSITLGS
        R +RDY  P   G +S I    ++ANNFE+K  ++QM +    + G P+ED N HL +F ++C T ++NGVS+DAIRLRLFPFSL+++A+ W  S+ + S
Subjt:  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTDLIQMARDCA-YRGSPTEDPNSHLKSFLDICGTVRINGVSEDAIRLRLFPFSLQDKARDWLQSITLGS

Query:  ITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFET--------------------------------TKTIVDAAAG-----TGSAQSIESAA
        I TW+ L   FL KFFPPAK  KLR +I  F QQ  E L+E                                 T+T++D A G       + ++ E   
Subjt:  ITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFET--------------------------------TKTIVDAAAG-----TGSAQSIESAA

Query:  ALASRPQEETIEQSLSNRTTKLDEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKMQMEYCKTITVHQEESED-----ELESEDYET
         +A   Q+ + E+  S +     E              AI  +  Q+ QL   V+ + +  +P+  E    E CK I++   +S D     + E+ + E 
Subjt:  ALASRPQEETIEQSLSNRTTKLDEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKMQMEYCKTITVHQEESED-----ELESEDYET

Query:  PTGEAEEDTSSDETEKPEPEPPI---HSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQY------------------------
        P    E+  + D  ++ E  PPI   H   +  P    ++ +K N   QF KF+  F  L+INIPFAEALE MP Y                        
Subjt:  PTGEAEEDTSSDETEKPEPEPPI---HSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQY------------------------

Query:  -NSTRVQQKVPEKVADPGSFSVPYSF-GTYSFRALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSMVRPVGIVENVLIRVGRFFLPIDLYVMDM
          S  +Q+K+P K+ DPGSF++P S  G+   +ALCDLGASIN++PLS+ K+L +GE K T V LQ+AD+S+  P GI+E+VL++VG+F  P D  ++DM
Subjt:  -NSTRVQQKVPEKVADPGSFSVPYSF-GTYSFRALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSMVRPVGIVENVLIRVGRFFLPIDLYVMDM

Query:  IENPSMPVILGRPFLATGRVIIDIERKELTIRVKNKKKSLK
         E+ ++P+ILGRPFLATGR +ID+++ EL +RV+ ++++ K
Subjt:  IENPSMPVILGRPFLATGRVIIDIERKELTIRVKNKKKSLK

TrEMBL top hitse value%identityAlignment
A0A1S3UJL2 uncharacterized protein LOC1067659805.6e-6033.22Show/hide
Query:  QSGIVYAPINANNFELKTDLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVRINGVSEDAIRLRLFPFSLQDKARDWLQSITLGSITTWDALVQAFLKKF
        Q  IV  PI A+NFE+K  L+Q+ +   + G+ +EDPNSHL++FL+IC T++I GVS+DAIRLRLFPFSL+DKA++WLQS   GSI+TW+ +   F+ K+
Subjt:  QSGIVYAPINANNFELKTDLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVRINGVSEDAIRLRLFPFSLQDKARDWLQSITLGSITTWDALVQAFLKKF

Query:  FPPAKTVKLRTEIGTFQQQYDEQLFE--------------------------------TTKTIVDAAAG-----TGSAQSIESAAALASR----------
        FPP+K+VK+R EI T  QQ  E L+E                                + K I+D A+G         +++E+   +A+           
Subjt:  FPPAKTVKLRTEIGTFQQQYDEQLFE--------------------------------TTKTIVDAAAG-----TGSAQSIESAAALASR----------

Query:  ---------------PQEETIEQSLSNRTTKL-DEAVIAINTTVNG----------------HSAAIKNIETQLGQLVNVVST----------MNKGKAP
                        Q + + Q +++ T KL +     +NT   G                H  A  N    +GQ  N  ST          M     P
Subjt:  ---------------PQEETIEQSLSNRTTKL-DEAVIAINTTVNG----------------HSAAIKNIETQLGQLVNVVST----------MNKGKAP

Query:  AEQEKMQMEY-------------------CKTITVHQEESEDELESEDYETPTGEAEEDTS-----------SDETEKPEPEPPIHSPTLMVPKEKKKK-
         +    +  Y                    + +T+      D+  +   ET T    ++ S           S +  K  P     S T+  P+E+ K  
Subjt:  AEQEKMQMEY-------------------CKTITVHQEESEDELESEDYETPTGEAEEDTS-----------SDETEKPEPEPPIHSPTLMVPKEKKKK-

Query:  -------KKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYN-------------------------STRVQQKVPEKVADPGSFSVPYSFGTYSF-R
                KK+     F +F++ F  L+INIPFAEALE MP Y                          S  +Q K+P K+ DPGSF +P   G  +  +
Subjt:  -------KKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYN-------------------------STRVQQKVPEKVADPGSFSVPYSFGTYSF-R

Query:  ALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSMVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIER
        ALCDLGASIN++PLS+ K+L IGE+K T + LQLAD+SM  P GIVE+VL++V +F    D  V+DM E+  +P+ILGRPFLATGR  ID+E+
Subjt:  ALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSMVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIER

A0A1S3UKD4 uncharacterized protein LOC1067662673.8e-6431.52Show/hide
Query:  DQNPLEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTDLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVRINGVSEDAIRLRLFPFSLQDKARDWL
        +QNP    + IRDY  P   G Q  IV  PI ANNFE+K  L+Q+ +   + G+ +EDPNSHL++FL IC T++ NGVS+DAI LRLFPFSL+DKA++WL
Subjt:  DQNPLEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTDLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVRINGVSEDAIRLRLFPFSLQDKARDWL

Query:  QSITLGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFE--------------------------------TTKTIVDAAAG---------
        QS+  GSI+TW+ +   F+ K+FPP+K+ K+R EI +F QQ +E L+E                                T K ++DAA+G         
Subjt:  QSITLGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFE--------------------------------TTKTIVDAAAG---------

Query:  -----------------------------------------------------------------------------TGSAQSIESAAALAS--------
                                                                                      G  Q+    A + +        
Subjt:  -----------------------------------------------------------------------------TGSAQSIESAAALAS--------

Query:  --------RPQ----------------------------------EETIEQSLSNRTTKLDEAVIAIN---TTVNGHSAAIKNIETQLGQLVNVVSTMNK
                RPQ                                  E  +E+     +T +D+    +N   T      A+I+N+E Q+GQL   +S  + 
Subjt:  --------RPQ----------------------------------EETIEQSLSNRTTKLDEAVIAIN---TTVNGHSAAIKNIETQLGQLVNVVSTMNK

Query:  GKAPAEQEKMQMEYCKTITVHQEESEDELESEDYETPTGEAEEDTSSDETEKPEPEPPIHSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFA
        G  P++      E CK I   Q  SE  LES           E  S  E EK +    I   +     EK+ ++K       + KFM   ++    +   
Subjt:  GKAPAEQEKMQMEYCKTITVHQEESEDELESEDYETPTGEAEEDTSSDETEKPEPEPPIHSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFA

Query:  EALEMPQYNSTRVQQKVPEKVADPGSFSVPYSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSMVRPVGIVENVLIRVGRFFLPI
        E + + +  S  +QQK+P K+ DP SF +P   G  S  +ALCDLGASIN++PLS+ K+L IG++K T + LQLAD+SM  P GIVE+VL++V +F  P 
Subjt:  EALEMPQYNSTRVQQKVPEKVADPGSFSVPYSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSMVRPVGIVENVLIRVGRFFLPI

Query:  DLYVMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNKK
        D  V+DM E+  +P+ILGRPFLATGR +ID+E+  L +RV ++K
Subjt:  DLYVMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNKK

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129458.6e-6931.87Show/hide
Query:  EEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTDLIQMARDCA-YRGSPTEDPNSHLKSFLDICGTVRINGVSEDAIRLRLFPFSLQDKARDWLQSIT
        E  R +RDY  P+ QG    I    INANNFE+K   IQM +    + G P++DPNSHL +FL+IC T + NGV++DAIRLRLFPFSL+DKA+ WL S+ 
Subjt:  EEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTDLIQMARDCA-YRGSPTEDPNSHLKSFLDICGTVRINGVSEDAIRLRLFPFSLQDKARDWLQSIT

Query:  LGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFE--------------------------------TTKTIVDAAAG-------------
         GSITTW+ L Q FL KFFPPAKT K+R +I +F Q   E L+E                                + KTI+DAAAG             
Subjt:  LGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFE--------------------------------TTKTIVDAAAG-------------

Query:  ----------------TGSAQSI------------ESAAALA----------------------------------------------------------
                        +GS +++               AAL+                                                          
Subjt:  ----------------TGSAQSI------------ESAAALA----------------------------------------------------------

Query:  ---------------SRPQ-------EETIEQSLSNRTTKLDEAVI----AINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQE--KMQMEYCK
                       S P+       ++     +  + ++L+E ++      +  +    A+++N+ETQ+GQL N ++   +G  P++ +      E C+
Subjt:  ---------------SRPQ-------EETIEQSLSNRTTKLDEAVI----AINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQE--KMQMEYCK

Query:  TIT---------VHQEESEDELESEDYETPTGEAEEDTSSDETEKPEPEPPIHSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MP
         IT         V+Q+  E E+E  D E   G  E +    + +  + E    S  +  P    ++ +K+  + QF KF+N F  L+INIPFAEALE MP
Subjt:  TIT---------VHQEESEDELESEDYETPTGEAEEDTSSDETEKPEPEPPIHSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MP

Query:  QY-------------------------NSTRVQQKVPEKVADPGSFSVPYSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSMVR
         Y                          S  +Q K+P K+ DPGSF++P + G   F +AL DLGASIN++P S+ +KL +GE K T V LQLAD+S V 
Subjt:  QY-------------------------NSTRVQQKVPEKVADPGSFSVPYSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSMVR

Query:  PVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRV
        P GI+E+VL++V +F  P+D  ++DM E+  +P+ILGRPFLAT   IID+   +++ +V
Subjt:  PVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRV

A0A6P4BWP3 uncharacterized protein LOC1074724919.9e-6535.09Show/hide
Query:  MADQNPLEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTDLIQMAR-DCAYRGSPTEDPNSHLKSFLDICGTVRINGVSEDAIRLRLFPFSLQDKAR
        MA+ N   + R +  Y     +   S I+   ++ANNFELK  LI + + +C+Y G P EDPN HL +FL IC  V+ NGV  D+ +L LFPFSL+DKA 
Subjt:  MADQNPLEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTDLIQMAR-DCAYRGSPTEDPNSHLKSFLDICGTVRINGVSEDAIRLRLFPFSLQDKAR

Query:  DWLQSITLGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFETTKTIVDAAAGTGSAQSIESAAALASRPQEETIEQSLSNRTTKLDEAVI
         WL+S    SI  WD LV  FL KF+PP + ++L+ E+ TF Q   E ++E  +            +++      A   + E ++      T K  EA+ 
Subjt:  DWLQSITLGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFETTKTIVDAAAGTGSAQSIESAAALASRPQEETIEQSLSNRTTKLDEAVI

Query:  AINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKA-PAEQEKMQMEYCKTITVHQEESEDELESEDYET--PTGEAEEDTSSDETEKPEPEPPIHSPTLM
              +   A++K++E Q+GQL   +S      + P++      E CK I   Q  S   L S +  T   TG  +E T  +E  K +      +  ++
Subjt:  AINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKA-PAEQEKMQMEYCKTITVHQEESEDELESEDYET--PTGEAEEDTSSDETEKPEPEPPIHSPTLM

Query:  VPKEKKKKKKKKNNQ----------------------------------VQFDKFMNAFMNLNINIPFAEALE-MPQY--------NSTR----------
        VP +  +K K+KNNQ                                    F KF+  F  L INIP AEALE MP Y        N  R          
Subjt:  VPKEKKKKKKKKNNQ----------------------------------VQFDKFMNAFMNLNINIPFAEALE-MPQY--------NSTR----------

Query:  -------VQQKVPEKVADPGSFSVPYSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSMVRPVGIVENVLIRVGRFFLPIDLYVM
               +Q+ +P K+ DPGSF +P + G  +  +A+CDLGASIN+IP S+ KKL I E+K   + L+L D+SMV P G++EN+L++V  F  P D  V+
Subjt:  -------VQQKVPEKVADPGSFSVPYSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSMVRPVGIVENVLIRVGRFFLPIDLYVM

Query:  DMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNKKKSLKQLKTLKM
        D  E+    VILGRPFLAT R IIDIE  ELT+R++ + K+ +  K +K+
Subjt:  DMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNKKKSLKQLKTLKM

A0A6P6XAQ1 Reverse transcriptase8.9e-6631.38Show/hide
Query:  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTDLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVRINGVSEDAIRLRLFPFSLQDKARDWLQSITLGSI
        R +RD+  P  QG Q+ IV   +NANNFE+K  LIQM +   Y G+ TEDPNSHL +FL+IC T++ NGVSEDAI+LRLFPFSL+DKA+ WLQS    + 
Subjt:  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTDLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVRINGVSEDAIRLRLFPFSLQDKARDWLQSITLGSI

Query:  TTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFET--------------------------------TKTIVDAAAGTG--------SAQSIES
        TTWD L +AFL KFFPP KT KLR +I +F QQ  E L+E                                 TKT VDAAAG          + Q IE 
Subjt:  TTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFET--------------------------------TKTIVDAAAGTG--------SAQSIES

Query:  AAA---------------------------------------------------LAS-------------------------------------------
         AA                                                   +AS                                           
Subjt:  AAA---------------------------------------------------LAS-------------------------------------------

Query:  --------------RP-------QEETIEQSLSNRTTKLDEAVIAINTTVNGHSAA------------------IKNIETQLGQLVNVVSTMNKGKAPAE
                      RP       Q++T+ +S       +++   A N  +   ++A                   +N+E QLGQ+ N V+  N+G  P++
Subjt:  --------------RP-------QEETIEQSLSNRTTKLDEAVIAINTTVNGHSAA------------------IKNIETQLGQLVNVVSTMNKGKAPAE

Query:  QEKMQMEYCKTITVHQEESEDELESEDYETPTGEAEEDTSSDETEKPEPEPPIHSPTLMVPKEKKKKKKKKNNQVQFD-------------KFMNAFMNL
         E    E+ K IT+    S  EL          E E+  +   +E  E             KE+K K+K + N++Q +             KF+   M  
Subjt:  QEKMQMEYCKTITVHQEESEDELESEDYETPTGEAEEDTSSDETEKPEPEPPIHSPTLMVPKEKKKKKKKKNNQVQFD-------------KFMNAFMNL

Query:  NINIPFAEALEMPQYNSTRVQQKVPEKVADPGSFSVPYSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSMVRPVGIVENVLIRV
           +  +E + + +  S  +Q K+P K+ DPGSF+VP + G   F +ALCDLGAS+++IPL++ ++L + E+K T + LQLAD+S+  P+GI+ENVLI+V
Subjt:  NINIPFAEALEMPQYNSTRVQQKVPEKVADPGSFSVPYSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSMVRPVGIVENVLIRV

Query:  GRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNKKKSLKQLKTLKMKCFS
         +F +P+D  V+DM E+ ++P+ILGRPFLAT   IID++R +   ++  ++      K  K   F+
Subjt:  GRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNKKKSLKQLKTLKMKCFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGACCAAAATCCACTTGAGGAGCCTAGGCCTATTAGAGACTACTTTCAGCCCGTGTTTCAGGGGCAACAGTCGGGGATTGTCTATGCACCGATTAATGCCAACAA
CTTTGAGCTGAAGACCGACCTCATTCAGATGGCTCGAGACTGTGCATACAGAGGATCGCCCACCGAGGATCCAAATTCTCATCTTAAATCATTCTTGGACATTTGTGGGA
CAGTAAGGATTAATGGAGTCTCTGAGGATGCTATTCGCTTACGTTTATTTCCATTTTCTTTGCAGGATAAAGCACGAGATTGGTTGCAGTCTATTACCCTTGGGAGCATC
ACCACCTGGGATGCTTTGGTCCAGGCCTTTTTAAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGTTGAGGACCGAGATTGGGACATTTCAACAACAATATGATGAGCA
GTTGTTCGAGACTACAAAAACGATTGTTGATGCAGCTGCAGGTACAGGGAGTGCACAGTCAATTGAATCAGCTGCTGCTTTGGCATCTAGACCTCAGGAGGAGACCATTG
AACAGAGTCTTAGTAACAGGACAACCAAACTAGATGAGGCAGTCATTGCTATCAACACCACGGTGAACGGCCACAGTGCAGCCATAAAGAACATTGAGACTCAGCTGGGA
CAGTTGGTAAATGTTGTAAGTACCATGAATAAAGGTAAGGCCCCAGCTGAACAGGAGAAAATGCAGATGGAGTACTGTAAGACAATCACTGTGCACCAGGAGGAATCTGA
AGATGAACTGGAATCTGAGGACTATGAAACGCCTACAGGGGAAGCTGAGGAGGACACATCATCTGATGAGACTGAAAAGCCTGAACCTGAGCCTCCTATTCATTCTCCCA
CACTGATGGTTCCCAAGGAAAAGAAAAAGAAAAAGAAGAAAAAGAACAATCAGGTTCAGTTTGATAAATTTATGAATGCTTTTATGAATCTGAATATTAATATTCCTTTT
GCAGAGGCATTAGAGATGCCCCAATACAACAGCACCAGAGTACAACAGAAAGTACCTGAAAAAGTAGCAGATCCAGGGAGTTTTTCTGTTCCTTATAGTTTTGGTACTTA
TTCTTTCAGAGCATTATGTGATTTAGGTGCTAGCATTAATATTATCCCATTGTCCTTGTGTAAGAAGTTAGACATAGGTGAGATTAAATCTACTCCTGTAAAGCTCCAAT
TGGCTGATCAATCTATGGTTAGACCAGTTGGCATTGTAGAAAATGTTTTAATCAGAGTAGGCAGATTTTTTCTCCCTATTGACTTGTATGTTATGGACATGATAGAAAAT
CCTTCGATGCCTGTCATATTAGGACGACCATTCCTCGCTACTGGGCGAGTGATTATAGATATTGAGCGCAAGGAGCTCACTATTAGAGTCAAGAACAAAAAGAAATCTTT
AAAGCAGCTGAAGACTCTAAAGATGAAGTGTTTTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGACCAAAATCCACTTGAGGAGCCTAGGCCTATTAGAGACTACTTTCAGCCCGTGTTTCAGGGGCAACAGTCGGGGATTGTCTATGCACCGATTAATGCCAACAA
CTTTGAGCTGAAGACCGACCTCATTCAGATGGCTCGAGACTGTGCATACAGAGGATCGCCCACCGAGGATCCAAATTCTCATCTTAAATCATTCTTGGACATTTGTGGGA
CAGTAAGGATTAATGGAGTCTCTGAGGATGCTATTCGCTTACGTTTATTTCCATTTTCTTTGCAGGATAAAGCACGAGATTGGTTGCAGTCTATTACCCTTGGGAGCATC
ACCACCTGGGATGCTTTGGTCCAGGCCTTTTTAAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGTTGAGGACCGAGATTGGGACATTTCAACAACAATATGATGAGCA
GTTGTTCGAGACTACAAAAACGATTGTTGATGCAGCTGCAGGTACAGGGAGTGCACAGTCAATTGAATCAGCTGCTGCTTTGGCATCTAGACCTCAGGAGGAGACCATTG
AACAGAGTCTTAGTAACAGGACAACCAAACTAGATGAGGCAGTCATTGCTATCAACACCACGGTGAACGGCCACAGTGCAGCCATAAAGAACATTGAGACTCAGCTGGGA
CAGTTGGTAAATGTTGTAAGTACCATGAATAAAGGTAAGGCCCCAGCTGAACAGGAGAAAATGCAGATGGAGTACTGTAAGACAATCACTGTGCACCAGGAGGAATCTGA
AGATGAACTGGAATCTGAGGACTATGAAACGCCTACAGGGGAAGCTGAGGAGGACACATCATCTGATGAGACTGAAAAGCCTGAACCTGAGCCTCCTATTCATTCTCCCA
CACTGATGGTTCCCAAGGAAAAGAAAAAGAAAAAGAAGAAAAAGAACAATCAGGTTCAGTTTGATAAATTTATGAATGCTTTTATGAATCTGAATATTAATATTCCTTTT
GCAGAGGCATTAGAGATGCCCCAATACAACAGCACCAGAGTACAACAGAAAGTACCTGAAAAAGTAGCAGATCCAGGGAGTTTTTCTGTTCCTTATAGTTTTGGTACTTA
TTCTTTCAGAGCATTATGTGATTTAGGTGCTAGCATTAATATTATCCCATTGTCCTTGTGTAAGAAGTTAGACATAGGTGAGATTAAATCTACTCCTGTAAAGCTCCAAT
TGGCTGATCAATCTATGGTTAGACCAGTTGGCATTGTAGAAAATGTTTTAATCAGAGTAGGCAGATTTTTTCTCCCTATTGACTTGTATGTTATGGACATGATAGAAAAT
CCTTCGATGCCTGTCATATTAGGACGACCATTCCTCGCTACTGGGCGAGTGATTATAGATATTGAGCGCAAGGAGCTCACTATTAGAGTCAAGAACAAAAAGAAATCTTT
AAAGCAGCTGAAGACTCTAAAGATGAAGTGTTTTTCATGA
Protein sequenceShow/hide protein sequence
MADQNPLEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTDLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVRINGVSEDAIRLRLFPFSLQDKARDWLQSITLGSI
TTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFETTKTIVDAAAGTGSAQSIESAAALASRPQEETIEQSLSNRTTKLDEAVIAINTTVNGHSAAIKNIETQLG
QLVNVVSTMNKGKAPAEQEKMQMEYCKTITVHQEESEDELESEDYETPTGEAEEDTSSDETEKPEPEPPIHSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPF
AEALEMPQYNSTRVQQKVPEKVADPGSFSVPYSFGTYSFRALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSMVRPVGIVENVLIRVGRFFLPIDLYVMDMIEN
PSMPVILGRPFLATGRVIIDIERKELTIRVKNKKKSLKQLKTLKMKCFS