; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018097 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018097
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr5:16041981..16044156
RNA-Seq ExpressionLag0018097
SyntenyLag0018097
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6734747.1 hypothetical protein I3842_01G285500 [Carya illinoinensis]1.1e-12138.9Show/hide
Query:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGLPTEDPNSHLKSFLDICGT---------------------DKARD
        MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + G P +DPN HL  FL+IC T                     DKAR 
Subjt:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGLPTEDPNSHLKSFLDICGT---------------------DKARD

Query:  WLQSITPRSITTWDALVQAFLKKSFPPTKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT
        WLQS+ P SI +W  + + FL K FPP KT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL   T+TIVDAA+GGTL+SKT
Subjt:  WLQSITPRSITTWDALVQAFLKKSFPPTKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT

Query:  VENARTLLEDMATNSYQWPSEWCTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAAALAS----------------------------
         E A  LLE+MA+N+YQWP+E    KK V G+ E++ ++AL AQ+ +L++     +     QS E  A+ +                             
Subjt:  VENARTLLEDMATNSYQWPSEWCTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAAALAS----------------------------

Query:  -------RPHEE----------------------TIETTKLEEVVIA------------------INTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAP
               R HE                       + +   LE+ +++                  I T  +   A +KN+E Q+GQL   ++   +G  P
Subjt:  -------RPHEE----------------------TIETTKLEEVVIA------------------INTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAP

Query:  AEQEKPQMEYCKEITVHQ-EESKEEPESEDYETPT----GEAEEDTSSDEAEKPNLE-------------PPIPSPTLLVPKERKKKKKKKNNQVQFDKF
        +  E    E CK IT+   +E +  P  E   TPT    G++++    +E     LE             PPI +P L  P+  +K+K  K    QF KF
Subjt:  AEQEKPQMEYCKEITVHQ-EESKEEPESEDYETPT----GEAEEDTSSDEAEKPNLE-------------PPIPSPTLLVPKERKKKKKKKNNQVQFDKF

Query:  MNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSIRVQQKVPEKVADPGSFSVPCSCGTYSF-KALCDLGASINTIPLSLCKKL
        ++ F  ++INIPFA+ALE MP Y +F+K+ ++KKR+ ++ +TV L+  CS  +Q+K+P+K+ DP SF++PC+ G   F + LCDLGASIN +P  +C+KL
Subjt:  MNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSIRVQQKVPEKVADPGSFSVPCSCGTYSF-KALCDLGASINTIPLSLCKKL

Query:  DIGEIKSTLVKLQLADQSVVRPVGIVENVLIRVRKFFLPIDLYGMNMIENPSMPVILGRPFLATGRVIIDIERRELTIRVKNEKEIFKAVEDSK
         +GE+K T + LQLAD+S+  P GI+E+VL++V KF  P D   ++M E+  +P+ILGRPFLATGR +ID+++ ELT+RV  E+ +F   +  K
Subjt:  DIGEIKSTLVKLQLADQSVVRPVGIVENVLIRVRKFFLPIDLYGMNMIENPSMPVILGRPFLATGRVIIDIERRELTIRVKNEKEIFKAVEDSK

KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]2.5e-12339.19Show/hide
Query:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGLPTEDPNSHLKSFLDICGT---------------------DKARD
        MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + G P +DPN HL  FL+IC T                     DKAR 
Subjt:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGLPTEDPNSHLKSFLDICGT---------------------DKARD

Query:  WLQSITPRSITTWDALVQAFLKKSFPPTKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT
        WLQS+ P SI +W  + + FL K FPP KT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL   T+TIVDAA+GGTL+SKT
Subjt:  WLQSITPRSITTWDALVQAFLKKSFPPTKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT

Query:  VENARTLLEDMATNSYQWPSEWCTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAAALAS----------------------------
         E A  LLE+MA+N+YQWP+E    KK V G+ E++ ++AL AQ+ +L++     +     QS E  A+ +                             
Subjt:  VENARTLLEDMATNSYQWPSEWCTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAAALAS----------------------------

Query:  -------RPHEE----------------------TIETTKLEEVVIA------------------INTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAP
               R HE                       + +   LE+ +++                  I T  +   A +KN+E Q+GQL   ++   +G  P
Subjt:  -------RPHEE----------------------TIETTKLEEVVIA------------------INTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAP

Query:  AEQEKPQMEYCKEITVHQ-EESKEEPESEDYETPT----GEAEEDTSSDEAEKPNLE-------------PPIPSPTLLVPKERKKKKKKKNNQVQFDKF
        +  E    E CK IT+   +E +  P  E   TPT    G++++    +E     LE             PPI +P L  P+  +K+K  K    QF KF
Subjt:  AEQEKPQMEYCKEITVHQ-EESKEEPESEDYETPT----GEAEEDTSSDEAEKPNLE-------------PPIPSPTLLVPKERKKKKKKKNNQVQFDKF

Query:  MNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSIRVQQKVPEKVADPGSFSVPCSCGTYSF-KALCDLGASINTIPLSLCKKL
        ++ F  ++INIPFA+ALE MP Y +F+K+ ++KKR+ ++ +TV L+  CS  +Q+K+P+K+ DPGSF++PC+ G   F + LCDLGASIN +P S+C+KL
Subjt:  MNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSIRVQQKVPEKVADPGSFSVPCSCGTYSF-KALCDLGASINTIPLSLCKKL

Query:  DIGEIKSTLVKLQLADQSVVRPVGIVENVLIRVRKFFLPIDLYGMNMIENPSMPVILGRPFLATGRVIIDIERRELTIRVKNEKEIFKAVEDSK
         +GE+K T + LQLAD+S+  P GI+E+VL++V KF  P D   ++M E+  +P+ILGRPFLATGR +ID+++ ELT+RV  E+ +F   +  K
Subjt:  DIGEIKSTLVKLQLADQSVVRPVGIVENVLIRVRKFFLPIDLYGMNMIENPSMPVILGRPFLATGRVIIDIERRELTIRVKNEKEIFKAVEDSK

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]1.1e-13140.84Show/hide
Query:  SREVAPFDPEIERTIHRLRRENREIIQMADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGLPTEDPNSHLKSFLDICG
        SR++ P DPEIERT+  LRR   +I+ MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + G P +DPN HL  FL+IC 
Subjt:  SREVAPFDPEIERTIHRLRRENREIIQMADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGLPTEDPNSHLKSFLDICG

Query:  T---------------------DKARDWLQSITPRSITTWDALVQAFLKKSFPPTKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQV
        T                     DKAR WLQS+ P SI +W  + + FL K FPP KT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQV
Subjt:  T---------------------DKARDWLQSITPRSITTWDALVQAFLKKSFPPTKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQV

Query:  QLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSEWCTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAAALAS-
        Q+FYNGL   T+TIVDAA+GGTL+SKT E A  LLE+MA+N+YQWP+E    KK V G+ +++ ++AL AQ+ +L++     +     QS E  A+ +  
Subjt:  QLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSEWCTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAAALAS-

Query:  ----------------------------------RPHEE-TIETTK---------------------LEEVVIA------------------INTTVNGH
                                          R HE  +   TK                     LE+ +++                  I T  +  
Subjt:  ----------------------------------RPHEE-TIETTK---------------------LEEVVIA------------------INTTVNGH

Query:  SAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKEITVHQ-EESKEEPESEDYETPT----GEAEEDTSSDEAEKPNLE-------------PPI
         AAIKNIE Q+GQL   ++   +G  P+  E    E CK IT+   +E +  P  E   TPT    G+++     DE     LE             PPI
Subjt:  SAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKEITVHQ-EESKEEPESEDYETPT----GEAEEDTSSDEAEKPNLE-------------PPI

Query:  PSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSIRVQQKVPEKVADPGSFSVPCSC
         +P L  P+  +K+K  K    QF KF++ F  ++INIPFA+ALE MP Y +F+K+ ++KKR+ ++ +TV L+  CS  +Q+K+P+K+ DPGSF++PC+ 
Subjt:  PSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSIRVQQKVPEKVADPGSFSVPCSC

Query:  GTYSF-KALCDLGASINTIPLSLCKKLDIGEIKSTLVKLQLADQSVVRPVGIVENVLIRVRKFFLPIDLYGMNMIENPSMPVILGRPFLATGRVIIDIER
        G   F K LCDLGASIN +PLS+C+KL + E+K T + LQLAD+S+  P GI+E+VL++V KF  P D   ++M E+  +P+ILGRPFLATGR +ID+++
Subjt:  GTYSF-KALCDLGASINTIPLSLCKKLDIGEIKSTLVKLQLADQSVVRPVGIVENVLIRVRKFFLPIDLYGMNMIENPSMPVILGRPFLATGRVIIDIER

Query:  RELTIRVKNEKEIFK
         ELT+RV  E+ +FK
Subjt:  RELTIRVKNEKEIFK

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]6.9e-12139.39Show/hide
Query:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGLPTEDPNSHLKSFLDICGT---------------------DKARD
        MA+     +PR ++DY +P+     SGI    INANNFELK  LI M +   + G P +DPN HL  FL+IC T                     DKAR 
Subjt:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGLPTEDPNSHLKSFLDICGT---------------------DKARD

Query:  WLQSITPRSITTWDALVQAFLKKSFPPTKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT
        WLQS+ P SIT+W  + + FL K FPP KT +LR+EIG F+Q   E L+EAWER+K+L+R CPQHG PDWLQVQ+FYNGL   T+TIVDAA+GGTL+SKT
Subjt:  WLQSITPRSITTWDALVQAFLKKSFPPTKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT

Query:  VENARTLLEDMATNSYQWPSEWCTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAAALA-----------------------------
         E A +LLE+MA+N+YQWP+E    KK V G+ E++  +AL AQ+ SL++     +     Q  E  AA +                             
Subjt:  VENARTLLEDMATNSYQWPSEWCTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAAALA-----------------------------

Query:  -----------------------------SRPHEET----------IETTKL-----EEVVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAE
                                     S+P E+           +E TK      +  +  I T  +   A +KN+E Q+GQL   ++   +G  P+ 
Subjt:  -----------------------------SRPHEET----------IETTKL-----EEVVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAE

Query:  QEKPQMEYCKEITVHQ-EESKEEPESEDYETPTG-------------EAEEDTSSDEAEKPNLEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMN
         E    E CK IT+    E +  P  E   TPT              E  EDT  +    P++  P   P L  P    ++ +K+    QF KF++ F  
Subjt:  QEKPQMEYCKEITVHQ-EESKEEPESEDYETPTG-------------EAEEDTSSDEAEKPNLEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMN

Query:  LNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSIRVQQKVPEKVADPGSFSVPCSCGTYSF-KALCDLGASINTIPLSLCKKLDIGEIK
        ++INIPFA+ALE MP Y +F+K+ ++KKR+ ++ +TV L+  CS  +Q+K+P+K+ DPGSF++PC+ G   F K LCDLGASIN +PLS+ +KL +GE+K
Subjt:  LNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSIRVQQKVPEKVADPGSFSVPCSCGTYSF-KALCDLGASINTIPLSLCKKLDIGEIK

Query:  STLVKLQLADQSVVRPVGIVENVLIRVRKFFLPIDLYGMNMIENPSMPVILGRPFLATGRVIIDIERRELTIRVKNEKEIFKAVEDSK
         T + LQLAD+S+  P GI+E+VL++V KF  P D   ++M E+  +P+ILGRPFLATGR ++D+++ ELT+RV  E+  F   E  K
Subjt:  STLVKLQLADQSVVRPVGIVENVLIRVRKFFLPIDLYGMNMIENPSMPVILGRPFLATGRVIIDIERRELTIRVKNEKEIFKAVEDSK

XP_038976409.1 uncharacterized protein LOC113461320 [Phoenix dactylifera]3.0e-11639.5Show/hide
Query:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGLPTEDPNSHLKSFLDICGT---------------------DKARD
        MAD    +  R + DY  P   G Q  IV   +NANNFE+K GLIQM +   + G P+EDP++HL +FL+IC T                     DKA+ 
Subjt:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGLPTEDPNSHLKSFLDICGT---------------------DKARD

Query:  WLQSITPRSITTWDALVQAFLKKSFPPTKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT
        WL S  P S TTW+AL QAFL K FPP KT KLR +I +F Q   E L+EAWERFK+L RKCP HG PDWL VQ FYNGLT S +  +DAAAGGTL+SK+
Subjt:  WLQSITPRSITTWDALVQAFLKKSFPPTKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT

Query:  VENARTLLEDMATNSYQWPSEWCTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQS-------------------------------------
         E A  LLE+MA+N+YQW +E C PKK V G+++VD ++ L A++ SL   F K     S  S                                     
Subjt:  VENARTLLEDMATNSYQWPSEWCTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQS-------------------------------------

Query:  ----------------IESAAALASRPHEETIETTKLEEV------VIAINTTVNGHS--------------AAIKNIETQLGQLVNVVSTMNKGKAPAE
                         +     +SRP        KL +        IAI    N  S              ++ +N+E QLGQL N +++  +G  P++
Subjt:  ----------------IESAAALASRPHEETIETTKLEEV------VIAINTTVNGHS--------------AAIKNIETQLGQLVNVVSTMNKGKAPAE

Query:  QEKPQMEYCKEITVHQEESKEEPESE-------DYETPTGEAEEDTSSDEAEKPNLEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPF
         E    E+CK +T+   +   +  SE       DYE    +  E+   D A+ P+  PP+      +P  ++ K+ K +   QF+KF+  F  L+INIPF
Subjt:  QEKPQMEYCKEITVHQEESKEEPESE-------DYETPTGEAEEDTSSDEAEKPNLEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPF

Query:  AEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSIRVQQKVPEKVADPGSFSVPCSCGTYSF-KALCDLGASINTIPLSLCKKLDIGEIKSTLVKLQ
        A+AL ++P Y +F+KE ++KKRK +  +T+ L   CS  +Q K+P K+ DPGSFS+PC+ G   F +ALCDLGAS++ +PLS+ +KL + E+K T + LQ
Subjt:  AEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSIRVQQKVPEKVADPGSFSVPCSCGTYSF-KALCDLGASINTIPLSLCKKLDIGEIKSTLVKLQ

Query:  LADQSVVRPVGIVENVLIRVRKFFLPIDLYGMNMIENPSMPVILGRPFLATGRVIIDIERRELTIRVKNEKEIFKAVEDSK
        LAD+SV  P+GI+ENVLI+V+KF +P+D   + M E+  +P+ILGRPFLAT   IID++   LT++V  E+  F   E +K
Subjt:  LADQSVVRPVGIVENVLIRVRKFFLPIDLYGMNMIENPSMPVILGRPFLATGRVIIDIERRELTIRVKNEKEIFKAVEDSK

TrEMBL top hitse value%identityAlignment
A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129451.2e-11538.9Show/hide
Query:  RSSREVAPFDPEIERTIHRLRRENREII----QMADQN----------PPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-YRGL
        R++  + PFDP+IERT  R RREN ++      MA+ N           PE  R +RDY  P+ QG    I    INANNFE+K   IQM +    + GL
Subjt:  RSSREVAPFDPEIERTIHRLRRENREII----QMADQN----------PPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-YRGL

Query:  PTEDPNSHLKSFLDICGT---------------------DKARDWLQSITPRSITTWDALVQAFLKKSFPPTKTVKLRTEIGTFQQQYDEQLFEAWERFK
        P++DPNSHL +FL+IC T                     DKA+ WL S+   SITTW+ L Q FL K FPP KT K+R +I +F Q   E L+EAWERFK
Subjt:  PTEDPNSHLKSFLDICGT---------------------DKARDWLQSITPRSITTWDALVQAFLKKSFPPTKTVKLRTEIGTFQQQYDEQLFEAWERFK

Query:  ELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSEWCTPKKIVVGVFEVDKVSALQAQMTSLA-------
        ELLR+CP HG PDWLQVQ FYNGL  S KTI+DAAAGG L+SK   +A  LLE+MA+N+YQWPSE    +K  VG +E+D +  L  Q+ +L+       
Subjt:  ELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSEWCTPKKIVVGVFEVDKVSALQAQMTSLA-------

Query:  -----NAFMKFSGTG-------------SAQSIESAAALASRPHEETI---------------------------------------ETTKLEEVVI---
             N+ +     G             S Q + +     + P+  T                                        + ++LEE+++   
Subjt:  -----NAFMKFSGTG-------------SAQSIESAAALASRPHEETI---------------------------------------ETTKLEEVVI---

Query:  -AINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAE-QEKPQ-MEYCKEIT---------VHQEESKEEPESED------YETPTGEAEEDTSSDEA
           +  +    A+++N+ETQ+GQL N ++   +G  P++ Q  P+  E C+ IT         V+Q+  + E E  D       E    + ++D + ++ 
Subjt:  -AINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAE-QEKPQ-MEYCKEIT---------VHQEESKEEPESED------YETPTGEAEEDTSSDEA

Query:  EKPNLEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSIRVQQKVPEKVADP
            + PP P P  L     +K+K +K    QF KF+N F  L+INIPFAEALE MP Y +F+K+ L+KKRK  + +TV+L   CS  +Q K+P K+ DP
Subjt:  EKPNLEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSIRVQQKVPEKVADP

Query:  GSFSVPCSCGTYSF-KALCDLGASINTIPLSLCKKLDIGEIKSTLVKLQLADQSVVRPVGIVENVLIRVRKFFLPIDLYGMNMIENPSMPVILGRPFLAT
        GSF++PC+ G   F KAL DLGASIN +P S+ +KL +GE K T V LQLAD+S V P GI+E+VL++V KF  P+D   ++M E+  +P+ILGRPFLAT
Subjt:  GSFSVPCSCGTYSF-KALCDLGASINTIPLSLCKKLDIGEIKSTLVKLQLADQSVVRPVGIVENVLIRVRKFFLPIDLYGMNMIENPSMPVILGRPFLAT

Query:  GRVIIDIERRELTIRVKNEKEIFKAVEDSK
           IID+   +++ +V  E   F     SK
Subjt:  GRVIIDIERRELTIRVKNEKEIFKAVEDSK

A0A6J1DU19 uncharacterized protein LOC1110243612.6e-10240.6Show/hide
Query:  IRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGLPTEDPNSHLKSFLDICGTDKARDWLQS-----ITPRSITTWDALVQAFLKKSFPPT
        IRDY QP F     GI+  PINANN ELK GLIQM R+  +RG  TEDPN+HL  FLD+CGT K    +       + P S+   + +VQAFL   FPP 
Subjt:  IRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGLPTEDPNSHLKSFLDICGTDKARDWLQS-----ITPRSITTWDALVQAFLKKSFPPT

Query:  KTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSEWCTPKKI
        KT +LRTEI +F++   EQLFE WER+KELLRKCPQHG  +WLQ+Q+FYNGL   T+TI+DAAAGGTLLS+T ENA  LL+DMA NS+QWPSE    KK 
Subjt:  KTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSEWCTPKKI

Query:  VVGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAAALASRP-HEETIE-----------------------------TTKLEEVVIAINTTVNGH
        V G++E+D++S+L+AQ+ +L NA  K SG G++ S E  AA  +   +E TIE                              +++E  V  +   + G+
Subjt:  VVGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAAALASRP-HEETIE-----------------------------TTKLEEVVIAINTTVNGH

Query:  SAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKEITVHQEESKEEPESEDYETPTGEAEEDTSSDEAEKPNLEPPIPSPTLLVPKERKKKKKKK
        + +IKN+E Q+GQ+   ++TM KGK P++ E    E+CK +T+   +  +EPE +  E P    EE  + +E  K        +P L   K         
Subjt:  SAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKEITVHQEESKEEPESEDYETPTGEAEEDTSSDEAEKPNLEPPIPSPTLLVPKERKKKKKKK

Query:  NNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSIRVQQKVPEKVADPGSFSVPCSCGTYSF-KALCDLGASINT
         N + + +                ALE MP Y RFMK+ +  KRK +  +TV L   CS  +Q+K+P+K+ DPGSF++PC+  + SF KALCD+ ASIN 
Subjt:  NNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSIRVQQKVPEKVADPGSFSVPCSCGTYSF-KALCDLGASINT

Query:  IPLSLCKKLDIGEIKSTLVKLQLADQSVVRPVGIVENVLIRVRKFFLPIDLYGMNMIENPSMPVILGRPFLATGRVIIDIERRELTIRVKNEKEIF
        +PL                             G++E+VL++V +   P D   +   E+  +P+ILGR FLATG  +ID++   LT+RV  E  +F
Subjt:  IPLSLCKKLDIGEIKSTLVKLQLADQSVVRPVGIVENVLIRVRKFFLPIDLYGMNMIENPSMPVILGRPFLATGRVIIDIERRELTIRVKNEKEIF

A0A6P6XAQ1 Reverse transcriptase2.7e-9936.58Show/hide
Query:  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGLPTEDPNSHLKSFLDICGT---------------------DKARDWLQSITPRSI
        R +RD+  P  QG Q+ IV   +NANNFE+K  LIQM +   Y G  TEDPNSHL +FL+IC T                     DKA+ WLQS  P + 
Subjt:  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGLPTEDPNSHLKSFLDICGT---------------------DKARDWLQSITPRSI

Query:  TTWDALVQAFLKKSFPPTKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLED
        TTWD L +AFL K FPP KT KLR +I +F QQ  E L+EAWER++EL R+CP HG PDWL VQ FYNGLT  TKT VDAAAGG L+ KT E A+ L+E+
Subjt:  TTWDALVQAFLKKSFPPTKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLED

Query:  MATNSYQWPSEWCTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAAALA----------------------SRP--------------
        MA N+YQW +E    ++   G+ EVD ++ L A+M ++     +  G+ S Q +  A+                         +RP              
Subjt:  MATNSYQWPSEWCTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAAALA----------------------SRP--------------

Query:  ------------------------HEETIETTK------LEEVVIAINTTVNGHSAA------------------IKNIETQLGQLVNVVSTMNKGKAPA
                                 ++T+  +K      +E++  A N  +   ++A                   +N+E QLGQ+ N V+  N+G  P+
Subjt:  ------------------------HEETIETTK------LEEVVIAINTTVNGHSAA------------------IKNIETQLGQLVNVVSTMNKGKAPA

Query:  EQEKPQMEYCKEITVHQEESKEEPESEDYETPTGEAEEDTSSDEAEKPNLEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALEM
        + E    E+ K IT+   +   EP         G   E    +  +   L+           KE K K+K + N++Q        M     IP      +
Subjt:  EQEKPQMEYCKEITVHQEESKEEPESEDYETPTGEAEEDTSSDEAEKPNLEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALEM

Query:  PQYNRFMKEWLAKKRKEKKVDTVYLASTCSIRVQQKVPEKVADPGSFSVPCSCGTYSF-KALCDLGASINTIPLSLCKKLDIGEIKSTLVKLQLADQSVV
        P Y +F+KE + KKRK    +T+ L   CS  +Q K+P K+ DPGSF+VPC+ G   F KALCDLGAS++ IPL++ ++L + E+K T + LQLAD+S+ 
Subjt:  PQYNRFMKEWLAKKRKEKKVDTVYLASTCSIRVQQKVPEKVADPGSFSVPCSCGTYSF-KALCDLGASINTIPLSLCKKLDIGEIKSTLVKLQLADQSVV

Query:  RPVGIVENVLIRVRKFFLPIDLYGMNMIENPSMPVILGRPFLATGRVIIDIERRELTIRVKNEKEIF
         P+GI+ENVLI+V+KF +P+D   ++M E+ ++P+ILGRPFLAT   IID++R +   ++  E+  F
Subjt:  RPVGIVENVLIRVRKFFLPIDLYGMNMIENPSMPVILGRPFLATGRVIIDIERRELTIRVKNEKEIF

A0A6P8DD93 uncharacterized protein LOC1162064533.8e-10135.67Show/hide
Query:  MRSSR--EVAPFDPEIERTIHRLRRENR-----EIIQMADQNPPEE----PRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGLPTE
        MR SR  E+ P DPEIERT+HRLRRENR     ++++MAD +   +     R +RDY  P   G  S I    I ANNFELK  LIQM +   + G P E
Subjt:  MRSSR--EVAPFDPEIERTIHRLRRENR-----EIIQMADQNPPEE----PRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGLPTE

Query:  DPNSHLKSFLDICGT---------------------DKARDWLQSITPRSITTWDALVQAFLKKSFPPTKTVKLRTEIGTFQQQYDEQLFEAWERFKELL
         P+ H+  FL  C T                     DKAR W  S+   SITTW  L   FL++ FPP +T +LR EI  F +   E L+EAWERFKE +
Subjt:  DPNSHLKSFLDICGT---------------------DKARDWLQSITPRSITTWDALVQAFLKKSFPPTKTVKLRTEIGTFQQQYDEQLFEAWERFKELL

Query:  RKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSEWCTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGTG
        RKCP HG PD L +++FY  L  + +++VDAAAGG L+ K  + A  L+E+MA++++ W +E    K  V  V ++D ++ L  Q+++L     K +   
Subjt:  RKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSEWCTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGTG

Query:  SAQSIESA-AALASRPHEETIE------------------------------------------------------------------------TTKLEE
        S  + + A   L S PH  T+E                                                                         +++EE
Subjt:  SAQSIESA-AALASRPHEETIE------------------------------------------------------------------------TTKLEE

Query:  VVIA----INTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAE-QEKPQ------MEYCKEITVHQEESKEEPESEDYETPTGEAEEDTSSDEAEKPN
        ++++     +T +    A I+N+E Q+ Q+   +S    G  P+  +E P+      +   KE+ +   +++ + ES + +    + EE        KP 
Subjt:  VVIA----INTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAE-QEKPQ------MEYCKEITVHQEESKEEPESEDYETPTGEAEEDTSSDEAEKPN

Query:  LEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSIRVQQ---KVPEKVADPG
        + PP+P P         ++ K++    QF KF++ F  L INIPFAEAL +MP Y RFMK+ L KKRK    + V L   CS+ +Q+    +P K  D G
Subjt:  LEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSIRVQQ---KVPEKVADPG

Query:  SFSVPCSCGTYSFK-ALCDLGASINTIPLSLCKKLDIGEIKSTLVKLQLADQSVVRPVGIVENVLIRVRKFFLPIDLYGMNMIENPSMPVILGRPFLATG
        SF+VPC+ G + F+  L D GASIN +PLS+ +KL +GE K T V LQLAD+S+  P GIVENVL++V KF  P+D   + M E+  +P+ILGRPFLATG
Subjt:  SFSVPCSCGTYSFK-ALCDLGASINTIPLSLCKKLDIGEIKSTLVKLQLADQSVVRPVGIVENVLIRVRKFFLPIDLYGMNMIENPSMPVILGRPFLATG

Query:  RVIIDIERRELTIRVKNEKEIFKAVEDSK
        + +ID+E+ +LT+RV NE+  F   +  K
Subjt:  RVIIDIERRELTIRVKNEKEIFKAVEDSK

A0A6P8DKJ2 uncharacterized protein LOC1162042313.8e-10135.67Show/hide
Query:  MRSSR--EVAPFDPEIERTIHRLRRENR-----EIIQMADQNPPEE----PRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGLPTE
        MR SR  E+ P DPEIERT+HRLRRENR     ++++MAD +   +     R +RDY  P   G  S I    I ANNFELK  LIQM +   + G P E
Subjt:  MRSSR--EVAPFDPEIERTIHRLRRENR-----EIIQMADQNPPEE----PRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGLPTE

Query:  DPNSHLKSFLDICGT---------------------DKARDWLQSITPRSITTWDALVQAFLKKSFPPTKTVKLRTEIGTFQQQYDEQLFEAWERFKELL
         P+ H+  FL  C T                     DKAR W  S+   SITTW  L   FL++ FPP +T +LR EI  F +   E L+EAWERFKE +
Subjt:  DPNSHLKSFLDICGT---------------------DKARDWLQSITPRSITTWDALVQAFLKKSFPPTKTVKLRTEIGTFQQQYDEQLFEAWERFKELL

Query:  RKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSEWCTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGTG
        RKCP HG PD L +++FY  L  + +++VDAAAGG L+ K  + A  L+E+MA++++ W +E    K  V  V ++D ++ L  Q+++L     K +   
Subjt:  RKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSEWCTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGTG

Query:  SAQSIESA-AALASRPHEETIE------------------------------------------------------------------------TTKLEE
        S  + + A   L S PH  T+E                                                                         +++EE
Subjt:  SAQSIESA-AALASRPHEETIE------------------------------------------------------------------------TTKLEE

Query:  VVIA----INTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAE-QEKPQ------MEYCKEITVHQEESKEEPESEDYETPTGEAEEDTSSDEAEKPN
        ++++     +T +    A I+N+E Q+ Q+   +S    G  P+  +E P+      +   KE+ +   +++ + ES + +    + EE        KP 
Subjt:  VVIA----INTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAE-QEKPQ------MEYCKEITVHQEESKEEPESEDYETPTGEAEEDTSSDEAEKPN

Query:  LEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSIRVQQ---KVPEKVADPG
        + PP+P P  L         K++    QF KF++ F  L INIPFAEAL +MP Y RFMK+ L KKRK    + V L   CS+ +Q+    +P K  D G
Subjt:  LEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSIRVQQ---KVPEKVADPG

Query:  SFSVPCSCGTYSFK-ALCDLGASINTIPLSLCKKLDIGEIKSTLVKLQLADQSVVRPVGIVENVLIRVRKFFLPIDLYGMNMIENPSMPVILGRPFLATG
        SF+VPC+ G + F+  L D GASIN +PLS+ +KL +GE K T + LQLAD+S+  P GIVENVL++V KF  P+D   + M E+  +P+ILGRPFLATG
Subjt:  SFSVPCSCGTYSFK-ALCDLGASINTIPLSLCKKLDIGEIKSTLVKLQLADQSVVRPVGIVENVLIRVRKFFLPIDLYGMNMIENPSMPVILGRPFLATG

Query:  RVIIDIERRELTIRVKNEKEIFKAVEDSK
        + +ID+E+ +LT+RV NE+  F   +  K
Subjt:  RVIIDIERRELTIRVKNEKEIFKAVEDSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTAGTAGTAGAGAAGTAGCACCATTTGATCCTGAGATAGAAAGAACAATTCATAGGCTTCGAAGGGAAAATAGAGAAATTATTCAAATGGCAGACCAAAATCCACC
TGAGGAGCCTAGGCCTATTAGAGATTATTTTCAGCCTGTGTTTCAGGGGCAACAGTCGGGGATTGTCTATGCACCGATTAATGCCAACAACTTTGAGCTGAAAACCGGTC
TTATTCAGATGGCTCGAGACTGTGCATACAGAGGATTGCCCACTGAGGATCCAAATTCTCATCTTAAATCATTCTTGGACATTTGTGGGACGGATAAAGCACGAGATTGG
TTGCAGTCTATTACCCCTAGGAGCATCACCACCTGGGATGCTTTGGTCCAGGCCTTTTTGAAGAAATCCTTTCCTCCTACGAAGACGGTCAAGCTGAGGACCGAGATTGG
GACATTCCAGCAACAGTATGATGAGCAGCTGTTTGAAGCCTGGGAGCGATTCAAAGAGTTGTTGAGGAAGTGTCCTCAGCATGGTTACCCAGATTGGCTTCAGGTACAGT
TGTTTTATAATGGTTTAACTCCTAGTACAAAAACGATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACCGTGGAAAACGCTCGCACACTTCTAGAGGATATG
GCCACCAACAGCTATCAGTGGCCATCTGAGTGGTGTACACCTAAAAAGATTGTTGTTGGAGTGTTTGAGGTTGATAAAGTAAGTGCACTCCAGGCCCAGATGACTTCCCT
CGCTAATGCTTTTATGAAATTTTCAGGTACAGGGAGTGCACAGTCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCATGAGGAGACCATCGAGACAACCAAGTTAG
AGGAGGTAGTCATTGCCATCAACACCACAGTGAATGGCCACAGTGCTGCCATCAAGAACATTGAGACTCAGCTGGGACAGTTGGTGAATGTTGTAAGCACCATGAATAAA
GGTAAGGCCCCAGCTGAACAAGAGAAACCCCAGATGGAGTATTGTAAGGAAATCACTGTGCACCAGGAGGAATCTAAAGAGGAACCTGAATCTGAGGACTATGAAACGCC
TACAGGGGAAGCTGAGGAGGACACATCATCTGATGAGGCTGAAAAGCCTAACCTTGAGCCTCCTATTCCTTCTCCCACACTGTTGGTTCCCAAGGAAAGGAAAAAGAAAA
AGAAGAAAAAGAACAATCAGGTTCAGTTTGATAAATTTATGAATGCTTTTATGAATCTGAATATTAATATTCCTTTTGCAGAGGCATTAGAGATGCCACAATACAACAGG
TTCATGAAGGAGTGGTTAGCAAAGAAGCGAAAAGAAAAGAAGGTTGACACTGTTTATCTTGCTTCCACATGCAGCATCAGAGTACAACAGAAAGTACCTGAAAAAGTAGC
AGATCCAGGGAGTTTTTCTGTTCCTTGTAGTTGTGGTACTTATTCTTTCAAAGCATTATGTGATTTAGGTGCTAGCATTAATACCATTCCTCTATCGTTGTGTAAAAAGT
TAGATATAGGTGAGATTAAATCTACTCTTGTAAAGCTCCAATTGGCTGATCAATCTGTGGTGAGACCAGTTGGTATTGTAGAAAATGTTTTAATCAGAGTACGTAAATTT
TTCCTCCCTATTGACTTGTATGGTATGAACATGATAGAAAATCCTTCGATGCCTGTTATATTAGGAAGACCATTCCTCGCTACTGGGCGAGTGATTATTGATATTGAGCG
CAGGGAGCTCACTATTAGAGTCAAGAACGAAAAAGAAATCTTTAAAGCAGTTGAAGACTCTAAAGATGAAGTGCTTTTCATGGGCTACAGGAAAGGTGCAAGAAAGAGCA
CCTCTGTTGGATTCACAAAACAAAAGCCTCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGTAGTAGTAGAGAAGTAGCACCATTTGATCCTGAGATAGAAAGAACAATTCATAGGCTTCGAAGGGAAAATAGAGAAATTATTCAAATGGCAGACCAAAATCCACC
TGAGGAGCCTAGGCCTATTAGAGATTATTTTCAGCCTGTGTTTCAGGGGCAACAGTCGGGGATTGTCTATGCACCGATTAATGCCAACAACTTTGAGCTGAAAACCGGTC
TTATTCAGATGGCTCGAGACTGTGCATACAGAGGATTGCCCACTGAGGATCCAAATTCTCATCTTAAATCATTCTTGGACATTTGTGGGACGGATAAAGCACGAGATTGG
TTGCAGTCTATTACCCCTAGGAGCATCACCACCTGGGATGCTTTGGTCCAGGCCTTTTTGAAGAAATCCTTTCCTCCTACGAAGACGGTCAAGCTGAGGACCGAGATTGG
GACATTCCAGCAACAGTATGATGAGCAGCTGTTTGAAGCCTGGGAGCGATTCAAAGAGTTGTTGAGGAAGTGTCCTCAGCATGGTTACCCAGATTGGCTTCAGGTACAGT
TGTTTTATAATGGTTTAACTCCTAGTACAAAAACGATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACCGTGGAAAACGCTCGCACACTTCTAGAGGATATG
GCCACCAACAGCTATCAGTGGCCATCTGAGTGGTGTACACCTAAAAAGATTGTTGTTGGAGTGTTTGAGGTTGATAAAGTAAGTGCACTCCAGGCCCAGATGACTTCCCT
CGCTAATGCTTTTATGAAATTTTCAGGTACAGGGAGTGCACAGTCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCATGAGGAGACCATCGAGACAACCAAGTTAG
AGGAGGTAGTCATTGCCATCAACACCACAGTGAATGGCCACAGTGCTGCCATCAAGAACATTGAGACTCAGCTGGGACAGTTGGTGAATGTTGTAAGCACCATGAATAAA
GGTAAGGCCCCAGCTGAACAAGAGAAACCCCAGATGGAGTATTGTAAGGAAATCACTGTGCACCAGGAGGAATCTAAAGAGGAACCTGAATCTGAGGACTATGAAACGCC
TACAGGGGAAGCTGAGGAGGACACATCATCTGATGAGGCTGAAAAGCCTAACCTTGAGCCTCCTATTCCTTCTCCCACACTGTTGGTTCCCAAGGAAAGGAAAAAGAAAA
AGAAGAAAAAGAACAATCAGGTTCAGTTTGATAAATTTATGAATGCTTTTATGAATCTGAATATTAATATTCCTTTTGCAGAGGCATTAGAGATGCCACAATACAACAGG
TTCATGAAGGAGTGGTTAGCAAAGAAGCGAAAAGAAAAGAAGGTTGACACTGTTTATCTTGCTTCCACATGCAGCATCAGAGTACAACAGAAAGTACCTGAAAAAGTAGC
AGATCCAGGGAGTTTTTCTGTTCCTTGTAGTTGTGGTACTTATTCTTTCAAAGCATTATGTGATTTAGGTGCTAGCATTAATACCATTCCTCTATCGTTGTGTAAAAAGT
TAGATATAGGTGAGATTAAATCTACTCTTGTAAAGCTCCAATTGGCTGATCAATCTGTGGTGAGACCAGTTGGTATTGTAGAAAATGTTTTAATCAGAGTACGTAAATTT
TTCCTCCCTATTGACTTGTATGGTATGAACATGATAGAAAATCCTTCGATGCCTGTTATATTAGGAAGACCATTCCTCGCTACTGGGCGAGTGATTATTGATATTGAGCG
CAGGGAGCTCACTATTAGAGTCAAGAACGAAAAAGAAATCTTTAAAGCAGTTGAAGACTCTAAAGATGAAGTGCTTTTCATGGGCTACAGGAAAGGTGCAAGAAAGAGCA
CCTCTGTTGGATTCACAAAACAAAAGCCTCCTTGA
Protein sequenceShow/hide protein sequence
MRSSREVAPFDPEIERTIHRLRRENREIIQMADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGLPTEDPNSHLKSFLDICGTDKARDW
LQSITPRSITTWDALVQAFLKKSFPPTKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDM
ATNSYQWPSEWCTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAAALASRPHEETIETTKLEEVVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNK
GKAPAEQEKPQMEYCKEITVHQEESKEEPESEDYETPTGEAEEDTSSDEAEKPNLEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALEMPQYNR
FMKEWLAKKRKEKKVDTVYLASTCSIRVQQKVPEKVADPGSFSVPCSCGTYSFKALCDLGASINTIPLSLCKKLDIGEIKSTLVKLQLADQSVVRPVGIVENVLIRVRKF
FLPIDLYGMNMIENPSMPVILGRPFLATGRVIIDIERRELTIRVKNEKEIFKAVEDSKDEVLFMGYRKGARKSTSVGFTKQKPP