; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028059 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028059
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr8:12031423..12038737
RNA-Seq ExpressionLag0028059
SyntenyLag0028059
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6734747.1 hypothetical protein I3842_01G285500 [Carya illinoinensis]1.1e-14243.17Show/hide
Query:  MADQNLPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFW------------------------IFDKARD
        MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + GSP +DPN HL  F                         + DKAR 
Subjt:  MADQNLPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFW------------------------IFDKARD

Query:  WLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT
        WLQS+ PGSI +W  + + FL KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL   T+TIVDAA+GGTL+SKT
Subjt:  WLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT

Query:  VENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQYVSNFNSRGYNNSST
         E A  LLE+MA+N+YQWP+ER+  KK+ AG+ E++ ++AL AQ+ +L++     +     QS E  ++ ++     E + EQVQYV+N N   Y  +  
Subjt:  VENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQYVSNFNSRGYNNSST

Query:  PTHYHPNNRNHENFSYANTKNAL---NPPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEAVITINTTVNGHSAAIKNIETQLGQLVNV---------
        P +YHP  RNHEN SY NTKN L   +PPGF  Q  + K  LED + +F+ E++ R  K +  +  I T  +   A +KN+E Q+GQL            
Subjt:  PTHYHPNNRNHENFSYANTKNAL---NPPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEAVITINTTVNGHSAAIKNIETQLGQLVNV---------

Query:  --------------------EESEEEPESEDYETPT----GEAEEDTSSDEAEQPNLE-------------PPIPSPTLMVPKEKKKKKKKKNNQVQFDK
                            +E E  P  E   TPT    G++++    +E     LE             PPI +P L  P+  +K+K  K    QF K
Subjt:  --------------------EESEEEPESEDYETPT----GEAEEDTSSDEAEQPNLE-------------PPIPSPTLMVPKEKKKKKKKKNNQVQFDK

Query:  FMNAFMNLNINIPFAEALE-MPQYNMFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKK
        F++ F  ++INIPFA+ALE MP Y  F+K+ ++KKR+ ++ +TV L+  CS  +Q+K+P+K+ DP SF++PC+ G   F R LCDLGASIN++P  +C+K
Subjt:  FMNAFMNLNINIPFAEALE-MPQYNMFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKK

Query:  LDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSK
        L +GE+K T + LQLAD+S+  P GI+E+VL++V +F  P D  V+DM E+  +P+ILGRPFLATGRA+ID+++ ELT+RV  E+ +F   +  K
Subjt:  LDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSK

KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]2.5e-14443.45Show/hide
Query:  MADQNLPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFW------------------------IFDKARD
        MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + GSP +DPN HL  F                         + DKAR 
Subjt:  MADQNLPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFW------------------------IFDKARD

Query:  WLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT
        WLQS+ PGSI +W  + + FL KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL   T+TIVDAA+GGTL+SKT
Subjt:  WLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT

Query:  VENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQYVSNFNSRGYNNSST
         E A  LLE+MA+N+YQWP+ER+  KK+ AG+ E++ ++AL AQ+ +L++     +     QS E  ++ ++     E + EQVQYV+N N   Y  +  
Subjt:  VENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQYVSNFNSRGYNNSST

Query:  PTHYHPNNRNHENFSYANTKNAL---NPPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEAVITINTTVNGHSAAIKNIETQLGQLVNV---------
        P +YHP  RNHEN SY NTKN L   +PPGF  Q  + K  LED + +F+ E++ R  K +  +  I T  +   A +KN+E Q+GQL            
Subjt:  PTHYHPNNRNHENFSYANTKNAL---NPPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEAVITINTTVNGHSAAIKNIETQLGQLVNV---------

Query:  --------------------EESEEEPESEDYETPT----GEAEEDTSSDEAEQPNLE-------------PPIPSPTLMVPKEKKKKKKKKNNQVQFDK
                            +E E  P  E   TPT    G++++    +E     LE             PPI +P L  P+  +K+K  K    QF K
Subjt:  --------------------EESEEEPESEDYETPT----GEAEEDTSSDEAEQPNLE-------------PPIPSPTLMVPKEKKKKKKKKNNQVQFDK

Query:  FMNAFMNLNINIPFAEALE-MPQYNMFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKK
        F++ F  ++INIPFA+ALE MP Y  F+K+ ++KKR+ ++ +TV L+  CS  +Q+K+P+K+ DPGSF++PC+ G   F R LCDLGASIN++P S+C+K
Subjt:  FMNAFMNLNINIPFAEALE-MPQYNMFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKK

Query:  LDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSK
        L +GE+K T + LQLAD+S+  P GI+E+VL++V +F  P D  V+DM E+  +P+ILGRPFLATGRA+ID+++ ELT+RV  E+ +F   +  K
Subjt:  LDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSK

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]2.5e-15244.55Show/hide
Query:  SREVIPFDPEIERTILRLRRENREIIQMADQNLPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFW----
        SR++IP DPEIERT+  LRR   +I+ MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + GSP +DPN HL  F     
Subjt:  SREVIPFDPEIERTILRLRRENREIIQMADQNLPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFW----

Query:  --------------------IFDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQV
                            + DKAR WLQS+ PGSI +W  + + FL KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQV
Subjt:  --------------------IFDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQV

Query:  QLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIE--SAAALA
        Q+FYNGL   T+TIVDAA+GGTL+SKT E A  LLE+MA+N+YQWP+ER+  KK+ AG+ +++ ++AL AQ+ +L++     +     QS E  ++ ++ 
Subjt:  QLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIE--SAAALA

Query:  SRPQEETIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHENFSYANTKNAL---NPPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEAVITINTTVNG
            E + EQVQYV+N N   Y  +  P +YHP  RNHEN SY NTKN L   +PPGF  Q  + K  LED + +F+ E++ R  K +  +  I T  + 
Subjt:  SRPQEETIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHENFSYANTKNAL---NPPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEAVITINTTVNG

Query:  HSAAIKNIETQLGQLVNV-----------------------------EESEEEPESEDYETPT----GEAEEDTSSDEAEQPNLE-------------PP
          AAIKNIE Q+GQL                                +E E  P  E   TPT    G+++     DE     LE             PP
Subjt:  HSAAIKNIETQLGQLVNV-----------------------------EESEEEPESEDYETPT----GEAEEDTSSDEAEQPNLE-------------PP

Query:  IPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNMFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCS
        I +P L  P+  +K+K  K    QF KF++ F  ++INIPFA+ALE MP Y  F+K+ ++KKR+ ++ +TV L+  CS  +Q+K+P+K+ DPGSF++PC+
Subjt:  IPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNMFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCS

Query:  FGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRAIIDIE
         G   F + LCDLGASIN++PLS+C+KL + E+K T + LQLAD+S+  P GI+E+VL++V +F  P D  V+DM E+  +P+ILGRPFLATGRA+ID++
Subjt:  FGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRAIIDIE

Query:  RRELTIRVKNEKEIFK
        + ELT+RV  E+ +FK
Subjt:  RRELTIRVKNEKEIFK

XP_022843226.1 uncharacterized protein LOC111366761 [Olea europaea var. sylvestris]8.6e-13743.93Show/hide
Query:  MRSSR--EVIPFDPEIERT--ILR-LRRENREIIQMAD---QNLPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPN
        MR +R  +++  DPE ERT  ILR ++R  RE +   D    N   + R IRDY +PV     SGI    I A NFELK GLI M +   + G+  EDPN
Subjt:  MRSSR--EVIPFDPEIERT--ILR-LRRENREIIQMAD---QNLPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPN

Query:  SHLKSFW------------------------IFDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKC
        +HL SF                         + DKA+ W QS+  GSITTWD L Q FL K+FPP+K+ +LR EI  F+Q   E  +EAWERFK+LLR+C
Subjt:  SHLKSFW------------------------IFDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKC

Query:  PQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQ
        PQHG+  W+Q+++FYNGL   T+T+VDAAAGG L++KT E A  LL+D+ATNSYQWPSERS  KK+ AG+ EVD ++AL AQ+ SL N  +  +  G+ Q
Subjt:  PQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQ

Query:  SIESAAALASRPQEETI--EQVQYVS--NFNSRGYNNSSTPTHYHPNNRNHENFSYANTKNALN-PPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEE
        +++S  + +S  QE  +  EQVQY+   N+N RG   ++   HYHP  RNHEN SY N +N L  PPGF  Q  D K  LED++G FI+E+ +R  K E 
Subjt:  SIESAAALASRPQEETI--EQVQYVS--NFNSRGYNNSSTPTHYHPNNRNHENFSYANTKNALN-PPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEE

Query:  AVITINTTVNGHSAAIKNIETQLGQLVNVEES------------------------------EEEPESEDYETP----TGEAEEDTSSDEAEQPNL----
         +  I T V+   A +KN+E Q+GQL  + +S                              E +P+     TP    T E + +    EAE   +    
Subjt:  AVITINTTVNGHSAAIKNIETQLGQLVNVEES------------------------------EEEPESEDYETP----TGEAEEDTSSDEAEQPNL----

Query:  ------EPPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMPQYNMFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVAD
               PPI  P L  P+   KKK       QF KF+  F  ++INIPFAE L +MP Y  F+KE ++ K+K ++ +T+ L   CS  + QK+P K+ D
Subjt:  ------EPPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMPQYNMFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVAD

Query:  PGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLA
        PGSF++PC+ G  +F RALCD GASIN++PLS+ KKL +GE+K T + LQLAD+S+  P G++E+VL++V +F LP+D  V+DM EN  +P+ILGRPFLA
Subjt:  PGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLA

Query:  TGRAIIDI
        TGRA+ID+
Subjt:  TGRAIIDI

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]5.8e-14143.11Show/hide
Query:  MADQNLPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFW------------------------IFDKARD
        MA+     +PR ++DY +P+     SGI    INANNFELK  LI M +   + GSP +DPN HL  F                         + DKAR 
Subjt:  MADQNLPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFW------------------------IFDKARD

Query:  WLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT
        WLQS+ PGSIT+W  + + FL KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R CPQHG PDWLQVQ+FYNGL   T+TIVDAA+GGTL+SKT
Subjt:  WLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT

Query:  VENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQYVSNFNSRGYNNSST
         E A +LLE+MA+N+YQWP+ER+  KK+ AG+ E++  +AL AQ+ SL++     +     Q  E  +A+++     E + EQVQY++N N   Y  +  
Subjt:  VENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQYVSNFNSRGYNNSST

Query:  PTHYHPNNRNHENFSYANTKNALN-PPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEAVITINTTVNGHSAAIKNIETQLGQLVNV-----------
        P +YHP  RNHENFSY NTKN L  PPGF  Q  + K  LED + +F+ E+     K +  +  I T  +   A +KN+E Q+GQL              
Subjt:  PTHYHPNNRNHENFSYANTKNALN-PPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEAVITINTTVNGHSAAIKNIETQLGQLVNV-----------

Query:  ------------------EESEEEPESEDYETPTG-------------EAEEDTSSDEAEQPNLEPPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFM
                           E E  P  E   TPT              E  EDT  +    P++  P   P L  P    ++ +K+    QF KF++ F 
Subjt:  ------------------EESEEEPESEDYETPTG-------------EAEEDTSSDEAEQPNLEPPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFM

Query:  NLNINIPFAEALE-MPQYNMFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEI
         ++INIPFA+ALE MP Y  F+K+ ++KKR+ ++ +TV L+  CS  +Q+K+P+K+ DPGSF++PC+ G   F + LCDLGASIN++PLS+ +KL +GE+
Subjt:  NLNINIPFAEALE-MPQYNMFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEI

Query:  KSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSK
        K T + LQLAD+S+  P GI+E+VL++V +F  P D  V+DM E+  +P+ILGRPFLATGRA++D+++ ELT+RV  E+  F   E  K
Subjt:  KSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSK

TrEMBL top hitse value%identityAlignment
A0A1U7Z951 uncharacterized protein LOC1045905681.2e-9936.3Show/hide
Query:  MRSSREVIPFDPEIERTILRLRRENREIIQMADQNLPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-YRGSPTEDPNSHLKSFW
        ++S +E I +DPEIERT+ R  R +++    A+     E R + DY +P   G  S IV   I ANNF++K  +IQM ++   + G   EDPN+H+ +F 
Subjt:  MRSSREVIPFDPEIERTILRLRRENREIIQMADQNLPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-YRGSPTEDPNSHLKSFW

Query:  ------------------------IFDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPD
                                + DK + WL S+   SI+TWD +   FL K+FPP+K  K+R +I TF QQ  E L+E+WER+KELLRK P HG P 
Subjt:  ------------------------IFDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPD

Query:  WLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAF--MKFSGTGSAQSIE--
        WLQVQ FYN LT + KTI+DA AGG++ +KT E A  L+E+M  N+YQW SER+  ++    +  VD  + L AQ+ +L+     M+ +GT   Q++   
Subjt:  WLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAF--MKFSGTGSAQSIE--

Query:  -------SAAALASRPQEETIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHENFSYANTKNALNP-PGFAPQTQDNKKLEDLVGAFIAESSNRTTKLEEA
               S          ++ +QV +V N   +    +  P  ++P  RNH N S+ N +NA+ P P    Q ++   LE+L+  FIA +  R    E  
Subjt:  -------SAAALASRPQEETIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHENFSYANTKNALNP-PGFAPQTQDNKKLEDLVGAFIAESSNRTTKLEEA

Query:  VITINTTVNGHSAAIKNIETQLGQLVNVEESEEEPESEDYETPTGEAEEDTSSDEAEQPNLEPPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLN
                    A+IKN+ETQ+GQL  +  S  +          G    +T ++  EQ      + + TL   KE ++ +KK   +              
Subjt:  VITINTTVNGHSAAIKNIETQLGQLVNVEESEEEPESEDYETPTGEAEEDTSSDEAEQPNLEPPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLN

Query:  INIPFAEAL-EMPQYNMFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKST
              +AL +MP Y  F+KE LA K K   V TV +   CS  +  K+P+K+ DPGSF++PC+ G+    +ALCDLGA+IN++  S+ KKL +GE + T
Subjt:  INIPFAEAL-EMPQYNMFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKST

Query:  PVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIFKAVE-----DSKDEVLYM
         V LQL D+S+  P GI+E+VL++V +F  P+D  V+DM E+  +P+ILGRPFLATG+A +D+++ +L++++++E+ IFK  +      S D+  YM
Subjt:  PVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIFKAVE-----DSKDEVLYM

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129451.8e-12439.6Show/hide
Query:  RSSREVIPFDPEIERTILRLRRENREII----QMADQN----------LPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-YRGS
        R++  ++PFDP+IERT  R RREN ++      MA+ N          +PE  R +RDY  P+ QG    I    INANNFE+K   IQM +    + G 
Subjt:  RSSREVIPFDPEIERTILRLRRENREII----QMADQN----------LPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-YRGS

Query:  PTEDPNSHLKSFW------------------------IFDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFK
        P++DPNSHL +F                         + DKA+ WL S+  GSITTW+ L Q FL KFFPPAKT K+R +I +F Q   E L+EAWERFK
Subjt:  PTEDPNSHLKSFW------------------------IFDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFK

Query:  ELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFS
        ELLR+CP HG PDWLQVQ FYNGL  S KTI+DAAAGG L+SK   +A  LLE+MA+N+YQWPSERS  +K A G +E+D +  L  Q+ +L+    K  
Subjt:  ELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFS

Query:  GTGSAQSIESAAALASRPQEE--------TIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHENFSYANTKNALN-----PPGF----APQTQDNK-KLED
         T    +++++  +     +           E VQ+V NFN R  NN  + T Y+P  RNH NFS++N     N     PPGF     PQ  + K +LE+
Subjt:  GTGSAQSIESAAALASRPQEE--------TIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHENFSYANTKNALN-----PPGF----APQTQDNK-KLED

Query:  LVGAFIAESSNRTTKLEEAVITINTTVNGHSAAIKNIETQLGQLVN------------------------------------------VEESEEEPESE-
        L+  +I+++              +  +    A+++N+ETQ+GQL N                                          VE   E  + E 
Subjt:  LVGAFIAESSNRTTKLEEAVITINTTVNGHSAAIKNIETQLGQLVN------------------------------------------VEESEEEPESE-

Query:  --DYETPTGEAEEDTSSDEAEQPNLEPPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNMFMKEWLAKKRKEKKVDTVY
          + E    + ++D + ++     + PP P P         ++ +K+  + QF KF+N F  L+INIPFAEALE MP Y  F+K+ L+KKRK  + +TV+
Subjt:  --DYETPTGEAEEDTSSDEAEQPNLEPPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNMFMKEWLAKKRKEKKVDTVY

Query:  LASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYV
        L   CS  +Q K+P K+ DPGSF++PC+ G   F +AL DLGASIN++P S+ +KL +GE K T V LQLAD+S V P GI+E+VL++V +F  P+D  +
Subjt:  LASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYV

Query:  MDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSK
        +DM E+  +P+ILGRPFLAT  AIID+   +++ +V  E   F     SK
Subjt:  MDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSK

A0A6P6XAQ1 Reverse transcriptase5.1e-11139.6Show/hide
Query:  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFW------------------------IFDKARDWLQSITPGSI
        R +RD+  P  QG Q+ IV   +NANNFE+K  LIQM +   Y G+ TEDPNSHL +F                         + DKA+ WLQS  P + 
Subjt:  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFW------------------------IFDKARDWLQSITPGSI

Query:  TTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLED
        TTWD L +AFL KFFPP KT KLR +I +F QQ  E L+EAWER++EL R+CP HG PDWL VQ FYNGLT  TKT VDAAAGG L+ KT E A+ L+E+
Subjt:  TTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLED

Query:  MATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAAALASRPQEE-----TIEQVQYVSNFNSRGYNNSSTPTHYHPN
        MA N+YQW +ER   ++  AG+ EVD ++ L A+M ++     +  G+ S Q +  A+        +     + EQVQY++N+N    NN  + T Y+P 
Subjt:  MATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAAALASRPQEE-----TIEQVQYVSNFNSRGYNNSSTPTHYHPN

Query:  NRNHENFSY---ANTKNALNPPGFAPQ--TQDNKKLEDLVGAFIAESSN-RTTKLEEAVITINTTVNGHSAAI----KNIETQLGQLVNVEESEEEPE--
         RNH NF +    N +  +NPPGF  +    ++K   +L    +A +SN +  KL  A       + G    +    +N+E QLGQ+ N   +  + +  
Subjt:  NRNHENFSY---ANTKNALNPPGFAPQ--TQDNKKLEDLVGAFIAESSN-RTTKLEEAVITINTTVNGHSAAI----KNIETQLGQLVNVEESEEEPE--

Query:  SEDYETPTGEAEEDTSSDEAEQPNLEPPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL--EMPQYNMFMKEWLAKKRKEKKVDTV
        S+    P    +  T     E   +EPP+        K + KK  +     + +K         + +  A  +   +P Y  F+KE + KKRK    +T+
Subjt:  SEDYETPTGEAEEDTSSDEAEQPNLEPPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL--EMPQYNMFMKEWLAKKRKEKKVDTV

Query:  YLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLY
         L   CS  +Q K+P K+ DPGSF+VPC+ G   F +ALCDLGAS+++IPL++ ++L + E+K T + LQLAD+S+  P+GI+ENVLI+V +F +P+D  
Subjt:  YLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLY

Query:  VMDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIF
        V+DM E+ ++P+ILGRPFLAT   IID++R +   ++  E+  F
Subjt:  VMDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIF

A0A6P8DD93 uncharacterized protein LOC1162064538.8e-11137.8Show/hide
Query:  MRSSR--EVIPFDPEIERTILRLRRENR-----EIIQMADQNLPEE----PRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTE
        MR SR  E++P DPEIERT+ RLRRENR     ++++MAD ++  +     R +RDY  P   G  S I    I ANNFELK  LIQM +   + G P E
Subjt:  MRSSR--EVIPFDPEIERTILRLRRENR-----EIIQMADQNLPEE----PRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTE

Query:  DPNSHLKSFWIF------------------------DKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELL
         P+ H+  F  +                        DKAR W  S+   SITTW  L   FL++FFPPA+T +LR EI  F +   E L+EAWERFKE +
Subjt:  DPNSHLKSFWIF------------------------DKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELL

Query:  RKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTG
        RKCP HG PD L +++FY  L  + +++VDAAAGG L+ K  + A  L+E+MA++++ W +ERS  K   A V ++D ++ L  Q+++L     K +   
Subjt:  RKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTG

Query:  SAQSIESA-AALASRPQ------------EETIEQVQYVSNF---NSRGYNNSSTPTHYHPNNRNHENFSYANTKNALN-PPGF--------APQTQDNK
        S  + + A   L S P                 EQV +V+NF   N   Y+N+     Y+P  RNH NFS+ N  NAL  PPGF        AP  Q   
Subjt:  SAQSIESA-AALASRPQ------------EETIEQVQYVSNF---NSRGYNNSSTPTHYHPNNRNHENFSYANTKNALN-PPGF--------APQTQDNK

Query:  KLEDLVGAFIAESSNRTTKLEEAVITINTTVNGHSAAIKNIETQLGQL-------------VNVEES-------------EEEPESEDYETPTGEAEEDT
        ++E+L+ +++ ++              +T +    A I+N+E Q+ Q+              N EE+             E E  +   +T     E+D 
Subjt:  KLEDLVGAFIAESSNRTTKLEEAVITINTTVNGHSAAIKNIETQLGQL-------------VNVEES-------------EEEPESEDYETPTGEAEEDT

Query:  SSDEAEQPNLE--------PPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMPQYNMFMKEWLAKKRKEKKVDTVYLASTCST
           + E+P  +        PP+P P         ++ K++    QF KF++ F  L INIPFAEAL +MP Y  FMK+ L KKRK    + V L   CS 
Subjt:  SSDEAEQPNLE--------PPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMPQYNMFMKEWLAKKRKEKKVDTVYLASTCST

Query:  RVQQ---KVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMI
         +Q+    +P K  D GSF+VPC+ G + F   L D GASIN++PLS+ +KL +GE K T V LQLAD+S+  P GIVENVL++V +F  P+D  V++M 
Subjt:  RVQQ---KVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMI

Query:  ENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSK
        E+  +P+ILGRPFLATG+A+ID+E+ +LT+RV NE+  F   +  K
Subjt:  ENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSK

A0A6P8DKJ2 uncharacterized protein LOC1162042313.3e-11037.67Show/hide
Query:  MRSSR--EVIPFDPEIERTILRLRRENR-----EIIQMADQNLPEE----PRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTE
        MR SR  E++P DPEIERT+ RLRRENR     ++++MAD ++  +     R +RDY  P   G  S I    I ANNFELK  LIQM +   + G P E
Subjt:  MRSSR--EVIPFDPEIERTILRLRRENR-----EIIQMADQNLPEE----PRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTE

Query:  DPNSHLKSFWIF------------------------DKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELL
         P+ H+  F  +                        DKAR W  S+   SITTW  L   FL++FFPPA+T +LR EI  F +   E L+EAWERFKE +
Subjt:  DPNSHLKSFWIF------------------------DKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELL

Query:  RKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTG
        RKCP HG PD L +++FY  L  + +++VDAAAGG L+ K  + A  L+E+MA++++ W +ERS  K   A V ++D ++ L  Q+++L     K +   
Subjt:  RKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTG

Query:  SAQSIESA-AALASRPQ------------EETIEQVQYVSNF---NSRGYNNSSTPTHYHPNNRNHENFSYANTKNALN-PPGF--------APQTQDNK
        S  + + A   L S P                 EQV +V+NF   N   Y+N+     Y+P  RNH NFS+ N  NAL  PPGF        AP  Q   
Subjt:  SAQSIESA-AALASRPQ------------EETIEQVQYVSNF---NSRGYNNSSTPTHYHPNNRNHENFSYANTKNALN-PPGF--------APQTQDNK

Query:  KLEDLVGAFIAESSNRTTKLEEAVITINTTVNGHSAAIKNIETQLGQL-------------VNVEES-------------EEEPESEDYETPTGEAEEDT
        ++E+L+ +++ ++              +T +    A I+N+E Q+ Q+              N EE+             E E  +   +T     E+D 
Subjt:  KLEDLVGAFIAESSNRTTKLEEAVITINTTVNGHSAAIKNIETQLGQL-------------VNVEES-------------EEEPESEDYETPTGEAEEDT

Query:  SSDEAEQPNLE--------PPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMPQYNMFMKEWLAKKRKEKKVDTVYLASTCST
           + E+P  +        PP+P P          + K++    QF KF++ F  L INIPFAEAL +MP Y  FMK+ L KKRK    + V L   CS 
Subjt:  SSDEAEQPNLE--------PPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMPQYNMFMKEWLAKKRKEKKVDTVYLASTCST

Query:  RVQQ---KVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMI
         +Q+    +P K  D GSF+VPC+ G + F   L D GASIN++PLS+ +KL +GE K T + LQLAD+S+  P GIVENVL++V +F  P+D  V++M 
Subjt:  RVQQ---KVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMI

Query:  ENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSK
        E+  +P+ILGRPFLATG+A+ID+E+ +LT+RV NE+  F   +  K
Subjt:  ENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTAGTAGTAGAGAAGTGATACCATTTGATCCCGAGATAGAAAGAACAATTCTTAGGCTTCGAAGGGAAAATAGAGAAATTATTCAAATGGCAGACCAAAATCTACC
TGAGGAGCCTAGGCCTATTAGAGATTATTTTCAGCCTGTGTTTCAGGGGCAACAGTCGGGAATTGTCTATGCACCGATTAATGCCAACAACTTTGAGCTGAAAACCGGTC
TTATTCAAATGGCTCGAGACTGTGCATACAGAGGATCGCCCACTGAGGATCCAAATTCTCATCTTAAATCATTTTGGATATTTGATAAAGCACGAGATTGGTTGCAGTCT
ATTACTCCTGGGAGCATCACCACCTGGGATGCTTTGGTCCAGGCCTTTTTAAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACATTCCA
ACAACAATATGATGAGCAGCTGTTCGAAGCTTGGGAACGATTCAAAGAGCTACTGAGGAAGTGTCCTCAACATGGTTACCCCGATTGGCTTCAGGTACAGTTGTTTTATA
ATGGTTTAACTCCTAGTACAAAAACGATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACCGTGGAAAACGCTCGCACACTTTTAGAGGATATGGCCACCAAC
AGCTATCAGTGGCCATCCGAGCGGTCTACACCTAAAAAGATTGCTGCTGGAGTGTTTGAGGTTGATAAAGTAAGTGCACTCCAGGCCCAGATGACTTCCCTTGCCAATGC
TTTTATGAAATTTTCAGGTACAGGGAGTGCACAGTCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGAGGAGACCATTGAACAGGTTCAGTATGTATCAAATT
TTAATTCTAGGGGATATAATAATAGTTCTACACCTACACATTATCACCCTAACAATAGGAACCATGAAAATTTCTCGTATGCTAATACTAAGAATGCTCTTAATCCTCCT
GGTTTTGCCCCTCAAACTCAAGATAATAAAAAGTTAGAAGATCTTGTTGGAGCTTTCATTGCAGAGTCTAGTAACAGGACAACCAAGTTAGAGGAGGCAGTCATTACTAT
CAACACCACGGTGAATGGCCACAGTGCAGCTATAAAGAACATTGAGACTCAGCTGGGACAGTTGGTGAATGTTGAAGAATCTGAAGAGGAACCTGAATCTGAGGACTATG
AAACGCCTACAGGGGAAGCTGAGGAGGACACATCATCTGATGAGGCTGAACAGCCTAACCTTGAGCCTCCTATTCCTTCTCCCACACTGATGGTTCCCAAGGAAAAGAAA
AAGAAAAAGAAGAAAAAGAACAATCAGGTTCAGTTTGATAAATTTATGAATGCTTTTATGAATCTGAATATTAATATTCCATTTGCAGAGGCATTAGAGATGCCCCAATA
CAACATGTTTATGAAGGAGTGGTTAGCAAAGAAGCGAAAAGAAAAGAAGGTTGACACTGTTTATCTTGCTTCCACATGCAGCACCAGAGTACAACAAAAAGTACCTGAAA
AAGTAGCAGATCCAGGGAGTTTTTCTGTTCCTTGTAGTTTTGGTACTTATTCTTTCAGAGCATTATGTGATTTAGGTGCTAGCATTAATATCATTCCTCTATCGTTGTGT
AAAAAGTTAGATATAGGTGAGATTAAGTCTACTCCTGTAAAGCTCCAATTGGCTGATCAATCTGTGGTGAGACCAGTTGGTATTGTAGAAAATGTGTTAATCAGAGTAGG
TAGATTTTTCCTCCCTATTGACTTGTATGTTATGGACATGATAGAAAATCCTTCAATGCCTGTCATATTAGGACGACCATTCCTCGCTACTGGGCGAGCGATCATAGATA
TTGAGCGCAGGGAGCTCACTATTAGAGTCAAGAACGAAAAAGAAATCTTTAAAGCAGTTGAAGACTCTAAAGATGAAGTGCTTTACATGGGCTACAGGAAAGGTCTTGTT
TTCTTTCTCTCCTTTGCTTTCAAGCTTTCAAGATTCCAAGCTCTCAAGCAAGAAGTCATCATCATTCCAGGCCAGTATTGTTCACTCCTGTCATTCTTTTATTGTTCTCA
TGCTTGTGAAATGGTTGATTGGAGCTATAATGCCATATTAGAGTTAATCGGGTGCTCGGGGCGTGAAAAGATGCAAAGGAATGAAAAGAGTAAAAGAGGAAAAAAGACAA
ATCTCGGTCAACAGCAGGCTAGCGTCGAGACGCTAGCTCTTGAGCGTCGAGACGCTCACATTCCATATCAGATTAGGCGCGTAAAGCTCACAACGTCGAGACGCTATGAT
AGGAAGCGTCCCGACGCTACCGTTTTTCCTTATTCAGAATGCGCGTATAAGAGGCAGCGTCGCGACACTGTCTTGACAGCGTCTCGACGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCGTAGTAGTAGAGAAGTGATACCATTTGATCCCGAGATAGAAAGAACAATTCTTAGGCTTCGAAGGGAAAATAGAGAAATTATTCAAATGGCAGACCAAAATCTACC
TGAGGAGCCTAGGCCTATTAGAGATTATTTTCAGCCTGTGTTTCAGGGGCAACAGTCGGGAATTGTCTATGCACCGATTAATGCCAACAACTTTGAGCTGAAAACCGGTC
TTATTCAAATGGCTCGAGACTGTGCATACAGAGGATCGCCCACTGAGGATCCAAATTCTCATCTTAAATCATTTTGGATATTTGATAAAGCACGAGATTGGTTGCAGTCT
ATTACTCCTGGGAGCATCACCACCTGGGATGCTTTGGTCCAGGCCTTTTTAAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACATTCCA
ACAACAATATGATGAGCAGCTGTTCGAAGCTTGGGAACGATTCAAAGAGCTACTGAGGAAGTGTCCTCAACATGGTTACCCCGATTGGCTTCAGGTACAGTTGTTTTATA
ATGGTTTAACTCCTAGTACAAAAACGATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACCGTGGAAAACGCTCGCACACTTTTAGAGGATATGGCCACCAAC
AGCTATCAGTGGCCATCCGAGCGGTCTACACCTAAAAAGATTGCTGCTGGAGTGTTTGAGGTTGATAAAGTAAGTGCACTCCAGGCCCAGATGACTTCCCTTGCCAATGC
TTTTATGAAATTTTCAGGTACAGGGAGTGCACAGTCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGAGGAGACCATTGAACAGGTTCAGTATGTATCAAATT
TTAATTCTAGGGGATATAATAATAGTTCTACACCTACACATTATCACCCTAACAATAGGAACCATGAAAATTTCTCGTATGCTAATACTAAGAATGCTCTTAATCCTCCT
GGTTTTGCCCCTCAAACTCAAGATAATAAAAAGTTAGAAGATCTTGTTGGAGCTTTCATTGCAGAGTCTAGTAACAGGACAACCAAGTTAGAGGAGGCAGTCATTACTAT
CAACACCACGGTGAATGGCCACAGTGCAGCTATAAAGAACATTGAGACTCAGCTGGGACAGTTGGTGAATGTTGAAGAATCTGAAGAGGAACCTGAATCTGAGGACTATG
AAACGCCTACAGGGGAAGCTGAGGAGGACACATCATCTGATGAGGCTGAACAGCCTAACCTTGAGCCTCCTATTCCTTCTCCCACACTGATGGTTCCCAAGGAAAAGAAA
AAGAAAAAGAAGAAAAAGAACAATCAGGTTCAGTTTGATAAATTTATGAATGCTTTTATGAATCTGAATATTAATATTCCATTTGCAGAGGCATTAGAGATGCCCCAATA
CAACATGTTTATGAAGGAGTGGTTAGCAAAGAAGCGAAAAGAAAAGAAGGTTGACACTGTTTATCTTGCTTCCACATGCAGCACCAGAGTACAACAAAAAGTACCTGAAA
AAGTAGCAGATCCAGGGAGTTTTTCTGTTCCTTGTAGTTTTGGTACTTATTCTTTCAGAGCATTATGTGATTTAGGTGCTAGCATTAATATCATTCCTCTATCGTTGTGT
AAAAAGTTAGATATAGGTGAGATTAAGTCTACTCCTGTAAAGCTCCAATTGGCTGATCAATCTGTGGTGAGACCAGTTGGTATTGTAGAAAATGTGTTAATCAGAGTAGG
TAGATTTTTCCTCCCTATTGACTTGTATGTTATGGACATGATAGAAAATCCTTCAATGCCTGTCATATTAGGACGACCATTCCTCGCTACTGGGCGAGCGATCATAGATA
TTGAGCGCAGGGAGCTCACTATTAGAGTCAAGAACGAAAAAGAAATCTTTAAAGCAGTTGAAGACTCTAAAGATGAAGTGCTTTACATGGGCTACAGGAAAGGTCTTGTT
TTCTTTCTCTCCTTTGCTTTCAAGCTTTCAAGATTCCAAGCTCTCAAGCAAGAAGTCATCATCATTCCAGGCCAGTATTGTTCACTCCTGTCATTCTTTTATTGTTCTCA
TGCTTGTGAAATGGTTGATTGGAGCTATAATGCCATATTAGAGTTAATCGGGTGCTCGGGGCGTGAAAAGATGCAAAGGAATGAAAAGAGTAAAAGAGGAAAAAAGACAA
ATCTCGGTCAACAGCAGGCTAGCGTCGAGACGCTAGCTCTTGAGCGTCGAGACGCTCACATTCCATATCAGATTAGGCGCGTAAAGCTCACAACGTCGAGACGCTATGAT
AGGAAGCGTCCCGACGCTACCGTTTTTCCTTATTCAGAATGCGCGTATAAGAGGCAGCGTCGCGACACTGTCTTGACAGCGTCTCGACGCTAA
Protein sequenceShow/hide protein sequence
MRSSREVIPFDPEIERTILRLRRENREIIQMADQNLPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFWIFDKARDWLQS
ITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATN
SYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAAALASRPQEETIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHENFSYANTKNALNPP
GFAPQTQDNKKLEDLVGAFIAESSNRTTKLEEAVITINTTVNGHSAAIKNIETQLGQLVNVEESEEEPESEDYETPTGEAEEDTSSDEAEQPNLEPPIPSPTLMVPKEKK
KKKKKKNNQVQFDKFMNAFMNLNINIPFAEALEMPQYNMFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSFRALCDLGASINIIPLSLC
KKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSKDEVLYMGYRKGLV
FFLSFAFKLSRFQALKQEVIIIPGQYCSLLSFFYCSHACEMVDWSYNAILELIGCSGREKMQRNEKSKRGKKTNLGQQQASVETLALERRDAHIPYQIRRVKLTTSRRYD
RKRPDATVFPYSECAYKRQRRDTVLTASRR