; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035150 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035150
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr3:15767313..15769398
RNA-Seq ExpressionLag0035150
SyntenyLag0035150
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6734747.1 hypothetical protein I3842_01G285500 [Carya illinoinensis]1.5e-13942.94Show/hide
Query:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYKESPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLF----------
        MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   +  SP +DPN HL  FL+IC TVKINGV+ED IRLRLF          
Subjt:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYKESPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLF----------

Query:  ---------------------KEFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT
                              +FFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL   T+TIVDAA+GGTL+SKT
Subjt:  ---------------------KEFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT

Query:  VENARTLLEDMATNSYQWPSERSTHKKIAAGVFEVDKVSALQAQMTSLASAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQ---------------
         E A  LLE+MA+N+YQWP+ER+  KK+ AG+ E++ ++AL AQ+ +L+      +     QS E  ++ ++     E + EQVQ               
Subjt:  VENARTLLEDMATNSYQWPSERSTHKKIAAGVFEVDKVSALQAQMTSLASAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQ---------------

Query:  --------NHENFSYANTKNVL---NPPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAP
                NHEN SY NTKNVL   +PPGF  Q  + K  LED + +F+ E++ R  K +  +  I T  +   A +KN+E Q+GQL   ++   +G  P
Subjt:  --------NHENFSYANTKNVL---NPPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAP

Query:  AEQEKPQMEYCKAITVHQ-EESEEEPESEDYETPT----GEAEEDTSADEAEKPNLQ-------------PPIPSPTLLVPKEKKKKKKKKNNQVQFDKF
        +  E    E CKAIT+   +E E  P  E   TPT    G++++    +E     L+             PPI +P L  P+  +K+K  K    QF KF
Subjt:  AEQEKPQMEYCKAITVHQ-EESEEEPESEDYETPT----GEAEEDTSADEAEKPNLQ-------------PPIPSPTLLVPKEKKKKKKKKNNQVQFDKF

Query:  MNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKL
        ++ F  ++INIPFA+ALE MP Y +F+K+ ++KKR+ ++ +TV L+  CS  +Q+K+P+K+ DP SF++PC+ G   F R LCDLGASIN++P  +C+KL
Subjt:  MNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKL

Query:  DIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRSFLATGRVIIDIECRELTIRVKNEKEIFKAVEDSK
         +GE+K T + LQLAD+S+  P GI+E+VL++V +F  P D  V+DM E+  +P+ILGR FLATGR +ID++  ELT+RV  E+ +F   +  K
Subjt:  DIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRSFLATGRVIIDIECRELTIRVKNEKEIFKAVEDSK

KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]3.5e-14143.23Show/hide
Query:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYKESPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLF----------
        MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   +  SP +DPN HL  FL+IC TVKINGV+ED IRLRLF          
Subjt:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYKESPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLF----------

Query:  ---------------------KEFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT
                              +FFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL   T+TIVDAA+GGTL+SKT
Subjt:  ---------------------KEFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT

Query:  VENARTLLEDMATNSYQWPSERSTHKKIAAGVFEVDKVSALQAQMTSLASAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQ---------------
         E A  LLE+MA+N+YQWP+ER+  KK+ AG+ E++ ++AL AQ+ +L+      +     QS E  ++ ++     E + EQVQ               
Subjt:  VENARTLLEDMATNSYQWPSERSTHKKIAAGVFEVDKVSALQAQMTSLASAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQ---------------

Query:  --------NHENFSYANTKNVL---NPPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAP
                NHEN SY NTKNVL   +PPGF  Q  + K  LED + +F+ E++ R  K +  +  I T  +   A +KN+E Q+GQL   ++   +G  P
Subjt:  --------NHENFSYANTKNVL---NPPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAP

Query:  AEQEKPQMEYCKAITVHQ-EESEEEPESEDYETPT----GEAEEDTSADEAEKPNLQ-------------PPIPSPTLLVPKEKKKKKKKKNNQVQFDKF
        +  E    E CKAIT+   +E E  P  E   TPT    G++++    +E     L+             PPI +P L  P+  +K+K  K    QF KF
Subjt:  AEQEKPQMEYCKAITVHQ-EESEEEPESEDYETPT----GEAEEDTSADEAEKPNLQ-------------PPIPSPTLLVPKEKKKKKKKKNNQVQFDKF

Query:  MNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKL
        ++ F  ++INIPFA+ALE MP Y +F+K+ ++KKR+ ++ +TV L+  CS  +Q+K+P+K+ DPGSF++PC+ G   F R LCDLGASIN++P S+C+KL
Subjt:  MNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKL

Query:  DIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRSFLATGRVIIDIECRELTIRVKNEKEIFKAVEDSK
         +GE+K T + LQLAD+S+  P GI+E+VL++V +F  P D  V+DM E+  +P+ILGR FLATGR +ID++  ELT+RV  E+ +F   +  K
Subjt:  DIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRSFLATGRVIIDIECRELTIRVKNEKEIFKAVEDSK

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]7.1e-14243.9Show/hide
Query:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYKESPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLF----------
        MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   +  SP +DPN HL  FL+IC TVKINGV+ED IRLRLF          
Subjt:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYKESPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLF----------

Query:  ---------------------KEFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT
                              +FFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL   T+TIVDAA+GGTL+SKT
Subjt:  ---------------------KEFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT

Query:  VENARTLLEDMATNSYQWPSERSTHKKIAAGVFEVDKVSALQAQMTSLASAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQ---------------
         E A  LLE+MA+N+YQWP+ER+  KK+ AG+ +++ ++AL AQ+ +L+      +     QS E  ++ ++     E + EQVQ               
Subjt:  VENARTLLEDMATNSYQWPSERSTHKKIAAGVFEVDKVSALQAQMTSLASAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQ---------------

Query:  --------NHENFSYANTKNVL---NPPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAP
                NHEN SY NTKNVL   +PPGF  Q  + K  LED + +F+ E++ R  K +  +  I T  +   AAIKNIE Q+GQL   ++   +G  P
Subjt:  --------NHENFSYANTKNVL---NPPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAP

Query:  AEQEKPQMEYCKAITVHQ-EESEEEPESEDYETPT----GEAEEDTSADEAEKPNLQ-------------PPIPSPTLLVPKEKKKKKKKKNNQVQFDKF
        +  E    E CKAIT+   +E E  P  E   TPT    G+++     DE     L+             PPI +P L  P+  +K+K  K    QF KF
Subjt:  AEQEKPQMEYCKAITVHQ-EESEEEPESEDYETPT----GEAEEDTSADEAEKPNLQ-------------PPIPSPTLLVPKEKKKKKKKKNNQVQFDKF

Query:  MNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKL
        ++ F  ++INIPFA+ALE MP Y +F+K+ ++KKR+ ++ +TV L+  CS  +Q+K+P+K+ DPGSF++PC+ G   F + LCDLGASIN++PLS+C+KL
Subjt:  MNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKL

Query:  DIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRSFLATGRVIIDIECRELTIRVKNEKEIFK
         + E+K T + LQLAD+S+  P GI+E+VL++V +F  P D  V+DM E+  +P+ILGR FLATGR +ID++  ELT+RV  E+ +FK
Subjt:  DIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRSFLATGRVIIDIECRELTIRVKNEKEIFK

XP_022843226.1 uncharacterized protein LOC111366761 [Olea europaea var. sylvestris]1.0e-13243.59Show/hide
Query:  NPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYKESPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFK-------------
        N   + R IRDY +PV     SGI    I A NFELK GLI M +   +  +  EDPN+HL SFL+IC TVK+NGV+EDAIRLRLF              
Subjt:  NPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYKESPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFK-------------

Query:  ------------------EFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENA
                          ++FPP+K+ +LR EI  F+Q   E  +EAWERFK+LLR+CPQHG+  W+Q+++FYNGL   T+T+VDAAAGG L++KT E A
Subjt:  ------------------EFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENA

Query:  RTLLEDMATNSYQWPSERSTHKKIAAGVFEVDKVSALQAQMTSLASAFMKFSGTGSAQSIESAAALASRPQEETI--EQVQ-------------------
          LL+D+ATNSYQWPSERS  KK+ AG+ EVD ++AL AQ+ SL +  +  +  G+ Q+++S  + +S  QE  +  EQVQ                   
Subjt:  RTLLEDMATNSYQWPSERSTHKKIAAGVFEVDKVSALQAQMTSLASAFMKFSGTGSAQSIESAAALASRPQEETI--EQVQ-------------------

Query:  ----NHENFSYANTKNVLN-PPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKP
            NHEN SY N +N L  PPGF  Q  D K  LED++G FI+E+ +R  K E  +  I T V+   A +KN+E Q+GQL  ++ +  KGK P++ E  
Subjt:  ----NHENFSYANTKNVLN-PPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKP

Query:  QMEYCKAITVHQEESEEEPESEDYETP------TGEAEEDTSADEAEKPNL----------QPPIPSPTLLVPKEKKKKKKKKNNQVQFDKFMNAFMNLN
          E+C AIT+   +  EE + +    P      T E + +    EAE   +           PPI  P L  P+   KKK       QF KF+  F  ++
Subjt:  QMEYCKAITVHQEESEEEPESEDYETP------TGEAEEDTSADEAEKPNL----------QPPIPSPTLLVPKEKKKKKKKKNNQVQFDKFMNAFMNLN

Query:  INIPFAEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKST
        INIPFAE L +MP Y +F+KE ++ K+K ++ +T+ L   CS  + QK+P K+ DPGSF++PC+ G  +F RALCD GASIN++PLS+ KKL +GE+K T
Subjt:  INIPFAEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKST

Query:  PVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRSFLATGRVIIDI
         + LQLAD+S+  P G++E+VL++V +F LP+D  V+DM EN  +P+ILGR FLATGR +ID+
Subjt:  PVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRSFLATGRVIIDI

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]2.4e-13742.73Show/hide
Query:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYKESPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLF----------
        MA+     +PR ++DY +P+     SGI    INANNFELK  LI M +   +  SP +DPN HL  FL+IC T+K+NGV+ED IRLRLF          
Subjt:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYKESPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLF----------

Query:  ---------------------KEFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT
                              +FFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R CPQHG PDWLQVQ+FYNGL   T+TIVDAA+GGTL+SKT
Subjt:  ---------------------KEFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT

Query:  VENARTLLEDMATNSYQWPSERSTHKKIAAGVFEVDKVSALQAQMTSLASAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQ---------------
         E A +LLE+MA+N+YQWP+ER+  KK+ AG+ E++  +AL AQ+ SL+      +     Q  E  +A+++     E + EQVQ               
Subjt:  VENARTLLEDMATNSYQWPSERSTHKKIAAGVFEVDKVSALQAQMTSLASAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQ---------------

Query:  --------NHENFSYANTKNVLN-PPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAE
                NHENFSY NTKNVL  PPGF  Q  + K  LED + +F+ E+     K +  +  I T  +   A +KN+E Q+GQL   ++   +G  P+ 
Subjt:  --------NHENFSYANTKNVLN-PPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAE

Query:  QEKPQMEYCKAITVHQ-EESEEEPESEDYETPTG-------------EAEEDTSADEAEKPNLQPPIPSPTLLVPKEKKKKKKKKNNQVQFDKFMNAFMN
         E    E CKAIT+    E E  P  E   TPT              E  EDT  +    P++  P   P L  P    ++ +K+    QF KF++ F  
Subjt:  QEKPQMEYCKAITVHQ-EESEEEPESEDYETPTG-------------EAEEDTSADEAEKPNLQPPIPSPTLLVPKEKKKKKKKKNNQVQFDKFMNAFMN

Query:  LNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIK
        ++INIPFA+ALE MP Y +F+K+ ++KKR+ ++ +TV L+  CS  +Q+K+P+K+ DPGSF++PC+ G   F + LCDLGASIN++PLS+ +KL +GE+K
Subjt:  LNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIK

Query:  STPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRSFLATGRVIIDIECRELTIRVKNEKEIFKAVEDSK
         T + LQLAD+S+  P GI+E+VL++V +F  P D  V+DM E+  +P+ILGR FLATGR ++D++  ELT+RV  E+  F   E  K
Subjt:  STPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRSFLATGRVIIDIECRELTIRVKNEKEIFKAVEDSK

TrEMBL top hitse value%identityAlignment
A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129453.6e-11540Show/hide
Query:  PEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-YKESPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLF---------------
        PE  R +RDY  P+ QG    I    INANNFE+K   IQM +    +   P++DPNSHL +FL+IC T K NGV++DAIRLRLF               
Subjt:  PEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-YKESPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLF---------------

Query:  ----------------KEFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENAR
                         +FFPPAKT K+R +I +F Q   E L+EAWERFKELLR+CP HG PDWLQVQ FYNGL  S KTI+DAAAGG L+SK   +A 
Subjt:  ----------------KEFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENAR

Query:  TLLEDMATNSYQWPSERSTHKKIAAGVFEVDKVSALQAQMTSLASAFMKFS-------------------------GTGSAQSIESAAALASRPQEETIE
         LLE+MA+N+YQWPSERS  +K A G +E+D +  L  Q+ +L+                                 + S Q + +     + P   T  
Subjt:  TLLEDMATNSYQWPSERSTHKKIAAGVFEVDKVSALQAQMTSLASAFMKFS-------------------------GTGSAQSIESAAALASRPQEETIE

Query:  Q-VQNHENFSYANTKNVLN-----PPGF----APQTQDNK-KLEDLVGAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGK
           +NH NFS++N     N     PPGF     PQ  + K +LE+L+  +I+++              +  +    A+++N+ETQ+GQL N ++   +G 
Subjt:  Q-VQNHENFSYANTKNVLN-----PPGF----APQTQDNK-KLEDLVGAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGK

Query:  APAE-QEKPQ-MEYCKAIT---------VHQEESEEEPESED------YETPTGEAEEDTSADEAEKPNLQPPIPSPTLLVPKEKKKKKKKKNNQVQFDK
         P++ Q  P+  E C+AIT         V+Q+  E E E  D       E    + ++D + ++     + PP P P  L   +K+K +K      QF K
Subjt:  APAE-QEKPQ-MEYCKAIT---------VHQEESEEEPESED------YETPTGEAEEDTSADEAEKPNLQPPIPSPTLLVPKEKKKKKKKKNNQVQFDK

Query:  FMNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKK
        F+N F  L+INIPFAEALE MP Y +F+K+ L+KKRK  + +TV+L   CS  +Q K+P K+ DPGSF++PC+ G   F +AL DLGASIN++P S+ +K
Subjt:  FMNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKK

Query:  LDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRSFLATGRVIIDIECRELTIRVKNEKEIFKAVEDSK
        L +GE K T V LQLAD+S V P GI+E+VL++V +F  P+D  ++DM E+  +P+ILGR FLAT   IID+   +++ +V  E   F     SK
Subjt:  LDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRSFLATGRVIIDIECRELTIRVKNEKEIFKAVEDSK

A0A6J1DU19 uncharacterized protein LOC1110243616.5e-11743.07Show/hide
Query:  IRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYKESPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLF--------------KEFFPPAK
        IRDY QP F     GI+  PINANN ELK GLIQM R+  ++ + TEDPN+HL  FLD+CGTVK+NGV +DAIRLRLF                FFPPAK
Subjt:  IRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYKESPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLF--------------KEFFPPAK

Query:  TVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTHKKIA
        T +LRTEI +F++   EQLFE WER+KELLRKCPQHG  +WLQ+Q+FYNGL   T+TI+DAAAGGTLLS+T ENA  LL+DMA NS+QWPSERS  KK+ 
Subjt:  TVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTHKKIA

Query:  AGVFEVDKVSALQAQMTSLASAFMKFSGTGSAQSIESAAALASRP-QEETIEQVQNHENFSYANTKNVLNPPGFAPQTQDNK-KLEDLVGAFIAESSNRT
        AG++E+D++S+L+AQ+ +L +A  K SG G++ S E  AA  +    E TIEQ Q                  F     + K  LEDL+GAFI E  +R 
Subjt:  AGVFEVDKVSALQAQMTSLASAFMKFSGTGSAQSIESAAALASRP-QEETIEQVQNHENFSYANTKNVLNPPGFAPQTQDNK-KLEDLVGAFIAESSNRT

Query:  TKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITVHQEESEEEPESEDYETPTGEAEEDTSADEAEKPNLQPPIP
        +++E  V  +   + G++ +IKN+E Q+GQ+   ++TM KGK P++ E    E+CKA+T+   +  +EPE +  E P    EE  + +E  K        
Subjt:  TKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITVHQEESEEEPESEDYETPTGEAEEDTSADEAEKPNLQPPIP

Query:  SPTLLVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFG
        +P L   K          N + + +                ALE MP Y RFMK+ +  KRK +  +TV L   CS  +Q+K+P+K+ DPGSF++PC+  
Subjt:  SPTLLVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFG

Query:  TYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRSFLATGRVIIDIECR
        + SF +ALCD+ ASIN++PL                             G++E+VL++V R   P D  V+   E+  +P+ILGR FLATG  +ID++  
Subjt:  TYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRSFLATGRVIIDIECR

Query:  ELTIRVKNEKEIF
         LT+RV  E  +F
Subjt:  ELTIRVKNEKEIF

A0A6P6XAQ1 Reverse transcriptase6.1e-10738.38Show/hide
Query:  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYKESPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLF--------------------
        R +RD+  P  QG Q+ IV   +NANNFE+K  LIQM +   Y  + TEDPNSHL +FL+IC T+K NGVSEDAI+LRLF                    
Subjt:  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYKESPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLF--------------------

Query:  -----------KEFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLED
                    +FFPP KT KLR +I +F QQ  E L+EAWER++EL R+CP HG PDWL VQ FYNGLT  TKT VDAAAGG L+ KT E A+ L+E+
Subjt:  -----------KEFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLED

Query:  MATNSYQWPSERSTHKKIAAGVFEVDKVSALQAQMTSLASAFMKFSGTGSAQSIESAAALASRPQEE-----TIEQVQ----------------------
        MA N+YQW +ER   ++  AG+ EVD ++ L A+M ++     +  G+ S Q +  A+        +     + EQVQ                      
Subjt:  MATNSYQWPSERSTHKKIAAGVFEVDKVSALQAQMTSLASAFMKFSGTGSAQSIESAAALASRPQEE-----TIEQVQ----------------------

Query:  -NHENFSY---ANTKNVLNPPGFAPQ--TQDNKKLEDLVGAFIAESSN-RTTKLEEAVIAINTTVNGHSAAI----KNIETQLGQLVNVVSTMNKGKAPA
         NH NF +    N +  +NPPGF  +    ++K   +L    +A +SN +  KL  A       + G    +    +N+E QLGQ+ N V+  N+G  P+
Subjt:  -NHENFSY---ANTKNVLNPPGFAPQ--TQDNKKLEDLVGAFIAESSN-RTTKLEEAVIAINTTVNGHSAAI----KNIETQLGQLVNVVSTMNKGKAPA

Query:  EQEKPQMEYCKAITVHQEESEEEPESEDYETPTGEAEEDTSADEAEKPNLQPPIPSPTLLVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALEM
        + E    E+ KAIT+   +   EP         G   E    +  +   L+           KE+K K+K + N++Q        M     IP      +
Subjt:  EQEKPQMEYCKAITVHQEESEEEPESEDYETPTGEAEEDTSADEAEKPNLQPPIPSPTLLVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALEM

Query:  PQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVV
        P Y +F+KE + KKRK    +T+ L   CS  +Q K+P K+ DPGSF+VPC+ G   F +ALCDLGAS+++IPL++ ++L + E+K T + LQLAD+S+ 
Subjt:  PQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVV

Query:  RPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRSFLATGRVIIDIECRELTIRVKNEKEIF
         P+GI+ENVLI+V +F +P+D  V+DM E+ ++P+ILGR FLAT   IID++  +   ++  E+  F
Subjt:  RPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRSFLATGRVIIDIECRELTIRVKNEKEIF

A0A6P8DD93 uncharacterized protein LOC1162064535.8e-9735.6Show/hide
Query:  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYKESPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLF--------------------
        R +RDY  P   G  S I    I ANNFELK  LIQM +   +   P E P+ H+  FL  C TVK+N V++D IRL+LF                    
Subjt:  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYKESPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLF--------------------

Query:  -----------KEFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLED
                   + FFPPA+T +LR EI  F +   E L+EAWERFKE +RKCP HG PD L +++FY  L  + +++VDAAAGG L+ K  + A  L+E+
Subjt:  -----------KEFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLED

Query:  MATNSYQWPSERSTHKKIAAGVFEVDKVSALQAQMTSLASAFMKFSG---------------TGSAQSIESAAALAS-RPQEETIEQV------------
        MA++++ W +ERS  K   A V ++D ++ L  Q+++L +   K +                +G   ++E  +   S  P  E +  V            
Subjt:  MATNSYQWPSERSTHKKIAAGVFEVDKVSALQAQMTSLASAFMKFSG---------------TGSAQSIESAAALAS-RPQEETIEQV------------

Query:  -------QNHENFSYANTKNVLN-PPGF--------APQTQDNKKLEDLVGAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMN
               +NH NFS+ N  N L  PPGF        AP  Q   ++E+L+ +++ ++              +T +    A I+N+E Q+ Q+   +S   
Subjt:  -------QNHENFSYANTKNVLN-PPGF--------APQTQDNKKLEDLVGAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMN

Query:  KGKAPAE-QEKPQ------MEYCKAITVHQEESEEEPESEDYETPTGEAEEDTSADEAEKPNLQPPIPSPTLLVPKEKKKKKKKKNNQVQFDKFMNAFMN
         G  P+  +E P+      +   K + +   +++ + ES + +    + EE        KP + PP+P P         ++ K++    QF KF++ F  
Subjt:  KGKAPAE-QEKPQ------MEYCKAITVHQEESEEEPESEDYETPTGEAEEDTSADEAEKPNLQPPIPSPTLLVPKEKKKKKKKKNNQVQFDKFMNAFMN

Query:  LNINIPFAEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQ---KVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIG
        L INIPFAEAL +MP Y RFMK+ L KKRK    + V L   CS  +Q+    +P K  D GSF+VPC+ G + F   L D GASIN++PLS+ +KL +G
Subjt:  LNINIPFAEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQ---KVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIG

Query:  EIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRSFLATGRVIIDIECRELTIRVKNEKEIFKAVEDSK
        E K T V LQLAD+S+  P GIVENVL++V +F  P+D  V++M E+  +P+ILGR FLATG+ +ID+E  +LT+RV NE+  F   +  K
Subjt:  EIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRSFLATGRVIIDIECRELTIRVKNEKEIFKAVEDSK

A0A6P8DKJ2 uncharacterized protein LOC1162042315.8e-9735.6Show/hide
Query:  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYKESPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLF--------------------
        R +RDY  P   G  S I    I ANNFELK  LIQM +   +   P E P+ H+  FL  C TVK+N V++D IRL+LF                    
Subjt:  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYKESPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLF--------------------

Query:  -----------KEFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLED
                   + FFPPA+T +LR EI  F +   E L+EAWERFKE +RKCP HG PD L +++FY  L  + +++VDAAAGG L+ K  + A  L+E+
Subjt:  -----------KEFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLED

Query:  MATNSYQWPSERSTHKKIAAGVFEVDKVSALQAQMTSLASAFMKFSG---------------TGSAQSIESAAALAS-RPQEETIEQV------------
        MA++++ W +ERS  K   A V ++D ++ L  Q+++L +   K +                +G   ++E  +   S  P  E +  V            
Subjt:  MATNSYQWPSERSTHKKIAAGVFEVDKVSALQAQMTSLASAFMKFSG---------------TGSAQSIESAAALAS-RPQEETIEQV------------

Query:  -------QNHENFSYANTKNVLN-PPGF--------APQTQDNKKLEDLVGAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMN
               +NH NFS+ N  N L  PPGF        AP  Q   ++E+L+ +++ ++              +T +    A I+N+E Q+ Q+   +S   
Subjt:  -------QNHENFSYANTKNVLN-PPGF--------APQTQDNKKLEDLVGAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMN

Query:  KGKAPAE-QEKPQ------MEYCKAITVHQEESEEEPESEDYETPTGEAEEDTSADEAEKPNLQPPIPSPTLLVPKEKKKKKKKKNNQVQFDKFMNAFMN
         G  P+  +E P+      +   K + +   +++ + ES + +    + EE        KP + PP+P P  L         K++    QF KF++ F  
Subjt:  KGKAPAE-QEKPQ------MEYCKAITVHQEESEEEPESEDYETPTGEAEEDTSADEAEKPNLQPPIPSPTLLVPKEKKKKKKKKNNQVQFDKFMNAFMN

Query:  LNINIPFAEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQ---KVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIG
        L INIPFAEAL +MP Y RFMK+ L KKRK    + V L   CS  +Q+    +P K  D GSF+VPC+ G + F   L D GASIN++PLS+ +KL +G
Subjt:  LNINIPFAEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQ---KVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIG

Query:  EIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRSFLATGRVIIDIECRELTIRVKNEKEIFKAVEDSK
        E K T + LQLAD+S+  P GIVENVL++V +F  P+D  V++M E+  +P+ILGR FLATG+ +ID+E  +LT+RV NE+  F   +  K
Subjt:  EIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRSFLATGRVIIDIECRELTIRVKNEKEIFKAVEDSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGACCAAAATCCACCTGAGGAGCCTAGGCCTATTAGAGATTATTTTCAGCCTGTGTTTCAGGGGCAACAGTCGGGGATTGTCTATGCACCGATTAATGCCAACAA
CTTTGAGCTGAAAACCGGCCTCATTCAGATGGCTCGAGACTGTGCATATAAAGAATCACCCACCGAGGATCCAAATTCTCATCTTAAATCATTCTTGGATATTTGTGGGA
CGGTAAAAATTAATGGTGTCTCTGAGGATGCTATTCGCTTACGCCTTTTTAAAGAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACATTCCAA
CAACAATATGATGAGCAGCTGTTCGAAGCTTGGGAGCGATTCAAAGAGCTACTGAGGAAGTGTCCTCAGCATGGTTACCCCGATTGGCTTCAGGTACAGTTGTTTTATAA
TGGTTTAACTCCTAGTACAAAAACGATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACCGTGGAAAACGCTCGCACACTTCTAGAGGATATGGCCACCAACA
GCTATCAGTGGCCATCTGAGCGGTCTACACATAAAAAGATTGCTGCTGGAGTGTTTGAGGTTGATAAAGTAAGTGCACTCCAGGCCCAGATGACTTCCCTCGCTAGTGCT
TTTATGAAATTTTCAGGTACAGGGAGTGCACAGTCAATTGAATCAGCTGCTGCTTTGGCATCTAGACCTCAGGAGGAGACTATTGAACAAGTTCAGAACCATGAAAATTT
CTCTTATGCTAATACTAAGAATGTTCTTAATCCTCCTGGTTTTGCCCCTCAAACTCAAGATAATAAAAAGTTAGAAGATCTTGTTGGAGCTTTCATTGCAGAGTCTAGTA
ACAGGACAACCAAATTAGAGGAGGCAGTCATTGCTATCAACACCACGGTGAATGGCCACAGTGCAGCCATAAAGAACATTGAGACTCAGCTGGGACAGTTGGTAAATGTT
GTAAGTACCATGAATAAAGGTAAGGCCCCAGCTGAACAAGAGAAACCTCAGATGGAGTACTGTAAGGCAATCACTGTGCACCAGGAGGAATCTGAAGAGGAACCTGAATC
TGAGGACTATGAAACGCCTACAGGGGAAGCTGAGGAGGACACGTCAGCAGATGAGGCTGAAAAGCCTAACCTTCAGCCTCCTATTCCTTCTCCCACACTGTTGGTTCCCA
AGGAAAAGAAAAAGAAAAAGAAGAAAAAGAACAATCAGGTTCAGTTTGATAAATTTATGAATGCTTTTATGAATCTGAATATTAATATTCCTTTTGCAGAGGCATTAGAG
ATGCCCCAATACAACAGGTTCATGAAGGAGTGGTTAGCAAAGAAGCGAAAAGAAAAGAAGGTTGACACTGTATATCTTGCTTCCACATGCAGCACTCGAGTCCAACAGAA
GGTACCTGAAAAAGTAGCAGATCCAGGGAGTTTTTCTGTTCCTTGTAGTTTTGGTACTTATTCTTTCAGAGCATTATGTGATTTAGGCGCTAGCATTAATATCATTCCTC
TATCGTTGTGTAAAAAGTTAGATATAGGTGAGATTAAGTCTACTCCTGTAAAGCTCCAATTGGCTGATCAATCTGTGGTGAGACCAGTTGGTATTGTAGAAAATGTGTTA
ATCAGAGTAGGTAGATTTTTTCTCCCTATTGATTTATATGTTATGGACATGATAGAAAATCCTTCAATGCCTGTCATATTAGGAAGATCATTCCTCGCTACTGGGCGAGT
GATTATAGATATTGAGTGCAGGGAGCTCACTATTAGAGTCAAGAACGAAAAAGAAATCTTTAAAGCAGTTGAAGACTCTAAAGATGAAGTGCTTTTCATGGGATATAGGA
AAGGTGCAAGAAAGAGCACCTCTTTTGGATTCATAGAACAAAAGCCTCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGACCAAAATCCACCTGAGGAGCCTAGGCCTATTAGAGATTATTTTCAGCCTGTGTTTCAGGGGCAACAGTCGGGGATTGTCTATGCACCGATTAATGCCAACAA
CTTTGAGCTGAAAACCGGCCTCATTCAGATGGCTCGAGACTGTGCATATAAAGAATCACCCACCGAGGATCCAAATTCTCATCTTAAATCATTCTTGGATATTTGTGGGA
CGGTAAAAATTAATGGTGTCTCTGAGGATGCTATTCGCTTACGCCTTTTTAAAGAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACATTCCAA
CAACAATATGATGAGCAGCTGTTCGAAGCTTGGGAGCGATTCAAAGAGCTACTGAGGAAGTGTCCTCAGCATGGTTACCCCGATTGGCTTCAGGTACAGTTGTTTTATAA
TGGTTTAACTCCTAGTACAAAAACGATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACCGTGGAAAACGCTCGCACACTTCTAGAGGATATGGCCACCAACA
GCTATCAGTGGCCATCTGAGCGGTCTACACATAAAAAGATTGCTGCTGGAGTGTTTGAGGTTGATAAAGTAAGTGCACTCCAGGCCCAGATGACTTCCCTCGCTAGTGCT
TTTATGAAATTTTCAGGTACAGGGAGTGCACAGTCAATTGAATCAGCTGCTGCTTTGGCATCTAGACCTCAGGAGGAGACTATTGAACAAGTTCAGAACCATGAAAATTT
CTCTTATGCTAATACTAAGAATGTTCTTAATCCTCCTGGTTTTGCCCCTCAAACTCAAGATAATAAAAAGTTAGAAGATCTTGTTGGAGCTTTCATTGCAGAGTCTAGTA
ACAGGACAACCAAATTAGAGGAGGCAGTCATTGCTATCAACACCACGGTGAATGGCCACAGTGCAGCCATAAAGAACATTGAGACTCAGCTGGGACAGTTGGTAAATGTT
GTAAGTACCATGAATAAAGGTAAGGCCCCAGCTGAACAAGAGAAACCTCAGATGGAGTACTGTAAGGCAATCACTGTGCACCAGGAGGAATCTGAAGAGGAACCTGAATC
TGAGGACTATGAAACGCCTACAGGGGAAGCTGAGGAGGACACGTCAGCAGATGAGGCTGAAAAGCCTAACCTTCAGCCTCCTATTCCTTCTCCCACACTGTTGGTTCCCA
AGGAAAAGAAAAAGAAAAAGAAGAAAAAGAACAATCAGGTTCAGTTTGATAAATTTATGAATGCTTTTATGAATCTGAATATTAATATTCCTTTTGCAGAGGCATTAGAG
ATGCCCCAATACAACAGGTTCATGAAGGAGTGGTTAGCAAAGAAGCGAAAAGAAAAGAAGGTTGACACTGTATATCTTGCTTCCACATGCAGCACTCGAGTCCAACAGAA
GGTACCTGAAAAAGTAGCAGATCCAGGGAGTTTTTCTGTTCCTTGTAGTTTTGGTACTTATTCTTTCAGAGCATTATGTGATTTAGGCGCTAGCATTAATATCATTCCTC
TATCGTTGTGTAAAAAGTTAGATATAGGTGAGATTAAGTCTACTCCTGTAAAGCTCCAATTGGCTGATCAATCTGTGGTGAGACCAGTTGGTATTGTAGAAAATGTGTTA
ATCAGAGTAGGTAGATTTTTTCTCCCTATTGATTTATATGTTATGGACATGATAGAAAATCCTTCAATGCCTGTCATATTAGGAAGATCATTCCTCGCTACTGGGCGAGT
GATTATAGATATTGAGTGCAGGGAGCTCACTATTAGAGTCAAGAACGAAAAAGAAATCTTTAAAGCAGTTGAAGACTCTAAAGATGAAGTGCTTTTCATGGGATATAGGA
AAGGTGCAAGAAAGAGCACCTCTTTTGGATTCATAGAACAAAAGCCTCCTTGA
Protein sequenceShow/hide protein sequence
MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYKESPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFKEFFPPAKTVKLRTEIGTFQ
QQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTHKKIAAGVFEVDKVSALQAQMTSLASA
FMKFSGTGSAQSIESAAALASRPQEETIEQVQNHENFSYANTKNVLNPPGFAPQTQDNKKLEDLVGAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNV
VSTMNKGKAPAEQEKPQMEYCKAITVHQEESEEEPESEDYETPTGEAEEDTSADEAEKPNLQPPIPSPTLLVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE
MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSFRALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVL
IRVGRFFLPIDLYVMDMIENPSMPVILGRSFLATGRVIIDIECRELTIRVKNEKEIFKAVEDSKDEVLFMGYRKGARKSTSFGFIEQKPP