; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032111 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032111
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr11:25041045..25043225
RNA-Seq ExpressionLag0032111
SyntenyLag0032111
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6734747.1 hypothetical protein I3842_01G285500 [Carya illinoinensis]4.4e-11538.71Show/hide
Query:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLTF------------------------DKARD
        MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + GSP +DPN HL  FL                          DKAR 
Subjt:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLTF------------------------DKARD

Query:  WLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPPHGYPDWLQ---------------------------T
        WLQS+ PGSI +W  + + FL KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CP HG PDWLQ                           T
Subjt:  WLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPPHGYPDWLQ---------------------------T

Query:  VENARILLEDMATNSYQWPSERSAPKKISAGVFEVDKVSALQAQITSLANAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQYVSNFNSRGYNNSST
         E A  LLE+MA+N+YQWP+ER+  KK+ AG+ E++ ++AL AQ+ +L++     +     QS E  ++ ++     E + EQVQYV+N N   Y  +  
Subjt:  VENARILLEDMATNSYQWPSERSAPKKISAGVFEVDKVSALQAQITSLANAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQYVSNFNSRGYNNSST

Query:  PTHYHPNNRNHEKFSYANTKNVL---NPPGFAPQTQENK-KLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHIAAIKNIETQLGQLVRVVSTMNKGKA
        P +YHP  RNHE  SY NTKNVL   +PPGF  Q  E K  LED + +F+ E++ R  K +  +  I +  +   A +KN+E Q+GQL   ++   +G  
Subjt:  PTHYHPNNRNHEKFSYANTKNVL---NPPGFAPQTQENK-KLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHIAAIKNIETQLGQLVRVVSTMNKGKA

Query:  LAEQEKTQMEYCKAIIVHQ-EEADEEPESEDYDTPTGE---------AEEDTSSDEAEKPN--------PEPPIPSPTLMVPKEKKKKKKKKNNQVQFDK
         +  E    E CKAI +   +E +  P  E   TPT            EE+  +D  E+ +          PPI +P L  P+  +K+K  K    QF K
Subjt:  LAEQEKTQMEYCKAIIVHQ-EEADEEPESEDYDTPTGE---------AEEDTSSDEAEKPN--------PEPPIPSPTLMVPKEKKKKKKKKNNQVQFDK

Query:  FMNAFMNLNINIPFAEALE-MPQYNRFMKEWSAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSF--------------------------LFLVVL
        F++ F  ++INIPFA+ALE MP Y +F+K+  +KKR+ ++ +TV L+  CS  +Q+K+P+K+ DP SF                           F+   
Subjt:  FMNAFMNLNINIPFAEALE-MPQYNRFMKEWSAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSF--------------------------LFLVVL

Query:  LDIGEIKSTPVKLQLAHQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPIILGRPFLATGRVIIDIERRELTIRVRNEKEIFKAVEDSK
        L +GE+K T + LQLA +S+  P GI+E+VL++V KF  P D  V+DM E+  +P+ILGRPFLATGR +ID+++ ELT+RV  E+ +F   +  K
Subjt:  LDIGEIKSTPVKLQLAHQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPIILGRPFLATGRVIIDIERRELTIRVRNEKEIFKAVEDSK

KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]1.2e-11539.14Show/hide
Query:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLTF------------------------DKARD
        MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + GSP +DPN HL  FL                          DKAR 
Subjt:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLTF------------------------DKARD

Query:  WLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPPHGYPDWLQ---------------------------T
        WLQS+ PGSI +W  + + FL KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CP HG PDWLQ                           T
Subjt:  WLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPPHGYPDWLQ---------------------------T

Query:  VENARILLEDMATNSYQWPSERSAPKKISAGVFEVDKVSALQAQITSLANAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQYVSNFNSRGYNNSST
         E A  LLE+MA+N+YQWP+ER+  KK+ AG+ E++ ++AL AQ+ +L++     +     QS E  ++ ++     E + EQVQYV+N N   Y  +  
Subjt:  VENARILLEDMATNSYQWPSERSAPKKISAGVFEVDKVSALQAQITSLANAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQYVSNFNSRGYNNSST

Query:  PTHYHPNNRNHEKFSYANTKNVL---NPPGFAPQTQENK-KLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHIAAIKNIETQLGQLVRVVSTMNKGKA
        P +YHP  RNHE  SY NTKNVL   +PPGF  Q  E K  LED + +F+ E++ R  K +  +  I +  +   A +KN+E Q+GQL   ++   +G  
Subjt:  PTHYHPNNRNHEKFSYANTKNVL---NPPGFAPQTQENK-KLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHIAAIKNIETQLGQLVRVVSTMNKGKA

Query:  LAEQEKTQMEYCKAIIVHQ-EEADEEPESEDYDTPTGE---------AEEDTSSDEAEKPN--------PEPPIPSPTLMVPKEKKKKKKKKNNQVQFDK
         +  E    E CKAI +   +E +  P  E   TPT            EE+  +D  E+ +          PPI +P L  P+  +K+K  K    QF K
Subjt:  LAEQEKTQMEYCKAIIVHQ-EEADEEPESEDYDTPTGE---------AEEDTSSDEAEKPN--------PEPPIPSPTLMVPKEKKKKKKKKNNQVQFDK

Query:  FMNAFMNLNINIPFAEALE-MPQYNRFMKEWSAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSF---------LFLVVLLD---------------
        F++ F  ++INIPFA+ALE MP Y +F+K+  +KKR+ ++ +TV L+  CS  +Q+K+P+K+ DPGSF          F  VL D               
Subjt:  FMNAFMNLNINIPFAEALE-MPQYNRFMKEWSAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSF---------LFLVVLLD---------------

Query:  --IGEIKSTPVKLQLAHQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPIILGRPFLATGRVIIDIERRELTIRVRNEKEIFKAVEDSK
          +GE+K T + LQLA +S+  P GI+E+VL++V KF  P D  V+DM E+  +P+ILGRPFLATGR +ID+++ ELT+RV  E+ +F   +  K
Subjt:  --IGEIKSTPVKLQLAHQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPIILGRPFLATGRVIIDIERRELTIRVRNEKEIFKAVEDSK

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]3.4e-12339.81Show/hide
Query:  MRSSKDLILAPLDPEIERTIHRLRRENRENFQMADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSF
        MR ++   + P+DPEIERT+  LRR   +   MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + GSP +DPN HL  F
Subjt:  MRSSKDLILAPLDPEIERTIHRLRRENRENFQMADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSF

Query:  LTF------------------------DKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPPHGYP
        L                          DKAR WLQS+ PGSI +W  + + FL KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CP HG P
Subjt:  LTF------------------------DKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPPHGYP

Query:  DWLQ---------------------------TVENARILLEDMATNSYQWPSERSAPKKISAGVFEVDKVSALQAQITSLANAFMKFSGTGSAQSIE--S
        DWLQ                           T E A  LLE+MA+N+YQWP+ER+  KK+ AG+ +++ ++AL AQ+ +L++     +     QS E  +
Subjt:  DWLQ---------------------------TVENARILLEDMATNSYQWPSERSAPKKISAGVFEVDKVSALQAQITSLANAFMKFSGTGSAQSIE--S

Query:  AAALASRPQEETIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHEKFSYANTKNVL---NPPGFAPQTQENK-KLEDLVGAFIAESSNRTTKLEEAVIAIN
        + ++     E + EQVQYV+N N   Y  +  P +YHP  RNHE  SY NTKNVL   +PPGF  Q  E K  LED + +F+ E++ R  K +  +  I 
Subjt:  AAALASRPQEETIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHEKFSYANTKNVL---NPPGFAPQTQENK-KLEDLVGAFIAESSNRTTKLEEAVIAIN

Query:  STVNGHIAAIKNIETQLGQLVRVVSTMNKGKALAEQEKTQMEYCKAIIVHQ-EEADEEPESEDYDTPT----GEAEEDTSSDEAEKPNPE----------
        +  +   AAIKNIE Q+GQL   ++   +G   +  E    E CKAI +   +E +  P  E   TPT    G+++     DE      E          
Subjt:  STVNGHIAAIKNIETQLGQLVRVVSTMNKGKALAEQEKTQMEYCKAIIVHQ-EEADEEPESEDYDTPT----GEAEEDTSSDEAEKPNPE----------

Query:  ---PPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWSAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSF
           PPI +P L  P+  +K+K  K    QF KF++ F  ++INIPFA+ALE MP Y +F+K+  +KKR+ ++ +TV L+  CS  +Q+K+P+K+ DPGSF
Subjt:  ---PPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWSAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSF

Query:  ---------LFLVVLLDIG-----------------EIKSTPVKLQLAHQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPIILGRPFLATGRV
                  F  VL D+G                 E+K T + LQLA +S+  P GI+E+VL++V KF  P D  V+DM E+  +P+ILGRPFLATGR 
Subjt:  ---------LFLVVLLDIG-----------------EIKSTPVKLQLAHQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPIILGRPFLATGRV

Query:  IIDIERRELTIRVRNEKEIFK
        +ID+++ ELT+RV  E+ +FK
Subjt:  IIDIERRELTIRVRNEKEIFK

XP_022843226.1 uncharacterized protein LOC111366761 [Olea europaea var. sylvestris]4.6e-11239.69Show/hide
Query:  MRSSKDLILAPLDPEIERTIHRLRR-ENRENFQMADQ-----NPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPN
        MR +++L L  +DPE ERT   LR  +  E   MA+Q     N   + R IRDY +PV     SGI    I A NFELK GLI M +   + G+  EDPN
Subjt:  MRSSKDLILAPLDPEIERTIHRLRR-ENRENFQMADQ-----NPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPN

Query:  SHLKSFLTF------------------------DKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKC
        +HL SFL                          DKA+ W QS+  GSITTWD L Q FL K+FPP+K+ +LR EI  F+Q   E  +EAWERFK+LLR+C
Subjt:  SHLKSFLTF------------------------DKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKC

Query:  PPHGYPDWLQ---------------------------TVENARILLEDMATNSYQWPSERSAPKKISAGVFEVDKVSALQAQITSLANAFMKFSGTGSAQ
        P HG+  W+Q                           T E A  LL+D+ATNSYQWPSERS  KK+ AG+ EVD ++AL AQ+ SL N  +  +  G+ Q
Subjt:  PPHGYPDWLQ---------------------------TVENARILLEDMATNSYQWPSERSAPKKISAGVFEVDKVSALQAQITSLANAFMKFSGTGSAQ

Query:  SIESAAALASRPQEETI--EQVQYVS--NFNSRGYNNSSTPTHYHPNNRNHEKFSYANTKNVLN-PPGFAPQTQENK-KLEDLVGAFIAESSNRTTKLEE
        +++S  + +S  QE  +  EQVQY+   N+N RG   ++   HYHP  RNHE  SY N +N L  PPGF  Q  + K  LED++G FI+E+ +R  K E 
Subjt:  SIESAAALASRPQEETI--EQVQYVS--NFNSRGYNNSSTPTHYHPNNRNHEKFSYANTKNVLN-PPGFAPQTQENK-KLEDLVGAFIAESSNRTTKLEE

Query:  AVIAINSTVNGHIAAIKNIETQLGQLVRVVSTMNKGKALAEQEKTQMEYCKAIIVHQEEADEEPESEDYDTP------TGEAEEDTSSDEAE-----KP-
         +  I + V+   A +KN+E Q+GQL  ++ +  KGK  ++ E    E+C AI +   +  EE + +    P      T E + +    EAE     KP 
Subjt:  AVIAINSTVNGHIAAIKNIETQLGQLVRVVSTMNKGKALAEQEKTQMEYCKAIIVHQEEADEEPESEDYDTP------TGEAEEDTSSDEAE-----KP-

Query:  ----NPEPPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMPQYNRFMKEWSAKKRKEKKVDTVYLASTCSTRVQQKVPEKVAD
               PPI  P L  P+   KKK       QF KF+  F  ++INIPFAE L +MP Y +F+KE  + K+K ++ +T+ L   CS  + QK+P K+ D
Subjt:  ----NPEPPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMPQYNRFMKEWSAKKRKEKKVDTVYLASTCSTRVQQKVPEKVAD

Query:  PGSF--------------------------LFLVVLLDIGEIKSTPVKLQLAHQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPIILGRPFLA
        PGSF                          L +   L +GE+K T + LQLA +S+  P G++E+VL++V KF LP+D  V+DM EN  +P+ILGRPFLA
Subjt:  PGSF--------------------------LFLVVLLDIGEIKSTPVKLQLAHQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPIILGRPFLA

Query:  TGRVIIDI
        TGR +ID+
Subjt:  TGRVIIDI

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]1.4e-11338.9Show/hide
Query:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLTF------------------------DKARD
        MA+     +PR ++DY +P+     SGI    INANNFELK  LI M +   + GSP +DPN HL  FL                          DKAR 
Subjt:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLTF------------------------DKARD

Query:  WLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPPHGYPDWLQ---------------------------T
        WLQS+ PGSIT+W  + + FL KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R CP HG PDWLQ                           T
Subjt:  WLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPPHGYPDWLQ---------------------------T

Query:  VENARILLEDMATNSYQWPSERSAPKKISAGVFEVDKVSALQAQITSLANAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQYVSNFNSRGYNNSST
         E A  LLE+MA+N+YQWP+ER+  KK+ AG+ E++  +AL AQ+ SL++     +     Q  E  +A+++     E + EQVQY++N N   Y  +  
Subjt:  VENARILLEDMATNSYQWPSERSAPKKISAGVFEVDKVSALQAQITSLANAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQYVSNFNSRGYNNSST

Query:  PTHYHPNNRNHEKFSYANTKNVLN-PPGFAPQTQENK-KLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHIAAIKNIETQLGQLVRVVSTMNKGKALA
        P +YHP  RNHE FSY NTKNVL  PPGF  Q  E K  LED + +F+ E+     K +  +  I +  +   A +KN+E Q+GQL   ++   +G   +
Subjt:  PTHYHPNNRNHEKFSYANTKNVLN-PPGFAPQTQENK-KLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHIAAIKNIETQLGQLVRVVSTMNKGKALA

Query:  EQEKTQMEYCKAIIVHQ-EEADEEPESEDYDTPTG-------------EAEEDTSSDEAEKPNPEPPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFM
          E    E CKAI +    E +  P  E   TPT              E  EDT  +    P+   P   P L  P    ++ +K+    QF KF++ F 
Subjt:  EQEKTQMEYCKAIIVHQ-EEADEEPESEDYDTPTG-------------EAEEDTSSDEAEKPNPEPPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFM

Query:  NLNINIPFAEALE-MPQYNRFMKEWSAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSF---------LFLVVLLD-----------------IGEI
         ++INIPFA+ALE MP Y +F+K+  +KKR+ ++ +TV L+  CS  +Q+K+P+K+ DPGSF          F  VL D                 +GE+
Subjt:  NLNINIPFAEALE-MPQYNRFMKEWSAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSF---------LFLVVLLD-----------------IGEI

Query:  KSTPVKLQLAHQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPIILGRPFLATGRVIIDIERRELTIRVRNEKEIFKAVEDSK
        K T + LQLA +S+  P GI+E+VL++V KF  P D  V+DM E+  +P+ILGRPFLATGR ++D+++ ELT+RV  E+  F   E  K
Subjt:  KSTPVKLQLAHQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPIILGRPFLATGRVIIDIERRELTIRVRNEKEIFKAVEDSK

TrEMBL top hitse value%identityAlignment
A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129458.5e-9635.88Show/hide
Query:  MRSSKDLILAPLDPEIERTIHRLRRENRE----NFQMADQN----------PPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-Y
        M+   +L L P DP+IERT  R RREN +    N  MA+ N           PE  R +RDY  P+ QG    I    INANNFE+K   IQM +    +
Subjt:  MRSSKDLILAPLDPEIERTIHRLRRENRE----NFQMADQN----------PPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-Y

Query:  RGSPTEDPNSHLKSFL----TF--------------------DKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWE
         G P++DPNSHL +FL    TF                    DKA+ WL S+  GSITTW+ L Q FL KFFPPAKT K+R +I +F Q   E L+EAWE
Subjt:  RGSPTEDPNSHLKSFL----TF--------------------DKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWE

Query:  RFKELLRKCPPHGYPDWLQ---------------------------TVENARILLEDMATNSYQWPSERSAPKKISAGVFEVDKVSALQAQITSLANAFM
        RFKELLR+CP HG PDWLQ                              +A  LLE+MA+N+YQWPSERS  +K + G +E+D +  L  Q+ +L+    
Subjt:  RFKELLRKCPPHGYPDWLQ---------------------------TVENARILLEDMATNSYQWPSERSAPKKISAGVFEVDKVSALQAQITSLANAFM

Query:  KFSGTGSAQSIESAAALASRPQEE--------TIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHEKFSYANTKNVLN-----PPGF----APQTQENK-K
        K   T    +++++  +     +           E VQ+V NFN R  NN  + T Y+P  RNH  FS++N     N     PPGF     PQ  E K +
Subjt:  KFSGTGSAQSIESAAALASRPQEE--------TIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHEKFSYANTKNVLN-----PPGF----APQTQENK-K

Query:  LEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHIAAIKNIETQLGQLVRVVSTMNKGKALAEQE--KTQMEYCKAII---------VHQEEADEEPESED
        LE+L+  +I+++              ++ +    A+++N+ETQ+GQL   ++   +G   ++ +      E C+AI          V+Q+  + E E  D
Subjt:  LEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHIAAIKNIETQLGQLVRVVSTMNKGKALAEQE--KTQMEYCKAII---------VHQEEADEEPESED

Query:  YDTPTGEAEEDTSSDEAEKPNPEPPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWSAKKRKEKKVDTVYLAS
         +   G  E +    + +    E    S  +  P    ++ +K+  + QF KF+N F  L+INIPFAEALE MP Y +F+K+  +KKRK  + +TV+L  
Subjt:  YDTPTGEAEEDTSSDEAEKPNPEPPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWSAKKRKEKKVDTVYLAS

Query:  TCSTRVQQKVPEKVADPGSFLFLVVL--------------------------LDIGEIKSTPVKLQLAHQSVVRPVGIVENVLIRVGKFFLPIDLYVMDM
         CS  +Q K+P K+ DPGSF     +                          L +GE K T V LQLA +S V P GI+E+VL++V KF  P+D  ++DM
Subjt:  TCSTRVQQKVPEKVADPGSFLFLVVL--------------------------LDIGEIKSTPVKLQLAHQSVVRPVGIVENVLIRVGKFFLPIDLYVMDM

Query:  IENPSMPIILGRPFLATGRVIIDIERRELTIRVRNEKEIFKAVEDSK
         E+  +PIILGRPFLAT   IID+   +++ +V  E   F     SK
Subjt:  IENPSMPIILGRPFLATGRVIIDIERRELTIRVRNEKEIFKAVEDSK

A0A6J1DU19 uncharacterized protein LOC1110243611.7e-8336.72Show/hide
Query:  IRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFL------TFDKARDWLQSITPGSITTWD-ALVQAFLKKFFPPAK
        IRDY QP F     GI+  PINANN ELK GLIQM R+  +RG+ TEDPN+HL  FL        +   D    +    ++  D  +VQAFL  FFPPAK
Subjt:  IRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFL------TFDKARDWLQSITPGSITTWD-ALVQAFLKKFFPPAK

Query:  TVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPPHGYPDWLQ---------------------------TVENARILLEDMATNSYQWPSERSAPKKIS
        T +LRTEI +F++   EQLFE WER+KELLRKCP HG  +WLQ                           T ENA ILL+DMA NS+QWPSERS  KK+ 
Subjt:  TVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPPHGYPDWLQ---------------------------TVENARILLEDMATNSYQWPSERSAPKKIS

Query:  AGVFEVDKVSALQAQITSLANAFMKFSGTGSAQSIESAAALASRP-QEETIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHEKFSYANTKNVLNPPGFAP
        AG++E+D++S+L+AQ+ +L NA  K SG G++ S E  AA  +    E TIEQ Q+ S                HP                        
Subjt:  AGVFEVDKVSALQAQITSLANAFMKFSGTGSAQSIESAAALASRP-QEETIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHEKFSYANTKNVLNPPGFAP

Query:  QTQENKKLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHIAAIKNIETQLGQLVRVVSTMNKGKALAEQEKTQMEYCKAIIVHQEEADEEPESEDYDTP
          ++   LEDL+GAFI E  +R +++E  V  +   + G+  +IKN+E Q+GQ+   ++TM KGK  ++ E    E+CKA+ +   +  +EPE +  + P
Subjt:  QTQENKKLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHIAAIKNIETQLGQLVRVVSTMNKGKALAEQEKTQMEYCKAIIVHQEEADEEPESEDYDTP

Query:  TGEAEEDTSSDEAEKPNPEPPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWSAKKRKEKKVDTVYLASTCST
            EE  + +E  K        +P L   K          N + + +                ALE MP Y RFMK+    KRK +  +TV L   CS 
Subjt:  TGEAEEDTSSDEAEKPNPEPPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWSAKKRKEKKVDTVYLASTCST

Query:  RVQQKVPEKVADPGSFLFLVVLLDIGEIKSTPVKLQLAHQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPIILGRPFLATGRVIIDIERRELT
         +Q+K+P+K+ DPGSF    +   I           +     + P+G++E+VL++V +   P D  V+   E+  +PIILGR FLATG  +ID++   LT
Subjt:  RVQQKVPEKVADPGSFLFLVVLLDIGEIKSTPVKLQLAHQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPIILGRPFLATGRVIIDIERRELT

Query:  IRVRNEKEIF
        +RV  E  +F
Subjt:  IRVRNEKEIF

A0A6P6XAQ1 Reverse transcriptase7.4e-8434.88Show/hide
Query:  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLTF------------------------DKARDWLQSITPGSI
        R +RD+  P  QG Q+ IV   +NANNFE+K  LIQM +   Y G+ TEDPNSHL +FL                          DKA+ WLQS  P + 
Subjt:  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLTF------------------------DKARDWLQSITPGSI

Query:  TTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPPHGYPDWL---------------------------QTVENARILLED
        TTWD L +AFL KFFPP KT KLR +I +F QQ  E L+EAWER++EL R+CP HG PDWL                           +T E A+ L+E+
Subjt:  TTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPPHGYPDWL---------------------------QTVENARILLED

Query:  MATNSYQWPSERSAPKKISAGVFEVDKVSALQAQITSLANAFMKFSGTGSAQSIESAAALASRPQEE-----TIEQVQYVSNFNSRGYNNSSTPTHYHPN
        MA N+YQW +ER   ++ +AG+ EVD ++ L A++ ++     +  G+ S Q +  A+        +     + EQVQY++N+N    NN  + T Y+P 
Subjt:  MATNSYQWPSERSAPKKISAGVFEVDKVSALQAQITSLANAFMKFSGTGSAQSIESAAALASRPQEE-----TIEQVQYVSNFNSRGYNNSSTPTHYHPN

Query:  NRNHEKFSY---ANTKNVLNPPGFAPQ--TQENKKLEDLVGAFIAESSN-RTTKLEEAVIAINSTVNGHIAAI----KNIETQLGQLVRVVSTMNKGKAL
         RNH  F +    N +  +NPPGF  +    E+K   +L    +A +SN +  KL  A       + G +  +    +N+E QLGQ+   V+  N+G   
Subjt:  NRNHEKFSY---ANTKNVLNPPGFAPQ--TQENKKLEDLVGAFIAESSN-RTTKLEEAVIAINSTVNGHIAAI----KNIETQLGQLVRVVSTMNKGKAL

Query:  AEQEKTQMEYCKAIIVHQEEADEEPESEDYDTPTGEAEEDTSSDEAEKPNPEPPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE
        ++ E    E+ KAI +   +   EP          + E    S+  E                KE+K K+K + N++Q        M     IP      
Subjt:  AEQEKTQMEYCKAIIVHQEEADEEPESEDYDTPTGEAEEDTSSDEAEKPNPEPPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE

Query:  MPQYNRFMKEWSAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFL---------FLVVLLDIG-----------------EIKSTPVKLQLAHQSV
        +P Y +F+KE   KKRK    +T+ L   CS  +Q K+P K+ DPGSF          F   L D+G                 E+K T + LQLA +S+
Subjt:  MPQYNRFMKEWSAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFL---------FLVVLLDIG-----------------EIKSTPVKLQLAHQSV

Query:  VRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPIILGRPFLATGRVIIDIERRELTIRVRNEKEIF
          P+GI+ENVLI+V KF +P+D  V+DM E+ ++PIILGRPFLAT   IID++R +   ++  E+  F
Subjt:  VRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPIILGRPFLATGRVIIDIERRELTIRVRNEKEIF

A0A6P8DD93 uncharacterized protein LOC1162064531.6e-8634.54Show/hide
Query:  MRSSKDLILAPLDPEIERTIHRLRRENREN-----FQMADQNPPEE----PRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTE
        MR S+   L PLDPEIERT+HRLRRENR        +MAD +   +     R +RDY  P   G  S I    I ANNFELK  LIQM +   + G P E
Subjt:  MRSSKDLILAPLDPEIERTIHRLRRENREN-----FQMADQNPPEE----PRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTE

Query:  DPNSHLKSFLTF------------------------DKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELL
         P+ H+  FL +                        DKAR W  S+   SITTW  L   FL++FFPPA+T +LR EI  F +   E L+EAWERFKE +
Subjt:  DPNSHLKSFLTF------------------------DKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELL

Query:  RKCPPHGYPDWL---------------------------QTVENARILLEDMATNSYQWPSERSAPKKISAGVFEVDKVSALQAQITSLANAFMKFSGTG
        RKCP HG PD L                           +  + A  L+E+MA++++ W +ERS  K   A V ++D ++ L  QI++L     K +   
Subjt:  RKCPPHGYPDWL---------------------------QTVENARILLEDMATNSYQWPSERSAPKKISAGVFEVDKVSALQAQITSLANAFMKFSGTG

Query:  SAQSIESA-AALASRPQ------------EETIEQVQYVSNF---NSRGYNNSSTPTHYHPNNRNHEKFSYANTKNVLN-PPGF--------APQTQENK
        S  + + A   L S P                 EQV +V+NF   N   Y+N+     Y+P  RNH  FS+ N  N L  PPGF        AP  Q   
Subjt:  SAQSIESA-AALASRPQ------------EETIEQVQYVSNF---NSRGYNNSSTPTHYHPNNRNHEKFSYANTKNVLN-PPGF--------APQTQENK

Query:  KLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHIAAIKNIETQLGQLVRVVSTMNKGKALAEQEKT-------QMEYCKAIIVHQEEADEEPESEDYDT
        ++E+L+ +++ ++              ++ +    A I+N+E Q+ Q+ + +S    G   +  E+         +   K + +   +A  + ES + D 
Subjt:  KLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHIAAIKNIETQLGQLVRVVSTMNKGKALAEQEKT-------QMEYCKAIIVHQEEADEEPESEDYDT

Query:  PTGEAEEDTSSDEAEKPNPEPPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMPQYNRFMKEWSAKKRKEKKVDTVYLASTCS
           + EE        KP   PP+P P         ++ K++    QF KF++ F  L INIPFAEAL +MP Y RFMK+   KKRK    + V L   CS
Subjt:  PTGEAEEDTSSDEAEKPNPEPPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMPQYNRFMKEWSAKKRKEKKVDTVYLASTCS

Query:  TRVQQ---KVPEKVADPGSFL---------FLVVLLD-----------------IGEIKSTPVKLQLAHQSVVRPVGIVENVLIRVGKFFLPIDLYVMDM
          +Q+    +P K  D GSF          F  VL+D                 +GE K T V LQLA +S+  P GIVENVL++V KF  P+D  V++M
Subjt:  TRVQQ---KVPEKVADPGSFL---------FLVVLLD-----------------IGEIKSTPVKLQLAHQSVVRPVGIVENVLIRVGKFFLPIDLYVMDM

Query:  IENPSMPIILGRPFLATGRVIIDIERRELTIRVRNEKEIFKAVEDSK
         E+  +P+ILGRPFLATG+ +ID+E+ +LT+RV NE+  F   +  K
Subjt:  IENPSMPIILGRPFLATGRVIIDIERRELTIRVRNEKEIFKAVEDSK

A0A6P8DKJ2 uncharacterized protein LOC1162042317.9e-8634.4Show/hide
Query:  MRSSKDLILAPLDPEIERTIHRLRRENREN-----FQMADQNPPEE----PRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTE
        MR S+   L PLDPEIERT+HRLRRENR        +MAD +   +     R +RDY  P   G  S I    I ANNFELK  LIQM +   + G P E
Subjt:  MRSSKDLILAPLDPEIERTIHRLRRENREN-----FQMADQNPPEE----PRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTE

Query:  DPNSHLKSFLTF------------------------DKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELL
         P+ H+  FL +                        DKAR W  S+   SITTW  L   FL++FFPPA+T +LR EI  F +   E L+EAWERFKE +
Subjt:  DPNSHLKSFLTF------------------------DKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELL

Query:  RKCPPHGYPDWL---------------------------QTVENARILLEDMATNSYQWPSERSAPKKISAGVFEVDKVSALQAQITSLANAFMKFSGTG
        RKCP HG PD L                           +  + A  L+E+MA++++ W +ERS  K   A V ++D ++ L  QI++L     K +   
Subjt:  RKCPPHGYPDWL---------------------------QTVENARILLEDMATNSYQWPSERSAPKKISAGVFEVDKVSALQAQITSLANAFMKFSGTG

Query:  SAQSIESA-AALASRPQ------------EETIEQVQYVSNF---NSRGYNNSSTPTHYHPNNRNHEKFSYANTKNVLN-PPGF--------APQTQENK
        S  + + A   L S P                 EQV +V+NF   N   Y+N+     Y+P  RNH  FS+ N  N L  PPGF        AP  Q   
Subjt:  SAQSIESA-AALASRPQ------------EETIEQVQYVSNF---NSRGYNNSSTPTHYHPNNRNHEKFSYANTKNVLN-PPGF--------APQTQENK

Query:  KLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHIAAIKNIETQLGQLVRVVSTMNKGKALAEQEKT-------QMEYCKAIIVHQEEADEEPESEDYDT
        ++E+L+ +++ ++              ++ +    A I+N+E Q+ Q+ + +S    G   +  E+         +   K + +   +A  + ES + D 
Subjt:  KLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHIAAIKNIETQLGQLVRVVSTMNKGKALAEQEKT-------QMEYCKAIIVHQEEADEEPESEDYDT

Query:  PTGEAEEDTSSDEAEKPNPEPPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMPQYNRFMKEWSAKKRKEKKVDTVYLASTCS
           + EE        KP   PP+P P          + K++    QF KF++ F  L INIPFAEAL +MP Y RFMK+   KKRK    + V L   CS
Subjt:  PTGEAEEDTSSDEAEKPNPEPPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMPQYNRFMKEWSAKKRKEKKVDTVYLASTCS

Query:  TRVQQ---KVPEKVADPGSFL---------FLVVLLD-----------------IGEIKSTPVKLQLAHQSVVRPVGIVENVLIRVGKFFLPIDLYVMDM
          +Q+    +P K  D GSF          F  VL+D                 +GE K T + LQLA +S+  P GIVENVL++V KF  P+D  V++M
Subjt:  TRVQQ---KVPEKVADPGSFL---------FLVVLLD-----------------IGEIKSTPVKLQLAHQSVVRPVGIVENVLIRVGKFFLPIDLYVMDM

Query:  IENPSMPIILGRPFLATGRVIIDIERRELTIRVRNEKEIFKAVEDSK
         E+  +P+ILGRPFLATG+ +ID+E+ +LT+RV NE+  F   +  K
Subjt:  IENPSMPIILGRPFLATGRVIIDIERRELTIRVRNEKEIFKAVEDSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTAGTAGTAAAGATTTAATTTTAGCACCATTGGATCCCGAGATAGAAAGAACCATCCATAGGCTTAGAAGGGAGAATAGAGAAAACTTTCAAATGGCTGACCAAAA
TCCACCTGAGGAGCCTAGGCCTATTAGAGATTACTTTCAGCCCGTGTTTCAGGGGCAACAATCGGGGATTGTCTATGCCCCGATTAATGCCAACAACTTTGAGCTGAAGA
CCGGTCTCATTCAGATGGCTCGAGACTGTGCATATAGAGGATCGCCCACTGAGGATCCAAATTCTCATCTTAAATCATTTTTGACATTTGATAAAGCACGAGATTGGTTG
CAATCTATTACCCCTGGGAGCATCACCACCTGGGATGCTTTGGTCCAGGCCTTTTTAAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGAC
ATTCCAACAACAATATGATGAGCAGTTGTTCGAGGCCTGGGAGCGATTTAAAGAGTTGTTGAGGAAGTGCCCTCCGCATGGATATCCCGACTGGCTTCAGACCGTGGAAA
ATGCTCGCATACTTCTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGGTCTGCACCTAAAAAGATTTCTGCTGGAGTGTTTGAGGTTGACAAGGTAAGT
GCACTCCAGGCCCAGATAACCTCCCTTGCTAATGCTTTTATGAAATTTTCAGGTACAGGGAGTGCTCAATCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGA
GGAGACCATCGAGCAGGTTCAGTATGTATCAAATTTTAATTCTAGGGGATATAATAATAGTTCTACACCTACACATTATCACCCTAACAATAGGAACCATGAAAAATTCT
CTTATGCAAATACTAAGAATGTTCTTAACCCCCCTGGTTTTGCCCCGCAAACTCAAGAAAATAAAAAGTTAGAAGATCTTGTTGGAGCTTTCATTGCAGAGTCTAGTAAC
AGGACAACCAAATTAGAGGAGGCAGTCATTGCCATCAACTCAACAGTGAATGGCCACATTGCAGCTATAAAGAACATTGAGACTCAGCTGGGACAGTTGGTGAGGGTTGT
GAGCACTATGAATAAAGGTAAGGCCCTAGCTGAGCAAGAGAAAACCCAGATGGAGTACTGTAAGGCAATCATTGTGCACCAGGAGGAAGCTGACGAGGAGCCTGAATCTG
AGGACTATGACACGCCTACAGGGGAAGCTGAGGAGGACACATCATCTGATGAGGCTGAAAAGCCTAACCCTGAGCCTCCTATTCCTTCTCCCACACTGATGGTTCCCAAG
GAAAAGAAAAAGAAAAAGAAGAAAAAGAACAATCAAGTTCAGTTTGATAAATTTATGAATGCTTTTATGAATCTGAATATTAATATTCCTTTTGCAGAGGCATTAGAGAT
GCCCCAATACAACAGGTTTATGAAGGAGTGGTCAGCAAAGAAGCGAAAGGAAAAGAAGGTTGACACTGTATATCTTGCTTCCACATGCAGCACCAGAGTACAACAGAAGG
TACCTGAAAAAGTAGCAGATCCAGGGAGTTTTTTGTTCCTTGTAGTTTTGTTAGACATAGGTGAGATTAAATCTACTCCTGTAAAGCTCCAATTGGCTCATCAATCTGTG
GTTAGACCAGTTGGCATTGTAGAAAATGTTTTAATCAGAGTAGGTAAATTTTTCCTCCCTATTGACTTGTATGTTATGGACATGATAGAAAATCCTTCAATGCCTATCAT
ATTAGGAAGACCATTCCTCGCTACTGGGCGAGTGATTATTGATATTGAGCGCAGGGAGCTCACTATTAGAGTCAGGAACGAAAAAGAAATTTTTAAAGCAGTGGAAGACT
CTAAAGATGAAGTGCTTTTCATGGGTTACAAGAAAGGTGCAAGAAAAAGCACCTCTGTTGGATTCACAGAACAAAAGCCTCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGTAGTAGTAAAGATTTAATTTTAGCACCATTGGATCCCGAGATAGAAAGAACCATCCATAGGCTTAGAAGGGAGAATAGAGAAAACTTTCAAATGGCTGACCAAAA
TCCACCTGAGGAGCCTAGGCCTATTAGAGATTACTTTCAGCCCGTGTTTCAGGGGCAACAATCGGGGATTGTCTATGCCCCGATTAATGCCAACAACTTTGAGCTGAAGA
CCGGTCTCATTCAGATGGCTCGAGACTGTGCATATAGAGGATCGCCCACTGAGGATCCAAATTCTCATCTTAAATCATTTTTGACATTTGATAAAGCACGAGATTGGTTG
CAATCTATTACCCCTGGGAGCATCACCACCTGGGATGCTTTGGTCCAGGCCTTTTTAAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGAC
ATTCCAACAACAATATGATGAGCAGTTGTTCGAGGCCTGGGAGCGATTTAAAGAGTTGTTGAGGAAGTGCCCTCCGCATGGATATCCCGACTGGCTTCAGACCGTGGAAA
ATGCTCGCATACTTCTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGGTCTGCACCTAAAAAGATTTCTGCTGGAGTGTTTGAGGTTGACAAGGTAAGT
GCACTCCAGGCCCAGATAACCTCCCTTGCTAATGCTTTTATGAAATTTTCAGGTACAGGGAGTGCTCAATCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGA
GGAGACCATCGAGCAGGTTCAGTATGTATCAAATTTTAATTCTAGGGGATATAATAATAGTTCTACACCTACACATTATCACCCTAACAATAGGAACCATGAAAAATTCT
CTTATGCAAATACTAAGAATGTTCTTAACCCCCCTGGTTTTGCCCCGCAAACTCAAGAAAATAAAAAGTTAGAAGATCTTGTTGGAGCTTTCATTGCAGAGTCTAGTAAC
AGGACAACCAAATTAGAGGAGGCAGTCATTGCCATCAACTCAACAGTGAATGGCCACATTGCAGCTATAAAGAACATTGAGACTCAGCTGGGACAGTTGGTGAGGGTTGT
GAGCACTATGAATAAAGGTAAGGCCCTAGCTGAGCAAGAGAAAACCCAGATGGAGTACTGTAAGGCAATCATTGTGCACCAGGAGGAAGCTGACGAGGAGCCTGAATCTG
AGGACTATGACACGCCTACAGGGGAAGCTGAGGAGGACACATCATCTGATGAGGCTGAAAAGCCTAACCCTGAGCCTCCTATTCCTTCTCCCACACTGATGGTTCCCAAG
GAAAAGAAAAAGAAAAAGAAGAAAAAGAACAATCAAGTTCAGTTTGATAAATTTATGAATGCTTTTATGAATCTGAATATTAATATTCCTTTTGCAGAGGCATTAGAGAT
GCCCCAATACAACAGGTTTATGAAGGAGTGGTCAGCAAAGAAGCGAAAGGAAAAGAAGGTTGACACTGTATATCTTGCTTCCACATGCAGCACCAGAGTACAACAGAAGG
TACCTGAAAAAGTAGCAGATCCAGGGAGTTTTTTGTTCCTTGTAGTTTTGTTAGACATAGGTGAGATTAAATCTACTCCTGTAAAGCTCCAATTGGCTCATCAATCTGTG
GTTAGACCAGTTGGCATTGTAGAAAATGTTTTAATCAGAGTAGGTAAATTTTTCCTCCCTATTGACTTGTATGTTATGGACATGATAGAAAATCCTTCAATGCCTATCAT
ATTAGGAAGACCATTCCTCGCTACTGGGCGAGTGATTATTGATATTGAGCGCAGGGAGCTCACTATTAGAGTCAGGAACGAAAAAGAAATTTTTAAAGCAGTGGAAGACT
CTAAAGATGAAGTGCTTTTCATGGGTTACAAGAAAGGTGCAAGAAAAAGCACCTCTGTTGGATTCACAGAACAAAAGCCTCCTTGA
Protein sequenceShow/hide protein sequence
MRSSKDLILAPLDPEIERTIHRLRRENRENFQMADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLTFDKARDWL
QSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPPHGYPDWLQTVENARILLEDMATNSYQWPSERSAPKKISAGVFEVDKVS
ALQAQITSLANAFMKFSGTGSAQSIESAAALASRPQEETIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHEKFSYANTKNVLNPPGFAPQTQENKKLEDLVGAFIAESSN
RTTKLEEAVIAINSTVNGHIAAIKNIETQLGQLVRVVSTMNKGKALAEQEKTQMEYCKAIIVHQEEADEEPESEDYDTPTGEAEEDTSSDEAEKPNPEPPIPSPTLMVPK
EKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALEMPQYNRFMKEWSAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFLFLVVLLDIGEIKSTPVKLQLAHQSV
VRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPIILGRPFLATGRVIIDIERRELTIRVRNEKEIFKAVEDSKDEVLFMGYKKGARKSTSVGFTEQKPP