; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024189 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024189
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionX8 domain-containing protein
Genome locationchr10:1103612..1106929
RNA-Seq ExpressionLag0024189
SyntenyLag0024189
Gene Ontology termsGO:0006810 - transport (biological process)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6584413.1 hypothetical protein SDJN03_20345, partial [Cucurbita argyrosperma subsp. sororia]4.6e-21973.11Show/hide
Query:  MPSKSHDVPKFWLCGNCTSNEAKSP-DSGLQVQPKMLRHSKTSKVKFLPTEEVIKLSSGGIKGPSKLNTTLAPQRTSKPKIGFESSMPRPLFQVSKESQE
        MP+KS DVPK WLCGNCT +EAKSP DSG  VQPK+LRH+KT KVKFLPTEEV KLSSGG+KGPSKLN    PQRT K +  FESS+PRP FQ SKESQE
Subjt:  MPSKSHDVPKFWLCGNCTSNEAKSP-DSGLQVQPKMLRHSKTSKVKFLPTEEVIKLSSGGIKGPSKLNTTLAPQRTSKPKIGFESSMPRPLFQVSKESQE

Query:  RISATMPPKVCGAKKQALATCLPPVPVGPVKTLKKVKV---DLYA-CTSASRDGLPVTNTGKEVPSPSTKLEDM--------------------------
        R S+T+PP +CG KKQAL TCLP +PV PV+TLKK KV   D+ A  +S SR GLPVT TGKEVPSPSTKLEDM                          
Subjt:  RISATMPPKVCGAKKQALATCLPPVPVGPVKTLKKVKV---DLYA-CTSASRDGLPVTNTGKEVPSPSTKLEDM--------------------------

Query:  --------QKKMRDALMTEEIHAYRDYLPSLHASWKGGFQFVDTHMVGDFYDGFLAKPPCLVHGRAYELSRKIPPILQVKLLSRSDIWDDLFHDKCPVLG
                QK+M+ ALMT+EIHAYRD LPSLHASWKGGFQF+DT M G+FYDGFLAKPPC+VHGRAYELSRKIPPILQVKLLSRSDIWD+LFHD+CP L 
Subjt:  --------QKKMRDALMTEEIHAYRDYLPSLHASWKGGFQFVDTHMVGDFYDGFLAKPPCLVHGRAYELSRKIPPILQVKLLSRSDIWDDLFHDKCPVLG

Query:  DIALYFFPSGNIERSRKNNSCLFELMEREDLLIRSLIDGAELVVFTSRQLDLGSQYVVNMLNADCLLFGVFRAIEDSQSPFHNLRESTAAVPMLEYGSAV
        DIALYFFP+ NIERSR+NNS LFELMEREDLLIRSLIDGAELVVFT RQLDL SQ+ +NML+A CLLFGVFRAI+  QS           VPMLEYGSAV
Subjt:  DIALYFFPSGNIERSRKNNSCLFELMEREDLLIRSLIDGAELVVFTSRQLDLGSQYVVNMLNADCLLFGVFRAIEDSQSPFHNLRESTAAVPMLEYGSAV

Query:  SSVECASGVTLMECTPKGHGKHDEGNAVNKEIDIMGGNTTGKSPTASKDVDSTIQRLLLEFGSQQPRESDVNALTKNVQIKNQEPAPSSAIGSYSLSQSK
        SSV   S V L+E TPKGHGKHDE NAV + IDI GGNTTGKSPTA KDVDSTIQRLLLEFGSQ+PRESDVNALT   QIK+QEPAP +A GSYSLSQSK
Subjt:  SSVECASGVTLMECTPKGHGKHDEGNAVNKEIDIMGGNTTGKSPTASKDVDSTIQRLLLEFGSQQPRESDVNALTKNVQIKNQEPAPSSAIGSYSLSQSK

Query:  VKTEPLPDIRVEGSDKRKCLESEHGSRMAPTFSIDGSQNRTGLSDQDVPKRVADKYLQIFNAGIKKERR
        VK EP+P I+ E  D RKCLE+EH SRMAPTFSIDGSQNRTGL+DQDVPKRVADKYLQIFNAGIKKERR
Subjt:  VKTEPLPDIRVEGSDKRKCLESEHGSRMAPTFSIDGSQNRTGLSDQDVPKRVADKYLQIFNAGIKKERR

XP_022923683.1 uncharacterized protein LOC111431323 isoform X1 [Cucurbita moschata]5.6e-21772.55Show/hide
Query:  MPSKSHDVPKFWLCGNCTSNEAKSP-DSGLQVQPKMLRHSKTSKVKFLPTEEVIKLSSGGIKGPSKLNTTLAPQRTSKPKIGFESSMPRPLFQVSKESQE
        MP+KS DVPK WLCGNCT +EAKSP DSG  VQPKMLRH+KT KVKFLPTEEV KLSSGG+KGPSKLN    PQRT K +  FESS+PRP FQ SKESQE
Subjt:  MPSKSHDVPKFWLCGNCTSNEAKSP-DSGLQVQPKMLRHSKTSKVKFLPTEEVIKLSSGGIKGPSKLNTTLAPQRTSKPKIGFESSMPRPLFQVSKESQE

Query:  RISATMPPKVCGAKKQALATCLPPVPVGPVKTLKKVKV---DLYA-CTSASRDGLPVTNTGKEVPSPSTKLEDMQKKMRD--------------------
        R SAT+PP +CG KKQAL TCLP +PV PV+TLKK KV   D+ A  +S SR GLPVT TGKEVPSPSTKLEDMQK+ ++                    
Subjt:  RISATMPPKVCGAKKQALATCLPPVPVGPVKTLKKVKV---DLYA-CTSASRDGLPVTNTGKEVPSPSTKLEDMQKKMRD--------------------

Query:  --------------ALMTEEIHAYRDYLPSLHASWKGGFQFVDTHMVGDFYDGFLAKPPCLVHGRAYELSRKIPPILQVKLLSRSDIWDDLFHDKCPVLG
                      ALMT+EIHAYRD LPSLHASWKGGFQF+D  M G+FYDGFLAKPPC+VHGRAYELSRKIPPILQVKLLSRSDIWD+LFHD+CP L 
Subjt:  --------------ALMTEEIHAYRDYLPSLHASWKGGFQFVDTHMVGDFYDGFLAKPPCLVHGRAYELSRKIPPILQVKLLSRSDIWDDLFHDKCPVLG

Query:  DIALYFFPSGNIERSRKNNSCLFELMEREDLLIRSLIDGAELVVFTSRQLDLGSQYV---VNMLNADCLLFGVFRAIEDSQSPFHNLRESTAAVPMLEYG
        DIALYFFP+ NIERSRKN+S LFELMEREDLLIRSLIDGAELVVFT RQLDL SQ +   +NML+A CLLFGVFRAI+  QS           VPMLEYG
Subjt:  DIALYFFPSGNIERSRKNNSCLFELMEREDLLIRSLIDGAELVVFTSRQLDLGSQYV---VNMLNADCLLFGVFRAIEDSQSPFHNLRESTAAVPMLEYG

Query:  SAVSSVECASGVTLMECTPKGHGKHDEGNAVNKEIDIMGGNTTGKSPTASKDVDSTIQRLLLEFGSQQPRESDVNALTKNVQIKNQEPAPSSAIGSYSLS
        SAVSSVE  S V L+E TPKGHGKHDE NAV +  DI GGNTTGKSPTA KDVDSTIQRLLLEFGSQ+ RESDVNALT   QIK+QEPAP +A GSYSLS
Subjt:  SAVSSVECASGVTLMECTPKGHGKHDEGNAVNKEIDIMGGNTTGKSPTASKDVDSTIQRLLLEFGSQQPRESDVNALTKNVQIKNQEPAPSSAIGSYSLS

Query:  QSKVKTEPLPDIRVEGSDKRKCLESEHGSRMAPTFSIDGSQNRTGLSDQDVPKRVADKYLQIFNAGIKKERR
        QSKVK EP+P I+ E SD RKCLE+EH SRMAPTFSIDGSQNRTGL+DQDVP+RVADKYLQIFNAGIKKERR
Subjt:  QSKVKTEPLPDIRVEGSDKRKCLESEHGSRMAPTFSIDGSQNRTGLSDQDVPKRVADKYLQIFNAGIKKERR

XP_022923685.1 uncharacterized protein LOC111431323 isoform X2 [Cucurbita moschata]8.6e-21872.93Show/hide
Query:  MPSKSHDVPKFWLCGNCTSNEAKSP-DSGLQVQPKMLRHSKTSKVKFLPTEEVIKLSSGGIKGPSKLNTTLAPQRTSKPKIGFESSMPRPLFQVSKESQE
        MP+KS DVPK WLCGNCT +EAKSP DSG  VQPKMLRH+KT KVKFLPTEEV KLSSGG+KGPSKLN    PQRT K +  FESS+PRP FQ SKESQE
Subjt:  MPSKSHDVPKFWLCGNCTSNEAKSP-DSGLQVQPKMLRHSKTSKVKFLPTEEVIKLSSGGIKGPSKLNTTLAPQRTSKPKIGFESSMPRPLFQVSKESQE

Query:  RISATMPPKVCGAKKQALATCLPPVPVGPVKTLKKVKV---DLYA-CTSASRDGLPVTNTGKEVPSPSTKLEDMQKKMRD--------------------
        R SAT+PP +CG KKQAL TCLP +PV PV+TLKK KV   D+ A  +S SR GLPVT TGKEVPSPSTKLEDMQK+ ++                    
Subjt:  RISATMPPKVCGAKKQALATCLPPVPVGPVKTLKKVKV---DLYA-CTSASRDGLPVTNTGKEVPSPSTKLEDMQKKMRD--------------------

Query:  --------------ALMTEEIHAYRDYLPSLHASWKGGFQFVDTHMVGDFYDGFLAKPPCLVHGRAYELSRKIPPILQVKLLSRSDIWDDLFHDKCPVLG
                      ALMT+EIHAYRD LPSLHASWKGGFQF+D  M G+FYDGFLAKPPC+VHGRAYELSRKIPPILQVKLLSRSDIWD+LFHD+CP L 
Subjt:  --------------ALMTEEIHAYRDYLPSLHASWKGGFQFVDTHMVGDFYDGFLAKPPCLVHGRAYELSRKIPPILQVKLLSRSDIWDDLFHDKCPVLG

Query:  DIALYFFPSGNIERSRKNNSCLFELMEREDLLIRSLIDGAELVVFTSRQLDLGSQYVVNMLNADCLLFGVFRAIEDSQSPFHNLRESTAAVPMLEYGSAV
        DIALYFFP+ NIERSRKN+S LFELMEREDLLIRSLIDGAELVVFT RQLDL SQ+ +NML+A CLLFGVFRAI+  QS           VPMLEYGSAV
Subjt:  DIALYFFPSGNIERSRKNNSCLFELMEREDLLIRSLIDGAELVVFTSRQLDLGSQYVVNMLNADCLLFGVFRAIEDSQSPFHNLRESTAAVPMLEYGSAV

Query:  SSVECASGVTLMECTPKGHGKHDEGNAVNKEIDIMGGNTTGKSPTASKDVDSTIQRLLLEFGSQQPRESDVNALTKNVQIKNQEPAPSSAIGSYSLSQSK
        SSVE  S V L+E TPKGHGKHDE NAV +  DI GGNTTGKSPTA KDVDSTIQRLLLEFGSQ+ RESDVNALT   QIK+QEPAP +A GSYSLSQSK
Subjt:  SSVECASGVTLMECTPKGHGKHDEGNAVNKEIDIMGGNTTGKSPTASKDVDSTIQRLLLEFGSQQPRESDVNALTKNVQIKNQEPAPSSAIGSYSLSQSK

Query:  VKTEPLPDIRVEGSDKRKCLESEHGSRMAPTFSIDGSQNRTGLSDQDVPKRVADKYLQIFNAGIKKERR
        VK EP+P I+ E SD RKCLE+EH SRMAPTFSIDGSQNRTGL+DQDVP+RVADKYLQIFNAGIKKERR
Subjt:  VKTEPLPDIRVEGSDKRKCLESEHGSRMAPTFSIDGSQNRTGLSDQDVPKRVADKYLQIFNAGIKKERR

XP_022923686.1 uncharacterized protein LOC111431323 isoform X3 [Cucurbita moschata]5.6e-21772.55Show/hide
Query:  MPSKSHDVPKFWLCGNCTSNEAKSP-DSGLQVQPKMLRHSKTSKVKFLPTEEVIKLSSGGIKGPSKLNTTLAPQRTSKPKIGFESSMPRPLFQVSKESQE
        MP+KS DVPK WLCGNCT +EAKSP DSG  VQPKMLRH+KT KVKFLPTEEV KLSSGG+KGPSKLN    PQRT K +  FESS+PRP FQ SKESQE
Subjt:  MPSKSHDVPKFWLCGNCTSNEAKSP-DSGLQVQPKMLRHSKTSKVKFLPTEEVIKLSSGGIKGPSKLNTTLAPQRTSKPKIGFESSMPRPLFQVSKESQE

Query:  RISATMPPKVCGAKKQALATCLPPVPVGPVKTLKKVKV---DLYA-CTSASRDGLPVTNTGKEVPSPSTKLEDMQKKMRD--------------------
        R SAT+PP +CG KKQAL TCLP +PV PV+TLKK KV   D+ A  +S SR GLPVT TGKEVPSPSTKLEDMQK+ ++                    
Subjt:  RISATMPPKVCGAKKQALATCLPPVPVGPVKTLKKVKV---DLYA-CTSASRDGLPVTNTGKEVPSPSTKLEDMQKKMRD--------------------

Query:  --------------ALMTEEIHAYRDYLPSLHASWKGGFQFVDTHMVGDFYDGFLAKPPCLVHGRAYELSRKIPPILQVKLLSRSDIWDDLFHDKCPVLG
                      ALMT+EIHAYRD LPSLHASWKGGFQF+D  M G+FYDGFLAKPPC+VHGRAYELSRKIPPILQVKLLSRSDIWD+LFHD+CP L 
Subjt:  --------------ALMTEEIHAYRDYLPSLHASWKGGFQFVDTHMVGDFYDGFLAKPPCLVHGRAYELSRKIPPILQVKLLSRSDIWDDLFHDKCPVLG

Query:  DIALYFFPSGNIERSRKNNSCLFELMEREDLLIRSLIDGAELVVFTSRQLDLGSQYV---VNMLNADCLLFGVFRAIEDSQSPFHNLRESTAAVPMLEYG
        DIALYFFP+ NIERSRKN+S LFELMEREDLLIRSLIDGAELVVFT RQLDL SQ +   +NML+A CLLFGVFRAI+  QS           VPMLEYG
Subjt:  DIALYFFPSGNIERSRKNNSCLFELMEREDLLIRSLIDGAELVVFTSRQLDLGSQYV---VNMLNADCLLFGVFRAIEDSQSPFHNLRESTAAVPMLEYG

Query:  SAVSSVECASGVTLMECTPKGHGKHDEGNAVNKEIDIMGGNTTGKSPTASKDVDSTIQRLLLEFGSQQPRESDVNALTKNVQIKNQEPAPSSAIGSYSLS
        SAVSSVE  S V L+E TPKGHGKHDE NAV +  DI GGNTTGKSPTA KDVDSTIQRLLLEFGSQ+ RESDVNALT   QIK+QEPAP +A GSYSLS
Subjt:  SAVSSVECASGVTLMECTPKGHGKHDEGNAVNKEIDIMGGNTTGKSPTASKDVDSTIQRLLLEFGSQQPRESDVNALTKNVQIKNQEPAPSSAIGSYSLS

Query:  QSKVKTEPLPDIRVEGSDKRKCLESEHGSRMAPTFSIDGSQNRTGLSDQDVPKRVADKYLQIFNAGIKKERR
        QSKVK EP+P I+ E SD RKCLE+EH SRMAPTFSIDGSQNRTGL+DQDVP+RVADKYLQIFNAGIKKERR
Subjt:  QSKVKTEPLPDIRVEGSDKRKCLESEHGSRMAPTFSIDGSQNRTGLSDQDVPKRVADKYLQIFNAGIKKERR

XP_023001152.1 uncharacterized protein LOC111495374 isoform X4 [Cucurbita maxima]1.7e-21876.21Show/hide
Query:  MPSKSHDVPKFWLCGNCTSNEAKSP-DSGLQVQPKMLRHSKTSKVKFLPTEEVIKLSSGGIKGPSKLNTTLAPQRTSKPKIGFESSMPRPLFQVSKESQE
        M +KS +VPK WLCGNCT +EAKSP DSG  VQPKM RH+KT KVKFLPTEEV KLSSG +KGPSKLN    PQRT K +  FESS+PRP FQ SKESQE
Subjt:  MPSKSHDVPKFWLCGNCTSNEAKSP-DSGLQVQPKMLRHSKTSKVKFLPTEEVIKLSSGGIKGPSKLNTTLAPQRTSKPKIGFESSMPRPLFQVSKESQE

Query:  RISATMPPKVCGAKKQALATCLPPVPVGPVKTLKKVKV---DLYA-CTSASRDGLPVTNTGKEVPSPSTKLEDMQKKMRDALMTEEIHAYRDYLPSLHAS
        R SAT+PP +CG KKQALATCLP +PV PV+TLKK KV   D+ A  +S SR G PVT TGKEVPSP TKL+D QK+M+ ALMT+EIHAYRD LPSLHAS
Subjt:  RISATMPPKVCGAKKQALATCLPPVPVGPVKTLKKVKV---DLYA-CTSASRDGLPVTNTGKEVPSPSTKLEDMQKKMRDALMTEEIHAYRDYLPSLHAS

Query:  WKGGFQFVDTHMVGDFYDGFLAKPPCLVHGRAYELSRKIPPILQVKLLSRSDIWDDLFHDKCPVLGDIALYFFPSGNIERSRKNNSCLFELMEREDLLIR
        WKGGFQF+DT M G+FYDGFLAKPPC+VHGRAYELSRKIPPILQVKLLSRSDIWD+LFHD+CP L DIALYFFP+ N ERSRKNNS LFELMEREDLLIR
Subjt:  WKGGFQFVDTHMVGDFYDGFLAKPPCLVHGRAYELSRKIPPILQVKLLSRSDIWDDLFHDKCPVLGDIALYFFPSGNIERSRKNNSCLFELMEREDLLIR

Query:  SLIDGAELVVFTSRQLDLGSQYV---VNMLNADCLLFGVFRAIEDSQSPFHNLRESTAAVPMLEYGSAVSSVECASGVTLMECTPKGHGKHDEGNAVNKE
        SLIDGAELVVFT RQLDL SQ +   +NML+A CLLFGVFRAI+  +S           VPMLEYGSAVSSVE  S V L+E TPKGHGKHDE NAV + 
Subjt:  SLIDGAELVVFTSRQLDLGSQYV---VNMLNADCLLFGVFRAIEDSQSPFHNLRESTAAVPMLEYGSAVSSVECASGVTLMECTPKGHGKHDEGNAVNKE

Query:  IDIMGGNTTGKSPTASKDVDSTIQRLLLEFGSQQPRESDVNALTKNVQIKNQEPAPSSAIGSYSLSQSKVKTEPLPDIRVEGSDKRKCLESEHGSRMAPT
        IDI GGNTTGKSPTA KDVDSTI+RLLLEFGSQ+PRESDVNALT   QIK+QEPAP +A G YSLSQSKVK EP+P I+ E SD RKCLE+EH SRMAPT
Subjt:  IDIMGGNTTGKSPTASKDVDSTIQRLLLEFGSQQPRESDVNALTKNVQIKNQEPAPSSAIGSYSLSQSKVKTEPLPDIRVEGSDKRKCLESEHGSRMAPT

Query:  FSIDGSQNRTGLSDQDVPKRVADKYLQIFNAGIKKERR
        FSIDGSQNRTGL+DQDVPKRVADKYLQIFNAGIKKERR
Subjt:  FSIDGSQNRTGLSDQDVPKRVADKYLQIFNAGIKKERR

TrEMBL top hitse value%identityAlignment
A0A6J1E6T7 uncharacterized protein LOC111431323 isoform X32.7e-21772.55Show/hide
Query:  MPSKSHDVPKFWLCGNCTSNEAKSP-DSGLQVQPKMLRHSKTSKVKFLPTEEVIKLSSGGIKGPSKLNTTLAPQRTSKPKIGFESSMPRPLFQVSKESQE
        MP+KS DVPK WLCGNCT +EAKSP DSG  VQPKMLRH+KT KVKFLPTEEV KLSSGG+KGPSKLN    PQRT K +  FESS+PRP FQ SKESQE
Subjt:  MPSKSHDVPKFWLCGNCTSNEAKSP-DSGLQVQPKMLRHSKTSKVKFLPTEEVIKLSSGGIKGPSKLNTTLAPQRTSKPKIGFESSMPRPLFQVSKESQE

Query:  RISATMPPKVCGAKKQALATCLPPVPVGPVKTLKKVKV---DLYA-CTSASRDGLPVTNTGKEVPSPSTKLEDMQKKMRD--------------------
        R SAT+PP +CG KKQAL TCLP +PV PV+TLKK KV   D+ A  +S SR GLPVT TGKEVPSPSTKLEDMQK+ ++                    
Subjt:  RISATMPPKVCGAKKQALATCLPPVPVGPVKTLKKVKV---DLYA-CTSASRDGLPVTNTGKEVPSPSTKLEDMQKKMRD--------------------

Query:  --------------ALMTEEIHAYRDYLPSLHASWKGGFQFVDTHMVGDFYDGFLAKPPCLVHGRAYELSRKIPPILQVKLLSRSDIWDDLFHDKCPVLG
                      ALMT+EIHAYRD LPSLHASWKGGFQF+D  M G+FYDGFLAKPPC+VHGRAYELSRKIPPILQVKLLSRSDIWD+LFHD+CP L 
Subjt:  --------------ALMTEEIHAYRDYLPSLHASWKGGFQFVDTHMVGDFYDGFLAKPPCLVHGRAYELSRKIPPILQVKLLSRSDIWDDLFHDKCPVLG

Query:  DIALYFFPSGNIERSRKNNSCLFELMEREDLLIRSLIDGAELVVFTSRQLDLGSQYV---VNMLNADCLLFGVFRAIEDSQSPFHNLRESTAAVPMLEYG
        DIALYFFP+ NIERSRKN+S LFELMEREDLLIRSLIDGAELVVFT RQLDL SQ +   +NML+A CLLFGVFRAI+  QS           VPMLEYG
Subjt:  DIALYFFPSGNIERSRKNNSCLFELMEREDLLIRSLIDGAELVVFTSRQLDLGSQYV---VNMLNADCLLFGVFRAIEDSQSPFHNLRESTAAVPMLEYG

Query:  SAVSSVECASGVTLMECTPKGHGKHDEGNAVNKEIDIMGGNTTGKSPTASKDVDSTIQRLLLEFGSQQPRESDVNALTKNVQIKNQEPAPSSAIGSYSLS
        SAVSSVE  S V L+E TPKGHGKHDE NAV +  DI GGNTTGKSPTA KDVDSTIQRLLLEFGSQ+ RESDVNALT   QIK+QEPAP +A GSYSLS
Subjt:  SAVSSVECASGVTLMECTPKGHGKHDEGNAVNKEIDIMGGNTTGKSPTASKDVDSTIQRLLLEFGSQQPRESDVNALTKNVQIKNQEPAPSSAIGSYSLS

Query:  QSKVKTEPLPDIRVEGSDKRKCLESEHGSRMAPTFSIDGSQNRTGLSDQDVPKRVADKYLQIFNAGIKKERR
        QSKVK EP+P I+ E SD RKCLE+EH SRMAPTFSIDGSQNRTGL+DQDVP+RVADKYLQIFNAGIKKERR
Subjt:  QSKVKTEPLPDIRVEGSDKRKCLESEHGSRMAPTFSIDGSQNRTGLSDQDVPKRVADKYLQIFNAGIKKERR

A0A6J1EAB4 uncharacterized protein LOC111431323 isoform X12.7e-21772.55Show/hide
Query:  MPSKSHDVPKFWLCGNCTSNEAKSP-DSGLQVQPKMLRHSKTSKVKFLPTEEVIKLSSGGIKGPSKLNTTLAPQRTSKPKIGFESSMPRPLFQVSKESQE
        MP+KS DVPK WLCGNCT +EAKSP DSG  VQPKMLRH+KT KVKFLPTEEV KLSSGG+KGPSKLN    PQRT K +  FESS+PRP FQ SKESQE
Subjt:  MPSKSHDVPKFWLCGNCTSNEAKSP-DSGLQVQPKMLRHSKTSKVKFLPTEEVIKLSSGGIKGPSKLNTTLAPQRTSKPKIGFESSMPRPLFQVSKESQE

Query:  RISATMPPKVCGAKKQALATCLPPVPVGPVKTLKKVKV---DLYA-CTSASRDGLPVTNTGKEVPSPSTKLEDMQKKMRD--------------------
        R SAT+PP +CG KKQAL TCLP +PV PV+TLKK KV   D+ A  +S SR GLPVT TGKEVPSPSTKLEDMQK+ ++                    
Subjt:  RISATMPPKVCGAKKQALATCLPPVPVGPVKTLKKVKV---DLYA-CTSASRDGLPVTNTGKEVPSPSTKLEDMQKKMRD--------------------

Query:  --------------ALMTEEIHAYRDYLPSLHASWKGGFQFVDTHMVGDFYDGFLAKPPCLVHGRAYELSRKIPPILQVKLLSRSDIWDDLFHDKCPVLG
                      ALMT+EIHAYRD LPSLHASWKGGFQF+D  M G+FYDGFLAKPPC+VHGRAYELSRKIPPILQVKLLSRSDIWD+LFHD+CP L 
Subjt:  --------------ALMTEEIHAYRDYLPSLHASWKGGFQFVDTHMVGDFYDGFLAKPPCLVHGRAYELSRKIPPILQVKLLSRSDIWDDLFHDKCPVLG

Query:  DIALYFFPSGNIERSRKNNSCLFELMEREDLLIRSLIDGAELVVFTSRQLDLGSQYV---VNMLNADCLLFGVFRAIEDSQSPFHNLRESTAAVPMLEYG
        DIALYFFP+ NIERSRKN+S LFELMEREDLLIRSLIDGAELVVFT RQLDL SQ +   +NML+A CLLFGVFRAI+  QS           VPMLEYG
Subjt:  DIALYFFPSGNIERSRKNNSCLFELMEREDLLIRSLIDGAELVVFTSRQLDLGSQYV---VNMLNADCLLFGVFRAIEDSQSPFHNLRESTAAVPMLEYG

Query:  SAVSSVECASGVTLMECTPKGHGKHDEGNAVNKEIDIMGGNTTGKSPTASKDVDSTIQRLLLEFGSQQPRESDVNALTKNVQIKNQEPAPSSAIGSYSLS
        SAVSSVE  S V L+E TPKGHGKHDE NAV +  DI GGNTTGKSPTA KDVDSTIQRLLLEFGSQ+ RESDVNALT   QIK+QEPAP +A GSYSLS
Subjt:  SAVSSVECASGVTLMECTPKGHGKHDEGNAVNKEIDIMGGNTTGKSPTASKDVDSTIQRLLLEFGSQQPRESDVNALTKNVQIKNQEPAPSSAIGSYSLS

Query:  QSKVKTEPLPDIRVEGSDKRKCLESEHGSRMAPTFSIDGSQNRTGLSDQDVPKRVADKYLQIFNAGIKKERR
        QSKVK EP+P I+ E SD RKCLE+EH SRMAPTFSIDGSQNRTGL+DQDVP+RVADKYLQIFNAGIKKERR
Subjt:  QSKVKTEPLPDIRVEGSDKRKCLESEHGSRMAPTFSIDGSQNRTGLSDQDVPKRVADKYLQIFNAGIKKERR

A0A6J1ECK6 uncharacterized protein LOC111431323 isoform X24.2e-21872.93Show/hide
Query:  MPSKSHDVPKFWLCGNCTSNEAKSP-DSGLQVQPKMLRHSKTSKVKFLPTEEVIKLSSGGIKGPSKLNTTLAPQRTSKPKIGFESSMPRPLFQVSKESQE
        MP+KS DVPK WLCGNCT +EAKSP DSG  VQPKMLRH+KT KVKFLPTEEV KLSSGG+KGPSKLN    PQRT K +  FESS+PRP FQ SKESQE
Subjt:  MPSKSHDVPKFWLCGNCTSNEAKSP-DSGLQVQPKMLRHSKTSKVKFLPTEEVIKLSSGGIKGPSKLNTTLAPQRTSKPKIGFESSMPRPLFQVSKESQE

Query:  RISATMPPKVCGAKKQALATCLPPVPVGPVKTLKKVKV---DLYA-CTSASRDGLPVTNTGKEVPSPSTKLEDMQKKMRD--------------------
        R SAT+PP +CG KKQAL TCLP +PV PV+TLKK KV   D+ A  +S SR GLPVT TGKEVPSPSTKLEDMQK+ ++                    
Subjt:  RISATMPPKVCGAKKQALATCLPPVPVGPVKTLKKVKV---DLYA-CTSASRDGLPVTNTGKEVPSPSTKLEDMQKKMRD--------------------

Query:  --------------ALMTEEIHAYRDYLPSLHASWKGGFQFVDTHMVGDFYDGFLAKPPCLVHGRAYELSRKIPPILQVKLLSRSDIWDDLFHDKCPVLG
                      ALMT+EIHAYRD LPSLHASWKGGFQF+D  M G+FYDGFLAKPPC+VHGRAYELSRKIPPILQVKLLSRSDIWD+LFHD+CP L 
Subjt:  --------------ALMTEEIHAYRDYLPSLHASWKGGFQFVDTHMVGDFYDGFLAKPPCLVHGRAYELSRKIPPILQVKLLSRSDIWDDLFHDKCPVLG

Query:  DIALYFFPSGNIERSRKNNSCLFELMEREDLLIRSLIDGAELVVFTSRQLDLGSQYVVNMLNADCLLFGVFRAIEDSQSPFHNLRESTAAVPMLEYGSAV
        DIALYFFP+ NIERSRKN+S LFELMEREDLLIRSLIDGAELVVFT RQLDL SQ+ +NML+A CLLFGVFRAI+  QS           VPMLEYGSAV
Subjt:  DIALYFFPSGNIERSRKNNSCLFELMEREDLLIRSLIDGAELVVFTSRQLDLGSQYVVNMLNADCLLFGVFRAIEDSQSPFHNLRESTAAVPMLEYGSAV

Query:  SSVECASGVTLMECTPKGHGKHDEGNAVNKEIDIMGGNTTGKSPTASKDVDSTIQRLLLEFGSQQPRESDVNALTKNVQIKNQEPAPSSAIGSYSLSQSK
        SSVE  S V L+E TPKGHGKHDE NAV +  DI GGNTTGKSPTA KDVDSTIQRLLLEFGSQ+ RESDVNALT   QIK+QEPAP +A GSYSLSQSK
Subjt:  SSVECASGVTLMECTPKGHGKHDEGNAVNKEIDIMGGNTTGKSPTASKDVDSTIQRLLLEFGSQQPRESDVNALTKNVQIKNQEPAPSSAIGSYSLSQSK

Query:  VKTEPLPDIRVEGSDKRKCLESEHGSRMAPTFSIDGSQNRTGLSDQDVPKRVADKYLQIFNAGIKKERR
        VK EP+P I+ E SD RKCLE+EH SRMAPTFSIDGSQNRTGL+DQDVP+RVADKYLQIFNAGIKKERR
Subjt:  VKTEPLPDIRVEGSDKRKCLESEHGSRMAPTFSIDGSQNRTGLSDQDVPKRVADKYLQIFNAGIKKERR

A0A6J1KFP4 uncharacterized protein LOC111495374 isoform X48.4e-21976.21Show/hide
Query:  MPSKSHDVPKFWLCGNCTSNEAKSP-DSGLQVQPKMLRHSKTSKVKFLPTEEVIKLSSGGIKGPSKLNTTLAPQRTSKPKIGFESSMPRPLFQVSKESQE
        M +KS +VPK WLCGNCT +EAKSP DSG  VQPKM RH+KT KVKFLPTEEV KLSSG +KGPSKLN    PQRT K +  FESS+PRP FQ SKESQE
Subjt:  MPSKSHDVPKFWLCGNCTSNEAKSP-DSGLQVQPKMLRHSKTSKVKFLPTEEVIKLSSGGIKGPSKLNTTLAPQRTSKPKIGFESSMPRPLFQVSKESQE

Query:  RISATMPPKVCGAKKQALATCLPPVPVGPVKTLKKVKV---DLYA-CTSASRDGLPVTNTGKEVPSPSTKLEDMQKKMRDALMTEEIHAYRDYLPSLHAS
        R SAT+PP +CG KKQALATCLP +PV PV+TLKK KV   D+ A  +S SR G PVT TGKEVPSP TKL+D QK+M+ ALMT+EIHAYRD LPSLHAS
Subjt:  RISATMPPKVCGAKKQALATCLPPVPVGPVKTLKKVKV---DLYA-CTSASRDGLPVTNTGKEVPSPSTKLEDMQKKMRDALMTEEIHAYRDYLPSLHAS

Query:  WKGGFQFVDTHMVGDFYDGFLAKPPCLVHGRAYELSRKIPPILQVKLLSRSDIWDDLFHDKCPVLGDIALYFFPSGNIERSRKNNSCLFELMEREDLLIR
        WKGGFQF+DT M G+FYDGFLAKPPC+VHGRAYELSRKIPPILQVKLLSRSDIWD+LFHD+CP L DIALYFFP+ N ERSRKNNS LFELMEREDLLIR
Subjt:  WKGGFQFVDTHMVGDFYDGFLAKPPCLVHGRAYELSRKIPPILQVKLLSRSDIWDDLFHDKCPVLGDIALYFFPSGNIERSRKNNSCLFELMEREDLLIR

Query:  SLIDGAELVVFTSRQLDLGSQYV---VNMLNADCLLFGVFRAIEDSQSPFHNLRESTAAVPMLEYGSAVSSVECASGVTLMECTPKGHGKHDEGNAVNKE
        SLIDGAELVVFT RQLDL SQ +   +NML+A CLLFGVFRAI+  +S           VPMLEYGSAVSSVE  S V L+E TPKGHGKHDE NAV + 
Subjt:  SLIDGAELVVFTSRQLDLGSQYV---VNMLNADCLLFGVFRAIEDSQSPFHNLRESTAAVPMLEYGSAVSSVECASGVTLMECTPKGHGKHDEGNAVNKE

Query:  IDIMGGNTTGKSPTASKDVDSTIQRLLLEFGSQQPRESDVNALTKNVQIKNQEPAPSSAIGSYSLSQSKVKTEPLPDIRVEGSDKRKCLESEHGSRMAPT
        IDI GGNTTGKSPTA KDVDSTI+RLLLEFGSQ+PRESDVNALT   QIK+QEPAP +A G YSLSQSKVK EP+P I+ E SD RKCLE+EH SRMAPT
Subjt:  IDIMGGNTTGKSPTASKDVDSTIQRLLLEFGSQQPRESDVNALTKNVQIKNQEPAPSSAIGSYSLSQSKVKTEPLPDIRVEGSDKRKCLESEHGSRMAPT

Query:  FSIDGSQNRTGLSDQDVPKRVADKYLQIFNAGIKKERR
        FSIDGSQNRTGL+DQDVPKRVADKYLQIFNAGIKKERR
Subjt:  FSIDGSQNRTGLSDQDVPKRVADKYLQIFNAGIKKERR

A0A6J1KPN9 uncharacterized protein LOC111495374 isoform X23.9e-21672.58Show/hide
Query:  MPSKSHDVPKFWLCGNCTSNEAKSP-DSGLQVQPKMLRHSKTSKVKFLPTEEVIKLSSGGIKGPSKLNTTLAPQRTSKPKIGFESSMPRPLFQVSKESQE
        M +KS +VPK WLCGNCT +EAKSP DSG  VQPKM RH+KT KVKFLPTEEV KLSSG +KGPSKLN    PQRT K +  FESS+PRP FQ SKESQE
Subjt:  MPSKSHDVPKFWLCGNCTSNEAKSP-DSGLQVQPKMLRHSKTSKVKFLPTEEVIKLSSGGIKGPSKLNTTLAPQRTSKPKIGFESSMPRPLFQVSKESQE

Query:  RISATMPPKVCGAKKQALATCLPPVPVGPVKTLKKVKV---DLYA-CTSASRDGLPVTNTGKEVPSPSTKLEDM--------------------------
        R SAT+PP +CG KKQALATCLP +PV PV+TLKK KV   D+ A  +S SR G PVT TGKEVPSPSTKLEDM                          
Subjt:  RISATMPPKVCGAKKQALATCLPPVPVGPVKTLKKVKV---DLYA-CTSASRDGLPVTNTGKEVPSPSTKLEDM--------------------------

Query:  --------QKKMRDALMTEEIHAYRDYLPSLHASWKGGFQFVDTHMVGDFYDGFLAKPPCLVHGRAYELSRKIPPILQVKLLSRSDIWDDLFHDKCPVLG
                QK+M+ ALMT+EIHAYRD LPSLHASWKGGFQF+DT M G+FYDGFLAKPPC+VHGRAYELSRKIPPILQVKLLSRSDIWD+LFHD+CP L 
Subjt:  --------QKKMRDALMTEEIHAYRDYLPSLHASWKGGFQFVDTHMVGDFYDGFLAKPPCLVHGRAYELSRKIPPILQVKLLSRSDIWDDLFHDKCPVLG

Query:  DIALYFFPSGNIERSRKNNSCLFELMEREDLLIRSLIDGAELVVFTSRQLDLGSQYVVNMLNADCLLFGVFRAIEDSQSPFHNLRESTAAVPMLEYGSAV
        DIALYFFP+ N ERSRKNNS LFELMEREDLLIRSLIDGAELVVFT RQLDL SQ+ +NML+A CLLFGVFRAI+  +S           VPMLEYGSAV
Subjt:  DIALYFFPSGNIERSRKNNSCLFELMEREDLLIRSLIDGAELVVFTSRQLDLGSQYVVNMLNADCLLFGVFRAIEDSQSPFHNLRESTAAVPMLEYGSAV

Query:  SSVECASGVTLMECTPKGHGKHDEGNAVNKEIDIMGGNTTGKSPTASKDVDSTIQRLLLEFGSQQPRESDVNALTKNVQIKNQEPAPSSAIGSYSLSQSK
        SSVE  S V L+E TPKGHGKHDE NAV + IDI GGNTTGKSPTA KDVDSTI+RLLLEFGSQ+PRESDVNALT   QIK+QEPAP +A G YSLSQSK
Subjt:  SSVECASGVTLMECTPKGHGKHDEGNAVNKEIDIMGGNTTGKSPTASKDVDSTIQRLLLEFGSQQPRESDVNALTKNVQIKNQEPAPSSAIGSYSLSQSK

Query:  VKTEPLPDIRVEGSDKRKCLESEHGSRMAPTFSIDGSQNRTGLSDQDVPKRVADKYLQIFNAGIKKERR
        VK EP+P I+ E SD RKCLE+EH SRMAPTFSIDGSQNRTGL+DQDVPKRVADKYLQIFNAGIKKERR
Subjt:  VKTEPLPDIRVEGSDKRKCLESEHGSRMAPTFSIDGSQNRTGLSDQDVPKRVADKYLQIFNAGIKKERR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43770.2 RING/FYVE/PHD zinc finger superfamily protein3.5e-0728.12Show/hide
Query:  DGFLAKPPCLVHGRAYELSRKIPPILQVKLLSRSDIWDDLF-HDKCPVLGDIALYFFPSGNIERSRKNNSCLFELMEREDLLIRSLIDGAELVVFTSRQL
        DG +A    L   + +E +  +   L  ++L R ++W   F  +  P    +AL+FFPS       K    L + M++ D  +R +++ AEL++FTS  L
Subjt:  DGFLAKPPCLVHGRAYELSRKIPPILQVKLLSRSDIWDDLF-HDKCPVLGDIALYFFPSGNIERSRKNNSCLFELMEREDLLIRSLIDGAELVVFTSRQL

Query:  DLGSQYVVNMLNADCLLFGVFRAIEDSQ
           S       N+   L+GVF+  + S+
Subjt:  DLGSQYVVNMLNADCLLFGVFRAIEDSQ

AT3G02890.1 RING/FYVE/PHD zinc finger superfamily protein1.7e-0925.67Show/hide
Query:  TGKEVPSPSTKLEDMQKKMRD-ALMTEEIHAYRDYLPSLHASWKGGFQFVDTHMVGDFYDGFLAKPPCLVHGRAYELSRKIPPILQVKLLSRSDIWDDLF
        T  +   P      +Q  MRD  +    + +    +P     W+G  +   +  +   + G  A    L   +  E+ ++ P  + +  + R   W   F
Subjt:  TGKEVPSPSTKLEDMQKKMRD-ALMTEEIHAYRDYLPSLHASWKGGFQFVDTHMVGDFYDGFLAKPPCLVHGRAYELSRKIPPILQVKLLSRSDIWDDLF

Query:  HDKCPVLGDIALYFFPSGNIERSRKNNSCLFELMEREDLLIRSLIDGAELVVFTSRQLDLGSQYVVNMLNADCLLFGVFRAIEDSQS
         D       +AL+FF + +IE   KN   L + M ++DL ++  ++G EL++F S QL    Q   NML     L+GVFR  ++S S
Subjt:  HDKCPVLGDIALYFFPSGNIERSRKNNSCLFELMEREDLLIRSLIDGAELVVFTSRQLDLGSQYVVNMLNADCLLFGVFRAIEDSQS

AT5G61090.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein7.7e-1527.81Show/hide
Query:  KMRDALMTEEIHAYRDYLPSLHASWKGGFQFVDTHMVGDFYDGFLAKPPCLVHGRAYELSRKIPPILQVKLLSRSDIWDDLFHDKCPVLGDIALYFFP-S
        ++R ++  EE+     YLP+ + +W G  + +D+    +F   F +KP   +  +A   S+ +P +L+V+LL    I +D+   + P L ++ +Y FP  
Subjt:  KMRDALMTEEIHAYRDYLPSLHASWKGGFQFVDTHMVGDFYDGFLAKPPCLVHGRAYELSRKIPPILQVKLLSRSDIWDDLFHDKCPVLGDIALYFFP-S

Query:  GNIERSRKNNSCLFELMEREDLLIRSLIDGAELVVFTSRQLDLGSQYVVN-MLNADCLLFGVFRAIEDS
           ER    ++ LF+ M    ++ ++ I+G EL++F+S+ LD  SQ+++N     +  L+G F   ++S
Subjt:  GNIERSRKNNSCLFELMEREDLLIRSLIDGAELVVFTSRQLDLGSQYVVN-MLNADCLLFGVFRAIEDS

AT5G61120.1 BEST Arabidopsis thaliana protein match is: Polynucleotidyl transferase, ribonuclease H-like superfamily protein (TAIR:AT5G61090.1)1.0e-2227.36Show/hide
Query:  PKMLRHSKTSKVKFLPTEEVIKLSSGGIKGPSKLN-----TTLAPQRTSKPKIGFESSMPRPLFQVSKESQERISATMPPKVCGAKKQALATCLPPVPVG
        PK  +  +TS++K +  EEV KL+ GG             +   P   +KP  GF  +       V+++++   S  +PPK    K + L+     +  G
Subjt:  PKMLRHSKTSKVKFLPTEEVIKLSSGGIKGPSKLN-----TTLAPQRTSKPKIGFESSMPRPLFQVSKESQERISATMPPKVCGAKKQALATCLPPVPVG

Query:  PVKTLKKVKVDLYACTSASRDGLPVTNTGKEVPSPSTKLEDMQKKMRDALMTEEIHAYRDYLPSLHASWKGGFQFVDTHMVGDFYDGFLAKPPCLVHGRA
         ++   K +               V    K       K  D  +  R ++++E++     Y P+LH  WKG  + VD+    +F   FLA+P   V G+A
Subjt:  PVKTLKKVKVDLYACTSASRDGLPVTNTGKEVPSPSTKLEDMQKKMRDALMTEEIHAYRDYLPSLHASWKGGFQFVDTHMVGDFYDGFLAKPPCLVHGRA

Query:  YELSRKIPPILQVKLLSRSDIWDDLFHDKCPVLGDIALYFFPSG-NIERSRKNNSCLFELMEREDLLIRSLIDGAELVVFTSRQLDLGSQYVVNM-LNAD
        Y LS+ IP +L+VKL+   ++   LF ++ P L D+ +Y FP   N +R       +FE M   + +++  I+G  L++F+S+ LD  SQ ++ M    +
Subjt:  YELSRKIPPILQVKLLSRSDIWDDLFHDKCPVLGDIALYFFPSG-NIERSRKNNSCLFELMEREDLLIRSLIDGAELVVFTSRQLDLGSQYVVNM-LNAD

Query:  CLLFGVF
          L+G+F
Subjt:  CLLFGVF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCAGTAAAAGTCACGATGTCCCCAAATTCTGGCTTTGTGGTAATTGTACATCCAATGAGGCCAAAAGTCCTGATTCAGGGTTGCAGGTTCAACCTAAAATGCTAAG
GCATTCTAAAACTAGTAAAGTGAAGTTCTTACCGACCGAAGAAGTAATAAAGCTATCATCGGGAGGGATTAAGGGACCTTCCAAATTGAACACAACTCTTGCACCACAAA
GAACATCGAAGCCCAAAATAGGCTTTGAAAGTTCTATGCCCCGGCCGCTTTTCCAAGTATCAAAAGAATCTCAAGAACGAATTTCTGCTACGATGCCTCCAAAGGTATGT
GGTGCAAAGAAACAAGCATTAGCTACTTGTTTACCTCCAGTGCCAGTAGGCCCAGTAAAAACTTTGAAAAAGGTGAAGGTAGACTTGTATGCTTGCACTTCTGCGTCAAG
GGATGGTTTACCTGTCACAAATACAGGAAAAGAAGTTCCTTCGCCTTCCACTAAGTTGGAGGATATGCAAAAAAAAATGAGGGATGCTTTGATGACAGAGGAAATACATG
CCTACCGTGACTATCTGCCATCATTACATGCCTCTTGGAAGGGAGGCTTCCAATTTGTTGATACACATATGGTCGGTGATTTCTATGATGGTTTCCTGGCAAAGCCTCCT
TGTCTAGTACATGGTAGAGCTTATGAATTGTCACGGAAGATTCCTCCCATTCTTCAAGTGAAGCTGCTTAGTCGTTCTGATATTTGGGATGACCTATTTCATGATAAATG
TCCTGTTCTTGGTGATATTGCCTTGTACTTCTTTCCCTCCGGCAATATTGAAAGGTCCAGAAAGAACAATTCTTGCCTGTTTGAACTTATGGAGAGAGAAGATTTGTTGA
TAAGAAGTCTTATTGACGGTGCAGAGTTGGTCGTATTTACATCTAGACAGCTGGATCTAGGCTCTCAATATGTTGTAAATATGTTAAATGCTGACTGCCTCCTTTTCGGA
GTCTTCCGTGCTATAGAAGACAGTCAGTCTCCTTTTCATAATCTTCGAGAAAGTACTGCTGCAGTTCCTATGTTGGAATATGGTTCTGCAGTTTCTTCTGTAGAATGTGC
TTCCGGAGTTACCCTGATGGAATGCACGCCCAAAGGACATGGAAAGCACGATGAAGGCAATGCTGTTAACAAAGAAATTGACATTATGGGTGGAAACACTACTGGAAAGT
CTCCAACTGCTTCAAAGGATGTAGACTCTACCATTCAGCGATTACTATTAGAATTTGGATCACAACAACCTAGGGAATCTGATGTCAATGCATTAACTAAGAATGTTCAA
ATAAAAAATCAGGAACCTGCTCCAAGCTCAGCAATTGGCTCCTATTCTCTTTCCCAGTCAAAAGTAAAGACTGAACCTTTACCCGATATTAGAGTGGAAGGAAGCGATAA
AAGAAAATGCTTGGAGTCGGAGCATGGCTCGAGAATGGCACCTACATTTAGTATTGATGGCTCTCAAAACAGGACTGGTTTATCTGACCAAGATGTTCCCAAGAGAGTTG
CGGACAAGTATCTCCAAATCTTTAACGCAGGGATTAAAAAGGAACGGCGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCCAGTAAAAGTCACGATGTCCCCAAATTCTGGCTTTGTGGTAATTGTACATCCAATGAGGCCAAAAGTCCTGATTCAGGGTTGCAGGTTCAACCTAAAATGCTAAG
GCATTCTAAAACTAGTAAAGTGAAGTTCTTACCGACCGAAGAAGTAATAAAGCTATCATCGGGAGGGATTAAGGGACCTTCCAAATTGAACACAACTCTTGCACCACAAA
GAACATCGAAGCCCAAAATAGGCTTTGAAAGTTCTATGCCCCGGCCGCTTTTCCAAGTATCAAAAGAATCTCAAGAACGAATTTCTGCTACGATGCCTCCAAAGGTATGT
GGTGCAAAGAAACAAGCATTAGCTACTTGTTTACCTCCAGTGCCAGTAGGCCCAGTAAAAACTTTGAAAAAGGTGAAGGTAGACTTGTATGCTTGCACTTCTGCGTCAAG
GGATGGTTTACCTGTCACAAATACAGGAAAAGAAGTTCCTTCGCCTTCCACTAAGTTGGAGGATATGCAAAAAAAAATGAGGGATGCTTTGATGACAGAGGAAATACATG
CCTACCGTGACTATCTGCCATCATTACATGCCTCTTGGAAGGGAGGCTTCCAATTTGTTGATACACATATGGTCGGTGATTTCTATGATGGTTTCCTGGCAAAGCCTCCT
TGTCTAGTACATGGTAGAGCTTATGAATTGTCACGGAAGATTCCTCCCATTCTTCAAGTGAAGCTGCTTAGTCGTTCTGATATTTGGGATGACCTATTTCATGATAAATG
TCCTGTTCTTGGTGATATTGCCTTGTACTTCTTTCCCTCCGGCAATATTGAAAGGTCCAGAAAGAACAATTCTTGCCTGTTTGAACTTATGGAGAGAGAAGATTTGTTGA
TAAGAAGTCTTATTGACGGTGCAGAGTTGGTCGTATTTACATCTAGACAGCTGGATCTAGGCTCTCAATATGTTGTAAATATGTTAAATGCTGACTGCCTCCTTTTCGGA
GTCTTCCGTGCTATAGAAGACAGTCAGTCTCCTTTTCATAATCTTCGAGAAAGTACTGCTGCAGTTCCTATGTTGGAATATGGTTCTGCAGTTTCTTCTGTAGAATGTGC
TTCCGGAGTTACCCTGATGGAATGCACGCCCAAAGGACATGGAAAGCACGATGAAGGCAATGCTGTTAACAAAGAAATTGACATTATGGGTGGAAACACTACTGGAAAGT
CTCCAACTGCTTCAAAGGATGTAGACTCTACCATTCAGCGATTACTATTAGAATTTGGATCACAACAACCTAGGGAATCTGATGTCAATGCATTAACTAAGAATGTTCAA
ATAAAAAATCAGGAACCTGCTCCAAGCTCAGCAATTGGCTCCTATTCTCTTTCCCAGTCAAAAGTAAAGACTGAACCTTTACCCGATATTAGAGTGGAAGGAAGCGATAA
AAGAAAATGCTTGGAGTCGGAGCATGGCTCGAGAATGGCACCTACATTTAGTATTGATGGCTCTCAAAACAGGACTGGTTTATCTGACCAAGATGTTCCCAAGAGAGTTG
CGGACAAGTATCTCCAAATCTTTAACGCAGGGATTAAAAAGGAACGGCGCTAG
Protein sequenceShow/hide protein sequence
MPSKSHDVPKFWLCGNCTSNEAKSPDSGLQVQPKMLRHSKTSKVKFLPTEEVIKLSSGGIKGPSKLNTTLAPQRTSKPKIGFESSMPRPLFQVSKESQERISATMPPKVC
GAKKQALATCLPPVPVGPVKTLKKVKVDLYACTSASRDGLPVTNTGKEVPSPSTKLEDMQKKMRDALMTEEIHAYRDYLPSLHASWKGGFQFVDTHMVGDFYDGFLAKPP
CLVHGRAYELSRKIPPILQVKLLSRSDIWDDLFHDKCPVLGDIALYFFPSGNIERSRKNNSCLFELMEREDLLIRSLIDGAELVVFTSRQLDLGSQYVVNMLNADCLLFG
VFRAIEDSQSPFHNLRESTAAVPMLEYGSAVSSVECASGVTLMECTPKGHGKHDEGNAVNKEIDIMGGNTTGKSPTASKDVDSTIQRLLLEFGSQQPRESDVNALTKNVQ
IKNQEPAPSSAIGSYSLSQSKVKTEPLPDIRVEGSDKRKCLESEHGSRMAPTFSIDGSQNRTGLSDQDVPKRVADKYLQIFNAGIKKERR