; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi07G012780 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi07G012780
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein DDB_G0284459
Genome locationchr07:18624565..18626235
RNA-Seq ExpressionLsi07G012780
SyntenyLsi07G012780
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041069.1 putative Hydroxyproline-rich glycoprotein family protein [Cucumis melo var. makuwa]1.6e-25184.67Show/hide
Query:  MAESEVLAPPQNQ--PTPSKFHSHILYKVVTAIFFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVS
        MAES+VL PPQN   P+PSKF++H+ YK++TA+FFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVS
Subjt:  MAESEVLAPPQNQ--PTPSKFHSHILYKVVTAIFFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVS

Query:  GLLHVSSVFDDEPETPSANDESVSSSDENKVHTWNSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESRSKPRVSSRRLLSNLKRSS
        GLLHVSSVFDDEPETPSAND      DENKV TWN+RYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRV+VDDESRSK RVSSRRLLSNLKR+S
Subjt:  GLLHVSSVFDDEPETPSANDESVSSSDENKVHTWNSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESRSKPRVSSRRLLSNLKRSS

Query:  NGEFGG-GNLEEIEDKLNENVVLPSPVPWRSRSGRMDVQEEADNPSMEDSESNRFDSRSPRPQTSRSSRASAISQKL--SPSPSPSPRKPSPSHNVSPES
        N EFGG  NL+EI+DKLNENVVLPSPVPWRSRSGR++ QEEADNPSMEDSESNR  SRSP+PQTS++SRASAI QKL  SPSPSPSPRKPSPSHNVSPE 
Subjt:  NGEFGG-GNLEEIEDKLNENVVLPSPVPWRSRSGRMDVQEEADNPSMEDSESNRFDSRSPRPQTSRSSRASAISQKL--SPSPSPSPRKPSPSHNVSPES

Query:  QAKSAEDLVRKKSFYRS-PPPPPPPPPPPHVRRISSMKPSSWLNENDVSHQKELRRSFTSKPRSIIRDTGNDIDMMNGGNSSAEALPRNYVDSLSMGKSV
        QAKSAEDLVRKKSFYRS PPPPPPPPPPP VRR SSMKPSSW+NE+D+ HQKELRRSFTSKPR+IIRDTG+D DMM G NSS E  PRNYVDSLSMGKSV
Subjt:  QAKSAEDLVRKKSFYRS-PPPPPPPPPPPHVRRISSMKPSSWLNENDVSHQKELRRSFTSKPRSIIRDTGNDIDMMNGGNSSAEALPRNYVDSLSMGKSV

Query:  RTIRPGEVVNEPPRRGREFGGNDQMKG-TMMDENTHVQDFEDNPIEFPDEDKEELVEKLTMDT---DDDDDMESE-EENNNMVGQFIREDNGEPFNVKRR
        RTIRPGEVVNEPPRRGREF  NDQ+KG  MM+ENTH+QDFE+NPIEFPDEDKEELVEKLT+DT   DDDDDMESE EENN+MVG+FIREDNGEPF+VKRR
Subjt:  RTIRPGEVVNEPPRRGREFGGNDQMKG-TMMDENTHVQDFEDNPIEFPDEDKEELVEKLTMDT---DDDDDMESE-EENNNMVGQFIREDNGEPFNVKRR

Query:  DRDNDERGSSN-------EEEAGGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSGQIRKNTSKQT
        +RD DERGS N       EEEAG +SN+ NDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSGQIRKN +KQT
Subjt:  DRDNDERGSSN-------EEEAGGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSGQIRKNTSKQT

XP_011650387.1 uncharacterized protein DDB_G0284459 [Cucumis sativus]3.4e-24984.68Show/hide
Query:  MAESEVLAPPQNQ--PTPSKFHSHILYKVVTAIFFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVS
        MAES+VL PPQN   P+PSKF++H+LYK++TAIFFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVS
Subjt:  MAESEVLAPPQNQ--PTPSKFHSHILYKVVTAIFFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVS

Query:  GLLHVSSVFDDEPETPSANDESVSSSDENKVHTWNSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESRSKPRVSSRRLLSNLKRSS
        GLLHVSSVFDDEPETPSAND      DENKV TWN+RYFRNESVVVAEERPV NEQRVRSEKPLLLPVRSLKSRVVVDDE RSK RVSSRRLLSNLKRSS
Subjt:  GLLHVSSVFDDEPETPSANDESVSSSDENKVHTWNSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESRSKPRVSSRRLLSNLKRSS

Query:  NGEFGG-GNLEEIEDKLNENVVLPSPVPWRSRSGRMDVQEEADNPSMEDSESNRFDSRSPRPQTSRSSRASAISQKLSPSPSPSPRKPSPSHNVSPESQA
        N EFGG  NL+EI+DKLNEN VLPSPVPWRSRSGRM+ QEEADNPSMEDSESNR  SRSP+PQTS+SSRASAI Q+LSPSPSPSPRKPSPSHNVSPE QA
Subjt:  NGEFGG-GNLEEIEDKLNENVVLPSPVPWRSRSGRMDVQEEADNPSMEDSESNRFDSRSPRPQTSRSSRASAISQKLSPSPSPSPRKPSPSHNVSPESQA

Query:  KSAEDLVRKKSFYRSPPPPPPPPPPPHVRRISSMKPSSWLNENDVSHQKELRRSFTSKPRSIIRDTGNDIDMMNGGNSSAEALPRNYVDSLSMGKSVRTI
        KSAEDLVRKKSFYRS PPPPPPPPPP VRR SSMKPSSW+NE+DV HQKELRRS+TSKPR+I RDTG+D DMM G NSS E  PR+YVD LSMGKSVRTI
Subjt:  KSAEDLVRKKSFYRSPPPPPPPPPPPHVRRISSMKPSSWLNENDVSHQKELRRSFTSKPRSIIRDTGNDIDMMNGGNSSAEALPRNYVDSLSMGKSVRTI

Query:  RPGEVVNEPPRRGREFGGNDQMKG-TMMDENTHVQDFEDNPIEFPDEDKEELVEKLTMDT----DDDDDMESEEENNNMVGQFIREDNGEPFNVKRRDRD
        R GE VNEPPRRGREF  NDQ+KG TMM+ENTHVQDFE+NP+E PDEDKEELVEKLTMDT    DDDDDMESE E N+MVG+FIREDNGEPF+VKRR+R+
Subjt:  RPGEVVNEPPRRGREFGGNDQMKG-TMMDENTHVQDFEDNPIEFPDEDKEELVEKLTMDT----DDDDDMESEEENNNMVGQFIREDNGEPFNVKRRDRD

Query:  NDERGSSN----EEEAGGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSGQIRKNTSKQT
         DERGSSN    EEEAG SSN+ NDGGPDVDKKADEFIAKFREQIRLQRIES KRSSGQIRKNT+KQ+
Subjt:  NDERGSSN----EEEAGGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSGQIRKNTSKQT

XP_016900571.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein DDB_G0284459 [Cucumis melo]2.6e-24984.12Show/hide
Query:  MAESEVLAPPQNQ--PTPSKFHSHILYKVVTAIFFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVS
        MAES+VL PPQN   P+PSKF++H+ YK++TA+FFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVS
Subjt:  MAESEVLAPPQNQ--PTPSKFHSHILYKVVTAIFFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVS

Query:  GLLHVSSVFDDEPETPSANDESVSSSDENKVHTWNSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESRSKPRVSSRRLLSNLKRSS
        GLLHVSSVFDDEPETPSAND      DENKV TWN+RYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRV+VDDESRSK RVSSRRLLSNLKR+S
Subjt:  GLLHVSSVFDDEPETPSANDESVSSSDENKVHTWNSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESRSKPRVSSRRLLSNLKRSS

Query:  NGEFGG-GNLEEIEDKLNENVVLPSPVPWRSRSGRMDVQEEADNPSMEDSESNRFDSRSPRPQTSRSSRASAISQKL--SPSPSPSPRKPSPSHNVSPES
        N EFGG  NL+EI+DKLNENVVLPSPVPWRSRSGR++ QEEADNPSMEDSESNR  SRSP+PQTS++SRASAI QKL  SPSPSPSPRKPSPSHNVSPE 
Subjt:  NGEFGG-GNLEEIEDKLNENVVLPSPVPWRSRSGRMDVQEEADNPSMEDSESNRFDSRSPRPQTSRSSRASAISQKL--SPSPSPSPRKPSPSHNVSPES

Query:  QAKSAEDLVRKKSFYRSPPPPPPPPPPPHVRRISSMKPSSWLNENDVSHQKELRRSFTSKPRSIIRDTGNDIDMMNGGNSSAEALPRNYVDSLSMGKSVR
        QAKSAEDLVRKKSFYRSPPPPPPPP P HV    SMKPSSW+NE+D+ HQKELRRSFTSKPR+IIRDTG+D DMM G NSS E  PRNYVDSLSMGKSVR
Subjt:  QAKSAEDLVRKKSFYRSPPPPPPPPPPPHVRRISSMKPSSWLNENDVSHQKELRRSFTSKPRSIIRDTGNDIDMMNGGNSSAEALPRNYVDSLSMGKSVR

Query:  TIRPGEVVNEPPRRGREFGGNDQMKG-TMMDENTHVQDFEDNPIEFPDEDKEELVEKLTMDT---DDDDDMESE-EENNNMVGQFIREDNGEPFNVKRRD
        TIRPGEVVNEPPRRGREF  NDQ+KG  MM+ENTH+QDFE+NPIEFPDEDKEELVEKLT+DT   DDDDDMESE EENN+MVG+FIREDNGEPF+VKRR+
Subjt:  TIRPGEVVNEPPRRGREFGGNDQMKG-TMMDENTHVQDFEDNPIEFPDEDKEELVEKLTMDT---DDDDDMESE-EENNNMVGQFIREDNGEPFNVKRRD

Query:  RDNDERGSSN-------EEEAGGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSGQIRKNTSKQT
        RD DERGS N       EEEAG +SN+ NDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSGQIRKN +KQT
Subjt:  RDNDERGSSN-------EEEAGGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSGQIRKNTSKQT

XP_023007761.1 uncharacterized protein DDB_G0284459-like [Cucurbita maxima]9.9e-23380.17Show/hide
Query:  MAESEVLAPPQNQP------TPSKFHSHILYKVVTAIFFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQ
        MAES+V A P N P      TPSKF+SHILYK++ AIFFL+ILPLVPSQAPEFVNQTLLTR+WELLHLLFVGIAVSYGLFSRR+DEKED ISVS FDNVQ
Subjt:  MAESEVLAPPQNQP------TPSKFHSHILYKVVTAIFFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQ

Query:  SYVSGLLHVSSVFDDEPETPSANDESVSSSDENKVHTWNSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESR----SKPRVSSRRL
        SYVSGLLHVSSVFDDE ETPSANDES+SSSD NKV TW++RYFRNES+VVAEE PVVNEQRVRSEKPLLLPVRSL S+VVVDDESR    S  RVSS RL
Subjt:  SYVSGLLHVSSVFDDEPETPSANDESVSSSDENKVHTWNSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESR----SKPRVSSRRL

Query:  LSNLKRSSNGEFGGGNLEEIEDKLNENVVLPSPVPWRSRSGRMDVQEEADNP-------SMEDSESNRFDSRSPRPQTSRSSRASAISQKLS-PSPSPSP
        LSN KRSSNGEFGG +LE IED LNENVVLPSPVPWRSRSGR +VQEEADNP        ME+SESN  DSRS RPQTSRS +ASAI  KLS PSPSP P
Subjt:  LSNLKRSSNGEFGGGNLEEIEDKLNENVVLPSPVPWRSRSGRMDVQEEADNP-------SMEDSESNRFDSRSPRPQTSRSSRASAISQKLS-PSPSPSP

Query:  RKPSPSHNVSPESQAKSAEDLVRKKSFYRSPPPPPPPPPPPHVRRISSMKPSSWLNENDVSHQKELRRSF-TSKPRSIIRDTGNDIDMMNGGNSSAEALP
        RKPSPS NVSPE +AKS+ED VRKKSF+ SPPPPPPPPPPPHVRRI+SMKPSS LN+NDV HQK+L+RS  TSKPR  IRDTG+DIDM+ G NSSAEALP
Subjt:  RKPSPSHNVSPESQAKSAEDLVRKKSFYRSPPPPPPPPPPPHVRRISSMKPSSWLNENDVSHQKELRRSF-TSKPRSIIRDTGNDIDMMNGGNSSAEALP

Query:  RNYVDSLSMGKSVRTIRPGEVVNEPPRRGREFGGNDQMKGTMMDENTHVQDFEDNPIEFPDEDKEELVEKLTMDTDDDDDMESEEENNNMVGQFIREDNG
        RNY D LSMGKS+R IRPGEV NEP RRGREFGGNDQ+KG M+D+NTHVQ FE+NPIEFPD+DK+E VEKL M+TDDDDDMESEEE+NNMVG+FIREDNG
Subjt:  RNYVDSLSMGKSVRTIRPGEVVNEPPRRGREFGGNDQMKGTMMDENTHVQDFEDNPIEFPDEDKEELVEKLTMDTDDDDDMESEEENNNMVGQFIREDNG

Query:  EPFNVKRRDRDNDERGSSNEEEAGGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSGQIRKNTSKQT
        EPFNV RRD   +ER SSN EEAGGSSN++NDGGPDVDKKADEFIAKFREQIRLQRIESIKRS+GQIR+NTSKQ+
Subjt:  EPFNVKRRDRDNDERGSSNEEEAGGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSGQIRKNTSKQT

XP_038905604.1 uncharacterized protein DDB_G0284459 [Benincasa hispida]1.7e-27792.65Show/hide
Query:  MAESEVLAPPQNQPTPSKFHSHILYKVVTAIFFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGL
        MAESEVLAPP NQPTPSKFH+HILYKVVTAIFFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGL
Subjt:  MAESEVLAPPQNQPTPSKFHSHILYKVVTAIFFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGL

Query:  LHVSSVFDDEPETPSANDESVSSSDENKVHTWNSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESRSKPRVSSRRLLSNLKRSSNG
        LHVSSVFDDEPETPSANDES+SSSDENKVHTWNSRYFRNESVVVAEERP VNEQRVRSEKPLLLPVRSLKSRVVVDDESR KPRVSSRRLLSNLKRSSNG
Subjt:  LHVSSVFDDEPETPSANDESVSSSDENKVHTWNSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESRSKPRVSSRRLLSNLKRSSNG

Query:  EFGGGNLEEIEDKLNENVVLPSPVPWRSRSGRMDVQEEADNPSMEDSESNRFDSRSPRPQTSRSSRASAISQKLSPSPSPSPRKPSPSHNVSPESQAKSA
        EFGG +LEEIEDKLNENVVLPSPVPWRSRSGRM+VQEEAD PSMEDSESNR DSRSPRPQ SRSSRASAISQK SPSPSPSPRKPSPSHNVSPESQAKSA
Subjt:  EFGGGNLEEIEDKLNENVVLPSPVPWRSRSGRMDVQEEADNPSMEDSESNRFDSRSPRPQTSRSSRASAISQKLSPSPSPSPRKPSPSHNVSPESQAKSA

Query:  EDLVRKKSFYRSPPPPPPPPPPPHVRRISSMKPSSWLNENDVSHQKELRRSFTSKPRSIIRDTGNDIDMMNGGNSSAEALPRNYVDSLSMGKSVRTIRPG
        EDLVRKKSFYRSPPPPPPPPPPPHVRRISSMKPSSW+NEN+V HQKEL+RS TSKPRS+IRDTG+D D+M G NSS EAL RNYVD+LSMGKSVRTIRPG
Subjt:  EDLVRKKSFYRSPPPPPPPPPPPHVRRISSMKPSSWLNENDVSHQKELRRSFTSKPRSIIRDTGNDIDMMNGGNSSAEALPRNYVDSLSMGKSVRTIRPG

Query:  EVVNEPPRRGREFGGNDQMKGTMMDENTHVQDFEDNPIEFPDEDKEELVEKLTMDT--DDDDDMESEEENNNMVGQFIREDNGEPFNVKRRDRDNDERGS
        EVVNEPPRRGREFGGNDQ+KG MMD+NTHVQDFE+NPIEFPDEDKEELVEKLTMDT  DDDDDMESEEENNNMVG+FIREDNGEPF+VK RDRD D R S
Subjt:  EVVNEPPRRGREFGGNDQMKGTMMDENTHVQDFEDNPIEFPDEDKEELVEKLTMDT--DDDDDMESEEENNNMVGQFIREDNGEPFNVKRRDRDNDERGS

Query:  SNEEEAGGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSGQIRKNTSKQT
        SNEEEAG SSNMANDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSGQIR+NT+KQT
Subjt:  SNEEEAGGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSGQIRKNTSKQT

TrEMBL top hitse value%identityAlignment
A0A0A0L1H7 Uncharacterized protein1.6e-24984.68Show/hide
Query:  MAESEVLAPPQNQ--PTPSKFHSHILYKVVTAIFFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVS
        MAES+VL PPQN   P+PSKF++H+LYK++TAIFFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVS
Subjt:  MAESEVLAPPQNQ--PTPSKFHSHILYKVVTAIFFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVS

Query:  GLLHVSSVFDDEPETPSANDESVSSSDENKVHTWNSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESRSKPRVSSRRLLSNLKRSS
        GLLHVSSVFDDEPETPSAND      DENKV TWN+RYFRNESVVVAEERPV NEQRVRSEKPLLLPVRSLKSRVVVDDE RSK RVSSRRLLSNLKRSS
Subjt:  GLLHVSSVFDDEPETPSANDESVSSSDENKVHTWNSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESRSKPRVSSRRLLSNLKRSS

Query:  NGEFGG-GNLEEIEDKLNENVVLPSPVPWRSRSGRMDVQEEADNPSMEDSESNRFDSRSPRPQTSRSSRASAISQKLSPSPSPSPRKPSPSHNVSPESQA
        N EFGG  NL+EI+DKLNEN VLPSPVPWRSRSGRM+ QEEADNPSMEDSESNR  SRSP+PQTS+SSRASAI Q+LSPSPSPSPRKPSPSHNVSPE QA
Subjt:  NGEFGG-GNLEEIEDKLNENVVLPSPVPWRSRSGRMDVQEEADNPSMEDSESNRFDSRSPRPQTSRSSRASAISQKLSPSPSPSPRKPSPSHNVSPESQA

Query:  KSAEDLVRKKSFYRSPPPPPPPPPPPHVRRISSMKPSSWLNENDVSHQKELRRSFTSKPRSIIRDTGNDIDMMNGGNSSAEALPRNYVDSLSMGKSVRTI
        KSAEDLVRKKSFYRS PPPPPPPPPP VRR SSMKPSSW+NE+DV HQKELRRS+TSKPR+I RDTG+D DMM G NSS E  PR+YVD LSMGKSVRTI
Subjt:  KSAEDLVRKKSFYRSPPPPPPPPPPPHVRRISSMKPSSWLNENDVSHQKELRRSFTSKPRSIIRDTGNDIDMMNGGNSSAEALPRNYVDSLSMGKSVRTI

Query:  RPGEVVNEPPRRGREFGGNDQMKG-TMMDENTHVQDFEDNPIEFPDEDKEELVEKLTMDT----DDDDDMESEEENNNMVGQFIREDNGEPFNVKRRDRD
        R GE VNEPPRRGREF  NDQ+KG TMM+ENTHVQDFE+NP+E PDEDKEELVEKLTMDT    DDDDDMESE E N+MVG+FIREDNGEPF+VKRR+R+
Subjt:  RPGEVVNEPPRRGREFGGNDQMKG-TMMDENTHVQDFEDNPIEFPDEDKEELVEKLTMDT----DDDDDMESEEENNNMVGQFIREDNGEPFNVKRRDRD

Query:  NDERGSSN----EEEAGGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSGQIRKNTSKQT
         DERGSSN    EEEAG SSN+ NDGGPDVDKKADEFIAKFREQIRLQRIES KRSSGQIRKNT+KQ+
Subjt:  NDERGSSN----EEEAGGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSGQIRKNTSKQT

A0A1S4DX64 LOW QUALITY PROTEIN: uncharacterized protein DDB_G02844591.3e-24984.12Show/hide
Query:  MAESEVLAPPQNQ--PTPSKFHSHILYKVVTAIFFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVS
        MAES+VL PPQN   P+PSKF++H+ YK++TA+FFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVS
Subjt:  MAESEVLAPPQNQ--PTPSKFHSHILYKVVTAIFFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVS

Query:  GLLHVSSVFDDEPETPSANDESVSSSDENKVHTWNSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESRSKPRVSSRRLLSNLKRSS
        GLLHVSSVFDDEPETPSAND      DENKV TWN+RYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRV+VDDESRSK RVSSRRLLSNLKR+S
Subjt:  GLLHVSSVFDDEPETPSANDESVSSSDENKVHTWNSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESRSKPRVSSRRLLSNLKRSS

Query:  NGEFGG-GNLEEIEDKLNENVVLPSPVPWRSRSGRMDVQEEADNPSMEDSESNRFDSRSPRPQTSRSSRASAISQKL--SPSPSPSPRKPSPSHNVSPES
        N EFGG  NL+EI+DKLNENVVLPSPVPWRSRSGR++ QEEADNPSMEDSESNR  SRSP+PQTS++SRASAI QKL  SPSPSPSPRKPSPSHNVSPE 
Subjt:  NGEFGG-GNLEEIEDKLNENVVLPSPVPWRSRSGRMDVQEEADNPSMEDSESNRFDSRSPRPQTSRSSRASAISQKL--SPSPSPSPRKPSPSHNVSPES

Query:  QAKSAEDLVRKKSFYRSPPPPPPPPPPPHVRRISSMKPSSWLNENDVSHQKELRRSFTSKPRSIIRDTGNDIDMMNGGNSSAEALPRNYVDSLSMGKSVR
        QAKSAEDLVRKKSFYRSPPPPPPPP P HV    SMKPSSW+NE+D+ HQKELRRSFTSKPR+IIRDTG+D DMM G NSS E  PRNYVDSLSMGKSVR
Subjt:  QAKSAEDLVRKKSFYRSPPPPPPPPPPPHVRRISSMKPSSWLNENDVSHQKELRRSFTSKPRSIIRDTGNDIDMMNGGNSSAEALPRNYVDSLSMGKSVR

Query:  TIRPGEVVNEPPRRGREFGGNDQMKG-TMMDENTHVQDFEDNPIEFPDEDKEELVEKLTMDT---DDDDDMESE-EENNNMVGQFIREDNGEPFNVKRRD
        TIRPGEVVNEPPRRGREF  NDQ+KG  MM+ENTH+QDFE+NPIEFPDEDKEELVEKLT+DT   DDDDDMESE EENN+MVG+FIREDNGEPF+VKRR+
Subjt:  TIRPGEVVNEPPRRGREFGGNDQMKG-TMMDENTHVQDFEDNPIEFPDEDKEELVEKLTMDT---DDDDDMESE-EENNNMVGQFIREDNGEPFNVKRRD

Query:  RDNDERGSSN-------EEEAGGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSGQIRKNTSKQT
        RD DERGS N       EEEAG +SN+ NDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSGQIRKN +KQT
Subjt:  RDNDERGSSN-------EEEAGGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSGQIRKNTSKQT

A0A5A7TIC6 Putative Hydroxyproline-rich glycoprotein family protein7.9e-25284.67Show/hide
Query:  MAESEVLAPPQNQ--PTPSKFHSHILYKVVTAIFFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVS
        MAES+VL PPQN   P+PSKF++H+ YK++TA+FFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVS
Subjt:  MAESEVLAPPQNQ--PTPSKFHSHILYKVVTAIFFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVS

Query:  GLLHVSSVFDDEPETPSANDESVSSSDENKVHTWNSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESRSKPRVSSRRLLSNLKRSS
        GLLHVSSVFDDEPETPSAND      DENKV TWN+RYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRV+VDDESRSK RVSSRRLLSNLKR+S
Subjt:  GLLHVSSVFDDEPETPSANDESVSSSDENKVHTWNSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESRSKPRVSSRRLLSNLKRSS

Query:  NGEFGG-GNLEEIEDKLNENVVLPSPVPWRSRSGRMDVQEEADNPSMEDSESNRFDSRSPRPQTSRSSRASAISQKL--SPSPSPSPRKPSPSHNVSPES
        N EFGG  NL+EI+DKLNENVVLPSPVPWRSRSGR++ QEEADNPSMEDSESNR  SRSP+PQTS++SRASAI QKL  SPSPSPSPRKPSPSHNVSPE 
Subjt:  NGEFGG-GNLEEIEDKLNENVVLPSPVPWRSRSGRMDVQEEADNPSMEDSESNRFDSRSPRPQTSRSSRASAISQKL--SPSPSPSPRKPSPSHNVSPES

Query:  QAKSAEDLVRKKSFYRS-PPPPPPPPPPPHVRRISSMKPSSWLNENDVSHQKELRRSFTSKPRSIIRDTGNDIDMMNGGNSSAEALPRNYVDSLSMGKSV
        QAKSAEDLVRKKSFYRS PPPPPPPPPPP VRR SSMKPSSW+NE+D+ HQKELRRSFTSKPR+IIRDTG+D DMM G NSS E  PRNYVDSLSMGKSV
Subjt:  QAKSAEDLVRKKSFYRS-PPPPPPPPPPPHVRRISSMKPSSWLNENDVSHQKELRRSFTSKPRSIIRDTGNDIDMMNGGNSSAEALPRNYVDSLSMGKSV

Query:  RTIRPGEVVNEPPRRGREFGGNDQMKG-TMMDENTHVQDFEDNPIEFPDEDKEELVEKLTMDT---DDDDDMESE-EENNNMVGQFIREDNGEPFNVKRR
        RTIRPGEVVNEPPRRGREF  NDQ+KG  MM+ENTH+QDFE+NPIEFPDEDKEELVEKLT+DT   DDDDDMESE EENN+MVG+FIREDNGEPF+VKRR
Subjt:  RTIRPGEVVNEPPRRGREFGGNDQMKG-TMMDENTHVQDFEDNPIEFPDEDKEELVEKLTMDT---DDDDDMESE-EENNNMVGQFIREDNGEPFNVKRR

Query:  DRDNDERGSSN-------EEEAGGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSGQIRKNTSKQT
        +RD DERGS N       EEEAG +SN+ NDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSGQIRKN +KQT
Subjt:  DRDNDERGSSN-------EEEAGGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSGQIRKNTSKQT

A0A6J1E6G0 uncharacterized protein DDB_G02844597.7e-23178.68Show/hide
Query:  MAESEVLAPPQNQP------TPSKFHSHILYKVVTAIFFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQ
        MAES+V A P N P      TPSKF+SHILYK++ AIFFL+ILPLVPSQAPEFVNQTLLTR+WELLHLLFVGIAVSYGLFSRR+DEKEDEISVS FDNVQ
Subjt:  MAESEVLAPPQNQP------TPSKFHSHILYKVVTAIFFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQ

Query:  SYVSGLLHVSSVFDDEPETPSANDESVSSSDENKVHTWNSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESR----SKPRVSSRRL
        SYVSGLLHVSSVFDDE ETPSANDES+S SD NKV TW++RYFRNESV V+EE PVVNEQRVRSEKPLLLPVRSLKSRVVVDDESR    S  RVSSRRL
Subjt:  SYVSGLLHVSSVFDDEPETPSANDESVSSSDENKVHTWNSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESR----SKPRVSSRRL

Query:  LSNLKRSSNGEFGGGNLEEIEDKLNENVVLPSPVPWRSRSGRMDVQEEADNP-------SMEDSESNRFDSRSPRPQTSRSSRASAIS---QKLSPSPSP
        LS+ KRSSNGE GG NL  +ED  NENV LPSPVPWRSRSGR +VQEEADNP        ME+SESN  DSRS RPQTSRSS+ASAI       SPSPSP
Subjt:  LSNLKRSSNGEFGGGNLEEIEDKLNENVVLPSPVPWRSRSGRMDVQEEADNP-------SMEDSESNRFDSRSPRPQTSRSSRASAIS---QKLSPSPSP

Query:  SPRKPSPSHNVSPESQAKSAEDLVRKKSFYRSPPPPPPPPPPPHVRRISSMKPSSWLNENDVSHQKELRRSF-TSKPRSIIRDTGNDIDMMNGGNSSAEA
        SPRKPSPS NVSPE +AKS+E  VRKKSF+ SPPPPPPPPPPPHVRRI+SMKPSSWLN+NDV HQK+L+RS  TSKPRS IR TG+DIDM+ G NSSAEA
Subjt:  SPRKPSPSHNVSPESQAKSAEDLVRKKSFYRSPPPPPPPPPPPHVRRISSMKPSSWLNENDVSHQKELRRSF-TSKPRSIIRDTGNDIDMMNGGNSSAEA

Query:  LPRNYVDSLSMGKSVRTIRPGEVVNEPPRRGREFGGNDQMKGTMMDENTHVQDFEDNPIEFPDEDKEELVEKLTMDTDDDDDMESEEENNNMVGQFIRED
        LPRNY DSLSMGKS R IRPGEV NEPPRRGREFGG DQ+KG M+D+N HVQ FE+NPIEFP+++K+ELVEKL+M+T  DDDMES+EE+NNMVG+FIRED
Subjt:  LPRNYVDSLSMGKSVRTIRPGEVVNEPPRRGREFGGNDQMKGTMMDENTHVQDFEDNPIEFPDEDKEELVEKLTMDTDDDDDMESEEENNNMVGQFIRED

Query:  NGEPFNVKRRDRDNDERGSSNEEEAGGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSGQIRKNTSKQT
        NGEPFNV RRD   +ER SSNE EAG SSN++NDGGPDVDKKADEFIAKFREQIRLQRIESIKRS+GQIR+NTSKQT
Subjt:  NGEPFNVKRRDRDNDERGSSNEEEAGGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSGQIRKNTSKQT

A0A6J1L1K4 uncharacterized protein DDB_G0284459-like4.8e-23380.17Show/hide
Query:  MAESEVLAPPQNQP------TPSKFHSHILYKVVTAIFFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQ
        MAES+V A P N P      TPSKF+SHILYK++ AIFFL+ILPLVPSQAPEFVNQTLLTR+WELLHLLFVGIAVSYGLFSRR+DEKED ISVS FDNVQ
Subjt:  MAESEVLAPPQNQP------TPSKFHSHILYKVVTAIFFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQ

Query:  SYVSGLLHVSSVFDDEPETPSANDESVSSSDENKVHTWNSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESR----SKPRVSSRRL
        SYVSGLLHVSSVFDDE ETPSANDES+SSSD NKV TW++RYFRNES+VVAEE PVVNEQRVRSEKPLLLPVRSL S+VVVDDESR    S  RVSS RL
Subjt:  SYVSGLLHVSSVFDDEPETPSANDESVSSSDENKVHTWNSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESR----SKPRVSSRRL

Query:  LSNLKRSSNGEFGGGNLEEIEDKLNENVVLPSPVPWRSRSGRMDVQEEADNP-------SMEDSESNRFDSRSPRPQTSRSSRASAISQKLS-PSPSPSP
        LSN KRSSNGEFGG +LE IED LNENVVLPSPVPWRSRSGR +VQEEADNP        ME+SESN  DSRS RPQTSRS +ASAI  KLS PSPSP P
Subjt:  LSNLKRSSNGEFGGGNLEEIEDKLNENVVLPSPVPWRSRSGRMDVQEEADNP-------SMEDSESNRFDSRSPRPQTSRSSRASAISQKLS-PSPSPSP

Query:  RKPSPSHNVSPESQAKSAEDLVRKKSFYRSPPPPPPPPPPPHVRRISSMKPSSWLNENDVSHQKELRRSF-TSKPRSIIRDTGNDIDMMNGGNSSAEALP
        RKPSPS NVSPE +AKS+ED VRKKSF+ SPPPPPPPPPPPHVRRI+SMKPSS LN+NDV HQK+L+RS  TSKPR  IRDTG+DIDM+ G NSSAEALP
Subjt:  RKPSPSHNVSPESQAKSAEDLVRKKSFYRSPPPPPPPPPPPHVRRISSMKPSSWLNENDVSHQKELRRSF-TSKPRSIIRDTGNDIDMMNGGNSSAEALP

Query:  RNYVDSLSMGKSVRTIRPGEVVNEPPRRGREFGGNDQMKGTMMDENTHVQDFEDNPIEFPDEDKEELVEKLTMDTDDDDDMESEEENNNMVGQFIREDNG
        RNY D LSMGKS+R IRPGEV NEP RRGREFGGNDQ+KG M+D+NTHVQ FE+NPIEFPD+DK+E VEKL M+TDDDDDMESEEE+NNMVG+FIREDNG
Subjt:  RNYVDSLSMGKSVRTIRPGEVVNEPPRRGREFGGNDQMKGTMMDENTHVQDFEDNPIEFPDEDKEELVEKLTMDTDDDDDMESEEENNNMVGQFIREDNG

Query:  EPFNVKRRDRDNDERGSSNEEEAGGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSGQIRKNTSKQT
        EPFNV RRD   +ER SSN EEAGGSSN++NDGGPDVDKKADEFIAKFREQIRLQRIESIKRS+GQIR+NTSKQ+
Subjt:  EPFNVKRRDRDNDERGSSNEEEAGGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSGQIRKNTSKQT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G60380.1 FUNCTIONS IN: molecular_function unknown1.9e-2734.76Show/hide
Query:  KVVTAIFFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQ-SYVSGLLHVSSVFDDEPETPSA------ND
        K V    FLL LPL PSQAP+FV +T+LT+ WEL+HLLFVGIAV+YGLFSRR+ E   ++ +++ D    SYVS +  VSSVFD+E +  S       +D
Subjt:  KVVTAIFFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQ-SYVSGLLHVSSVFDDEPETPSA------ND

Query:  ESVSS------------------------SDENKVHTWNSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESRSKPRVSSRRLLSNL
        ESVS+                         + N+V  WNS+YF+ +S VV   RP          +PL LP+R L+S +      R    +  +    + 
Subjt:  ESVSS------------------------SDENKVHTWNSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESRSKPRVSSRRLLSNL

Query:  KRSSNGEFGGGNLEEIEDKLNENVVLP-SPVPWRSRSGRMDVQEEADNPSMEDSESNRFDSRSPRPQTSRSSRASAISQKLSPSPSPSPRKPSPSHNVSP
          + N E      +   D   E +  P SPVPW++R   M + +   +     S      S S R   S SS+ S  SQ        +  + SPS +VS 
Subjt:  KRSSNGEFGGGNLEEIEDKLNENVVLP-SPVPWRSRSGRMDVQEEADNPSMEDSESNRFDSRSPRPQTSRSSRASAISQKLSPSPSPSPRKPSPSHNVSP

Query:  ESQAKSAEDLVRKK---SFYRSPPPPPPPPPPPHVRRISSMKPSSWLNEND
        ES   + E+LV++K   S  RS  P  PP P      +S   PS  L  ND
Subjt:  ESQAKSAEDLVRKK---SFYRSPPPPPPPPPPPHVRRISSMKPSSWLNEND

AT3G60380.1 FUNCTIONS IN: molecular_function unknown9.3e-0352.17Show/hide
Query:  EEEAGGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSG
        EE A  S + A+    +VD+KA EFIAKFREQIRLQ++ S ++  G
Subjt:  EEEAGGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSG

AT4G16790.1 hydroxyproline-rich glycoprotein family protein2.2e-3632.34Show/hide
Query:  MAESEVLAPP-----QNQPTPSKFHSHILYKVVTAIFFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFD----
        M E+  L  P     +    P KF+S  ++K +       ++P+  SQ PE  NQ   TR  ELLHL+FVGIAVSYGLFSRR+ +       S  D    
Subjt:  MAESEVLAPP-----QNQPTPSKFHSHILYKVVTAIFFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFD----

Query:  -----NVQSYVSGLLHVSSVFDDEPETPSANDESVSSSDENKVHTWNSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVD---DESRSKP
             N  SYV  +L VSSVF+   E+ S   +  SS D+ K  TW ++Y  +  +   E R V        EKPLLLPVRSL    V D   D S    
Subjt:  -----NVQSYVSGLLHVSSVFDDEPETPSANDESVSSSDENKVHTWNSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVD---DESRSKP

Query:  RVSSRRLLSNLKRSSNGEFGGGNLEEIEDKLNENVVLPSPVPWRSRSGRMDVQEEADNPSMEDSESNRFDSRSPRPQTSRSSRASAISQKLSPSPSPSPR
        +V S+R L       N +                 VLPSP+PWRSRS            S   S S   +S       +       I     PS   SPR
Subjt:  RVSSRRLLSNLKRSSNGEFGGGNLEEIEDKLNENVVLPSPVPWRSRSGRMDVQEEADNPSMEDSESNRFDSRSPRPQTSRSSRASAISQKLSPSPSPSPR

Query:  KPSPSHNVSPESQAKSAEDLVRKKSFYRSPPPPPPPPPPPHVRRISSMKPSSWLNENDVSHQKELRRSFTSKPRSIIRDTGNDIDMMNGGNSSAEALPRN
        K +P  N++ E              F+ SPPPPPPPPPP     + +   SS   ++   ++ E R S   K +              GG       P  
Subjt:  KPSPSHNVSPESQAKSAEDLVRKKSFYRSPPPPPPPPPPPHVRRISSMKPSSWLNENDVSHQKELRRSFTSKPRSIIRDTGNDIDMMNGGNSSAEALPRN

Query:  YVDSLSMGKSVRTIRPGEVVNEPPRRGREFGGNDQMKGTMMDENTHVQDFEDNPIEFPDEDKEELVEKLTMDTDDDDDMESEEENNNMVGQFIRE-DNGE
                       P E    PP + R      +     M  N   + +  +PI    E KE+             D E  ++ +N+  + + E +NGE
Subjt:  YVDSLSMGKSVRTIRPGEVVNEPPRRGREFGGNDQMKGTMMDENTHVQDFEDNPIEFPDEDKEELVEKLTMDTDDDDDMESEEENNNMVGQFIRE-DNGE

Query:  PFNVKRRDRDNDERGSSNEEEAGGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSGQIRKNTSK
            +    D  E+    EE   G S + N  G DVDKKADEFIAKFREQIRLQRIESIKRS+ +I  N+S+
Subjt:  PFNVKRRDRDNDERGSSNEEEAGGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSGQIRKNTSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAATCAGAGGTTCTCGCGCCACCGCAAAATCAACCTACTCCAAGTAAGTTCCATAGCCATATCTTGTATAAAGTTGTAACCGCCATTTTCTTTCTACTCATTCT
ACCTTTAGTTCCTTCCCAAGCCCCTGAGTTTGTTAATCAAACTTTACTCACCAGAAGCTGGGAGCTTCTGCATCTTCTTTTTGTCGGAATTGCTGTTTCTTACGGCCTTT
TTAGCCGGAGAAGTGATGAAAAAGAAGATGAAATTAGTGTATCTAAGTTTGATAATGTTCAATCTTATGTTTCTGGATTGCTTCATGTTTCGTCTGTTTTTGATGATGAG
CCTGAAACTCCGTCTGCTAATGATGAATCGGTGTCTTCGTCTGATGAAAATAAGGTCCATACATGGAATAGTCGGTATTTTAGGAATGAATCTGTGGTTGTTGCTGAAGA
ACGTCCTGTTGTTAATGAGCAGAGAGTTAGGAGTGAGAAGCCTTTGCTTCTCCCTGTTCGTAGCTTGAAGTCTCGTGTTGTTGTAGACGATGAGTCTAGATCTAAGCCGA
GAGTGAGTTCGAGAAGATTATTGAGCAATTTGAAGAGGAGTTCGAATGGAGAGTTTGGGGGAGGGAATCTTGAAGAAATTGAGGATAAGTTGAATGAAAATGTTGTTCTT
CCATCTCCGGTTCCATGGCGATCGAGATCGGGGAGGATGGATGTGCAAGAAGAAGCTGATAATCCTTCCATGGAGGATTCTGAATCGAATAGGTTCGATTCTAGGTCTCC
TAGGCCTCAAACTTCAAGGTCTTCCCGAGCCAGTGCCATTTCTCAGAAGCTATCTCCTTCTCCATCTCCATCTCCAAGGAAACCATCTCCTTCCCATAATGTGTCACCAG
AATCACAGGCCAAGAGTGCTGAGGATTTAGTGAGGAAAAAGAGCTTCTACCGGTCTCCTCCACCTCCGCCGCCACCTCCGCCCCCGCCACATGTTCGAAGAATTTCCTCA
ATGAAACCAAGTTCATGGTTGAACGAGAATGATGTATCTCATCAAAAGGAATTGAGAAGAAGCTTCACTAGCAAGCCCAGAAGTATAATTCGTGATACAGGAAATGATAT
TGATATGATGAATGGTGGTAATTCAAGTGCTGAAGCTTTGCCTAGAAATTATGTTGATAGTCTATCAATGGGAAAATCTGTTAGAACAATCAGACCTGGGGAAGTTGTGA
ATGAGCCACCAAGGAGAGGGAGAGAATTTGGTGGGAATGATCAAATGAAGGGGACGATGATGGATGAAAATACCCATGTCCAAGATTTTGAAGATAACCCCATTGAGTTT
CCAGATGAAGATAAAGAAGAATTGGTGGAAAAGCTAACCATGGACACCGACGACGACGATGACATGGAAAGCGAGGAAGAAAACAATAACATGGTGGGGCAGTTTATTAG
GGAAGACAATGGAGAACCTTTTAATGTGAAACGAAGAGATAGAGACAACGACGAAAGAGGTTCGAGTAATGAAGAAGAAGCAGGAGGCTCTAGTAATATGGCTAATGATG
GAGGACCAGATGTAGATAAGAAGGCTGATGAGTTCATTGCCAAATTCAGAGAGCAAATCAGGCTTCAAAGGATTGAATCAATCAAAAGATCAAGTGGACAAATTCGTAAA
AACACTTCAAAGCAAACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGAATCAGAGGTTCTCGCGCCACCGCAAAATCAACCTACTCCAAGTAAGTTCCATAGCCATATCTTGTATAAAGTTGTAACCGCCATTTTCTTTCTACTCATTCT
ACCTTTAGTTCCTTCCCAAGCCCCTGAGTTTGTTAATCAAACTTTACTCACCAGAAGCTGGGAGCTTCTGCATCTTCTTTTTGTCGGAATTGCTGTTTCTTACGGCCTTT
TTAGCCGGAGAAGTGATGAAAAAGAAGATGAAATTAGTGTATCTAAGTTTGATAATGTTCAATCTTATGTTTCTGGATTGCTTCATGTTTCGTCTGTTTTTGATGATGAG
CCTGAAACTCCGTCTGCTAATGATGAATCGGTGTCTTCGTCTGATGAAAATAAGGTCCATACATGGAATAGTCGGTATTTTAGGAATGAATCTGTGGTTGTTGCTGAAGA
ACGTCCTGTTGTTAATGAGCAGAGAGTTAGGAGTGAGAAGCCTTTGCTTCTCCCTGTTCGTAGCTTGAAGTCTCGTGTTGTTGTAGACGATGAGTCTAGATCTAAGCCGA
GAGTGAGTTCGAGAAGATTATTGAGCAATTTGAAGAGGAGTTCGAATGGAGAGTTTGGGGGAGGGAATCTTGAAGAAATTGAGGATAAGTTGAATGAAAATGTTGTTCTT
CCATCTCCGGTTCCATGGCGATCGAGATCGGGGAGGATGGATGTGCAAGAAGAAGCTGATAATCCTTCCATGGAGGATTCTGAATCGAATAGGTTCGATTCTAGGTCTCC
TAGGCCTCAAACTTCAAGGTCTTCCCGAGCCAGTGCCATTTCTCAGAAGCTATCTCCTTCTCCATCTCCATCTCCAAGGAAACCATCTCCTTCCCATAATGTGTCACCAG
AATCACAGGCCAAGAGTGCTGAGGATTTAGTGAGGAAAAAGAGCTTCTACCGGTCTCCTCCACCTCCGCCGCCACCTCCGCCCCCGCCACATGTTCGAAGAATTTCCTCA
ATGAAACCAAGTTCATGGTTGAACGAGAATGATGTATCTCATCAAAAGGAATTGAGAAGAAGCTTCACTAGCAAGCCCAGAAGTATAATTCGTGATACAGGAAATGATAT
TGATATGATGAATGGTGGTAATTCAAGTGCTGAAGCTTTGCCTAGAAATTATGTTGATAGTCTATCAATGGGAAAATCTGTTAGAACAATCAGACCTGGGGAAGTTGTGA
ATGAGCCACCAAGGAGAGGGAGAGAATTTGGTGGGAATGATCAAATGAAGGGGACGATGATGGATGAAAATACCCATGTCCAAGATTTTGAAGATAACCCCATTGAGTTT
CCAGATGAAGATAAAGAAGAATTGGTGGAAAAGCTAACCATGGACACCGACGACGACGATGACATGGAAAGCGAGGAAGAAAACAATAACATGGTGGGGCAGTTTATTAG
GGAAGACAATGGAGAACCTTTTAATGTGAAACGAAGAGATAGAGACAACGACGAAAGAGGTTCGAGTAATGAAGAAGAAGCAGGAGGCTCTAGTAATATGGCTAATGATG
GAGGACCAGATGTAGATAAGAAGGCTGATGAGTTCATTGCCAAATTCAGAGAGCAAATCAGGCTTCAAAGGATTGAATCAATCAAAAGATCAAGTGGACAAATTCGTAAA
AACACTTCAAAGCAAACTTGA
Protein sequenceShow/hide protein sequence
MAESEVLAPPQNQPTPSKFHSHILYKVVTAIFFLLILPLVPSQAPEFVNQTLLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGLLHVSSVFDDE
PETPSANDESVSSSDENKVHTWNSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESRSKPRVSSRRLLSNLKRSSNGEFGGGNLEEIEDKLNENVVL
PSPVPWRSRSGRMDVQEEADNPSMEDSESNRFDSRSPRPQTSRSSRASAISQKLSPSPSPSPRKPSPSHNVSPESQAKSAEDLVRKKSFYRSPPPPPPPPPPPHVRRISS
MKPSSWLNENDVSHQKELRRSFTSKPRSIIRDTGNDIDMMNGGNSSAEALPRNYVDSLSMGKSVRTIRPGEVVNEPPRRGREFGGNDQMKGTMMDENTHVQDFEDNPIEF
PDEDKEELVEKLTMDTDDDDDMESEEENNNMVGQFIREDNGEPFNVKRRDRDNDERGSSNEEEAGGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSGQIRK
NTSKQT