; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g29800 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g29800
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr3:21264655..21268186
RNA-Seq ExpressionMoc03g29800
SyntenyMoc03g29800
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573113.1 hypothetical protein SDJN03_27000, partial [Cucurbita argyrosperma subsp. sororia]1.5e-26083.02Show/hide
Query:  MASSAFKSTTKRTPIGAPAASGDDSTSTNRSSIHRRSRSLSRFSHPMPSSPVDKAFDEAPARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARN
        MASSAFKSTTKRTPIGA  AS DDS STNRSSIHRRSRSLSRFSHP+PSSPVDK F E  ARRGRFVNTSRGS FPEISLDDLAVEFFGSGDRGRSAAR+
Subjt:  MASSAFKSTTKRTPIGAPAASGDDSTSTNRSSIHRRSRSLSRFSHPMPSSPVDKAFDEAPARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARN

Query:  SESSGATNVAAASHRRGRSVSRHGCSKTSVSGSDGKGRPNYSVGGEKVVPETNSRRRRSVSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQKPIPH
        SESS A +  A+S RRGRSVSRHG +KTS  GSDGKG+ +YSVGG K+VPE+NSRRRRSVSVVRYQISDSESDLD+SQ+SGTR++EKS+G GNKQKP+ H
Subjt:  SESSGATNVAAASHRRGRSVSRHGCSKTSVSGSDGKGRPNYSVGGEKVVPETNSRRRRSVSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQKPIPH

Query:  KADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNGVEKTIRTIYARKVKQANGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS
        KAD+SNRRP LRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYF NN  EKTIRTI ARK KQ NGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS
Subjt:  KADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNGVEKTIRTIYARKVKQANGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS

Query:  SVGTFSDDLNSNDSGVLQQTSPFTRNYSTKQEQSEKRRDSLAKMVLEEQRGQQLPKMVKNLPSDLKNVVAENSPRIRKRSNDRNRMSKRLSEEAEKYIED
        SVGT SDDL+S+DSGV Q+TSPF RNYSTKQEQS+KRRDSLAKMVL+EQR Q+LPK VKN   DL NVVAENS RIRKRSNDR+RMSKRL+EEAEKYIED
Subjt:  SVGTFSDDLNSNDSGVLQQTSPFTRNYSTKQEQSEKRRDSLAKMVLEEQRGQQLPKMVKNLPSDLKNVVAENSPRIRKRSNDRNRMSKRLSEEAEKYIED

Query:  FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKVPAVSKYVPPGMDGVLLPWLQWETSNDASSYPRKNTSEPPMTPQTFPWDVNQESTNGQDQSNHSGSS
        FISNVEDTDISSLDGDRSDTSSSLGGKTKPNF++PA SK VPPGMDGVLLPWLQWETSNDA+ YPRKNT  P +TPQ FPWDVNQE++N QD  NHSGSS
Subjt:  FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKVPAVSKYVPPGMDGVLLPWLQWETSNDASSYPRKNTSEPPMTPQTFPWDVNQESTNGQDQSNHSGSS

Query:  QGSWSPGVGIGVCGKVVEDLGSRFKKAGNYQNQSYLEPRD-QSRFDIEEYLKRPSSEDFLLERWKQQHRTNCSGLLLCNRVFL
        QGSWSPGV   + GKVVEDLGSRFKK G YQNQSYLE R+ +  FDI+EYLKRPS+E+FLLERWKQQH+ N  GLLLCN VFL
Subjt:  QGSWSPGVGIGVCGKVVEDLGSRFKKAGNYQNQSYLEPRD-QSRFDIEEYLKRPSSEDFLLERWKQQHRTNCSGLLLCNRVFL

XP_008439429.1 PREDICTED: uncharacterized protein LOC103484238 isoform X1 [Cucumis melo]3.8e-25982.33Show/hide
Query:  MASSAFKSTTKRTPIGAPAASGDDSTSTNRSSIHRRSRSLSRFSHPMPSSPVDKAFDEAPAR-RGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAAR
        MASSAFKSTTKRTPIGA   S DDSTSTNR S HRRSRSLSRFSHP+PSSP+DK F EA A  RGRFVNTSRGSGFPEISLDDLAVEFFGS DRGRS  R
Subjt:  MASSAFKSTTKRTPIGAPAASGDDSTSTNRSSIHRRSRSLSRFSHPMPSSPVDKAFDEAPAR-RGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAAR

Query:  NSESSGATNVAAASHRRGRSVSRHGCSKTSVSGSDGKGRPNYSVGGEKVVPETNSRRRRSVSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQKPIP
        +SE SGA N + AS+RRGRSVSRHG  KTS  GS+ KGR   SV G KVVPE+NSRRRRS+SVVRYQISDSESD DRSQ+SGTRV+EKSFGIGNKQKPI 
Subjt:  NSESSGATNVAAASHRRGRSVSRHGCSKTSVSGSDGKGRPNYSVGGEKVVPETNSRRRRSVSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQKPIP

Query:  HKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNGVEKTIRTIYARKVKQANGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRN
        HKADDSNRRPTLRRSLSQNDFKCHDGYSS SSVLTDDEGKDA+FGN+ +EKT+R+IYARK KQANG VVD+GLYEAMRKELRHAVEEIRVELEQEMVNRN
Subjt:  HKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNGVEKTIRTIYARKVKQANGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRN

Query:  SSVGTFSDDLNSNDSGVLQQTSPFTRNYSTKQEQSEKRRDSLAKMVLEEQRGQQLPKMVKNLPSDLKNVVAENSPRIRKRSNDRNRMSKRLSEEAEKYIE
        SSV TFSDDL+S+DSGV   TSPFTRNYS KQEQSEKRRDSL KMV+E+QRGQ LPKMVKNLP DLKNVVA+NS R RKRS DR+RMSKRLSEEAEKYIE
Subjt:  SSVGTFSDDLNSNDSGVLQQTSPFTRNYSTKQEQSEKRRDSLAKMVLEEQRGQQLPKMVKNLPSDLKNVVAENSPRIRKRSNDRNRMSKRLSEEAEKYIE

Query:  DFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKVPAVSKYVPPGMDGVLLPWLQWETSNDASSYPRKNTSEPPMTPQTFPWDVNQESTNGQDQSNHSGS
        DFISNVEDTDISSLDGDRSDTSSSLGGK KPNFK+PA  +YVPPGMDGVLLPWLQWETSNDA+ YPRKN  EPP TPQT PWDVNQ+++N  D  NHSGS
Subjt:  DFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKVPAVSKYVPPGMDGVLLPWLQWETSNDASSYPRKNTSEPPMTPQTFPWDVNQESTNGQDQSNHSGS

Query:  SQGSWSPGVGIGVCGKVVEDLGSRFKKAGNYQNQSYLEPRDQSRFDIEEYLKRPSSEDFLLERWKQQHRTNCSGLLLCNRVFL
        SQGSWSPGV IG+ GKVVED+GSRFK+ GN Q QSY E R +SRFDI+EYLKRPS+EDFLLERWKQQH+  CSG+LLCNRVFL
Subjt:  SQGSWSPGVGIGVCGKVVEDLGSRFKKAGNYQNQSYLEPRDQSRFDIEEYLKRPSSEDFLLERWKQQHRTNCSGLLLCNRVFL

XP_022137536.1 uncharacterized protein LOC111008961 [Momordica charantia]0.0e+00100Show/hide
Query:  MASSAFKSTTKRTPIGAPAASGDDSTSTNRSSIHRRSRSLSRFSHPMPSSPVDKAFDEAPARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARN
        MASSAFKSTTKRTPIGAPAASGDDSTSTNRSSIHRRSRSLSRFSHPMPSSPVDKAFDEAPARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARN
Subjt:  MASSAFKSTTKRTPIGAPAASGDDSTSTNRSSIHRRSRSLSRFSHPMPSSPVDKAFDEAPARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARN

Query:  SESSGATNVAAASHRRGRSVSRHGCSKTSVSGSDGKGRPNYSVGGEKVVPETNSRRRRSVSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQKPIPH
        SESSGATNVAAASHRRGRSVSRHGCSKTSVSGSDGKGRPNYSVGGEKVVPETNSRRRRSVSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQKPIPH
Subjt:  SESSGATNVAAASHRRGRSVSRHGCSKTSVSGSDGKGRPNYSVGGEKVVPETNSRRRRSVSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQKPIPH

Query:  KADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNGVEKTIRTIYARKVKQANGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS
        KADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNGVEKTIRTIYARKVKQANGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS
Subjt:  KADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNGVEKTIRTIYARKVKQANGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS

Query:  SVGTFSDDLNSNDSGVLQQTSPFTRNYSTKQEQSEKRRDSLAKMVLEEQRGQQLPKMVKNLPSDLKNVVAENSPRIRKRSNDRNRMSKRLSEEAEKYIED
        SVGTFSDDLNSNDSGVLQQTSPFTRNYSTKQEQSEKRRDSLAKMVLEEQRGQQLPKMVKNLPSDLKNVVAENSPRIRKRSNDRNRMSKRLSEEAEKYIED
Subjt:  SVGTFSDDLNSNDSGVLQQTSPFTRNYSTKQEQSEKRRDSLAKMVLEEQRGQQLPKMVKNLPSDLKNVVAENSPRIRKRSNDRNRMSKRLSEEAEKYIED

Query:  FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKVPAVSKYVPPGMDGVLLPWLQWETSNDASSYPRKNTSEPPMTPQTFPWDVNQESTNGQDQSNHSGSS
        FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKVPAVSKYVPPGMDGVLLPWLQWETSNDASSYPRKNTSEPPMTPQTFPWDVNQESTNGQDQSNHSGSS
Subjt:  FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKVPAVSKYVPPGMDGVLLPWLQWETSNDASSYPRKNTSEPPMTPQTFPWDVNQESTNGQDQSNHSGSS

Query:  QGSWSPGVGIGVCGKVVEDLGSRFKKAGNYQNQSYLEPRDQSRFDIEEYLKRPSSEDFLLERWKQQHRTNCSGLLLCNRVFL
        QGSWSPGVGIGVCGKVVEDLGSRFKKAGNYQNQSYLEPRDQSRFDIEEYLKRPSSEDFLLERWKQQHRTNCSGLLLCNRVFL
Subjt:  QGSWSPGVGIGVCGKVVEDLGSRFKKAGNYQNQSYLEPRDQSRFDIEEYLKRPSSEDFLLERWKQQHRTNCSGLLLCNRVFL

XP_022994153.1 uncharacterized protein LOC111489977 [Cucurbita maxima]1.8e-26182.85Show/hide
Query:  MASSAFKSTTKRTPIGAPAASGDDSTSTNRSSIHRRSRSLSRFSHPMPSSPVDKAFDEAPARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARN
        MASSAFKSTTKRTPIGA  +S DDS STNRSSIHRRSRSLSRFSHP+PSSPVDK F E  ARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAAR+
Subjt:  MASSAFKSTTKRTPIGAPAASGDDSTSTNRSSIHRRSRSLSRFSHPMPSSPVDKAFDEAPARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARN

Query:  SESSGATNVAAASHRRGRSVSRHGCSKTSVSGSDGKGRPNYSVGGEKVVPETNSRRRRSVSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQKPIPH
        SESS A +  A+S RRGRS+SRHG +KTS  GS+G+G+ +YSVGG K+VPE+NSRRRRSVSVVRYQISDSESDLD+SQ+SGT ++EKS+G GNKQKP+ H
Subjt:  SESSGATNVAAASHRRGRSVSRHGCSKTSVSGSDGKGRPNYSVGGEKVVPETNSRRRRSVSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQKPIPH

Query:  KADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNGVEKTIRTIYARKVKQANGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS
        KAD+SNRRP LRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNN  EKTIRTI ARK KQ NGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS
Subjt:  KADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNGVEKTIRTIYARKVKQANGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS

Query:  SVGTFSDDLNSNDSGVLQQTSPFTRNYSTKQEQSEKRRDSLAKMVLEEQRGQQLPKMVKNLPSDLKNVVAENSPRIRKRSNDRNRMSKRLSEEAEKYIED
        SVGT SDDL+S+DSGV Q TSPF RNYSTKQEQS+KRRDSLAKMVL+EQR Q+LPK VKN   DL NVVAENS RIRKRSNDR+RMSKRL+EEAEKYIED
Subjt:  SVGTFSDDLNSNDSGVLQQTSPFTRNYSTKQEQSEKRRDSLAKMVLEEQRGQQLPKMVKNLPSDLKNVVAENSPRIRKRSNDRNRMSKRLSEEAEKYIED

Query:  FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKVPAVSKYVPPGMDGVLLPWLQWETSNDASSYPRKNTSEPPMTPQTFPWDVNQESTNGQDQSNHSGSS
        FISNVEDTDISSLDGDRSDTSSSLGGKTKPNF++PA SK VPPGMDGVLLPWLQWETSNDA+ YPRKNT  P MTPQ FPWDVNQE++N QD  NHSGSS
Subjt:  FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKVPAVSKYVPPGMDGVLLPWLQWETSNDASSYPRKNTSEPPMTPQTFPWDVNQESTNGQDQSNHSGSS

Query:  QGSWSPGVGIGVCGKVVEDLGSRFKKAGNYQNQSYLEPRD-QSRFDIEEYLKRPSSEDFLLERWKQQHRTNCSGLLLCNRVFL
        QGSWSPGV   + GKVVEDLGSRFKK G YQNQSYLE R+ +  FDI+EYLKRPS+EDFLLERWKQQH+ N  GLLLCN VFL
Subjt:  QGSWSPGVGIGVCGKVVEDLGSRFKKAGNYQNQSYLEPRD-QSRFDIEEYLKRPSSEDFLLERWKQQHRTNCSGLLLCNRVFL

XP_038893835.1 uncharacterized protein LOC120082652 isoform X1 [Benincasa hispida]4.2e-26684.88Show/hide
Query:  MASSAFKSTTKRTPIGAPAASGDDSTSTNRSSIHRRSRSLSRFSHPMPSSPVDKAFDEAPARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARN
        MASSAFKSTTKRTPIGA   S DDS STNRSSIHRRSRSLSRFSHP+PSSP+DK F EA A RGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAAR+
Subjt:  MASSAFKSTTKRTPIGAPAASGDDSTSTNRSSIHRRSRSLSRFSHPMPSSPVDKAFDEAPARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARN

Query:  SESSGATNVAAASHRRGRSVSRHGCSKTSVSGSDGKGRPNYSVGGEKVVPETNSRRRRSVSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQKPIPH
         ESSGATN A AS+RRGRSVSRHG +KT+  GS+GKGR    V   KVVPE+NSRRRRS+SVVRYQISDSESDLDRSQ+SGTRVKE SFGIGNKQKPI H
Subjt:  SESSGATNVAAASHRRGRSVSRHGCSKTSVSGSDGKGRPNYSVGGEKVVPETNSRRRRSVSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQKPIPH

Query:  KADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNGVEKTIRTIYARKVKQANGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS
        KADDSNRRPTLRRSLSQNDFKCHDGYSS SSVLTDDEGKDAYFGN+ +EKTIR+IYARK KQANG VVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS
Subjt:  KADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNGVEKTIRTIYARKVKQANGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS

Query:  SVGTFSDDLNSNDSGVLQQTSPFTRNYSTKQEQSEKRRDSLAKMVLEEQRGQQLPKMVKNLPSDLKNVVAENSPRIRKRSNDRNRMSKRLSEEAEKYIED
        SV TFSDDL+S+DSGV Q TSPFTRNYSTKQEQSEKRR+SL KMV+EEQRGQ+LPKMVKNLP D+KN VAENS R RKRSNDR+RMSKRLSEEAEKYIED
Subjt:  SVGTFSDDLNSNDSGVLQQTSPFTRNYSTKQEQSEKRRDSLAKMVLEEQRGQQLPKMVKNLPSDLKNVVAENSPRIRKRSNDRNRMSKRLSEEAEKYIED

Query:  FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKVPAVSKYVPPGMDGVLLPWLQWETSNDASSYPRKNTSEPPMTPQTFPWDVNQESTNGQDQSNHSGSS
        FISNVEDTDISSLDGDRSDTSSSLGGK KPNFK+ A S+ VPPGMDGVLLPWLQWETSNDA+ YPRKN  EPP TPQTFPWDVNQ+++N QD  NHSGSS
Subjt:  FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKVPAVSKYVPPGMDGVLLPWLQWETSNDASSYPRKNTSEPPMTPQTFPWDVNQESTNGQDQSNHSGSS

Query:  QGSWSPGVGIGVCGKVVEDLGSRFKKAGNYQNQSYLEPRDQSRFDIEEYLKRPSSEDFLLERWKQQHRTNCSGLLLCNRVFL
        QGSWSPGV IG+  KVVED+GSRFKK GNYQNQS LE R +SRFDI+EYLKRPS+E+FLLERWKQQH+ NCSGLLLCNRVFL
Subjt:  QGSWSPGVGIGVCGKVVEDLGSRFKKAGNYQNQSYLEPRDQSRFDIEEYLKRPSSEDFLLERWKQQHRTNCSGLLLCNRVFL

TrEMBL top hitse value%identityAlignment
A0A1S3AZE3 uncharacterized protein LOC103484238 isoform X11.8e-25982.33Show/hide
Query:  MASSAFKSTTKRTPIGAPAASGDDSTSTNRSSIHRRSRSLSRFSHPMPSSPVDKAFDEAPAR-RGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAAR
        MASSAFKSTTKRTPIGA   S DDSTSTNR S HRRSRSLSRFSHP+PSSP+DK F EA A  RGRFVNTSRGSGFPEISLDDLAVEFFGS DRGRS  R
Subjt:  MASSAFKSTTKRTPIGAPAASGDDSTSTNRSSIHRRSRSLSRFSHPMPSSPVDKAFDEAPAR-RGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAAR

Query:  NSESSGATNVAAASHRRGRSVSRHGCSKTSVSGSDGKGRPNYSVGGEKVVPETNSRRRRSVSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQKPIP
        +SE SGA N + AS+RRGRSVSRHG  KTS  GS+ KGR   SV G KVVPE+NSRRRRS+SVVRYQISDSESD DRSQ+SGTRV+EKSFGIGNKQKPI 
Subjt:  NSESSGATNVAAASHRRGRSVSRHGCSKTSVSGSDGKGRPNYSVGGEKVVPETNSRRRRSVSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQKPIP

Query:  HKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNGVEKTIRTIYARKVKQANGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRN
        HKADDSNRRPTLRRSLSQNDFKCHDGYSS SSVLTDDEGKDA+FGN+ +EKT+R+IYARK KQANG VVD+GLYEAMRKELRHAVEEIRVELEQEMVNRN
Subjt:  HKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNGVEKTIRTIYARKVKQANGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRN

Query:  SSVGTFSDDLNSNDSGVLQQTSPFTRNYSTKQEQSEKRRDSLAKMVLEEQRGQQLPKMVKNLPSDLKNVVAENSPRIRKRSNDRNRMSKRLSEEAEKYIE
        SSV TFSDDL+S+DSGV   TSPFTRNYS KQEQSEKRRDSL KMV+E+QRGQ LPKMVKNLP DLKNVVA+NS R RKRS DR+RMSKRLSEEAEKYIE
Subjt:  SSVGTFSDDLNSNDSGVLQQTSPFTRNYSTKQEQSEKRRDSLAKMVLEEQRGQQLPKMVKNLPSDLKNVVAENSPRIRKRSNDRNRMSKRLSEEAEKYIE

Query:  DFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKVPAVSKYVPPGMDGVLLPWLQWETSNDASSYPRKNTSEPPMTPQTFPWDVNQESTNGQDQSNHSGS
        DFISNVEDTDISSLDGDRSDTSSSLGGK KPNFK+PA  +YVPPGMDGVLLPWLQWETSNDA+ YPRKN  EPP TPQT PWDVNQ+++N  D  NHSGS
Subjt:  DFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKVPAVSKYVPPGMDGVLLPWLQWETSNDASSYPRKNTSEPPMTPQTFPWDVNQESTNGQDQSNHSGS

Query:  SQGSWSPGVGIGVCGKVVEDLGSRFKKAGNYQNQSYLEPRDQSRFDIEEYLKRPSSEDFLLERWKQQHRTNCSGLLLCNRVFL
        SQGSWSPGV IG+ GKVVED+GSRFK+ GN Q QSY E R +SRFDI+EYLKRPS+EDFLLERWKQQH+  CSG+LLCNRVFL
Subjt:  SQGSWSPGVGIGVCGKVVEDLGSRFKKAGNYQNQSYLEPRDQSRFDIEEYLKRPSSEDFLLERWKQQHRTNCSGLLLCNRVFL

A0A5D3BJY3 Uncharacterized protein1.8e-25982.33Show/hide
Query:  MASSAFKSTTKRTPIGAPAASGDDSTSTNRSSIHRRSRSLSRFSHPMPSSPVDKAFDEAPAR-RGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAAR
        MASSAFKSTTKRTPIGA   S DDSTSTNR S HRRSRSLSRFSHP+PSSP+DK F EA A  RGRFVNTSRGSGFPEISLDDLAVEFFGS DRGRS  R
Subjt:  MASSAFKSTTKRTPIGAPAASGDDSTSTNRSSIHRRSRSLSRFSHPMPSSPVDKAFDEAPAR-RGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAAR

Query:  NSESSGATNVAAASHRRGRSVSRHGCSKTSVSGSDGKGRPNYSVGGEKVVPETNSRRRRSVSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQKPIP
        +SE SGA N + AS+RRGRSVSRHG  KTS  GS+ KGR   SV G KVVPE+NSRRRRS+SVVRYQISDSESD DRSQ+SGTRV+EKSFGIGNKQKPI 
Subjt:  NSESSGATNVAAASHRRGRSVSRHGCSKTSVSGSDGKGRPNYSVGGEKVVPETNSRRRRSVSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQKPIP

Query:  HKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNGVEKTIRTIYARKVKQANGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRN
        HKADDSNRRPTLRRSLSQNDFKCHDGYSS SSVLTDDEGKDA+FGN+ +EKT+R+IYARK KQANG VVD+GLYEAMRKELRHAVEEIRVELEQEMVNRN
Subjt:  HKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNGVEKTIRTIYARKVKQANGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRN

Query:  SSVGTFSDDLNSNDSGVLQQTSPFTRNYSTKQEQSEKRRDSLAKMVLEEQRGQQLPKMVKNLPSDLKNVVAENSPRIRKRSNDRNRMSKRLSEEAEKYIE
        SSV TFSDDL+S+DSGV   TSPFTRNYS KQEQSEKRRDSL KMV+E+QRGQ LPKMVKNLP DLKNVVA+NS R RKRS DR+RMSKRLSEEAEKYIE
Subjt:  SSVGTFSDDLNSNDSGVLQQTSPFTRNYSTKQEQSEKRRDSLAKMVLEEQRGQQLPKMVKNLPSDLKNVVAENSPRIRKRSNDRNRMSKRLSEEAEKYIE

Query:  DFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKVPAVSKYVPPGMDGVLLPWLQWETSNDASSYPRKNTSEPPMTPQTFPWDVNQESTNGQDQSNHSGS
        DFISNVEDTDISSLDGDRSDTSSSLGGK KPNFK+PA  +YVPPGMDGVLLPWLQWETSNDA+ YPRKN  EPP TPQT PWDVNQ+++N  D  NHSGS
Subjt:  DFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKVPAVSKYVPPGMDGVLLPWLQWETSNDASSYPRKNTSEPPMTPQTFPWDVNQESTNGQDQSNHSGS

Query:  SQGSWSPGVGIGVCGKVVEDLGSRFKKAGNYQNQSYLEPRDQSRFDIEEYLKRPSSEDFLLERWKQQHRTNCSGLLLCNRVFL
        SQGSWSPGV IG+ GKVVED+GSRFK+ GN Q QSY E R +SRFDI+EYLKRPS+EDFLLERWKQQH+  CSG+LLCNRVFL
Subjt:  SQGSWSPGVGIGVCGKVVEDLGSRFKKAGNYQNQSYLEPRDQSRFDIEEYLKRPSSEDFLLERWKQQHRTNCSGLLLCNRVFL

A0A6J1C6X2 uncharacterized protein LOC1110089610.0e+00100Show/hide
Query:  MASSAFKSTTKRTPIGAPAASGDDSTSTNRSSIHRRSRSLSRFSHPMPSSPVDKAFDEAPARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARN
        MASSAFKSTTKRTPIGAPAASGDDSTSTNRSSIHRRSRSLSRFSHPMPSSPVDKAFDEAPARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARN
Subjt:  MASSAFKSTTKRTPIGAPAASGDDSTSTNRSSIHRRSRSLSRFSHPMPSSPVDKAFDEAPARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARN

Query:  SESSGATNVAAASHRRGRSVSRHGCSKTSVSGSDGKGRPNYSVGGEKVVPETNSRRRRSVSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQKPIPH
        SESSGATNVAAASHRRGRSVSRHGCSKTSVSGSDGKGRPNYSVGGEKVVPETNSRRRRSVSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQKPIPH
Subjt:  SESSGATNVAAASHRRGRSVSRHGCSKTSVSGSDGKGRPNYSVGGEKVVPETNSRRRRSVSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQKPIPH

Query:  KADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNGVEKTIRTIYARKVKQANGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS
        KADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNGVEKTIRTIYARKVKQANGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS
Subjt:  KADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNGVEKTIRTIYARKVKQANGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS

Query:  SVGTFSDDLNSNDSGVLQQTSPFTRNYSTKQEQSEKRRDSLAKMVLEEQRGQQLPKMVKNLPSDLKNVVAENSPRIRKRSNDRNRMSKRLSEEAEKYIED
        SVGTFSDDLNSNDSGVLQQTSPFTRNYSTKQEQSEKRRDSLAKMVLEEQRGQQLPKMVKNLPSDLKNVVAENSPRIRKRSNDRNRMSKRLSEEAEKYIED
Subjt:  SVGTFSDDLNSNDSGVLQQTSPFTRNYSTKQEQSEKRRDSLAKMVLEEQRGQQLPKMVKNLPSDLKNVVAENSPRIRKRSNDRNRMSKRLSEEAEKYIED

Query:  FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKVPAVSKYVPPGMDGVLLPWLQWETSNDASSYPRKNTSEPPMTPQTFPWDVNQESTNGQDQSNHSGSS
        FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKVPAVSKYVPPGMDGVLLPWLQWETSNDASSYPRKNTSEPPMTPQTFPWDVNQESTNGQDQSNHSGSS
Subjt:  FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKVPAVSKYVPPGMDGVLLPWLQWETSNDASSYPRKNTSEPPMTPQTFPWDVNQESTNGQDQSNHSGSS

Query:  QGSWSPGVGIGVCGKVVEDLGSRFKKAGNYQNQSYLEPRDQSRFDIEEYLKRPSSEDFLLERWKQQHRTNCSGLLLCNRVFL
        QGSWSPGVGIGVCGKVVEDLGSRFKKAGNYQNQSYLEPRDQSRFDIEEYLKRPSSEDFLLERWKQQHRTNCSGLLLCNRVFL
Subjt:  QGSWSPGVGIGVCGKVVEDLGSRFKKAGNYQNQSYLEPRDQSRFDIEEYLKRPSSEDFLLERWKQQHRTNCSGLLLCNRVFL

A0A6J1K4C9 uncharacterized protein LOC1114899778.8e-26282.85Show/hide
Query:  MASSAFKSTTKRTPIGAPAASGDDSTSTNRSSIHRRSRSLSRFSHPMPSSPVDKAFDEAPARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARN
        MASSAFKSTTKRTPIGA  +S DDS STNRSSIHRRSRSLSRFSHP+PSSPVDK F E  ARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAAR+
Subjt:  MASSAFKSTTKRTPIGAPAASGDDSTSTNRSSIHRRSRSLSRFSHPMPSSPVDKAFDEAPARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARN

Query:  SESSGATNVAAASHRRGRSVSRHGCSKTSVSGSDGKGRPNYSVGGEKVVPETNSRRRRSVSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQKPIPH
        SESS A +  A+S RRGRS+SRHG +KTS  GS+G+G+ +YSVGG K+VPE+NSRRRRSVSVVRYQISDSESDLD+SQ+SGT ++EKS+G GNKQKP+ H
Subjt:  SESSGATNVAAASHRRGRSVSRHGCSKTSVSGSDGKGRPNYSVGGEKVVPETNSRRRRSVSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQKPIPH

Query:  KADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNGVEKTIRTIYARKVKQANGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS
        KAD+SNRRP LRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNN  EKTIRTI ARK KQ NGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS
Subjt:  KADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNGVEKTIRTIYARKVKQANGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS

Query:  SVGTFSDDLNSNDSGVLQQTSPFTRNYSTKQEQSEKRRDSLAKMVLEEQRGQQLPKMVKNLPSDLKNVVAENSPRIRKRSNDRNRMSKRLSEEAEKYIED
        SVGT SDDL+S+DSGV Q TSPF RNYSTKQEQS+KRRDSLAKMVL+EQR Q+LPK VKN   DL NVVAENS RIRKRSNDR+RMSKRL+EEAEKYIED
Subjt:  SVGTFSDDLNSNDSGVLQQTSPFTRNYSTKQEQSEKRRDSLAKMVLEEQRGQQLPKMVKNLPSDLKNVVAENSPRIRKRSNDRNRMSKRLSEEAEKYIED

Query:  FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKVPAVSKYVPPGMDGVLLPWLQWETSNDASSYPRKNTSEPPMTPQTFPWDVNQESTNGQDQSNHSGSS
        FISNVEDTDISSLDGDRSDTSSSLGGKTKPNF++PA SK VPPGMDGVLLPWLQWETSNDA+ YPRKNT  P MTPQ FPWDVNQE++N QD  NHSGSS
Subjt:  FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKVPAVSKYVPPGMDGVLLPWLQWETSNDASSYPRKNTSEPPMTPQTFPWDVNQESTNGQDQSNHSGSS

Query:  QGSWSPGVGIGVCGKVVEDLGSRFKKAGNYQNQSYLEPRD-QSRFDIEEYLKRPSSEDFLLERWKQQHRTNCSGLLLCNRVFL
        QGSWSPGV   + GKVVEDLGSRFKK G YQNQSYLE R+ +  FDI+EYLKRPS+EDFLLERWKQQH+ N  GLLLCN VFL
Subjt:  QGSWSPGVGIGVCGKVVEDLGSRFKKAGNYQNQSYLEPRD-QSRFDIEEYLKRPSSEDFLLERWKQQHRTNCSGLLLCNRVFL

E5GBV7 Uncharacterized protein1.8e-25982.33Show/hide
Query:  MASSAFKSTTKRTPIGAPAASGDDSTSTNRSSIHRRSRSLSRFSHPMPSSPVDKAFDEAPAR-RGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAAR
        MASSAFKSTTKRTPIGA   S DDSTSTNR S HRRSRSLSRFSHP+PSSP+DK F EA A  RGRFVNTSRGSGFPEISLDDLAVEFFGS DRGRS  R
Subjt:  MASSAFKSTTKRTPIGAPAASGDDSTSTNRSSIHRRSRSLSRFSHPMPSSPVDKAFDEAPAR-RGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAAR

Query:  NSESSGATNVAAASHRRGRSVSRHGCSKTSVSGSDGKGRPNYSVGGEKVVPETNSRRRRSVSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQKPIP
        +SE SGA N + AS+RRGRSVSRHG  KTS  GS+ KGR   SV G KVVPE+NSRRRRS+SVVRYQISDSESD DRSQ+SGTRV+EKSFGIGNKQKPI 
Subjt:  NSESSGATNVAAASHRRGRSVSRHGCSKTSVSGSDGKGRPNYSVGGEKVVPETNSRRRRSVSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQKPIP

Query:  HKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNGVEKTIRTIYARKVKQANGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRN
        HKADDSNRRPTLRRSLSQNDFKCHDGYSS SSVLTDDEGKDA+FGN+ +EKT+R+IYARK KQANG VVD+GLYEAMRKELRHAVEEIRVELEQEMVNRN
Subjt:  HKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNGVEKTIRTIYARKVKQANGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRN

Query:  SSVGTFSDDLNSNDSGVLQQTSPFTRNYSTKQEQSEKRRDSLAKMVLEEQRGQQLPKMVKNLPSDLKNVVAENSPRIRKRSNDRNRMSKRLSEEAEKYIE
        SSV TFSDDL+S+DSGV   TSPFTRNYS KQEQSEKRRDSL KMV+E+QRGQ LPKMVKNLP DLKNVVA+NS R RKRS DR+RMSKRLSEEAEKYIE
Subjt:  SSVGTFSDDLNSNDSGVLQQTSPFTRNYSTKQEQSEKRRDSLAKMVLEEQRGQQLPKMVKNLPSDLKNVVAENSPRIRKRSNDRNRMSKRLSEEAEKYIE

Query:  DFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKVPAVSKYVPPGMDGVLLPWLQWETSNDASSYPRKNTSEPPMTPQTFPWDVNQESTNGQDQSNHSGS
        DFISNVEDTDISSLDGDRSDTSSSLGGK KPNFK+PA  +YVPPGMDGVLLPWLQWETSNDA+ YPRKN  EPP TPQT PWDVNQ+++N  D  NHSGS
Subjt:  DFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKVPAVSKYVPPGMDGVLLPWLQWETSNDASSYPRKNTSEPPMTPQTFPWDVNQESTNGQDQSNHSGS

Query:  SQGSWSPGVGIGVCGKVVEDLGSRFKKAGNYQNQSYLEPRDQSRFDIEEYLKRPSSEDFLLERWKQQHRTNCSGLLLCNRVFL
        SQGSWSPGV IG+ GKVVED+GSRFK+ GN Q QSY E R +SRFDI+EYLKRPS+EDFLLERWKQQH+  CSG+LLCNRVFL
Subjt:  SQGSWSPGVGIGVCGKVVEDLGSRFKKAGNYQNQSYLEPRDQSRFDIEEYLKRPSSEDFLLERWKQQHRTNCSGLLLCNRVFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G50350.1 unknown protein9.5e-8338.47Show/hide
Query:  MASSAFKSTTKR-TPIGAPAASGDDSTSTNRSSIHRRSRSLSRFSHPMPSSPVDKAFDEAPARRGRFVNTSRGSGFPEISLDDLAVEFF---------GS
        MA+SAF ST KR T +   + SGDDS+ + R S  RR RSLSRFSH MP    D   +  P R+G+FVNT RGSGF EISLDDLAVEFF          S
Subjt:  MASSAFKSTTKR-TPIGAPAASGDDSTSTNRSSIHRRSRSLSRFSHPMPSSPVDKAFDEAPARRGRFVNTSRGSGFPEISLDDLAVEFF---------GS

Query:  GDRGRSAARNSESSGATNVAAASHRRGRSVS-----------------------------------RHGCSKT-SVSGSDGKGRPNYSVGGEKVVPE---
        G+RGRS  RNS   G       S RRGRSVS                                   R G SK  SVS +  + R       E+V  E   
Subjt:  GDRGRSAARNSESSGATNVAAASHRRGRSVS-----------------------------------RHGCSKT-SVSGSDGKGRPNYSVGGEKVVPE---

Query:  ---------TNSRRRRSVSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQKPIPHKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAY
                  NSRRRRS+SVVR +I +SESD+D+ Q S +    KSF  G  Q     K+  S+ R  LRRS SQN  K HDGYSSQSS +TDDEGKD+ 
Subjt:  ---------TNSRRRRSVSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQKPIPHKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAY

Query:  FGNNGVEKTIRTIYAR-KVKQANGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNSSVGTFSDDLNSNDSGVLQQTSPFTRNYSTK-QEQSEKRRDS
           +G E+ IRT+YA+ K      + + N  Y + RK L                               ++ GV    S  T+ Y+TK QE  E++R+ 
Subjt:  FGNNGVEKTIRTIYAR-KVKQANGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNSSVGTFSDDLNSNDSGVLQQTSPFTRNYSTK-QEQSEKRRDS

Query:  LAKMVLEEQRGQQLPKMVKNLPSDLKNVVAENSPRIRKRSNDRN-RMSKRLSEEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKVPAVSK
        LA+++LEEQRG++L   +K + ++  +   E   R RKRS DR+ RMS  L++EAE++I++FISN+EDTD SSL+ +RS++SSS G     + +  +  K
Subjt:  LAKMVLEEQRGQQLPKMVKNLPSDLKNVVAENSPRIRKRSNDRN-RMSKRLSEEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKVPAVSK

Query:  YVPPGMDGVLLPWLQWETSNDASSYPRKNTSEPPMTPQTFPW--DVNQESTNGQDQSNHSGSSQGSWSPGVGIGVCGKVVEDLGSRFKKAGNYQNQSYLE
             MDGV+LPWLQWET + +++      S      ++  W  D  Q++++G+  S  + SS+GSWSP   +                         + 
Subjt:  YVPPGMDGVLLPWLQWETSNDASSYPRKNTSEPPMTPQTFPW--DVNQESTNGQDQSNHSGSSQGSWSPGVGIGVCGKVVEDLGSRFKKAGNYQNQSYLE

Query:  PRDQSRFDIEEYLKRPSSEDFLLERWKQQHRTNCSGLLLCNR
        P    + D+ EYLKRP+S D L E WK +HR +   L+LC+R
Subjt:  PRDQSRFDIEEYLKRPSSEDFLLERWKQQHRTNCSGLLLCNR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCCTCTGCTTTCAAATCTACGACCAAACGGACGCCGATCGGAGCACCGGCGGCTTCTGGTGATGACTCCACTTCCACCAATCGGAGCTCGATTCACCGCCGCTC
TCGAAGTCTCAGCCGATTTTCCCACCCTATGCCGTCGTCTCCCGTGGACAAGGCCTTCGATGAGGCCCCGGCGCGGCGAGGTAGGTTTGTCAACACGTCGAGAGGCTCGG
GATTCCCTGAGATCAGTCTCGATGACTTAGCCGTCGAATTCTTCGGTTCTGGTGATCGAGGGCGCTCTGCTGCGCGGAACTCCGAGTCAAGTGGTGCTACGAATGTCGCC
GCAGCTTCGCATAGACGAGGGAGGTCGGTATCGAGACACGGTTGCTCTAAGACTAGCGTCAGTGGTAGCGATGGCAAAGGAAGACCCAATTATAGCGTTGGTGGGGAAAA
AGTGGTTCCCGAAACTAATTCGAGAAGAAGGCGCTCTGTCTCAGTGGTTCGCTACCAGATTAGCGACTCGGAGAGTGATCTTGATCGATCTCAGAATTCTGGAACTCGTG
TCAAAGAAAAGAGCTTTGGCATCGGAAATAAGCAGAAGCCGATACCCCATAAGGCTGATGATTCAAACCGTAGACCAACATTGAGAAGGTCTCTTAGCCAGAATGATTTT
AAGTGCCATGATGGTTATTCAAGCCAGTCCTCAGTTCTAACTGATGATGAGGGGAAGGATGCTTACTTCGGTAATAACGGAGTAGAGAAGACTATTCGAACGATTTATGC
AAGAAAGGTAAAGCAGGCAAATGGGGATGTTGTTGACAATGGCTTGTATGAAGCAATGCGGAAAGAACTTAGACATGCAGTGGAAGAGATAAGGGTGGAACTCGAGCAGG
AAATGGTGAACAGAAATTCGTCTGTTGGAACTTTCAGTGATGACTTGAATTCAAATGATTCTGGTGTTCTTCAGCAGACATCTCCGTTTACAAGAAATTATTCGACAAAA
CAAGAACAGTCAGAGAAGCGCAGAGATTCATTGGCTAAGATGGTGCTGGAGGAGCAACGTGGTCAACAACTTCCTAAGATGGTTAAAAATTTGCCTTCTGACCTGAAGAA
TGTTGTTGCAGAGAACTCCCCACGAATTAGAAAGAGGAGCAATGACCGGAACAGGATGTCTAAACGATTGAGTGAGGAGGCGGAAAAGTACATCGAGGACTTCATTTCCA
ATGTCGAAGATACAGATATTTCATCTCTTGATGGTGACAGAAGCGACACCAGTTCATCTTTAGGGGGAAAAACGAAACCAAATTTCAAAGTTCCAGCAGTCAGCAAATAC
GTTCCTCCTGGAATGGATGGTGTCCTACTTCCATGGTTGCAATGGGAAACCAGTAATGATGCTTCTTCTTACCCTCGAAAGAATACGAGCGAACCACCTATGACTCCACA
AACTTTTCCATGGGATGTTAATCAGGAATCGACGAATGGACAGGATCAAAGTAACCATTCTGGTAGCAGCCAAGGGAGTTGGAGCCCTGGAGTTGGCATTGGAGTTTGTG
GGAAAGTTGTTGAAGATTTAGGAAGCAGATTTAAGAAAGCTGGTAATTATCAAAATCAATCCTATTTGGAACCAAGAGATCAATCTCGGTTTGATATAGAGGAATATCTG
AAGCGTCCAAGCAGTGAAGATTTTCTTTTAGAAAGATGGAAGCAGCAACACAGAACCAACTGCAGTGGTCTTTTGCTCTGTAATCGTGTATTTTTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCCTCTGCTTTCAAATCTACGACCAAACGGACGCCGATCGGAGCACCGGCGGCTTCTGGTGATGACTCCACTTCCACCAATCGGAGCTCGATTCACCGCCGCTC
TCGAAGTCTCAGCCGATTTTCCCACCCTATGCCGTCGTCTCCCGTGGACAAGGCCTTCGATGAGGCCCCGGCGCGGCGAGGTAGGTTTGTCAACACGTCGAGAGGCTCGG
GATTCCCTGAGATCAGTCTCGATGACTTAGCCGTCGAATTCTTCGGTTCTGGTGATCGAGGGCGCTCTGCTGCGCGGAACTCCGAGTCAAGTGGTGCTACGAATGTCGCC
GCAGCTTCGCATAGACGAGGGAGGTCGGTATCGAGACACGGTTGCTCTAAGACTAGCGTCAGTGGTAGCGATGGCAAAGGAAGACCCAATTATAGCGTTGGTGGGGAAAA
AGTGGTTCCCGAAACTAATTCGAGAAGAAGGCGCTCTGTCTCAGTGGTTCGCTACCAGATTAGCGACTCGGAGAGTGATCTTGATCGATCTCAGAATTCTGGAACTCGTG
TCAAAGAAAAGAGCTTTGGCATCGGAAATAAGCAGAAGCCGATACCCCATAAGGCTGATGATTCAAACCGTAGACCAACATTGAGAAGGTCTCTTAGCCAGAATGATTTT
AAGTGCCATGATGGTTATTCAAGCCAGTCCTCAGTTCTAACTGATGATGAGGGGAAGGATGCTTACTTCGGTAATAACGGAGTAGAGAAGACTATTCGAACGATTTATGC
AAGAAAGGTAAAGCAGGCAAATGGGGATGTTGTTGACAATGGCTTGTATGAAGCAATGCGGAAAGAACTTAGACATGCAGTGGAAGAGATAAGGGTGGAACTCGAGCAGG
AAATGGTGAACAGAAATTCGTCTGTTGGAACTTTCAGTGATGACTTGAATTCAAATGATTCTGGTGTTCTTCAGCAGACATCTCCGTTTACAAGAAATTATTCGACAAAA
CAAGAACAGTCAGAGAAGCGCAGAGATTCATTGGCTAAGATGGTGCTGGAGGAGCAACGTGGTCAACAACTTCCTAAGATGGTTAAAAATTTGCCTTCTGACCTGAAGAA
TGTTGTTGCAGAGAACTCCCCACGAATTAGAAAGAGGAGCAATGACCGGAACAGGATGTCTAAACGATTGAGTGAGGAGGCGGAAAAGTACATCGAGGACTTCATTTCCA
ATGTCGAAGATACAGATATTTCATCTCTTGATGGTGACAGAAGCGACACCAGTTCATCTTTAGGGGGAAAAACGAAACCAAATTTCAAAGTTCCAGCAGTCAGCAAATAC
GTTCCTCCTGGAATGGATGGTGTCCTACTTCCATGGTTGCAATGGGAAACCAGTAATGATGCTTCTTCTTACCCTCGAAAGAATACGAGCGAACCACCTATGACTCCACA
AACTTTTCCATGGGATGTTAATCAGGAATCGACGAATGGACAGGATCAAAGTAACCATTCTGGTAGCAGCCAAGGGAGTTGGAGCCCTGGAGTTGGCATTGGAGTTTGTG
GGAAAGTTGTTGAAGATTTAGGAAGCAGATTTAAGAAAGCTGGTAATTATCAAAATCAATCCTATTTGGAACCAAGAGATCAATCTCGGTTTGATATAGAGGAATATCTG
AAGCGTCCAAGCAGTGAAGATTTTCTTTTAGAAAGATGGAAGCAGCAACACAGAACCAACTGCAGTGGTCTTTTGCTCTGTAATCGTGTATTTTTATAG
Protein sequenceShow/hide protein sequence
MASSAFKSTTKRTPIGAPAASGDDSTSTNRSSIHRRSRSLSRFSHPMPSSPVDKAFDEAPARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARNSESSGATNVA
AASHRRGRSVSRHGCSKTSVSGSDGKGRPNYSVGGEKVVPETNSRRRRSVSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQKPIPHKADDSNRRPTLRRSLSQNDF
KCHDGYSSQSSVLTDDEGKDAYFGNNGVEKTIRTIYARKVKQANGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNSSVGTFSDDLNSNDSGVLQQTSPFTRNYSTK
QEQSEKRRDSLAKMVLEEQRGQQLPKMVKNLPSDLKNVVAENSPRIRKRSNDRNRMSKRLSEEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKVPAVSKY
VPPGMDGVLLPWLQWETSNDASSYPRKNTSEPPMTPQTFPWDVNQESTNGQDQSNHSGSSQGSWSPGVGIGVCGKVVEDLGSRFKKAGNYQNQSYLEPRDQSRFDIEEYL
KRPSSEDFLLERWKQQHRTNCSGLLLCNRVFL