; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr007109 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr007109
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00005228:26142..33781
RNA-Seq ExpressionSgr007109
SyntenySgr007109
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578775.1 Ras-related protein RABF1, partial [Cucurbita argyrosperma subsp. sororia]1.4e-24775.31Show/hide
Query:  NKIMKNPDQDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQLKIVSLQRKMAEKATADVLAILEDNGASDISETLDSNS
        ++IM+NPDQDQQDPR VPG EDTTAMTIEFLRARLLSERSVS+ ARQRADELAKRVAELEEQL++VS QR+MAEKATADVLAILEDNGA+DISETLDSNS
Subjt:  NKIMKNPDQDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQLKIVSLQRKMAEKATADVLAILEDNGASDISETLDSNS

Query:  DHETPCESKVGNDPAREDVNSSKSIHMRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHNREKYKKFSTRSRSSFTSIGSSSPKHRLGRSCRQIKRRDTR
        DHET   +KV + P R D NSS SI  RNEHEE+SGS  DTSP+LG SLSWKGRND PH REKYKKFS RS+S+FTSIGSSSPKH+LGRSCRQIKRRDTR
Subjt:  DHETPCESKVGNDPAREDVNSSKSIHMRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHNREKYKKFSTRSRSSFTSIGSSSPKHRLGRSCRQIKRRDTR

Query:  QLDGEQELKSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDGYEPHEKTRSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQ
         LDGEQELKS+  V S QEI PS CSEDSRN  V G KI RDGYE HEKTRSG S+ HNSV NKDQDHDLD  EK +DMEK+L+CQAQLIDQYEAMEKAQ
Subjt:  QLDGEQELKSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDGYEPHEKTRSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQ

Query:  REWEEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDSSQAQTN-GLDPSSCADVEDLQDQNTNSISTSRSLEEF
        REWEEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNL  ++ LANE KSQV+ +CVTRD SQAQT+ GL PS C+DVEDLQDQN NS+STSRSLEEF
Subjt:  REWEEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDSSQAQTN-GLDPSSCADVEDLQDQNTNSISTSRSLEEF

Query:  TFPMANVKQCQESQESSEQEPSCTSQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKKIKKLPSVEG----ESI
        TFPMANVKQCQES E+ EQEPSCTS LNHGLP R LSSH GI+ YDQETPCS+ DLYALVPHEPPAL+GVLEALKQAKLSL KKI KLP VEG    +SI
Subjt:  TFPMANVKQCQESQESSEQEPSCTSQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKKIKKLPSVEG----ESI

Query:  GTLSVPKVGGRLDIPIGCAGLFRLPTDFAAEA-STQANFLVSSSQLRSSTHY-----------------------PVF---------------TRDGFLT
        G LSVPKV   L+IPIGCAGLFRLPTDFAAEA ST  NFL SSS+LRS+  Y                       P F               TRDG+LT
Subjt:  GTLSVPKVGGRLDIPIGCAGLFRLPTDFAAEA-STQANFLVSSSQLRSSTHY-----------------------PVF---------------TRDGFLT

Query:  DHFPENGWKNPGQKHHFDRYFDAIQPSPHVHNYPSP
        DHFPE+ WKNPGQ HHFD+YFD+IQPSP+VH+YPSP
Subjt:  DHFPENGWKNPGQKHHFDRYFDAIQPSPHVHNYPSP

TYK11749.1 uncharacterized protein E5676_scaffold304G00680 [Cucumis melo var. makuwa]9.8e-25473.22Show/hide
Query:  QDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQLKIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHETPCES
        Q++ + R+VPG EDTTAMTIEFLRARLLSERSVS+SARQRADELAKRVAELEEQLKIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHET  E 
Subjt:  QDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQLKIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHETPCES

Query:  KVGNDPAREDVNSSKSIHMRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHNREKYKKFSTRSRSSFTSIGSSSPKHRLGRSCRQIKRRDTRQLDGEQEL
        KV + P REDVNS  ++  RNEHEE+SGS+I+TSPVLGGSLSWKGRNDSPH REKYKK S RSRSSFTSIGSSSPKH+LGRSCRQIKRRDTR LDGEQEL
Subjt:  KVGNDPAREDVNSSKSIHMRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHNREKYKKFSTRSRSSFTSIGSSSPKHRLGRSCRQIKRRDTRQLDGEQEL

Query:  KSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDGYEPHEKTRSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQREWEEKFR
        KSEA + S +EI  S   EDSRN  V G  ILRD YE  EKT S  S  HNS+ N DQD+D+D  EK +DMEKAL+CQAQLIDQYEAMEKAQREWEEKFR
Subjt:  KSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDGYEPHEKTRSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQREWEEKFR

Query:  ENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDSSQAQTNGLDPSSC-ADVEDLQDQNTNSISTSRSLEEFTFPMANVK
        ENNNSTPDSCDPGNHSDITEERDE+RAQAPNLS N   ANEAK  +AV C  RD SQ QTNGL PS C ADVEDLQDQNTNSISTS+SLEEFTFPMANVK
Subjt:  ENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDSSQAQTNGLDPSSC-ADVEDLQDQNTNSISTSRSLEEFTFPMANVK

Query:  QCQESQESSEQEPSCTSQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKKIKKLPSVEGE------SIGTLSVP
        QCQ+SQE+S QEPSCTS LNHGLP RPLSSH GI+ YDQETPCS NDLYALVPHEPPAL+GVLEALKQAKLSL KKI KLPSV+GE      SIG LSV 
Subjt:  QCQESQESSEQEPSCTSQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKKIKKLPSVEGE------SIGTLSVP

Query:  KVGGRLDIPIGCAGLFRLPTDFAAEASTQANFLVSSSQLRSSTHYP---------------------------------------VFTRDGFLTDHFPEN
        K+G RL+IP+GCAGLFRLPTDFAAEAS+QANFL SSSQLRS THYP                                        FTRD FLTDH PEN
Subjt:  KVGGRLDIPIGCAGLFRLPTDFAAEASTQANFLVSSSQLRSSTHYP---------------------------------------VFTRDGFLTDHFPEN

Query:  GWKNPGQKHHFDRYFDAIQPSPHVHNYPSPSVPGELFFFCNNRLVYHGKAISTFSWFLQTCWKEIGVERDLTLYHT
         WKNP QKHH D+YFDA+QPS +V NYPS  VPGELFFFCN RL+YHG+       F +   +++    DLTLYHT
Subjt:  GWKNPGQKHHFDRYFDAIQPSPHVHNYPSPSVPGELFFFCNNRLVYHGKAISTFSWFLQTCWKEIGVERDLTLYHT

XP_004140985.1 uncharacterized protein LOC101207733 [Cucumis sativus]1.2e-25677.22Show/hide
Query:  MKNPDQDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQLKIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHE
        M+NPDQDQQDPR+VPG EDTTAMTIEFLRARLLSERSVS+SARQRADELAKRVAELEEQLKIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHE
Subjt:  MKNPDQDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQLKIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHE

Query:  TPCESKVGNDPAREDVNSSKSIHMRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHNREKYKKFSTRSRSSFTSIGSSSPKHRLGRSCRQIKRRDTRQLD
        T  E KV +  AREDV SS ++  RNEHEE+SGS+IDTSPVLGGSLSWKGRNDSPH REKYKK S RSRSSFTSIGSSSPKH+LGRSCRQIKRRDTR LD
Subjt:  TPCESKVGNDPAREDVNSSKSIHMRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHNREKYKKFSTRSRSSFTSIGSSSPKHRLGRSCRQIKRRDTRQLD

Query:  GEQELKSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDGYEPHEKTRSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQREW
        GEQELKS+ALV S +EI PS   EDS+N  V G  ILRDGYE  EKTRS  S  HNSV N DQD+D+D  EK +DMEKAL+CQAQLIDQYEAMEKAQREW
Subjt:  GEQELKSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDGYEPHEKTRSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQREW

Query:  EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDSSQAQTNGLDPSSCA-DVEDLQDQNTNSISTSRSLEEFTFP
        EEKFRENNNSTPDSCDPGNHSDITEERDE+RAQAPNLS N   ANEAK QVA +C TRD SQAQTNGL PS CA DVEDLQDQNTNSISTS+SLEEFTFP
Subjt:  EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDSSQAQTNGLDPSSCA-DVEDLQDQNTNSISTSRSLEEFTFP

Query:  MANVKQCQESQESSEQEPSCTSQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKKIKKLPSVEGE------SIG
        MANVKQCQESQE+S QEPSCTS LNHGLP RPLSSHGGI+ YDQETPCS NDLYALVPHEPPAL+GVLEALKQAKLSL KKI KLPSV+GE      SIG
Subjt:  MANVKQCQESQESSEQEPSCTSQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKKIKKLPSVEGE------SIG

Query:  TLSVPKVGGRLDIPIGCAGLFRLPTDFAAEASTQANFLVSSSQLRSSTHYP---------------------------------------VFTRDGFLTD
         LS+PK+G RL+IP+GCAGLFRLPTDFAAEAS+QANFL SSSQLRS THYP                                        FTRDGFLTD
Subjt:  TLSVPKVGGRLDIPIGCAGLFRLPTDFAAEASTQANFLVSSSQLRSSTHYP---------------------------------------VFTRDGFLTD

Query:  HFPENGWKNPGQKHHFDRYFDAIQPSPHVHNYPSPSVPGEL
        H PEN WKNPGQKHHFD+YFDA+QPS +VHNYP   V   +
Subjt:  HFPENGWKNPGQKHHFDRYFDAIQPSPHVHNYPSPSVPGEL

XP_022134072.1 uncharacterized protein LOC111006434 [Momordica charantia]6.8e-27180.22Show/hide
Query:  MKNPDQDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQLKIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHE
        M+NPDQDQQDPR+VPG EDTTAMTIEFLRARLLSERSVSRSA+QRADELAKRVAELEEQLK+VSLQRKMAEKATADVL+ILEDNGASDISETLDSNSDHE
Subjt:  MKNPDQDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQLKIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHE

Query:  TPCESKVGNDPAREDVNSSKSIHMRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHNREKYKKFSTRSRSSFTSIGSSSPKHRLGRSCRQIKRRDTRQLD
        T CESKV +DPAR DVNS+ S   RN HEE+SGSDIDTSPVLGGSLSWKGRNDSPH  EKYKK S RSRSSF+SIGSSSPKHRLGRSCRQIKRRD R L+
Subjt:  TPCESKVGNDPAREDVNSSKSIHMRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHNREKYKKFSTRSRSSFTSIGSSSPKHRLGRSCRQIKRRDTRQLD

Query:  GEQELKSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDGYEPHEKTRSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQREW
         EQELKSEALV S QEIAPS CSEDSRNCC+ GPKILRDG++ HE+TRSG S D++ V NKD+DHDLDE EK NDMEKALECQAQLIDQYEAMEKAQREW
Subjt:  GEQELKSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDGYEPHEKTRSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQREW

Query:  EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDSSQAQTNGLDPSSCADVEDLQDQNTNSISTSRSLEEFTFPM
        EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFL NEAK+QVAV+C+ RDS QAQTNGL PS CADVE+LQDQN+NSISTSRSLEEFTFPM
Subjt:  EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDSSQAQTNGLDPSSCADVEDLQDQNTNSISTSRSLEEFTFPM

Query:  ANVKQCQESQESSEQEPSCTSQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKKIKKLPSVEGE----SIGTLS
        ANVKQCQESQE+ EQEPSCTSQLN+GLP RPLSSHGGI+F+++E PCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKKI KLPSVEGE    SIGTLS
Subjt:  ANVKQCQESQESSEQEPSCTSQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKKIKKLPSVEGE----SIGTLS

Query:  VPKVGGRLDIPIGCAGLFRLPTDFAAEASTQANFLVSSSQLRSSTHYP----------------------VFTRD--------GFLTDHFPENGWKNP--
        VP VG RL++PIGCAGLFRLPTDFAAEAS+QA+FL SSSQ RS+THYP                       F RD        GF TDHFPENGW NP  
Subjt:  VPKVGGRLDIPIGCAGLFRLPTDFAAEASTQANFLVSSSQLRSSTHYP----------------------VFTRD--------GFLTDHFPENGWKNP--

Query:  GQKHHFDRYFDAIQPSPH-VHNYPSPSVPGEL
        GQ++ FDR FDAIQPSPH VH YP P V   +
Subjt:  GQKHHFDRYFDAIQPSPH-VHNYPSPSVPGEL

XP_038885028.1 uncharacterized protein LOC120075573 [Benincasa hispida]1.3e-26178.84Show/hide
Query:  MKNPDQDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQLKIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHE
        M+NPDQDQQDPR+VPG EDTTAMTIEFLRARLLSERSVS+SARQRADELAKRVAELEEQLKIVSLQRKMAEKATADVLAILEDNGASDISET DSNSD E
Subjt:  MKNPDQDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQLKIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHE

Query:  TPCESKVGNDPAREDVNSSKSIHMRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHNREKYKKFSTRSRSSFTSIGSSSPKHRLGRSCRQIKRRDTRQLD
        T  ESKV + PARE VNSS SI  RN HEE+SG DIDTSPVLGGSLSWKGRNDSPH REKYKKFS RSRSSFTSI SSSPKH+LGRSCRQIKR+DTR LD
Subjt:  TPCESKVGNDPAREDVNSSKSIHMRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHNREKYKKFSTRSRSSFTSIGSSSPKHRLGRSCRQIKRRDTRQLD

Query:  GEQELKSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDGYEPHEKTRSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQREW
        GEQELKSEA V S QEI PS CSED+RN  V G  ILRDGYE  EKT SG S  HNSV NKDQDHDLD  EK N+MEKAL+CQAQLIDQYEAMEKAQREW
Subjt:  GEQELKSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDGYEPHEKTRSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQREW

Query:  EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDSSQAQTNGLDPSSCADVEDLQDQNTNSISTSRSLEEFTFPM
        EEKFRENNNSTPDSCDPGNHSDITEERDE+RAQAPNLS NAFLANEAKSQVAV+CV RD SQAQTNGL PS CADVEDLQDQNTNS+STS+SLEEFTFPM
Subjt:  EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDSSQAQTNGLDPSSCADVEDLQDQNTNSISTSRSLEEFTFPM

Query:  ANVKQCQESQESSEQEPSCTSQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKKIKKLPSVEGE----SIGTLS
        A VKQ QESQE+S QEPSCTS L+HGLP RPLSSH GI+FYDQETP S NDLYALVPHEPPAL+GVLEAL QAKLSL KKI KLPSVEGE    SIG LS
Subjt:  ANVKQCQESQESSEQEPSCTSQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKKIKKLPSVEGE----SIGTLS

Query:  VPKVGGRLDIPIGCAGLFRLPTDFAAEASTQANFLVSSSQLRSSTHYP---------------------------------------VFTRDGFLTDHFP
        VPKVG RL+IPIGCAGLFRLPTDFAAEAS+Q NFL SSSQLRSSTHYP                                        FTRDGFLT +  
Subjt:  VPKVGGRLDIPIGCAGLFRLPTDFAAEASTQANFLVSSSQLRSSTHYP---------------------------------------VFTRDGFLTDHFP

Query:  ENGWKNPGQKHHFDRYFDAIQPSPHVHNYPSPSVPGEL
        EN WKNPGQKHHFD+YFDA+QPSP+VHNYPS  V   +
Subjt:  ENGWKNPGQKHHFDRYFDAIQPSPHVHNYPSPSVPGEL

TrEMBL top hitse value%identityAlignment
A0A0A0K8I3 Uncharacterized protein6.0e-25777.22Show/hide
Query:  MKNPDQDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQLKIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHE
        M+NPDQDQQDPR+VPG EDTTAMTIEFLRARLLSERSVS+SARQRADELAKRVAELEEQLKIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHE
Subjt:  MKNPDQDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQLKIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHE

Query:  TPCESKVGNDPAREDVNSSKSIHMRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHNREKYKKFSTRSRSSFTSIGSSSPKHRLGRSCRQIKRRDTRQLD
        T  E KV +  AREDV SS ++  RNEHEE+SGS+IDTSPVLGGSLSWKGRNDSPH REKYKK S RSRSSFTSIGSSSPKH+LGRSCRQIKRRDTR LD
Subjt:  TPCESKVGNDPAREDVNSSKSIHMRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHNREKYKKFSTRSRSSFTSIGSSSPKHRLGRSCRQIKRRDTRQLD

Query:  GEQELKSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDGYEPHEKTRSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQREW
        GEQELKS+ALV S +EI PS   EDS+N  V G  ILRDGYE  EKTRS  S  HNSV N DQD+D+D  EK +DMEKAL+CQAQLIDQYEAMEKAQREW
Subjt:  GEQELKSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDGYEPHEKTRSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQREW

Query:  EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDSSQAQTNGLDPSSCA-DVEDLQDQNTNSISTSRSLEEFTFP
        EEKFRENNNSTPDSCDPGNHSDITEERDE+RAQAPNLS N   ANEAK QVA +C TRD SQAQTNGL PS CA DVEDLQDQNTNSISTS+SLEEFTFP
Subjt:  EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDSSQAQTNGLDPSSCA-DVEDLQDQNTNSISTSRSLEEFTFP

Query:  MANVKQCQESQESSEQEPSCTSQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKKIKKLPSVEGE------SIG
        MANVKQCQESQE+S QEPSCTS LNHGLP RPLSSHGGI+ YDQETPCS NDLYALVPHEPPAL+GVLEALKQAKLSL KKI KLPSV+GE      SIG
Subjt:  MANVKQCQESQESSEQEPSCTSQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKKIKKLPSVEGE------SIG

Query:  TLSVPKVGGRLDIPIGCAGLFRLPTDFAAEASTQANFLVSSSQLRSSTHYP---------------------------------------VFTRDGFLTD
         LS+PK+G RL+IP+GCAGLFRLPTDFAAEAS+QANFL SSSQLRS THYP                                        FTRDGFLTD
Subjt:  TLSVPKVGGRLDIPIGCAGLFRLPTDFAAEASTQANFLVSSSQLRSSTHYP---------------------------------------VFTRDGFLTD

Query:  HFPENGWKNPGQKHHFDRYFDAIQPSPHVHNYPSPSVPGEL
        H PEN WKNPGQKHHFD+YFDA+QPS +VHNYP   V   +
Subjt:  HFPENGWKNPGQKHHFDRYFDAIQPSPHVHNYPSPSVPGEL

A0A1S3C3K3 uncharacterized protein LOC1034964966.6e-24875.2Show/hide
Query:  MKNPDQDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQLKIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHE
        M+NPDQDQQDPR+VPG EDTTAMTIEFLRARLLSERSVS+SARQRADELAKRVAELEEQLKIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHE
Subjt:  MKNPDQDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQLKIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHE

Query:  TPCESKVGNDPAREDVNSSKSIHMRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHNREKYKKFSTRSRSSFTSIGSSSPKHRLGRSCRQIKRRDTRQLD
        T  E KV + P REDVNS  ++  RNEHEE+SGS+I+TSPVLGGSLSWKGRNDSPH REKYKK S RSRSSFTSIGSSSPKH+LGRSCRQIKRRDTR LD
Subjt:  TPCESKVGNDPAREDVNSSKSIHMRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHNREKYKKFSTRSRSSFTSIGSSSPKHRLGRSCRQIKRRDTRQLD

Query:  GEQELKSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDGYEPHEKTRSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQREW
        GEQELKSEA + S +EI  S   EDSRN  V G  ILRD YE  EKT S  S  HNS+ N DQD+D+D  EK +DMEKAL+CQAQLIDQYEAMEKAQREW
Subjt:  GEQELKSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDGYEPHEKTRSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQREW

Query:  EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDSSQAQTNGLDPSSC-ADVEDLQDQNTNSISTSRSLEEFTFP
        EEKFRENNNSTPDSCDPGNHSDITEERDE+RAQAPNLS N   ANEAK  +AV C  RD SQ QTNGL PS C ADVEDLQDQNTNSISTS+SLEEFTFP
Subjt:  EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDSSQAQTNGLDPSSC-ADVEDLQDQNTNSISTSRSLEEFTFP

Query:  MANVKQCQESQESSEQEPSCTSQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKKIKKLPSVEGE------SIG
        MANVKQCQ+SQE+S QEPSCTS LNHGLP RPLSSH GI+ YDQETPCS NDLYALVPHEPPAL+GVLEALKQAKLSL KKI KLPSV+GE      SIG
Subjt:  MANVKQCQESQESSEQEPSCTSQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKKIKKLPSVEGE------SIG

Query:  TLSVPKVGGRLDIPIGCAGLFRLPTDFAAEASTQANFLVSSSQLRSSTHYP---------------------------------------VFTRDGFLTD
         LSV K+G RL+IP+GCAGLFRLPTDFAAEAS+QANFL SSSQLRS THYP                                        FTRD FLTD
Subjt:  TLSVPKVGGRLDIPIGCAGLFRLPTDFAAEASTQANFLVSSSQLRSSTHYP---------------------------------------VFTRDGFLTD

Query:  HFPENGWKNPGQKHHFDRYFDAIQPSPHVHNYPSPSVPGEL
        H PEN WKNP QKHH D+YFDA+QPS +V NYPS  V   +
Subjt:  HFPENGWKNPGQKHHFDRYFDAIQPSPHVHNYPSPSVPGEL

A0A5D3CNC8 Uncharacterized protein4.7e-25473.22Show/hide
Query:  QDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQLKIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHETPCES
        Q++ + R+VPG EDTTAMTIEFLRARLLSERSVS+SARQRADELAKRVAELEEQLKIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHET  E 
Subjt:  QDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQLKIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHETPCES

Query:  KVGNDPAREDVNSSKSIHMRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHNREKYKKFSTRSRSSFTSIGSSSPKHRLGRSCRQIKRRDTRQLDGEQEL
        KV + P REDVNS  ++  RNEHEE+SGS+I+TSPVLGGSLSWKGRNDSPH REKYKK S RSRSSFTSIGSSSPKH+LGRSCRQIKRRDTR LDGEQEL
Subjt:  KVGNDPAREDVNSSKSIHMRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHNREKYKKFSTRSRSSFTSIGSSSPKHRLGRSCRQIKRRDTRQLDGEQEL

Query:  KSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDGYEPHEKTRSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQREWEEKFR
        KSEA + S +EI  S   EDSRN  V G  ILRD YE  EKT S  S  HNS+ N DQD+D+D  EK +DMEKAL+CQAQLIDQYEAMEKAQREWEEKFR
Subjt:  KSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDGYEPHEKTRSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQREWEEKFR

Query:  ENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDSSQAQTNGLDPSSC-ADVEDLQDQNTNSISTSRSLEEFTFPMANVK
        ENNNSTPDSCDPGNHSDITEERDE+RAQAPNLS N   ANEAK  +AV C  RD SQ QTNGL PS C ADVEDLQDQNTNSISTS+SLEEFTFPMANVK
Subjt:  ENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDSSQAQTNGLDPSSC-ADVEDLQDQNTNSISTSRSLEEFTFPMANVK

Query:  QCQESQESSEQEPSCTSQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKKIKKLPSVEGE------SIGTLSVP
        QCQ+SQE+S QEPSCTS LNHGLP RPLSSH GI+ YDQETPCS NDLYALVPHEPPAL+GVLEALKQAKLSL KKI KLPSV+GE      SIG LSV 
Subjt:  QCQESQESSEQEPSCTSQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKKIKKLPSVEGE------SIGTLSVP

Query:  KVGGRLDIPIGCAGLFRLPTDFAAEASTQANFLVSSSQLRSSTHYP---------------------------------------VFTRDGFLTDHFPEN
        K+G RL+IP+GCAGLFRLPTDFAAEAS+QANFL SSSQLRS THYP                                        FTRD FLTDH PEN
Subjt:  KVGGRLDIPIGCAGLFRLPTDFAAEASTQANFLVSSSQLRSSTHYP---------------------------------------VFTRDGFLTDHFPEN

Query:  GWKNPGQKHHFDRYFDAIQPSPHVHNYPSPSVPGELFFFCNNRLVYHGKAISTFSWFLQTCWKEIGVERDLTLYHT
         WKNP QKHH D+YFDA+QPS +V NYPS  VPGELFFFCN RL+YHG+       F +   +++    DLTLYHT
Subjt:  GWKNPGQKHHFDRYFDAIQPSPHVHNYPSPSVPGELFFFCNNRLVYHGKAISTFSWFLQTCWKEIGVERDLTLYHT

A0A6J1BXR7 uncharacterized protein LOC1110064343.3e-27180.22Show/hide
Query:  MKNPDQDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQLKIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHE
        M+NPDQDQQDPR+VPG EDTTAMTIEFLRARLLSERSVSRSA+QRADELAKRVAELEEQLK+VSLQRKMAEKATADVL+ILEDNGASDISETLDSNSDHE
Subjt:  MKNPDQDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQLKIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHE

Query:  TPCESKVGNDPAREDVNSSKSIHMRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHNREKYKKFSTRSRSSFTSIGSSSPKHRLGRSCRQIKRRDTRQLD
        T CESKV +DPAR DVNS+ S   RN HEE+SGSDIDTSPVLGGSLSWKGRNDSPH  EKYKK S RSRSSF+SIGSSSPKHRLGRSCRQIKRRD R L+
Subjt:  TPCESKVGNDPAREDVNSSKSIHMRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHNREKYKKFSTRSRSSFTSIGSSSPKHRLGRSCRQIKRRDTRQLD

Query:  GEQELKSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDGYEPHEKTRSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQREW
         EQELKSEALV S QEIAPS CSEDSRNCC+ GPKILRDG++ HE+TRSG S D++ V NKD+DHDLDE EK NDMEKALECQAQLIDQYEAMEKAQREW
Subjt:  GEQELKSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDGYEPHEKTRSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQREW

Query:  EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDSSQAQTNGLDPSSCADVEDLQDQNTNSISTSRSLEEFTFPM
        EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFL NEAK+QVAV+C+ RDS QAQTNGL PS CADVE+LQDQN+NSISTSRSLEEFTFPM
Subjt:  EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDSSQAQTNGLDPSSCADVEDLQDQNTNSISTSRSLEEFTFPM

Query:  ANVKQCQESQESSEQEPSCTSQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKKIKKLPSVEGE----SIGTLS
        ANVKQCQESQE+ EQEPSCTSQLN+GLP RPLSSHGGI+F+++E PCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKKI KLPSVEGE    SIGTLS
Subjt:  ANVKQCQESQESSEQEPSCTSQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKKIKKLPSVEGE----SIGTLS

Query:  VPKVGGRLDIPIGCAGLFRLPTDFAAEASTQANFLVSSSQLRSSTHYP----------------------VFTRD--------GFLTDHFPENGWKNP--
        VP VG RL++PIGCAGLFRLPTDFAAEAS+QA+FL SSSQ RS+THYP                       F RD        GF TDHFPENGW NP  
Subjt:  VPKVGGRLDIPIGCAGLFRLPTDFAAEASTQANFLVSSSQLRSSTHYP----------------------VFTRD--------GFLTDHFPENGWKNP--

Query:  GQKHHFDRYFDAIQPSPH-VHNYPSPSVPGEL
        GQ++ FDR FDAIQPSPH VH YP P V   +
Subjt:  GQKHHFDRYFDAIQPSPH-VHNYPSPSVPGEL

A0A6J1FEU7 uncharacterized protein LOC1114450708.7e-24875.79Show/hide
Query:  MKNPDQDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQLKIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHE
        M+NPDQDQQDPR+VPG EDTTAMTIEFLRARLLSERSVS+ ARQRADELAKRVAELEEQL++VS QR+MAEKATADVLAILEDNGA+DISETLDSNSDHE
Subjt:  MKNPDQDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQLKIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHE

Query:  TPCESKVGNDPAREDVNSSKSIHMRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHNREKYKKFSTRSRSSFTSIGSSSPKHRLGRSCRQIKRRDTRQLD
        T   +KV + P R D NSS SI  RNEHEE+SGS  DTSP+LG SLSWKGRND PH REKYKKFS RS+S+FTSIGSSSPKH+LGRSCRQIKRRDTR LD
Subjt:  TPCESKVGNDPAREDVNSSKSIHMRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHNREKYKKFSTRSRSSFTSIGSSSPKHRLGRSCRQIKRRDTRQLD

Query:  GEQELKSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDGYEPHEKTRSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQREW
        GEQELKS+  V S QEI PS CSEDSRN  V G KI RDGYE HEKTRSG S+ HNSV NKDQDHDLD  EK +DMEK+L+CQAQLIDQYEAMEKAQREW
Subjt:  GEQELKSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDGYEPHEKTRSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQREW

Query:  EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDSSQAQTN-GLDPSSCADVEDLQDQNTNSISTSRSLEEFTFP
        EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNL  ++ LANE KSQV+ +CVTRD SQAQT+ GL PS C+DV DLQDQN NS+STSRSLEEFTFP
Subjt:  EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDSSQAQTN-GLDPSSCADVEDLQDQNTNSISTSRSLEEFTFP

Query:  MANVKQCQESQESSEQEPSCTSQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKKIKKLPSVEG----ESIGTL
        MANVKQCQES E+ EQEPSCTS LNHGLP R LSSH GI+ YDQETPCS+ DLYALVPHEPPAL+GVLEALKQAKLSL KKI KLP VEG    +SIG L
Subjt:  MANVKQCQESQESSEQEPSCTSQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKKIKKLPSVEG----ESIGTL

Query:  SVPKVGGRLDIPIGCAGLFRLPTDFAAEA-STQANFLVSSSQLRS----------------------------------STHYPV---FTRDGFLTDHFP
        SVPKV   L+IPIGCAGLFRLPTDFAAEA STQ NFL SSS+LRS                                  S+HY      TRDG+LTDHFP
Subjt:  SVPKVGGRLDIPIGCAGLFRLPTDFAAEA-STQANFLVSSSQLRS----------------------------------STHYPV---FTRDGFLTDHFP

Query:  ENGWKNPGQKHHFDRYFDAIQPSPHVHNYPSP
        E+ WKNPGQ HHFD+YFDAIQPSP+VH+YPSP
Subjt:  ENGWKNPGQKHHFDRYFDAIQPSPHVHNYPSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G52240.1 unknown protein1.0e-6236.83Show/hide
Query:  EDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQLKIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHETPCESKVGNDPAREDVN
        +D T++TIEFLRARLL+ER+VS+SAR + D LA +VAELEEQLKIVSLQRK AE+ATADVLAILE+NG +D+S+  DSNSDHE  C S+           
Subjt:  EDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQLKIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHETPCESKVGNDPAREDVN

Query:  SSKSIHMRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHNREKYKK-FSTRSRSSFTSIGSSSPKHRLGRSCRQIKRRDTRQLDGEQELKSEALVGSFQ-
                            ++ VLG SLSWKGR   P + +K K+  + R    F     SSP+HR GRSCRQI+R + R +   ++ K +     FQ 
Subjt:  SSKSIHMRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHNREKYKK-FSTRSRSSFTSIGSSSPKHRLGRSCRQIKRRDTRQLDGEQELKSEALVGSFQ-

Query:  -----EIAPSACSEDSRN----CCVKGPKILRDGYEPHEKTRSGPSQDHNSVENKDQDHDLDECEKGN----DMEKALECQAQLIDQYEAMEKAQREWEE
             E+ P    + SR       VKG   L                  N + N       +  EKGN    ++E+ALE +AQ+I  +E ME+ QREWE+
Subjt:  -----EIAPSACSEDSRN----CCVKGPKILRDGYEPHEKTRSGPSQDHNSVENKDQDHDLDECEKGN----DMEKALECQAQLIDQYEAMEKAQREWEE

Query:  KFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVT-RDSSQAQTNGLDPSSCADVEDLQDQNTNSISTSRSLEEFTFPMA
         FREN +S  D CD GNHSD+T+E +  +AQ+P L G+  + +   ++   N V  R+S +  ++G   +S        D+  NS   SRS+E+  +   
Subjt:  KFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVT-RDSSQAQTNGLDPSSCADVEDLQDQNTNSISTSRSLEEFTFPMA

Query:  NV-KQCQESQESSEQEPSCTSQLN-------HGLPHRPLSSHGGIDFYDQETPCSKND--LYALVPHEPPALNGVLEALKQAKLSLAKKIKKL-------
        +  K   ES  S   +P  +  +N          P    +S GG  F    T   + D  L ++   +P     VL ALKQAKLSL +K+  L       
Subjt:  NV-KQCQESQESSEQEPSCTSQLN-------HGLPHRPLSSHGGIDFYDQETPCSKND--LYALVPHEPPALNGVLEALKQAKLSLAKKIKKL-------

Query:  ------PSVEGESIGTLSVP--------------KVGGRLDIPIGC-AGLFRLPTDFAAEASTQANFLVSSSQLRSSTHYP
              PS  G  + T ++P               VG  ++ P+ C AGLFR+PTDF ++AS +  FL SSSQ    TH P
Subjt:  ------PSVEGESIGTLSVP--------------KVGGRLDIPIGC-AGLFRLPTDFAAEASTQANFLVSSSQLRSSTHYP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCGTGGAAAAGGAATTTCTCTTTCAGTCCCACCTCCCTCGCTCTCTCTCCTCCCCTCAGAGTGAAAGTAAAAAAACTGAGAAGTTCGAGGTTGAAGTTGTTTTTAT
CAATAAGATAATGAAGAATCCTGATCAGGATCAGCAAGATCCGAGAAATGTACCTGGTGGGGAGGACACAACTGCAATGACTATTGAGTTTCTTCGGGCTCGACTTCTAT
CGGAAAGATCTGTTTCAAGAAGTGCAAGACAAAGAGCTGATGAACTAGCGAAAAGGGTTGCAGAATTGGAGGAGCAGCTTAAGATCGTGTCTCTTCAAAGAAAGATGGCT
GAAAAGGCAACAGCAGATGTACTTGCCATTCTAGAAGATAATGGCGCTAGTGATATTTCTGAGACACTTGATTCAAACTCTGACCACGAAACACCATGTGAATCAAAAGT
TGGGAATGACCCTGCAAGAGAAGATGTGAACTCCTCCAAATCAATACATATGAGAAATGAACATGAAGAATTTTCAGGTTCTGATATTGATACTTCTCCAGTGCTAGGTG
GAAGCCTATCTTGGAAAGGACGCAATGATTCTCCACATAATCGTGAGAAGTACAAAAAATTTTCTACAAGAAGTCGAAGCAGTTTTACATCTATTGGTTCTTCTTCACCA
AAACATCGTCTTGGAAGATCATGCCGCCAGATAAAACGTAGAGATACAAGACAACTGGATGGAGAGCAAGAGCTCAAATCTGAGGCACTCGTGGGTAGTTTTCAAGAGAT
TGCACCATCTGCATGTTCAGAAGACTCTCGAAATTGCTGTGTAAAAGGGCCTAAGATATTAAGAGATGGTTATGAACCTCATGAAAAGACACGCTCAGGTCCTTCACAAG
ATCATAATAGTGTAGAAAATAAAGATCAAGATCATGATTTGGATGAGTGCGAAAAAGGAAATGATATGGAAAAGGCGTTGGAATGTCAAGCACAACTCATTGATCAATAT
GAAGCAATGGAAAAGGCTCAAAGAGAATGGGAAGAGAAGTTCAGAGAAAATAACAACAGTACTCCCGATTCTTGTGACCCTGGAAACCATTCAGATATCACTGAGGAAAG
AGATGAGATAAGGGCACAAGCTCCAAATCTGTCTGGTAATGCTTTCCTTGCAAATGAGGCAAAATCACAGGTTGCAGTCAATTGTGTCACTAGAGATTCGTCCCAAGCTC
AAACCAATGGGCTTGACCCATCTTCATGTGCTGATGTGGAAGACTTGCAGGATCAGAATACAAATAGCATTTCTACTTCACGATCACTTGAAGAATTTACCTTTCCTATG
GCTAATGTGAAGCAATGCCAAGAAAGCCAAGAAAGTAGCGAACAAGAACCTTCTTGTACCTCCCAACTCAATCATGGGCTCCCTCACAGGCCATTGTCATCTCATGGTGG
TATCGATTTCTATGACCAAGAAACTCCGTGCAGTAAGAATGATCTATATGCATTGGTGCCACATGAACCGCCTGCATTAAATGGTGTACTCGAGGCACTTAAACAAGCAA
AGCTATCGCTAGCAAAGAAAATCAAGAAATTACCCTCCGTAGAGGGTGAATCAATTGGAACTCTTTCTGTTCCAAAAGTTGGGGGCAGGTTAGATATCCCTATTGGATGT
GCTGGGCTCTTCAGACTTCCAACCGACTTTGCTGCCGAAGCTTCTACTCAAGCGAACTTCCTAGTTTCAAGTTCTCAGTTAAGATCGTCAACTCATTATCCTGTTTTTAC
CCGAGATGGATTTCTGACTGACCATTTTCCTGAGAATGGATGGAAAAATCCAGGCCAGAAGCATCATTTTGATCGATACTTCGATGCAATTCAACCCTCTCCCCATGTAC
ACAACTATCCATCACCTTCGGTTCCAGGAGAGCTGTTTTTCTTCTGCAACAATAGACTCGTATACCATGGGAAGGCCATTTCTACATTCTCTTGGTTTCTGCAAACTTGC
TGGAAGGAAATTGGCGTTGAAAGAGATTTGACCTTGTATCATACTGATTACTTCACCTCAAAAGTTCAAAGCTTGTGTTTGCTTAAATGTCATCAGAAGTTCAAGAATTC
CTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTCGTGGAAAAGGAATTTCTCTTTCAGTCCCACCTCCCTCGCTCTCTCTCCTCCCCTCAGAGTGAAAGTAAAAAAACTGAGAAGTTCGAGGTTGAAGTTGTTTTTAT
CAATAAGATAATGAAGAATCCTGATCAGGATCAGCAAGATCCGAGAAATGTACCTGGTGGGGAGGACACAACTGCAATGACTATTGAGTTTCTTCGGGCTCGACTTCTAT
CGGAAAGATCTGTTTCAAGAAGTGCAAGACAAAGAGCTGATGAACTAGCGAAAAGGGTTGCAGAATTGGAGGAGCAGCTTAAGATCGTGTCTCTTCAAAGAAAGATGGCT
GAAAAGGCAACAGCAGATGTACTTGCCATTCTAGAAGATAATGGCGCTAGTGATATTTCTGAGACACTTGATTCAAACTCTGACCACGAAACACCATGTGAATCAAAAGT
TGGGAATGACCCTGCAAGAGAAGATGTGAACTCCTCCAAATCAATACATATGAGAAATGAACATGAAGAATTTTCAGGTTCTGATATTGATACTTCTCCAGTGCTAGGTG
GAAGCCTATCTTGGAAAGGACGCAATGATTCTCCACATAATCGTGAGAAGTACAAAAAATTTTCTACAAGAAGTCGAAGCAGTTTTACATCTATTGGTTCTTCTTCACCA
AAACATCGTCTTGGAAGATCATGCCGCCAGATAAAACGTAGAGATACAAGACAACTGGATGGAGAGCAAGAGCTCAAATCTGAGGCACTCGTGGGTAGTTTTCAAGAGAT
TGCACCATCTGCATGTTCAGAAGACTCTCGAAATTGCTGTGTAAAAGGGCCTAAGATATTAAGAGATGGTTATGAACCTCATGAAAAGACACGCTCAGGTCCTTCACAAG
ATCATAATAGTGTAGAAAATAAAGATCAAGATCATGATTTGGATGAGTGCGAAAAAGGAAATGATATGGAAAAGGCGTTGGAATGTCAAGCACAACTCATTGATCAATAT
GAAGCAATGGAAAAGGCTCAAAGAGAATGGGAAGAGAAGTTCAGAGAAAATAACAACAGTACTCCCGATTCTTGTGACCCTGGAAACCATTCAGATATCACTGAGGAAAG
AGATGAGATAAGGGCACAAGCTCCAAATCTGTCTGGTAATGCTTTCCTTGCAAATGAGGCAAAATCACAGGTTGCAGTCAATTGTGTCACTAGAGATTCGTCCCAAGCTC
AAACCAATGGGCTTGACCCATCTTCATGTGCTGATGTGGAAGACTTGCAGGATCAGAATACAAATAGCATTTCTACTTCACGATCACTTGAAGAATTTACCTTTCCTATG
GCTAATGTGAAGCAATGCCAAGAAAGCCAAGAAAGTAGCGAACAAGAACCTTCTTGTACCTCCCAACTCAATCATGGGCTCCCTCACAGGCCATTGTCATCTCATGGTGG
TATCGATTTCTATGACCAAGAAACTCCGTGCAGTAAGAATGATCTATATGCATTGGTGCCACATGAACCGCCTGCATTAAATGGTGTACTCGAGGCACTTAAACAAGCAA
AGCTATCGCTAGCAAAGAAAATCAAGAAATTACCCTCCGTAGAGGGTGAATCAATTGGAACTCTTTCTGTTCCAAAAGTTGGGGGCAGGTTAGATATCCCTATTGGATGT
GCTGGGCTCTTCAGACTTCCAACCGACTTTGCTGCCGAAGCTTCTACTCAAGCGAACTTCCTAGTTTCAAGTTCTCAGTTAAGATCGTCAACTCATTATCCTGTTTTTAC
CCGAGATGGATTTCTGACTGACCATTTTCCTGAGAATGGATGGAAAAATCCAGGCCAGAAGCATCATTTTGATCGATACTTCGATGCAATTCAACCCTCTCCCCATGTAC
ACAACTATCCATCACCTTCGGTTCCAGGAGAGCTGTTTTTCTTCTGCAACAATAGACTCGTATACCATGGGAAGGCCATTTCTACATTCTCTTGGTTTCTGCAAACTTGC
TGGAAGGAAATTGGCGTTGAAAGAGATTTGACCTTGTATCATACTGATTACTTCACCTCAAAAGTTCAAAGCTTGTGTTTGCTTAAATGTCATCAGAAGTTCAAGAATTC
CTAA
Protein sequenceShow/hide protein sequence
MVVEKEFLFQSHLPRSLSSPQSESKKTEKFEVEVVFINKIMKNPDQDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQLKIVSLQRKMA
EKATADVLAILEDNGASDISETLDSNSDHETPCESKVGNDPAREDVNSSKSIHMRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHNREKYKKFSTRSRSSFTSIGSSSP
KHRLGRSCRQIKRRDTRQLDGEQELKSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDGYEPHEKTRSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQY
EAMEKAQREWEEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDSSQAQTNGLDPSSCADVEDLQDQNTNSISTSRSLEEFTFPM
ANVKQCQESQESSEQEPSCTSQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKKIKKLPSVEGESIGTLSVPKVGGRLDIPIGC
AGLFRLPTDFAAEASTQANFLVSSSQLRSSTHYPVFTRDGFLTDHFPENGWKNPGQKHHFDRYFDAIQPSPHVHNYPSPSVPGELFFFCNNRLVYHGKAISTFSWFLQTC
WKEIGVERDLTLYHTDYFTSKVQSLCLLKCHQKFKNS