; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g26590 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g26590
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein of unknown function (DUF616)
Genome locationchr8:19196939..19215866
RNA-Seq ExpressionMoc08g26590
SyntenyMoc08g26590
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006852 - Protein of unknown function DUF616


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156300.1 uncharacterized protein LOC111023226 [Momordica charantia]1.5e-22885.99Show/hide
Query:  MMGLFGHNNEQLPERR--------------------------------------------------VFGLKVNLHDKIGGSISVSEVQGETLVSRDQKPS
        MMGLFGHNNEQLPERR                                                  VFGLKVNLHDKIGGSISVSEVQGETLVSRDQKPS
Subjt:  MMGLFGHNNEQLPERR--------------------------------------------------VFGLKVNLHDKIGGSISVSEVQGETLVSRDQKPS

Query:  TSKHRRKQHFPCEVVFTESVDYLLEPEDFMNVTQFSLEYIEREEKPSETDLYKPRFGGHQTLKERERSFYATNQKLHCGFIKGPPESPSTGFDLDEKDNA
        TSKHRRKQHFPCEVVFTESVDYLLEPEDFMNVTQFSLEYIEREEKPSETDLYKPRFGGHQTLKERERSFYATNQKLHCGFIKGPPESPSTGFDLDEKDNA
Subjt:  TSKHRRKQHFPCEVVFTESVDYLLEPEDFMNVTQFSLEYIEREEKPSETDLYKPRFGGHQTLKERERSFYATNQKLHCGFIKGPPESPSTGFDLDEKDNA

Query:  YMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFP-----
        YMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFP     
Subjt:  YMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFP-----

Query:  -------LFKVDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPSYVPEGSFIVRAHTPMSN
                 +VDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPSYVPEGSFIVRAHTPMSN
Subjt:  -------LFKVDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPSYVPEGSFIVRAHTPMSN

Query:  LFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHRSWPPPNT
        LFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHRSWPPPNT
Subjt:  LFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHRSWPPPNT

XP_022951707.1 uncharacterized protein LOC111454452 [Cucurbita moschata]6.7e-20574.79Show/hide
Query:  PPTLLETRCEPRSGGNVMMGLFGHNNEQLPERR--------------------------------------------------VFGLKVNLHDKIGGSIS
        P TL +TR + R GGNVMM LFG+NNEQLPERR                                                  VFGLKVN H KIG SIS
Subjt:  PPTLLETRCEPRSGGNVMMGLFGHNNEQLPERR--------------------------------------------------VFGLKVNLHDKIGGSIS

Query:  VSEVQGETLVSRDQKPSTSKHRRKQHFPCEVVFTESVDYLLEPEDFMNVTQFSLEYIEREEKPSETDLYKPRFGGHQTLKERERSFYATNQKLHCGFIKG
        V+EVQGE L SRDQKP T+KH RKQH PC+V F ESV YL+EPE FMNV QFSLE++E EE+ SETDLYKPRFGGHQTL+ERE+SFYATNQKLHCGF+KG
Subjt:  VSEVQGETLVSRDQKPSTSKHRRKQHFPCEVVFTESVDYLLEPEDFMNVTQFSLEYIEREEKPSETDLYKPRFGGHQTLKERERSFYATNQKLHCGFIKG

Query:  PPESPSTGFDLDEKDNAYMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTG
        PP  PSTGFDLDEKDNAYMK+CKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEG+  DDKGYIG+WKIVVVRNLPY+DMRRTG
Subjt:  PPESPSTGFDLDEKDNAYMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTG

Query:  KVPKFLSHRLFP------------LFKVDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPS
        KVPKFLSHRLFP              ++DPMLIIE+FLW+KKSEYAISNHYDRHCVWEEV QNKRLNKYNHTAIDEQFAFYQSDGLVKFDP +  TPLPS
Subjt:  KVPKFLSHRLFP------------LFKVDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPS

Query:  YVPEGSFIVRAHTPMSNLFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHRSWPPPN
        YVPEGSFIVRAHTPMSNLFSCLWFNEVDRFTSRDQLSFAYTYLK  R NQD+PFNLNMFKDCERRSLAKLFRHR +PPPN
Subjt:  YVPEGSFIVRAHTPMSNLFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHRSWPPPN

XP_022973308.1 uncharacterized protein LOC111471866 [Cucurbita maxima]1.7e-20075.59Show/hide
Query:  MMGLFGHNNEQLPERR--------------------------------------------------VFGLKVNLHDKIGGSISVSEVQGETLVSRDQKPS
        MM LFG+NNEQLPERR                                                  VFGLKVN H KIG SISV+EVQGE L SRDQKP 
Subjt:  MMGLFGHNNEQLPERR--------------------------------------------------VFGLKVNLHDKIGGSISVSEVQGETLVSRDQKPS

Query:  TSKHRRKQHFPCEVVFTESVDYLLEPEDFMNVTQFSLEYIEREEKPSETDLYKPRFGGHQTLKERERSFYATNQKLHCGFIKGPPESPSTGFDLDEKDNA
        T+KH RKQH PC+V F ESV YL+EPE FMNV QFSLE++E EE+ SETDLYKPRFGGHQTL+ERE+SFYATNQKLHCGF+KGPP  PSTGFDLDEKDNA
Subjt:  TSKHRRKQHFPCEVVFTESVDYLLEPEDFMNVTQFSLEYIEREEKPSETDLYKPRFGGHQTLKERERSFYATNQKLHCGFIKGPPESPSTGFDLDEKDNA

Query:  YMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFP-----
        YMK+CKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEG+ PDDKGYIG+WKIVVVRNLPY+DMRRTGKVPKFLSHRLFP     
Subjt:  YMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFP-----

Query:  -------LFKVDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPSYVPEGSFIVRAHTPMSN
                 ++DPMLIIE+FLW+KKSEYAISNHYDRHCVWEEV QNKRLNKYNHTAIDEQFAFYQSDGLVKFDP +  TPLPSYVPEGSFIVRAHTPMSN
Subjt:  -------LFKVDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPSYVPEGSFIVRAHTPMSN

Query:  LFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHRSWPPPN
        LFSCLWFNEVDRFTSRDQLSFAYTYLK  R NQD+PFNLNMFKDCERRSLAKLFRHR +PPPN
Subjt:  LFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHRSWPPPN

XP_023536952.1 uncharacterized protein LOC111798179 [Cucurbita pepo subsp. pepo]4.8e-20373.96Show/hide
Query:  PPTLLETRCEPRSGGNVMMGLFGHNNEQLPERR--------------------------------------------------VFGLKVNLHDKIGGSIS
        P TL +TR + R GGN MM LFG+NNEQLPERR                                                  VFGLKVN H KIG SIS
Subjt:  PPTLLETRCEPRSGGNVMMGLFGHNNEQLPERR--------------------------------------------------VFGLKVNLHDKIGGSIS

Query:  VSEVQGETLVSRDQKPSTSKHRRKQHFPCEVVFTESVDYLLEPEDFMNVTQFSLEYIEREEKPSETDLYKPRFGGHQTLKERERSFYATNQKLHCGFIKG
        V+EVQGE L SRDQKP T+KH RKQH PC+V F ESV YL+EPE FMNV QFSLE++E EE+  ETDLYKPRFGGHQTL+ERE+SFYATNQKLHCGF+KG
Subjt:  VSEVQGETLVSRDQKPSTSKHRRKQHFPCEVVFTESVDYLLEPEDFMNVTQFSLEYIEREEKPSETDLYKPRFGGHQTLKERERSFYATNQKLHCGFIKG

Query:  PPESPSTGFDLDEKDNAYMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTG
        PP  PSTGFDLDEKDNAYMK+CKVAVSSCIFGSSDFLRRPTSKQIS+YSKKNVCFVMFVDEQTLSKLSAEG+  DDKGYIG+WKIVVVRNLPY+DMRRTG
Subjt:  PPESPSTGFDLDEKDNAYMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTG

Query:  KVPKFLSHRLFP------------LFKVDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPS
        KVPKFLSHRLFP              ++DPMLIIE+FLW+KKSEYAISNHYDRHCVWEEV QNKRLNKYNHTAIDEQFAFYQSDGLVKFDP +  TPLPS
Subjt:  KVPKFLSHRLFP------------LFKVDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPS

Query:  YVPEGSFIVRAHTPMSNLFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHRSWPPPN
        YVPEGSFIVRAHTPMSNLFSCLWFNEVDRFTSRDQLSFAYTYLK  R NQD+PFNLNMFKDC+RRSLAKLFRHR +PPPN
Subjt:  YVPEGSFIVRAHTPMSNLFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHRSWPPPN

XP_038899595.1 uncharacterized protein LOC120086852 isoform X1 [Benincasa hispida]4.8e-20376.22Show/hide
Query:  EPRSGGNVMMGLFGHNNEQLPERR--------------------------------------------------VFGLKVNLHDKIGGSISVSEVQGETL
        EPR GG VMMGLFG+NNEQLPERR                                                  VFGLK+N +DK G S+SVSEVQGE L
Subjt:  EPRSGGNVMMGLFGHNNEQLPERR--------------------------------------------------VFGLKVNLHDKIGGSISVSEVQGETL

Query:  VSRDQKPSTSKHRRKQHFPCEVVFTESVDYLLEPEDFMNVTQFSLEYIEREEKPSETDLYKPRFGGHQTLKERERSFYATNQKLHCGFIKGPPESPSTGF
        VSRDQ+P TSKHRRKQHFPC+V F ESV YL+EPE FMNVT+FSLE+IE EEKPSET LY PRFGGHQTL+ERE+SFYATNQKLHCGFIKGPP  PSTGF
Subjt:  VSRDQKPSTSKHRRKQHFPCEVVFTESVDYLLEPEDFMNVTQFSLEYIEREEKPSETDLYKPRFGGHQTLKERERSFYATNQKLHCGFIKGPPESPSTGF

Query:  DLDEKDNAYMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTGKVPKFLSHR
        DLDEKD+AYMK CKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKL+AEG+IPDDKG IGLWKIVVVRNLPYEDMRRTGKVPKFLSHR
Subjt:  DLDEKDNAYMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTGKVPKFLSHR

Query:  LFP------------LFKVDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPSYVPEGSFIV
        LFP              ++DPMLIIE+FLW+KKSEYAISNHYDRHCVWEEV QNKRLNKYNHTAIDEQFAFYQSDGLVKFDP DV++ LPS+VPEGSFIV
Subjt:  LFP------------LFKVDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPSYVPEGSFIV

Query:  RAHTPMSNLFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHRSWPPPN
        RAHTPMSNLFSCLWFNEVDRFTSRDQLSFAYTYLK RR NQD PFNLNMFKDCERRSLAKLFRHR   PPN
Subjt:  RAHTPMSNLFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHRSWPPPN

TrEMBL top hitse value%identityAlignment
A0A0A0K521 Uncharacterized protein1.9e-19775.26Show/hide
Query:  RCEPRSGGNVMMGLFGHNNEQLPERR--------------------------------------------------VFGLKVNLHDKIGGSISVSEVQGE
        R E R GGNVMMGLFG+NNEQLPERR                                                  VFGLKVN +D IG S SVSEV+ E
Subjt:  RCEPRSGGNVMMGLFGHNNEQLPERR--------------------------------------------------VFGLKVNLHDKIGGSISVSEVQGE

Query:  TLVSRDQKPSTSKHRRKQHFPCEVVFTESVDYLLEPEDFMNVTQFSLEYIEREEKPSETDLYKPRFGGHQTLKERERSFYATNQKLHCGFIKGPPESPST
         LVSRDQ+P TSKHRRKQHFPC+V F ESV YL+EPE FMNVTQFSLE+IEREEK  E DL+ PRFGGHQTL+ERE SFYATNQKLHCGFIKGPP  PST
Subjt:  TLVSRDQKPSTSKHRRKQHFPCEVVFTESVDYLLEPEDFMNVTQFSLEYIEREEKPSETDLYKPRFGGHQTLKERERSFYATNQKLHCGFIKGPPESPST

Query:  GFDLDEKDNAYMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTGKVPKFLS
        GFDLDEKD+AYMK CKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVD+QTLSKLSAEG+IPDDKG IGLWKIVVV NLPYEDMRRTGKVPKFLS
Subjt:  GFDLDEKDNAYMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTGKVPKFLS

Query:  HRLFP------------LFKVDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPSYVPEGSF
        HRLFP              +VDPMLIIE+FLW+KKSEYAISNHYDRHCVWEEV QNKRLNKYNHTAIDEQFAFYQSDGLVKFDP D+ + LPSYVPEGSF
Subjt:  HRLFP------------LFKVDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPSYVPEGSF

Query:  IVRAHTPMSNLFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHRSWPPPN
        IVRAHTPMSNLFSCLWFNEV+RFTSRDQLSFAYTYLK RR NQ IPFNLNMFKDCERRSLAKLFRHR   P N
Subjt:  IVRAHTPMSNLFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHRSWPPPN

A0A1S4E493 uncharacterized protein LOC103500457 isoform X11.4e-20076.16Show/hide
Query:  TRCEPRSGGNVMMGLFGHNNEQLPERR--------------------------------------------------VFGLKVNLHDKIGGSISVSEVQG
        TR EPR GGNVMMGLFG+NNEQLPERR                                                  VFGLKVN +D IG S SVSEVQ 
Subjt:  TRCEPRSGGNVMMGLFGHNNEQLPERR--------------------------------------------------VFGLKVNLHDKIGGSISVSEVQG

Query:  ETLVSRDQKPSTSKHRRKQHFPCEVVFTESVDYLLEPEDFMNVTQFSLEYIEREEKPSETDLYKPRFGGHQTLKERERSFYATNQKLHCGFIKGPPESPS
        E LVSRDQ+P TSKHRRKQHFPC+V F ESV YL+EP  FMNVTQFSLE+IE EEK SETDLY PRFGGHQTL+ERE SFYATNQKLHCGFIKGPP  PS
Subjt:  ETLVSRDQKPSTSKHRRKQHFPCEVVFTESVDYLLEPEDFMNVTQFSLEYIEREEKPSETDLYKPRFGGHQTLKERERSFYATNQKLHCGFIKGPPESPS

Query:  TGFDLDEKDNAYMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTGKVPKFL
        TGFDLDEKD AYMK CKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVD+QTLSKLS+EG+IPDDKGYIGLWKIVVVRNLPYEDMRRTGKVPKFL
Subjt:  TGFDLDEKDNAYMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTGKVPKFL

Query:  SHRLFP------------LFKVDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPSYVPEGS
        SHRLFP              +VDPMLIIE+FLW+KKSEYAISNHYDRHCVWEEV QNKRLNKYNHTAIDEQFAFYQSDGLVKFDP D+ + LPSYVPEGS
Subjt:  SHRLFP------------LFKVDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPSYVPEGS

Query:  FIVRAHTPMSNLFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHRSWPPPN
        FIVRAHTPMSNLFSCLWFNEVDRFTSRDQLSFAYTYLK RR NQ  PFNLNMFKDCERRSLAKLFRHR   P N
Subjt:  FIVRAHTPMSNLFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHRSWPPPN

A0A6J1DPW9 uncharacterized protein LOC1110232267.2e-22985.99Show/hide
Query:  MMGLFGHNNEQLPERR--------------------------------------------------VFGLKVNLHDKIGGSISVSEVQGETLVSRDQKPS
        MMGLFGHNNEQLPERR                                                  VFGLKVNLHDKIGGSISVSEVQGETLVSRDQKPS
Subjt:  MMGLFGHNNEQLPERR--------------------------------------------------VFGLKVNLHDKIGGSISVSEVQGETLVSRDQKPS

Query:  TSKHRRKQHFPCEVVFTESVDYLLEPEDFMNVTQFSLEYIEREEKPSETDLYKPRFGGHQTLKERERSFYATNQKLHCGFIKGPPESPSTGFDLDEKDNA
        TSKHRRKQHFPCEVVFTESVDYLLEPEDFMNVTQFSLEYIEREEKPSETDLYKPRFGGHQTLKERERSFYATNQKLHCGFIKGPPESPSTGFDLDEKDNA
Subjt:  TSKHRRKQHFPCEVVFTESVDYLLEPEDFMNVTQFSLEYIEREEKPSETDLYKPRFGGHQTLKERERSFYATNQKLHCGFIKGPPESPSTGFDLDEKDNA

Query:  YMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFP-----
        YMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFP     
Subjt:  YMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFP-----

Query:  -------LFKVDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPSYVPEGSFIVRAHTPMSN
                 +VDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPSYVPEGSFIVRAHTPMSN
Subjt:  -------LFKVDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPSYVPEGSFIVRAHTPMSN

Query:  LFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHRSWPPPNT
        LFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHRSWPPPNT
Subjt:  LFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHRSWPPPNT

A0A6J1GIF1 uncharacterized protein LOC1114544523.3e-20574.79Show/hide
Query:  PPTLLETRCEPRSGGNVMMGLFGHNNEQLPERR--------------------------------------------------VFGLKVNLHDKIGGSIS
        P TL +TR + R GGNVMM LFG+NNEQLPERR                                                  VFGLKVN H KIG SIS
Subjt:  PPTLLETRCEPRSGGNVMMGLFGHNNEQLPERR--------------------------------------------------VFGLKVNLHDKIGGSIS

Query:  VSEVQGETLVSRDQKPSTSKHRRKQHFPCEVVFTESVDYLLEPEDFMNVTQFSLEYIEREEKPSETDLYKPRFGGHQTLKERERSFYATNQKLHCGFIKG
        V+EVQGE L SRDQKP T+KH RKQH PC+V F ESV YL+EPE FMNV QFSLE++E EE+ SETDLYKPRFGGHQTL+ERE+SFYATNQKLHCGF+KG
Subjt:  VSEVQGETLVSRDQKPSTSKHRRKQHFPCEVVFTESVDYLLEPEDFMNVTQFSLEYIEREEKPSETDLYKPRFGGHQTLKERERSFYATNQKLHCGFIKG

Query:  PPESPSTGFDLDEKDNAYMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTG
        PP  PSTGFDLDEKDNAYMK+CKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEG+  DDKGYIG+WKIVVVRNLPY+DMRRTG
Subjt:  PPESPSTGFDLDEKDNAYMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTG

Query:  KVPKFLSHRLFP------------LFKVDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPS
        KVPKFLSHRLFP              ++DPMLIIE+FLW+KKSEYAISNHYDRHCVWEEV QNKRLNKYNHTAIDEQFAFYQSDGLVKFDP +  TPLPS
Subjt:  KVPKFLSHRLFP------------LFKVDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPS

Query:  YVPEGSFIVRAHTPMSNLFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHRSWPPPN
        YVPEGSFIVRAHTPMSNLFSCLWFNEVDRFTSRDQLSFAYTYLK  R NQD+PFNLNMFKDCERRSLAKLFRHR +PPPN
Subjt:  YVPEGSFIVRAHTPMSNLFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHRSWPPPN

A0A6J1ICN5 uncharacterized protein LOC1114718668.3e-20175.59Show/hide
Query:  MMGLFGHNNEQLPERR--------------------------------------------------VFGLKVNLHDKIGGSISVSEVQGETLVSRDQKPS
        MM LFG+NNEQLPERR                                                  VFGLKVN H KIG SISV+EVQGE L SRDQKP 
Subjt:  MMGLFGHNNEQLPERR--------------------------------------------------VFGLKVNLHDKIGGSISVSEVQGETLVSRDQKPS

Query:  TSKHRRKQHFPCEVVFTESVDYLLEPEDFMNVTQFSLEYIEREEKPSETDLYKPRFGGHQTLKERERSFYATNQKLHCGFIKGPPESPSTGFDLDEKDNA
        T+KH RKQH PC+V F ESV YL+EPE FMNV QFSLE++E EE+ SETDLYKPRFGGHQTL+ERE+SFYATNQKLHCGF+KGPP  PSTGFDLDEKDNA
Subjt:  TSKHRRKQHFPCEVVFTESVDYLLEPEDFMNVTQFSLEYIEREEKPSETDLYKPRFGGHQTLKERERSFYATNQKLHCGFIKGPPESPSTGFDLDEKDNA

Query:  YMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFP-----
        YMK+CKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEG+ PDDKGYIG+WKIVVVRNLPY+DMRRTGKVPKFLSHRLFP     
Subjt:  YMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFP-----

Query:  -------LFKVDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPSYVPEGSFIVRAHTPMSN
                 ++DPMLIIE+FLW+KKSEYAISNHYDRHCVWEEV QNKRLNKYNHTAIDEQFAFYQSDGLVKFDP +  TPLPSYVPEGSFIVRAHTPMSN
Subjt:  -------LFKVDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPSYVPEGSFIVRAHTPMSN

Query:  LFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHRSWPPPN
        LFSCLWFNEVDRFTSRDQLSFAYTYLK  R NQD+PFNLNMFKDCERRSLAKLFRHR +PPPN
Subjt:  LFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHRSWPPPN

SwissProt top hitse value%identityAlignment
Q9FZ97 Probable hexosyltransferase MUCI706.0e-7142.18Show/hide
Query:  QGETLVSRDQKPSTSKHRRKQHFPCEVVF---TESVDYLLEPEDFMNVTQFSLEYIEREEKPSETDLYKPRFGGHQTLKERERSF-YATNQKLHCGFIKG
        QG    S    P  +  +R    PC V +    E+V  +     F  V + +L YI  E    ET+     FGG+ TLK R  SF       +HCGF+KG
Subjt:  QGETLVSRDQKPSTSKHRRKQHFPCEVVF---TESVDYLLEPEDFMNVTQFSLEYIEREEKPSETDLYKPRFGGHQTLKERERSF-YATNQKLHCGFIKG

Query:  PPESPSTGFDLDEKDNAYMKMCK-VAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRT
        P    +TGFD+DE D   MK C+ + V+S +F + D ++ P  + IS+Y+++ VCF MFVDE+T S L  E  +  +K  +G+W++VVV NLPY D RR 
Subjt:  PPESPSTGFDLDEKDNAYMKMCK-VAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRT

Query:  GKVPKFLSHRLFPLFK------------VDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDV-RTPL
        GKVPK L HR+FP  +            VDP  I+E FLW+K + +AIS HY R  V  E   NK   KY++ +ID Q  FY+++GL    P+ V + P+
Subjt:  GKVPKFLSHRLFPLFK------------VDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDV-RTPL

Query:  PSYVPEGSFIVRAHTPMSNLFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHRS
         S VPEG  I+R H P+SNLF+CLWFNEVDRFTSRDQ+SF+    K   I     + ++MF DCERR+      HR+
Subjt:  PSYVPEGSFIVRAHTPMSNLFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHRS

Arabidopsis top hitse value%identityAlignment
AT1G28240.1 Protein of unknown function (DUF616)4.2e-7242.18Show/hide
Query:  QGETLVSRDQKPSTSKHRRKQHFPCEVVF---TESVDYLLEPEDFMNVTQFSLEYIEREEKPSETDLYKPRFGGHQTLKERERSF-YATNQKLHCGFIKG
        QG    S    P  +  +R    PC V +    E+V  +     F  V + +L YI  E    ET+     FGG+ TLK R  SF       +HCGF+KG
Subjt:  QGETLVSRDQKPSTSKHRRKQHFPCEVVF---TESVDYLLEPEDFMNVTQFSLEYIEREEKPSETDLYKPRFGGHQTLKERERSF-YATNQKLHCGFIKG

Query:  PPESPSTGFDLDEKDNAYMKMCK-VAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRT
        P    +TGFD+DE D   MK C+ + V+S +F + D ++ P  + IS+Y+++ VCF MFVDE+T S L  E  +  +K  +G+W++VVV NLPY D RR 
Subjt:  PPESPSTGFDLDEKDNAYMKMCK-VAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRT

Query:  GKVPKFLSHRLFPLFK------------VDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDV-RTPL
        GKVPK L HR+FP  +            VDP  I+E FLW+K + +AIS HY R  V  E   NK   KY++ +ID Q  FY+++GL    P+ V + P+
Subjt:  GKVPKFLSHRLFPLFK------------VDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDV-RTPL

Query:  PSYVPEGSFIVRAHTPMSNLFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHRS
         S VPEG  I+R H P+SNLF+CLWFNEVDRFTSRDQ+SF+    K   I     + ++MF DCERR+      HR+
Subjt:  PSYVPEGSFIVRAHTPMSNLFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHRS

AT1G34550.1 Protein of unknown function (DUF616)4.6e-13557.25Show/hide
Query:  GH--NNEQLPERRVFGL--KVNLHDKIGGSISVSEVQGETLVSRDQKPSTS-------KHRRKQHFPCEVVFTESVDYLLEPEDFMNVTQFSLEYIEREE
        GH  +NE++ + +V  L  K    DK  G +S   +   +LVS+  K   +       + RR+    CE+    S   ++EP +     +FSL+YIE+E+
Subjt:  GH--NNEQLPERRVFGL--KVNLHDKIGGSISVSEVQGETLVSRDQKPSTS-------KHRRKQHFPCEVVFTESVDYLLEPEDFMNVTQFSLEYIEREE

Query:  KPSETDLYKPRFGGHQTLKERERSFYATNQKLHCGFIKGPPESPSTGFDLDEKDNAYMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDE
        KP E + ++PRF GHQ+L+ERE SF A ++K+HCGF+KGP  S STGFDL E D  Y+  C +AVSSCIFG+SD LR P +K IS  S+KNVCF++FVDE
Subjt:  KPSETDLYKPRFGGHQTLKERERSFYATNQKLHCGFIKGPPESPSTGFDLDEKDNAYMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDE

Query:  QTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFP------------LFKVDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVL
         T+  LSAEG  PD  G+IGLWK+VVV+NLPY DMRR GK+PK L HRLFP              ++DP+LI+E+FLW+K  EYAISNHYDRHC+WEEV 
Subjt:  QTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFP------------LFKVDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVL

Query:  QNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPSYVPEGSFIVRAHTPMSNLFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKD
        QNK+LNKYNHT I++QF FY++DGL +F+  D    LPS VPEGSFIVRAHTPMSNLFSCLWFNEV+RFT RDQLSFAYTY K RR+N D PFNL+MFKD
Subjt:  QNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPSYVPEGSFIVRAHTPMSNLFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKD

Query:  CERRSLAKLFRHRS
        CERR +AKLFRHRS
Subjt:  CERRSLAKLFRHRS

AT1G53040.1 Protein of unknown function (DUF616)5.2e-7042.62Show/hide
Query:  RRKQHFPCEVVFTESVDYL--LEPEDFMNVTQFSLEYIEREE--KPSETDLYKPRFGGHQTLKERERSF-YATNQKLHCGFIKGPPESPSTGFDLDEKD-
        RR    PC V +    + L  +    F +    +L YI  E   KP E++     FGG+ +L+ R  SF    +  +HCGFIKG      TGFD+DE   
Subjt:  RRKQHFPCEVVFTESVDYL--LEPEDFMNVTQFSLEYIEREE--KPSETDLYKPRFGGHQTLKERERSF-YATNQKLHCGFIKGPPESPSTGFDLDEKD-

Query:  NAYMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFPLFK
        +   +   V V+S IFG  D ++ P +  ISE ++KN+ F MFVDE+T   L    S  DD   +GLW+I+VV N+PY D RR GKVPK L HRLFP  +
Subjt:  NAYMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFPLFK

Query:  ------------VDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPSYVPEGSFIVRAHTPM
                    VDP  I+E FLW+  S +AIS HY R  V+ E   NK   KY++ +ID Q  FY+ +GL  +   + + P+ S VPEG  I+R H P+
Subjt:  ------------VDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPSYVPEGSFIVRAHTPM

Query:  SNLFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHR
        +NLF+C+WFNEVDRFTSRDQLSFA    K R   + + +++NMF DCERR+  K   HR
Subjt:  SNLFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHR

AT2G02910.1 Protein of unknown function (DUF616)5.6e-14969.19Show/hide
Query:  SRDQKPSTSKHRRKQHFPCEVVFTESVDYLLEPEDFMNVTQFSLEYIEREEKPSETDLYKPRFGGHQTLKERERSFYATNQKLHCGFIKGPPESPSTGFD
        S  ++P  SK R K H PCEV   ES D +LEP+D++N T+FSL ++E E   +      PRFGGHQTL ERERS+ A NQ +HCGF+KG      TGFD
Subjt:  SRDQKPSTSKHRRKQHFPCEVVFTESVDYLLEPEDFMNVTQFSLEYIEREEKPSETDLYKPRFGGHQTLKERERSFYATNQKLHCGFIKGPPESPSTGFD

Query:  LDEKDNAYMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRL
        L EKD AYMK C V+VSSCIFGSSDFLRRP +K+ISE+SK+NVCFVMFVDEQTLSKL++EG +PD +G++GLWK VVV NLPY DMR+TGKVPKFLSHRL
Subjt:  LDEKDNAYMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRL

Query:  FPLFK------------VDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPSYVPEGSFIVR
        FP  +             DPMLII+ FLW+ KSE+AISNHYDRHCVW+EVLQNKRLNKYNH+AIDEQF FY+SDGL KFDP D  +PLPSYVPEGSFIVR
Subjt:  FPLFK------------VDPMLIIEHFLWQKKSEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPSYVPEGSFIVR

Query:  AHTPMSNLFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHR--SWPP
        AHTPMSNLF+CLWFNEVDRFTSRDQLSFAYTYLK +R+N D P  LNMFKDCERR+L KLF HR  S PP
Subjt:  AHTPMSNLFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNMFKDCERRSLAKLFRHR--SWPP

AT4G09630.1 Protein of unknown function (DUF616)1.2e-12754.44Show/hide
Query:  HNNEQLPERRVFGLKVNLH--------DKIGGSISV-SEVQGETLVSRDQKPS--TSKHRRKQH----FPCEVVFTESVDYLLEPEDFMNVTQFSLEYIE
        HN +   E       V LH        +K+ G+ S  S  +   L  +  K S   +K R + H      CE+    S   + EP    N    SL+YI+
Subjt:  HNNEQLPERRVFGLKVNLH--------DKIGGSISV-SEVQGETLVSRDQKPS--TSKHRRKQH----FPCEVVFTESVDYLLEPEDFMNVTQFSLEYIE

Query:  REEKPSETDLYKPRFGGHQTLKERERSFYATNQKLHCGFIKGPPESPSTGFDLDEKDNAYMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMF
         E+KP   + ++P+F GHQ+L+ERE SF    QK+HCGF+K P   PSTGFDL E D  Y+  C +AV SCIFG+SD LR P +K +S  S+K+VCFV+F
Subjt:  REEKPSETDLYKPRFGGHQTLKERERSFYATNQKLHCGFIKGPPESPSTGFDLDEKDNAYMKMCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMF

Query:  VDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFP------------LFKVDPMLIIEHFLWQKKSEYAISNHYDRHCVWE
        VDE T+  LSAEG +PD  G++GLWK+VVVRNLPY DMRR GK+PK L HRLF               ++DP++I+E+FLW++  EYAISNHYDRHC+WE
Subjt:  VDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFP------------LFKVDPMLIIEHFLWQKKSEYAISNHYDRHCVWE

Query:  EVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPSYVPEGSFIVRAHTPMSNLFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNM
        EV QNK+LNKYNHT ID+QF FYQSDGL +F+  D    LPS VPEGSFIVR HTPMSNLFSCLWFNEV+RFT RDQLSFAYTY K  R+N D PFNL+M
Subjt:  EVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPSYVPEGSFIVRAHTPMSNLFSCLWFNEVDRFTSRDQLSFAYTYLKFRRINQDIPFNLNM

Query:  FKDCERRSLAKLFRHRS
        FKDCERR + KLFRHRS
Subjt:  FKDCERRSLAKLFRHRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCGGTTCGGGTCCCGGTTCAGGAAAAACCGAGGGAACAAAATCAATGTTACGGAACCGGTTGTGAAGGGGCGGCTGCGTCTCGCGCCATTTCCTGCAAAA
TGCCAACTGCATTTCCCCTCTCACACTTCGCTCCCCCAACTCACTCCACCGGCCATAATTTATCGACAAGAAGATCGATCCAACATAGATGCTCATCTATGGAGT
AATCAGGGGTTTGTTTCTATTCTTTTGCATTTCTTCACATTACTCTCATCATTTTTTGCCCGCCAGATCTCCAACCATCTAATTCTCCGGATCGTATTTTTCATT
TGGTTTCACGGACTTTGGTGGGAGATCTATCTATACAGTAATGCTTTCAATCACAGGAATTGTATTTTATCAGAAACTGAAAAGGTTCTACATCGTGATGTTCGT
CCTCCCACCCTCCTCGAAACCAGATGTGAACCGAGATCTGGAGGCAATGTCATGATGGGTCTGTTTGGGCACAATAATGAACAGTTACCAGAAAGAAGAGTCTTC
GGCTTAAAAGTGAACTTGCATGATAAAATTGGAGGTAGCATTTCGGTGTCAGAAGTACAAGGAGAAACTTTGGTATCGAGGGACCAGAAACCATCAACTAGCAAG
CATCGTCGTAAGCAACATTTTCCTTGTGAAGTCGTGTTTACGGAGTCAGTTGATTATCTTTTGGAACCTGAGGACTTCATGAACGTTACTCAATTCTCTTTAGAG
TACATAGAGCGTGAGGAAAAACCTTCTGAAACTGATTTATATAAGCCCAGATTTGGAGGACACCAAACCCTTAAGGAAAGGGAGAGATCCTTCTATGCAACAAAT
CAAAAACTTCATTGTGGTTTCATTAAAGGACCACCAGAATCCCCAAGTACTGGATTTGATTTAGATGAAAAGGACAATGCGTACATGAAAATGTGCAAGGTTGCG
GTATCATCGTGCATTTTTGGGAGCTCTGATTTTCTGAGGAGGCCTACTAGCAAACAGATCAGTGAATATTCCAAGAAGAATGTTTGTTTTGTCATGTTTGTGGAT
GAGCAAACACTATCAAAACTATCAGCTGAGGGAAGTATTCCTGATGACAAGGGATACATTGGACTGTGGAAAATAGTTGTTGTGAGGAATTTACCATATGAAGAT
ATGCGCAGAACTGGAAAAGTTCCTAAATTTTTGTCACATCGGCTCTTCCCCCTCTTCAAGGTAGATCCAATGCTAATTATTGAACATTTTCTGTGGCAAAAGAAG
TCAGAGTATGCTATTTCAAACCACTATGATCGCCACTGTGTCTGGGAGGAAGTTCTCCAAAACAAACGTCTGAACAAGTACAATCACACTGCCATTGATGAACAG
TTTGCTTTTTATCAGTCTGATGGCCTCGTTAAATTCGACCCTTTTGATGTCAGGACTCCGCTTCCGAGTTATGTTCCTGAAGGTTCCTTCATTGTCCGGGCACAT
ACGCCAATGTCAAATTTATTTTCATGTCTTTGGTTCAATGAAGTTGACCGGTTTACCTCACGTGATCAATTGAGCTTTGCATATACTTACTTGAAATTCAGGAGA
ATAAACCAAGACATACCATTCAACTTAAACATGTTTAAGGACTGTGAACGACGATCACTAGCCAAGCTTTTTCGTCATAGATCATGGCCACCTCCAAATACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCGGTTCGGGTCCCGGTTCAGGAAAAACCGAGGGAACAAAATCAATGTTACGGAACCGGTTGTGAAGGGGCGGCTGCGTCTCGCGCCATTTCCTGCAAAA
TGCCAACTGCATTTCCCCTCTCACACTTCGCTCCCCCAACTCACTCCACCGGCCATAATTTATCGACAAGAAGATCGATCCAACATAGATGCTCATCTATGGAGT
AATCAGGGGTTTGTTTCTATTCTTTTGCATTTCTTCACATTACTCTCATCATTTTTTGCCCGCCAGATCTCCAACCATCTAATTCTCCGGATCGTATTTTTCATT
TGGTTTCACGGACTTTGGTGGGAGATCTATCTATACAGTAATGCTTTCAATCACAGGAATTGTATTTTATCAGAAACTGAAAAGGTTCTACATCGTGATGTTCGT
CCTCCCACCCTCCTCGAAACCAGATGTGAACCGAGATCTGGAGGCAATGTCATGATGGGTCTGTTTGGGCACAATAATGAACAGTTACCAGAAAGAAGAGTCTTC
GGCTTAAAAGTGAACTTGCATGATAAAATTGGAGGTAGCATTTCGGTGTCAGAAGTACAAGGAGAAACTTTGGTATCGAGGGACCAGAAACCATCAACTAGCAAG
CATCGTCGTAAGCAACATTTTCCTTGTGAAGTCGTGTTTACGGAGTCAGTTGATTATCTTTTGGAACCTGAGGACTTCATGAACGTTACTCAATTCTCTTTAGAG
TACATAGAGCGTGAGGAAAAACCTTCTGAAACTGATTTATATAAGCCCAGATTTGGAGGACACCAAACCCTTAAGGAAAGGGAGAGATCCTTCTATGCAACAAAT
CAAAAACTTCATTGTGGTTTCATTAAAGGACCACCAGAATCCCCAAGTACTGGATTTGATTTAGATGAAAAGGACAATGCGTACATGAAAATGTGCAAGGTTGCG
GTATCATCGTGCATTTTTGGGAGCTCTGATTTTCTGAGGAGGCCTACTAGCAAACAGATCAGTGAATATTCCAAGAAGAATGTTTGTTTTGTCATGTTTGTGGAT
GAGCAAACACTATCAAAACTATCAGCTGAGGGAAGTATTCCTGATGACAAGGGATACATTGGACTGTGGAAAATAGTTGTTGTGAGGAATTTACCATATGAAGAT
ATGCGCAGAACTGGAAAAGTTCCTAAATTTTTGTCACATCGGCTCTTCCCCCTCTTCAAGGTAGATCCAATGCTAATTATTGAACATTTTCTGTGGCAAAAGAAG
TCAGAGTATGCTATTTCAAACCACTATGATCGCCACTGTGTCTGGGAGGAAGTTCTCCAAAACAAACGTCTGAACAAGTACAATCACACTGCCATTGATGAACAG
TTTGCTTTTTATCAGTCTGATGGCCTCGTTAAATTCGACCCTTTTGATGTCAGGACTCCGCTTCCGAGTTATGTTCCTGAAGGTTCCTTCATTGTCCGGGCACAT
ACGCCAATGTCAAATTTATTTTCATGTCTTTGGTTCAATGAAGTTGACCGGTTTACCTCACGTGATCAATTGAGCTTTGCATATACTTACTTGAAATTCAGGAGA
ATAAACCAAGACATACCATTCAACTTAAACATGTTTAAGGACTGTGAACGACGATCACTAGCCAAGCTTTTTCGTCATAGATCATGGCCACCTCCAAATACTTGA
Protein sequenceShow/hide protein sequence
MVRFGSRFRKNRGNKINVTEPVVKGRLRLAPFPAKCQLHFPSHTSLPQLTPPAIIYRQEDRSNIDAHLWSNQGFVSILLHFFTLLSSFFARQISNHLILRIVFFI
WFHGLWWEIYLYSNAFNHRNCILSETEKVLHRDVRPPTLLETRCEPRSGGNVMMGLFGHNNEQLPERRVFGLKVNLHDKIGGSISVSEVQGETLVSRDQKPSTSK
HRRKQHFPCEVVFTESVDYLLEPEDFMNVTQFSLEYIEREEKPSETDLYKPRFGGHQTLKERERSFYATNQKLHCGFIKGPPESPSTGFDLDEKDNAYMKMCKVA
VSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGSIPDDKGYIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFPLFKVDPMLIIEHFLWQKK
SEYAISNHYDRHCVWEEVLQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFDVRTPLPSYVPEGSFIVRAHTPMSNLFSCLWFNEVDRFTSRDQLSFAYTYLKFRR
INQDIPFNLNMFKDCERRSLAKLFRHRSWPPPNT