; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr012594 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr012594
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein GAMETE CELL DEFECTIVE 1, mitochondrial
Genome locationtig00153449:102783..105124
RNA-Seq ExpressionSgr012594
SyntenySgr012594
Gene Ontology termsGO:0007006 - mitochondrial membrane organization (biological process)
GO:0007033 - vacuole organization (biological process)
GO:0007154 - cell communication (biological process)
GO:0009555 - pollen development (biological process)
GO:0009793 - embryo development ending in seed dormancy (biological process)
GO:0009846 - pollen germination (biological process)
GO:0009960 - endosperm development (biological process)
GO:0010342 - endosperm cellularization (biological process)
GO:0010468 - regulation of gene expression (biological process)
GO:0010581 - regulation of starch biosynthetic process (biological process)
GO:0043067 - regulation of programmed cell death (biological process)
GO:0048868 - pollen tube development (biological process)
GO:0051647 - nucleus localization (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0000287 - magnesium ion binding (molecular function)
GO:0010333 - terpene synthase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154030.1 uncharacterized protein LOC111021386 [Momordica charantia]2.0e-20091.93Show/hide
Query:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVK------SVTTLKQAQGAWLTNLRDFSAKSGGFDEGDPKNEWDKNVRESFSGTTSDDLGWDSVFSWSTGL
        MQNLHHLI RLSSTS+GKSTNTSRLLK+NVK      SVTTLK  QGAWLTNLR+FSAKSGGFDEGD KNEWDK+V +SFSGTTSDDLGWDSV SWSTGL
Subjt:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVK------SVTTLKQAQGAWLTNLRDFSAKSGGFDEGDPKNEWDKNVRESFSGTTSDDLGWDSVFSWSTGL

Query:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAEMYRLHKENPEVFT
        TKEHFDGEAVGRRTSGGGDSPKSPQSSL+SGLQEIED++RELEAENRKSKGFVD+WGERMRE+S LLKQVREPGARG+YLKDSEKAEMYRLHKENPEV+T
Subjt:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAEMYRLHKENPEVFT

Query:  VEKLAKDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDEMLYK
        V+KLAKDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQ ED+MLYK
Subjt:  VEKLAKDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDEMLYK

Query:  EFVEKMNFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
        EFVEKMNFNKKKIAGEV+RHKYSRRR SDGWKFT+EKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  EFVEKMNFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

XP_022923343.1 uncharacterized protein LOC111431066 [Cucurbita moschata]2.1e-18687.83Show/hide
Query:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVKSVTTLKQAQGAWLTNLRDFSAKSGGFDEGDPKNEWDKNVRESFSGTTSDDLGWDSVFSWSTGLTKEHFD
        MQNLHH ICRLSS S+GKSTN    L  +  SV+TLK  QGAWLT LR+FSAKSGGFDE + KNEWDK+V ESFSGTT+DDLGWDSV SWSTGLTKEHFD
Subjt:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVKSVTTLKQAQGAWLTNLRDFSAKSGGFDEGDPKNEWDKNVRESFSGTTSDDLGWDSVFSWSTGLTKEHFD

Query:  GEAVGRRTSGGGDSPKSPQSSLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAEMYRLHKENPEVFTVEKLAK
        GEAVGRR   G DSPKSPQ+SL+SGLQE ED++RELEAENRKSK FVDKWGERM+EMS LLKQVREPGARGSYLKDSEKAEMYRLHKENPEV+TVEKLAK
Subjt:  GEAVGRRTSGGGDSPKSPQSSLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAEMYRLHKENPEVFTVEKLAK

Query:  DYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDEMLYKEFVEKM
        DYRIMRQRVHAILWLKELEEEEEKKLG PLDDSVELLLDT PEFFKSHD EFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEIS+KED+MLYKEFVEKM
Subjt:  DYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDEMLYKEFVEKM

Query:  NFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
        NFNKKKIAGEV+RHKYSRRR++DGWKFTIEKMGPRGKRGG GGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  NFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

XP_022965080.1 uncharacterized protein LOC111465050 [Cucurbita maxima]6.4e-19189.42Show/hide
Query:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVKSVTTLKQAQGAWLTNLRDFSAKSGGFDEGDPKNEWDKNVRESFSGTTSDDLGWDSVFSWSTGLTKEHFD
        MQNLHH ICRLSSTS+GKSTN    L  +  SV+TLK  QGAWLT LR+FSAKSGGFDE + KNEWDK+V ESFSGTTSDDLGWDSV SWSTGLTKEHFD
Subjt:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVKSVTTLKQAQGAWLTNLRDFSAKSGGFDEGDPKNEWDKNVRESFSGTTSDDLGWDSVFSWSTGLTKEHFD

Query:  GEAVGRRTSGGGDSPKSPQSSLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAEMYRLHKENPEVFTVEKLAK
        GEAVGRR   GGDSPKSPQSSL+SGLQE ED++RELEAENRKSK FVDKWGERM+EMS LLKQVREPGARGSYLKDSEKAEMYRLHKENPEV+TVEKLAK
Subjt:  GEAVGRRTSGGGDSPKSPQSSLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAEMYRLHKENPEVFTVEKLAK

Query:  DYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDEMLYKEFVEKM
        DYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHD EFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEIS+KED+MLYKEFVEKM
Subjt:  DYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDEMLYKEFVEKM

Query:  NFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
        NFNKKKIAGEV+RHKYSRRR++DGWKFTIEKMGPRGKRGG GGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  NFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

XP_023553278.1 uncharacterized protein LOC111810742 [Cucurbita pepo subsp. pepo]5.4e-19088.92Show/hide
Query:  MQNLHHLICRLSSTSIGKSTNT-SRLLKENVKSVTTLKQAQGAWLTNLRDFSAKSGGFDEGDPKNEWDKNVRESFSGTTSDDLGWDSVFSWSTGLTKEHF
        MQNLHH ICRLSSTS+GKS N  S L+ +   SV+TLK  QGAWLT LR+FSAKSGGFDE + KNEWDK+V ESFSGTTSDDLGWDSV SWSTGLTKEHF
Subjt:  MQNLHHLICRLSSTSIGKSTNT-SRLLKENVKSVTTLKQAQGAWLTNLRDFSAKSGGFDEGDPKNEWDKNVRESFSGTTSDDLGWDSVFSWSTGLTKEHF

Query:  DGEAVGRRTSGGGDSPKSPQSSLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAEMYRLHKENPEVFTVEKLA
        DGEAVGRR   GGDSPKSPQSSL+SGLQE ED++RELEAENRKSK FVDKWGERM+EMS LLKQV+EPGARGSYLKDSEKAEMYRLHKENPEV+TVEKLA
Subjt:  DGEAVGRRTSGGGDSPKSPQSSLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAEMYRLHKENPEVFTVEKLA

Query:  KDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDEMLYKEFVEK
        KDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHD EFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEIS+KED+MLYKEFVEK
Subjt:  KDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDEMLYKEFVEK

Query:  MNFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
        MNFNKKKIAGEV+RHKYSRRR++DGWKFTIEKMGPRGKRGG GGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  MNFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

XP_038906006.1 protein GAMETE CELL DEFECTIVE 1, mitochondrial [Benincasa hispida]4.7e-19489.84Show/hide
Query:  MQNLHHLICRLSSTSIGKSTNTSRLLKENV------KSVTTLKQAQGAWLTNLRDFSAKSGGFDEGDPKNEWDKNVRESFSGTTSDDLGWDSVFSWSTGL
        MQNLHHLICRLSSTS+GK+T TS+LLKENV       SV+TLK AQGAWLTNLR+FSAKSGGFD G+ KNE DK+V ESFSGT SDD GWDSV SWSTGL
Subjt:  MQNLHHLICRLSSTSIGKSTNTSRLLKENV------KSVTTLKQAQGAWLTNLRDFSAKSGGFDEGDPKNEWDKNVRESFSGTTSDDLGWDSVFSWSTGL

Query:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAEMYRLHKENPEVFT
        TKEHFDGE VG RTSGG DSPKSPQSSL+SGLQEIED++RELEAENRKSK FVDKWGERMREMS LLKQV+EPGARGSYLKDSEKAEMYRLHKENPEV+T
Subjt:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAEMYRLHKENPEVFT

Query:  VEKLAKDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDEMLYK
        VEKLAKDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKED+MLYK
Subjt:  VEKLAKDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDEMLYK

Query:  EFVEKMNFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
        EFVEKMNFNKKKIAGEV+ HKYSRRR++DGWKFTIEKMGPRGKRGGGGGWKFVSLP+GSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  EFVEKMNFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

TrEMBL top hitse value%identityAlignment
A0A0A0L1K7 Uncharacterized protein9.6e-18586.46Show/hide
Query:  MQNLHHLICRLSSTSIGKSTNTSRLLKEN------VKSVTTLKQAQGAWLTNLRDFSAKSGGFDEGDPKNEWDKNVRESFSGTTSDDLGWDSVFSWSTGL
        MQNLH+LI RLSSTS+GK+TNTSRLLK+N      V SV+TLK  QGAWLT LR+FSAKSGGF  GD KNEWDK+V ESFSG TSDDLGWDS  SWSTGL
Subjt:  MQNLHHLICRLSSTSIGKSTNTSRLLKEN------VKSVTTLKQAQGAWLTNLRDFSAKSGGFDEGDPKNEWDKNVRESFSGTTSDDLGWDSVFSWSTGL

Query:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAEMYRLHKENPEVFT
        TKEHFDGEAVGRR SGGG+S +SPQSSL+SGLQE EDK+RELEAENRKSK +VDKWGERMRE S LLKQV+EPGARGSYLKDSEKAEMYRLHKENPEV+T
Subjt:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAEMYRLHKENPEVFT

Query:  VEKLAKDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDEMLYK
        +EKLAKDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLD  PEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDEMLY+
Subjt:  VEKLAKDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDEMLYK

Query:  EFVEKMNFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
        EFVEK+NFNKKK+AGEV+RHKYSRRR +DGWKFTIEK+GPRGK+G GGG KFVSLP+GSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  EFVEKMNFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

A0A5D3CJ95 Uncharacterized protein9.6e-18585.94Show/hide
Query:  MQNLHHLICRLSSTSIGKSTNTSRLLKEN------VKSVTTLKQAQGAWLTNLRDFSAKSGGFDEGDPKNEWDKNVRESFSGTTSDDLGWDSVFSWSTGL
        MQNLH+LI RLSSTS+GKSTNTSRLLK+N      V SV+TLK  QGAWLT LR+FSAKSGGF  GD KNEWDK+V E F G TSDDLGWDSV SWSTGL
Subjt:  MQNLHHLICRLSSTSIGKSTNTSRLLKEN------VKSVTTLKQAQGAWLTNLRDFSAKSGGFDEGDPKNEWDKNVRESFSGTTSDDLGWDSVFSWSTGL

Query:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAEMYRLHKENPEVFT
        TKEHFDGEAVGR+ SGGG S +SPQSS++SGLQE ED +RELEAENRKSK +V+KWGERMREMS LLKQV+EPGARGSYLKDSEKAEMYRLHKENPEV+T
Subjt:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAEMYRLHKENPEVFT

Query:  VEKLAKDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDEMLYK
        +EKLAKDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDS+ELLLD  PEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDEMLYK
Subjt:  VEKLAKDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDEMLYK

Query:  EFVEKMNFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
        EFVE++NFNKKK+AGEV+ HKYSRRR++DGWKFT+EKMGPRGKRG GGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  EFVEKMNFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

A0A6J1DIH3 uncharacterized protein LOC1110213869.6e-20191.93Show/hide
Query:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVK------SVTTLKQAQGAWLTNLRDFSAKSGGFDEGDPKNEWDKNVRESFSGTTSDDLGWDSVFSWSTGL
        MQNLHHLI RLSSTS+GKSTNTSRLLK+NVK      SVTTLK  QGAWLTNLR+FSAKSGGFDEGD KNEWDK+V +SFSGTTSDDLGWDSV SWSTGL
Subjt:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVK------SVTTLKQAQGAWLTNLRDFSAKSGGFDEGDPKNEWDKNVRESFSGTTSDDLGWDSVFSWSTGL

Query:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAEMYRLHKENPEVFT
        TKEHFDGEAVGRRTSGGGDSPKSPQSSL+SGLQEIED++RELEAENRKSKGFVD+WGERMRE+S LLKQVREPGARG+YLKDSEKAEMYRLHKENPEV+T
Subjt:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAEMYRLHKENPEVFT

Query:  VEKLAKDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDEMLYK
        V+KLAKDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQ ED+MLYK
Subjt:  VEKLAKDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDEMLYK

Query:  EFVEKMNFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
        EFVEKMNFNKKKIAGEV+RHKYSRRR SDGWKFT+EKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  EFVEKMNFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

A0A6J1E998 uncharacterized protein LOC1114310661.0e-18687.83Show/hide
Query:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVKSVTTLKQAQGAWLTNLRDFSAKSGGFDEGDPKNEWDKNVRESFSGTTSDDLGWDSVFSWSTGLTKEHFD
        MQNLHH ICRLSS S+GKSTN    L  +  SV+TLK  QGAWLT LR+FSAKSGGFDE + KNEWDK+V ESFSGTT+DDLGWDSV SWSTGLTKEHFD
Subjt:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVKSVTTLKQAQGAWLTNLRDFSAKSGGFDEGDPKNEWDKNVRESFSGTTSDDLGWDSVFSWSTGLTKEHFD

Query:  GEAVGRRTSGGGDSPKSPQSSLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAEMYRLHKENPEVFTVEKLAK
        GEAVGRR   G DSPKSPQ+SL+SGLQE ED++RELEAENRKSK FVDKWGERM+EMS LLKQVREPGARGSYLKDSEKAEMYRLHKENPEV+TVEKLAK
Subjt:  GEAVGRRTSGGGDSPKSPQSSLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAEMYRLHKENPEVFTVEKLAK

Query:  DYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDEMLYKEFVEKM
        DYRIMRQRVHAILWLKELEEEEEKKLG PLDDSVELLLDT PEFFKSHD EFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEIS+KED+MLYKEFVEKM
Subjt:  DYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDEMLYKEFVEKM

Query:  NFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
        NFNKKKIAGEV+RHKYSRRR++DGWKFTIEKMGPRGKRGG GGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  NFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

A0A6J1HMP6 uncharacterized protein LOC1114650503.1e-19189.42Show/hide
Query:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVKSVTTLKQAQGAWLTNLRDFSAKSGGFDEGDPKNEWDKNVRESFSGTTSDDLGWDSVFSWSTGLTKEHFD
        MQNLHH ICRLSSTS+GKSTN    L  +  SV+TLK  QGAWLT LR+FSAKSGGFDE + KNEWDK+V ESFSGTTSDDLGWDSV SWSTGLTKEHFD
Subjt:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVKSVTTLKQAQGAWLTNLRDFSAKSGGFDEGDPKNEWDKNVRESFSGTTSDDLGWDSVFSWSTGLTKEHFD

Query:  GEAVGRRTSGGGDSPKSPQSSLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAEMYRLHKENPEVFTVEKLAK
        GEAVGRR   GGDSPKSPQSSL+SGLQE ED++RELEAENRKSK FVDKWGERM+EMS LLKQVREPGARGSYLKDSEKAEMYRLHKENPEV+TVEKLAK
Subjt:  GEAVGRRTSGGGDSPKSPQSSLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAEMYRLHKENPEVFTVEKLAK

Query:  DYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDEMLYKEFVEKM
        DYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHD EFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEIS+KED+MLYKEFVEKM
Subjt:  DYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDEMLYKEFVEKM

Query:  NFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
        NFNKKKIAGEV+RHKYSRRR++DGWKFTIEKMGPRGKRGG GGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  NFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

SwissProt top hitse value%identityAlignment
A2WW22 Protein GAMETE CELL DEFECTIVE 1, mitochondrial2.0e-11870.61Show/hide
Query:  WDSVFSWSTGLTKEHFDGE--AVGRRTSGGGDSPKSPQSSLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAE
        W S  SWSTG+TKEHFDG   AVGR  +     P SP+ + +  + E ++  R +E +NR++K +VD WG+RMRE   LLKQVREPG+RGSYLKDSEK E
Subjt:  WDSVFSWSTGLTKEHFDGE--AVGRRTSGGGDSPKSPQSSLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAE

Query:  MYRLHKENPEVFTVEKLAKDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVH
        MYRLHKE+PE +TVE+LAKD+R+MRQRVHAILWLKE+EEEEE+KLG PLDDSVE+LLD+CPEFF SHDREFHVASLPYKPDFKVMPEGWDGTTRD DEV 
Subjt:  MYRLHKENPEVFTVEKLAKDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVH

Query:  YEISQKEDEMLYKEFVEKMNFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKIL
        YEIS KED+MLY+EFV+++ FNKKK+AGEV  HKYSRRR  DGW + +EK+G + KRG GGGWKF SLPDGSSRPLN+MEKMYV+RETP+ RR+I+
Subjt:  YEISQKEDEMLYKEFVEKMNFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKIL

Q8S2G4 Protein GAMETE CELL DEFECTIVE 1, mitochondrial1.5e-11870.61Show/hide
Query:  WDSVFSWSTGLTKEHFDGE--AVGRRTSGGGDSPKSPQSSLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAE
        W S  SWSTG+TKEHFDG   AVGR  +     P SP+ + +  + E ++  R +E +NR++K +VD WG+RMRE   LLKQVREPG+RGSYLKDSEK E
Subjt:  WDSVFSWSTGLTKEHFDGE--AVGRRTSGGGDSPKSPQSSLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAE

Query:  MYRLHKENPEVFTVEKLAKDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVH
        MYRLHKE+PE +TVE+LAKD+R+MRQRVHAILWLKE+EEEEE+KLG PLDDSVE+LLD+CPEFF SHDREFHVASLPYKPDFKVMPEGWDGTTRD DEV 
Subjt:  MYRLHKENPEVFTVEKLAKDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVH

Query:  YEISQKEDEMLYKEFVEKMNFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKIL
        YEIS KED+MLY+EFV+++ FNKKK+AGEV  HKYSRRR  DGW + +EK+G + KRG GGGWKF SLPDGSSRPLN+MEKMYV+RETP+ RR+I+
Subjt:  YEISQKEDEMLYKEFVEKMNFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKIL

Q9LVA9 Protein GAMETE CELL DEFECTIVE 1, mitochondrial7.5e-12662.66Show/hide
Query:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVKSVTTLKQAQGAWLTNLRDFSAKSGGFDEGDPKNEWDKNVRESFSGTTSDDLGWDSVFSWSTGLTKEHFD
        M NL  +I R SS S+  ST  S  L EN  S    K  Q A  TN R FSAKSG    G   N W+ +   SF GT S DL WD+   WSTGLTKEHFD
Subjt:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVKSVTTLKQAQGAWLTNLRDFSAKSGGFDEGDPKNEWDKNVRESFSGTTSDDLGWDSVFSWSTGLTKEHFD

Query:  GEAVGRRTSGGGDSPKSPQS--------------SLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAEMYRLH
        G +VGR+ +    S  +  S              +L++ + E +D L+E+E +NR+ + FVD   +RM E+S LLKQV+EPGARGSYLKDSEK EMYRLH
Subjt:  GEAVGRRTSGGGDSPKSPQS--------------SLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAEMYRLH

Query:  KENPEVFTVEKLAKDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQ
        KENPEV+T+E+LAKDYRIMRQRVHAIL+LKE EEEEE+KLG PLDDSV+ LLD  PEFF SHDREFHVASL YKPDFKVMPEGWDGT +D+DEVHYEIS+
Subjt:  KENPEVFTVEKLAKDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQ

Query:  KEDEMLYKEFVEKMNFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKIL
        KED+MLY+EFV +  FNK K  GEV  HKYSRRRSS+GWK T+EK+G +GKRG GGGWKF+SLPDGSSRPLNEMEK+YV+RETP  RR I+
Subjt:  KEDEMLYKEFVEKMNFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKIL

Arabidopsis top hitse value%identityAlignment
AT2G02880.1 mucin-related8.7e-2933.19Show/hide
Query:  EDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAEMYRLHKENPEVFTVEKLAKDYRIMRQRVHAILWLKELEEEEEKKL---
        E     +E  + ++K FV+   E   E      +V +   +   + D E        + +  ++++E + KDYR+ +QRVHA LW+KE+E+ EE KL   
Subjt:  EDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAEMYRLHKENPEVFTVEKLAKDYRIMRQRVHAILWLKELEEEEEKKL---

Query:  --GHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDEMLYKEFVEKMNFNKKKIAGEVYRHKYSRRRSSDG
          G   DD ++ LLD+C E F S D +F    +    + K  P+GW+ T ++ D   +E+SQ+E+++L +EF  +  F K +IA  + +H +SRRR  DG
Subjt:  --GHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDEMLYKEFVEKMNFNKKKIAGEVYRHKYSRRRSSDG

Query:  WKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNE
        WK+ IE +GP  ++G G   +  +L D S++P  E
Subjt:  WKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNE

AT5G62270.1 BEST Arabidopsis thaliana protein match is: mucin-related (TAIR:AT2G02880.1)2.4e-12763.08Show/hide
Query:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVKSVTTLKQAQGAWLTNLRDFSAKSGGFDEGDPKNEWDKNVRESFSGTTSDDLGWDSVFSWSTGLTKEHFD
        M NL  +I R SS S+  ST  S  L EN  S    K  Q A  TN R FSAKSG    G   N W+ +   SF GT S DL WD+   WSTGLTKEHFD
Subjt:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVKSVTTLKQAQGAWLTNLRDFSAKSGGFDEGDPKNEWDKNVRESFSGTTSDDLGWDSVFSWSTGLTKEHFD

Query:  GEAVGRRTSGGGDSPKSPQS--------------SLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAEMYRLH
        G +VGR+ +    S  +  S              +L++ + E +D L+E+E +NR+ + FVD   +RM E+S LLKQV+EPGARGSYLKDSEK EMYRLH
Subjt:  GEAVGRRTSGGGDSPKSPQS--------------SLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAEMYRLH

Query:  KENPEVFTVEKLAKDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQ
        KENPEV+T+E+LAKDYRIMRQRVHAIL+LKE EEEEE+KLG PLDDSV+ LLD  PEFF SHDREFHVASL YKPDFKVMPEGWDGT +D+DEVHYEIS+
Subjt:  KENPEVFTVEKLAKDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQ

Query:  KEDEMLYKEFVEKMNFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKI
        KED+MLY+EFV +  FNK K  GEV  HKYSRRRSS+GWK T+EK+G +GKRG GGGWKF+SLPDGSSRPLNEMEK+YV+RETP  RRKI
Subjt:  KEDEMLYKEFVEKMNFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKI

AT5G62270.2 FUNCTIONS IN: molecular_function unknown5.3e-12762.66Show/hide
Query:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVKSVTTLKQAQGAWLTNLRDFSAKSGGFDEGDPKNEWDKNVRESFSGTTSDDLGWDSVFSWSTGLTKEHFD
        M NL  +I R SS S+  ST  S  L EN  S    K  Q A  TN R FSAKSG    G   N W+ +   SF GT S DL WD+   WSTGLTKEHFD
Subjt:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVKSVTTLKQAQGAWLTNLRDFSAKSGGFDEGDPKNEWDKNVRESFSGTTSDDLGWDSVFSWSTGLTKEHFD

Query:  GEAVGRRTSGGGDSPKSPQS--------------SLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAEMYRLH
        G +VGR+ +    S  +  S              +L++ + E +D L+E+E +NR+ + FVD   +RM E+S LLKQV+EPGARGSYLKDSEK EMYRLH
Subjt:  GEAVGRRTSGGGDSPKSPQS--------------SLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAEMYRLH

Query:  KENPEVFTVEKLAKDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQ
        KENPEV+T+E+LAKDYRIMRQRVHAIL+LKE EEEEE+KLG PLDDSV+ LLD  PEFF SHDREFHVASL YKPDFKVMPEGWDGT +D+DEVHYEIS+
Subjt:  KENPEVFTVEKLAKDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQ

Query:  KEDEMLYKEFVEKMNFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKIL
        KED+MLY+EFV +  FNK K  GEV  HKYSRRRSS+GWK T+EK+G +GKRG GGGWKF+SLPDGSSRPLNEMEK+YV+RETP  RR I+
Subjt:  KEDEMLYKEFVEKMNFNKKKIAGEVYRHKYSRRRSSDGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAAATCTGCATCATTTAATTTGTCGTCTCAGTTCCACTTCTATTGGAAAAAGCACGAATACTTCGAGACTTCTGAAAGAAAATGTGAAATCAGTAACTACATTGAA
GCAAGCTCAAGGAGCTTGGTTAACCAATTTGAGAGATTTCTCTGCAAAATCTGGTGGATTTGATGAAGGTGATCCTAAGAATGAATGGGATAAGAATGTGAGAGAATCGT
TTTCTGGCACCACGTCAGATGATTTAGGTTGGGATTCAGTTTTCTCTTGGTCTACTGGATTGACCAAAGAGCATTTTGATGGAGAGGCTGTGGGCCGCAGGACTAGTGGA
GGGGGTGATTCACCAAAATCACCCCAGTCTTCATTACTTTCCGGATTGCAAGAGATTGAGGACAAATTAAGGGAATTAGAGGCGGAAAACCGGAAAAGTAAGGGCTTTGT
GGACAAGTGGGGTGAAAGGATGAGGGAGATGAGCACTCTTTTGAAACAAGTAAGAGAGCCTGGTGCTAGAGGGTCTTATCTCAAGGACTCAGAGAAGGCTGAGATGTATC
GCTTGCACAAGGAAAACCCTGAGGTATTCACTGTCGAGAAGCTTGCTAAAGATTACAGAATTATGAGGCAAAGGGTTCATGCCATTCTTTGGCTGAAAGAGCTTGAAGAG
GAAGAGGAGAAAAAACTGGGCCACCCCTTGGATGATTCTGTTGAGCTTTTACTCGATACTTGCCCTGAATTCTTTAAATCCCATGACCGGGAATTCCATGTGGCATCCCT
TCCTTACAAACCTGATTTCAAGGTCATGCCGGAGGGTTGGGATGGTACAACCAGAGATTTGGATGAAGTGCATTACGAGATCTCCCAAAAAGAAGATGAGATGCTATATA
AAGAATTCGTTGAGAAGATGAATTTCAACAAAAAGAAGATTGCAGGAGAGGTTTATCGCCACAAATATAGTAGGCGTCGGTCATCAGATGGGTGGAAATTCACTATAGAG
AAAATGGGACCTCGAGGGAAAAGGGGAGGTGGTGGTGGATGGAAGTTCGTCAGCTTGCCCGATGGTTCTAGCCGACCGTTGAACGAAATGGAGAAGATGTATGTGAGGCG
AGAGACACCTCGCCATCGACGTAAAATCCTTCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAAAATCTGCATCATTTAATTTGTCGTCTCAGTTCCACTTCTATTGGAAAAAGCACGAATACTTCGAGACTTCTGAAAGAAAATGTGAAATCAGTAACTACATTGAA
GCAAGCTCAAGGAGCTTGGTTAACCAATTTGAGAGATTTCTCTGCAAAATCTGGTGGATTTGATGAAGGTGATCCTAAGAATGAATGGGATAAGAATGTGAGAGAATCGT
TTTCTGGCACCACGTCAGATGATTTAGGTTGGGATTCAGTTTTCTCTTGGTCTACTGGATTGACCAAAGAGCATTTTGATGGAGAGGCTGTGGGCCGCAGGACTAGTGGA
GGGGGTGATTCACCAAAATCACCCCAGTCTTCATTACTTTCCGGATTGCAAGAGATTGAGGACAAATTAAGGGAATTAGAGGCGGAAAACCGGAAAAGTAAGGGCTTTGT
GGACAAGTGGGGTGAAAGGATGAGGGAGATGAGCACTCTTTTGAAACAAGTAAGAGAGCCTGGTGCTAGAGGGTCTTATCTCAAGGACTCAGAGAAGGCTGAGATGTATC
GCTTGCACAAGGAAAACCCTGAGGTATTCACTGTCGAGAAGCTTGCTAAAGATTACAGAATTATGAGGCAAAGGGTTCATGCCATTCTTTGGCTGAAAGAGCTTGAAGAG
GAAGAGGAGAAAAAACTGGGCCACCCCTTGGATGATTCTGTTGAGCTTTTACTCGATACTTGCCCTGAATTCTTTAAATCCCATGACCGGGAATTCCATGTGGCATCCCT
TCCTTACAAACCTGATTTCAAGGTCATGCCGGAGGGTTGGGATGGTACAACCAGAGATTTGGATGAAGTGCATTACGAGATCTCCCAAAAAGAAGATGAGATGCTATATA
AAGAATTCGTTGAGAAGATGAATTTCAACAAAAAGAAGATTGCAGGAGAGGTTTATCGCCACAAATATAGTAGGCGTCGGTCATCAGATGGGTGGAAATTCACTATAGAG
AAAATGGGACCTCGAGGGAAAAGGGGAGGTGGTGGTGGATGGAAGTTCGTCAGCTTGCCCGATGGTTCTAGCCGACCGTTGAACGAAATGGAGAAGATGTATGTGAGGCG
AGAGACACCTCGCCATCGACGTAAAATCCTTCCATGA
Protein sequenceShow/hide protein sequence
MQNLHHLICRLSSTSIGKSTNTSRLLKENVKSVTTLKQAQGAWLTNLRDFSAKSGGFDEGDPKNEWDKNVRESFSGTTSDDLGWDSVFSWSTGLTKEHFDGEAVGRRTSG
GGDSPKSPQSSLLSGLQEIEDKLRELEAENRKSKGFVDKWGERMREMSTLLKQVREPGARGSYLKDSEKAEMYRLHKENPEVFTVEKLAKDYRIMRQRVHAILWLKELEE
EEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDEMLYKEFVEKMNFNKKKIAGEVYRHKYSRRRSSDGWKFTIE
KMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP