; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004422 (gene) of Snake gourd v1 genome

Gene IDTan0004422
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein GAMETE CELL DEFECTIVE 1, mitochondrial
Genome locationLG08:24995126..24999359
RNA-Seq ExpressionTan0004422
SyntenyTan0004422
Gene Ontology termsGO:0007006 - mitochondrial membrane organization (biological process)
GO:0007033 - vacuole organization (biological process)
GO:0007154 - cell communication (biological process)
GO:0009555 - pollen development (biological process)
GO:0009793 - embryo development ending in seed dormancy (biological process)
GO:0009846 - pollen germination (biological process)
GO:0009960 - endosperm development (biological process)
GO:0010342 - endosperm cellularization (biological process)
GO:0010468 - regulation of gene expression (biological process)
GO:0010581 - regulation of starch biosynthetic process (biological process)
GO:0043067 - regulation of programmed cell death (biological process)
GO:0048868 - pollen tube development (biological process)
GO:0051647 - nucleus localization (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0000287 - magnesium ion binding (molecular function)
GO:0010333 - terpene synthase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK12013.1 uncharacterized protein E5676_scaffold1017G00220 [Cucumis melo var. makuwa]5.8e-19288.54Show/hide
Query:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVGSNLIIDSVSTLKHAQGAWLTTLREFSAKSGGFDEGDAKNEWDKSVSD--AGTMSDDLGWDSVSSWSTGL
        MQNLH+LI RLSSTS+GKSTNTSRLLK+NVG NL++DSVSTLKH QGAWLTTLREFSAKSGGF  GD KNEWDKSVS+   G  SDDLGWDSVSSWSTGL
Subjt:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVGSNLIIDSVSTLKHAQGAWLTTLREFSAKSGGFDEGDAKNEWDKSVSD--AGTMSDDLGWDSVSSWSTGL

Query:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKAEMYRLHKENPEIYT
        TKEHFDGEAVGR+ SGGG S +SPQSS+VSGLQE ED +RELEAENRKSK +V+ WGERMREMS+LLKQV+EPGARGSYLKDSEKAEMYRLHKENPE+YT
Subjt:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKAEMYRLHKENPEIYT

Query:  VEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDDMLYK
        +EKLA+DYRIMRQRVHAILWLKELEEEEEKKLGHPLDDS+ELLLD  PEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKED+MLYK
Subjt:  VEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDDMLYK

Query:  EFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
        EFVE++NFNKKK+AGEVF HKYSRRRAADGWKFT+EKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  EFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

XP_022154030.1 uncharacterized protein LOC111021386 [Momordica charantia]4.8e-20292.97Show/hide
Query:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVGSNLIIDSVSTLKHAQGAWLTTLREFSAKSGGFDEGDAKNEWDKSVSD--AGTMSDDLGWDSVSSWSTGL
        MQNLHHLI RLSSTS+GKSTNTSRLLK+NV S+LI DSV+TLKH QGAWLT LREFSAKSGGFDEGDAKNEWDKSVS   +GT SDDLGWDSVSSWSTGL
Subjt:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVGSNLIIDSVSTLKHAQGAWLTTLREFSAKSGGFDEGDAKNEWDKSVSD--AGTMSDDLGWDSVSSWSTGL

Query:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKAEMYRLHKENPEIYT
        TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIRELEAENRKSK FVD WGERMRE+S+LLKQVREPGARG+YLKDSEKAEMYRLHKENPE+YT
Subjt:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKAEMYRLHKENPEIYT

Query:  VEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDDMLYK
        V+KLA+DYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQ EDDMLYK
Subjt:  VEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDDMLYK

Query:  EFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
        EFVEKMNFNKKKIAGEVFRHKYSRRR +DGWKFT+EKMGPRGKRG GGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  EFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

XP_022965080.1 uncharacterized protein LOC111465050 [Cucurbita maxima]5.1e-19691.15Show/hide
Query:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVGSNLIIDSVSTLKHAQGAWLTTLREFSAKSGGFDEGDAKNEWDKSVSD--AGTMSDDLGWDSVSSWSTGL
        MQNLHH ICRLSSTS+GKST        NVGS+LI DSVSTLKH QGAWLTTLREFSAKSGGFDE ++KNEWDKSVS+  +GT SDDLGWDSVSSWSTGL
Subjt:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVGSNLIIDSVSTLKHAQGAWLTTLREFSAKSGGFDEGDAKNEWDKSVSD--AGTMSDDLGWDSVSSWSTGL

Query:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKAEMYRLHKENPEIYT
        TKEHFDGEAVGRR   GGDSPKSPQSSLVSGLQE EDRIRELEAENRKSKDFVD WGERM+EMS+LLKQVREPGARGSYLKDSEKAEMYRLHKENPE+YT
Subjt:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKAEMYRLHKENPEIYT

Query:  VEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDDMLYK
        VEKLA+DYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHD EFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEIS+KEDDMLYK
Subjt:  VEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDDMLYK

Query:  EFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
        EFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRG  GGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  EFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

XP_023553278.1 uncharacterized protein LOC111810742 [Cucurbita pepo subsp. pepo]6.7e-19690.89Show/hide
Query:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVGSNLIIDSVSTLKHAQGAWLTTLREFSAKSGGFDEGDAKNEWDKSVSD--AGTMSDDLGWDSVSSWSTGL
        MQNLHH ICRLSSTS+GKS        +NVGSNLI DSVSTLKH QGAWLTTLREFSAKSGGFDE ++KNEWDKSVS+  +GT SDDLGWDSVSSWSTGL
Subjt:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVGSNLIIDSVSTLKHAQGAWLTTLREFSAKSGGFDEGDAKNEWDKSVSD--AGTMSDDLGWDSVSSWSTGL

Query:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKAEMYRLHKENPEIYT
        TKEHFDGEAVGRR   GGDSPKSPQSSLVSGLQE EDRIRELEAENRKSKDFVD WGERM+EMS+LLKQV+EPGARGSYLKDSEKAEMYRLHKENPE+YT
Subjt:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKAEMYRLHKENPEIYT

Query:  VEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDDMLYK
        VEKLA+DYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHD EFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEIS+KEDDMLYK
Subjt:  VEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDDMLYK

Query:  EFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
        EFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRG  GGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  EFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

XP_038906006.1 protein GAMETE CELL DEFECTIVE 1, mitochondrial [Benincasa hispida]9.0e-20192.71Show/hide
Query:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVGSNLIIDSVSTLKHAQGAWLTTLREFSAKSGGFDEGDAKNEWDKSVSD--AGTMSDDLGWDSVSSWSTGL
        MQNLHHLICRLSSTS+GK+T TS+LLKENVGS+L+IDSVSTLKHAQGAWLT LREFSAKSGGFD G++KNE DKSVS+  +GT SDD GWDSVSSWSTGL
Subjt:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVGSNLIIDSVSTLKHAQGAWLTTLREFSAKSGGFDEGDAKNEWDKSVSD--AGTMSDDLGWDSVSSWSTGL

Query:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKAEMYRLHKENPEIYT
        TKEHFDGE VG RTSGG DSPKSPQSSLVSGLQEIEDRIRELEAENRKSKDFVD WGERMREMS+LLKQV+EPGARGSYLKDSEKAEMYRLHKENPE+YT
Subjt:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKAEMYRLHKENPEIYT

Query:  VEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDDMLYK
        VEKLA+DYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDDMLYK
Subjt:  VEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDDMLYK

Query:  EFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
        EFVEKMNFNKKKIAGEVF HKYSRRRAADGWKFTIEKMGPRGKRG GGGWKFVSLP+GSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  EFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

TrEMBL top hitse value%identityAlignment
A0A1S4DXD4 uncharacterized protein LOC1034908962.8e-19288.54Show/hide
Query:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVGSNLIIDSVSTLKHAQGAWLTTLREFSAKSGGFDEGDAKNEWDKSVSD--AGTMSDDLGWDSVSSWSTGL
        MQNLH+LI RLSSTS+GKSTNTSRLLK+NVG NL++DSVSTLKH QGAWLTTLREFSAKSGGF  GD KNEWDKSVS+   G  SDDLGWDSVSSWSTGL
Subjt:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVGSNLIIDSVSTLKHAQGAWLTTLREFSAKSGGFDEGDAKNEWDKSVSD--AGTMSDDLGWDSVSSWSTGL

Query:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKAEMYRLHKENPEIYT
        TKEHFDGEAVGR+ SGGG S +SPQSS+VSGLQE ED +RELEAENRKSK +V+ WGERMREMS+LLKQV+EPGARGSYLKDSEKAEMYRLHKENPE+YT
Subjt:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKAEMYRLHKENPEIYT

Query:  VEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDDMLYK
        +EKLA+DYRIMRQRVHAILWLKELEEEEEKKLGHPLDDS+ELLLD  PEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKED+MLYK
Subjt:  VEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDDMLYK

Query:  EFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
        EFVE++NFNKKK+AGEVF HKYSRRRAADGWKFT+EKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  EFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

A0A5D3CJ95 Uncharacterized protein2.8e-19288.54Show/hide
Query:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVGSNLIIDSVSTLKHAQGAWLTTLREFSAKSGGFDEGDAKNEWDKSVSD--AGTMSDDLGWDSVSSWSTGL
        MQNLH+LI RLSSTS+GKSTNTSRLLK+NVG NL++DSVSTLKH QGAWLTTLREFSAKSGGF  GD KNEWDKSVS+   G  SDDLGWDSVSSWSTGL
Subjt:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVGSNLIIDSVSTLKHAQGAWLTTLREFSAKSGGFDEGDAKNEWDKSVSD--AGTMSDDLGWDSVSSWSTGL

Query:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKAEMYRLHKENPEIYT
        TKEHFDGEAVGR+ SGGG S +SPQSS+VSGLQE ED +RELEAENRKSK +V+ WGERMREMS+LLKQV+EPGARGSYLKDSEKAEMYRLHKENPE+YT
Subjt:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKAEMYRLHKENPEIYT

Query:  VEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDDMLYK
        +EKLA+DYRIMRQRVHAILWLKELEEEEEKKLGHPLDDS+ELLLD  PEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKED+MLYK
Subjt:  VEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDDMLYK

Query:  EFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
        EFVE++NFNKKK+AGEVF HKYSRRRAADGWKFT+EKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  EFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

A0A6J1DIH3 uncharacterized protein LOC1110213862.3e-20292.97Show/hide
Query:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVGSNLIIDSVSTLKHAQGAWLTTLREFSAKSGGFDEGDAKNEWDKSVSD--AGTMSDDLGWDSVSSWSTGL
        MQNLHHLI RLSSTS+GKSTNTSRLLK+NV S+LI DSV+TLKH QGAWLT LREFSAKSGGFDEGDAKNEWDKSVS   +GT SDDLGWDSVSSWSTGL
Subjt:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVGSNLIIDSVSTLKHAQGAWLTTLREFSAKSGGFDEGDAKNEWDKSVSD--AGTMSDDLGWDSVSSWSTGL

Query:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKAEMYRLHKENPEIYT
        TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIRELEAENRKSK FVD WGERMRE+S+LLKQVREPGARG+YLKDSEKAEMYRLHKENPE+YT
Subjt:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKAEMYRLHKENPEIYT

Query:  VEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDDMLYK
        V+KLA+DYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQ EDDMLYK
Subjt:  VEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDDMLYK

Query:  EFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
        EFVEKMNFNKKKIAGEVFRHKYSRRR +DGWKFT+EKMGPRGKRG GGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  EFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

A0A6J1E998 uncharacterized protein LOC1114310663.7e-19289.84Show/hide
Query:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVGSNLIIDSVSTLKHAQGAWLTTLREFSAKSGGFDEGDAKNEWDKSVSD--AGTMSDDLGWDSVSSWSTGL
        MQNLHH ICRLSS S+GKST        NVGS+LI DSVSTLKH QGAWLTTLREFSAKSGGFDE +AKNEWDKSVS+  +GT +DDLGWDSVSSWSTGL
Subjt:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVGSNLIIDSVSTLKHAQGAWLTTLREFSAKSGGFDEGDAKNEWDKSVSD--AGTMSDDLGWDSVSSWSTGL

Query:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKAEMYRLHKENPEIYT
        TKEHFDGEAVGRR   G DSPKSPQ+SLVSGLQE EDRIRELEAENRKSKDFVD WGERM+EMS+LLKQVREPGARGSYLKDSEKAEMYRLHKENPE+YT
Subjt:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKAEMYRLHKENPEIYT

Query:  VEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDDMLYK
        VEKLA+DYRIMRQRVHAILWLKELEEEEEKKLG PLDDSVELLLDT PEFFKSHD EFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEIS+KEDDMLYK
Subjt:  VEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDDMLYK

Query:  EFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
        EFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRG  GGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  EFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

A0A6J1HMP6 uncharacterized protein LOC1114650502.5e-19691.15Show/hide
Query:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVGSNLIIDSVSTLKHAQGAWLTTLREFSAKSGGFDEGDAKNEWDKSVSD--AGTMSDDLGWDSVSSWSTGL
        MQNLHH ICRLSSTS+GKST        NVGS+LI DSVSTLKH QGAWLTTLREFSAKSGGFDE ++KNEWDKSVS+  +GT SDDLGWDSVSSWSTGL
Subjt:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVGSNLIIDSVSTLKHAQGAWLTTLREFSAKSGGFDEGDAKNEWDKSVSD--AGTMSDDLGWDSVSSWSTGL

Query:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKAEMYRLHKENPEIYT
        TKEHFDGEAVGRR   GGDSPKSPQSSLVSGLQE EDRIRELEAENRKSKDFVD WGERM+EMS+LLKQVREPGARGSYLKDSEKAEMYRLHKENPE+YT
Subjt:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKAEMYRLHKENPEIYT

Query:  VEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDDMLYK
        VEKLA+DYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHD EFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEIS+KEDDMLYK
Subjt:  VEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDDMLYK

Query:  EFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
        EFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRG  GGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  EFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

SwissProt top hitse value%identityAlignment
A2WW22 Protein GAMETE CELL DEFECTIVE 1, mitochondrial5.2e-11971.92Show/hide
Query:  SSWSTGLTKEHFDGE--AVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKAEMYRL
        SSWSTG+TKEHFDG   AVGR  +     P SP+ + V  + E ++  R +E +NR++K +VD WG+RMRE   LLKQVREPG+RGSYLKDSEK EMYRL
Subjt:  SSWSTGLTKEHFDGE--AVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKAEMYRL

Query:  HKENPEIYTVEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEIS
        HKE+PE YTVE+LA+D+R+MRQRVHAILWLKE+EEEEE+KLG PLDDSVE+LLD+CPEFF SHDREFHVASLPYKPDFKVMPEGWDGTTRD DEV YEIS
Subjt:  HKENPEIYTVEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEIS

Query:  QKEDDMLYKEFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKIL
         KED MLY+EFV+++ FNKKK+AGEV  HKYSRRR  DGW + +EK+G + KRGSGGGWKF SLPDGSSRPLN+MEKMYV+RETP+ RR+I+
Subjt:  QKEDDMLYKEFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKIL

Q8S2G4 Protein GAMETE CELL DEFECTIVE 1, mitochondrial4.0e-11971.92Show/hide
Query:  SSWSTGLTKEHFDGE--AVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKAEMYRL
        SSWSTG+TKEHFDG   AVGR  +     P SP+ + V  + E ++  R +E +NR++K +VD WG+RMRE   LLKQVREPG+RGSYLKDSEK EMYRL
Subjt:  SSWSTGLTKEHFDGE--AVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKAEMYRL

Query:  HKENPEIYTVEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEIS
        HKE+PE YTVE+LA+D+R+MRQRVHAILWLKE+EEEEE+KLG PLDDSVE+LLD+CPEFF SHDREFHVASLPYKPDFKVMPEGWDGTTRD DEV YEIS
Subjt:  HKENPEIYTVEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEIS

Query:  QKEDDMLYKEFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKIL
         KED MLY+EFV+++ FNKKK+AGEV  HKYSRRR  DGW + +EK+G + KRGSGGGWKF SLPDGSSRPLN+MEKMYV+RETP+ RR+I+
Subjt:  QKEDDMLYKEFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKIL

Q9LVA9 Protein GAMETE CELL DEFECTIVE 1, mitochondrial4.9e-12561.21Show/hide
Query:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVGSNLIIDSVSTLKHAQGAWLTTLREFSAKSGGFDEGDAKNEWDKSV--SDAGTMSDDLGWDSVSSWSTGL
        M NL  +I R SS S+  ST  S  L EN  S ++          Q A   T R FSAKSG    G   N W+ S   S  GT S DL WD+ S WSTGL
Subjt:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVGSNLIIDSVSTLKHAQGAWLTTLREFSAKSGGFDEGDAKNEWDKSV--SDAGTMSDDLGWDSVSSWSTGL

Query:  TKEHFDGEAVGRRTSGGGDSPKSPQS--------------SLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKA
        TKEHFDG +VGR+ +    S  +  S              +LV+ + E +D ++E+E +NR+ + FVD   +RM E+SVLLKQV+EPGARGSYLKDSEK 
Subjt:  TKEHFDGEAVGRRTSGGGDSPKSPQS--------------SLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKA

Query:  EMYRLHKENPEIYTVEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEV
        EMYRLHKENPE+YT+E+LA+DYRIMRQRVHAIL+LKE EEEEE+KLG PLDDSV+ LLD  PEFF SHDREFHVASL YKPDFKVMPEGWDGT +D+DEV
Subjt:  EMYRLHKENPEIYTVEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEV

Query:  HYEISQKEDDMLYKEFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKIL
        HYEIS+KEDDMLY+EFV +  FNK K  GEV  HKYSRRR+++GWK T+EK+G +GKRG+GGGWKF+SLPDGSSRPLNEMEK+YV+RETP  RR I+
Subjt:  HYEISQKEDDMLYKEFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKIL

Arabidopsis top hitse value%identityAlignment
AT2G02880.1 mucin-related8.0e-3034.48Show/hide
Query:  IRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKAEMYRLHKENPEIYTVEKLAEDYRIMRQRVHAILWLKELEEEEEKKL-----G
        I E++ E   +K FV+   E   E      +V +   +   + D E        + +  +Y++E + +DYR+ +QRVHA LW+KE+E+ EE KL     G
Subjt:  IRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKAEMYRLHKENPEIYTVEKLAEDYRIMRQRVHAILWLKELEEEEEKKL-----G

Query:  HPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDDMLYKEFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKF
           DD ++ LLD+C E F S D +F    +    + K  P+GW+ T ++ D   +E+SQ+E+D+L +EF  +  F K +IA  + +H +SRRR  DGWK+
Subjt:  HPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDDMLYKEFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKF

Query:  TIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNE
         IE +GP  ++G G   +  +L D S++P  E
Subjt:  TIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNE

AT5G62270.1 BEST Arabidopsis thaliana protein match is: mucin-related (TAIR:AT2G02880.1)1.6e-12661.62Show/hide
Query:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVGSNLIIDSVSTLKHAQGAWLTTLREFSAKSGGFDEGDAKNEWDKSV--SDAGTMSDDLGWDSVSSWSTGL
        M NL  +I R SS S+  ST  S  L EN  S ++          Q A   T R FSAKSG    G   N W+ S   S  GT S DL WD+ S WSTGL
Subjt:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVGSNLIIDSVSTLKHAQGAWLTTLREFSAKSGGFDEGDAKNEWDKSV--SDAGTMSDDLGWDSVSSWSTGL

Query:  TKEHFDGEAVGRRTSGGGDSPKSPQS--------------SLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKA
        TKEHFDG +VGR+ +    S  +  S              +LV+ + E +D ++E+E +NR+ + FVD   +RM E+SVLLKQV+EPGARGSYLKDSEK 
Subjt:  TKEHFDGEAVGRRTSGGGDSPKSPQS--------------SLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKA

Query:  EMYRLHKENPEIYTVEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEV
        EMYRLHKENPE+YT+E+LA+DYRIMRQRVHAIL+LKE EEEEE+KLG PLDDSV+ LLD  PEFF SHDREFHVASL YKPDFKVMPEGWDGT +D+DEV
Subjt:  EMYRLHKENPEIYTVEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEV

Query:  HYEISQKEDDMLYKEFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKI
        HYEIS+KEDDMLY+EFV +  FNK K  GEV  HKYSRRR+++GWK T+EK+G +GKRG+GGGWKF+SLPDGSSRPLNEMEK+YV+RETP  RRKI
Subjt:  HYEISQKEDDMLYKEFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKI

AT5G62270.2 FUNCTIONS IN: molecular_function unknown3.5e-12661.21Show/hide
Query:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVGSNLIIDSVSTLKHAQGAWLTTLREFSAKSGGFDEGDAKNEWDKSV--SDAGTMSDDLGWDSVSSWSTGL
        M NL  +I R SS S+  ST  S  L EN  S ++          Q A   T R FSAKSG    G   N W+ S   S  GT S DL WD+ S WSTGL
Subjt:  MQNLHHLICRLSSTSIGKSTNTSRLLKENVGSNLIIDSVSTLKHAQGAWLTTLREFSAKSGGFDEGDAKNEWDKSV--SDAGTMSDDLGWDSVSSWSTGL

Query:  TKEHFDGEAVGRRTSGGGDSPKSPQS--------------SLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKA
        TKEHFDG +VGR+ +    S  +  S              +LV+ + E +D ++E+E +NR+ + FVD   +RM E+SVLLKQV+EPGARGSYLKDSEK 
Subjt:  TKEHFDGEAVGRRTSGGGDSPKSPQS--------------SLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKA

Query:  EMYRLHKENPEIYTVEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEV
        EMYRLHKENPE+YT+E+LA+DYRIMRQRVHAIL+LKE EEEEE+KLG PLDDSV+ LLD  PEFF SHDREFHVASL YKPDFKVMPEGWDGT +D+DEV
Subjt:  EMYRLHKENPEIYTVEKLAEDYRIMRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEV

Query:  HYEISQKEDDMLYKEFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKIL
        HYEIS+KEDDMLY+EFV +  FNK K  GEV  HKYSRRR+++GWK T+EK+G +GKRG+GGGWKF+SLPDGSSRPLNEMEK+YV+RETP  RR I+
Subjt:  HYEISQKEDDMLYKEFVEKMNFNKKKIAGEVFRHKYSRRRAADGWKFTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAAACCTGCATCATTTAATTTGTCGTCTCAGTTCCACGTCTATTGGGAAGAGCACAAATACTTCGAGACTTCTAAAAGAAAATGTGGGATCTAATTTAATTATTGA
TTCAGTTTCAACATTGAAGCATGCTCAAGGAGCTTGGTTAACCACTTTGAGAGAGTTCTCTGCAAAATCTGGTGGATTTGATGAAGGTGATGCTAAGAATGAATGGGATA
AGAGTGTAAGTGATGCTGGCACCATGTCAGATGATTTAGGTTGGGATTCTGTTTCCTCCTGGTCGACTGGATTGACCAAAGAGCATTTTGATGGAGAGGCTGTTGGCCGC
AGGACTAGTGGGGGAGGAGATTCCCCAAAATCACCACAGTCTTCATTAGTTTCTGGGTTGCAAGAGATTGAGGACAGAATAAGGGAATTAGAGGCAGAAAACCGAAAAAG
CAAGGACTTTGTGGACATGTGGGGTGAAAGGATGAGGGAGATGAGCGTGCTTTTGAAACAAGTAAGAGAGCCTGGTGCTAGAGGGTCTTATCTCAAGGACTCAGAGAAGG
CAGAGATGTATCGCTTGCACAAGGAAAACCCTGAGATATATACTGTTGAGAAGCTTGCTGAAGATTACAGGATCATGAGGCAAAGGGTTCACGCCATTCTTTGGCTGAAA
GAACTTGAAGAGGAAGAAGAGAAAAAGCTGGGCCACCCCTTGGATGATTCTGTTGAGCTTTTACTCGATACTTGCCCTGAATTCTTCAAGTCCCATGACCGGGAATTCCA
TGTGGCATCCCTTCCGTACAAACCTGATTTCAAGGTTATGCCGGAGGGTTGGGATGGTACAACCAGAGATTTGGATGAAGTCCATTACGAGATCTCCCAAAAAGAAGACG
ATATGCTATATAAAGAATTTGTCGAGAAGATGAATTTCAACAAAAAGAAAATTGCAGGAGAGGTCTTTCGCCACAAATATAGTAGGCGTCGGGCAGCAGATGGGTGGAAA
TTCACAATAGAGAAAATGGGACCCCGAGGGAAACGGGGAAGTGGCGGTGGATGGAAGTTTGTTAGCTTGCCTGATGGTTCTAGTAGGCCATTGAACGAAATGGAGAAGAT
GTATGTGAGGCGAGAGACACCTCGCCATCGACGTAAAATCCTTCCATGA
mRNA sequenceShow/hide mRNA sequence
AAAAAAACGAAAGCCCTAACTGCTCTTCTCCACGCTGCTGCTTCTTCTTCTTCTTCTTCTTCATCACGCAGGTCGGTGTCGCCGCCCCCTCTCTCTAGTTCGCTGCTGCC
GCCGCCGTGGTTTCTTCCGCTCGTACCGCCTCCACCATCGTACGCGCCTCTCCTCTCGCAGTTCTGACCTCCTCTGTCCTCTCTCCCCCATTGCACGACGCACAACGCCT
CCTCTCTCATTTTGAGCTTCAACTCGTTCTTGTCACCCACGCTCGTCGCCGTCACCGTTGGTTGCTTCCTCTGATTATCGTCATCATCGCCCAGATCAACTCCATTTTGC
CTTCAGGTTTTGGTTTTTAAGGGATACATAGTTGTTGTGGCACTTTCAGAGAGGGGTATAGTTTGACCATGCAAAACCTGCATCATTTAATTTGTCGTCTCAGTTCCACG
TCTATTGGGAAGAGCACAAATACTTCGAGACTTCTAAAAGAAAATGTGGGATCTAATTTAATTATTGATTCAGTTTCAACATTGAAGCATGCTCAAGGAGCTTGGTTAAC
CACTTTGAGAGAGTTCTCTGCAAAATCTGGTGGATTTGATGAAGGTGATGCTAAGAATGAATGGGATAAGAGTGTAAGTGATGCTGGCACCATGTCAGATGATTTAGGTT
GGGATTCTGTTTCCTCCTGGTCGACTGGATTGACCAAAGAGCATTTTGATGGAGAGGCTGTTGGCCGCAGGACTAGTGGGGGAGGAGATTCCCCAAAATCACCACAGTCT
TCATTAGTTTCTGGGTTGCAAGAGATTGAGGACAGAATAAGGGAATTAGAGGCAGAAAACCGAAAAAGCAAGGACTTTGTGGACATGTGGGGTGAAAGGATGAGGGAGAT
GAGCGTGCTTTTGAAACAAGTAAGAGAGCCTGGTGCTAGAGGGTCTTATCTCAAGGACTCAGAGAAGGCAGAGATGTATCGCTTGCACAAGGAAAACCCTGAGATATATA
CTGTTGAGAAGCTTGCTGAAGATTACAGGATCATGAGGCAAAGGGTTCACGCCATTCTTTGGCTGAAAGAACTTGAAGAGGAAGAAGAGAAAAAGCTGGGCCACCCCTTG
GATGATTCTGTTGAGCTTTTACTCGATACTTGCCCTGAATTCTTCAAGTCCCATGACCGGGAATTCCATGTGGCATCCCTTCCGTACAAACCTGATTTCAAGGTTATGCC
GGAGGGTTGGGATGGTACAACCAGAGATTTGGATGAAGTCCATTACGAGATCTCCCAAAAAGAAGACGATATGCTATATAAAGAATTTGTCGAGAAGATGAATTTCAACA
AAAAGAAAATTGCAGGAGAGGTCTTTCGCCACAAATATAGTAGGCGTCGGGCAGCAGATGGGTGGAAATTCACAATAGAGAAAATGGGACCCCGAGGGAAACGGGGAAGT
GGCGGTGGATGGAAGTTTGTTAGCTTGCCTGATGGTTCTAGTAGGCCATTGAACGAAATGGAGAAGATGTATGTGAGGCGAGAGACACCTCGCCATCGACGTAAAATCCT
TCCATGATAAGTTTCGAGAACACACATCTCGTTTTGTTTGTGTATTTGATGAAATAGTTAGTTTAAAAGAAACTGGTAGATCCCCCAGGATGATTTCAAAATTTTCCTCT
GTGACATCATCTGAAGTAGGGTTCCCTTTTTTCACTGACCTGTTTTTGGTTGAACTCTTTGCAAATGTTGTAGGTTTCCCTGATAGTTCAATATCTGCAATAACAGGTTT
TTTTTTTGCCCTTCCTCTCAATGATGGGATGATATTTTGGTATCTTATTTCCAATGCCATCTTCCAAATAATGTAGCTCAAGGATGAAAGAGTTGGAGGAACATATTGAT
TATGTAGTTGAAATATTTTATTTTTCTTGTTCATTACTTTATTAGAGGCTTTATTGCATCACTTGTACAAATGGTGCTTTATGTCAAATAATTTTGAAACTCAATGATCA
TTTCTAAGAAGTAAGAACTGATAACATATTGATCTTTCTTTTGAAC
Protein sequenceShow/hide protein sequence
MQNLHHLICRLSSTSIGKSTNTSRLLKENVGSNLIIDSVSTLKHAQGAWLTTLREFSAKSGGFDEGDAKNEWDKSVSDAGTMSDDLGWDSVSSWSTGLTKEHFDGEAVGR
RTSGGGDSPKSPQSSLVSGLQEIEDRIRELEAENRKSKDFVDMWGERMREMSVLLKQVREPGARGSYLKDSEKAEMYRLHKENPEIYTVEKLAEDYRIMRQRVHAILWLK
ELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGTTRDLDEVHYEISQKEDDMLYKEFVEKMNFNKKKIAGEVFRHKYSRRRAADGWK
FTIEKMGPRGKRGSGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP