; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011366 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011366
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein GAMETE CELL DEFECTIVE 1, mitochondrial
Genome locationchr1:22900270..22906793
RNA-Seq ExpressionLag0011366
SyntenyLag0011366
Gene Ontology termsGO:0007006 - mitochondrial membrane organization (biological process)
GO:0007033 - vacuole organization (biological process)
GO:0007154 - cell communication (biological process)
GO:0009555 - pollen development (biological process)
GO:0009793 - embryo development ending in seed dormancy (biological process)
GO:0009846 - pollen germination (biological process)
GO:0009960 - endosperm development (biological process)
GO:0010342 - endosperm cellularization (biological process)
GO:0010468 - regulation of gene expression (biological process)
GO:0010581 - regulation of starch biosynthetic process (biological process)
GO:0043067 - regulation of programmed cell death (biological process)
GO:0048868 - pollen tube development (biological process)
GO:0051647 - nucleus localization (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0000287 - magnesium ion binding (molecular function)
GO:0010333 - terpene synthase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK12013.1 uncharacterized protein E5676_scaffold1017G00220 [Cucumis melo var. makuwa]3.9e-16679.02Show/hide
Query:  LTMQNLHHLICRLSSTSLGKSTNTSRLLKENVGSNLILDSVSTLKHAQGAWLTSWKEFSAKSGGFDEGDAKNEWDRSVSESFSGTTSDDLGWDSVSSWST
        LTMQNLH+LI RLSSTSLGKSTNTSRLLK+NVG NL++DSVSTLKH QGAWLT+ +EFSAKSGGF  GD KNEWD+SVSE F G TSDDLGWDSVSSWST
Subjt:  LTMQNLHHLICRLSSTSLGKSTNTSRLLKENVGSNLILDSVSTLKHAQGAWLTSWKEFSAKSGGFDEGDAKNEWDRSVSESFSGTTSDDLGWDSVSSWST

Query:  GLTKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARGSYLKDSEKAEIYRLHKENPEV
        GLTKEHFDGEAVGR+ SGGG S +SPQSS+VSGLQE ED +RE++AENRK++ +++KWGERMREMS+LLKQVKEPGARGSYLKDSEKAE+YRLHKENPEV
Subjt:  GLTKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARGSYLKDSEKAEIYRLHKENPEV

Query:  YTVEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD---------------------
        YT+EKLAKDYRI+RQRVHAILWLKELEEEEEKKLGHPLDDS+ELLLD  PEFFKSHDREFHVASLPYKPDFKVMPEGWD                     
Subjt:  YTVEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD---------------------

Query:  ----------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
                        GEVF HKYSRRRAADGWKFT+EKMGPRGKRG GGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  ----------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

XP_022154030.1 uncharacterized protein LOC111021386 [Momordica charantia]4.9e-17783.07Show/hide
Query:  MQNLHHLICRLSSTSLGKSTNTSRLLKENVGSNLILDSVSTLKHAQGAWLTSWKEFSAKSGGFDEGDAKNEWDRSVSESFSGTTSDDLGWDSVSSWSTGL
        MQNLHHLI RLSSTSLGKSTNTSRLLK+NV S+LI DSV+TLKH QGAWLT+ +EFSAKSGGFDEGDAKNEWD+SVS+SFSGTTSDDLGWDSVSSWSTGL
Subjt:  MQNLHHLICRLSSTSLGKSTNTSRLLKENVGSNLILDSVSTLKHAQGAWLTSWKEFSAKSGGFDEGDAKNEWDRSVSESFSGTTSDDLGWDSVSSWSTGL

Query:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARGSYLKDSEKAEIYRLHKENPEVYT
        TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIRE++AENRK++GF+D+WGERMRE+S+LLKQV+EPGARG+YLKDSEKAE+YRLHKENPEVYT
Subjt:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARGSYLKDSEKAEIYRLHKENPEVYT

Query:  VEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD-----------------------
        V+KLAKDYRI+RQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD                       
Subjt:  VEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD-----------------------

Query:  --------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
                      GEVFRHKYSRRR +DGWKFT+EKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  --------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

XP_022965080.1 uncharacterized protein LOC111465050 [Cucurbita maxima]5.8e-17081.25Show/hide
Query:  MQNLHHLICRLSSTSLGKSTNTSRLLKENVGSNLILDSVSTLKHAQGAWLTSWKEFSAKSGGFDEGDAKNEWDRSVSESFSGTTSDDLGWDSVSSWSTGL
        MQNLHH ICRLSSTSLGKST        NVGS+LI DSVSTLKH QGAWLT+ +EFSAKSGGFDE ++KNEWD+SVSESFSGTTSDDLGWDSVSSWSTGL
Subjt:  MQNLHHLICRLSSTSLGKSTNTSRLLKENVGSNLILDSVSTLKHAQGAWLTSWKEFSAKSGGFDEGDAKNEWDRSVSESFSGTTSDDLGWDSVSSWSTGL

Query:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARGSYLKDSEKAEIYRLHKENPEVYT
        TKEHFDGEAVGRR   GGDSPKSPQSSLVSGLQE EDRIRE++AENRK++ F+DKWGERM+EMSMLLKQV+EPGARGSYLKDSEKAE+YRLHKENPEVYT
Subjt:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARGSYLKDSEKAEIYRLHKENPEVYT

Query:  VEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD-----------------------
        VEKLAKDYRI+RQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHD EFHVASLPYKPDFKVMPEGWD                       
Subjt:  VEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD-----------------------

Query:  --------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
                      GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGG GGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  --------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

XP_023553278.1 uncharacterized protein LOC111810742 [Cucurbita pepo subsp. pepo]2.0e-17081.51Show/hide
Query:  MQNLHHLICRLSSTSLGKSTNTSRLLKENVGSNLILDSVSTLKHAQGAWLTSWKEFSAKSGGFDEGDAKNEWDRSVSESFSGTTSDDLGWDSVSSWSTGL
        MQNLHH ICRLSSTSLGKS        +NVGSNLI DSVSTLKH QGAWLT+ +EFSAKSGGFDE ++KNEWD+SVSESFSGTTSDDLGWDSVSSWSTGL
Subjt:  MQNLHHLICRLSSTSLGKSTNTSRLLKENVGSNLILDSVSTLKHAQGAWLTSWKEFSAKSGGFDEGDAKNEWDRSVSESFSGTTSDDLGWDSVSSWSTGL

Query:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARGSYLKDSEKAEIYRLHKENPEVYT
        TKEHFDGEAVGRR   GGDSPKSPQSSLVSGLQE EDRIRE++AENRK++ F+DKWGERM+EMSMLLKQVKEPGARGSYLKDSEKAE+YRLHKENPEVYT
Subjt:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARGSYLKDSEKAEIYRLHKENPEVYT

Query:  VEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD-----------------------
        VEKLAKDYRI+RQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHD EFHVASLPYKPDFKVMPEGWD                       
Subjt:  VEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD-----------------------

Query:  --------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
                      GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGG GGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  --------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

XP_038906006.1 protein GAMETE CELL DEFECTIVE 1, mitochondrial [Benincasa hispida]1.3e-17482.81Show/hide
Query:  MQNLHHLICRLSSTSLGKSTNTSRLLKENVGSNLILDSVSTLKHAQGAWLTSWKEFSAKSGGFDEGDAKNEWDRSVSESFSGTTSDDLGWDSVSSWSTGL
        MQNLHHLICRLSSTSLGK+T TS+LLKENVGS+L++DSVSTLKHAQGAWLT+ +EFSAKSGGFD G++KNE D+SVSESFSGT SDD GWDSVSSWSTGL
Subjt:  MQNLHHLICRLSSTSLGKSTNTSRLLKENVGSNLILDSVSTLKHAQGAWLTSWKEFSAKSGGFDEGDAKNEWDRSVSESFSGTTSDDLGWDSVSSWSTGL

Query:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARGSYLKDSEKAEIYRLHKENPEVYT
        TKEHFDGE VG RTSGG DSPKSPQSSLVSGLQEIEDRIRE++AENRK++ F+DKWGERMREMSMLLKQVKEPGARGSYLKDSEKAE+YRLHKENPEVYT
Subjt:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARGSYLKDSEKAEIYRLHKENPEVYT

Query:  VEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD-----------------------
        VEKLAKDYRI+RQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD                       
Subjt:  VEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD-----------------------

Query:  --------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
                      GEVF HKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLP+GSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  --------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

TrEMBL top hitse value%identityAlignment
A0A1S4DXD4 uncharacterized protein LOC1034908962.1e-16578.91Show/hide
Query:  MQNLHHLICRLSSTSLGKSTNTSRLLKENVGSNLILDSVSTLKHAQGAWLTSWKEFSAKSGGFDEGDAKNEWDRSVSESFSGTTSDDLGWDSVSSWSTGL
        MQNLH+LI RLSSTSLGKSTNTSRLLK+NVG NL++DSVSTLKH QGAWLT+ +EFSAKSGGF  GD KNEWD+SVSE F G TSDDLGWDSVSSWSTGL
Subjt:  MQNLHHLICRLSSTSLGKSTNTSRLLKENVGSNLILDSVSTLKHAQGAWLTSWKEFSAKSGGFDEGDAKNEWDRSVSESFSGTTSDDLGWDSVSSWSTGL

Query:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARGSYLKDSEKAEIYRLHKENPEVYT
        TKEHFDGEAVGR+ SGGG S +SPQSS+VSGLQE ED +RE++AENRK++ +++KWGERMREMS+LLKQVKEPGARGSYLKDSEKAE+YRLHKENPEVYT
Subjt:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARGSYLKDSEKAEIYRLHKENPEVYT

Query:  VEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD-----------------------
        +EKLAKDYRI+RQRVHAILWLKELEEEEEKKLGHPLDDS+ELLLD  PEFFKSHDREFHVASLPYKPDFKVMPEGWD                       
Subjt:  VEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD-----------------------

Query:  --------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
                      GEVF HKYSRRRAADGWKFT+EKMGPRGKRG GGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  --------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

A0A5D3CJ95 Uncharacterized protein1.9e-16679.02Show/hide
Query:  LTMQNLHHLICRLSSTSLGKSTNTSRLLKENVGSNLILDSVSTLKHAQGAWLTSWKEFSAKSGGFDEGDAKNEWDRSVSESFSGTTSDDLGWDSVSSWST
        LTMQNLH+LI RLSSTSLGKSTNTSRLLK+NVG NL++DSVSTLKH QGAWLT+ +EFSAKSGGF  GD KNEWD+SVSE F G TSDDLGWDSVSSWST
Subjt:  LTMQNLHHLICRLSSTSLGKSTNTSRLLKENVGSNLILDSVSTLKHAQGAWLTSWKEFSAKSGGFDEGDAKNEWDRSVSESFSGTTSDDLGWDSVSSWST

Query:  GLTKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARGSYLKDSEKAEIYRLHKENPEV
        GLTKEHFDGEAVGR+ SGGG S +SPQSS+VSGLQE ED +RE++AENRK++ +++KWGERMREMS+LLKQVKEPGARGSYLKDSEKAE+YRLHKENPEV
Subjt:  GLTKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARGSYLKDSEKAEIYRLHKENPEV

Query:  YTVEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD---------------------
        YT+EKLAKDYRI+RQRVHAILWLKELEEEEEKKLGHPLDDS+ELLLD  PEFFKSHDREFHVASLPYKPDFKVMPEGWD                     
Subjt:  YTVEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD---------------------

Query:  ----------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
                        GEVF HKYSRRRAADGWKFT+EKMGPRGKRG GGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  ----------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

A0A6J1DIH3 uncharacterized protein LOC1110213862.4e-17783.07Show/hide
Query:  MQNLHHLICRLSSTSLGKSTNTSRLLKENVGSNLILDSVSTLKHAQGAWLTSWKEFSAKSGGFDEGDAKNEWDRSVSESFSGTTSDDLGWDSVSSWSTGL
        MQNLHHLI RLSSTSLGKSTNTSRLLK+NV S+LI DSV+TLKH QGAWLT+ +EFSAKSGGFDEGDAKNEWD+SVS+SFSGTTSDDLGWDSVSSWSTGL
Subjt:  MQNLHHLICRLSSTSLGKSTNTSRLLKENVGSNLILDSVSTLKHAQGAWLTSWKEFSAKSGGFDEGDAKNEWDRSVSESFSGTTSDDLGWDSVSSWSTGL

Query:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARGSYLKDSEKAEIYRLHKENPEVYT
        TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIRE++AENRK++GF+D+WGERMRE+S+LLKQV+EPGARG+YLKDSEKAE+YRLHKENPEVYT
Subjt:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARGSYLKDSEKAEIYRLHKENPEVYT

Query:  VEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD-----------------------
        V+KLAKDYRI+RQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD                       
Subjt:  VEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD-----------------------

Query:  --------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
                      GEVFRHKYSRRR +DGWKFT+EKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  --------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

A0A6J1E998 uncharacterized protein LOC1114310664.2e-16679.95Show/hide
Query:  MQNLHHLICRLSSTSLGKSTNTSRLLKENVGSNLILDSVSTLKHAQGAWLTSWKEFSAKSGGFDEGDAKNEWDRSVSESFSGTTSDDLGWDSVSSWSTGL
        MQNLHH ICRLSS SLGKST        NVGS+LI DSVSTLKH QGAWLT+ +EFSAKSGGFDE +AKNEWD+SVSESFSGTT+DDLGWDSVSSWSTGL
Subjt:  MQNLHHLICRLSSTSLGKSTNTSRLLKENVGSNLILDSVSTLKHAQGAWLTSWKEFSAKSGGFDEGDAKNEWDRSVSESFSGTTSDDLGWDSVSSWSTGL

Query:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARGSYLKDSEKAEIYRLHKENPEVYT
        TKEHFDGEAVGRR   G DSPKSPQ+SLVSGLQE EDRIRE++AENRK++ F+DKWGERM+EMSMLLKQV+EPGARGSYLKDSEKAE+YRLHKENPEVYT
Subjt:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARGSYLKDSEKAEIYRLHKENPEVYT

Query:  VEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD-----------------------
        VEKLAKDYRI+RQRVHAILWLKELEEEEEKKLG PLDDSVELLLDT PEFFKSHD EFHVASLPYKPDFKVMPEGWD                       
Subjt:  VEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD-----------------------

Query:  --------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
                      GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGG GGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  --------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

A0A6J1HMP6 uncharacterized protein LOC1114650502.8e-17081.25Show/hide
Query:  MQNLHHLICRLSSTSLGKSTNTSRLLKENVGSNLILDSVSTLKHAQGAWLTSWKEFSAKSGGFDEGDAKNEWDRSVSESFSGTTSDDLGWDSVSSWSTGL
        MQNLHH ICRLSSTSLGKST        NVGS+LI DSVSTLKH QGAWLT+ +EFSAKSGGFDE ++KNEWD+SVSESFSGTTSDDLGWDSVSSWSTGL
Subjt:  MQNLHHLICRLSSTSLGKSTNTSRLLKENVGSNLILDSVSTLKHAQGAWLTSWKEFSAKSGGFDEGDAKNEWDRSVSESFSGTTSDDLGWDSVSSWSTGL

Query:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARGSYLKDSEKAEIYRLHKENPEVYT
        TKEHFDGEAVGRR   GGDSPKSPQSSLVSGLQE EDRIRE++AENRK++ F+DKWGERM+EMSMLLKQV+EPGARGSYLKDSEKAE+YRLHKENPEVYT
Subjt:  TKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARGSYLKDSEKAEIYRLHKENPEVYT

Query:  VEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD-----------------------
        VEKLAKDYRI+RQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHD EFHVASLPYKPDFKVMPEGWD                       
Subjt:  VEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD-----------------------

Query:  --------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
                      GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGG GGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP
Subjt:  --------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP

SwissProt top hitse value%identityAlignment
A2WW22 Protein GAMETE CELL DEFECTIVE 1, mitochondrial1.4e-9459.15Show/hide
Query:  SGTTSDDLGWDS-VSSWSTGLTKEHFDGE--AVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARG
        S +  D  G D   SSWSTG+TKEHFDG   AVGR  +     P SP+ + V  + E ++  R ++ +NR+ + ++D WG+RMRE   LLKQV+EPG+RG
Subjt:  SGTTSDDLGWDS-VSSWSTGLTKEHFDGE--AVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARG

Query:  SYLKDSEKAEIYRLHKENPEVYTVEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD
        SYLKDSEK E+YRLHKE+PE YTVE+LAKD+R++RQRVHAILWLKE+EEEEE+KLG PLDDSVE+LLD+CPEFF SHDREFHVASLPYKPDFKVMPEGWD
Subjt:  SYLKDSEKAEIYRLHKENPEVYTVEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD

Query:  -------------------------------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPR
                                             GEV  HKYSRRR  DGW + +EK+G + KRG GGGWKF SLPDGSSRPLN+MEKMYV+RETP+
Subjt:  -------------------------------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPR

Query:  HRRKIL
         RR+I+
Subjt:  HRRKIL

Q8S2G4 Protein GAMETE CELL DEFECTIVE 1, mitochondrial1.1e-9459.15Show/hide
Query:  SGTTSDDLGWDS-VSSWSTGLTKEHFDGE--AVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARG
        S +  D  G D   SSWSTG+TKEHFDG   AVGR  +     P SP+ + V  + E ++  R ++ +NR+ + ++D WG+RMRE   LLKQV+EPG+RG
Subjt:  SGTTSDDLGWDS-VSSWSTGLTKEHFDGE--AVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARG

Query:  SYLKDSEKAEIYRLHKENPEVYTVEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD
        SYLKDSEK E+YRLHKE+PE YTVE+LAKD+R++RQRVHAILWLKE+EEEEE+KLG PLDDSVE+LLD+CPEFF SHDREFHVASLPYKPDFKVMPEGWD
Subjt:  SYLKDSEKAEIYRLHKENPEVYTVEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD

Query:  -------------------------------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPR
                                             GEV  HKYSRRR  DGW + +EK+G + KRG GGGWKF SLPDGSSRPLN+MEKMYV+RETP+
Subjt:  -------------------------------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPR

Query:  HRRKIL
         RR+I+
Subjt:  HRRKIL

Q9LVA9 Protein GAMETE CELL DEFECTIVE 1, mitochondrial2.3e-10554.16Show/hide
Query:  MQNLHHLICRLSSTSLGKSTNTSRLLKENVGSNLILDSVSTLKHAQGAWLTSWKEFSAKSGGFDEGDAKNEWDRSVSESFSGTTSDDLGWDSVSSWSTGL
        M NL  +I R SS SL  ST  S  L EN  S ++  + +             + FSAKSG    G   N W+ S   SF GT S DL WD+ S WSTGL
Subjt:  MQNLHHLICRLSSTSLGKSTNTSRLLKENVGSNLILDSVSTLKHAQGAWLTSWKEFSAKSGGFDEGDAKNEWDRSVSESFSGTTSDDLGWDSVSSWSTGL

Query:  TKEHFDGEAVGRRTSGGGDSPKSPQS--------------SLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARGSYLKDSEKA
        TKEHFDG +VGR+ +    S  +  S              +LV+ + E +D ++EI+ +NR+   F+D   +RM E+S+LLKQVKEPGARGSYLKDSEK 
Subjt:  TKEHFDGEAVGRRTSGGGDSPKSPQS--------------SLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARGSYLKDSEKA

Query:  EIYRLHKENPEVYTVEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD---------
        E+YRLHKENPEVYT+E+LAKDYRI+RQRVHAIL+LKE EEEEE+KLG PLDDSV+ LLD  PEFF SHDREFHVASL YKPDFKVMPEGWD         
Subjt:  EIYRLHKENPEVYTVEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD---------

Query:  ----------------------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKIL
                                    GEV  HKYSRRR+++GWK T+EK+G +GKRG GGGWKF+SLPDGSSRPLNEMEK+YV+RETP  RR I+
Subjt:  ----------------------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKIL

Arabidopsis top hitse value%identityAlignment
AT2G02880.1 mucin-related2.2e-1830.47Show/hide
Query:  IREIDAENRK-NEGFMDKWGERMREMSMLLKQVKEPGARGSYLKDSEKAEIYRLHKENPEVYTVEKLAKDYRIIRQRVHAILWLKELEEEEEKKL-----
        I EID E +   E   + W ER  +   + K+ ++    G    D E        + +  +Y++E + KDYR+ +QRVHA LW+KE+E+ EE KL     
Subjt:  IREIDAENRK-NEGFMDKWGERMREMSMLLKQVKEPGARGSYLKDSEKAEIYRLHKENPEVYTVEKLAKDYRIIRQRVHAILWLKELEEEEEKKL-----

Query:  GHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGW-------DGEVF------------------------------RHKYSRRRAADGWK
        G   DD ++ LLD+C E F S D +F    +    + K  P+GW       DG ++                              +H +SRRR  DGWK
Subjt:  GHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGW-------DGEVF------------------------------RHKYSRRRAADGWK

Query:  FTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNE
        + IE +GP  ++G G   +  +L D S++P  E
Subjt:  FTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNE

AT5G62270.1 BEST Arabidopsis thaliana protein match is: mucin-related (TAIR:AT2G02880.1)7.5e-10754.55Show/hide
Query:  MQNLHHLICRLSSTSLGKSTNTSRLLKENVGSNLILDSVSTLKHAQGAWLTSWKEFSAKSGGFDEGDAKNEWDRSVSESFSGTTSDDLGWDSVSSWSTGL
        M NL  +I R SS SL  ST  S  L EN  S ++  + +             + FSAKSG    G   N W+ S   SF GT S DL WD+ S WSTGL
Subjt:  MQNLHHLICRLSSTSLGKSTNTSRLLKENVGSNLILDSVSTLKHAQGAWLTSWKEFSAKSGGFDEGDAKNEWDRSVSESFSGTTSDDLGWDSVSSWSTGL

Query:  TKEHFDGEAVGRRTSGGGDSPKSPQS--------------SLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARGSYLKDSEKA
        TKEHFDG +VGR+ +    S  +  S              +LV+ + E +D ++EI+ +NR+   F+D   +RM E+S+LLKQVKEPGARGSYLKDSEK 
Subjt:  TKEHFDGEAVGRRTSGGGDSPKSPQS--------------SLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARGSYLKDSEKA

Query:  EIYRLHKENPEVYTVEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD---------
        E+YRLHKENPEVYT+E+LAKDYRI+RQRVHAIL+LKE EEEEE+KLG PLDDSV+ LLD  PEFF SHDREFHVASL YKPDFKVMPEGWD         
Subjt:  EIYRLHKENPEVYTVEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD---------

Query:  ----------------------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKI
                                    GEV  HKYSRRR+++GWK T+EK+G +GKRG GGGWKF+SLPDGSSRPLNEMEK+YV+RETP  RRKI
Subjt:  ----------------------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKI

AT5G62270.2 FUNCTIONS IN: molecular_function unknown1.7e-10654.16Show/hide
Query:  MQNLHHLICRLSSTSLGKSTNTSRLLKENVGSNLILDSVSTLKHAQGAWLTSWKEFSAKSGGFDEGDAKNEWDRSVSESFSGTTSDDLGWDSVSSWSTGL
        M NL  +I R SS SL  ST  S  L EN  S ++  + +             + FSAKSG    G   N W+ S   SF GT S DL WD+ S WSTGL
Subjt:  MQNLHHLICRLSSTSLGKSTNTSRLLKENVGSNLILDSVSTLKHAQGAWLTSWKEFSAKSGGFDEGDAKNEWDRSVSESFSGTTSDDLGWDSVSSWSTGL

Query:  TKEHFDGEAVGRRTSGGGDSPKSPQS--------------SLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARGSYLKDSEKA
        TKEHFDG +VGR+ +    S  +  S              +LV+ + E +D ++EI+ +NR+   F+D   +RM E+S+LLKQVKEPGARGSYLKDSEK 
Subjt:  TKEHFDGEAVGRRTSGGGDSPKSPQS--------------SLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARGSYLKDSEKA

Query:  EIYRLHKENPEVYTVEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD---------
        E+YRLHKENPEVYT+E+LAKDYRI+RQRVHAIL+LKE EEEEE+KLG PLDDSV+ LLD  PEFF SHDREFHVASL YKPDFKVMPEGWD         
Subjt:  EIYRLHKENPEVYTVEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWD---------

Query:  ----------------------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKIL
                                    GEV  HKYSRRR+++GWK T+EK+G +GKRG GGGWKF+SLPDGSSRPLNEMEK+YV+RETP  RR I+
Subjt:  ----------------------------GEVFRHKYSRRRAADGWKFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGGCTTATTGAGTCGGCGAATAAGCCACTCTCACCCGTACAAATCAAAAGACAATGCCTCATGGACAGGAGTCCATAATCCACTCAGGATTAAGGCCAAGTTGCC
TAGGATACCCCCACTCGCATGTCAACTACACGAACGCGTTGGATCATTACGTTTGTATCAAAATACAAAGCGGGTCGTATCTGTAGTGTCACCAGGAAAAGGTTACTGTT
CATTGTCGTTACCTAGCACGGCTGGGTTGGGAATTCCCAGCTGCGGGATTGTCAGAAACATGTGGTGGTGGGTCGCTAGTGGATGGCTGGAGCTTGGCGGGGCGGGTTGG
CGTCTATCGAGGTTGGGCAGCCGCCGCCTACCCCTCAACAGAGTCTCCTTCTTCCTCGCGTTCACGGCCACCGACCTCTTCTCCCTCTCTTCCGTCTTGCTTCGCACGCG
TCGAATCTCTCTCTCCCTCTCGTTTTCTTCAACAGCAACCCGCGGAGTATCAGCGCCGCACCCTCGACAGCTCTGCATCTCGGCGTGTGACGTTCGAGCGGCCGCAGCAG
CGACTATGCGGCTCCTCCGGCCATTTCTTTTCACCCTCGACGATCCAACATCATTCATTCTTTCCCTCGCGTTCTTTCCAGTGCAACTTCAGCCCACGCGATTTCGGCAA
GGAAACACATCGATCTCGAATTTTCCAGCGTCTGAATTGGACCTCGAATTCGTTCCGGGGTACCAATTTGTTGTGGGTGGCACTTGCAGAGAGGGTTATAAATTGACCAT
GCAAAACCTGCATCATTTAATTTGTCGTCTCAGTTCCACTTCTCTCGGCAAAAGCACAAATACTTCGAGACTTCTAAAAGAAAATGTGGGATCTAATTTAATTCTTGATT
CAGTTTCAACATTGAAGCATGCTCAAGGAGCTTGGTTAACCTCTTGGAAAGAGTTCTCTGCAAAATCTGGTGGATTTGATGAAGGTGATGCTAAGAATGAATGGGATAGG
AGTGTTAGTGAATCATTTTCTGGCACCACGTCAGATGATTTAGGTTGGGATTCTGTTTCCTCCTGGTCAACTGGATTGACCAAAGAACATTTTGATGGAGAGGCTGTGGG
CCGCAGGACTAGTGGGGGCGGGGATTCACCAAAATCACCACAGTCTTCATTAGTTTCTGGGTTGCAAGAGATTGAGGACAGAATAAGGGAAATAGATGCGGAAAACCGAA
AAAACGAGGGCTTTATGGACAAGTGGGGTGAAAGGATGAGGGAGATGAGCATGCTTTTGAAACAAGTAAAAGAACCTGGTGCTAGAGGGTCTTATCTCAAGGACTCAGAG
AAAGCTGAGATATATCGCTTGCACAAGGAAAACCCTGAGGTATATACGGTTGAGAAGCTTGCTAAAGATTACAGGATTATAAGGCAAAGGGTTCACGCCATTCTTTGGCT
GAAAGAGCTTGAAGAGGAAGAGGAGAAAAAACTGGGCCACCCCTTGGATGATTCTGTTGAGCTTTTACTCGATACTTGCCCTGAATTCTTTAAGTCCCATGACCGCGAAT
TCCATGTGGCATCCCTTCCGTACAAACCTGATTTCAAGGTTATGCCGGAGGGTTGGGATGGAGAGGTCTTCCGCCATAAATATAGTAGGCGTCGGGCTGCAGATGGGTGG
AAATTCACAATAGAGAAAATGGGACCCCGAGGGAAACGGGGAGGTGGTGGTGGATGGAAGTTCGTCAGCTTGCCTGATGGTTCTAGCAGGCCATTGAACGAAATGGAGAA
GATGTATGTGAGGCGAGAGACACCTCGCCATCGACGTAAAATCCTTCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTAGGCTTATTGAGTCGGCGAATAAGCCACTCTCACCCGTACAAATCAAAAGACAATGCCTCATGGACAGGAGTCCATAATCCACTCAGGATTAAGGCCAAGTTGCC
TAGGATACCCCCACTCGCATGTCAACTACACGAACGCGTTGGATCATTACGTTTGTATCAAAATACAAAGCGGGTCGTATCTGTAGTGTCACCAGGAAAAGGTTACTGTT
CATTGTCGTTACCTAGCACGGCTGGGTTGGGAATTCCCAGCTGCGGGATTGTCAGAAACATGTGGTGGTGGGTCGCTAGTGGATGGCTGGAGCTTGGCGGGGCGGGTTGG
CGTCTATCGAGGTTGGGCAGCCGCCGCCTACCCCTCAACAGAGTCTCCTTCTTCCTCGCGTTCACGGCCACCGACCTCTTCTCCCTCTCTTCCGTCTTGCTTCGCACGCG
TCGAATCTCTCTCTCCCTCTCGTTTTCTTCAACAGCAACCCGCGGAGTATCAGCGCCGCACCCTCGACAGCTCTGCATCTCGGCGTGTGACGTTCGAGCGGCCGCAGCAG
CGACTATGCGGCTCCTCCGGCCATTTCTTTTCACCCTCGACGATCCAACATCATTCATTCTTTCCCTCGCGTTCTTTCCAGTGCAACTTCAGCCCACGCGATTTCGGCAA
GGAAACACATCGATCTCGAATTTTCCAGCGTCTGAATTGGACCTCGAATTCGTTCCGGGGTACCAATTTGTTGTGGGTGGCACTTGCAGAGAGGGTTATAAATTGACCAT
GCAAAACCTGCATCATTTAATTTGTCGTCTCAGTTCCACTTCTCTCGGCAAAAGCACAAATACTTCGAGACTTCTAAAAGAAAATGTGGGATCTAATTTAATTCTTGATT
CAGTTTCAACATTGAAGCATGCTCAAGGAGCTTGGTTAACCTCTTGGAAAGAGTTCTCTGCAAAATCTGGTGGATTTGATGAAGGTGATGCTAAGAATGAATGGGATAGG
AGTGTTAGTGAATCATTTTCTGGCACCACGTCAGATGATTTAGGTTGGGATTCTGTTTCCTCCTGGTCAACTGGATTGACCAAAGAACATTTTGATGGAGAGGCTGTGGG
CCGCAGGACTAGTGGGGGCGGGGATTCACCAAAATCACCACAGTCTTCATTAGTTTCTGGGTTGCAAGAGATTGAGGACAGAATAAGGGAAATAGATGCGGAAAACCGAA
AAAACGAGGGCTTTATGGACAAGTGGGGTGAAAGGATGAGGGAGATGAGCATGCTTTTGAAACAAGTAAAAGAACCTGGTGCTAGAGGGTCTTATCTCAAGGACTCAGAG
AAAGCTGAGATATATCGCTTGCACAAGGAAAACCCTGAGGTATATACGGTTGAGAAGCTTGCTAAAGATTACAGGATTATAAGGCAAAGGGTTCACGCCATTCTTTGGCT
GAAAGAGCTTGAAGAGGAAGAGGAGAAAAAACTGGGCCACCCCTTGGATGATTCTGTTGAGCTTTTACTCGATACTTGCCCTGAATTCTTTAAGTCCCATGACCGCGAAT
TCCATGTGGCATCCCTTCCGTACAAACCTGATTTCAAGGTTATGCCGGAGGGTTGGGATGGAGAGGTCTTCCGCCATAAATATAGTAGGCGTCGGGCTGCAGATGGGTGG
AAATTCACAATAGAGAAAATGGGACCCCGAGGGAAACGGGGAGGTGGTGGTGGATGGAAGTTCGTCAGCTTGCCTGATGGTTCTAGCAGGCCATTGAACGAAATGGAGAA
GATGTATGTGAGGCGAGAGACACCTCGCCATCGACGTAAAATCCTTCCATGA
Protein sequenceShow/hide protein sequence
MVGLLSRRISHSHPYKSKDNASWTGVHNPLRIKAKLPRIPPLACQLHERVGSLRLYQNTKRVVSVVSPGKGYCSLSLPSTAGLGIPSCGIVRNMWWWVASGWLELGGAGW
RLSRLGSRRLPLNRVSFFLAFTATDLFSLSSVLLRTRRISLSLSFSSTATRGVSAPHPRQLCISACDVRAAAAATMRLLRPFLFTLDDPTSFILSLAFFPVQLQPTRFRQ
GNTSISNFPASELDLEFVPGYQFVVGGTCREGYKLTMQNLHHLICRLSSTSLGKSTNTSRLLKENVGSNLILDSVSTLKHAQGAWLTSWKEFSAKSGGFDEGDAKNEWDR
SVSESFSGTTSDDLGWDSVSSWSTGLTKEHFDGEAVGRRTSGGGDSPKSPQSSLVSGLQEIEDRIREIDAENRKNEGFMDKWGERMREMSMLLKQVKEPGARGSYLKDSE
KAEIYRLHKENPEVYTVEKLAKDYRIIRQRVHAILWLKELEEEEEKKLGHPLDDSVELLLDTCPEFFKSHDREFHVASLPYKPDFKVMPEGWDGEVFRHKYSRRRAADGW
KFTIEKMGPRGKRGGGGGWKFVSLPDGSSRPLNEMEKMYVRRETPRHRRKILP