; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020071 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020071
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionC2H2-type domain-containing protein
Genome locationtig00153446:1886669..1888480
RNA-Seq ExpressionSgr020071
SyntenySgr020071
Gene Ontology termsNA
InterPro domainsIPR013087 - Zinc finger C2H2-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046912.1 Transcription factor [Cucumis melo var. makuwa]1.1e-21387.67Show/hide
Query:  MPTVWLSLKKSLHCKSEPSEVHDPKTRKNLGAILTRKT--GGRSGCSRSIANLKDVIH-GSKRHLEKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKI
        MPTVWLSLKKSLHCKSEPSEVHDPK++KNLGAILTRKT  GGRSGCSRSIANLKDVI+ GS+RHL+KPPSCSPRSIGSSEFLNPITHEVILSNS+CELKI
Subjt:  MPTVWLSLKKSLHCKSEPSEVHDPKTRKNLGAILTRKT--GGRSGCSRSIANLKDVIH-GSKRHLEKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKI

Query:  TGFHGGFHDVAAGAATASAADAHGGGSTFVGTLTPGTPGPGGHPTMHYF----RTPSRKFPFKDGSGFGSSNKSGGGGGGGGRGGGCGGVHLSNRISLET
        TGFHGGF            AD HGG STFVGTLTPGTPGPGGHP+MHYF    R+ S+KF F++  GFGSSNKSGGG GGG  G    GVHLSNRISLET
Subjt:  TGFHGGFHDVAAGAATASAADAHGGGSTFVGTLTPGTPGPGGHPTMHYF----RTPSRKFPFKDGSGFGSSNKSGGGGGGGGRGGGCGGVHLSNRISLET

Query:  EVGGNGSSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVKMKASKLPKK
        E   NG SSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVK+KASKLPKK
Subjt:  EVGGNGSSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVKMKASKLPKK

Query:  HPRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSARAFNSIEESGGEEGLTRKAMIVCRVIAGRVHRPL
        HPRCLADGNELLRFYGTTVAC LGLNGSSSLCISEKCCVCRIIRNGFSAKTEM+EGIGVFTTSTSARAF+SI+ SGG+EG+TRKAMIVCRVIAGRVHRPL
Subjt:  HPRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSARAFNSIEESGGEEGLTRKAMIVCRVIAGRVHRPL

Query:  ENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
        ENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
Subjt:  ENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP

XP_004149303.1 uncharacterized protein LOC101208986 [Cucumis sativus]1.8e-21387.44Show/hide
Query:  MPTVWLSLKKSLHCKSEPSEVHDPKTRKNLGAILTRKT--GGRSGCSRSIANLKDVIH-GSKRHLEKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKI
        MPTVWLSLKKSLHCKSEPS+VHDPK++KNLGAILTRKT  GGRSGCSRSIANLKDVI+ GSKRHL+KPPSCSPRSIGSSEFLNPITHEVILSNS+CELKI
Subjt:  MPTVWLSLKKSLHCKSEPSEVHDPKTRKNLGAILTRKT--GGRSGCSRSIANLKDVIH-GSKRHLEKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKI

Query:  TGFHGGFHDVAAGAATASAADAHGGGSTFVGTLTPGTPGPGGHPTMHYF----RTPSRKFPFKDGSGFGSSNKSGGGGGGGGRGGGCGGVHLSNRISLET
        TGFHGGF            AD HGG STFVGTLTPGTPGPGGHP+MHYF    R+ S+KF F++  GFGSSNKSGGG GGG RG    G+HLSNRISLET
Subjt:  TGFHGGFHDVAAGAATASAADAHGGGSTFVGTLTPGTPGPGGHPTMHYF----RTPSRKFPFKDGSGFGSSNKSGGGGGGGGRGGGCGGVHLSNRISLET

Query:  EVGGNGSSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVKMKASKLPKK
        E   NG SSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVK+KASKLPKK
Subjt:  EVGGNGSSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVKMKASKLPKK

Query:  HPRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSARAFNSIEESGGEEGLTRKAMIVCRVIAGRVHRPL
        HPRCLADGNELLRFYGTTV C LGLNGSSSLCISEKCCVCRIIRNGFSAKTEM+EGIGVFTTSTSARAF+SI+ SGG+EG+TRKAMIVCRVIAGRVHRPL
Subjt:  HPRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSARAFNSIEESGGEEGLTRKAMIVCRVIAGRVHRPL

Query:  ENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
        ENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
Subjt:  ENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP

XP_008452316.1 PREDICTED: uncharacterized protein LOC103493380 [Cucumis melo]1.6e-21487.89Show/hide
Query:  MPTVWLSLKKSLHCKSEPSEVHDPKTRKNLGAILTRKT--GGRSGCSRSIANLKDVIH-GSKRHLEKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKI
        MPTVWLSLKKSLHCKSEPSEVHDPK++KNLGAILTRKT  GGRSGCSRSIANLKDVI+ GS+RHL+KPPSCSPRSIGSSEFLNPITHEVILSNS+CELKI
Subjt:  MPTVWLSLKKSLHCKSEPSEVHDPKTRKNLGAILTRKT--GGRSGCSRSIANLKDVIH-GSKRHLEKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKI

Query:  TGFHGGFHDVAAGAATASAADAHGGGSTFVGTLTPGTPGPGGHPTMHYF----RTPSRKFPFKDGSGFGSSNKSGGGGGGGGRGGGCGGVHLSNRISLET
        TGFHGGF            AD HGG STFVGTLTPGTPGPGGHP+MHYF    R+ S+KF F++  GFGSSNKSGGG GGG RG    GVHLSNRISLET
Subjt:  TGFHGGFHDVAAGAATASAADAHGGGSTFVGTLTPGTPGPGGHPTMHYF----RTPSRKFPFKDGSGFGSSNKSGGGGGGGGRGGGCGGVHLSNRISLET

Query:  EVGGNGSSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVKMKASKLPKK
        E   NG SSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVK+KASKLPKK
Subjt:  EVGGNGSSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVKMKASKLPKK

Query:  HPRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSARAFNSIEESGGEEGLTRKAMIVCRVIAGRVHRPL
        HPRCLADGNELLRFYGTTVAC LGLNGSSSLCISEKCCVCRIIRNGFSAKTEM+EGIGVFTTSTSARAF+SI+ SGG+EG+TRKAMIVCRVIAGRVHRPL
Subjt:  HPRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSARAFNSIEESGGEEGLTRKAMIVCRVIAGRVHRPL

Query:  ENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
        ENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
Subjt:  ENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP

XP_022142725.1 uncharacterized protein LOC111012772 [Momordica charantia]4.6e-22591.91Show/hide
Query:  MPTVWLSLKKSLHCKSEPSEVHDPKTRKNLGAILTRKT--GGRSGCSRSIANLKDVIH-GSKRHLEKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKI
        MPTVWLSLKKSLHCKSEPSEVHDPK+RKNLGAILTRKT  GGRSGCSRSIANLKDVIH GS+RHLEKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKI
Subjt:  MPTVWLSLKKSLHCKSEPSEVHDPKTRKNLGAILTRKT--GGRSGCSRSIANLKDVIH-GSKRHLEKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKI

Query:  TGFHGGFHDVAAGAATASAADAHG---GGSTFVGTLTPGTPGPGGHPTMHYFRTPSRKFPFKDGSGFGSSNKSGGGGGGGGRGGGCGGVHLSNRISLETE
        TGFHGG  DVAAG A   AAD HG   GGSTFVGTLTPGTPGPGGHPTMHYFRTPSRKFPF++  GFGSSNKSG GGGGGG GG  GGV+ SNRISLETE
Subjt:  TGFHGGFHDVAAGAATASAADAHG---GGSTFVGTLTPGTPGPGGHPTMHYFRTPSRKFPFKDGSGFGSSNKSGGGGGGGGRGGGCGGVHLSNRISLETE

Query:  VGGNGSSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVKMKASKLPKKH
        +GGNGSSS VTCHKCGEQFNKW+AAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQ+GRIERVLKVHNMQRTLARFEEYRETVK KASKLPKKH
Subjt:  VGGNGSSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVKMKASKLPKKH

Query:  PRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSARAFNSIEESGGEEGLTRKAMIVCRVIAGRVHRPLE
        PRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSARAF+SIE SGG+E + RKAMIVCRVIAGRVHRPLE
Subjt:  PRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSARAFNSIEESGGEEGLTRKAMIVCRVIAGRVHRPLE

Query:  NIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
        NIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
Subjt:  NIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP

XP_038906351.1 uncharacterized protein LOC120092187 [Benincasa hispida]2.6e-22090.18Show/hide
Query:  MPTVWLSLKKSLHCKSEPSEVHDPKTRKNLGAILTRKT-GGRSGCSRSIANLKDVIH-GSKRHLEKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKIT
        MPTVWLSLKKSLHCKSEPSEVHDPK RKNLGAILTRKT GGRSGCSRSIANLKDVIH GSKRHL+KPPSCSPRSIGSSEFLNPITHEVILSNS+CELKIT
Subjt:  MPTVWLSLKKSLHCKSEPSEVHDPKTRKNLGAILTRKT-GGRSGCSRSIANLKDVIH-GSKRHLEKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKIT

Query:  GFHGGFHDVAAGAATASAADAHG---GGSTFVGTLTPGTPGPGGHPTMHYF----RTPSRKFPFKDGSGFGSSNKSGGGGGGGGRGGGCGGVHLSNRISL
        GFHGGF           AAD HG   GGSTFVGTLTPGTPGPGGHP+MHYF    R+ S+KFPF++  GFGSSNKSGGG GGG RGGG GGVHLSNRI L
Subjt:  GFHGGFHDVAAGAATASAADAHG---GGSTFVGTLTPGTPGPGGHPTMHYF----RTPSRKFPFKDGSGFGSSNKSGGGGGGGGRGGGCGGVHLSNRISL

Query:  ETEVGGNGSSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVKMKASKLP
        ETE   NGSSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVK+KASKLP
Subjt:  ETEVGGNGSSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVKMKASKLP

Query:  KKHPRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSARAFNSIEESGGEEGLTRKAMIVCRVIAGRVHR
        KKHPRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSARAFNSIE SGG+E  TRKAMIVCRVIAGRVHR
Subjt:  KKHPRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSARAFNSIEESGGEEGLTRKAMIVCRVIAGRVHR

Query:  PLENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
        PLENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
Subjt:  PLENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP

TrEMBL top hitse value%identityAlignment
A0A0A0L585 C2H2-type domain-containing protein8.8e-21487.44Show/hide
Query:  MPTVWLSLKKSLHCKSEPSEVHDPKTRKNLGAILTRKT--GGRSGCSRSIANLKDVIH-GSKRHLEKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKI
        MPTVWLSLKKSLHCKSEPS+VHDPK++KNLGAILTRKT  GGRSGCSRSIANLKDVI+ GSKRHL+KPPSCSPRSIGSSEFLNPITHEVILSNS+CELKI
Subjt:  MPTVWLSLKKSLHCKSEPSEVHDPKTRKNLGAILTRKT--GGRSGCSRSIANLKDVIH-GSKRHLEKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKI

Query:  TGFHGGFHDVAAGAATASAADAHGGGSTFVGTLTPGTPGPGGHPTMHYF----RTPSRKFPFKDGSGFGSSNKSGGGGGGGGRGGGCGGVHLSNRISLET
        TGFHGGF            AD HGG STFVGTLTPGTPGPGGHP+MHYF    R+ S+KF F++  GFGSSNKSGGG GGG RG    G+HLSNRISLET
Subjt:  TGFHGGFHDVAAGAATASAADAHGGGSTFVGTLTPGTPGPGGHPTMHYF----RTPSRKFPFKDGSGFGSSNKSGGGGGGGGRGGGCGGVHLSNRISLET

Query:  EVGGNGSSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVKMKASKLPKK
        E   NG SSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVK+KASKLPKK
Subjt:  EVGGNGSSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVKMKASKLPKK

Query:  HPRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSARAFNSIEESGGEEGLTRKAMIVCRVIAGRVHRPL
        HPRCLADGNELLRFYGTTV C LGLNGSSSLCISEKCCVCRIIRNGFSAKTEM+EGIGVFTTSTSARAF+SI+ SGG+EG+TRKAMIVCRVIAGRVHRPL
Subjt:  HPRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSARAFNSIEESGGEEGLTRKAMIVCRVIAGRVHRPL

Query:  ENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
        ENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
Subjt:  ENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP

A0A1S3BUD2 uncharacterized protein LOC1034933808.0e-21587.89Show/hide
Query:  MPTVWLSLKKSLHCKSEPSEVHDPKTRKNLGAILTRKT--GGRSGCSRSIANLKDVIH-GSKRHLEKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKI
        MPTVWLSLKKSLHCKSEPSEVHDPK++KNLGAILTRKT  GGRSGCSRSIANLKDVI+ GS+RHL+KPPSCSPRSIGSSEFLNPITHEVILSNS+CELKI
Subjt:  MPTVWLSLKKSLHCKSEPSEVHDPKTRKNLGAILTRKT--GGRSGCSRSIANLKDVIH-GSKRHLEKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKI

Query:  TGFHGGFHDVAAGAATASAADAHGGGSTFVGTLTPGTPGPGGHPTMHYF----RTPSRKFPFKDGSGFGSSNKSGGGGGGGGRGGGCGGVHLSNRISLET
        TGFHGGF            AD HGG STFVGTLTPGTPGPGGHP+MHYF    R+ S+KF F++  GFGSSNKSGGG GGG RG    GVHLSNRISLET
Subjt:  TGFHGGFHDVAAGAATASAADAHGGGSTFVGTLTPGTPGPGGHPTMHYF----RTPSRKFPFKDGSGFGSSNKSGGGGGGGGRGGGCGGVHLSNRISLET

Query:  EVGGNGSSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVKMKASKLPKK
        E   NG SSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVK+KASKLPKK
Subjt:  EVGGNGSSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVKMKASKLPKK

Query:  HPRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSARAFNSIEESGGEEGLTRKAMIVCRVIAGRVHRPL
        HPRCLADGNELLRFYGTTVAC LGLNGSSSLCISEKCCVCRIIRNGFSAKTEM+EGIGVFTTSTSARAF+SI+ SGG+EG+TRKAMIVCRVIAGRVHRPL
Subjt:  HPRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSARAFNSIEESGGEEGLTRKAMIVCRVIAGRVHRPL

Query:  ENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
        ENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
Subjt:  ENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP

A0A5A7TTX7 Transcription factor5.2e-21487.67Show/hide
Query:  MPTVWLSLKKSLHCKSEPSEVHDPKTRKNLGAILTRKT--GGRSGCSRSIANLKDVIH-GSKRHLEKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKI
        MPTVWLSLKKSLHCKSEPSEVHDPK++KNLGAILTRKT  GGRSGCSRSIANLKDVI+ GS+RHL+KPPSCSPRSIGSSEFLNPITHEVILSNS+CELKI
Subjt:  MPTVWLSLKKSLHCKSEPSEVHDPKTRKNLGAILTRKT--GGRSGCSRSIANLKDVIH-GSKRHLEKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKI

Query:  TGFHGGFHDVAAGAATASAADAHGGGSTFVGTLTPGTPGPGGHPTMHYF----RTPSRKFPFKDGSGFGSSNKSGGGGGGGGRGGGCGGVHLSNRISLET
        TGFHGGF            AD HGG STFVGTLTPGTPGPGGHP+MHYF    R+ S+KF F++  GFGSSNKSGGG GGG  G    GVHLSNRISLET
Subjt:  TGFHGGFHDVAAGAATASAADAHGGGSTFVGTLTPGTPGPGGHPTMHYF----RTPSRKFPFKDGSGFGSSNKSGGGGGGGGRGGGCGGVHLSNRISLET

Query:  EVGGNGSSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVKMKASKLPKK
        E   NG SSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVK+KASKLPKK
Subjt:  EVGGNGSSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVKMKASKLPKK

Query:  HPRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSARAFNSIEESGGEEGLTRKAMIVCRVIAGRVHRPL
        HPRCLADGNELLRFYGTTVAC LGLNGSSSLCISEKCCVCRIIRNGFSAKTEM+EGIGVFTTSTSARAF+SI+ SGG+EG+TRKAMIVCRVIAGRVHRPL
Subjt:  HPRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSARAFNSIEESGGEEGLTRKAMIVCRVIAGRVHRPL

Query:  ENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
        ENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
Subjt:  ENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP

A0A5D3BUU3 Transcription factor8.0e-21587.89Show/hide
Query:  MPTVWLSLKKSLHCKSEPSEVHDPKTRKNLGAILTRKT--GGRSGCSRSIANLKDVIH-GSKRHLEKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKI
        MPTVWLSLKKSLHCKSEPSEVHDPK++KNLGAILTRKT  GGRSGCSRSIANLKDVI+ GS+RHL+KPPSCSPRSIGSSEFLNPITHEVILSNS+CELKI
Subjt:  MPTVWLSLKKSLHCKSEPSEVHDPKTRKNLGAILTRKT--GGRSGCSRSIANLKDVIH-GSKRHLEKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKI

Query:  TGFHGGFHDVAAGAATASAADAHGGGSTFVGTLTPGTPGPGGHPTMHYF----RTPSRKFPFKDGSGFGSSNKSGGGGGGGGRGGGCGGVHLSNRISLET
        TGFHGGF            AD HGG STFVGTLTPGTPGPGGHP+MHYF    R+ S+KF F++  GFGSSNKSGGG GGG RG    GVHLSNRISLET
Subjt:  TGFHGGFHDVAAGAATASAADAHGGGSTFVGTLTPGTPGPGGHPTMHYF----RTPSRKFPFKDGSGFGSSNKSGGGGGGGGRGGGCGGVHLSNRISLET

Query:  EVGGNGSSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVKMKASKLPKK
        E   NG SSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVK+KASKLPKK
Subjt:  EVGGNGSSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVKMKASKLPKK

Query:  HPRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSARAFNSIEESGGEEGLTRKAMIVCRVIAGRVHRPL
        HPRCLADGNELLRFYGTTVAC LGLNGSSSLCISEKCCVCRIIRNGFSAKTEM+EGIGVFTTSTSARAF+SI+ SGG+EG+TRKAMIVCRVIAGRVHRPL
Subjt:  HPRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSARAFNSIEESGGEEGLTRKAMIVCRVIAGRVHRPL

Query:  ENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
        ENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
Subjt:  ENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP

A0A6J1CLR5 uncharacterized protein LOC1110127722.2e-22591.91Show/hide
Query:  MPTVWLSLKKSLHCKSEPSEVHDPKTRKNLGAILTRKT--GGRSGCSRSIANLKDVIH-GSKRHLEKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKI
        MPTVWLSLKKSLHCKSEPSEVHDPK+RKNLGAILTRKT  GGRSGCSRSIANLKDVIH GS+RHLEKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKI
Subjt:  MPTVWLSLKKSLHCKSEPSEVHDPKTRKNLGAILTRKT--GGRSGCSRSIANLKDVIH-GSKRHLEKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKI

Query:  TGFHGGFHDVAAGAATASAADAHG---GGSTFVGTLTPGTPGPGGHPTMHYFRTPSRKFPFKDGSGFGSSNKSGGGGGGGGRGGGCGGVHLSNRISLETE
        TGFHGG  DVAAG A   AAD HG   GGSTFVGTLTPGTPGPGGHPTMHYFRTPSRKFPF++  GFGSSNKSG GGGGGG GG  GGV+ SNRISLETE
Subjt:  TGFHGGFHDVAAGAATASAADAHG---GGSTFVGTLTPGTPGPGGHPTMHYFRTPSRKFPFKDGSGFGSSNKSGGGGGGGGRGGGCGGVHLSNRISLETE

Query:  VGGNGSSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVKMKASKLPKKH
        +GGNGSSS VTCHKCGEQFNKW+AAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQ+GRIERVLKVHNMQRTLARFEEYRETVK KASKLPKKH
Subjt:  VGGNGSSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVKMKASKLPKKH

Query:  PRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSARAFNSIEESGGEEGLTRKAMIVCRVIAGRVHRPLE
        PRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSARAF+SIE SGG+E + RKAMIVCRVIAGRVHRPLE
Subjt:  PRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSARAFNSIEESGGEEGLTRKAMIVCRVIAGRVHRPLE

Query:  NIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
        NIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
Subjt:  NIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11490.1 zinc finger (C2H2 type) family protein2.5e-5945.52Show/hide
Query:  FKDGSGFGSSNKSGGGGGGGGRGGGCGGVHLSNRISLETEVGGNG--SSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLK
        F++ +G+    K       G   G      L  R S   +V G+       + C KC E+    DA EAH+LS H+V  L+ GD SR  VE+IC T +  
Subjt:  FKDGSGFGSSNKSGGGGGGGGRGGGCGGVHLSNRISLETEVGGNG--SSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLK

Query:  --SENQSGRIERVLKVHNMQRTLARFEEYRETVKMKASKLPKKHPRCLADGNELLRFYGTTVACSLGL-NGSSSLCISEKCCVCRIIRNGFSAKTEMKEG
           + +   I  + K+ N+QR +A FE+YRE VK++A+KL KKH RC+ADGNE L F+GTT++C+LG  N SS+LC S+ C VC I+R+GFS KT     
Subjt:  --SENQSGRIERVLKVHNMQRTLARFEEYRETVKMKASKLPKKHPRCLADGNELLRFYGTTVACSLGL-NGSSSLCISEKCCVCRIIRNGFSAKTEMKEG

Query:  IGVFTTSTSARAFNSIEESGGEEGLTRKAMIVCRVIAGRVHRPLENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
         GV T STS+ A  SIE   G    +  A+++CRVIAGRVH+P++  +   G + FDSLA KVG +S IEELYLL+ +ALLPCFV+I KP
Subjt:  IGVFTTSTSARAFNSIEESGGEEGLTRKAMIVCRVIAGRVHRPLENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP

AT1G75710.1 C2H2-like zinc finger protein1.6e-5335.32Show/hide
Query:  PTVWLSLKKSLHCKS-EPSEVHDPKTRKNLGAILTRKTGGR---SGCSRSIANLKDVIHGSKRHLEKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKI
        P+ W  +K  L CK  E S VHDP      G  +T         S CS SI + +DV HG+ R + +    SP  +G+S   N        S +R   + 
Subjt:  PTVWLSLKKSLHCKS-EPSEVHDPKTRKNLGAILTRKTGGR---SGCSRSIANLKDVIHGSKRHLEKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKI

Query:  TGFHG--GFHDVAAGAATASAADAHGGGSTFVGTLTPGTPGPGGHPTMHYFRTPSRKFPFKDGSGFGSSNKSGGGGGGGGRGGGCGGVHL---SNRISLE
         G HG      + +G+  ++A+ ++   ST                      T  R   F+  S                   GC   H+    +R  + 
Subjt:  TGFHG--GFHDVAAGAATASAADAHGGGSTFVGTLTPGTPGPGGHPTMHYFRTPSRKFPFKDGSGFGSSNKSGGGGGGGGRGGGCGGVHL---SNRISLE

Query:  TEVGGNGSSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVKMKASKLPK
          V          C +CGE F K ++ E H   +HAV+EL   DS R IVEII ++SWLK ++   +IER+LKVHN QRT+ RFE+ R+ VK +A +  +
Subjt:  TEVGGNGSSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVKMKASKLPK

Query:  KHPRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCC-VCRIIRNGFSAKT----EMKEGIGVFTTSTSARAFNSIEESGGEEGLTRKAMIVCRVIAG
        K  RC ADGNELLRF+ TT+ CSLG  GSSSLC +   C VC +IR+GF  K+          GV TT++S RA + +  S       R+ M+VCRVIAG
Subjt:  KHPRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCC-VCRIIRNGFSAKT----EMKEGIGVFTTSTSARAFNSIEESGGEEGLTRKAMIVCRVIAG

Query:  RVHR---PLENIQEMAGQTG----------------FDSLAGKVGLHSNIEELYLLNPRALLPCFVVICK
        RV R   P  +    A +                  FDS+A   G++SN+EEL + NPRA+LPCFVVI K
Subjt:  RVHR---PLENIQEMAGQTG----------------FDSLAGKVGLHSNIEELYLLNPRALLPCFVVICK

AT2G29660.1 zinc finger (C2H2 type) family protein1.6e-4542.48Show/hide
Query:  RISLETEVGGNGSSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSEN-QSGRIERVLKVHNMQRTLARFEEYRETVKMK
        RI  +TE   + S     C+ CGE F K +  E H   KHAV+EL+ G+SS  IV+II ++ W +  N +S  I R+LK+HN  + L RFEEYRE VK K
Subjt:  RISLETEVGGNGSSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSEN-QSGRIERVLKVHNMQRTLARFEEYRETVKMK

Query:  ASKLPK-----KHPRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSARAFNSIEESGGEE-GL--TRKA
        A++           RC+ADGNELLRFY +T  C LG NG S+LC  + C +C II +GFS K +     G+ T +T  R   ++ E   EE G    ++A
Subjt:  ASKLPK-----KHPRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSARAFNSIEESGGEE-GL--TRKA

Query:  MIVCRVIAGRVHRPL--ENIQEMAGQTGFDSLAGKVG------LHSNIEELYLLNPRALLPCFVVI
        M+VCRV+AGRV   L  ++  + +   G+DSL G+ G      L  + +EL + NPRA+LPCFV++
Subjt:  MIVCRVIAGRVHRPL--ENIQEMAGQTGFDSLAGKVG------LHSNIEELYLLNPRALLPCFVVI

AT4G27240.1 zinc finger (C2H2 type) family protein9.8e-14963.84Show/hide
Query:  MPTVWLSLKKSLHCKSEPSEVHDPKTRKNLGAILTRKT--------GGRSGCSRSIANLKDVIHGSKRHLEKPPSCSPRSIGSSEFLNPITHEVILSNSR
        +P+VW SLKKSL CKS+ S+VH P+++K L  I T++T        GGRSGCSRSIANLKDVIHG++RHLEKP   SPRSIGSSEFLNPITH+VI SNS 
Subjt:  MPTVWLSLKKSLHCKSEPSEVHDPKTRKNLGAILTRKT--------GGRSGCSRSIANLKDVIHGSKRHLEKPPSCSPRSIGSSEFLNPITHEVILSNSR

Query:  CELKITGFHGGFHDVAAGAATASAADAHGGGSTFVGTLTPGTPGPGGHPTMHYFRTPSRKFPFKDGSGFGSSNKSGGGGGGGGRGGGCGGVHLSNRISLE
        CELKIT         AAGA            + FVG L PGT  P  + +    +T SRK    D  G                     G H S R + +
Subjt:  CELKITGFHGGFHDVAAGAATASAADAHGGGSTFVGTLTPGTPGPGGHPTMHYFRTPSRKFPFKDGSGFGSSNKSGGGGGGGGRGGGCGGVHLSNRISLE

Query:  TEVGGNGSSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVKMKASKLPK
         E   NG +S+V+CHKCGE+F+K +AAEAHHL+KHAVTEL+EGDSSR+IVEIICRTSWLK+ENQ GRI+R+LKVHNMQ+TLARFEEYR+TVK++ASKL K
Subjt:  TEVGGNGSSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVKMKASKLPK

Query:  KHPRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSARAFNSIEESGGEEGLTRKAMIVCRVIAGRVHRP
        KHPRC+ADGNELLRF+GTTVAC+LG+NGS+SLC SEKCCVCRIIRNGFSAK EM  GIGVFT STS RAF SI    G  G  RKA+IVCRVIAGRVHRP
Subjt:  KHPRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSARAFNSIEESGGEEGLTRKAMIVCRVIAGRVHRP

Query:  LENIQEMAG-QTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
        +EN++EM G  +GFDSLAGKVGL++N+EELYLLN RALLPCFV+ICKP
Subjt:  LENIQEMAG-QTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP

AT5G54630.1 zinc finger protein-related1.5e-16066.38Show/hide
Query:  MPTVWLSLKKSLHCKSEPSEVHDP----KTRKNLGAILTRKT------------GGRSGCSRSIANLKDVIHGSKRHLEKPPSCSPRSIGSSEFLNPITH
        +PTVW SLKKSLHCKSEPS+VHDP    K +++L  I T+K             GG SGCSRSIANLKDVIHGSKRH EKPP  SPRSIGS+EFLNPITH
Subjt:  MPTVWLSLKKSLHCKSEPSEVHDP----KTRKNLGAILTRKT------------GGRSGCSRSIANLKDVIHGSKRHLEKPPSCSPRSIGSSEFLNPITH

Query:  EVILSNSRCELKITGFHGGFHDVAAGAATASAADAHGGGSTFVGTLTPGTPGPGGHPTMHYF-RTPSRKFPFKDGSGFGSSNKSGGGGGGGGRGGGCGGV
        EVILSNS CELKITG       V A  +       +G  +T+VG L PGTP       MHY   + S +   + GS   S    GGGGGG G      G 
Subjt:  EVILSNSRCELKITGFHGGFHDVAAGAATASAADAHGGGSTFVGTLTPGTPGPGGHPTMHYF-RTPSRKFPFKDGSGFGSSNKSGGGGGGGGRGGGCGGV

Query:  HLSNRISLE-----TEVGGNGSSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEE
        H + R+SLE     T  GGN SS  V+CHKCGEQFNK +AAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQ GRI+RVLKVHNMQ+TLARFEE
Subjt:  HLSNRISLE-----TEVGGNGSSSAVTCHKCGEQFNKWDAAEAHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEE

Query:  YRETVKMKASKLPKKHPRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSARAFNSIEESGGEEG-----
        YRETVK++ASKL KKHPRCLADGNELLRF+GTTVAC LG+NGS+S+C +EKCCVCRIIRNGFS+K E   G+GVFT STS RAF SI  +GG+E      
Subjt:  YRETVKMKASKLPKKHPRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSARAFNSIEESGGEEG-----

Query:  LTRKAMIVCRVIAGRVHRPLENIQEMAG-QTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
          RK +IVCRVIAGRVHRP+EN++EM G  +GFDSLAGKVGL++N+EELYLLNP+ALLPCFVVICKP
Subjt:  LTRKAMIVCRVIAGRVHRPLENIQEMAG-QTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAACAGTTTGGTTGTCTTTGAAGAAATCTTTGCACTGCAAATCGGAGCCATCAGAAGTTCATGACCCAAAAACCAGAAAGAATTTGGGAGCAATTTTGACAAGAAA
AACAGGAGGAAGATCAGGGTGTTCAAGGTCCATAGCAAATCTCAAAGATGTCATCCATGGCAGCAAGAGGCATTTGGAGAAGCCCCCAAGCTGCAGCCCAAGATCGATTG
GCAGCAGTGAGTTTCTCAACCCAATAACCCATGAAGTCATTCTCAGTAACTCAAGGTGTGAGCTCAAAATCACCGGTTTTCATGGAGGTTTCCACGATGTGGCTGCCGGC
GCCGCCACCGCTTCTGCCGCCGATGCTCACGGTGGCGGTTCCACTTTTGTGGGTACTCTGACGCCGGGCACGCCAGGCCCTGGCGGCCACCCCACAATGCATTACTTTAG
GACTCCTTCGAGAAAGTTCCCTTTTAAAGACGGCTCGGGGTTTGGCAGCTCTAATAAATCTGGCGGCGGCGGTGGGGGAGGAGGTCGCGGCGGCGGCTGTGGTGGAGTTC
ATTTAAGCAACAGAATCTCTCTGGAGACAGAGGTCGGTGGGAATGGGTCTTCTTCTGCCGTTACTTGCCATAAATGTGGAGAGCAGTTCAACAAATGGGATGCTGCTGAA
GCACATCATCTCTCTAAACATGCTGTGACTGAGCTAGTGGAAGGAGATTCATCAAGGAAAATTGTGGAGATCATATGCAGGACAAGCTGGTTAAAGTCTGAGAATCAAAG
TGGTAGAATAGAAAGAGTCCTGAAAGTTCACAACATGCAACGAACGCTTGCCCGGTTCGAGGAGTATCGCGAGACGGTAAAGATGAAAGCCAGCAAGCTCCCGAAGAAGC
ACCCTCGATGCCTCGCCGATGGGAACGAGCTCCTGAGATTCTATGGCACGACAGTCGCGTGCTCGCTCGGCCTGAATGGCTCCTCCAGCCTCTGCATATCAGAGAAATGC
TGTGTTTGCAGGATCATCCGAAACGGGTTCTCAGCAAAGACAGAGATGAAAGAAGGGATAGGCGTTTTCACGACTTCCACGAGCGCGAGAGCGTTCAACTCGATCGAAGA
GTCGGGAGGGGAGGAGGGGTTGACGAGGAAGGCGATGATAGTTTGCAGGGTGATTGCTGGGAGAGTTCACAGGCCATTGGAGAACATACAAGAAATGGCTGGGCAGACAG
GGTTTGATTCATTGGCTGGGAAAGTTGGGCTGCATTCAAATATAGAAGAGCTTTATTTGCTGAATCCTAGAGCTTTGCTTCCATGTTTTGTGGTAATTTGCAAACCATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCAACAGTTTGGTTGTCTTTGAAGAAATCTTTGCACTGCAAATCGGAGCCATCAGAAGTTCATGACCCAAAAACCAGAAAGAATTTGGGAGCAATTTTGACAAGAAA
AACAGGAGGAAGATCAGGGTGTTCAAGGTCCATAGCAAATCTCAAAGATGTCATCCATGGCAGCAAGAGGCATTTGGAGAAGCCCCCAAGCTGCAGCCCAAGATCGATTG
GCAGCAGTGAGTTTCTCAACCCAATAACCCATGAAGTCATTCTCAGTAACTCAAGGTGTGAGCTCAAAATCACCGGTTTTCATGGAGGTTTCCACGATGTGGCTGCCGGC
GCCGCCACCGCTTCTGCCGCCGATGCTCACGGTGGCGGTTCCACTTTTGTGGGTACTCTGACGCCGGGCACGCCAGGCCCTGGCGGCCACCCCACAATGCATTACTTTAG
GACTCCTTCGAGAAAGTTCCCTTTTAAAGACGGCTCGGGGTTTGGCAGCTCTAATAAATCTGGCGGCGGCGGTGGGGGAGGAGGTCGCGGCGGCGGCTGTGGTGGAGTTC
ATTTAAGCAACAGAATCTCTCTGGAGACAGAGGTCGGTGGGAATGGGTCTTCTTCTGCCGTTACTTGCCATAAATGTGGAGAGCAGTTCAACAAATGGGATGCTGCTGAA
GCACATCATCTCTCTAAACATGCTGTGACTGAGCTAGTGGAAGGAGATTCATCAAGGAAAATTGTGGAGATCATATGCAGGACAAGCTGGTTAAAGTCTGAGAATCAAAG
TGGTAGAATAGAAAGAGTCCTGAAAGTTCACAACATGCAACGAACGCTTGCCCGGTTCGAGGAGTATCGCGAGACGGTAAAGATGAAAGCCAGCAAGCTCCCGAAGAAGC
ACCCTCGATGCCTCGCCGATGGGAACGAGCTCCTGAGATTCTATGGCACGACAGTCGCGTGCTCGCTCGGCCTGAATGGCTCCTCCAGCCTCTGCATATCAGAGAAATGC
TGTGTTTGCAGGATCATCCGAAACGGGTTCTCAGCAAAGACAGAGATGAAAGAAGGGATAGGCGTTTTCACGACTTCCACGAGCGCGAGAGCGTTCAACTCGATCGAAGA
GTCGGGAGGGGAGGAGGGGTTGACGAGGAAGGCGATGATAGTTTGCAGGGTGATTGCTGGGAGAGTTCACAGGCCATTGGAGAACATACAAGAAATGGCTGGGCAGACAG
GGTTTGATTCATTGGCTGGGAAAGTTGGGCTGCATTCAAATATAGAAGAGCTTTATTTGCTGAATCCTAGAGCTTTGCTTCCATGTTTTGTGGTAATTTGCAAACCATGA
Protein sequenceShow/hide protein sequence
MPTVWLSLKKSLHCKSEPSEVHDPKTRKNLGAILTRKTGGRSGCSRSIANLKDVIHGSKRHLEKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKITGFHGGFHDVAAG
AATASAADAHGGGSTFVGTLTPGTPGPGGHPTMHYFRTPSRKFPFKDGSGFGSSNKSGGGGGGGGRGGGCGGVHLSNRISLETEVGGNGSSSAVTCHKCGEQFNKWDAAE
AHHLSKHAVTELVEGDSSRKIVEIICRTSWLKSENQSGRIERVLKVHNMQRTLARFEEYRETVKMKASKLPKKHPRCLADGNELLRFYGTTVACSLGLNGSSSLCISEKC
CVCRIIRNGFSAKTEMKEGIGVFTTSTSARAFNSIEESGGEEGLTRKAMIVCRVIAGRVHRPLENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP