; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0017155 (gene) of Chayote v1 genome

Gene IDSed0017155
OrganismSechium edule (Chayote v1)
DescriptionC2H2-type domain-containing protein
Genome locationLG07:32919466..32922317
RNA-Seq ExpressionSed0017155
SyntenySed0017155
Gene Ontology termsNA
InterPro domainsIPR013087 - Zinc finger C2H2-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600745.1 hypothetical protein SDJN03_05978, partial [Cucurbita argyrosperma subsp. sororia]2.3e-20282.22Show/hide
Query:  MPTVWISLKKSLHCKSQPSEVHEPKTRRQIGEILTRK-AGGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKIT
        MPTVW+SLKKSLHCKS PSEVH+PK+RR +G ILTRK  GGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVIL+NSRCELKIT
Subjt:  MPTVWISLKKSLHCKSQPSEVHEPKTRRQIGEILTRK-AGGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKIT

Query:  GFRSG---NAAAAADPP---------FVGTLTPGTPGPGGHPAMHYFSPSLRSSSRK---RDGF--------------------GGGGGGGGVHMSNRIS
        GF      +  ++AD P         FVGTLTPGTPGPGGHP MHYF+PS RSSSRK   R+GF                    GGGGGGGGVH SNRIS
Subjt:  GFRSG---NAAAAADPP---------FVGTLTPGTPGPGGHPAMHYFSPSLRSSSRK---RDGF--------------------GGGGGGGGVHMSNRIS

Query:  LETEFGGNGSSSAVTCHKCGDHFNKWNDAEAHHLSKHAVTELVEGDSSRKIVEIICRTNWLKSENQSGRIERVLKVHNMQRTLARFEEHRETVKTKASKL
        LETE  GN SSSAVTCHKCG+ FNKW+ AEAHHLSKHAVTELVEGDSSRKIVEIICRT+WLKSENQSGRIERVLKVHNMQRTLARFEEHRETVK KASKL
Subjt:  LETEFGGNGSSSAVTCHKCGDHFNKWNDAEAHHLSKHAVTELVEGDSSRKIVEIICRTNWLKSENQSGRIERVLKVHNMQRTLARFEEHRETVKTKASKL

Query:  PKKHPRCLADGNELLRFFGTTVECPLGLNGSSSLCPSEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSVKAFNSIEASGG--DGAMMRKAMIVCRVIAGR
        PKKHPRCLADGNELLRF+GTTV C LGLNGSSSLC SEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTS +AF SIEASGG  +G M+RKAMIVCRVIAGR
Subjt:  PKKHPRCLADGNELLRFFGTTVECPLGLNGSSSLCPSEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSVKAFNSIEASGG--DGAMMRKAMIVCRVIAGR

Query:  VHRPLENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICK
        VHRPLENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICK
Subjt:  VHRPLENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICK

KAG7031385.1 hypothetical protein SDJN02_05425, partial [Cucurbita argyrosperma subsp. argyrosperma]8.0e-20382.96Show/hide
Query:  MPTVWISLKKSLHCKSQPSEVHEPKTRRQIGEILTRK-AGGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKIT
        MPTVW+SLKKSLHCKS PSEVH+PK+RR +G ILTRK  GGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVIL+NSRCELKIT
Subjt:  MPTVWISLKKSLHCKSQPSEVHEPKTRRQIGEILTRK-AGGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKIT

Query:  GFRSG---NAAAAADPP---------FVGTLTPGTPGPGGHPAMHYFSPSLRSSSRK---RDGF----------------GGGGGGGGVHMSNRISLETE
        GF      +  ++AD P         FVGTLTPGTPGPGGHP MHYF+PS RSSSRK   R+GF                GGGGGGGGVH SNRISLETE
Subjt:  GFRSG---NAAAAADPP---------FVGTLTPGTPGPGGHPAMHYFSPSLRSSSRK---RDGF----------------GGGGGGGGVHMSNRISLETE

Query:  FGGNGSSSAVTCHKCGDHFNKWNDAEAHHLSKHAVTELVEGDSSRKIVEIICRTNWLKSENQSGRIERVLKVHNMQRTLARFEEHRETVKTKASKLPKKH
          GN SSSAVTCHKCG+ FNKW+ AEAHHLSKHAVTELVEGDSSRKIVEIICRT+WLKSENQSGRIERVLKVHNMQRTLARFEEHRETVK KASKLPKKH
Subjt:  FGGNGSSSAVTCHKCGDHFNKWNDAEAHHLSKHAVTELVEGDSSRKIVEIICRTNWLKSENQSGRIERVLKVHNMQRTLARFEEHRETVKTKASKLPKKH

Query:  PRCLADGNELLRFFGTTVECPLGLNGSSSLCPSEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSVKAFNSIEASGG--DGAMMRKAMIVCRVIAGRVHRP
        PRCLADGNELLRF+GTTV C LGLNGSSSLC SEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTS +AF SIEASGG  +G M+RKAMIVCRVIAGRVHRP
Subjt:  PRCLADGNELLRFFGTTVECPLGLNGSSSLCPSEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSVKAFNSIEASGG--DGAMMRKAMIVCRVIAGRVHRP

Query:  LENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICK
        LENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICK
Subjt:  LENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICK

XP_022942714.1 uncharacterized protein LOC111447669 isoform X2 [Cucurbita moschata]2.3e-20283.71Show/hide
Query:  MPTVWISLKKSLHCKSQPSEVHEPKTRRQIGEILTRK-AGGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKIT
        MPTVW+SLKKSLHCKS PSEVH+PK+RR +G ILTRK  GGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVIL+NSRCELKIT
Subjt:  MPTVWISLKKSLHCKSQPSEVHEPKTRRQIGEILTRK-AGGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKIT

Query:  GFRSG---NAAAAADPP---------FVGTLTPGTPGPGGHPAMHYFSPSLRSSSRK---RDGF--------GGG----GGGGGVHMSNRISLETEFGGN
        GF      +  ++AD P         FVGTLTPGTPGPGGHP MHYF+PS RSSSRK   R+GF        GGG    GGGGGVH SNRISLETE GGN
Subjt:  GFRSG---NAAAAADPP---------FVGTLTPGTPGPGGHPAMHYFSPSLRSSSRK---RDGF--------GGG----GGGGGVHMSNRISLETEFGGN

Query:  GSSSAVTCHKCGDHFNKWNDAEAHHLSKHAVTELVEGDSSRKIVEIICRTNWLKSENQSGRIERVLKVHNMQRTLARFEEHRETVKTKASKLPKKHPRCL
         SSSAVTCHKCG+ FNKW+ AEAHHLSKHAVTELVEGDSSRKIVEIICRT+WLKSENQSGRIERVLKVHNMQRTLARFEEHRETVK KASKLPKKHPRCL
Subjt:  GSSSAVTCHKCGDHFNKWNDAEAHHLSKHAVTELVEGDSSRKIVEIICRTNWLKSENQSGRIERVLKVHNMQRTLARFEEHRETVKTKASKLPKKHPRCL

Query:  ADGNELLRFFGTTVECPLGLNGSSSLCPSEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSVKAFNSIEASGG--DGAMMRKAMIVCRVIAGRVHRPLENI
        ADGNELLRF+GTTV C LGLNGSSSLC SEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTS +AF SIE SGG  +G M+RKAMIVCRVIAGRVHRPLENI
Subjt:  ADGNELLRFFGTTVECPLGLNGSSSLCPSEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSVKAFNSIEASGG--DGAMMRKAMIVCRVIAGRVHRPLENI

Query:  QEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICK
        QEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICK
Subjt:  QEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICK

XP_022993855.1 uncharacterized protein LOC111489735 [Cucurbita maxima]1.8e-20283.15Show/hide
Query:  MPTVWISLKKSLHCKSQPSEVHEPKTRRQIGEILTRK-AGGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKIT
        MPTVW+SLKKSLHCKS PSEVH+PK+RR +G ILTRK  GGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVIL+NSRCELKIT
Subjt:  MPTVWISLKKSLHCKSQPSEVHEPKTRRQIGEILTRK-AGGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKIT

Query:  GFRSG---NAAAAADPP---------FVGTLTPGTPGPGGHPAMHYFSPSLRSSSRK---RDGFG---------------GGGGGGGVHMSNRISLETEF
        GF      +  A+ D P         FVGTLTPGTPGPGGHP MHYF PS RSSSRK   R+GFG               GGGGGGGVH  NRISLETE 
Subjt:  GFRSG---NAAAAADPP---------FVGTLTPGTPGPGGHPAMHYFSPSLRSSSRK---RDGFG---------------GGGGGGGVHMSNRISLETEF

Query:  GGNGSSSAVTCHKCGDHFNKWNDAEAHHLSKHAVTELVEGDSSRKIVEIICRTNWLKSENQSGRIERVLKVHNMQRTLARFEEHRETVKTKASKLPKKHP
         GN SSSAVTCHKCG+ FNKW+ AEAHHLSKHAVTELVEGDSSRKIVEIICRT+WLKSENQSGRIERVLKVHNMQRTLARFEEHRETVK KASKLPKKHP
Subjt:  GGNGSSSAVTCHKCGDHFNKWNDAEAHHLSKHAVTELVEGDSSRKIVEIICRTNWLKSENQSGRIERVLKVHNMQRTLARFEEHRETVKTKASKLPKKHP

Query:  RCLADGNELLRFFGTTVECPLGLNGSSSLCPSEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSVKAFNSIEASGG--DGAMMRKAMIVCRVIAGRVHRPL
        RCLADGNELLRFFGTTV C LGLNGSSSLC SEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTS +AF SIEASGG  +G M+RKAMIVCRVIAGRVHRPL
Subjt:  RCLADGNELLRFFGTTVECPLGLNGSSSLCPSEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSVKAFNSIEASGG--DGAMMRKAMIVCRVIAGRVHRPL

Query:  ENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICK
        ENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICK
Subjt:  ENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICK

XP_038906351.1 uncharacterized protein LOC120092187 [Benincasa hispida]3.2e-20484.09Show/hide
Query:  MPTVWISLKKSLHCKSQPSEVHEPKTRRQIGEILTRK-AGGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKIT
        MPTVW+SLKKSLHCKS+PSEVH+PK R+ +G ILTRK  GGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVILSNS+CELKIT
Subjt:  MPTVWISLKKSLHCKSQPSEVHEPKTRRQIGEILTRK-AGGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKIT

Query:  GFRSG--NAAAAADP--------PFVGTLTPGTPGPGGHPAMHYFSPSLRSSSRK---RDGFGG-------------GGGGGGVHMSNRISLETEFGGNG
        GF  G      AADP         FVGTLTPGTPGPGGHP+MHYF+PSLRSSS+K   R+GFG              GGGGGGVH+SNRI LETE   NG
Subjt:  GFRSG--NAAAAADP--------PFVGTLTPGTPGPGGHPAMHYFSPSLRSSSRK---RDGFGG-------------GGGGGGVHMSNRISLETEFGGNG

Query:  SSSAVTCHKCGDHFNKWNDAEAHHLSKHAVTELVEGDSSRKIVEIICRTNWLKSENQSGRIERVLKVHNMQRTLARFEEHRETVKTKASKLPKKHPRCLA
        SSSAVTCHKCG+ FNKW+ AEAHHLSKHAVTELVEGDSSRKIVEIICRT+WLKSENQSGRIERVLKVHNMQRTLARFEE+RETVK KASKLPKKHPRCLA
Subjt:  SSSAVTCHKCGDHFNKWNDAEAHHLSKHAVTELVEGDSSRKIVEIICRTNWLKSENQSGRIERVLKVHNMQRTLARFEEHRETVKTKASKLPKKHPRCLA

Query:  DGNELLRFFGTTVECPLGLNGSSSLCPSEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSVKAFNSIEASGGDGAMMRKAMIVCRVIAGRVHRPLENIQEM
        DGNELLRF+GTTV C LGLNGSSSLC SEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTS +AFNSIEASGGD A  RKAMIVCRVIAGRVHRPLENIQEM
Subjt:  DGNELLRFFGTTVECPLGLNGSSSLCPSEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSVKAFNSIEASGGDGAMMRKAMIVCRVIAGRVHRPLENIQEM

Query:  AGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
        AGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
Subjt:  AGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP

TrEMBL top hitse value%identityAlignment
A0A5D3BUU3 Transcription factor4.4e-19982.99Show/hide
Query:  MPTVWISLKKSLHCKSQPSEVHEPKTRRQIGEILTRK--AGGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKI
        MPTVW+SLKKSLHCKS+PSEVH+PK+++ +G ILTRK   GGRSGCSRSIANLKDVI+GGS+RHLDKPPSCSPRSIGSSEFLNPITHEVILSNS+CELKI
Subjt:  MPTVWISLKKSLHCKSQPSEVHEPKTRRQIGEILTRK--AGGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKI

Query:  TGFRSG--NAAAAAD-----PPFVGTLTPGTPGPGGHPAMHYFSPSLRSSSRK---RDGF------GGGGGGG----GVHMSNRISLETEFGGNGSSSAV
        TGF  G       AD       FVGTLTPGTPGPGGHP+MHYF+PSLRSSS+K   R+GF      GGG GGG    GVH+SNRISLETE   NG SSAV
Subjt:  TGFRSG--NAAAAAD-----PPFVGTLTPGTPGPGGHPAMHYFSPSLRSSSRK---RDGF------GGGGGGG----GVHMSNRISLETEFGGNGSSSAV

Query:  TCHKCGDHFNKWNDAEAHHLSKHAVTELVEGDSSRKIVEIICRTNWLKSENQSGRIERVLKVHNMQRTLARFEEHRETVKTKASKLPKKHPRCLADGNEL
        TCHKCG+ FNKW+ AEAHHLSKHAVTELVEGDSSRKIVEIICRT+WLKSENQSGRIERVLKVHNMQRTLARFEE+RETVK KASKLPKKHPRCLADGNEL
Subjt:  TCHKCGDHFNKWNDAEAHHLSKHAVTELVEGDSSRKIVEIICRTNWLKSENQSGRIERVLKVHNMQRTLARFEEHRETVKTKASKLPKKHPRCLADGNEL

Query:  LRFFGTTVECPLGLNGSSSLCPSEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSVKAFNSIEASGGDGAMMRKAMIVCRVIAGRVHRPLENIQEMAGQTG
        LRF+GTTV C LGLNGSSSLC SEKCCVCRIIRNGFSAKTEM+EGIGVFTTSTS +AF+SI+ SGGD  M RKAMIVCRVIAGRVHRPLENIQEMAGQTG
Subjt:  LRFFGTTVECPLGLNGSSSLCPSEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSVKAFNSIEASGGDGAMMRKAMIVCRVIAGRVHRPLENIQEMAGQTG

Query:  FDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
        FDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
Subjt:  FDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP

A0A6J1CLR5 uncharacterized protein LOC1110127721.4e-20082.58Show/hide
Query:  MPTVWISLKKSLHCKSQPSEVHEPKTRRQIGEILTRK--AGGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKI
        MPTVW+SLKKSLHCKS+PSEVH+PK+R+ +G ILTRK   GGRSGCSRSIANLKDVIHGGS+RHL+KPPSCSPRSIGSSEFLNPITHEVILSNSRCELKI
Subjt:  MPTVWISLKKSLHCKSQPSEVHEPKTRRQIGEILTRK--AGGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKI

Query:  TGFRSG-----NAAAAAD--------PPFVGTLTPGTPGPGGHPAMHYFSPSLRSSSRK---RDGF-----------GGGGGGGGVHMSNRISLETEFGG
        TGF  G        AAAD          FVGTLTPGTPGPGGHP MHYF    R+ SRK   R+GF           GGGGGGGGV+ SNRISLETE GG
Subjt:  TGFRSG-----NAAAAAD--------PPFVGTLTPGTPGPGGHPAMHYFSPSLRSSSRK---RDGF-----------GGGGGGGGVHMSNRISLETEFGG

Query:  NGSSSAVTCHKCGDHFNKWNDAEAHHLSKHAVTELVEGDSSRKIVEIICRTNWLKSENQSGRIERVLKVHNMQRTLARFEEHRETVKTKASKLPKKHPRC
        NGSSS VTCHKCG+ FNKW  AEAHHLSKHAVTELVEGDSSRKIVEIICRT+WLKSENQ+GRIERVLKVHNMQRTLARFEE+RETVKTKASKLPKKHPRC
Subjt:  NGSSSAVTCHKCGDHFNKWNDAEAHHLSKHAVTELVEGDSSRKIVEIICRTNWLKSENQSGRIERVLKVHNMQRTLARFEEHRETVKTKASKLPKKHPRC

Query:  LADGNELLRFFGTTVECPLGLNGSSSLCPSEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSVKAFNSIEASGGDGAMMRKAMIVCRVIAGRVHRPLENIQ
        LADGNELLRF+GTTV C LGLNGSSSLC SEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTS +AF+SIEASGGD  + RKAMIVCRVIAGRVHRPLENIQ
Subjt:  LADGNELLRFFGTTVECPLGLNGSSSLCPSEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSVKAFNSIEASGGDGAMMRKAMIVCRVIAGRVHRPLENIQ

Query:  EMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
        EMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
Subjt:  EMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP

A0A6J1FS53 uncharacterized protein LOC111447669 isoform X21.1e-20283.71Show/hide
Query:  MPTVWISLKKSLHCKSQPSEVHEPKTRRQIGEILTRK-AGGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKIT
        MPTVW+SLKKSLHCKS PSEVH+PK+RR +G ILTRK  GGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVIL+NSRCELKIT
Subjt:  MPTVWISLKKSLHCKSQPSEVHEPKTRRQIGEILTRK-AGGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKIT

Query:  GFRSG---NAAAAADPP---------FVGTLTPGTPGPGGHPAMHYFSPSLRSSSRK---RDGF--------GGG----GGGGGVHMSNRISLETEFGGN
        GF      +  ++AD P         FVGTLTPGTPGPGGHP MHYF+PS RSSSRK   R+GF        GGG    GGGGGVH SNRISLETE GGN
Subjt:  GFRSG---NAAAAADPP---------FVGTLTPGTPGPGGHPAMHYFSPSLRSSSRK---RDGF--------GGG----GGGGGVHMSNRISLETEFGGN

Query:  GSSSAVTCHKCGDHFNKWNDAEAHHLSKHAVTELVEGDSSRKIVEIICRTNWLKSENQSGRIERVLKVHNMQRTLARFEEHRETVKTKASKLPKKHPRCL
         SSSAVTCHKCG+ FNKW+ AEAHHLSKHAVTELVEGDSSRKIVEIICRT+WLKSENQSGRIERVLKVHNMQRTLARFEEHRETVK KASKLPKKHPRCL
Subjt:  GSSSAVTCHKCGDHFNKWNDAEAHHLSKHAVTELVEGDSSRKIVEIICRTNWLKSENQSGRIERVLKVHNMQRTLARFEEHRETVKTKASKLPKKHPRCL

Query:  ADGNELLRFFGTTVECPLGLNGSSSLCPSEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSVKAFNSIEASGG--DGAMMRKAMIVCRVIAGRVHRPLENI
        ADGNELLRF+GTTV C LGLNGSSSLC SEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTS +AF SIE SGG  +G M+RKAMIVCRVIAGRVHRPLENI
Subjt:  ADGNELLRFFGTTVECPLGLNGSSSLCPSEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSVKAFNSIEASGG--DGAMMRKAMIVCRVIAGRVHRPLENI

Query:  QEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICK
        QEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICK
Subjt:  QEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICK

A0A6J1FVH1 uncharacterized protein LOC111447669 isoform X11.1e-20283.11Show/hide
Query:  MPTVWISLKKSLHCKSQPSEVHEPKTRRQIGEILTRK-AGGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKIT
        MPTVW+SLKKSLHCKS PSEVH+PK+RR +G ILTRK  GGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVIL+NSRCELKIT
Subjt:  MPTVWISLKKSLHCKSQPSEVHEPKTRRQIGEILTRK-AGGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKIT

Query:  GFRSG---NAAAAADPP---------FVGTLTPGTPGPGGHPAMHYFSPSLRSSSRK---RDGFG--------------GGGGGGGVHMSNRISLETEFG
        GF      +  ++AD P         FVGTLTPGTPGPGGHP MHYF+PS RSSSRK   R+GF               GGGGGGGVH SNRISLETE G
Subjt:  GFRSG---NAAAAADPP---------FVGTLTPGTPGPGGHPAMHYFSPSLRSSSRK---RDGFG--------------GGGGGGGVHMSNRISLETEFG

Query:  GNGSSSAVTCHKCGDHFNKWNDAEAHHLSKHAVTELVEGDSSRKIVEIICRTNWLKSENQSGRIERVLKVHNMQRTLARFEEHRETVKTKASKLPKKHPR
        GN SSSAVTCHKCG+ FNKW+ AEAHHLSKHAVTELVEGDSSRKIVEIICRT+WLKSENQSGRIERVLKVHNMQRTLARFEEHRETVK KASKLPKKHPR
Subjt:  GNGSSSAVTCHKCGDHFNKWNDAEAHHLSKHAVTELVEGDSSRKIVEIICRTNWLKSENQSGRIERVLKVHNMQRTLARFEEHRETVKTKASKLPKKHPR

Query:  CLADGNELLRFFGTTVECPLGLNGSSSLCPSEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSVKAFNSIEASGG--DGAMMRKAMIVCRVIAGRVHRPLE
        CLADGNELLRF+GTTV C LGLNGSSSLC SEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTS +AF SIE SGG  +G M+RKAMIVCRVIAGRVHRPLE
Subjt:  CLADGNELLRFFGTTVECPLGLNGSSSLCPSEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSVKAFNSIEASGG--DGAMMRKAMIVCRVIAGRVHRPLE

Query:  NIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICK
        NIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICK
Subjt:  NIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICK

A0A6J1JZN5 uncharacterized protein LOC1114897358.6e-20383.15Show/hide
Query:  MPTVWISLKKSLHCKSQPSEVHEPKTRRQIGEILTRK-AGGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKIT
        MPTVW+SLKKSLHCKS PSEVH+PK+RR +G ILTRK  GGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVIL+NSRCELKIT
Subjt:  MPTVWISLKKSLHCKSQPSEVHEPKTRRQIGEILTRK-AGGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKIT

Query:  GFRSG---NAAAAADPP---------FVGTLTPGTPGPGGHPAMHYFSPSLRSSSRK---RDGFG---------------GGGGGGGVHMSNRISLETEF
        GF      +  A+ D P         FVGTLTPGTPGPGGHP MHYF PS RSSSRK   R+GFG               GGGGGGGVH  NRISLETE 
Subjt:  GFRSG---NAAAAADPP---------FVGTLTPGTPGPGGHPAMHYFSPSLRSSSRK---RDGFG---------------GGGGGGGVHMSNRISLETEF

Query:  GGNGSSSAVTCHKCGDHFNKWNDAEAHHLSKHAVTELVEGDSSRKIVEIICRTNWLKSENQSGRIERVLKVHNMQRTLARFEEHRETVKTKASKLPKKHP
         GN SSSAVTCHKCG+ FNKW+ AEAHHLSKHAVTELVEGDSSRKIVEIICRT+WLKSENQSGRIERVLKVHNMQRTLARFEEHRETVK KASKLPKKHP
Subjt:  GGNGSSSAVTCHKCGDHFNKWNDAEAHHLSKHAVTELVEGDSSRKIVEIICRTNWLKSENQSGRIERVLKVHNMQRTLARFEEHRETVKTKASKLPKKHP

Query:  RCLADGNELLRFFGTTVECPLGLNGSSSLCPSEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSVKAFNSIEASGG--DGAMMRKAMIVCRVIAGRVHRPL
        RCLADGNELLRFFGTTV C LGLNGSSSLC SEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTS +AF SIEASGG  +G M+RKAMIVCRVIAGRVHRPL
Subjt:  RCLADGNELLRFFGTTVECPLGLNGSSSLCPSEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSVKAFNSIEASGG--DGAMMRKAMIVCRVIAGRVHRPL

Query:  ENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICK
        ENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICK
Subjt:  ENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11490.1 zinc finger (C2H2 type) family protein3.1e-6440.33Show/hide
Query:  MPTVWISLKKSLH-CKSQPSEVHEPKTRRQIGEILTRKAGGRSGCS-RSIANLKDV-IHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVILSNSRCELK
        M  VW+ LKKSL  CK+Q S V     + QI     +K    SGCS RS++NL+DV +  G +  +     CS RS+ SS F+N +  E    N+    +
Subjt:  MPTVWISLKKSLH-CKSQPSEVHEPKTRRQIGEILTRKAGGRSGCS-RSIANLKDV-IHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVILSNSRCELK

Query:  ITGFRSGNAAAAADPPFVGTLTPGTPGPGGHPAMHYFSPSLRSSSRKRDGFGGGGGGGGVHMSNRISLETEFGGNGSSSAVTCHKCGDHFNKWNDAEAHH
          G  SG  A+++D      L PG                    S + D  G    G GV                    + C KC +     +  EAH+
Subjt:  ITGFRSGNAAAAADPPFVGTLTPGTPGPGGHPAMHYFSPSLRSSSRKRDGFGGGGGGGGVHMSNRISLETEFGGNGSSSAVTCHKCGDHFNKWNDAEAHH

Query:  LSKHAVTELVEGDSSRKIVEIICRTNWLK--SENQSGRIERVLKVHNMQRTLARFEEHRETVKTKASKLPKKHPRCLADGNELLRFFGTTVECPLGL-NG
        LS H+V  L+ GD SR  VE+IC T +     + +   I  + K+ N+QR +A FE++RE VK +A+KL KKH RC+ADGNE L F GTT+ C LG  N 
Subjt:  LSKHAVTELVEGDSSRKIVEIICRTNWLK--SENQSGRIERVLKVHNMQRTLARFEEHRETVKTKASKLPKKHPRCLADGNELLRFFGTTVECPLGL-NG

Query:  SSSLCPSEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSVKAFNSIEASGGDGAMMRKAMIVCRVIAGRVHRPLENIQEMAGQTGFDSLAGKVGLHSNIEE
        SS+LC S+ C VC I+R+GFS KT      GV T STS  A  SIE   G       A+++CRVIAGRVH+P++  +   G + FDSLA KVG +S IEE
Subjt:  SSSLCPSEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSVKAFNSIEASGGDGAMMRKAMIVCRVIAGRVHRPLENIQEMAGQTGFDSLAGKVGLHSNIEE

Query:  LYLLNPRALLPCFVVICKP
        LYLL+ +ALLPCFV+I KP
Subjt:  LYLLNPRALLPCFVVICKP

AT1G75710.1 C2H2-like zinc finger protein2.6e-5035.81Show/hide
Query:  PTVWISLKKSLHCKS-QPSEVHEPKTRRQIGEILTRKAGGR---SGCSRSIANLKDVIHGGSK--RHLDKPPSCSPRSIGSSEFLNPITHEVILSNSRCE
        P+ W  +K  L CK  + S VH+P    Q G  +T         S CS SI + +DV HG ++     D  P      +G+S   N        S +R  
Subjt:  PTVWISLKKSLHCKS-QPSEVHEPKTRRQIGEILTRKAGGR---SGCSRSIANLKDVIHGGSK--RHLDKPPSCSPRSIGSSEFLNPITHEVILSNSRCE

Query:  LKITGFRSGNAAAAADPPFVGTLTPGTPGPGGHPAMHYFSPSLRSSSRKRDGFGGGGGGGGVHM---SNRISLETEFGGNGSSSAVTCHKCGDHFNKWND
         +  G    +++ +            T G     A   ++ S  +S R    F    G    HM    +R  +        S     C +CG+ F K   
Subjt:  LKITGFRSGNAAAAADPPFVGTLTPGTPGPGGHPAMHYFSPSLRSSSRKRDGFGGGGGGGGVHM---SNRISLETEFGGNGSSSAVTCHKCGDHFNKWND

Query:  AEAHHLSKHAVTELVEGDSSRKIVEIICRTNWLKSENQSGRIERVLKVHNMQRTLARFEEHRETVKTKASKLPKKHPRCLADGNELLRFFGTTVECPLGL
         E H   +HAV+EL   DS R IVEII +++WLK ++   +IER+LKVHN QRT+ RFE+ R+ VK +A +  +K  RC ADGNELLRF  TT+ C LG 
Subjt:  AEAHHLSKHAVTELVEGDSSRKIVEIICRTNWLKSENQSGRIERVLKVHNMQRTLARFEEHRETVKTKASKLPKKHPRCLADGNELLRFFGTTVECPLGL

Query:  NGSSSLCPSEKCC-VCRIIRNGFSAKT----EMKEGIGVFTTSTSVKAFNSIEASGGDGAMMRKAMIVCRVIAGRVHR---PLENIQEMAGQTG------
         GSSSLC +   C VC +IR+GF  K+          GV TT++S +A + +  S  D A  R+ M+VCRVIAGRV R   P  +    A +        
Subjt:  NGSSSLCPSEKCC-VCRIIRNGFSAKT----EMKEGIGVFTTSTSVKAFNSIEASGGDGAMMRKAMIVCRVIAGRVHR---PLENIQEMAGQTG------

Query:  ----------FDSLAGKVGLHSNIEELYLLNPRALLPCFVVICK
                  FDS+A   G++SN+EEL + NPRA+LPCFVVI K
Subjt:  ----------FDSLAGKVGLHSNIEELYLLNPRALLPCFVVICK

AT2G29660.1 zinc finger (C2H2 type) family protein3.6e-4441.35Show/hide
Query:  RISLETEFGGNGSSSAVTCHKCGDHFNKWNDAEAHHLSKHAVTELVEGDSSRKIVEIICRTNWLKSEN-QSGRIERVLKVHNMQRTLARFEEHRETVKTK
        RI  +TEF  + S     C+ CG+ F K N  E H   KHAV+EL+ G+SS  IV+II ++ W +  N +S  I R+LK+HN  + L RFEE+RE VK K
Subjt:  RISLETEFGGNGSSSAVTCHKCGDHFNKWNDAEAHHLSKHAVTELVEGDSSRKIVEIICRTNWLKSEN-QSGRIERVLKVHNMQRTLARFEEHRETVKTK

Query:  ASKLPK-----KHPRCLADGNELLRFFGTTVECPLGLNGSSSLCPSEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSVKAFNSI--EASGGDGAM-MRKA
        A++           RC+ADGNELLRF+ +T  C LG NG S+LC  + C +C II +GFS K +     G+ T +T  +   ++  E     G M +++A
Subjt:  ASKLPK-----KHPRCLADGNELLRFFGTTVECPLGLNGSSSLCPSEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSVKAFNSI--EASGGDGAM-MRKA

Query:  MIVCRVIAGRVHRPL--ENIQEMAGQTGFDSLAGKVG------LHSNIEELYLLNPRALLPCFVVI
        M+VCRV+AGRV   L  ++  + +   G+DSL G+ G      L  + +EL + NPRA+LPCFV++
Subjt:  MIVCRVIAGRVHRPL--ENIQEMAGQTGFDSLAGKVG------LHSNIEELYLLNPRALLPCFVVI

AT4G27240.1 zinc finger (C2H2 type) family protein7.3e-14664.93Show/hide
Query:  MPTVWISLKKSLHCKSQPSEVHEPKTRRQIGEILTRK--------AGGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVILSNS
        +P+VW SLKKSL CKS  S+VH P++++++  I T++         GGRSGCSRSIANLKDVIH G++RHL+KP   SPRSIGSSEFLNPITH+VI SNS
Subjt:  MPTVWISLKKSLHCKSQPSEVHEPKTRRQIGEILTRK--------AGGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVILSNS

Query:  RCELKITGFRSGNAAAAADPPFVGTLTPGTPGPGGHPAMHYFSPSLRSSSRKRDGFGGGGGGGGVHMSNRISLETEFGGNGSSSAVTCHKCGDHFNKWND
         CELKIT        AA    FVG L PGTP       ++Y S     +SRK         G G H S R + + E   NG +S+V+CHKCG+ F+K   
Subjt:  RCELKITGFRSGNAAAAADPPFVGTLTPGTPGPGGHPAMHYFSPSLRSSSRKRDGFGGGGGGGGVHMSNRISLETEFGGNGSSSAVTCHKCGDHFNKWND

Query:  AEAHHLSKHAVTELVEGDSSRKIVEIICRTNWLKSENQSGRIERVLKVHNMQRTLARFEEHRETVKTKASKLPKKHPRCLADGNELLRFFGTTVECPLGL
        AEAHHL+KHAVTEL+EGDSSR+IVEIICRT+WLK+ENQ GRI+R+LKVHNMQ+TLARFEE+R+TVK +ASKL KKHPRC+ADGNELLRF GTTV C LG+
Subjt:  AEAHHLSKHAVTELVEGDSSRKIVEIICRTNWLKSENQSGRIERVLKVHNMQRTLARFEEHRETVKTKASKLPKKHPRCLADGNELLRFFGTTVECPLGL

Query:  NGSSSLCPSEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSVKAFNSIEASGGDGAMMRKAMIVCRVIAGRVHRPLENIQEMAG-QTGFDSLAGKVGLHSN
        NGS+SLC SEKCCVCRIIRNGFSAK EM  GIGVFT STS +AF SI    G G   RKA+IVCRVIAGRVHRP+EN++EM G  +GFDSLAGKVGL++N
Subjt:  NGSSSLCPSEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSVKAFNSIEASGGDGAMMRKAMIVCRVIAGRVHRPLENIQEMAG-QTGFDSLAGKVGLHSN

Query:  IEELYLLNPRALLPCFVVICKP
        +EELYLLN RALLPCFV+ICKP
Subjt:  IEELYLLNPRALLPCFVVICKP

AT5G54630.1 zinc finger protein-related1.2e-15364.58Show/hide
Query:  MPTVWISLKKSLHCKSQPSEVHEP----KTRRQIGEILTRK------------AGGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPIT
        +PTVW SLKKSLHCKS+PS+VH+P    K ++ +  I T+K             GG SGCSRSIANLKDVIH GSKRH +KPP  SPRSIGS+EFLNPIT
Subjt:  MPTVWISLKKSLHCKSQPSEVHEP----KTRRQIGEILTRK------------AGGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPIT

Query:  HEVILSNSRCELKITGF-------------RSGNAAAAADPPFVGTLTPGTPGPGGHPAMHYF--SPSLRSSSRK-------RDGFGGGGGGG-GVHMSN
        HEVILSNS CELKITG                G         +VG L PGTP       MHY   S S RS +RK       RD  GGGGG G G H + 
Subjt:  HEVILSNSRCELKITGF-------------RSGNAAAAADPPFVGTLTPGTPGPGGHPAMHYF--SPSLRSSSRK-------RDGFGGGGGGG-GVHMSN

Query:  RISLE-----TEFGGNGSSSAVTCHKCGDHFNKWNDAEAHHLSKHAVTELVEGDSSRKIVEIICRTNWLKSENQSGRIERVLKVHNMQRTLARFEEHRET
        R+SLE     T  GGN SS  V+CHKCG+ FNK   AEAHHLSKHAVTELVEGDSSRKIVEIICRT+WLKSENQ GRI+RVLKVHNMQ+TLARFEE+RET
Subjt:  RISLE-----TEFGGNGSSSAVTCHKCGDHFNKWNDAEAHHLSKHAVTELVEGDSSRKIVEIICRTNWLKSENQSGRIERVLKVHNMQRTLARFEEHRET

Query:  VKTKASKLPKKHPRCLADGNELLRFFGTTVECPLGLNGSSSLCPSEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSVKAFNSIEASGGDGA-----MMRK
        VK +ASKL KKHPRCLADGNELLRF GTTV C LG+NGS+S+C +EKCCVCRIIRNGFS+K E   G+GVFT STS +AF SI  +GGD +      +RK
Subjt:  VKTKASKLPKKHPRCLADGNELLRFFGTTVECPLGLNGSSSLCPSEKCCVCRIIRNGFSAKTEMKEGIGVFTTSTSVKAFNSIEASGGDGA-----MMRK

Query:  AMIVCRVIAGRVHRPLENIQEMAG-QTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP
         +IVCRVIAGRVHRP+EN++EM G  +GFDSLAGKVGL++N+EELYLLNP+ALLPCFVVICKP
Subjt:  AMIVCRVIAGRVHRPLENIQEMAG-QTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAACAGTTTGGATATCCTTGAAGAAATCCTTGCACTGCAAATCACAGCCATCAGAAGTTCATGAGCCAAAAACCAGAAGACAAATAGGAGAAATATTGACAAGAAA
AGCAGGAGGGAGATCAGGATGCTCAAGGTCCATAGCAAATCTGAAAGATGTCATCCATGGGGGGAGCAAGAGGCATTTGGATAAGCCCCCAAGTTGCAGCCCAAGATCCA
TCGGCAGCAGTGAGTTTCTGAACCCAATAACCCACGAAGTGATTCTAAGCAACTCAAGATGCGAGCTCAAAATCACAGGCTTTCGGAGCGGAAACGCCGCCGCCGCTGCA
GATCCTCCCTTTGTGGGTACTCTCACACCGGGTACGCCGGGACCGGGTGGGCACCCCGCAATGCATTACTTCAGCCCTTCGCTGAGGTCTTCTTCCAGGAAAAGAGACGG
ATTTGGTGGTGGTGGCGGCGGTGGTGGAGTTCATATGAGTAACAGAATCTCTTTGGAAACGGAGTTTGGTGGCAACGGATCTTCTTCTGCGGTTACTTGTCATAAATGTG
GAGACCACTTCAACAAATGGAATGATGCTGAAGCACACCATCTTTCTAAACATGCTGTGACTGAACTTGTGGAAGGAGATTCATCAAGAAAAATTGTGGAGATCATATGC
AGGACAAACTGGTTAAAGTCTGAGAATCAATCTGGTAGAATTGAAAGAGTTCTTAAAGTTCACAACATGCAACGGACACTAGCCCGATTTGAGGAGCATCGCGAGACGGT
AAAGACCAAAGCCAGCAAGCTCCCGAAGAAGCATCCTCGGTGTCTTGCAGATGGGAACGAGCTACTGAGATTCTTTGGCACGACAGTTGAATGTCCGCTCGGCCTGAACG
GCTCATCGAGCCTATGCCCATCGGAGAAATGCTGTGTTTGCAGGATCATACGTAATGGGTTCTCTGCAAAGACAGAGATGAAAGAAGGAATAGGCGTGTTCACGACGTCC
ACGAGTGTGAAAGCATTCAACTCGATCGAGGCGTCGGGAGGGGATGGGGCGATGATGAGAAAGGCAATGATAGTTTGCAGGGTGATTGCAGGGAGAGTTCACAGGCCATT
GGAGAACATACAAGAAATGGCTGGGCAGACAGGGTTTGATTCATTGGCAGGCAAAGTTGGGCTGCATTCCAATATAGAAGAGCTTTATTTGCTGAATCCTAGAGCTTTGC
TTCCATGTTTTGTTGTAATTTGCAAGCCATGA
mRNA sequenceShow/hide mRNA sequence
TTGGTTCTTAAATCTCACGGATGGAAAATCTTTGCGAAAAGATTTGTAATGGTAGCAAGAAAATCCAAAAAGAGAAACCCAATAATTCAAAACCAGATAAAAGGCAATGA
ATGAATGCCCTTCAAAAGCCCCCCTTCAGGTTGCCCCATTTGGGGATCTTTCTGTGTCTTTCCTCAGTAACATTTATGAGTTTCACTGATGAAGGATTTTTGGCTCTCAT
TTTTTCAAACAGTCAAAACCCCTTTCCCCAATCCCTTTCTTTCCTCATCCCTTACCCTACCTATATTTAATTTTATTCCCTTGTTTTCCCTTTCTGGGCACATATGAAAA
CTCCTTTCCCTTAGCTCATTGTTTTGAACTCCACCCATCTTCTCCAAATTCTCTCAACCAAATGCCCACAAAATTGAAATAAACCACCACCTCTTTGTGGGTCATGTTTG
AAGAAATGGGGTTAAGTTATCTTTGAAGAAAAGCAAAAGGGGGAAGAAAGTAAAATATGCCAACAGTTTGGATATCCTTGAAGAAATCCTTGCACTGCAAATCACAGCCA
TCAGAAGTTCATGAGCCAAAAACCAGAAGACAAATAGGAGAAATATTGACAAGAAAAGCAGGAGGGAGATCAGGATGCTCAAGGTCCATAGCAAATCTGAAAGATGTCAT
CCATGGGGGGAGCAAGAGGCATTTGGATAAGCCCCCAAGTTGCAGCCCAAGATCCATCGGCAGCAGTGAGTTTCTGAACCCAATAACCCACGAAGTGATTCTAAGCAACT
CAAGATGCGAGCTCAAAATCACAGGCTTTCGGAGCGGAAACGCCGCCGCCGCTGCAGATCCTCCCTTTGTGGGTACTCTCACACCGGGTACGCCGGGACCGGGTGGGCAC
CCCGCAATGCATTACTTCAGCCCTTCGCTGAGGTCTTCTTCCAGGAAAAGAGACGGATTTGGTGGTGGTGGCGGCGGTGGTGGAGTTCATATGAGTAACAGAATCTCTTT
GGAAACGGAGTTTGGTGGCAACGGATCTTCTTCTGCGGTTACTTGTCATAAATGTGGAGACCACTTCAACAAATGGAATGATGCTGAAGCACACCATCTTTCTAAACATG
CTGTGACTGAACTTGTGGAAGGAGATTCATCAAGAAAAATTGTGGAGATCATATGCAGGACAAACTGGTTAAAGTCTGAGAATCAATCTGGTAGAATTGAAAGAGTTCTT
AAAGTTCACAACATGCAACGGACACTAGCCCGATTTGAGGAGCATCGCGAGACGGTAAAGACCAAAGCCAGCAAGCTCCCGAAGAAGCATCCTCGGTGTCTTGCAGATGG
GAACGAGCTACTGAGATTCTTTGGCACGACAGTTGAATGTCCGCTCGGCCTGAACGGCTCATCGAGCCTATGCCCATCGGAGAAATGCTGTGTTTGCAGGATCATACGTA
ATGGGTTCTCTGCAAAGACAGAGATGAAAGAAGGAATAGGCGTGTTCACGACGTCCACGAGTGTGAAAGCATTCAACTCGATCGAGGCGTCGGGAGGGGATGGGGCGATG
ATGAGAAAGGCAATGATAGTTTGCAGGGTGATTGCAGGGAGAGTTCACAGGCCATTGGAGAACATACAAGAAATGGCTGGGCAGACAGGGTTTGATTCATTGGCAGGCAA
AGTTGGGCTGCATTCCAATATAGAAGAGCTTTATTTGCTGAATCCTAGAGCTTTGCTTCCATGTTTTGTTGTAATTTGCAAGCCATGAAGTGGAAATTGATGTGGTACAT
TTCTTTTTCTTTTTTACCTACCTTCCTTCCAAAATTTATATAAAGATTTGAAGCTTTTTATGTGGAAAGGGGACTTTTTAATTTTTTTTTTTTTTTGAATCTCTCTTGAT
CCATGTAATAGAACTGTACAAAGTTTCAAATTCAAAATGAGAGTTGGTTTCGGAATCTTTATAGTTGTAAACACAAGGCGAACCTGAATGTCCTGTACTAAAAAGCATGA
G
Protein sequenceShow/hide protein sequence
MPTVWISLKKSLHCKSQPSEVHEPKTRRQIGEILTRKAGGRSGCSRSIANLKDVIHGGSKRHLDKPPSCSPRSIGSSEFLNPITHEVILSNSRCELKITGFRSGNAAAAA
DPPFVGTLTPGTPGPGGHPAMHYFSPSLRSSSRKRDGFGGGGGGGGVHMSNRISLETEFGGNGSSSAVTCHKCGDHFNKWNDAEAHHLSKHAVTELVEGDSSRKIVEIIC
RTNWLKSENQSGRIERVLKVHNMQRTLARFEEHRETVKTKASKLPKKHPRCLADGNELLRFFGTTVECPLGLNGSSSLCPSEKCCVCRIIRNGFSAKTEMKEGIGVFTTS
TSVKAFNSIEASGGDGAMMRKAMIVCRVIAGRVHRPLENIQEMAGQTGFDSLAGKVGLHSNIEELYLLNPRALLPCFVVICKP