; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022296 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022296
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionzinc finger CCCH domain-containing protein 65
Genome locationtig00154107:247345..254854
RNA-Seq ExpressionSgr022296
SyntenySgr022296
Gene Ontology termsGO:0045892 - negative regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR000571 - Zinc finger, CCCH-type
IPR036855 - Zinc finger, CCCH-type superfamily
IPR041367 - E3 ligase, CCCH-type zinc finger
IPR045124 - Protein suppressor of sable-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6586476.1 Zinc finger CCCH domain-containing protein 65, partial [Cucurbita argyrosperma subsp. sororia]1.9e-28360.52Show/hide
Query:  MEDARIEAPILVESKLVFPPRRRSHLRSQTYRTLVRILSPFCENADVSLATEFDPQELRVACSSHLRQAIVEPAGQQVGEDFVRAQTGDWGIEKVTVVED
        MEDARI AP LVESKL+FPP RRSHL+S TYRTLVRILSPFC + DVSLATEFDPQ+LR A +SHL+Q+ +E +GQ V EDFVR + GD  +EKVTVVED
Subjt:  MEDARIEAPILVESKLVFPPRRRSHLRSQTYRTLVRILSPFCENADVSLATEFDPQELRVACSSHLRQAIVEPAGQQVGEDFVRAQTGDWGIEKVTVVED

Query:  GLGMNSLEDGCFHDFEAKMDVIKDTTIVVRSGNASDERKSSGASSQGNHCVHEENYRDELE--HIVEGNKVLVDDNNSNNMRTSDQKNGGNDQISPKKFQ
        G+GM+SLEDGCFH+ E KM+VIKD +IV RS N S ER+  GASSQ NHCVHEE++RD+LE  ++VEGNKVL+DDN+SN  R+ DQ N GNDQISPKKF 
Subjt:  GLGMNSLEDGCFHDFEAKMDVIKDTTIVVRSGNASDERKSSGASSQGNHCVHEENYRDELE--HIVEGNKVLVDDNNSNNMRTSDQKNGGNDQISPKKFQ

Query:  EEIADHQQVMAEEDAASIVQKLSQTVFSLDGNRIKEASKPKEDNQENRLLIVGKGMHQWPENQLKKLESEGIVHEP--DLERSYVSSFEVMEVGDGEQCS
        EEI +HQQVMAE+DA + VQ LS T+FS +G+ I EASK K DNQEN L IV K M+Q  E Q K+ + EGIV EP  ++E+S+VSSFEVMEV DGEQCS
Subjt:  EEIADHQQVMAEEDAASIVQKLSQTVFSLDGNRIKEASKPKEDNQENRLLIVGKGMHQWPENQLKKLESEGIVHEP--DLERSYVSSFEVMEVGDGEQCS

Query:  EK--------------------SKKSEDKAISMVVDASKVEHKMLSEDMQMDKCMIENEDIEEGEICGDSTYEISEDPMTLAEKVVDIQISEETVNRNKL
        EK                    SK SED A+SM VD S+V HK++SED+QM+KC+IENEDIEEGEICGDST+EISEDP  L E               K 
Subjt:  EK--------------------SKKSEDKAISMVVDASKVEHKMLSEDMQMDKCMIENEDIEEGEICGDSTYEISEDPMTLAEKVVDIQISEETVNRNKL

Query:  PSIQANGEAKKPVSFFSDTAEHASHENKEEFKQNKSSMEAHKSGSIVSEIMVKSTNEDRNDLMLMERGYKVKDSGTKKDLGSSTAIVSNLVLCDQVPNEQ
         SIQANGEAKKP S FSD  E+A+HENKEE KQ KS+++A K GS+ S  MV +  ED+ND ML E GY VKDS   +D  S TA +SN V   QVP+E 
Subjt:  PSIQANGEAKKPVSFFSDTAEHASHENKEEFKQNKSSMEAHKSGSIVSEIMVKSTNEDRNDLMLMERGYKVKDSGTKKDLGSSTAIVSNLVLCDQVPNEQ

Query:  GTGQKGIVFTDKETGVCNKKKRGPLSEGRKERKKILKRKKRAEKNRKLGVKRLKLHPVTVPKPIVYCRHYIKGRCQEGDKCKFSHDTIPLTKST------
         TG+K I  T+KETG CNKKKRG  +EGRKERKKI KRKKRAEKNRKLGVKRLKL PVTVPKP+VYCRHYIKGRC EGDKCKFSHDT P TKS       
Subjt:  GTGQKGIVFTDKETGVCNKKKRGPLSEGRKERKKILKRKKRAEKNRKLGVKRLKLHPVTVPKPIVYCRHYIKGRCQEGDKCKFSHDTIPLTKST------

Query:  ---------------LSKYPCSNFVSKGACPRGDTCMFSHKILQQEESTSTPNLKTEPKPPRILDNSDSQKKLNIS----------------GTSCNRTP
                       L KYPCSNFVSKG+CPRGDTCMFSHKIL QE+S +  NLKT+ KPP I D+ DSQ+K NIS                G SCNRTP
Subjt:  ---------------LSKYPCSNFVSKGACPRGDTCMFSHKILQQEESTSTPNLKTEPKPPRILDNSDSQKKLNIS----------------GTSCNRTP

Query:  QG--------GPPPVGINFLLLGS---------------RNLLNQSGLCPVENSKEISRKIQPETAPKGINFLSFGKISLKSRSEKLATPSSYNDASIPS
        QG        GP P GI+FL  G                RN L +SGLC VENS EISRKIQP+ AP+GI+ LSFG+ SLKS SE LATPSSYN A+I S
Subjt:  QG--------GPPPVGINFLLLGS---------------RNLLNQSGLCPVENSKEISRKIQPETAPKGINFLSFGKISLKSRSEKLATPSSYNDASIPS

Query:  DNQIQAVSKIGHDSDELLQGVQTTTLKQTNFLSPKAPMGNYVHNELLVRSSSSSKAGSDITVQIKESMHDKHESSSAILSRLPSAPICSGQSSDSLASSI
         NQ  + SK GHD  EL + VQTTT K  NF SP                             +KESM+ K E  S I SRLP   + +GQ SDS ASS+
Subjt:  DNQIQAVSKIGHDSDELLQGVQTTTLKQTNFLSPKAPMGNYVHNELLVRSSSSSKAGSDITVQIKESMHDKHESSSAILSRLPSAPICSGQSSDSLASSI

Query:  YKGTPNTAQKALLSTLAFAAKYESLMNKGKSSGSADVSKEGNKKVRNDNISGSLQNDNSKASKLLDFLLG
         KGTPN+AQKALLSTLAFAAKYES MNKGKSS SA  S  GNK+ RN+ ISGSL+ND +KASKLLDFLLG
Subjt:  YKGTPNTAQKALLSTLAFAAKYESLMNKGKSSGSADVSKEGNKKVRNDNISGSLQNDNSKASKLLDFLLG

XP_022154630.1 uncharacterized protein LOC111021849 [Momordica charantia]0.0e+0068.65Show/hide
Query:  MEDARIEAPILVESKLVFPPRRRSHLRSQTYRTLVRILSPFCENADVSLATEFDPQELRVACSSHLRQAIVEPAGQQVGEDFVRAQTGDWGIEKVTVVED
        MEDA  EAPILVES+  FPP RRSHLRS+TYRTLVRILSPFC+N +VSLATEFDP+ELR A SSHLR   VEP GQQVGE FVRA+TG+ G EKVTVVED
Subjt:  MEDARIEAPILVESKLVFPPRRRSHLRSQTYRTLVRILSPFCENADVSLATEFDPQELRVACSSHLRQAIVEPAGQQVGEDFVRAQTGDWGIEKVTVVED

Query:  GLGMNSLEDGCFHDFEAKMDVIKDTTIVVRSGNASDERKSSGASSQGNHCVHEENYRDEL--EHIVEGNKVLVDDNNSNNMRTSDQKNGGNDQISPKKFQ
        GLGMNSLEDG FH  E   DV+ DTT V  SGN  DERK SGA S GNHCVHEEN+R+EL  EHIVEGNKV VD+NNSNNMR SDQK GGNDQI    FQ
Subjt:  GLGMNSLEDGCFHDFEAKMDVIKDTTIVVRSGNASDERKSSGASSQGNHCVHEENYRDEL--EHIVEGNKVLVDDNNSNNMRTSDQKNGGNDQISPKKFQ

Query:  EEIADHQQVMAEEDAASIVQKLSQTVFSLDGNRIKEASKPKEDNQENRLLIVGKGMHQWPENQLKKLESEGIVHEPDLERSYVSSFEVMEVGDGEQCSEK
        EEIADHQ VMAEED  +I+QKLS T+  LDGN I    KPKEDNQENRLL+VG GMH+WPE+QLK+ ESEGIV  PDL+RS+VSSF+ +EV DGE+CSEK
Subjt:  EEIADHQQVMAEEDAASIVQKLSQTVFSLDGNRIKEASKPKEDNQENRLLIVGKGMHQWPENQLKKLESEGIVHEPDLERSYVSSFEVMEVGDGEQCSEK

Query:  --------------------SKKSEDKAISMVVDASKVEHKMLSEDMQMDKCMIENEDIEEGEICGD-STYEISEDPMTLAEKVVDIQISEETVNRNKLP
                             K S+DK ISMVVDASKV+HKML EDMQMDKC  ENEDIEEGEICGD STYEI EDP  LAEKVVDIQISE+ +N N+LP
Subjt:  --------------------SKKSEDKAISMVVDASKVEHKMLSEDMQMDKCMIENEDIEEGEICGD-STYEISEDPMTLAEKVVDIQISEETVNRNKLP

Query:  SIQANGEAKKPVSFFSDTAEHASHENKEEFKQNKSSME-AHKSGSIVSEIMVKSTNEDRNDLMLMERGYKVKDSGTKKDLGSSTAIVSNLVLCDQVPNEQ
        SI++NG A KP S FSDT E+ASHENKEE K N SSME A+K GS VS+ MV STNED+ND ML ER YKVKDSGTKK + S    V N VL +QVP+E 
Subjt:  SIQANGEAKKPVSFFSDTAEHASHENKEEFKQNKSSME-AHKSGSIVSEIMVKSTNEDRNDLMLMERGYKVKDSGTKKDLGSSTAIVSNLVLCDQVPNEQ

Query:  GTGQKGIVFTDKETGVCNKKKRGPLSEGRKERKKILKRKKRAEKNRKLGVKRLKLHPVTV--PKPIVYCRHYIKGRCQEGDKCKFSHDTIPLTKST----
         +G+K IV T+KETG C+KKKRGPLSEGRKERKKILKRKKRAEKNRKLGVKRLKLHPVTV  PKP+VYCRHYIKGRCQEGDKCKFSH TIPLTKST    
Subjt:  GTGQKGIVFTDKETGVCNKKKRGPLSEGRKERKKILKRKKRAEKNRKLGVKRLKLHPVTV--PKPIVYCRHYIKGRCQEGDKCKFSHDTIPLTKST----

Query:  -----------------LSKYPCSNFVSKGACPRGDTCMFSHKILQQEESTSTPNLKTEPKPPRILDNSDSQKKLNISGTSCNRTPQG------------
                         LSKYPCSNFVSKG+CPRGDTCMFSHKIL QEESTSTPNLKTEPKPP+ILD SDSQKKL+ISGTSCNRTP G            
Subjt:  -----------------LSKYPCSNFVSKGACPRGDTCMFSHKILQQEESTSTPNLKTEPKPPRILDNSDSQKKLNISGTSCNRTPQG------------

Query:  ---------GPPPVGINFLLLGSR-------------------NLLNQSGLCPVENSKEISRKIQPETAPKGINFLSFGKISLKSRSEKLATPSSYNDAS
                 GPPPVGINFL  G                     N LN+SG     NS EI R+I PETAPKGINFLSFGK SLKS SEKL T SSYNDA+
Subjt:  ---------GPPPVGINFLLLGSR-------------------NLLNQSGLCPVENSKEISRKIQPETAPKGINFLSFGKISLKSRSEKLATPSSYNDAS

Query:  IPSDNQIQAVSKIGHDSDELLQGVQTTTLKQTNFLSPKAPMGNYVHNELLVRSSSSSKAGSDITVQIKESMHDKHESSSAILSRLPSAPICSGQSSDSLA
        IPS N+I+ VSK GHD +ELL+ VQ  T K TN LSPKAP+GNY   E L  SSS+SK GSD++VQI ES +DK+  SSA+L RLP   I  GQSSDSLA
Subjt:  IPSDNQIQAVSKIGHDSDELLQGVQTTTLKQTNFLSPKAPMGNYVHNELLVRSSSSSKAGSDITVQIKESMHDKHESSSAILSRLPSAPICSGQSSDSLA

Query:  SSIYKGTPNTAQKALLSTLAFAAKYESLMNKGKSSGSADVSKEGNKKVRNDNISGSLQNDNSKASKLLDFLLG
        SS+YKGTPN+AQKALLSTLAFAA+YESLMNKGKSSGS   S EGNK+ +N+N+ GSLQND +KASKLL  LLG
Subjt:  SSIYKGTPNTAQKALLSTLAFAAKYESLMNKGKSSGSADVSKEGNKKVRNDNISGSLQNDNSKASKLLDFLLG

XP_022965565.1 uncharacterized protein LOC111465427 isoform X1 [Cucurbita maxima]1.5e-28561.24Show/hide
Query:  MEDARIEAPILVESKLVFPPRRRSHLRSQTYRTLVRILSPFCENADVSLATEFDPQELRVACSSHLRQAIVEPAGQQVGEDFVRAQTGDWGIEKVTVVED
        MEDARIEA  LVESKL+FPP RRSHL+S TYRTLVRILSPFC + DVSLATEFDPQ+LR A +SHL+Q+ VE +GQQ GEDFVR + GD  IE VTVVED
Subjt:  MEDARIEAPILVESKLVFPPRRRSHLRSQTYRTLVRILSPFCENADVSLATEFDPQELRVACSSHLRQAIVEPAGQQVGEDFVRAQTGDWGIEKVTVVED

Query:  GLGMNSLEDGCFHDFEAKMDVIKDTTIVVRSGNASDERKSSGASSQGNHCVHEENYRDELE--HIVEGNKVLVDDNNSNNMRTSDQKNGGNDQISPKKFQ
        G+GM+SLEDG FH+ E KM+VIKD +IV RS N S ER+  GASSQ N CVHEE++ DELE  ++VEGNKVL+DDN+SN MR+SDQ N  NDQISPKKFQ
Subjt:  GLGMNSLEDGCFHDFEAKMDVIKDTTIVVRSGNASDERKSSGASSQGNHCVHEENYRDELE--HIVEGNKVLVDDNNSNNMRTSDQKNGGNDQISPKKFQ

Query:  EEIADHQQVMAEEDAASIVQKLSQTVFSLDGNRIKEASKPKEDNQENRLLIVGKGMHQWPENQLKKLESEGIVHEP--DLERSYVSSFEVMEVGDGEQCS
        EEI +HQQVMAE+DA +  Q LS T+FS +G+ I+EASK K DNQEN L I  KGM+Q  E Q K+ + EG V EP  ++ERS+VSSFEVMEV DGEQCS
Subjt:  EEIADHQQVMAEEDAASIVQKLSQTVFSLDGNRIKEASKPKEDNQENRLLIVGKGMHQWPENQLKKLESEGIVHEP--DLERSYVSSFEVMEVGDGEQCS

Query:  EK--------------------SKKSEDKAISMVVDASKVEHKMLSEDMQMDKCMIENEDIEEGEICGDSTYEISEDPMTLAEKVVDIQISEETVNRNKL
        EK                    SK SED A+SM VD S+V HK++SE++QMDKC+IENEDIEEGEICGDST+EISEDP  L E               K 
Subjt:  EK--------------------SKKSEDKAISMVVDASKVEHKMLSEDMQMDKCMIENEDIEEGEICGDSTYEISEDPMTLAEKVVDIQISEETVNRNKL

Query:  PSIQANGEAKKPVSFFSDTAEHASHENKEEFKQNKSSMEAHKSGSIVSEIMVKSTNEDRNDLMLMERGYKVKDSGTKKDLGSSTAIVSNLVLCDQVPNEQ
          IQANGEAKKP S FSD  ++A+HENKEE KQ KSS++A K GS+ S  MV +T  D+ND ML E GY VKDS  K+D  S TA +SN V C QVP+E 
Subjt:  PSIQANGEAKKPVSFFSDTAEHASHENKEEFKQNKSSMEAHKSGSIVSEIMVKSTNEDRNDLMLMERGYKVKDSGTKKDLGSSTAIVSNLVLCDQVPNEQ

Query:  GTGQKGIVFTDKETGVCNKKKRGPLSEGRKERKKILKRKKRAEKNRKLGVKRLKLHPVTVPKPIVYCRHYIKGRCQEGDKCKFSHDTIPLTKS-------
         TG+K IV T+KETG CNKKKRG  +EGRKERKKI KRKKRAEKNRKLGVKRLKL PVT+PKP+VYCRHYIKGRC EGDKCKFSHDT P TKS       
Subjt:  GTGQKGIVFTDKETGVCNKKKRGPLSEGRKERKKILKRKKRAEKNRKLGVKRLKLHPVTVPKPIVYCRHYIKGRCQEGDKCKFSHDTIPLTKS-------

Query:  --------------TLSKYPCSNFVSKGACPRGDTCMFSHKILQQEESTSTPNLKTEPKPPRILDNSDSQKKLNIS----------------GTSCNRTP
                       L KYPCSNFVSKG+CPRGDTCMFSHKIL QE S +  NLKT+ KPP   D+SDSQ+K N+S                G SCNRTP
Subjt:  --------------TLSKYPCSNFVSKGACPRGDTCMFSHKILQQEESTSTPNLKTEPKPPRILDNSDSQKKLNIS----------------GTSCNRTP

Query:  QG--------GPPPVGINFLLLGS---------------RNLLNQSGLCPVENSKEISRKIQPETAPKGINFLSFGKISLKSRSEKLATPSSYNDASIPS
        QG        G  P+GI+FL  G                RN L QSGLC VENS EISRKIQP++AP+GI+ LSFG+ SL+S SE  ATPSSYN A+I S
Subjt:  QG--------GPPPVGINFLLLGS---------------RNLLNQSGLCPVENSKEISRKIQPETAPKGINFLSFGKISLKSRSEKLATPSSYNDASIPS

Query:  DNQIQAVSKIGHDSDELLQGVQTTTLKQTNFLSPKAPMGNYVHNELLVRSSSSSKAGSDITVQIKESMHDKHESSSAILSRLPSAPICSGQSSDSLASSI
         NQ  + SK GHD  EL + VQ TT K TNF S   P+GN          +S+SK  SD++V +KESM+ K E  S I SRLP   + +GQ SDS ASS+
Subjt:  DNQIQAVSKIGHDSDELLQGVQTTTLKQTNFLSPKAPMGNYVHNELLVRSSSSSKAGSDITVQIKESMHDKHESSSAILSRLPSAPICSGQSSDSLASSI

Query:  YKGTPNTAQKALLSTLAFAAKYESLMNKGKSSGSADVSKEGNKKVRNDNISGSLQNDNSKASKLLDFLLG
         KGTPN+AQKALLSTLAFAAKYES MNKGKSS SA  S +GNK+ RN+ ISGSL+ND +KASKLLDFLLG
Subjt:  YKGTPNTAQKALLSTLAFAAKYESLMNKGKSSGSADVSKEGNKKVRNDNISGSLQNDNSKASKLLDFLLG

XP_023537273.1 zinc finger CCCH domain-containing protein 65 [Cucurbita pepo subsp. pepo]7.8e-29061.69Show/hide
Query:  MEDARIEAPILVESKLVFPPRRRSHLRSQTYRTLVRILSPFCENADVSLATEFDPQELRVACSSHLRQAIVEPAGQQVGEDFVRAQTGDWGIEKVTVVED
        MEDARIEAP LVESKL+FPP RRSHL+S TYRTLVRILSPFC + DVSLATEFDPQ+LR A +SHL+Q+ VE +GQQV  DFVR + GD  IEKVTVVED
Subjt:  MEDARIEAPILVESKLVFPPRRRSHLRSQTYRTLVRILSPFCENADVSLATEFDPQELRVACSSHLRQAIVEPAGQQVGEDFVRAQTGDWGIEKVTVVED

Query:  GLGMNSLEDGCFHDFEAKMDVIKDTTIVVRSGNASDERKSSGASSQGNHCVHEENYRDELE--HIVEGNKVLVDDNNSNNMRTSDQKNGGNDQISPKKFQ
        G+GM+SLE+GCFH+ EAKM+VIKD +IV  S NA  ER+  GA SQ NHCVHEE++RDELE  ++VEGNKVL+DDN+SN  R+SDQ N GN+QISPKKFQ
Subjt:  GLGMNSLEDGCFHDFEAKMDVIKDTTIVVRSGNASDERKSSGASSQGNHCVHEENYRDELE--HIVEGNKVLVDDNNSNNMRTSDQKNGGNDQISPKKFQ

Query:  EEIADHQQVMAEEDAASIVQKLSQTVFSLDGNRIKEASKPKEDNQENRLLIVGKGMHQWPENQLKKLESEGIVHEP--DLERSYVSSFEVMEVGDGEQCS
        EEI +HQQVMAE+DA + VQ LS T+FS  G+ I EAS+ K DNQENRL IV KGM+Q  E Q K+ + EGIV EP  ++E+S+VSSFEVMEV DG+QCS
Subjt:  EEIADHQQVMAEEDAASIVQKLSQTVFSLDGNRIKEASKPKEDNQENRLLIVGKGMHQWPENQLKKLESEGIVHEP--DLERSYVSSFEVMEVGDGEQCS

Query:  EK--------------------SKKSEDKAISMVVDASKVEHKMLSEDMQMDKCMIENEDIEEGEICGDSTYEISEDPMTLAEKVVDIQISEETVNRNKL
        EK                    SK SED A+SM VDAS+V HK++SED+QM+KC++ENEDIEEGEICGDST+EISEDP  L E               K 
Subjt:  EK--------------------SKKSEDKAISMVVDASKVEHKMLSEDMQMDKCMIENEDIEEGEICGDSTYEISEDPMTLAEKVVDIQISEETVNRNKL

Query:  PSIQANGEAKKPVSFFSDTAEHASHENKEEFKQNKSSMEAHKSGSIVSEIMVKSTNEDRNDLMLMERGYKVKDSGTKKDLGSSTAIVSNLVLCDQVPNEQ
         SIQANGEAKKP S FSD  E+A+HENKEE KQ KS+++A K GS+ S  MV +  ED+ND ML E GY VKDS   +D  S TA +SN V   QVP+E 
Subjt:  PSIQANGEAKKPVSFFSDTAEHASHENKEEFKQNKSSMEAHKSGSIVSEIMVKSTNEDRNDLMLMERGYKVKDSGTKKDLGSSTAIVSNLVLCDQVPNEQ

Query:  GTGQKGIVFTDKETGVCNKKKRGPLSEGRKERKKILKRKKRAEKNRKLGVKRLKLHPVTVPKPIVYCRHYIKGRCQEGDKCKFSHDTIPLTKST------
         TG+K IV T+KETG CNKKKRG  +EGRKERKKI KRKKRAEKNRKLGVKRLKL PVTVPKP+VYCRHYIKGRC EGDKCKFSHDT P TKS       
Subjt:  GTGQKGIVFTDKETGVCNKKKRGPLSEGRKERKKILKRKKRAEKNRKLGVKRLKLHPVTVPKPIVYCRHYIKGRCQEGDKCKFSHDTIPLTKST------

Query:  ---------------LSKYPCSNFVSKGACPRGDTCMFSHKILQQEESTSTPNLKTEPKPPRILDNSDSQKKLNIS----------------GTSCNRTP
                       L KYPCSNFVSKG+CPRGDTCMFSHKIL QE+S +  NLKT+ KPP I D+ DSQ+K NIS                G SCNRTP
Subjt:  ---------------LSKYPCSNFVSKGACPRGDTCMFSHKILQQEESTSTPNLKTEPKPPRILDNSDSQKKLNIS----------------GTSCNRTP

Query:  QG--------GPPPVGINFLLLGS---------------RNLLNQSGLCPVENSKEISRKIQPETAPKGINFLSFGKISLKSRSEKLATPSSYNDASIPS
        QG        GP P GI+FL  G                RN L QSGLC VENS EISRKIQP++AP+GI+ LSFG+ SL+S SE LATPSSYN A+I S
Subjt:  QG--------GPPPVGINFLLLGS---------------RNLLNQSGLCPVENSKEISRKIQPETAPKGINFLSFGKISLKSRSEKLATPSSYNDASIPS

Query:  DNQIQAVSKIGHDSDELLQGVQTTTLKQTNFLSPKAPMGNYVHNELLVRSSSSSKAGSDITVQIKESMHDKHESSSAILSRLPSAPICSGQSSDSLASSI
         NQ    SK GHD  EL + VQTTT K TNF S   P+GN          +S+SK  SD++V +KESM+ K E  S I SRLP   + +GQ SDS ASS+
Subjt:  DNQIQAVSKIGHDSDELLQGVQTTTLKQTNFLSPKAPMGNYVHNELLVRSSSSSKAGSDITVQIKESMHDKHESSSAILSRLPSAPICSGQSSDSLASSI

Query:  -YKGTPNTAQKALLSTLAFAAKYESLMNKGKSSGSADVSKEGNKKVRNDNISGSLQNDNSKASKLLDFLLG
          KGTP +AQKALLSTLAFAAKYES MNKGKSS SA  SK+GNK+  N+NISGSL++D +KASKLLDFLLG
Subjt:  -YKGTPNTAQKALLSTLAFAAKYESLMNKGKSSGSADVSKEGNKKVRNDNISGSLQNDNSKASKLLDFLLG

XP_038898303.1 uncharacterized protein LOC120086002 [Benincasa hispida]9.2e-29162.62Show/hide
Query:  MEDARIEAPILVESKLVFPPRRRSHLRSQTYRTLVRILSPFCENADVSLATEFDPQELRVACSSHLRQAIVEPAGQQVGEDFVRAQTGDWGIEKVTVVED
        ME+A IEAP LVESKL+FPP RR    S+TY TLVRILSPFCEN +VSLATEFDPQ+LR A SSH+RQ  +E + QQV ED VR +TGD   EKVTVVED
Subjt:  MEDARIEAPILVESKLVFPPRRRSHLRSQTYRTLVRILSPFCENADVSLATEFDPQELRVACSSHLRQAIVEPAGQQVGEDFVRAQTGDWGIEKVTVVED

Query:  GLGMNSLEDGCFHDFEAKMDVIKDTTIVVRSGNASDERKSSGASSQGNHCVHEENYRDEL--EHIVEGNKVLVDDNNSNNMRTSDQKNGGNDQISPKKFQ
        GLG N+LED   HD E  MD +KD  IV RS N SD+R++  A S+ N CVHE N+ +E+  EHIVEGNKVLVDDN+ +N RTS+Q NGGNDQISP K  
Subjt:  GLGMNSLEDGCFHDFEAKMDVIKDTTIVVRSGNASDERKSSGASSQGNHCVHEENYRDEL--EHIVEGNKVLVDDNNSNNMRTSDQKNGGNDQISPKKFQ

Query:  EEIADHQQVMAEEDAASIVQKLSQTVFSLDGNRIKEASKPKEDNQENRLLIVGKGMHQWPENQLKKLESEGIVHEPDL--ERSYVSSFEVMEVGDGEQCS
        EEI +H QVM EED ASIVQKLS  V  +DG+ +KEASK KEDNQEN   I+ K M Q  EN  K+L+SE IV + DL  E+SYVSS EVM   + EQC 
Subjt:  EEIADHQQVMAEEDAASIVQKLSQTVFSLDGNRIKEASKPKEDNQENRLLIVGKGMHQWPENQLKKLESEGIVHEPDL--ERSYVSSFEVMEVGDGEQCS

Query:  --------------------EKSKKSEDKAISMVVDASKVEHKMLSEDMQMDKCMIENEDIEEGEICGDSTYEISEDPMTLAEKVVDIQISEETVNRNKL
                            E+SK   DK I++ VDASKVEHKML E M++D  MIEN +IEEGEICGDST EISEDP+ L EKV  IQISEE +NR+KL
Subjt:  --------------------EKSKKSEDKAISMVVDASKVEHKMLSEDMQMDKCMIENEDIEEGEICGDSTYEISEDPMTLAEKVVDIQISEETVNRNKL

Query:  PSIQANGEAKKPVSFFSDTAEHASHENKEE--FKQNKSSMEAHKSGSIVSEIMVKSTNEDRNDLMLMERGYKVKDSGTKKDLGSSTAIVSNLVLCDQVPN
        P IQANGEA K VS   D A +ASHEN+EE   KQNKSSM++ +  SIVSE MV STNED+N  ML ERGY  K++G+KKD+ S TA  SN VLC QVP 
Subjt:  PSIQANGEAKKPVSFFSDTAEHASHENKEE--FKQNKSSMEAHKSGSIVSEIMVKSTNEDRNDLMLMERGYKVKDSGTKKDLGSSTAIVSNLVLCDQVPN

Query:  EQGTGQKGIVFTDKETGVCNKKKRGPLSEGRKERKKILKRKKRAEKNRKLGVKRLKLHPVTVPKPIVYCRHYIKGRCQEGDKCKFSHDTIPLTKST----
        E   G+K IV  +KETG CNKKKRGP SEGRKERKKI KRKKRAEKNRK GVKRLKLH VTVPKP+VYCRHYIKGRCQEGDKCKFSHDT PLTKST    
Subjt:  EQGTGQKGIVFTDKETGVCNKKKRGPLSEGRKERKKILKRKKRAEKNRKLGVKRLKLHPVTVPKPIVYCRHYIKGRCQEGDKCKFSHDTIPLTKST----

Query:  -----------------LSKYPCSNFVSKGACPRGDTCMFSHKILQQ-EESTSTPNLKTEPKPPRILDNSDSQKKLNIS--------------GTSCNRT
                         L KYPCSNFVS G+CPRGDTCMFSHK+L Q +ESTS PNLKTE KPP I D+ D+QK+LNIS              G SCNRT
Subjt:  -----------------LSKYPCSNFVSKGACPRGDTCMFSHKILQQ-EESTSTPNLKTEPKPPRILDNSDSQKKLNIS--------------GTSCNRT

Query:  PQG--------GPPPVGINFLLLGS---------------RNLLNQSGLCPVENSKEISRKIQPETAPKGINFLSFGKISLKSRSEKLATPSSYNDASIP
        P G        GPPPVGINFL  G                RN+L QSGL  VENS EISRKIQP TAPKGINFLSFG+ SL+S  E LATP SYN  SIP
Subjt:  PQG--------GPPPVGINFLLLGS---------------RNLLNQSGLCPVENSKEISRKIQPETAPKGINFLSFGKISLKSRSEKLATPSSYNDASIP

Query:  SDNQIQAVSKIGHDSDELLQGVQTTTLKQTNFLSPKAPMGNYVHNELLVRSSSSSKAGSDITVQIKESMHDKHESSSAILSRLPSAPICSGQSSDSLASS
        S NQIQAVSK GH S+ELL+ VQ TT KQTNF S   P+GNY  NE LV SSS+ KAG+D+ VQ K S+ D+HE  SA+LSRL   P  SGQ SD LASS
Subjt:  SDNQIQAVSKIGHDSDELLQGVQTTTLKQTNFLSPKAPMGNYVHNELLVRSSSSSKAGSDITVQIKESMHDKHESSSAILSRLPSAPICSGQSSDSLASS

Query:  IYKGTPNTAQKALLSTLAFAAKYESLMNKGKSSGSADVSKEGNKKVRNDNISGSLQNDNSKASKLLDFLLG
        + KGTP +AQKALLSTLAFAAK++S MNKGKSS    +  EGNK+  N N+ GSL+ND +KASKLLD LLG
Subjt:  IYKGTPNTAQKALLSTLAFAAKYESLMNKGKSSGSADVSKEGNKKVRNDNISGSLQNDNSKASKLLDFLLG

TrEMBL top hitse value%identityAlignment
A0A1S3B9F1 uncharacterized protein LOC103487622 isoform X12.4e-27660.64Show/hide
Query:  EAPILVESKLVFPPRRRSHLRSQTYRTLVRILSPFCENADVSLATEFDPQELRVACSSHLRQAIVEPAGQQVGEDFVRAQTGDWGIEKVTVVEDGLGMNS
        E   LVESKL+FPP RR    S+TYRTLVRILSPFCENADVSLATEFDPQ+LR A SSH+RQA VEP+ Q+V +D VR +TGD  IEK TVVED      
Subjt:  EAPILVESKLVFPPRRRSHLRSQTYRTLVRILSPFCENADVSLATEFDPQELRVACSSHLRQAIVEPAGQQVGEDFVRAQTGDWGIEKVTVVEDGLGMNS

Query:  LEDGCFHDFEAKMDVIKDTTIVVRSGNASDERKSSGASSQGNHCVHEENYRD--ELEHIVEGNKVLVDDNNSNNMRTSDQKNGGNDQISPKKFQEEIADH
               DF+  MD I+D  IV RS N SD+R++    S+ NHCV+EEN  +  E EHIVEGNK+L DDN+ +N+RTS Q NGGNDQISP+  +EEI +H
Subjt:  LEDGCFHDFEAKMDVIKDTTIVVRSGNASDERKSSGASSQGNHCVHEENYRD--ELEHIVEGNKVLVDDNNSNNMRTSDQKNGGNDQISPKKFQEEIADH

Query:  QQVMAEEDAASIVQKLSQTVFSLDGNRIKEASKPKEDNQENRLLIVGKGMHQWPENQLKKLESEGIVHEPDLER--SYVSSF-EVMEVGDGEQCSEKS--
        Q VM EEDA SIVQKLS T+ SLDG+  +EASK +E NQEN L    K MH+  EN  K+L+SE IV +PDLER  SY+SS  EVM+V D EQCSE    
Subjt:  QQVMAEEDAASIVQKLSQTVFSLDGNRIKEASKPKEDNQENRLLIVGKGMHQWPENQLKKLESEGIVHEPDLER--SYVSSF-EVMEVGDGEQCSEKS--

Query:  ------------------KKSEDKAISMVVDASKVEHKMLSEDMQMDKCMIENEDIEEGEICGDSTYEISEDPMTLAEKVVDIQISEETVNRNKLPSIQA
                          K SEDKAI++ +  SKVEH ML + M+MDK M EN DIEEGEICG+ T EISEDP  L EKV  IQISEET+NR+KLP IQA
Subjt:  ------------------KKSEDKAISMVVDASKVEHKMLSEDMQMDKCMIENEDIEEGEICGDSTYEISEDPMTLAEKVVDIQISEETVNRNKLPSIQA

Query:  NGEAKKPVSFFSDTAEHASHENKEEF--KQNKSSMEAHKSGSIVSEIMVKSTNEDRNDLMLMERGYKVKDSGTKKDLGSSTAIVSNLVLCDQVPNEQGTG
        + EA KPVS F D A +ASHEN+E+F  K+NKS M+A K  SIVSE  V STNED++  +L ERGY  KDSG KKD+ S TA  SN VLC QVP E  TG
Subjt:  NGEAKKPVSFFSDTAEHASHENKEEF--KQNKSSMEAHKSGSIVSEIMVKSTNEDRNDLMLMERGYKVKDSGTKKDLGSSTAIVSNLVLCDQVPNEQGTG

Query:  QKGIVFTDKETGVCNKKKRGPLSEGRKERKKILKRKKRAEKNRKLGVKRLKLHPVTVPKPIVYCRHYIKGRCQEGDKCKFSHDTIPLTKST---------
        +K  V  +KETGVCNKKKRGPLSEG+KERKKI KRKKRAEKNRK GVKRLKLHPVTVPKP+VYCRHY+KGRCQEGDKCKFSHDT PLTKST         
Subjt:  QKGIVFTDKETGVCNKKKRGPLSEGRKERKKILKRKKRAEKNRKLGVKRLKLHPVTVPKPIVYCRHYIKGRCQEGDKCKFSHDTIPLTKST---------

Query:  ------------LSKYPCSNFVSKGACPRGDTCMFSHKILQQEESTSTPNLKTEPKPPRILDNSDSQKKLNIS--------------GTSCNRTPQG---
                    LSKY C+NFVS G+CPRGDTCMFSHKIL Q ESTSTPNLKTE KPP I D+ D++K+ N+S              GTSCNRTP G   
Subjt:  ------------LSKYPCSNFVSKGACPRGDTCMFSHKILQQEESTSTPNLKTEPKPPRILDNSDSQKKLNIS--------------GTSCNRTPQG---

Query:  -----GPPPVGINFLLLGS---------------RNLLNQSGLCPVENSKEISRKIQPETAPKGINFLSFGKISLKSRSEKLATPSSYNDASIPSDNQIQ
             GPPP GI FL  G                 N+L QS +C  E S E SRKIQP  APKGINFLSFG+  L+S  E LATPSS N   I S NQIQ
Subjt:  -----GPPPVGINFLLLGS---------------RNLLNQSGLCPVENSKEISRKIQPETAPKGINFLSFGKISLKSRSEKLATPSSYNDASIPSDNQIQ

Query:  AVSKIGHDSDELLQGVQTTTLKQTNFLSPKAPMGNYVHNELLVRSSSSSKAGSDITVQIKESMHDKHESSSAILSRLPSAPICSGQSSDSLASSIYKGTP
        AVSK GHDS+ELL+ VQ TT KQTNFLS   P GNY  NE LV SSS+ KAG+D+ VQ K ++ DKHE  SA+L R P  PI SGQ SDSLASSI KGTP
Subjt:  AVSKIGHDSDELLQGVQTTTLKQTNFLSPKAPMGNYVHNELLVRSSSSSKAGSDITVQIKESMHDKHESSSAILSRLPSAPICSGQSSDSLASSIYKGTP

Query:  NTAQKALLSTLAFAAKYESLMNKGKSSGSADVSKEGNKKVRNDNISGSLQNDNSKASKLLDFL
         +AQ+ALLSTLAFAAK++S MNKGKSS    +   GNK   N+NI GSL+ND +KASKL+D L
Subjt:  NTAQKALLSTLAFAAKYESLMNKGKSSGSADVSKEGNKKVRNDNISGSLQNDNSKASKLLDFL

A0A6J1DK56 uncharacterized protein LOC1110218490.0e+0068.65Show/hide
Query:  MEDARIEAPILVESKLVFPPRRRSHLRSQTYRTLVRILSPFCENADVSLATEFDPQELRVACSSHLRQAIVEPAGQQVGEDFVRAQTGDWGIEKVTVVED
        MEDA  EAPILVES+  FPP RRSHLRS+TYRTLVRILSPFC+N +VSLATEFDP+ELR A SSHLR   VEP GQQVGE FVRA+TG+ G EKVTVVED
Subjt:  MEDARIEAPILVESKLVFPPRRRSHLRSQTYRTLVRILSPFCENADVSLATEFDPQELRVACSSHLRQAIVEPAGQQVGEDFVRAQTGDWGIEKVTVVED

Query:  GLGMNSLEDGCFHDFEAKMDVIKDTTIVVRSGNASDERKSSGASSQGNHCVHEENYRDEL--EHIVEGNKVLVDDNNSNNMRTSDQKNGGNDQISPKKFQ
        GLGMNSLEDG FH  E   DV+ DTT V  SGN  DERK SGA S GNHCVHEEN+R+EL  EHIVEGNKV VD+NNSNNMR SDQK GGNDQI    FQ
Subjt:  GLGMNSLEDGCFHDFEAKMDVIKDTTIVVRSGNASDERKSSGASSQGNHCVHEENYRDEL--EHIVEGNKVLVDDNNSNNMRTSDQKNGGNDQISPKKFQ

Query:  EEIADHQQVMAEEDAASIVQKLSQTVFSLDGNRIKEASKPKEDNQENRLLIVGKGMHQWPENQLKKLESEGIVHEPDLERSYVSSFEVMEVGDGEQCSEK
        EEIADHQ VMAEED  +I+QKLS T+  LDGN I    KPKEDNQENRLL+VG GMH+WPE+QLK+ ESEGIV  PDL+RS+VSSF+ +EV DGE+CSEK
Subjt:  EEIADHQQVMAEEDAASIVQKLSQTVFSLDGNRIKEASKPKEDNQENRLLIVGKGMHQWPENQLKKLESEGIVHEPDLERSYVSSFEVMEVGDGEQCSEK

Query:  --------------------SKKSEDKAISMVVDASKVEHKMLSEDMQMDKCMIENEDIEEGEICGD-STYEISEDPMTLAEKVVDIQISEETVNRNKLP
                             K S+DK ISMVVDASKV+HKML EDMQMDKC  ENEDIEEGEICGD STYEI EDP  LAEKVVDIQISE+ +N N+LP
Subjt:  --------------------SKKSEDKAISMVVDASKVEHKMLSEDMQMDKCMIENEDIEEGEICGD-STYEISEDPMTLAEKVVDIQISEETVNRNKLP

Query:  SIQANGEAKKPVSFFSDTAEHASHENKEEFKQNKSSME-AHKSGSIVSEIMVKSTNEDRNDLMLMERGYKVKDSGTKKDLGSSTAIVSNLVLCDQVPNEQ
        SI++NG A KP S FSDT E+ASHENKEE K N SSME A+K GS VS+ MV STNED+ND ML ER YKVKDSGTKK + S    V N VL +QVP+E 
Subjt:  SIQANGEAKKPVSFFSDTAEHASHENKEEFKQNKSSME-AHKSGSIVSEIMVKSTNEDRNDLMLMERGYKVKDSGTKKDLGSSTAIVSNLVLCDQVPNEQ

Query:  GTGQKGIVFTDKETGVCNKKKRGPLSEGRKERKKILKRKKRAEKNRKLGVKRLKLHPVTV--PKPIVYCRHYIKGRCQEGDKCKFSHDTIPLTKST----
         +G+K IV T+KETG C+KKKRGPLSEGRKERKKILKRKKRAEKNRKLGVKRLKLHPVTV  PKP+VYCRHYIKGRCQEGDKCKFSH TIPLTKST    
Subjt:  GTGQKGIVFTDKETGVCNKKKRGPLSEGRKERKKILKRKKRAEKNRKLGVKRLKLHPVTV--PKPIVYCRHYIKGRCQEGDKCKFSHDTIPLTKST----

Query:  -----------------LSKYPCSNFVSKGACPRGDTCMFSHKILQQEESTSTPNLKTEPKPPRILDNSDSQKKLNISGTSCNRTPQG------------
                         LSKYPCSNFVSKG+CPRGDTCMFSHKIL QEESTSTPNLKTEPKPP+ILD SDSQKKL+ISGTSCNRTP G            
Subjt:  -----------------LSKYPCSNFVSKGACPRGDTCMFSHKILQQEESTSTPNLKTEPKPPRILDNSDSQKKLNISGTSCNRTPQG------------

Query:  ---------GPPPVGINFLLLGSR-------------------NLLNQSGLCPVENSKEISRKIQPETAPKGINFLSFGKISLKSRSEKLATPSSYNDAS
                 GPPPVGINFL  G                     N LN+SG     NS EI R+I PETAPKGINFLSFGK SLKS SEKL T SSYNDA+
Subjt:  ---------GPPPVGINFLLLGSR-------------------NLLNQSGLCPVENSKEISRKIQPETAPKGINFLSFGKISLKSRSEKLATPSSYNDAS

Query:  IPSDNQIQAVSKIGHDSDELLQGVQTTTLKQTNFLSPKAPMGNYVHNELLVRSSSSSKAGSDITVQIKESMHDKHESSSAILSRLPSAPICSGQSSDSLA
        IPS N+I+ VSK GHD +ELL+ VQ  T K TN LSPKAP+GNY   E L  SSS+SK GSD++VQI ES +DK+  SSA+L RLP   I  GQSSDSLA
Subjt:  IPSDNQIQAVSKIGHDSDELLQGVQTTTLKQTNFLSPKAPMGNYVHNELLVRSSSSSKAGSDITVQIKESMHDKHESSSAILSRLPSAPICSGQSSDSLA

Query:  SSIYKGTPNTAQKALLSTLAFAAKYESLMNKGKSSGSADVSKEGNKKVRNDNISGSLQNDNSKASKLLDFLLG
        SS+YKGTPN+AQKALLSTLAFAA+YESLMNKGKSSGS   S EGNK+ +N+N+ GSLQND +KASKLL  LLG
Subjt:  SSIYKGTPNTAQKALLSTLAFAAKYESLMNKGKSSGSADVSKEGNKKVRNDNISGSLQNDNSKASKLLDFLLG

A0A6J1FC47 zinc finger CCCH domain-containing protein 651.7e-28260.31Show/hide
Query:  MEDARIEAPILVESKLVFPPRRRSHLRSQTYRTLVRILSPFCENADVSLATEFDPQELRVACSSHLRQAIVEPAGQQVGEDFVRAQTGDWGIEKVTVVED
        MEDARIEAP LVESKL+FPP RRSHL+S TYRTLVRI SPFC + DVSLATEFDPQ+LR A +SHL+Q+ +E +GQ V EDFVR + GD  +EKVTVVED
Subjt:  MEDARIEAPILVESKLVFPPRRRSHLRSQTYRTLVRILSPFCENADVSLATEFDPQELRVACSSHLRQAIVEPAGQQVGEDFVRAQTGDWGIEKVTVVED

Query:  GLGMNSLEDGCFHDFEAKMDVIKDTTIVVRSGNASDERKSSGASSQGNHCVHEENYRDELE--HIVEGNKVLVDDNNSNNMRTSDQKNGGNDQISPKKFQ
        G+GM+SL DGCFH+ E KM+VIKD +IV R+ N S ER+  GASSQ NHCVHEE++RD+LE  ++VEGNKVL+DDN+SN  R+SDQ N GNDQISPKKF 
Subjt:  GLGMNSLEDGCFHDFEAKMDVIKDTTIVVRSGNASDERKSSGASSQGNHCVHEENYRDELE--HIVEGNKVLVDDNNSNNMRTSDQKNGGNDQISPKKFQ

Query:  EEIADHQQVMAEEDAASIVQKLSQTVFSLDGNRIKEASKPKEDNQENRLLIVGKGMHQWPENQLKKLESEGIVHEP--DLERSYVSSFEVMEVGDGEQCS
        EEI +HQQVMAE+DA + VQ LS T+FS +G+ I EASK K DNQEN L IV K M+Q  E Q K+ + EGIV EP  ++E+S+VSSFEVMEV DGEQCS
Subjt:  EEIADHQQVMAEEDAASIVQKLSQTVFSLDGNRIKEASKPKEDNQENRLLIVGKGMHQWPENQLKKLESEGIVHEP--DLERSYVSSFEVMEVGDGEQCS

Query:  EK--------------------SKKSEDKAISMVVDASKVEHKMLSEDMQMDKCMIENEDIEEGEICGDSTYEISEDPMTLAEKVVDIQISEETVNRNKL
        EK                    SK SED A+SM VD S+V HK++SED+QM+KC+IENEDIEEGEICGDST+EISEDP  L E               K 
Subjt:  EK--------------------SKKSEDKAISMVVDASKVEHKMLSEDMQMDKCMIENEDIEEGEICGDSTYEISEDPMTLAEKVVDIQISEETVNRNKL

Query:  PSIQANGEAKKPVSFFSDTAEHASHENKEEFKQNKSSMEAHKSGSIVSEIMVKSTNEDRNDLMLMERGYKVKDSGTKKDLGSSTAIVSNLVLCDQVPNEQ
         SIQANGEAKKP S FSD  E+A+HENKEE KQ KS+++A K GS+ S  MV +  E +ND ML E GY VKDS   +D  S TA +SN V   QVP+E 
Subjt:  PSIQANGEAKKPVSFFSDTAEHASHENKEEFKQNKSSMEAHKSGSIVSEIMVKSTNEDRNDLMLMERGYKVKDSGTKKDLGSSTAIVSNLVLCDQVPNEQ

Query:  GTGQKGIVFTDKETGVCNKKKRGPLSEGRKERKKILKRKKRAEKNRKLGVKRLKLHPVTVPKPIVYCRHYIKGRCQEGDKCKFSHDTIPLTKST------
         TG+K I  T+KETG CNKKKRG  +EGRKERKKI KRKKRAEKNRKLGVKRLKL PVTVPKP+VYCRHYIKGRC EGDKCKFSHDT P TKS       
Subjt:  GTGQKGIVFTDKETGVCNKKKRGPLSEGRKERKKILKRKKRAEKNRKLGVKRLKLHPVTVPKPIVYCRHYIKGRCQEGDKCKFSHDTIPLTKST------

Query:  ---------------LSKYPCSNFVSKGACPRGDTCMFSHKILQQEESTSTPNLKTEPKPPRILDNSDSQKKLNIS----------------GTSCNRTP
                       L KYPCSNFVSKG+CPRGDTCMFSHKIL QE+S +  NLKT+ KPP I D+ DSQ+K NIS                G SCNRTP
Subjt:  ---------------LSKYPCSNFVSKGACPRGDTCMFSHKILQQEESTSTPNLKTEPKPPRILDNSDSQKKLNIS----------------GTSCNRTP

Query:  QG--------GPPPVGINFLLLGS---------------RNLLNQSGLCPVENSKEISRKIQPETAPKGINFLSFGKISLKSRSEKLATPSSYNDASIPS
        QG        G  P GI+FL  G                RN L QSGLC VENS EISR IQP+ AP+GI+ LSFG+ SLKS SE LATPSSYN A+I S
Subjt:  QG--------GPPPVGINFLLLGS---------------RNLLNQSGLCPVENSKEISRKIQPETAPKGINFLSFGKISLKSRSEKLATPSSYNDASIPS

Query:  DNQIQAVSKIGHDSDELLQGVQTTTLKQTNFLSPKAPMGNYVHNELLVRSSSSSKAGSDITVQIKESMHDKHESSSAILSRLPSAPICSGQSSDSLASSI
         NQ  + SK GHD  EL + VQTTT K TNF SP                             +KESM+ K E  S I SRLP   + +GQ SDS ASS+
Subjt:  DNQIQAVSKIGHDSDELLQGVQTTTLKQTNFLSPKAPMGNYVHNELLVRSSSSSKAGSDITVQIKESMHDKHESSSAILSRLPSAPICSGQSSDSLASSI

Query:  YKGTPNTAQKALLSTLAFAAKYESLMNKGKSSGSADVSKEGNKKVRNDNISGSLQNDNSKASKLLDFLLG
         KGTPN+AQKALLSTLAFAAKYES MNKGKSS SA  S +GNK+ RN+ ISGSL+ND +KASKLLDFLLG
Subjt:  YKGTPNTAQKALLSTLAFAAKYESLMNKGKSSGSADVSKEGNKKVRNDNISGSLQNDNSKASKLLDFLLG

A0A6J1HKN8 uncharacterized protein LOC111465427 isoform X21.2e-28060.41Show/hide
Query:  MEDARIEAPILVESKLVFPPRRRSHLRSQTYRTLVRILSPFCENADVSLATEFDPQELRVACSSHLRQAIVEPAGQQVGEDFVRAQTGDWGIEKVTVVED
        MEDARIEA  LVESKL+FPP RRSHL+S TYRTLVRILSPFC + DVSLATEFDPQ+LR A +SHL+Q+ VE +GQQ GEDFVR + GD  IE VTVVED
Subjt:  MEDARIEAPILVESKLVFPPRRRSHLRSQTYRTLVRILSPFCENADVSLATEFDPQELRVACSSHLRQAIVEPAGQQVGEDFVRAQTGDWGIEKVTVVED

Query:  GLGMNSLEDGCFHDFEAKMDVIKDTTIVVRSGNASDERKSSGASSQGNHCVHEENYRDELE--HIVEGNKVLVDDNNSNNMRTSDQKNGGNDQISPKKFQ
        G+GM+SLEDG FH+ E KM+VIKD +IV RS N S ER+  GASSQ N CVHEE++ DELE  ++VEGNKVL+DDN+SN MR+SDQ N  NDQISPKKFQ
Subjt:  GLGMNSLEDGCFHDFEAKMDVIKDTTIVVRSGNASDERKSSGASSQGNHCVHEENYRDELE--HIVEGNKVLVDDNNSNNMRTSDQKNGGNDQISPKKFQ

Query:  EEIADHQQVMAEEDAASIVQKLSQTVFSLDGNRIKEASKPKEDNQENRLLIVGKGMHQWPENQLKKLESEGIVHEP--DLERSYVSSFEVMEVGDGEQCS
        EEI +HQQVMAE+DA +  Q LS T+FS +G+ I+EASK K DNQEN L I  KGM+Q  E Q K+ + EG V EP  ++ERS+VSSFEVMEV DGEQCS
Subjt:  EEIADHQQVMAEEDAASIVQKLSQTVFSLDGNRIKEASKPKEDNQENRLLIVGKGMHQWPENQLKKLESEGIVHEP--DLERSYVSSFEVMEVGDGEQCS

Query:  EK--------------------SKKSEDKAISMVVDASKVEHKMLSEDMQMDKCMIENEDIEEGEICGDSTYEISEDPMTLAEKVVDIQISEETVNRNKL
        EK                    SK SED A+SM VD S+V HK++SE++QMDKC+IENEDIEEGEICGDST+EISEDP  L E               K 
Subjt:  EK--------------------SKKSEDKAISMVVDASKVEHKMLSEDMQMDKCMIENEDIEEGEICGDSTYEISEDPMTLAEKVVDIQISEETVNRNKL

Query:  PSIQANGEAKKPVSFFSDTAEHASHENKEEFKQNKSSMEAHKSGSIVSEIMVKSTNEDRNDLMLMERGYKVKDSGTKKDLGSSTAIVSNLVLCDQVPNEQ
          IQANGEAKKP S FSD  ++A+HENKEE KQ KSS++A K GS+ S  MV +T  D+ND ML E GY VKDS  K+D  S TA +SN V C QVP+E 
Subjt:  PSIQANGEAKKPVSFFSDTAEHASHENKEEFKQNKSSMEAHKSGSIVSEIMVKSTNEDRNDLMLMERGYKVKDSGTKKDLGSSTAIVSNLVLCDQVPNEQ

Query:  GTGQKGIVFTDKETGVCNKKKRGPLSEGRKERKKILKRKKRAEKNRKLGVKRLKLHPVTVPKPIVYCRHYIKGRCQEGDKCKFSHDTIPLTKS-------
         TG+K IV T+KETG CNKKKRG  +EGRKERKKI KRKKRAEKNRKLGVKRLKL PVT+PKP+VYCRHYIKGRC EGDKCKFSHDT P TKS       
Subjt:  GTGQKGIVFTDKETGVCNKKKRGPLSEGRKERKKILKRKKRAEKNRKLGVKRLKLHPVTVPKPIVYCRHYIKGRCQEGDKCKFSHDTIPLTKS-------

Query:  --------------TLSKYPCSNFVSKGACPRGDTCMFSHKILQQEESTSTPNLKTEPKPPRILDNSDSQKKLNIS----------------GTSCNRTP
                       L KYPCSNFVSKG+CPRGDTCMFSHKIL QE S +  NLKT+ KPP   D+SDSQ+K N+S                G SCNRTP
Subjt:  --------------TLSKYPCSNFVSKGACPRGDTCMFSHKILQQEESTSTPNLKTEPKPPRILDNSDSQKKLNIS----------------GTSCNRTP

Query:  QG--------GPPPVGINFLLLGS---------------RNLLNQSGLCPVENSKEISRKIQPETAPKGINFLSFGKISLKSRSEKLATPSSYNDASIPS
        QG        G  P+GI+FL  G                RN L QSGLC VENS EISRKIQP++AP+GI+ LSFG+ SL+S SE  ATPSSYN A+I S
Subjt:  QG--------GPPPVGINFLLLGS---------------RNLLNQSGLCPVENSKEISRKIQPETAPKGINFLSFGKISLKSRSEKLATPSSYNDASIPS

Query:  DNQIQAVSKIGHDSDELLQGVQTTTLKQTNFLSPKAPMGNYVHNELLVRSSSSSKAGSDITVQIKESMHDKHESSSAILSRLPSAPICSGQSSDSLASSI
         NQ  + SK GHD  EL + VQ TT K TNF SP                             +KESM+ K E  S I SRLP   + +GQ SDS ASS+
Subjt:  DNQIQAVSKIGHDSDELLQGVQTTTLKQTNFLSPKAPMGNYVHNELLVRSSSSSKAGSDITVQIKESMHDKHESSSAILSRLPSAPICSGQSSDSLASSI

Query:  YKGTPNTAQKALLSTLAFAAKYESLMNKGKSSGSADVSKEGNKKVRNDNISGSLQNDNSKASKLLDFLLG
         KGTPN+AQKALLSTLAFAAKYES MNKGKSS SA  S +GNK+ RN+ ISGSL+ND +KASKLLDFLLG
Subjt:  YKGTPNTAQKALLSTLAFAAKYESLMNKGKSSGSADVSKEGNKKVRNDNISGSLQNDNSKASKLLDFLLG

A0A6J1HM17 uncharacterized protein LOC111465427 isoform X17.3e-28661.24Show/hide
Query:  MEDARIEAPILVESKLVFPPRRRSHLRSQTYRTLVRILSPFCENADVSLATEFDPQELRVACSSHLRQAIVEPAGQQVGEDFVRAQTGDWGIEKVTVVED
        MEDARIEA  LVESKL+FPP RRSHL+S TYRTLVRILSPFC + DVSLATEFDPQ+LR A +SHL+Q+ VE +GQQ GEDFVR + GD  IE VTVVED
Subjt:  MEDARIEAPILVESKLVFPPRRRSHLRSQTYRTLVRILSPFCENADVSLATEFDPQELRVACSSHLRQAIVEPAGQQVGEDFVRAQTGDWGIEKVTVVED

Query:  GLGMNSLEDGCFHDFEAKMDVIKDTTIVVRSGNASDERKSSGASSQGNHCVHEENYRDELE--HIVEGNKVLVDDNNSNNMRTSDQKNGGNDQISPKKFQ
        G+GM+SLEDG FH+ E KM+VIKD +IV RS N S ER+  GASSQ N CVHEE++ DELE  ++VEGNKVL+DDN+SN MR+SDQ N  NDQISPKKFQ
Subjt:  GLGMNSLEDGCFHDFEAKMDVIKDTTIVVRSGNASDERKSSGASSQGNHCVHEENYRDELE--HIVEGNKVLVDDNNSNNMRTSDQKNGGNDQISPKKFQ

Query:  EEIADHQQVMAEEDAASIVQKLSQTVFSLDGNRIKEASKPKEDNQENRLLIVGKGMHQWPENQLKKLESEGIVHEP--DLERSYVSSFEVMEVGDGEQCS
        EEI +HQQVMAE+DA +  Q LS T+FS +G+ I+EASK K DNQEN L I  KGM+Q  E Q K+ + EG V EP  ++ERS+VSSFEVMEV DGEQCS
Subjt:  EEIADHQQVMAEEDAASIVQKLSQTVFSLDGNRIKEASKPKEDNQENRLLIVGKGMHQWPENQLKKLESEGIVHEP--DLERSYVSSFEVMEVGDGEQCS

Query:  EK--------------------SKKSEDKAISMVVDASKVEHKMLSEDMQMDKCMIENEDIEEGEICGDSTYEISEDPMTLAEKVVDIQISEETVNRNKL
        EK                    SK SED A+SM VD S+V HK++SE++QMDKC+IENEDIEEGEICGDST+EISEDP  L E               K 
Subjt:  EK--------------------SKKSEDKAISMVVDASKVEHKMLSEDMQMDKCMIENEDIEEGEICGDSTYEISEDPMTLAEKVVDIQISEETVNRNKL

Query:  PSIQANGEAKKPVSFFSDTAEHASHENKEEFKQNKSSMEAHKSGSIVSEIMVKSTNEDRNDLMLMERGYKVKDSGTKKDLGSSTAIVSNLVLCDQVPNEQ
          IQANGEAKKP S FSD  ++A+HENKEE KQ KSS++A K GS+ S  MV +T  D+ND ML E GY VKDS  K+D  S TA +SN V C QVP+E 
Subjt:  PSIQANGEAKKPVSFFSDTAEHASHENKEEFKQNKSSMEAHKSGSIVSEIMVKSTNEDRNDLMLMERGYKVKDSGTKKDLGSSTAIVSNLVLCDQVPNEQ

Query:  GTGQKGIVFTDKETGVCNKKKRGPLSEGRKERKKILKRKKRAEKNRKLGVKRLKLHPVTVPKPIVYCRHYIKGRCQEGDKCKFSHDTIPLTKS-------
         TG+K IV T+KETG CNKKKRG  +EGRKERKKI KRKKRAEKNRKLGVKRLKL PVT+PKP+VYCRHYIKGRC EGDKCKFSHDT P TKS       
Subjt:  GTGQKGIVFTDKETGVCNKKKRGPLSEGRKERKKILKRKKRAEKNRKLGVKRLKLHPVTVPKPIVYCRHYIKGRCQEGDKCKFSHDTIPLTKS-------

Query:  --------------TLSKYPCSNFVSKGACPRGDTCMFSHKILQQEESTSTPNLKTEPKPPRILDNSDSQKKLNIS----------------GTSCNRTP
                       L KYPCSNFVSKG+CPRGDTCMFSHKIL QE S +  NLKT+ KPP   D+SDSQ+K N+S                G SCNRTP
Subjt:  --------------TLSKYPCSNFVSKGACPRGDTCMFSHKILQQEESTSTPNLKTEPKPPRILDNSDSQKKLNIS----------------GTSCNRTP

Query:  QG--------GPPPVGINFLLLGS---------------RNLLNQSGLCPVENSKEISRKIQPETAPKGINFLSFGKISLKSRSEKLATPSSYNDASIPS
        QG        G  P+GI+FL  G                RN L QSGLC VENS EISRKIQP++AP+GI+ LSFG+ SL+S SE  ATPSSYN A+I S
Subjt:  QG--------GPPPVGINFLLLGS---------------RNLLNQSGLCPVENSKEISRKIQPETAPKGINFLSFGKISLKSRSEKLATPSSYNDASIPS

Query:  DNQIQAVSKIGHDSDELLQGVQTTTLKQTNFLSPKAPMGNYVHNELLVRSSSSSKAGSDITVQIKESMHDKHESSSAILSRLPSAPICSGQSSDSLASSI
         NQ  + SK GHD  EL + VQ TT K TNF S   P+GN          +S+SK  SD++V +KESM+ K E  S I SRLP   + +GQ SDS ASS+
Subjt:  DNQIQAVSKIGHDSDELLQGVQTTTLKQTNFLSPKAPMGNYVHNELLVRSSSSSKAGSDITVQIKESMHDKHESSSAILSRLPSAPICSGQSSDSLASSI

Query:  YKGTPNTAQKALLSTLAFAAKYESLMNKGKSSGSADVSKEGNKKVRNDNISGSLQNDNSKASKLLDFLLG
         KGTPN+AQKALLSTLAFAAKYES MNKGKSS SA  S +GNK+ RN+ ISGSL+ND +KASKLLDFLLG
Subjt:  YKGTPNTAQKALLSTLAFAAKYESLMNKGKSSGSADVSKEGNKKVRNDNISGSLQNDNSKASKLLDFLLG

SwissProt top hitse value%identityAlignment
Q657B3 Zinc finger CCCH domain-containing protein 75.3e-2333.64Show/hide
Query:  DKETGVCNKKKRGPLSEGRKERKKILKRKKRAEKNRKLGVKRLKLHPVTVPKPIVYCRHYIKGRCQEGDKCKFSHDTIPLTKS-----------------
        DK     N +KR  L+E RK +K   KR KRA +    GVKRLKL PV  PK +  C  Y+ G+CQ+G+ CKFSHDT PLTKS                 
Subjt:  DKETGVCNKKKRGPLSEGRKERKKILKRKKRAEKNRKLGVKRLKLHPVTVPKPIVYCRHYIKGRCQEGDKCKFSHDTIPLTKS-----------------

Query:  ----TLSKYPCSNFVSKGACPRGDTCMFSHKILQQEESTSTPNLKTEPKPPRILDNSDSQKKLNISGTSCNRTPQGGPP----PVGINFLLLGSRNLLNQ
             LSKYPC NF+  G C RGD C FSH ++   E  STP+ K +     + + ++ Q++ +   TS   T   G P    P+  + +L   +NL   
Subjt:  ----TLSKYPCSNFVSKGACPRGDTCMFSHKILQQEESTSTPNLKTEPKPPRILDNSDSQKKLNISGTSCNRTPQGGPP----PVGINFLLLGSRNLLNQ

Query:  SGLCPVENSKEISRKIQPETAPKGINFLSFGKISLKS---RSEKLATPSSYNDASIPSDN----QIQAVSKI----GHDSDELLQGVQTTTLKQTNFLSP
        SG     N++++     P   P+GI FL F K    S     + ++T    N    P  N    Q     K+    GH S  LL   +  + KQ N    
Subjt:  SGLCPVENSKEISRKIQPETAPKGINFLSFGKISLKS---RSEKLATPSSYNDASIPSDN----QIQAVSKI----GHDSDELLQGVQTTTLKQTNFLSP

Query:  KAPMGNYVHNELLVRSSSSSK
          P  N +     V SS S++
Subjt:  KAPMGNYVHNELLVRSSSSSK

Q9LTS7 Zinc finger CCCH domain-containing protein 658.4e-2937.94Show/hide
Query:  KKRGPLSEGRKERKKILKRKKRAEKNRKLGVKRLKLHPVT-VPKPIVYCRHYIKGRCQEGDKCKFSHDTIPLTKST---------------------LSK
        KKR   S+  K RK+   R KRA++   LGVK+LKL PV   PKPI YCRHY+KGRC EGDKCKFSHDTIP TK +                     LSK
Subjt:  KKRGPLSEGRKERKKILKRKKRAEKNRKLGVKRLKLHPVT-VPKPIVYCRHYIKGRCQEGDKCKFSHDTIPLTKST---------------------LSK

Query:  YPCSNFVSKGACPRGDTCMFSHKILQQEESTSTPNLKTEPKPPRILDNSDS---QKKLNISGTSCNRTPQGGPPPVGINFLLLGSRNLLNQSGLCPVENS
        YPC+NF++KG C RGD+C+FSHK   Q  S  TP+        +I   S S    KK ++                 + FL   S +  NQ       +S
Subjt:  YPCSNFVSKGACPRGDTCMFSHKILQQEESTSTPNLKTEPKPPRILDNSDS---QKKLNISGTSCNRTPQGGPPPVGINFLLLGSRNLLNQSGLCPVENS

Query:  K--------EISRKIQPETAPKGINFLSFGKISLKS--RSEKLATPSSYNDAS
        K        ++    +P  APKG++FLS  K S +   ++   + P++ N  S
Subjt:  K--------EISRKIQPETAPKGINFLSFGKISLKS--RSEKLATPSSYNDAS

Q9QXP6 E3 ubiquitin-protein ligase makorin-13.4e-0639.19Show/hide
Query:  VYCRHYIKGRCQEGDKCKFSHDTIPLTKSTLSKYPCSNFVSKGACPRGDTCMFSH-KILQQEESTSTPNLKTEP
        V CR+++ G C+EGD C++SHD        + KY       +G C  GD C + H K L+QEE T+T +L  +P
Subjt:  VYCRHYIKGRCQEGDKCKFSHDTIPLTKSTLSKYPCSNFVSKGACPRGDTCMFSH-KILQQEESTSTPNLKTEP

Q9UHC7 E3 ubiquitin-protein ligase makorin-16.9e-0741.79Show/hide
Query:  VYCRHYIKGRCQEGDKCKFSHDTIPLTKSTLSKYPCSNFVSKGACPRGDTCMFSH-KILQQEESTST
        V CR+++ G C+EGD C++SHD      S + KY       +G C  GD C + H K L+QEE+T+T
Subjt:  VYCRHYIKGRCQEGDKCKFSHDTIPLTKSTLSKYPCSNFVSKGACPRGDTCMFSH-KILQQEESTST

Q9UPT8 Zinc finger CCCH domain-containing protein 42.0e-0627.33Show/hide
Query:  KLHPVTVPKPIVYCRHYIKGRCQEGDKCKFSHDTIPLTKSTLSK-------------------YPCSNFVSKGACPRGDTCMFSHKILQQE---------
        K H  +  K  V C+++++GRC  GD C FSHD     K  L K                   +PC  + + G C  GD CMFSH  L +E         
Subjt:  KLHPVTVPKPIVYCRHYIKGRCQEGDKCKFSHDTIPLTKSTLSK-------------------YPCSNFVSKGACPRGDTCMFSHKILQQE---------

Query:  --------------ESTSTPNLKTEPKPP---RILDNSDSQKKLNISGTSCNRTPQGGPPP
                      E      +   PKPP    +L             +   R  QGGPPP
Subjt:  --------------ESTSTPNLKTEPKPP---RILDNSDSQKKLNISGTSCNRTPQGGPPP

Arabidopsis top hitse value%identityAlignment
AT3G47120.1 RNA recognition motif (RRM)-containing protein5.6e-0434.29Show/hide
Query:  CRHYIKGRCQEGDKCKFSHDTIPLT-----------------KSTLSKYPCSNFVSKGACPRGDTCMFSH
        CR + +G C  GD CKFSHD                      K+   +  C  F  +G C RGD+C FSH
Subjt:  CRHYIKGRCQEGDKCKFSHDTIPLT-----------------KSTLSKYPCSNFVSKGACPRGDTCMFSH

AT5G56930.1 CCCH-type zinc finger family protein6.0e-3037.94Show/hide
Query:  KKRGPLSEGRKERKKILKRKKRAEKNRKLGVKRLKLHPVT-VPKPIVYCRHYIKGRCQEGDKCKFSHDTIPLTKST---------------------LSK
        KKR   S+  K RK+   R KRA++   LGVK+LKL PV   PKPI YCRHY+KGRC EGDKCKFSHDTIP TK +                     LSK
Subjt:  KKRGPLSEGRKERKKILKRKKRAEKNRKLGVKRLKLHPVT-VPKPIVYCRHYIKGRCQEGDKCKFSHDTIPLTKST---------------------LSK

Query:  YPCSNFVSKGACPRGDTCMFSHKILQQEESTSTPNLKTEPKPPRILDNSDS---QKKLNISGTSCNRTPQGGPPPVGINFLLLGSRNLLNQSGLCPVENS
        YPC+NF++KG C RGD+C+FSHK   Q  S  TP+        +I   S S    KK ++                 + FL   S +  NQ       +S
Subjt:  YPCSNFVSKGACPRGDTCMFSHKILQQEESTSTPNLKTEPKPPRILDNSDS---QKKLNISGTSCNRTPQGGPPPVGINFLLLGSRNLLNQSGLCPVENS

Query:  K--------EISRKIQPETAPKGINFLSFGKISLKS--RSEKLATPSSYNDAS
        K        ++    +P  APKG++FLS  K S +   ++   + P++ N  S
Subjt:  K--------EISRKIQPETAPKGINFLSFGKISLKS--RSEKLATPSSYNDAS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCTTCCATGGTTGCTGGGGAGGTTTTGAATCGTGTACAGAAGGTTTAATGGCGGCGCTTATTCGGAATGCGTCTCTTTTTGGATGCTTTCTTTTTCAGCGCGAGGA
ATCTGTGAATTGGAAGTGGAAGGTTCGAGTATTCATGGAGGACGCGCGTATCGAAGCCCCGATATTAGTCGAATCGAAATTAGTTTTCCCACCTCGCCGTCGATCACATT
TAAGGAGCCAGACTTATCGTACGCTGGTACGAATCCTTTCGCCCTTCTGCGAGAACGCCGATGTCTCTCTGGCAACTGAATTTGATCCACAGGAACTAAGAGTTGCTTGT
AGCAGCCACTTGAGGCAAGCAATTGTAGAACCAGCAGGTCAGCAGGTTGGTGAAGATTTTGTGAGGGCACAAACTGGAGATTGGGGTATTGAAAAGGTGACAGTGGTTGA
AGATGGCTTGGGGATGAATAGTCTTGAAGATGGATGTTTTCATGACTTTGAAGCAAAGATGGATGTAATAAAAGATACGACAATAGTAGTAAGAAGTGGCAATGCGAGTG
ATGAAAGGAAATCTTCTGGAGCTAGTTCCCAAGGTAATCATTGTGTTCATGAAGAAAATTATAGAGACGAGTTAGAGCATATTGTGGAAGGAAATAAAGTGCTTGTTGAT
GACAACAATAGTAACAACATGAGAACTTCAGATCAGAAAAATGGTGGAAATGATCAAATATCACCTAAAAAATTCCAAGAAGAAATTGCTGACCATCAACAAGTTATGGC
TGAAGAAGATGCCGCAAGCATAGTCCAGAAACTTTCTCAGACAGTTTTCTCGCTAGATGGCAACAGGATCAAGGAAGCTTCTAAACCTAAGGAGGACAATCAAGAAAACA
GACTTTTAATTGTAGGTAAGGGCATGCACCAATGGCCTGAAAATCAACTAAAAAAATTGGAATCTGAGGGCATAGTTCATGAGCCTGATTTAGAGAGATCCTATGTTTCT
AGTTTTGAGGTCATGGAGGTGGGAGATGGTGAACAATGTAGTGAAAAGAGTAAGAAGAGCGAAGATAAAGCTATTTCAATGGTGGTTGATGCCTCCAAAGTAGAGCACAA
AATGCTGTCCGAAGATATGCAGATGGACAAATGTATGATTGAAAATGAAGACATTGAGGAGGGTGAAATATGTGGTGATTCAACATATGAAATTTCAGAAGATCCCATGA
CATTGGCTGAGAAGGTTGTTGATATTCAAATCTCTGAGGAGACAGTGAATAGAAATAAGCTTCCTTCAATTCAAGCTAATGGAGAAGCAAAGAAGCCTGTTTCCTTCTTC
TCTGACACAGCCGAGCATGCAAGTCATGAGAACAAAGAAGAATTCAAACAAAATAAGAGTAGCATGGAGGCTCATAAAAGCGGAAGTATTGTTTCAGAAATAATGGTGAA
GAGTACAAATGAAGATCGAAATGATCTTATGTTAATGGAAAGAGGATATAAAGTAAAAGACAGTGGAACCAAAAAGGACTTGGGTTCTTCTACAGCAATTGTTAGTAATC
TAGTCCTTTGCGACCAAGTTCCAAATGAACAAGGCACTGGACAGAAAGGAATTGTATTCACAGATAAGGAAACTGGGGTCTGCAATAAGAAAAAACGTGGTCCTTTATCA
GAGGGGAGAAAAGAAAGGAAAAAGATACTGAAAAGAAAGAAGCGGGCAGAGAAGAACAGAAAGCTTGGTGTCAAAAGGTTGAAGCTGCATCCAGTAACAGTGCCTAAACC
TATAGTATACTGTCGTCATTATATCAAGGGGAGGTGCCAAGAGGGTGACAAGTGTAAGTTTTCTCATGATACAATCCCTTTGACAAAATCCACGCTGTCCAAGTATCCTT
GTAGCAATTTTGTTTCTAAAGGTGCATGTCCCAGAGGTGATACTTGCATGTTTTCACACAAGATTTTGCAACAAGAGGAGTCTACTAGTACGCCAAATTTGAAAACCGAA
CCAAAGCCACCACGAATTCTGGATAATTCGGACTCTCAAAAGAAACTGAACATCAGTGGAACTTCTTGCAATAGAACTCCACAGGGAGGACCACCACCTGTTGGAATTAA
TTTTCTTCTTTTGGGAAGTAGGAATCTGTTAAACCAAAGTGGGTTATGTCCGGTTGAAAATTCAAAAGAAATCTCGAGGAAGATTCAGCCAGAAACAGCACCAAAGGGGA
TAAATTTTCTGTCGTTTGGTAAAATTTCACTCAAGTCTAGGAGTGAGAAATTAGCTACACCTTCAAGCTACAATGATGCCAGTATTCCATCTGATAATCAGATTCAAGCT
GTTTCCAAAATTGGACATGATTCTGATGAGCTACTGCAGGGAGTACAAACTACAACGTTAAAACAAACAAACTTTTTGTCTCCGAAAGCTCCAATGGGAAATTATGTTCA
TAATGAATTGTTGGTTAGATCGTCTTCCAGCTCAAAAGCGGGTTCTGATATAACCGTACAGATTAAGGAAAGCATGCATGATAAGCATGAAAGCTCAAGTGCAATATTAT
CAAGGCTGCCTAGTGCTCCAATTTGTTCGGGTCAATCTTCTGACAGCTTAGCTTCTAGCATCTATAAAGGAACGCCAAACACAGCTCAGAAGGCACTATTGTCAACATTG
GCATTTGCAGCAAAATATGAGTCCTTGATGAATAAAGGCAAATCCAGTGGTTCTGCTGATGTCAGTAAGGAGGGTAACAAGAAAGTTAGGAATGACAATATTAGTGGAAG
CTTGCAGAATGATAATTCCAAAGCTTCAAAACTTTTGGATTTCTTGTTAGGCCAAATCAGGAATCGACGAGTTGCATGTTATGGGTTTGATGAAGTTATACTGTATGAGG
CGAATCCTTTTGTTCAGTTCTCTTTTCTTTTTCGCGGTTCTTTCATGGTGAGCGAAATTCAAATGTTCCTACCTGACGTTGGCGGCGGTGGCGGTGGTTGCTGTGACGGT
TGTGTCGGTGCTGCAAGCGGTGACGCATTATGTTTGCAGCCCGTCGTGGATGTATTGAACGAGCCAGGGGAGGGAAAGACTGTTAAGAAAGAGAAGATGGAGGAGCAGGA
GCTTCATCGTAATAACAATGGTAATAGTACCAATGATGTAATTATATTAGGTGGATCGGCTTCTTCTTCTTCTTCTTCATCAGATGGCTTGCTGGAAAAACGCAAGGATT
CGATTCCGGTTTCGAAGGAAGTTCTGTCGGAGACGCCGAAAGGCGAAGTGGTCTTAATTAAAAAGAAAGCACGGCGGCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTCTTCCATGGTTGCTGGGGAGGTTTTGAATCGTGTACAGAAGGTTTAATGGCGGCGCTTATTCGGAATGCGTCTCTTTTTGGATGCTTTCTTTTTCAGCGCGAGGA
ATCTGTGAATTGGAAGTGGAAGGTTCGAGTATTCATGGAGGACGCGCGTATCGAAGCCCCGATATTAGTCGAATCGAAATTAGTTTTCCCACCTCGCCGTCGATCACATT
TAAGGAGCCAGACTTATCGTACGCTGGTACGAATCCTTTCGCCCTTCTGCGAGAACGCCGATGTCTCTCTGGCAACTGAATTTGATCCACAGGAACTAAGAGTTGCTTGT
AGCAGCCACTTGAGGCAAGCAATTGTAGAACCAGCAGGTCAGCAGGTTGGTGAAGATTTTGTGAGGGCACAAACTGGAGATTGGGGTATTGAAAAGGTGACAGTGGTTGA
AGATGGCTTGGGGATGAATAGTCTTGAAGATGGATGTTTTCATGACTTTGAAGCAAAGATGGATGTAATAAAAGATACGACAATAGTAGTAAGAAGTGGCAATGCGAGTG
ATGAAAGGAAATCTTCTGGAGCTAGTTCCCAAGGTAATCATTGTGTTCATGAAGAAAATTATAGAGACGAGTTAGAGCATATTGTGGAAGGAAATAAAGTGCTTGTTGAT
GACAACAATAGTAACAACATGAGAACTTCAGATCAGAAAAATGGTGGAAATGATCAAATATCACCTAAAAAATTCCAAGAAGAAATTGCTGACCATCAACAAGTTATGGC
TGAAGAAGATGCCGCAAGCATAGTCCAGAAACTTTCTCAGACAGTTTTCTCGCTAGATGGCAACAGGATCAAGGAAGCTTCTAAACCTAAGGAGGACAATCAAGAAAACA
GACTTTTAATTGTAGGTAAGGGCATGCACCAATGGCCTGAAAATCAACTAAAAAAATTGGAATCTGAGGGCATAGTTCATGAGCCTGATTTAGAGAGATCCTATGTTTCT
AGTTTTGAGGTCATGGAGGTGGGAGATGGTGAACAATGTAGTGAAAAGAGTAAGAAGAGCGAAGATAAAGCTATTTCAATGGTGGTTGATGCCTCCAAAGTAGAGCACAA
AATGCTGTCCGAAGATATGCAGATGGACAAATGTATGATTGAAAATGAAGACATTGAGGAGGGTGAAATATGTGGTGATTCAACATATGAAATTTCAGAAGATCCCATGA
CATTGGCTGAGAAGGTTGTTGATATTCAAATCTCTGAGGAGACAGTGAATAGAAATAAGCTTCCTTCAATTCAAGCTAATGGAGAAGCAAAGAAGCCTGTTTCCTTCTTC
TCTGACACAGCCGAGCATGCAAGTCATGAGAACAAAGAAGAATTCAAACAAAATAAGAGTAGCATGGAGGCTCATAAAAGCGGAAGTATTGTTTCAGAAATAATGGTGAA
GAGTACAAATGAAGATCGAAATGATCTTATGTTAATGGAAAGAGGATATAAAGTAAAAGACAGTGGAACCAAAAAGGACTTGGGTTCTTCTACAGCAATTGTTAGTAATC
TAGTCCTTTGCGACCAAGTTCCAAATGAACAAGGCACTGGACAGAAAGGAATTGTATTCACAGATAAGGAAACTGGGGTCTGCAATAAGAAAAAACGTGGTCCTTTATCA
GAGGGGAGAAAAGAAAGGAAAAAGATACTGAAAAGAAAGAAGCGGGCAGAGAAGAACAGAAAGCTTGGTGTCAAAAGGTTGAAGCTGCATCCAGTAACAGTGCCTAAACC
TATAGTATACTGTCGTCATTATATCAAGGGGAGGTGCCAAGAGGGTGACAAGTGTAAGTTTTCTCATGATACAATCCCTTTGACAAAATCCACGCTGTCCAAGTATCCTT
GTAGCAATTTTGTTTCTAAAGGTGCATGTCCCAGAGGTGATACTTGCATGTTTTCACACAAGATTTTGCAACAAGAGGAGTCTACTAGTACGCCAAATTTGAAAACCGAA
CCAAAGCCACCACGAATTCTGGATAATTCGGACTCTCAAAAGAAACTGAACATCAGTGGAACTTCTTGCAATAGAACTCCACAGGGAGGACCACCACCTGTTGGAATTAA
TTTTCTTCTTTTGGGAAGTAGGAATCTGTTAAACCAAAGTGGGTTATGTCCGGTTGAAAATTCAAAAGAAATCTCGAGGAAGATTCAGCCAGAAACAGCACCAAAGGGGA
TAAATTTTCTGTCGTTTGGTAAAATTTCACTCAAGTCTAGGAGTGAGAAATTAGCTACACCTTCAAGCTACAATGATGCCAGTATTCCATCTGATAATCAGATTCAAGCT
GTTTCCAAAATTGGACATGATTCTGATGAGCTACTGCAGGGAGTACAAACTACAACGTTAAAACAAACAAACTTTTTGTCTCCGAAAGCTCCAATGGGAAATTATGTTCA
TAATGAATTGTTGGTTAGATCGTCTTCCAGCTCAAAAGCGGGTTCTGATATAACCGTACAGATTAAGGAAAGCATGCATGATAAGCATGAAAGCTCAAGTGCAATATTAT
CAAGGCTGCCTAGTGCTCCAATTTGTTCGGGTCAATCTTCTGACAGCTTAGCTTCTAGCATCTATAAAGGAACGCCAAACACAGCTCAGAAGGCACTATTGTCAACATTG
GCATTTGCAGCAAAATATGAGTCCTTGATGAATAAAGGCAAATCCAGTGGTTCTGCTGATGTCAGTAAGGAGGGTAACAAGAAAGTTAGGAATGACAATATTAGTGGAAG
CTTGCAGAATGATAATTCCAAAGCTTCAAAACTTTTGGATTTCTTGTTAGGCCAAATCAGGAATCGACGAGTTGCATGTTATGGGTTTGATGAAGTTATACTGTATGAGG
CGAATCCTTTTGTTCAGTTCTCTTTTCTTTTTCGCGGTTCTTTCATGGTGAGCGAAATTCAAATGTTCCTACCTGACGTTGGCGGCGGTGGCGGTGGTTGCTGTGACGGT
TGTGTCGGTGCTGCAAGCGGTGACGCATTATGTTTGCAGCCCGTCGTGGATGTATTGAACGAGCCAGGGGAGGGAAAGACTGTTAAGAAAGAGAAGATGGAGGAGCAGGA
GCTTCATCGTAATAACAATGGTAATAGTACCAATGATGTAATTATATTAGGTGGATCGGCTTCTTCTTCTTCTTCTTCATCAGATGGCTTGCTGGAAAAACGCAAGGATT
CGATTCCGGTTTCGAAGGAAGTTCTGTCGGAGACGCCGAAAGGCGAAGTGGTCTTAATTAAAAAGAAAGCACGGCGGCAATGA
Protein sequenceShow/hide protein sequence
MLFHGCWGGFESCTEGLMAALIRNASLFGCFLFQREESVNWKWKVRVFMEDARIEAPILVESKLVFPPRRRSHLRSQTYRTLVRILSPFCENADVSLATEFDPQELRVAC
SSHLRQAIVEPAGQQVGEDFVRAQTGDWGIEKVTVVEDGLGMNSLEDGCFHDFEAKMDVIKDTTIVVRSGNASDERKSSGASSQGNHCVHEENYRDELEHIVEGNKVLVD
DNNSNNMRTSDQKNGGNDQISPKKFQEEIADHQQVMAEEDAASIVQKLSQTVFSLDGNRIKEASKPKEDNQENRLLIVGKGMHQWPENQLKKLESEGIVHEPDLERSYVS
SFEVMEVGDGEQCSEKSKKSEDKAISMVVDASKVEHKMLSEDMQMDKCMIENEDIEEGEICGDSTYEISEDPMTLAEKVVDIQISEETVNRNKLPSIQANGEAKKPVSFF
SDTAEHASHENKEEFKQNKSSMEAHKSGSIVSEIMVKSTNEDRNDLMLMERGYKVKDSGTKKDLGSSTAIVSNLVLCDQVPNEQGTGQKGIVFTDKETGVCNKKKRGPLS
EGRKERKKILKRKKRAEKNRKLGVKRLKLHPVTVPKPIVYCRHYIKGRCQEGDKCKFSHDTIPLTKSTLSKYPCSNFVSKGACPRGDTCMFSHKILQQEESTSTPNLKTE
PKPPRILDNSDSQKKLNISGTSCNRTPQGGPPPVGINFLLLGSRNLLNQSGLCPVENSKEISRKIQPETAPKGINFLSFGKISLKSRSEKLATPSSYNDASIPSDNQIQA
VSKIGHDSDELLQGVQTTTLKQTNFLSPKAPMGNYVHNELLVRSSSSSKAGSDITVQIKESMHDKHESSSAILSRLPSAPICSGQSSDSLASSIYKGTPNTAQKALLSTL
AFAAKYESLMNKGKSSGSADVSKEGNKKVRNDNISGSLQNDNSKASKLLDFLLGQIRNRRVACYGFDEVILYEANPFVQFSFLFRGSFMVSEIQMFLPDVGGGGGGCCDG
CVGAASGDALCLQPVVDVLNEPGEGKTVKKEKMEEQELHRNNNGNSTNDVIILGGSASSSSSSSDGLLEKRKDSIPVSKEVLSETPKGEVVLIKKKARRQ