; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg18717 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg18717
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionProtein of Unknown Function (DUF239)
Genome locationCarg_Chr17:7312893..7316005
RNA-Seq ExpressionCarg18717
SyntenyCarg18717
Gene Ontology termsGO:0016874 - ligase activity (molecular function)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575651.1 hypothetical protein SDJN03_26290, partial [Cucurbita argyrosperma subsp. sororia]2.0e-24999.52Show/hide
Query:  VGEMGTPKLLLLLLLLMLVSLSLMAAAGKSSRHRHRHRWLEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEGVFSD
        V EMGTPKLLLLLLLLMLVSLSLMAAAGKSSRHRHRHRWLEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEGVFSD
Subjt:  VGEMGTPKLLLLLLLLMLVSLSLMAAAGKSSRHRHRHRWLEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEGVFSD

Query:  SKVTSKASITQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQTNEFS
        SKVTSKASITQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHA VKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQTNEFS
Subjt:  SKVTSKASITQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQTNEFS

Query:  LSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEGNWWMQ
        LSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEGNWWMQ
Subjt:  LSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEGNWWMQ

Query:  FGNSHVLGYWPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGKSDDWG
        FGNSHVLGYWPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGKSDDWG
Subjt:  FGNSHVLGYWPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGKSDDWG

Query:  NYFFYGGPGRNPNCL
        NYFFYGGPGRNPNCL
Subjt:  NYFFYGGPGRNPNCL

KAG7014200.1 hypothetical protein SDJN02_24375, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-260100Show/hide
Query:  MRALSESMETERQIAVNELVGEMGTPKLLLLLLLLMLVSLSLMAAAGKSSRHRHRHRWLEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLL
        MRALSESMETERQIAVNELVGEMGTPKLLLLLLLLMLVSLSLMAAAGKSSRHRHRHRWLEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLL
Subjt:  MRALSESMETERQIAVNELVGEMGTPKLLLLLLLLMLVSLSLMAAAGKSSRHRHRHRWLEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLL

Query:  RNHTIQMRPNFHPEGVFSDSKVTSKASITQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYG
        RNHTIQMRPNFHPEGVFSDSKVTSKASITQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYG
Subjt:  RNHTIQMRPNFHPEGVFSDSKVTSKASITQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYG

Query:  AKATMNVWSPKIEQTNEFSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQ
        AKATMNVWSPKIEQTNEFSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQ
Subjt:  AKATMNVWSPKIEQTNEFSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQ

Query:  YDISLLIWKDPKEGNWWMQFGNSHVLGYWPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIAT
        YDISLLIWKDPKEGNWWMQFGNSHVLGYWPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIAT
Subjt:  YDISLLIWKDPKEGNWWMQFGNSHVLGYWPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIAT

Query:  FTEQPSCYDVQTGKSDDWGNYFFYGGPGRNPNCL
        FTEQPSCYDVQTGKSDDWGNYFFYGGPGRNPNCL
Subjt:  FTEQPSCYDVQTGKSDDWGNYFFYGGPGRNPNCL

XP_022953210.1 uncharacterized protein LOC111455822 [Cucurbita moschata]3.7e-24398.3Show/hide
Query:  MGTPKLLLLLLLLMLVSLSLMAAAGKSSRHRHRHRWLEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEGVFSDSKV
        MGTPK LLLLLLLMLVSL L+AAAGK+SRH HRHR LEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEGVFSDSKV
Subjt:  MGTPKLLLLLLLLMLVSLSLMAAAGKSSRHRHRHRWLEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEGVFSDSKV

Query:  TSKASITQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQTNEFSLSQ
        TSKAS+TQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQTNEFSLSQ
Subjt:  TSKASITQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQTNEFSLSQ

Query:  IWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEGNWWMQFGN
        IWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEGNWWMQFGN
Subjt:  IWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEGNWWMQFGN

Query:  SHVLGYWPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGKSDDWGNYF
        SHVLGYWPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGKSDDWGNYF
Subjt:  SHVLGYWPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGKSDDWGNYF

Query:  FYGGPGRNPNCL
        FYGGPGRNPNCL
Subjt:  FYGGPGRNPNCL

XP_022991895.1 uncharacterized protein LOC111488398 [Cucurbita maxima]1.7e-24096.84Show/hide
Query:  MGTPKLLLLLLLLMLVSLSLMAAAGKSSRHRHRHRWLEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEGVFSDSKV
        MGT K  LLLLLLMLVSLSLMAAAGKS  HRHRHR LEAHTHMKKLNKPPLKSIKSPDGD+IDCVHMAHQPAFDHPLLRNHTIQMRPNFHP+GVFSDSKV
Subjt:  MGTPKLLLLLLLLMLVSLSLMAAAGKSSRHRHRHRWLEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEGVFSDSKV

Query:  TSKASITQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQTNEFSLSQ
        +SKAS+TQLWHLKGRCP+GTIPIRRTKRDDILRASSVESYGKKRAHATVKP+SIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQTNEFSLSQ
Subjt:  TSKASITQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQTNEFSLSQ

Query:  IWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEGNWWMQFGN
        IWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSY+GSQYDISLLIWKDPKEGNWWMQFGN
Subjt:  IWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEGNWWMQFGN

Query:  SHVLGYWPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGKSDDWGNYF
        SHVLGYWPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGKSDDWGNYF
Subjt:  SHVLGYWPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGKSDDWGNYF

Query:  FYGGPGRNPNCL
        FYGGPGRNPNCL
Subjt:  FYGGPGRNPNCL

XP_023549236.1 uncharacterized protein LOC111807656 [Cucurbita pepo subsp. pepo]1.6e-24198.52Show/hide
Query:  LLLLLLLMLVSLSLMAAAGKSSRHRHRHRWLEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEGVFSDSKVTSKASI
        L LLLLL+LVSLSLMAAAGKS RHRHRHRWLEAHTHMKKLNKP LKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEGVFSDSKVTS+ASI
Subjt:  LLLLLLLMLVSLSLMAAAGKSSRHRHRHRWLEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEGVFSDSKVTSKASI

Query:  TQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQTNEFSLSQIWILGG
        TQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFN QNGHQHAITYVEGGQYYGAKATMNVWSPKIEQTNEFSLSQIWILGG
Subjt:  TQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQTNEFSLSQIWILGG

Query:  TFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEGNWWMQFGNSHVLGY
        TFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEGNWWMQFGNSHVLGY
Subjt:  TFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEGNWWMQFGNSHVLGY

Query:  WPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGKSDDWGNYFFYGGPG
        WPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGKSDDWGNYFFYGGPG
Subjt:  WPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGKSDDWGNYFFYGGPG

Query:  RNPNCL
        RNPNCL
Subjt:  RNPNCL

TrEMBL top hitse value%identityAlignment
A0A1S3CE73 LOW QUALITY PROTEIN: uncharacterized protein LOC1034998722.2e-20984.01Show/hide
Query:  MGTPKLLLLLLLLMLVSLSLMAAAGKSS--RHRH-RHRWLEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPE-GVFS
        MG  K LL L LLMLVSLSL    GK++  RHRH RHR LE H+H+KKLNKP +KSIKSPDGD+IDCVHMAHQPAFDHPLL+NHTIQMRPNFHPE G+ S
Subjt:  MGTPKLLLLLLLLMLVSLSLMAAAGKSS--RHRH-RHRWLEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPE-GVFS

Query:  DSKVTSKAS----ITQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQ
        +SKV+ K S    ITQLWHLKG+CP+GTIPIRR K++DILR +SV+SYGKK+ +ATVKP+SI+ID NGQNGHQHAI YVEGGQYYGAKAT+NVWSPKI+Q
Subjt:  DSKVTSKAS----ITQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQ

Query:  TNEFSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEG
        TNEFSLSQIWILGGTFG+DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINN+IAMGASI+PISSYK SQYDISLLIWKDPKEG
Subjt:  TNEFSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEG

Query:  NWWMQFGNSHVLGYWPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGK
        NWWMQFGN +VLGYWPAFLFSYLTDSASMIEWGGEVVNSE DGQHTSTQMGSGHFP +GF  A YFRNIQIVG SN+LRAPEDI  FTEQPSCYDVQ GK
Subjt:  NWWMQFGNSHVLGYWPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGK

Query:  SDDWGNYFFYGGPGRNPNC
        SDDWGNYFFYGGPGRNPNC
Subjt:  SDDWGNYFFYGGPGRNPNC

A0A6J1DGY2 uncharacterized protein LOC1110203901.1e-21186.13Show/hide
Query:  LLLLLLLMLVSLSLM-AAAGKSSRHRHRHRW-LEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEGVFSDSKVTSKA
        L LLL L L+SLSL   AA K+SR RH   W LEAHTH+KKLNKPP+KSIKSPDGD+IDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEG+F D+KV+S  
Subjt:  LLLLLLLMLVSLSLM-AAAGKSSRHRHRHRW-LEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEGVFSDSKVTSKA

Query:  S----ITQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQTNEFSLSQ
        S    ITQLWHLKGRCPEGTIPIRRTK+ DILRASS++SYGKK+  ATVKP+SIDID NGQ GHQHAI YVEGG+YYGAKAT+NVWSPKI+QTNEFSLSQ
Subjt:  S----ITQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQTNEFSLSQ

Query:  IWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEGNWWMQFGN
        IWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINN IAMGASIYPISSY+ SQYDISLLIWKDPKEGNWWMQFGN
Subjt:  IWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEGNWWMQFGN

Query:  SHVLGYWPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGKSDDWGNYF
        SHVLGYWPAFLFSYLTDSASMIEWGGEVVNSE DGQHTSTQMGSGHFP++GF  A YFRNIQ+V GSNNL+APEDI TFTEQPSCYDVQ GKS +WGNYF
Subjt:  SHVLGYWPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGKSDDWGNYF

Query:  FYGGPGRNPNC
        FYGGPGRNPNC
Subjt:  FYGGPGRNPNC

A0A6J1GP09 uncharacterized protein LOC1114558221.8e-24398.3Show/hide
Query:  MGTPKLLLLLLLLMLVSLSLMAAAGKSSRHRHRHRWLEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEGVFSDSKV
        MGTPK LLLLLLLMLVSL L+AAAGK+SRH HRHR LEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEGVFSDSKV
Subjt:  MGTPKLLLLLLLLMLVSLSLMAAAGKSSRHRHRHRWLEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEGVFSDSKV

Query:  TSKASITQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQTNEFSLSQ
        TSKAS+TQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQTNEFSLSQ
Subjt:  TSKASITQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQTNEFSLSQ

Query:  IWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEGNWWMQFGN
        IWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEGNWWMQFGN
Subjt:  IWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEGNWWMQFGN

Query:  SHVLGYWPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGKSDDWGNYF
        SHVLGYWPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGKSDDWGNYF
Subjt:  SHVLGYWPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGKSDDWGNYF

Query:  FYGGPGRNPNCL
        FYGGPGRNPNCL
Subjt:  FYGGPGRNPNCL

A0A6J1H5K4 uncharacterized protein LOC1114606891.3e-20984.15Show/hide
Query:  LLLLLLMLVSLSLMAAAGKSSRHRHRHRW--LEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEGVFSDSKVTSKAS
        L LLLL++VSLS+ AAAGKSS HRHRHR    EA  H+KKLNKP +KSIKSPDGD+IDCVHMAHQPAFDHPLL+NHTIQMRPNFHPEGV SD+K++SK S
Subjt:  LLLLLLMLVSLSLMAAAGKSSRHRHRHRW--LEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEGVFSDSKVTSKAS

Query:  ----ITQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQTNEFSLSQI
            ITQ WHLKGRCP+GTIPIRRT + DILRA+SV++YG+K+  AT KP+SIDID NGQ GHQHAITYVEGGQYYGAKAT+NVWSPKI+  NEFSLSQI
Subjt:  ----ITQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQTNEFSLSQI

Query:  WILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEGNWWMQFGNS
        WILGGTFGEDLNSIEAGWQVSPDLYGDNNTR FTYWTSDAYQATGCYNLLCSGFVQINN+IA+GASIYPISSY+GSQYDISLLIWKDPKEGNWWMQFGNS
Subjt:  WILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEGNWWMQFGNS

Query:  HVLGYWPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGKSDDWGNYFF
        +VLGYWPAFLFSYLTD ASM+EWGGEVVNSE DGQHTSTQMGSGHFP +GF  A YFRNIQ+V GSNNLR PEDI TFTEQPSCYDVQ GKS DWGNYFF
Subjt:  HVLGYWPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGKSDDWGNYFF

Query:  YGGPGRNPNC
        YGGPGRNPNC
Subjt:  YGGPGRNPNC

A0A6J1JU76 uncharacterized protein LOC1114883988.3e-24196.84Show/hide
Query:  MGTPKLLLLLLLLMLVSLSLMAAAGKSSRHRHRHRWLEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEGVFSDSKV
        MGT K  LLLLLLMLVSLSLMAAAGKS  HRHRHR LEAHTHMKKLNKPPLKSIKSPDGD+IDCVHMAHQPAFDHPLLRNHTIQMRPNFHP+GVFSDSKV
Subjt:  MGTPKLLLLLLLLMLVSLSLMAAAGKSSRHRHRHRWLEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEGVFSDSKV

Query:  TSKASITQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQTNEFSLSQ
        +SKAS+TQLWHLKGRCP+GTIPIRRTKRDDILRASSVESYGKKRAHATVKP+SIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQTNEFSLSQ
Subjt:  TSKASITQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQTNEFSLSQ

Query:  IWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEGNWWMQFGN
        IWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSY+GSQYDISLLIWKDPKEGNWWMQFGN
Subjt:  IWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEGNWWMQFGN

Query:  SHVLGYWPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGKSDDWGNYF
        SHVLGYWPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGKSDDWGNYF
Subjt:  SHVLGYWPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGKSDDWGNYF

Query:  FYGGPGRNPNCL
        FYGGPGRNPNCL
Subjt:  FYGGPGRNPNCL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)1.9e-17667.92Show/hide
Query:  VGEMGTPKL----LLLLLLLMLVSLSLMAAAGKSSRHRHRHRWLEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEG
        V  + T KL    L+ L L    SLS  A +G S +        E   H+ +LNKP +KSI+S DGDVIDCV ++ QPAFDHP L++H IQM+PN+HPEG
Subjt:  VGEMGTPKL----LLLLLLLMLVSLSLMAAAGKSSRHRHRHRWLEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEG

Query:  VFSDSKVTSKAS------ITQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWS
        +F D+KV++  S      I QLWH  G+C EGTIP+RRTK DD+LRASSV+ YGKK+  +   P S + D   Q+GHQHAI YVEG +YYGAKAT+NVW 
Subjt:  VFSDSKVTSKAS------ITQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWS

Query:  PKIEQTNEFSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWK
        PKI+Q NEFSLSQIW+LGG+FG+DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGF+QIN+DIAMGASI P+S Y+ SQYDIS+LIWK
Subjt:  PKIEQTNEFSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWK

Query:  DPKEGNWWMQFGNSHVLGYWPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYD
        DPKEG+WWMQFGN +VLGYWP+FLFSYLT+SASMIEWGGEVVNS+ DGQHTSTQMGSG FP++GF+ A+YFRNIQ+V GSNNL+AP+ + TFTEQ +CYD
Subjt:  DPKEGNWWMQFGNSHVLGYWPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYD

Query:  VQTGKSDDWGNYFFYGGPGRNPNC
        VQTG +DDWG+YF+YGGPG+N  C
Subjt:  VQTGKSDDWGNYFFYGGPGRNPNC

AT2G44210.1 Protein of Unknown Function (DUF239)6.7e-15862.29Show/hide
Query:  MGTPKLLLLLLLLMLVSLSLMAAAGKSSRHRHRHRWLEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEGVFSDSKV
        M T     L L++ +V L+    +G++         L+  TH+K+LNKP LKSIKSPDGD+IDCV +  QPAF HPLL NHT+QM P+ +PE VFS+SKV
Subjt:  MGTPKLLLLLLLLMLVSLSLMAAAGKSSRHRHRHRWLEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEGVFSDSKV

Query:  TSKA------SITQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKP-SSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQT
        +SK       +I QLWH+ G+CP+ TIPIRRT+R D+ RASSVE+YG K   +  KP SS   +   QNGHQHAI YVE G +YGAKA +NVW P +E  
Subjt:  TSKA------SITQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKP-SSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQT

Query:  NEFSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEGN
        NEFSL+QIW+LGG F  DLNSIEAGWQVSP LYGDN TRLFTYWTSDAYQ TGCYNLLCSGFVQIN +IAMG SI P+S+Y  SQYDI++LIWKDPKEG+
Subjt:  NEFSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEGN

Query:  WWMQFGNSHVLGYWPAFLFSYLTDSASMIEWGGEVVNSE-PDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGK
        WW+QFG  +++GYWPA LFSYL++SASMIEWGGEVVNS+  +GQHT+TQMGSG F ++G+  A+YF+N+Q+V GSN LR PE++  FT+Q +CY+V++G 
Subjt:  WWMQFGNSHVLGYWPAFLFSYLTDSASMIEWGGEVVNSE-PDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGK

Query:  SDDWGNYFFYGGPGRNPNC
           WG+YF+YGGPGRNPNC
Subjt:  SDDWGNYFFYGGPGRNPNC

AT3G13510.1 Protein of Unknown Function (DUF239)1.5e-17869.14Show/hide
Query:  LLMLVSLSLMAAAGKSSRHRHRHRWLEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEGVFSDSKVTS-----KASI
        L +++SLS  AA+  SSR +      E   H+ +LNKPP+K+I+SPDGD+IDC+ ++ QPAFDHP L++H IQMRP++HPEG+F D+KV++     +  I
Subjt:  LLMLVSLSLMAAAGKSSRHRHRHRWLEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEGVFSDSKVTS-----KASI

Query:  TQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQTNEFSLSQIWILGG
         QLWH  G+C EGTIP+RRT+ DD+LRASSV+ YGKK+  +   P S + D   QNGHQHAI YVEG +YYGAKAT+NVW PKI+ TNEFSLSQIW+LGG
Subjt:  TQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQTNEFSLSQIWILGG

Query:  TFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEGNWWMQFGNSHVLGY
        +FG+DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGF+QIN+DIAMGASI P+S Y+ SQYDIS+LIWKDPKEG+WWMQFGN +VLGY
Subjt:  TFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEGNWWMQFGNSHVLGY

Query:  WPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGKSDDWGNYFFYGGPG
        WP+FLFSYLT+SASMIEWGGEVVNS+ +G HT TQMGSGHFP++GF+ A+YFRNIQ+V GSNNL+AP+ + TFTE+ +CYDVQTG +DDWG+YF+YGGPG
Subjt:  WPAFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGKSDDWGNYFFYGGPG

Query:  RNPNC
        +N NC
Subjt:  RNPNC

AT5G56530.1 Protein of Unknown Function (DUF239)8.1e-17267.42Show/hide
Query:  LSLMAAAGKSSRHRHRHRWLEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEGVFSDSKVTSKAS-----ITQLWHL
        L  +  AG+ S  R      E H H+ +LNKP +KSI+SPDGD+IDCVH++ QPAFDHP L++H IQM P++ PE +F +SKV+ K       ITQLWH 
Subjt:  LSLMAAAGKSSRHRHRHRWLEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEGVFSDSKVTSKAS-----ITQLWHL

Query:  KGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQTNEFSLSQIWILGGTFGEDL
         G C EGTIP+RRTK++D+LRASSV+ YGKK+  +   P S D D   Q+GHQHAI YVEGG++YGAKAT+NVW PK++ +NEFSLSQ+WILGG+FG+DL
Subjt:  KGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQTNEFSLSQIWILGGTFGEDL

Query:  NSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEGNWWMQFGNSHVLGYWPAFLF
        NSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGF+QIN+ IAMGASI P+S +   QYDIS+ IWKDPKEG+WWMQFG+ +VLGYWP+FLF
Subjt:  NSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEGNWWMQFGNSHVLGYWPAFLF

Query:  SYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGKSDDWGNYFFYGGPGRNPNC
        SYL DSAS++EWGGEVVN E DG HT+TQMGSG FPD+GF  A+YFRNIQ+V  SNNL+ P+ + TFTE+ +CYDV+ GK+DDWG+YF+YGGPGRNPNC
Subjt:  SYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGKSDDWGNYFFYGGPGRNPNC

AT5G56530.2 Protein of Unknown Function (DUF239)8.1e-17267.42Show/hide
Query:  LSLMAAAGKSSRHRHRHRWLEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEGVFSDSKVTSKAS-----ITQLWHL
        L  +  AG+ S  R      E H H+ +LNKP +KSI+SPDGD+IDCVH++ QPAFDHP L++H IQM P++ PE +F +SKV+ K       ITQLWH 
Subjt:  LSLMAAAGKSSRHRHRHRWLEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPNFHPEGVFSDSKVTSKAS-----ITQLWHL

Query:  KGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQTNEFSLSQIWILGGTFGEDL
         G C EGTIP+RRTK++D+LRASSV+ YGKK+  +   P S D D   Q+GHQHAI YVEGG++YGAKAT+NVW PK++ +NEFSLSQ+WILGG+FG+DL
Subjt:  KGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQTNEFSLSQIWILGGTFGEDL

Query:  NSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEGNWWMQFGNSHVLGYWPAFLF
        NSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGF+QIN+ IAMGASI P+S +   QYDIS+ IWKDPKEG+WWMQFG+ +VLGYWP+FLF
Subjt:  NSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEGNWWMQFGNSHVLGYWPAFLF

Query:  SYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGKSDDWGNYFFYGGPGRNPNC
        SYL DSAS++EWGGEVVN E DG HT+TQMGSG FPD+GF  A+YFRNIQ+V  SNNL+ P+ + TFTE+ +CYDV+ GK+DDWG+YF+YGGPGRNPNC
Subjt:  SYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGKSDDWGNYFFYGGPGRNPNC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGGCATTGTCTGAAAGCATGGAGACTGAAAGACAAATAGCAGTAAATGAGCTTGTCGGTGAAATGGGTACTCCAAAGCTGCTGCTGCTTCTGCTGCTGCTAATGCT
GGTTTCCCTGTCTCTGATGGCGGCGGCCGGGAAATCCTCCCGCCACCGCCACCGCCACCGGTGGCTGGAAGCTCACACCCACATGAAGAAGCTCAATAAACCGCCTCTCA
AGTCCATCAAGAGTCCAGACGGCGATGTAATCGATTGTGTTCATATGGCTCATCAACCAGCTTTTGATCATCCTCTTCTCAGAAACCACACGATTCAGATGAGACCAAAT
TTTCATCCAGAAGGGGTTTTCAGTGACAGTAAAGTAACTTCAAAAGCTTCGATAACTCAATTATGGCACTTGAAAGGAAGGTGCCCAGAAGGAACGATTCCCATTAGAAG
AACGAAAAGAGATGATATTTTAAGAGCAAGCTCTGTGGAAAGCTACGGCAAAAAGAGGGCTCATGCCACGGTGAAACCAAGCTCCATTGATATCGATTTCAATGGCCAAA
ATGGACATCAGCATGCAATAACTTATGTTGAAGGAGGACAATACTATGGAGCTAAGGCAACTATGAACGTTTGGTCACCCAAAATCGAACAGACCAACGAATTCAGCCTC
TCGCAGATTTGGATTCTTGGAGGAACTTTTGGGGAAGATCTTAATAGCATTGAAGCTGGTTGGCAGGTTAGCCCTGATTTGTATGGAGATAACAATACTCGACTTTTCAC
TTATTGGACTAGTGATGCATATCAAGCCACTGGCTGCTACAATCTTCTCTGTTCTGGGTTTGTCCAAATCAACAATGACATAGCCATGGGTGCCAGCATTTATCCCATTT
CTTCTTACAAAGGCTCTCAGTATGACATTAGCTTGCTCATCTGGAAGGACCCTAAAGAAGGAAACTGGTGGATGCAATTCGGGAATAGCCACGTTTTGGGGTATTGGCCG
GCGTTCTTATTCTCATACCTCACCGACAGCGCCTCCATGATCGAGTGGGGCGGCGAAGTCGTCAACTCCGAACCCGACGGCCAACACACTTCCACTCAGATGGGCAGCGG
CCACTTCCCCGACCAGGGCTTCGCCAACGCCGCCTACTTTCGGAACATCCAGATTGTTGGGGGTTCCAACAACCTCCGGGCGCCGGAGGACATTGCAACGTTCACGGAGC
AACCAAGTTGCTACGACGTTCAGACCGGCAAGTCCGACGACTGGGGCAACTACTTCTTCTACGGCGGCCCTGGCAGAAACCCAAACTGCCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGGCATTGTCTGAAAGCATGGAGACTGAAAGACAAATAGCAGTAAATGAGCTTGTCGGTGAAATGGGTACTCCAAAGCTGCTGCTGCTTCTGCTGCTGCTAATGCT
GGTTTCCCTGTCTCTGATGGCGGCGGCCGGGAAATCCTCCCGCCACCGCCACCGCCACCGGTGGCTGGAAGCTCACACCCACATGAAGAAGCTCAATAAACCGCCTCTCA
AGTCCATCAAGAGTCCAGACGGCGATGTAATCGATTGTGTTCATATGGCTCATCAACCAGCTTTTGATCATCCTCTTCTCAGAAACCACACGATTCAGATGAGACCAAAT
TTTCATCCAGAAGGGGTTTTCAGTGACAGTAAAGTAACTTCAAAAGCTTCGATAACTCAATTATGGCACTTGAAAGGAAGGTGCCCAGAAGGAACGATTCCCATTAGAAG
AACGAAAAGAGATGATATTTTAAGAGCAAGCTCTGTGGAAAGCTACGGCAAAAAGAGGGCTCATGCCACGGTGAAACCAAGCTCCATTGATATCGATTTCAATGGCCAAA
ATGGACATCAGCATGCAATAACTTATGTTGAAGGAGGACAATACTATGGAGCTAAGGCAACTATGAACGTTTGGTCACCCAAAATCGAACAGACCAACGAATTCAGCCTC
TCGCAGATTTGGATTCTTGGAGGAACTTTTGGGGAAGATCTTAATAGCATTGAAGCTGGTTGGCAGGTTAGCCCTGATTTGTATGGAGATAACAATACTCGACTTTTCAC
TTATTGGACTAGTGATGCATATCAAGCCACTGGCTGCTACAATCTTCTCTGTTCTGGGTTTGTCCAAATCAACAATGACATAGCCATGGGTGCCAGCATTTATCCCATTT
CTTCTTACAAAGGCTCTCAGTATGACATTAGCTTGCTCATCTGGAAGGACCCTAAAGAAGGAAACTGGTGGATGCAATTCGGGAATAGCCACGTTTTGGGGTATTGGCCG
GCGTTCTTATTCTCATACCTCACCGACAGCGCCTCCATGATCGAGTGGGGCGGCGAAGTCGTCAACTCCGAACCCGACGGCCAACACACTTCCACTCAGATGGGCAGCGG
CCACTTCCCCGACCAGGGCTTCGCCAACGCCGCCTACTTTCGGAACATCCAGATTGTTGGGGGTTCCAACAACCTCCGGGCGCCGGAGGACATTGCAACGTTCACGGAGC
AACCAAGTTGCTACGACGTTCAGACCGGCAAGTCCGACGACTGGGGCAACTACTTCTTCTACGGCGGCCCTGGCAGAAACCCAAACTGCCTTTGATTTTAAATTATCAAA
TATATATATATATAAATGCATTTCTTGTGCATTATTAATTAATTATATA
Protein sequenceShow/hide protein sequence
MRALSESMETERQIAVNELVGEMGTPKLLLLLLLLMLVSLSLMAAAGKSSRHRHRHRWLEAHTHMKKLNKPPLKSIKSPDGDVIDCVHMAHQPAFDHPLLRNHTIQMRPN
FHPEGVFSDSKVTSKASITQLWHLKGRCPEGTIPIRRTKRDDILRASSVESYGKKRAHATVKPSSIDIDFNGQNGHQHAITYVEGGQYYGAKATMNVWSPKIEQTNEFSL
SQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNDIAMGASIYPISSYKGSQYDISLLIWKDPKEGNWWMQFGNSHVLGYWP
AFLFSYLTDSASMIEWGGEVVNSEPDGQHTSTQMGSGHFPDQGFANAAYFRNIQIVGGSNNLRAPEDIATFTEQPSCYDVQTGKSDDWGNYFFYGGPGRNPNCL