; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG09G003600 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG09G003600
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionProtein of Unknown Function (DUF239)
Genome locationCG_Chr09:3171742..3175193
RNA-Seq ExpressionClCG09G003600
SyntenyClCG09G003600
Gene Ontology termsGO:0016874 - ligase activity (molecular function)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593345.1 hypothetical protein SDJN03_12821, partial [Cucurbita argyrosperma subsp. sororia]1.1e-21586.33Show/hide
Query:  MGVLKLLLLLFLMLVSLSPATVAGKTNRHRH----RRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPEGILRDSK
        MG LKLLLLL + L S++ A VAGK++ HRH    RR EAQ HLKKLNKPAVKSIKSPDGD IDCVHMAHQPAFDHPLL+NHTIQM+PNFHPEG+L D+K
Subjt:  MGVLKLLLLLFLMLVSLSPATVAGKTNRHRH----RRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPEGILRDSK

Query:  LSTKASKSDAITQLWHLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNE
        +S+K S+   ITQ WHLKG CP+GTIPIRRT K DILRANSVK+YGKKKP AT KPTSIDIDLNGQ GHQHAI YVEGGQYYGAKATINVWSPKIQ  NE
Subjt:  LSTKASKSDAITQLWHLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNE

Query:  FSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGNWW
        FSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTR FTYWTSDAYQATGCYNLLCSGFVQINNEIA+GASI+PISSY+GSQYDISLLIWKDPKEGNWW
Subjt:  FSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGNWW

Query:  MQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKSDD
        MQFGN YVLGYWPAFLFSYLTD ASM+EWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQ+V  SN+LR PEDIGTFTEQPSCYDVQNGKS D
Subjt:  MQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKSDD

Query:  WGNYFFYGGPGRNPSCP
        WGNYFFYGGPGRNP+CP
Subjt:  WGNYFFYGGPGRNPSCP

XP_004135896.1 uncharacterized protein LOC101218833 [Cucumis sativus]1.5e-22591.15Show/hide
Query:  MGVLKLLLLLFLMLVSLSPATVAGKT----NRHRHRRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPE-GILRDS
        MGVLKLL L  LMLVSL+  TV GKT    +RHR RRLE  SHLKKLNKPAVKSIKSPDGD IDCV MAHQPAFDHPLL+NHTIQM+P FHPE GIL DS
Subjt:  MGVLKLLLLLFLMLVSLSPATVAGKT----NRHRHRRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPE-GILRDS

Query:  KLSTKASKSDAITQLWHLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTN
        K+S K SKS+ ITQLWHLKG CPKGTIPIRRTKKEDILR NSVKSYGKKKP+ATVKP SI++DLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTN
Subjt:  KLSTKASKSDAITQLWHLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTN

Query:  EFSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGNW
        EFSLSQIWILGGTFG+DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYK SQYDISLLIWKDPKEGNW
Subjt:  EFSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGNW

Query:  WMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKSD
        WMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIG FTEQPSCYDVQNGKSD
Subjt:  WMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKSD

Query:  DWGNYFFYGGPGRNPSCP
        DWGNYFFYGGPGRNP+CP
Subjt:  DWGNYFFYGGPGRNPSCP

XP_008461220.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103499872 [Cucumis melo]2.5e-22591.41Show/hide
Query:  MGVLKLLLLLFLMLVSLSPATVAGKT----NRH-RHRRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPE-GILRD
        MGVLKLL L  LMLVSLS  TV GK     +RH RHRRLE  SHLKKLNKPAVKSIKSPDGD IDCVHMAHQPAFDHPLL+NHTIQM+PNFHPE GIL +
Subjt:  MGVLKLLLLLFLMLVSLSPATVAGKT----NRH-RHRRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPE-GILRD

Query:  SKLSTKASKSDAITQLWHLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQT
        SK+S K SKS+ ITQLWHLKG CPKGTIPIRR KKEDILR NSVKSYGKKKP+ATVKP SI+IDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQT
Subjt:  SKLSTKASKSDAITQLWHLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQT

Query:  NEFSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGN
        NEFSLSQIWILGGTFG+DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYK SQYDISLLIWKDPKEGN
Subjt:  NEFSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGN

Query:  WWMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKS
        WWMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIG FTEQPSCYDVQNGKS
Subjt:  WWMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKS

Query:  DDWGNYFFYGGPGRNPSCP
        DDWGNYFFYGGPGRNP+CP
Subjt:  DDWGNYFFYGGPGRNPSCP

XP_022959691.1 uncharacterized protein LOC111460689 [Cucurbita moschata]3.7e-21686.57Show/hide
Query:  MGVLKLLLLLFLMLVSLSPATVAGKTNRHRH----RRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPEGILRDSK
        MG LKLLLLL   +VSLS A  AGK++ HRH    RR EAQ HLKKLNKPAVKSIKSPDGD IDCVHMAHQPAFDHPLL+NHTIQM+PNFHPEG+L D+K
Subjt:  MGVLKLLLLLFLMLVSLSPATVAGKTNRHRH----RRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPEGILRDSK

Query:  LSTKASKSDAITQLWHLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNE
        +S+K S+   ITQ WHLKG CPKGTIPIRRT K DILRANSVK+YG+KKP AT KPTSIDIDLNGQ GHQHAI YVEGGQYYGAKATINVWSPKIQ  NE
Subjt:  LSTKASKSDAITQLWHLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNE

Query:  FSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGNWW
        FSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTR FTYWTSDAYQATGCYNLLCSGFVQINNEIA+GASI+PISSY+GSQYDISLLIWKDPKEGNWW
Subjt:  FSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGNWW

Query:  MQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKSDD
        MQFGN YVLGYWPAFLFSYLTD ASM+EWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQ+V  SN+LR PEDIGTFTEQPSCYDVQNGKS D
Subjt:  MQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKSDD

Query:  WGNYFFYGGPGRNPSCP
        WGNYFFYGGPGRNP+CP
Subjt:  WGNYFFYGGPGRNPSCP

XP_038899904.1 uncharacterized protein LOC120087094 [Benincasa hispida]3.5e-23594.92Show/hide
Query:  MGVLKLLLLLFLMLVSLSPATVAGKTNRHRHRRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPEGILRDSKLSTK
        MGVLKLL LL LML+SLS  TVAGKT RHRHRRL+  SHLKKLNKP VKSIKSPDGD IDCVHMAHQPAFDHPLLRNHTIQM PNFHPEGILR+SK+S+K
Subjt:  MGVLKLLLLLFLMLVSLSPATVAGKTNRHRHRRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPEGILRDSKLSTK

Query:  ASKSDAITQLWHLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLS
        ASKS  ITQLWHLKG CPKGTIPIRRTKKEDILRA+SVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLS
Subjt:  ASKSDAITQLWHLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLS

Query:  QIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGNWWMQFG
        QIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGNWWMQFG
Subjt:  QIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGNWWMQFG

Query:  NKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKSDDWGNY
        NKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDV+NGKSDDWGNY
Subjt:  NKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKSDDWGNY

Query:  FFYGGPGRNPSCP
        FFYGGPGRNP+CP
Subjt:  FFYGGPGRNPSCP

TrEMBL top hitse value%identityAlignment
A0A0A0KBL1 Uncharacterized protein7.2e-22691.15Show/hide
Query:  MGVLKLLLLLFLMLVSLSPATVAGKT----NRHRHRRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPE-GILRDS
        MGVLKLL L  LMLVSL+  TV GKT    +RHR RRLE  SHLKKLNKPAVKSIKSPDGD IDCV MAHQPAFDHPLL+NHTIQM+P FHPE GIL DS
Subjt:  MGVLKLLLLLFLMLVSLSPATVAGKT----NRHRHRRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPE-GILRDS

Query:  KLSTKASKSDAITQLWHLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTN
        K+S K SKS+ ITQLWHLKG CPKGTIPIRRTKKEDILR NSVKSYGKKKP+ATVKP SI++DLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTN
Subjt:  KLSTKASKSDAITQLWHLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTN

Query:  EFSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGNW
        EFSLSQIWILGGTFG+DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYK SQYDISLLIWKDPKEGNW
Subjt:  EFSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGNW

Query:  WMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKSD
        WMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIG FTEQPSCYDVQNGKSD
Subjt:  WMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKSD

Query:  DWGNYFFYGGPGRNPSCP
        DWGNYFFYGGPGRNP+CP
Subjt:  DWGNYFFYGGPGRNPSCP

A0A1S3CE73 LOW QUALITY PROTEIN: uncharacterized protein LOC1034998721.2e-22591.41Show/hide
Query:  MGVLKLLLLLFLMLVSLSPATVAGKT----NRH-RHRRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPE-GILRD
        MGVLKLL L  LMLVSLS  TV GK     +RH RHRRLE  SHLKKLNKPAVKSIKSPDGD IDCVHMAHQPAFDHPLL+NHTIQM+PNFHPE GIL +
Subjt:  MGVLKLLLLLFLMLVSLSPATVAGKT----NRH-RHRRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPE-GILRD

Query:  SKLSTKASKSDAITQLWHLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQT
        SK+S K SKS+ ITQLWHLKG CPKGTIPIRR KKEDILR NSVKSYGKKKP+ATVKP SI+IDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQT
Subjt:  SKLSTKASKSDAITQLWHLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQT

Query:  NEFSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGN
        NEFSLSQIWILGGTFG+DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYK SQYDISLLIWKDPKEGN
Subjt:  NEFSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGN

Query:  WWMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKS
        WWMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIG FTEQPSCYDVQNGKS
Subjt:  WWMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKS

Query:  DDWGNYFFYGGPGRNPSCP
        DDWGNYFFYGGPGRNP+CP
Subjt:  DDWGNYFFYGGPGRNPSCP

A0A6J1DGY2 uncharacterized protein LOC1110203908.8e-21687.56Show/hide
Query:  LKLLLLLFLMLVSLSPATVAGKTNRHRHRRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPEGILRDSKLSTKASK
        L LLL LFL+ +SL+P   A KT+R RH RLEA +HLKKLNKP VKSIKSPDGD IDCVHMAHQPAFDHPLLRNHTIQM+PNFHPEGI +D+K+S+  S+
Subjt:  LKLLLLLFLMLVSLSPATVAGKTNRHRHRRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPEGILRDSKLSTKASK

Query:  SDAITQLWHLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIW
        S  ITQLWHLKG CP+GTIPIRRTKK DILRA+S+KSYGKKKP ATVKPTSIDIDLNGQ GHQHAIIYVEGG+YYGAKATINVWSPKIQQTNEFSLSQIW
Subjt:  SDAITQLWHLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIW

Query:  ILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGNWWMQFGNKY
        ILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINN+IAMGASI+PISSY+ SQYDISLLIWKDPKEGNWWMQFGN +
Subjt:  ILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGNWWMQFGNKY

Query:  VLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKSDDWGNYFFY
        VLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFP EGFGKAGYFRNIQ+V  SN+L+APEDIGTFTEQPSCYDVQNGKS +WGNYFFY
Subjt:  VLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKSDDWGNYFFY

Query:  GGPGRNPSCP
        GGPGRNP+CP
Subjt:  GGPGRNPSCP

A0A6J1H5K4 uncharacterized protein LOC1114606891.8e-21686.57Show/hide
Query:  MGVLKLLLLLFLMLVSLSPATVAGKTNRHRH----RRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPEGILRDSK
        MG LKLLLLL   +VSLS A  AGK++ HRH    RR EAQ HLKKLNKPAVKSIKSPDGD IDCVHMAHQPAFDHPLL+NHTIQM+PNFHPEG+L D+K
Subjt:  MGVLKLLLLLFLMLVSLSPATVAGKTNRHRH----RRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPEGILRDSK

Query:  LSTKASKSDAITQLWHLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNE
        +S+K S+   ITQ WHLKG CPKGTIPIRRT K DILRANSVK+YG+KKP AT KPTSIDIDLNGQ GHQHAI YVEGGQYYGAKATINVWSPKIQ  NE
Subjt:  LSTKASKSDAITQLWHLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNE

Query:  FSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGNWW
        FSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTR FTYWTSDAYQATGCYNLLCSGFVQINNEIA+GASI+PISSY+GSQYDISLLIWKDPKEGNWW
Subjt:  FSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGNWW

Query:  MQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKSDD
        MQFGN YVLGYWPAFLFSYLTD ASM+EWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQ+V  SN+LR PEDIGTFTEQPSCYDVQNGKS D
Subjt:  MQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKSDD

Query:  WGNYFFYGGPGRNPSCP
        WGNYFFYGGPGRNP+CP
Subjt:  WGNYFFYGGPGRNPSCP

A0A6J1KPT3 uncharacterized protein LOC1114976112.8e-21486.92Show/hide
Query:  MGVLKLLLLLFLMLVSLSPATVAGKTNRHRHRRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPEGILRDSKLSTK
        MG LKLLLLL   +VSLS A  AGK++ HR RR EAQ HLKKLNKPAVKSIKSPDGD IDCVHMAHQPAFDHPLL+NHTIQM+PNFHPEG+L  +K+S+K
Subjt:  MGVLKLLLLLFLMLVSLSPATVAGKTNRHRHRRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPEGILRDSKLSTK

Query:  ASKSDAITQLWHLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLS
         S+   ITQ WHLKG CP+GTIPIRRT K DILRANSVKSYGKKKP ATV+PTSIDIDLNGQ GHQHAI YVEGGQYYGAKATINVWSPKIQ  NEFSLS
Subjt:  ASKSDAITQLWHLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLS

Query:  QIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGNWWMQFG
        QIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTR FTYWTSDAYQATGCYNLLCSGFVQINNEIA+GASI+PISSY+GSQYDISLLIWKDPKEGNWWMQFG
Subjt:  QIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGNWWMQFG

Query:  NKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKSDDWGNY
        N YVLGYWPAFLFSYLTD ASM+EWGGEVVNSES GQHTSTQMGSGHFPGEGFGKAGYFRNIQ+V  SN+LR PEDIGTFTEQPSCYDVQNGKS DWGNY
Subjt:  NKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKSDDWGNY

Query:  FFYGGPGRNPSCP
        FFYGGPGRNP+CP
Subjt:  FFYGGPGRNPSCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)3.0e-17670.49Show/hide
Query:  LLLLLFLMLVSLSPATVAGKTNRHRHRRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPEGILRDSKLSTKAS--K
        L+ L      SLS A  +G +     ++ E + HL +LNKPAVKSI+S DGD IDCV ++ QPAFDHP L++H IQMKPN+HPEG+  D+K+S   S  K
Subjt:  LLLLLFLMLVSLSPATVAGKTNRHRHRRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPEGILRDSKLSTKAS--K

Query:  SDAITQLWHLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIW
           I QLWH  G C +GTIP+RRTK++D+LRA+SVK YGKKK  +   P S + DL  Q+GHQHAI YVEG +YYGAKATINVW PKIQQ NEFSLSQIW
Subjt:  SDAITQLWHLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIW

Query:  ILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGNWWMQFGNKY
        +LGG+FG+DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGF+QIN++IAMGASI P+S Y+ SQYDIS+LIWKDPKEG+WWMQFGN Y
Subjt:  ILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGNWWMQFGNKY

Query:  VLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKSDDWGNYFFY
        VLGYWP+FLFSYLT+SASMIEWGGEVVNS+SDGQHTSTQMGSG FP EGF KA YFRNIQ+V  SN+L+AP+ +GTFTEQ +CYDVQ G +DDWG+YF+Y
Subjt:  VLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKSDDWGNYFFY

Query:  GGPGRNPSCP
        GGPG+N  CP
Subjt:  GGPGRNPSCP

AT2G44210.1 Protein of Unknown Function (DUF239)3.1e-16063.53Show/hide
Query:  LKLLLLLFLMLVSLSPATVAGKTNRHRHRRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPEGILRDSKLS--TKA
        +   L L + +V L+P+ V+G+        L+ ++HLK+LNKPA+KSIKSPDGD IDCV +  QPAF HPLL NHT+QM P+ +PE +  +SK+S  TK 
Subjt:  LKLLLLLFLMLVSLSPATVAGKTNRHRHRRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPEGILRDSKLS--TKA

Query:  SKSDAITQLWHLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSID-IDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLS
         +S+AI QLWH+ G CPK TIPIRRT+++D+ RA+SV++YG K   +  KP S +  ++  QNGHQHAI+YVE G +YGAKA INVW P ++  NEFSL+
Subjt:  SKSDAITQLWHLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSID-IDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLS

Query:  QIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGNWWMQFG
        QIW+LGG F  DLNSIEAGWQVSP LYGDN TRLFTYWTSDAYQ TGCYNLLCSGFVQIN EIAMG SI P+S+Y  SQYDI++LIWKDPKEG+WW+QFG
Subjt:  QIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGNWWMQFG

Query:  NKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSES-DGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKSDDWGN
         KY++GYWPA LFSYL++SASMIEWGGEVVNS+S +GQHT+TQMGSG F  EG+GKA YF+N+Q+V  SN LR PE++  FT+Q +CY+V++G    WG+
Subjt:  NKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSES-DGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKSDDWGN

Query:  YFFYGGPGRNPSCP
        YF+YGGPGRNP+CP
Subjt:  YFFYGGPGRNPSCP

AT3G13510.1 Protein of Unknown Function (DUF239)3.4e-17569.15Show/hide
Query:  MLVSLSPATVAGKTNRHRHRRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPEGILRDSKLSTK-ASKSDAITQLW
        +++SLS A  +  ++R   ++ E + HL +LNKP VK+I+SPDGD IDC+ ++ QPAFDHP L++H IQM+P++HPEG+  D+K+S +   K   I QLW
Subjt:  MLVSLSPATVAGKTNRHRHRRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPEGILRDSKLSTK-ASKSDAITQLW

Query:  HLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGE
        H  G C +GTIP+RRT+++D+LRA+SVK YGKKK  +   P S + DL  QNGHQHAI YVEG +YYGAKAT+NVW PKIQ TNEFSLSQIW+LGG+FG+
Subjt:  HLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGE

Query:  DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGNWWMQFGNKYVLGYWPAF
        DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGF+QIN++IAMGASI P+S Y+ SQYDIS+LIWKDPKEG+WWMQFGN YVLGYWP+F
Subjt:  DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGNWWMQFGNKYVLGYWPAF

Query:  LFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKSDDWGNYFFYGGPGRNPS
        LFSYLT+SASMIEWGGEVVNS+S+G HT TQMGSGHFP EGF KA YFRNIQ+V  SN+L+AP+ +GTFTE+ +CYDVQ G +DDWG+YF+YGGPG+N +
Subjt:  LFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKSDDWGNYFFYGGPGRNPS

Query:  CP
        CP
Subjt:  CP

AT5G56530.1 Protein of Unknown Function (DUF239)4.5e-17268.4Show/hide
Query:  LLFLMLVSLSPATVAGKTNRHRHRRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPEGILRDSKLSTKASKS-DAI
        L++     L   T AG+ +  R +  E   HL +LNKPAVKSI+SPDGD IDCVH++ QPAFDHP L++H IQM P++ PE +  +SK+S K  +S + I
Subjt:  LLFLMLVSLSPATVAGKTNRHRHRRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPEGILRDSKLSTKASKS-DAI

Query:  TQLWHLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGG
        TQLWH  G C +GTIP+RRTKKED+LRA+SVK YGKKK  +   P S D DL  Q+GHQHAI YVEGG++YGAKATINVW PK+Q +NEFSLSQ+WILGG
Subjt:  TQLWHLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGG

Query:  TFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGNWWMQFGNKYVLGY
        +FG+DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGF+QIN++IAMGASI P+S +   QYDIS+ IWKDPKEG+WWMQFG+ YVLGY
Subjt:  TFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGNWWMQFGNKYVLGY

Query:  WPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKSDDWGNYFFYGGPG
        WP+FLFSYL DSAS++EWGGEVVN E DG HT+TQMGSG FP EGF KA YFRNIQ+V  SN+L+ P+ + TFTE+ +CYDV+ GK+DDWG+YF+YGGPG
Subjt:  WPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKSDDWGNYFFYGGPG

Query:  RNPSC
        RNP+C
Subjt:  RNPSC

AT5G56530.2 Protein of Unknown Function (DUF239)4.5e-17268.4Show/hide
Query:  LLFLMLVSLSPATVAGKTNRHRHRRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPEGILRDSKLSTKASKS-DAI
        L++     L   T AG+ +  R +  E   HL +LNKPAVKSI+SPDGD IDCVH++ QPAFDHP L++H IQM P++ PE +  +SK+S K  +S + I
Subjt:  LLFLMLVSLSPATVAGKTNRHRHRRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPEGILRDSKLSTKASKS-DAI

Query:  TQLWHLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGG
        TQLWH  G C +GTIP+RRTKKED+LRA+SVK YGKKK  +   P S D DL  Q+GHQHAI YVEGG++YGAKATINVW PK+Q +NEFSLSQ+WILGG
Subjt:  TQLWHLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGG

Query:  TFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGNWWMQFGNKYVLGY
        +FG+DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGF+QIN++IAMGASI P+S +   QYDIS+ IWKDPKEG+WWMQFG+ YVLGY
Subjt:  TFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGNWWMQFGNKYVLGY

Query:  WPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKSDDWGNYFFYGGPG
        WP+FLFSYL DSAS++EWGGEVVN E DG HT+TQMGSG FP EGF KA YFRNIQ+V  SN+L+ P+ + TFTE+ +CYDV+ GK+DDWG+YF+YGGPG
Subjt:  WPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKSDDWGNYFFYGGPG

Query:  RNPSC
        RNP+C
Subjt:  RNPSC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGTTCTTAAGCTCCTCCTCCTGCTCTTCTTAATGCTGGTCTCTTTATCTCCTGCCACGGTGGCCGGAAAAACCAACCGCCACCGTCACCGGCGCCTCGAAGCTCA
GTCCCACCTCAAGAAGCTGAATAAACCTGCTGTGAAATCCATCAAGAGTCCGGATGGGGATACAATCGATTGTGTTCATATGGCTCATCAACCAGCTTTTGATCATCCTC
TTCTCAGAAACCACACAATTCAGATGAAACCAAATTTTCATCCAGAAGGGATTTTAAGGGACAGTAAATTGTCTACAAAGGCTTCAAAATCAGATGCTATAACTCAATTA
TGGCACTTGAAAGGAAGCTGTCCAAAAGGGACAATTCCCATTAGAAGAACCAAAAAAGAAGACATTTTAAGAGCAAATTCAGTGAAAAGCTATGGAAAAAAGAAGCCTCA
TGCCACTGTAAAACCAACCTCCATTGATATTGATCTCAATGGACAAAATGGACATCAGCATGCAATCATATATGTTGAAGGAGGACAATACTATGGAGCTAAAGCAACTA
TAAATGTTTGGTCCCCAAAAATTCAACAAACAAATGAATTTAGTCTCTCACAGATCTGGATTCTTGGAGGAACTTTTGGGGAAGATCTTAATAGCATTGAAGCTGGTTGG
CAGGTCAGCCCTGATTTGTATGGAGATAACAACACTAGACTTTTCACTTATTGGACTAGTGATGCGTATCAAGCAACTGGTTGCTATAATCTGCTCTGTTCTGGGTTTGT
TCAAATCAATAATGAAATAGCCATGGGTGCCAGCATTTTTCCTATTTCTTCTTACAAAGGTTCTCAATATGACATCAGCTTGCTCATTTGGAAGGACCCTAAAGAAGGAA
ACTGGTGGATGCAATTTGGAAATAAGTACGTATTGGGGTATTGGCCGGCATTCTTATTTTCATATCTCACCGACAGCGCCTCCATGATCGAATGGGGCGGCGAAGTCGTA
AACTCTGAATCCGACGGCCAACATACTTCCACTCAGATGGGCAGCGGCCACTTCCCCGGCGAGGGCTTTGGCAAAGCCGGCTACTTTCGTAACATTCAGATTGTTGGAGA
ATCCAACAGCCTCCGGGCGCCGGAGGACATAGGAACTTTCACGGAGCAACCAAGTTGCTACGACGTTCAAAACGGCAAGTCCGACGACTGGGGCAATTACTTCTTCTACG
GCGGTCCGGGCAGAAATCCAAGCTGCCCCTGA
mRNA sequenceShow/hide mRNA sequence
GTTTTTATGGTAGTAAGCATTTGTGTGTGTGTTGGTGAAAATTTCTGGGCAGCCACAATTCTTAAAATATTAATATCTCTCTCTCTCCTTTTTCTCTATTTGCTAAGTTC
TGCTAAACACACTTATTTTGAGAAATTTGCTTCTATCCCATTCTACACAATTTCTCAACCAGCCATTTTTGCTTTTCAATTCCATAAAGTTCTCTTACTTTTTCCTCTCA
AAAATTCACTCCAATGGGTGTTCTTAAGCTCCTCCTCCTGCTCTTCTTAATGCTGGTCTCTTTATCTCCTGCCACGGTGGCCGGAAAAACCAACCGCCACCGTCACCGGC
GCCTCGAAGCTCAGTCCCACCTCAAGAAGCTGAATAAACCTGCTGTGAAATCCATCAAGAGTCCGGATGGGGATACAATCGATTGTGTTCATATGGCTCATCAACCAGCT
TTTGATCATCCTCTTCTCAGAAACCACACAATTCAGATGAAACCAAATTTTCATCCAGAAGGGATTTTAAGGGACAGTAAATTGTCTACAAAGGCTTCAAAATCAGATGC
TATAACTCAATTATGGCACTTGAAAGGAAGCTGTCCAAAAGGGACAATTCCCATTAGAAGAACCAAAAAAGAAGACATTTTAAGAGCAAATTCAGTGAAAAGCTATGGAA
AAAAGAAGCCTCATGCCACTGTAAAACCAACCTCCATTGATATTGATCTCAATGGACAAAATGGACATCAGCATGCAATCATATATGTTGAAGGAGGACAATACTATGGA
GCTAAAGCAACTATAAATGTTTGGTCCCCAAAAATTCAACAAACAAATGAATTTAGTCTCTCACAGATCTGGATTCTTGGAGGAACTTTTGGGGAAGATCTTAATAGCAT
TGAAGCTGGTTGGCAGGTCAGCCCTGATTTGTATGGAGATAACAACACTAGACTTTTCACTTATTGGACTAGTGATGCGTATCAAGCAACTGGTTGCTATAATCTGCTCT
GTTCTGGGTTTGTTCAAATCAATAATGAAATAGCCATGGGTGCCAGCATTTTTCCTATTTCTTCTTACAAAGGTTCTCAATATGACATCAGCTTGCTCATTTGGAAGGAC
CCTAAAGAAGGAAACTGGTGGATGCAATTTGGAAATAAGTACGTATTGGGGTATTGGCCGGCATTCTTATTTTCATATCTCACCGACAGCGCCTCCATGATCGAATGGGG
CGGCGAAGTCGTAAACTCTGAATCCGACGGCCAACATACTTCCACTCAGATGGGCAGCGGCCACTTCCCCGGCGAGGGCTTTGGCAAAGCCGGCTACTTTCGTAACATTC
AGATTGTTGGAGAATCCAACAGCCTCCGGGCGCCGGAGGACATAGGAACTTTCACGGAGCAACCAAGTTGCTACGACGTTCAAAACGGCAAGTCCGACGACTGGGGCAAT
TACTTCTTCTACGGCGGTCCGGGCAGAAATCCAAGCTGCCCCTGATTTGGATTCCATTCCATAATATTCTCTTACTTCTCTATAATATATATATATATATGCATTTCTTG
TGATTATTAATTATATTTTTAGTGGCTTTTTCTTTTTCTTTTTCTTTTTCTTTTTCTCTTGAGATGTTTAACTGATGATAAAGCTTCTCTCTCTCTCTCTCTCTCTCTGA
TAGAGCTTCATTTGTTTGTTTTCAGAAGCCTTAGAAACACAGCTGATGATTTGTTTTATCTCTTTGGAAAATGTGTATAAGTGTGCTTATTTTTCTT
Protein sequenceShow/hide protein sequence
MGVLKLLLLLFLMLVSLSPATVAGKTNRHRHRRLEAQSHLKKLNKPAVKSIKSPDGDTIDCVHMAHQPAFDHPLLRNHTIQMKPNFHPEGILRDSKLSTKASKSDAITQL
WHLKGSCPKGTIPIRRTKKEDILRANSVKSYGKKKPHATVKPTSIDIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGEDLNSIEAGW
QVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKGSQYDISLLIWKDPKEGNWWMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVV
NSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGTFTEQPSCYDVQNGKSDDWGNYFFYGGPGRNPSCP