; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G19780 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G19780
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationChr7:17295333..17298392
RNA-Seq ExpressionCSPI07G19780
SyntenyCSPI07G19780
Gene Ontology termsGO:0016874 - ligase activity (molecular function)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135896.1 uncharacterized protein LOC101218833 [Cucumis sativus]8.8e-25099.28Show/hide
Query:  MGVLKLLFLFLLMLVSLSLPTVGGKTTLHRHRHRRRRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSDS
        MGVLKLLFLFLLMLVSL+LPTVGGKTTLHRHRHRRRRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSDS
Subjt:  MGVLKLLFLFLLMLVSLSLPTVGGKTTLHRHRHRRRRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSDS

Query:  KVSLKGSKSEDITQLWHLKGKCPKGTIPIRRTKKEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTN
        KVS+KGSKSEDITQLWHLKGKCPKGTIPIRRTKKEDILRGNSVKSYGKKKPYATVKPNSIE+DLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTN
Subjt:  KVSLKGSKSEDITQLWHLKGKCPKGTIPIRRTKKEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTN

Query:  EFSLSQIWILGGTFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGNW
        EFSLSQIWILGGTFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGNW
Subjt:  EFSLSQIWILGGTFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGNW

Query:  WMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKSD
        WMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKSD
Subjt:  WMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKSD

Query:  DWGNYFFYGGPGRNPNCP
        DWGNYFFYGGPGRNPNCP
Subjt:  DWGNYFFYGGPGRNPNCP

XP_008461220.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103499872 [Cucumis melo]5.5e-24498.09Show/hide
Query:  MGVLKLLFLFLLMLVSLSLPTVGGKTTLHRHRHRR-RRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSD
        MGVLKLLFLFLLMLVSLSLPTV GK TL RHRHRR RRLEVHSHLKKLNKPAVKSIKSPDGDIIDCV MAHQPAFDHPLLKNHTIQMRP FHPEGGILS+
Subjt:  MGVLKLLFLFLLMLVSLSLPTVGGKTTLHRHRHRR-RRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSD

Query:  SKVSLKGSKSEDITQLWHLKGKCPKGTIPIRRTKKEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQT
        SKVSLKGSKSEDITQLWHLKGKCPKGTIPIRR KKEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQT
Subjt:  SKVSLKGSKSEDITQLWHLKGKCPKGTIPIRRTKKEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQT

Query:  NEFSLSQIWILGGTFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGN
        NEFSLSQIWILGGTFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGN
Subjt:  NEFSLSQIWILGGTFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGN

Query:  WWMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKS
        WWMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKS
Subjt:  WWMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKS

Query:  DDWGNYFFYGGPGRNPNCP
        DDWGNYFFYGGPGRNPNCP
Subjt:  DDWGNYFFYGGPGRNPNCP

XP_022959691.1 uncharacterized protein LOC111460689 [Cucurbita moschata]1.2e-21485.41Show/hide
Query:  MGVLKLLFLFLLMLVSLSLPTVGGKTTLHRHRHRRRRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSDS
        MG LKLL   LL++VSLS+    GK++ HRHRHR RR E   HLKKLNKPAVKSIKSPDGDIIDCV MAHQPAFDHPLLKNHTIQMRP FHPE G+LSD+
Subjt:  MGVLKLLFLFLLMLVSLSLPTVGGKTTLHRHRHRRRRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSDS

Query:  KVSLKGSKSEDITQLWHLKGKCPKGTIPIRRTKKEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTN
        K+S KGS+ + ITQ WHLKG+CPKGTIPIRRT K DILR NSVK+YG+KKP AT KP SI+IDLNGQ GHQHAI YVEGGQYYGAKATINVWSPKIQ  N
Subjt:  KVSLKGSKSEDITQLWHLKGKCPKGTIPIRRTKKEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTN

Query:  EFSLSQIWILGGTFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGNW
        EFSLSQIWILGGTFG+DLNSIEAGWQVSPDLYGDNNTR FTYWTSDAYQATGCYNLLCSGFVQINNEIA+GASI+PISSY+ SQYDISLLIWKDPKEGNW
Subjt:  EFSLSQIWILGGTFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGNW

Query:  WMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKSD
        WMQFGN YVLGYWPAFLFSYLTD ASM+EWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQ+V  SN+LR PEDIG FTEQPSCYDVQNGKS 
Subjt:  WMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKSD

Query:  DWGNYFFYGGPGRNPNCP
        DWGNYFFYGGPGRNPNCP
Subjt:  DWGNYFFYGGPGRNPNCP

XP_023514011.1 uncharacterized protein LOC111778432 [Cucurbita pepo subsp. pepo]7.8e-21485.41Show/hide
Query:  MGVLKLLFLFLLMLVSLSLPTVGGKTTLHRHRHRRRRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSDS
        MG LKLL   LL++VSLS+    GK++ HRHRHR RR E   HLKKLNKPAVKSIKSPDGDIIDCV MAHQPA DHPLLKNHTIQMRP FHPE G+LSD+
Subjt:  MGVLKLLFLFLLMLVSLSLPTVGGKTTLHRHRHRRRRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSDS

Query:  KVSLKGSKSEDITQLWHLKGKCPKGTIPIRRTKKEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTN
        K+S KGS+ + ITQ WHLKG+CPKGTIPIRRT K DILR NSVK+YGKKKP ATVKP SI+IDLNGQ GHQHAI YVEGGQYYGAKATINVWSPKIQ  N
Subjt:  KVSLKGSKSEDITQLWHLKGKCPKGTIPIRRTKKEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTN

Query:  EFSLSQIWILGGTFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGNW
        EFSLSQIWILGGTFG+DLNSIEAGWQVSPDLYGDNNTR FTYWTSDAYQATGCYNLLCS FVQINNEIA+GASI+PISSY+ SQYDISLLIWKDPKEGNW
Subjt:  EFSLSQIWILGGTFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGNW

Query:  WMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKSD
        WMQFGN YVLGYWPAFLFSYLTD ASM+EWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQ+V  SN+LR PEDIG FTEQPSCYDVQNGKS 
Subjt:  WMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKSD

Query:  DWGNYFFYGGPGRNPNCP
        DWGNYFFYGGPGRNPNCP
Subjt:  DWGNYFFYGGPGRNPNCP

XP_038899904.1 uncharacterized protein LOC120087094 [Benincasa hispida]2.0e-23092.58Show/hide
Query:  MGVLKLLFLFLLMLVSLSLPTVGGKTTLHRHRHRRRRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSDS
        MGVLKLLFL LLML+SLS PTV GKTT HRH    RRL+VHSHLKKLNKP VKSIKSPDGDIIDCV MAHQPAFDHPLL+NHTIQM P FHPE GIL +S
Subjt:  MGVLKLLFLFLLMLVSLSLPTVGGKTTLHRHRHRRRRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSDS

Query:  KVSLKGSKSEDITQLWHLKGKCPKGTIPIRRTKKEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTN
        KVS K SKS+DITQLWHLKG+CPKGTIPIRRTKKEDILR +SVKSYGKKKP+ATVKP SI+IDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTN
Subjt:  KVSLKGSKSEDITQLWHLKGKCPKGTIPIRRTKKEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTN

Query:  EFSLSQIWILGGTFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGNW
        EFSLSQIWILGGTFG+DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYK SQYDISLLIWKDPKEGNW
Subjt:  EFSLSQIWILGGTFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGNW

Query:  WMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKSD
        WMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIG FTEQPSCYDV+NGKSD
Subjt:  WMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKSD

Query:  DWGNYFFYGGPGRNPNCP
        DWGNYFFYGGPGRNPNCP
Subjt:  DWGNYFFYGGPGRNPNCP

TrEMBL top hitse value%identityAlignment
A0A0A0KBL1 Uncharacterized protein4.3e-25099.28Show/hide
Query:  MGVLKLLFLFLLMLVSLSLPTVGGKTTLHRHRHRRRRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSDS
        MGVLKLLFLFLLMLVSL+LPTVGGKTTLHRHRHRRRRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSDS
Subjt:  MGVLKLLFLFLLMLVSLSLPTVGGKTTLHRHRHRRRRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSDS

Query:  KVSLKGSKSEDITQLWHLKGKCPKGTIPIRRTKKEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTN
        KVS+KGSKSEDITQLWHLKGKCPKGTIPIRRTKKEDILRGNSVKSYGKKKPYATVKPNSIE+DLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTN
Subjt:  KVSLKGSKSEDITQLWHLKGKCPKGTIPIRRTKKEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTN

Query:  EFSLSQIWILGGTFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGNW
        EFSLSQIWILGGTFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGNW
Subjt:  EFSLSQIWILGGTFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGNW

Query:  WMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKSD
        WMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKSD
Subjt:  WMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKSD

Query:  DWGNYFFYGGPGRNPNCP
        DWGNYFFYGGPGRNPNCP
Subjt:  DWGNYFFYGGPGRNPNCP

A0A1S3CE73 LOW QUALITY PROTEIN: uncharacterized protein LOC1034998722.7e-24498.09Show/hide
Query:  MGVLKLLFLFLLMLVSLSLPTVGGKTTLHRHRHRR-RRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSD
        MGVLKLLFLFLLMLVSLSLPTV GK TL RHRHRR RRLEVHSHLKKLNKPAVKSIKSPDGDIIDCV MAHQPAFDHPLLKNHTIQMRP FHPEGGILS+
Subjt:  MGVLKLLFLFLLMLVSLSLPTVGGKTTLHRHRHRR-RRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSD

Query:  SKVSLKGSKSEDITQLWHLKGKCPKGTIPIRRTKKEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQT
        SKVSLKGSKSEDITQLWHLKGKCPKGTIPIRR KKEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQT
Subjt:  SKVSLKGSKSEDITQLWHLKGKCPKGTIPIRRTKKEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQT

Query:  NEFSLSQIWILGGTFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGN
        NEFSLSQIWILGGTFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGN
Subjt:  NEFSLSQIWILGGTFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGN

Query:  WWMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKS
        WWMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKS
Subjt:  WWMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKS

Query:  DDWGNYFFYGGPGRNPNCP
        DDWGNYFFYGGPGRNPNCP
Subjt:  DDWGNYFFYGGPGRNPNCP

A0A6J1DGY2 uncharacterized protein LOC1110203902.8e-20985.27Show/hide
Query:  LLFLFLLMLVSLSL-PTVGGKTTLHRHRHRRRRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSDSKVSL
        L  L  L L+SLSL P    KT+  RH     RLE H+HLKKLNKP VKSIKSPDGDIIDCV MAHQPAFDHPLL+NHTIQMRP FHPE GI  D+KVS 
Subjt:  LLFLFLLMLVSLSL-PTVGGKTTLHRHRHRRRRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSDSKVSL

Query:  KGSKSEDITQLWHLKGKCPKGTIPIRRTKKEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSL
          S+S+ ITQLWHLKG+CP+GTIPIRRTKK DILR +S+KSYGKKKP ATVKP SI+IDLNGQ GHQHAIIYVEGG+YYGAKATINVWSPKIQQTNEFSL
Subjt:  KGSKSEDITQLWHLKGKCPKGTIPIRRTKKEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSL

Query:  SQIWILGGTFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGNWWMQF
        SQIWILGGTFG+DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINN+IAMGASI+PISSY+SSQYDISLLIWKDPKEGNWWMQF
Subjt:  SQIWILGGTFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGNWWMQF

Query:  GNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKSDDWGN
        GN +VLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFP EGFGKAGYFRNIQ+V  SN+L+APEDIG FTEQPSCYDVQNGKS +WGN
Subjt:  GNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKSDDWGN

Query:  YFFYGGPGRNPNCP
        YFFYGGPGRNPNCP
Subjt:  YFFYGGPGRNPNCP

A0A6J1H5K4 uncharacterized protein LOC1114606895.8e-21585.41Show/hide
Query:  MGVLKLLFLFLLMLVSLSLPTVGGKTTLHRHRHRRRRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSDS
        MG LKLL   LL++VSLS+    GK++ HRHRHR RR E   HLKKLNKPAVKSIKSPDGDIIDCV MAHQPAFDHPLLKNHTIQMRP FHPE G+LSD+
Subjt:  MGVLKLLFLFLLMLVSLSLPTVGGKTTLHRHRHRRRRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSDS

Query:  KVSLKGSKSEDITQLWHLKGKCPKGTIPIRRTKKEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTN
        K+S KGS+ + ITQ WHLKG+CPKGTIPIRRT K DILR NSVK+YG+KKP AT KP SI+IDLNGQ GHQHAI YVEGGQYYGAKATINVWSPKIQ  N
Subjt:  KVSLKGSKSEDITQLWHLKGKCPKGTIPIRRTKKEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTN

Query:  EFSLSQIWILGGTFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGNW
        EFSLSQIWILGGTFG+DLNSIEAGWQVSPDLYGDNNTR FTYWTSDAYQATGCYNLLCSGFVQINNEIA+GASI+PISSY+ SQYDISLLIWKDPKEGNW
Subjt:  EFSLSQIWILGGTFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGNW

Query:  WMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKSD
        WMQFGN YVLGYWPAFLFSYLTD ASM+EWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQ+V  SN+LR PEDIG FTEQPSCYDVQNGKS 
Subjt:  WMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKSD

Query:  DWGNYFFYGGPGRNPNCP
        DWGNYFFYGGPGRNPNCP
Subjt:  DWGNYFFYGGPGRNPNCP

A0A6J1JU76 uncharacterized protein LOC1114883983.3e-21084.65Show/hide
Query:  MGVLKLLFLFLLMLVSLSLPTVGGKTTLHRHRHRRRRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSDS
        MG  KLL L LLMLVSLSL    GK+  HRHRH  RRLE H+H+KKLNKP +KSIKSPDGDIIDCV MAHQPAFDHPLL+NHTIQMRP FHP+ G+ SDS
Subjt:  MGVLKLLFLFLLMLVSLSLPTVGGKTTLHRHRHRRRRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSDS

Query:  KVSLKGSKSEDITQLWHLKGKCPKGTIPIRRTKKEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTN
        KVS K S    +TQLWHLKG+CPKGTIPIRRTK++DILR +SV+SYGKK+ +ATVKPNSI+ID NGQNGHQHAI YVEGGQYYGAKAT+NVWSPKI+QTN
Subjt:  KVSLKGSKSEDITQLWHLKGKCPKGTIPIRRTKKEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTN

Query:  EFSLSQIWILGGTFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGNW
        EFSLSQIWILGGTFG+DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINN+IAMGASI+PISSY+ SQYDISLLIWKDPKEGNW
Subjt:  EFSLSQIWILGGTFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGNW

Query:  WMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKSD
        WMQFGN +VLGYWPAFLFSYLTDSASMIEWGGEVVNSE DGQHTSTQMGSGHFP +GF  A YFRNIQIVG SN+LRAPEDI  FTEQPSCYDVQ GKSD
Subjt:  WMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKSD

Query:  DWGNYFFYGGPGRNPNC
        DWGNYFFYGGPGRNPNC
Subjt:  DWGNYFFYGGPGRNPNC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)1.1e-17673.32Show/hide
Query:  RRRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSDSKVSLKGSKSED--ITQLWHLKGKCPKGTIPIRRT
        +++ EV  HL +LNKPAVKSI+S DGD+IDCV ++ QPAFDHP LK+H IQM+P +HPE G+  D+KVS   S  ++  I QLWH  GKC +GTIP+RRT
Subjt:  RRRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSDSKVSLKGSKSED--ITQLWHLKGKCPKGTIPIRRT

Query:  KKEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGQDLNSIEAGWQVSPDLY
        K++D+LR +SVK YGKKK  +   P S E DL  Q+GHQHAI YVEG +YYGAKATINVW PKIQQ NEFSLSQIW+LGG+FGQDLNSIEAGWQVSPDLY
Subjt:  KKEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGQDLNSIEAGWQVSPDLY

Query:  GDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGNWWMQFGNKYVLGYWPAFLFSYLTDSASMIEWGG
        GDNNTRLFTYWTSDAYQATGCYNLLCSGF+QIN++IAMGASI P+S Y++SQYDIS+LIWKDPKEG+WWMQFGN YVLGYWP+FLFSYLT+SASMIEWGG
Subjt:  GDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGNWWMQFGNKYVLGYWPAFLFSYLTDSASMIEWGG

Query:  EVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKSDDWGNYFFYGGPGRNPNCP
        EVVNS+SDGQHTSTQMGSG FP EGF KA YFRNIQ+V  SN+L+AP+ +G FTEQ +CYDVQ G +DDWG+YF+YGGPG+N  CP
Subjt:  EVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKSDDWGNYFFYGGPGRNPNCP

AT2G44210.1 Protein of Unknown Function (DUF239)6.9e-16063.94Show/hide
Query:  FLFLLMLVSLSLPT-VGGKTTLHRHRHRRRRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSDSKVS--L
        FL L+M V +  P+ V G+            L++ +HLK+LNKPA+KSIKSPDGD+IDCV +  QPAF HPLL NHT+QM P+ +PE  + S+SKVS   
Subjt:  FLFLLMLVSLSLPT-VGGKTTLHRHRHRRRRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSDSKVS--L

Query:  KGSKSEDITQLWHLKGKCPKGTIPIRRTKKEDILRGNSVKSYGKKKPYATVKPNSIE-IDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFS
        K  +S  I QLWH+ GKCPK TIPIRRT+++D+ R +SV++YG K   +  KP S E  ++  QNGHQHAI+YVE G +YGAKA INVW P ++  NEFS
Subjt:  KGSKSEDITQLWHLKGKCPKGTIPIRRTKKEDILRGNSVKSYGKKKPYATVKPNSIE-IDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFS

Query:  LSQIWILGGTFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGNWWMQ
        L+QIW+LGG F  DLNSIEAGWQVSP LYGDN TRLFTYWTSDAYQ TGCYNLLCSGFVQIN EIAMG SI P+S+Y +SQYDI++LIWKDPKEG+WW+Q
Subjt:  LSQIWILGGTFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGNWWMQ

Query:  FGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSES-DGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKSDDW
        FG KY++GYWPA LFSYL++SASMIEWGGEVVNS+S +GQHT+TQMGSG F  EG+GKA YF+N+Q+V  SN LR PE++ +FT+Q +CY+V++G    W
Subjt:  FGNKYVLGYWPAFLFSYLTDSASMIEWGGEVVNSES-DGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKSDDW

Query:  GNYFFYGGPGRNPNCP
        G+YF+YGGPGRNPNCP
Subjt:  GNYFFYGGPGRNPNCP

AT3G13510.1 Protein of Unknown Function (DUF239)2.4e-17669.76Show/hide
Query:  FLLMLVSLSLPTVGGKTTLHRHRHRRRRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSDSKVSLK-GSK
        F+ + V LSL           +   R++ EV  HL +LNKP VK+I+SPDGDIIDC+ ++ QPAFDHP LK+H IQMRP++HPE G+  D+KVS +   K
Subjt:  FLLMLVSLSLPTVGGKTTLHRHRHRRRRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSDSKVSLK-GSK

Query:  SEDITQLWHLKGKCPKGTIPIRRTKKEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIW
           I QLWH  GKC +GTIP+RRT+++D+LR +SVK YGKKK  +   P S E DL  QNGHQHAI YVEG +YYGAKAT+NVW PKIQ TNEFSLSQIW
Subjt:  SEDITQLWHLKGKCPKGTIPIRRTKKEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIW

Query:  ILGGTFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGNWWMQFGNKY
        +LGG+FGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGF+QIN++IAMGASI P+S Y++SQYDIS+LIWKDPKEG+WWMQFGN Y
Subjt:  ILGGTFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGNWWMQFGNKY

Query:  VLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKSDDWGNYFFY
        VLGYWP+FLFSYLT+SASMIEWGGEVVNS+S+G HT TQMGSGHFP EGF KA YFRNIQ+V  SN+L+AP+ +G FTE+ +CYDVQ G +DDWG+YF+Y
Subjt:  VLGYWPAFLFSYLTDSASMIEWGGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKSDDWGNYFFY

Query:  GGPGRNPNCP
        GGPG+N NCP
Subjt:  GGPGRNPNCP

AT5G56530.1 Protein of Unknown Function (DUF239)2.1e-17271.61Show/hide
Query:  RRRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSDSKVSLKGSKS-EDITQLWHLKGKCPKGTIPIRRTK
        R+  EVH HL +LNKPAVKSI+SPDGDIIDCV ++ QPAFDHP LK+H IQM P++ PE  +  +SKVS K  +S   ITQLWH  G C +GTIP+RRTK
Subjt:  RRRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSDSKVSLKGSKS-EDITQLWHLKGKCPKGTIPIRRTK

Query:  KEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGQDLNSIEAGWQVSPDLYG
        KED+LR +SVK YGKKK  +   P S + DL  Q+GHQHAI YVEGG++YGAKATINVW PK+Q +NEFSLSQ+WILGG+FGQDLNSIEAGWQVSPDLYG
Subjt:  KEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGQDLNSIEAGWQVSPDLYG

Query:  DNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGNWWMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGE
        DNNTRLFTYWTSDAYQATGCYNLLCSGF+QIN++IAMGASI P+S + + QYDIS+ IWKDPKEG+WWMQFG+ YVLGYWP+FLFSYL DSAS++EWGGE
Subjt:  DNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGNWWMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGE

Query:  VVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKSDDWGNYFFYGGPGRNPNC
        VVN E DG HT+TQMGSG FP EGF KA YFRNIQ+V  SN+L+ P+ +  FTE+ +CYDV+ GK+DDWG+YF+YGGPGRNPNC
Subjt:  VVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKSDDWGNYFFYGGPGRNPNC

AT5G56530.2 Protein of Unknown Function (DUF239)2.1e-17271.61Show/hide
Query:  RRRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSDSKVSLKGSKS-EDITQLWHLKGKCPKGTIPIRRTK
        R+  EVH HL +LNKPAVKSI+SPDGDIIDCV ++ QPAFDHP LK+H IQM P++ PE  +  +SKVS K  +S   ITQLWH  G C +GTIP+RRTK
Subjt:  RRRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSDSKVSLKGSKS-EDITQLWHLKGKCPKGTIPIRRTK

Query:  KEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGQDLNSIEAGWQVSPDLYG
        KED+LR +SVK YGKKK  +   P S + DL  Q+GHQHAI YVEGG++YGAKATINVW PK+Q +NEFSLSQ+WILGG+FGQDLNSIEAGWQVSPDLYG
Subjt:  KEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGQDLNSIEAGWQVSPDLYG

Query:  DNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGNWWMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGE
        DNNTRLFTYWTSDAYQATGCYNLLCSGF+QIN++IAMGASI P+S + + QYDIS+ IWKDPKEG+WWMQFG+ YVLGYWP+FLFSYL DSAS++EWGGE
Subjt:  DNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGNWWMQFGNKYVLGYWPAFLFSYLTDSASMIEWGGE

Query:  VVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKSDDWGNYFFYGGPGRNPNC
        VVN E DG HT+TQMGSG FP EGF KA YFRNIQ+V  SN+L+ P+ +  FTE+ +CYDV+ GK+DDWG+YF+YGGPGRNPNC
Subjt:  VVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKSDDWGNYFFYGGPGRNPNC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGTTCTTAAGCTCCTCTTCCTTTTCCTCTTAATGCTGGTTTCTTTATCTCTCCCCACGGTGGGCGGAAAAACCACGCTTCACCGCCATCGTCACCGCCGCCGTCG
CCTTGAAGTTCACTCCCACTTGAAGAAACTGAATAAACCTGCTGTGAAATCCATAAAGAGTCCAGATGGGGATATTATTGATTGTGTTCGTATGGCTCATCAACCAGCTT
TTGATCATCCTCTTCTCAAAAACCACACAATTCAGATGAGACCAACTTTTCATCCAGAAGGAGGGATTTTAAGTGACAGTAAAGTGTCATTAAAAGGTTCAAAATCAGAG
GATATAACTCAATTATGGCACTTAAAAGGAAAATGTCCAAAAGGGACAATTCCCATTAGAAGAACAAAAAAAGAAGACATTTTAAGAGGAAATTCAGTGAAAAGCTATGG
AAAAAAGAAGCCTTATGCAACTGTGAAACCAAACTCCATTGAAATTGATCTTAATGGACAAAATGGACATCAGCATGCAATCATATATGTTGAAGGAGGACAATACTATG
GAGCTAAAGCAACTATTAATGTTTGGTCACCAAAAATTCAACAAACAAATGAATTTAGCCTCTCACAGATCTGGATTCTTGGAGGAACTTTTGGGCAAGATCTTAATAGT
ATTGAAGCTGGTTGGCAGGTCAGCCCTGATTTGTATGGAGATAATAACACTCGACTTTTCACTTATTGGACCAGTGATGCATATCAAGCAACTGGTTGCTATAATCTGCT
GTGTTCTGGGTTTGTTCAAATCAATAATGAAATAGCAATGGGTGCCAGCATTTTTCCTATTTCTTCCTACAAAAGTTCTCAATATGACATCAGCTTGCTCATTTGGAAGG
ACCCTAAAGAAGGAAACTGGTGGATGCAATTCGGAAATAAGTACGTTTTAGGGTATTGGCCGGCCTTCTTATTTTCTTATCTCACCGACAGCGCCTCCATGATCGAATGG
GGCGGCGAAGTCGTTAACTCTGAATCCGACGGCCAACATACTTCCACCCAGATGGGTAGCGGCCACTTCCCCGGTGAGGGCTTCGGCAAAGCCGGCTACTTCCGTAACAT
TCAGATTGTAGGAGAATCAAACAGCCTAAGGGCGCCGGAGGACATAGGAATTTTCACGGAGCAACCAAGTTGTTACGATGTTCAGAACGGCAAGTCCGACGACTGGGGCA
ATTATTTCTTCTACGGCGGTCCGGGCAGAAATCCTAACTGCCCCTAA
mRNA sequenceShow/hide mRNA sequence
TAATTCTTAAAATATTAATATCTCTCTTCTCTCTCTCTCTCTCTCTGTCTCTCTCTGCTAAGTCATGCTCTAAAACACACTTATTGAGAAATTAGCTGCTCCCATTCTTC
ACAATTTCTCAAAATTTTCTTTTCAATTCCATAAAGTTCTCTTACTTTTTAGTTTTTCGTCTCAAAATTTCACTGAAAATGGGTGTTCTTAAGCTCCTCTTCCTTTTCCT
CTTAATGCTGGTTTCTTTATCTCTCCCCACGGTGGGCGGAAAAACCACGCTTCACCGCCATCGTCACCGCCGCCGTCGCCTTGAAGTTCACTCCCACTTGAAGAAACTGA
ATAAACCTGCTGTGAAATCCATAAAGAGTCCAGATGGGGATATTATTGATTGTGTTCGTATGGCTCATCAACCAGCTTTTGATCATCCTCTTCTCAAAAACCACACAATT
CAGATGAGACCAACTTTTCATCCAGAAGGAGGGATTTTAAGTGACAGTAAAGTGTCATTAAAAGGTTCAAAATCAGAGGATATAACTCAATTATGGCACTTAAAAGGAAA
ATGTCCAAAAGGGACAATTCCCATTAGAAGAACAAAAAAAGAAGACATTTTAAGAGGAAATTCAGTGAAAAGCTATGGAAAAAAGAAGCCTTATGCAACTGTGAAACCAA
ACTCCATTGAAATTGATCTTAATGGACAAAATGGACATCAGCATGCAATCATATATGTTGAAGGAGGACAATACTATGGAGCTAAAGCAACTATTAATGTTTGGTCACCA
AAAATTCAACAAACAAATGAATTTAGCCTCTCACAGATCTGGATTCTTGGAGGAACTTTTGGGCAAGATCTTAATAGTATTGAAGCTGGTTGGCAGGTCAGCCCTGATTT
GTATGGAGATAATAACACTCGACTTTTCACTTATTGGACCAGTGATGCATATCAAGCAACTGGTTGCTATAATCTGCTGTGTTCTGGGTTTGTTCAAATCAATAATGAAA
TAGCAATGGGTGCCAGCATTTTTCCTATTTCTTCCTACAAAAGTTCTCAATATGACATCAGCTTGCTCATTTGGAAGGACCCTAAAGAAGGAAACTGGTGGATGCAATTC
GGAAATAAGTACGTTTTAGGGTATTGGCCGGCCTTCTTATTTTCTTATCTCACCGACAGCGCCTCCATGATCGAATGGGGCGGCGAAGTCGTTAACTCTGAATCCGACGG
CCAACATACTTCCACCCAGATGGGTAGCGGCCACTTCCCCGGTGAGGGCTTCGGCAAAGCCGGCTACTTCCGTAACATTCAGATTGTAGGAGAATCAAACAGCCTAAGGG
CGCCGGAGGACATAGGAATTTTCACGGAGCAACCAAGTTGTTACGATGTTCAGAACGGCAAGTCCGACGACTGGGGCAATTATTTCTTCTACGGCGGTCCGGGCAGAAAT
CCTAACTGCCCCTAATTTGGATTCTTCTTCAATATAATTATATATATATAGTGTTATAAATATATATATATATATAAATGCATTTCTTGTGCATTATTATTAATTATATT
TTAGTGGGGG
Protein sequenceShow/hide protein sequence
MGVLKLLFLFLLMLVSLSLPTVGGKTTLHRHRHRRRRLEVHSHLKKLNKPAVKSIKSPDGDIIDCVRMAHQPAFDHPLLKNHTIQMRPTFHPEGGILSDSKVSLKGSKSE
DITQLWHLKGKCPKGTIPIRRTKKEDILRGNSVKSYGKKKPYATVKPNSIEIDLNGQNGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGQDLNS
IEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIFPISSYKSSQYDISLLIWKDPKEGNWWMQFGNKYVLGYWPAFLFSYLTDSASMIEW
GGEVVNSESDGQHTSTQMGSGHFPGEGFGKAGYFRNIQIVGESNSLRAPEDIGIFTEQPSCYDVQNGKSDDWGNYFFYGGPGRNPNCP