; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0002828 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0002828
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationchr4:46019110..46022472
RNA-Seq ExpressionLag0002828
SyntenyLag0002828
Gene Ontology termsGO:0016874 - ligase activity (molecular function)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593345.1 hypothetical protein SDJN03_12821, partial [Cucurbita argyrosperma subsp. sororia]7.6e-22289.45Show/hide
Query:  MGSLKLLLLLMLVSFSVAA---AGKTTRHRH----RRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFKDSK
        MG+LKLLLLL +VS SVAA   AGK++ HRH    RR EAQ HLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEG+  D+K
Subjt:  MGSLKLLLLLMLVSFSVAA---AGKTTRHRH----RRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFKDSK

Query:  VSSKASKSNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNE
        +SSK S+  PITQ WHLKGRCP+GTIPIRRT K DILRANSVK+YGKKKPQAT KPTSIDIDLNGQTGHQHAI YVEGGQYYGAKATINVWSPKIQ  NE
Subjt:  VSSKASKSNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNE

Query:  FSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWW
        FSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTR FTYWTSDAYQATGCYNLLCSGFVQINN+IA+GASIYPISSY+ SQYDISLLIWKDPKEGNWW
Subjt:  FSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWW

Query:  MQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGD
        MQFGNSYVLGYWPAFLFSYLTD ASMVEWGGEVVNSE+DG+HTSTQMGSGHFP EGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQ GKSGD
Subjt:  MQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGD

Query:  WGNYFFYGGPGRNPNCP
        WGNYFFYGGPGRNPNCP
Subjt:  WGNYFFYGGPGRNPNCP

XP_022152744.1 uncharacterized protein LOC111020390 [Momordica charantia]2.8e-22491.2Show/hide
Query:  LKLLLLLMLVSFSVA--AAGKTTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFKDSKVSSKASKS
        L LLL L L+S S+A  AA KT+R RH RLEA  HLKKLNKP VKSIKSPDGDIIDCVHMAHQPAFDHPLL+NHTIQMRPNFHPEGIFKD+KVSS  S+S
Subjt:  LKLLLLLMLVSFSVA--AAGKTTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFKDSKVSSKASKS

Query:  NPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWI
         PITQLWHLKGRCP+GTIPIRRTKK DILRA+S+KSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGG+YYGAKATINVWSPKIQQTNEFSLSQIWI
Subjt:  NPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWI

Query:  LGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWWMQFGNSYV
        LGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSY+SSQYDISLLIWKDPKEGNWWMQFGNS+V
Subjt:  LGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWWMQFGNSYV

Query:  LGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNYFFYG
        LGYWPAFLFSYLTDSASM+EWGGEVVNSE+DG+HTSTQMGSGHFP+EGFGKAGYFRNIQVVDGSNNL+ PEDIGTFTEQPSCYDVQ GKSG+WGNYFFYG
Subjt:  LGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNYFFYG

Query:  GPGRNPNCP
        GPGRNPNCP
Subjt:  GPGRNPNCP

XP_022959691.1 uncharacterized protein LOC111460689 [Cucurbita moschata]3.4e-22289.13Show/hide
Query:  MGSLKLLLLLMLVSFSVAAAGKTTRHRH----RRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFKDSKVSS
        MG+LKLLLLL++     AAAGK++ HRH    RR EAQ HLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEG+  D+K+SS
Subjt:  MGSLKLLLLLMLVSFSVAAAGKTTRHRH----RRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFKDSKVSS

Query:  KASKSNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSL
        K S+  PITQ WHLKGRCPKGTIPIRRT K DILRANSVK+YG+KKPQAT KPTSIDIDLNGQTGHQHAI YVEGGQYYGAKATINVWSPKIQ  NEFSL
Subjt:  KASKSNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSL

Query:  SQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWWMQF
        SQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTR FTYWTSDAYQATGCYNLLCSGFVQINN+IA+GASIYPISSY+ SQYDISLLIWKDPKEGNWWMQF
Subjt:  SQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWWMQF

Query:  GNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGN
        GNSYVLGYWPAFLFSYLTD ASMVEWGGEVVNSE+DG+HTSTQMGSGHFP EGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQ GKSGDWGN
Subjt:  GNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGN

Query:  YFFYGGPGRNPNCP
        YFFYGGPGRNPNCP
Subjt:  YFFYGGPGRNPNCP

XP_023004212.1 uncharacterized protein LOC111497611 [Cucurbita maxima]9.0e-22390.49Show/hide
Query:  MGSLKLLLLLMLVSFSVAAAGKTTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFKDSKVSSKASK
        MG+LKLLLLL +VS SVAAAGK++ HR RR EAQ HLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEG+   +K+SSK S+
Subjt:  MGSLKLLLLLMLVSFSVAAAGKTTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFKDSKVSSKASK

Query:  SNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIW
          PITQ WHLKGRCP+GTIPIRRT K DILRANSVKSYGKKKPQATV+PTSIDIDLNGQTGHQHAI YVEGGQYYGAKATINVWSPKIQ  NEFSLSQIW
Subjt:  SNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIW

Query:  ILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWWMQFGNSY
        ILGGTFGEDLNSIEAGWQVSPDLYGDNNTR FTYWTSDAYQATGCYNLLCSGFVQINN+IA+GASIYPISSY+ SQYDISLLIWKDPKEGNWWMQFGNSY
Subjt:  ILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWWMQFGNSY

Query:  VLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNYFFY
        VLGYWPAFLFSYLTD ASMVEWGGEVVNSE+ G+HTSTQMGSGHFP EGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQ GKSGDWGNYFFY
Subjt:  VLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNYFFY

Query:  GGPGRNPNCP
        GGPGRNPNCP
Subjt:  GGPGRNPNCP

XP_038899904.1 uncharacterized protein LOC120087094 [Benincasa hispida]9.0e-22389.59Show/hide
Query:  MGSLKLLLLLMLVSFSVAA---AGKTTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFKDSKVSSK
        MG LKLL LL+L+  S++A   AGKTTRHRHRRL+  +HLKKLNKP VKSIKSPDGDIIDCVHMAHQPAFDHPLL+NHTIQM PNFHPEGI ++SKVSSK
Subjt:  MGSLKLLLLLMLVSFSVAA---AGKTTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFKDSKVSSK

Query:  ASKSNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLS
        ASKS  ITQLWHLKGRCPKGTIPIRRTKK+DILRA+SVKSYGKKKP ATVKPTSIDIDLNGQ GHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLS
Subjt:  ASKSNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLS

Query:  QIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWWMQFG
        QIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINN+IAMGASI+PISSYK SQYDISLLIWKDPKEGNWWMQFG
Subjt:  QIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWWMQFG

Query:  NSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNY
        N YVLGYWPAFLFSYLTDSASM+EWGGEVVNSE+DG+HTSTQMGSGHFP EGFGKAGYFRNIQ+V  SN+LR PEDIGTFTEQPSCYDV+ GKS DWGNY
Subjt:  NSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNY

Query:  FFYGGPGRNPNCP
        FFYGGPGRNPNCP
Subjt:  FFYGGPGRNPNCP

TrEMBL top hitse value%identityAlignment
A0A0A0KBL1 Uncharacterized protein1.1e-21587.56Show/hide
Query:  MGSLKL--LLLLMLVSFSV-AAAGKTTRHRH----RRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPE-GIFKDS
        MG LKL  L LLMLVS ++    GKTT HRH    RRLE  +HLKKLNKPAVKSIKSPDGDIIDCV MAHQPAFDHPLLKNHTIQMRP FHPE GI  DS
Subjt:  MGSLKL--LLLLMLVSFSV-AAAGKTTRHRH----RRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPE-GIFKDS

Query:  KVSSKASKSNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTN
        KVS K SKS  ITQLWHLKG+CPKGTIPIRRTKK+DILR NSVKSYGKKKP ATVKP SI++DLNGQ GHQHAIIYVEGGQYYGAKATINVWSPKIQQTN
Subjt:  KVSSKASKSNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTN

Query:  EFSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNW
        EFSLSQIWILGGTFG+DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINN+IAMGASI+PISSYKSSQYDISLLIWKDPKEGNW
Subjt:  EFSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNW

Query:  WMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSG
        WMQFGN YVLGYWPAFLFSYLTDSASM+EWGGEVVNSE+DG+HTSTQMGSGHFP EGFGKAGYFRNIQ+V  SN+LR PEDIG FTEQPSCYDVQ GKS 
Subjt:  WMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSG

Query:  DWGNYFFYGGPGRNPNCP
        DWGNYFFYGGPGRNPNCP
Subjt:  DWGNYFFYGGPGRNPNCP

A0A1S3CE73 LOW QUALITY PROTEIN: uncharacterized protein LOC1034998722.6e-21587.83Show/hide
Query:  MGSLKL--LLLLMLVSFSV-AAAGKTT----RH-RHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPE-GIFKD
        MG LKL  L LLMLVS S+    GK T    RH RHRRLE  +HLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPE GI  +
Subjt:  MGSLKL--LLLLMLVSFSV-AAAGKTT----RH-RHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPE-GIFKD

Query:  SKVSSKASKSNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQT
        SKVS K SKS  ITQLWHLKG+CPKGTIPIRR KK+DILR NSVKSYGKKKP ATVKP SI+IDLNGQ GHQHAIIYVEGGQYYGAKATINVWSPKIQQT
Subjt:  SKVSSKASKSNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQT

Query:  NEFSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGN
        NEFSLSQIWILGGTFG+DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINN+IAMGASI+PISSYKSSQYDISLLIWKDPKEGN
Subjt:  NEFSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGN

Query:  WWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKS
        WWMQFGN YVLGYWPAFLFSYLTDSASM+EWGGEVVNSE+DG+HTSTQMGSGHFP EGFGKAGYFRNIQ+V  SN+LR PEDIG FTEQPSCYDVQ GKS
Subjt:  WWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKS

Query:  GDWGNYFFYGGPGRNPNCP
         DWGNYFFYGGPGRNPNCP
Subjt:  GDWGNYFFYGGPGRNPNCP

A0A6J1DGY2 uncharacterized protein LOC1110203901.4e-22491.2Show/hide
Query:  LKLLLLLMLVSFSVA--AAGKTTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFKDSKVSSKASKS
        L LLL L L+S S+A  AA KT+R RH RLEA  HLKKLNKP VKSIKSPDGDIIDCVHMAHQPAFDHPLL+NHTIQMRPNFHPEGIFKD+KVSS  S+S
Subjt:  LKLLLLLMLVSFSVA--AAGKTTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFKDSKVSSKASKS

Query:  NPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWI
         PITQLWHLKGRCP+GTIPIRRTKK DILRA+S+KSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGG+YYGAKATINVWSPKIQQTNEFSLSQIWI
Subjt:  NPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWI

Query:  LGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWWMQFGNSYV
        LGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSY+SSQYDISLLIWKDPKEGNWWMQFGNS+V
Subjt:  LGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWWMQFGNSYV

Query:  LGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNYFFYG
        LGYWPAFLFSYLTDSASM+EWGGEVVNSE+DG+HTSTQMGSGHFP+EGFGKAGYFRNIQVVDGSNNL+ PEDIGTFTEQPSCYDVQ GKSG+WGNYFFYG
Subjt:  LGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNYFFYG

Query:  GPGRNPNCP
        GPGRNPNCP
Subjt:  GPGRNPNCP

A0A6J1H5K4 uncharacterized protein LOC1114606891.7e-22289.13Show/hide
Query:  MGSLKLLLLLMLVSFSVAAAGKTTRHRH----RRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFKDSKVSS
        MG+LKLLLLL++     AAAGK++ HRH    RR EAQ HLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEG+  D+K+SS
Subjt:  MGSLKLLLLLMLVSFSVAAAGKTTRHRH----RRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFKDSKVSS

Query:  KASKSNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSL
        K S+  PITQ WHLKGRCPKGTIPIRRT K DILRANSVK+YG+KKPQAT KPTSIDIDLNGQTGHQHAI YVEGGQYYGAKATINVWSPKIQ  NEFSL
Subjt:  KASKSNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSL

Query:  SQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWWMQF
        SQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTR FTYWTSDAYQATGCYNLLCSGFVQINN+IA+GASIYPISSY+ SQYDISLLIWKDPKEGNWWMQF
Subjt:  SQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWWMQF

Query:  GNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGN
        GNSYVLGYWPAFLFSYLTD ASMVEWGGEVVNSE+DG+HTSTQMGSGHFP EGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQ GKSGDWGN
Subjt:  GNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGN

Query:  YFFYGGPGRNPNCP
        YFFYGGPGRNPNCP
Subjt:  YFFYGGPGRNPNCP

A0A6J1KPT3 uncharacterized protein LOC1114976114.3e-22390.49Show/hide
Query:  MGSLKLLLLLMLVSFSVAAAGKTTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFKDSKVSSKASK
        MG+LKLLLLL +VS SVAAAGK++ HR RR EAQ HLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEG+   +K+SSK S+
Subjt:  MGSLKLLLLLMLVSFSVAAAGKTTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFKDSKVSSKASK

Query:  SNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIW
          PITQ WHLKGRCP+GTIPIRRT K DILRANSVKSYGKKKPQATV+PTSIDIDLNGQTGHQHAI YVEGGQYYGAKATINVWSPKIQ  NEFSLSQIW
Subjt:  SNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIW

Query:  ILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWWMQFGNSY
        ILGGTFGEDLNSIEAGWQVSPDLYGDNNTR FTYWTSDAYQATGCYNLLCSGFVQINN+IA+GASIYPISSY+ SQYDISLLIWKDPKEGNWWMQFGNSY
Subjt:  ILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWWMQFGNSY

Query:  VLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNYFFY
        VLGYWPAFLFSYLTD ASMVEWGGEVVNSE+ G+HTSTQMGSGHFP EGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQ GKSGDWGNYFFY
Subjt:  VLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNYFFY

Query:  GGPGRNPNCP
        GGPGRNPNCP
Subjt:  GGPGRNPNCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)1.2e-18071.29Show/hide
Query:  GSLKLLLLLMLVSFSVAAAGKTTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFKDSKVSSKAS--
        G L  L L    S S AA    ++   ++ E + HL +LNKPAVKSI+S DGD+IDCV ++ QPAFDHP LK+H IQM+PN+HPEG+F D+KVS+  S  
Subjt:  GSLKLLLLLMLVSFSVAAAGKTTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFKDSKVSSKAS--

Query:  KSNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQI
        K   I QLWH  G+C +GTIP+RRTK+DD+LRA+SVK YGKKK ++   P S + DL  Q+GHQHAI YVEG +YYGAKATINVW PKIQQ NEFSLSQI
Subjt:  KSNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQI

Query:  WILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWWMQFGNS
        W+LGG+FG+DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGF+QIN+ IAMGASI P+S Y++SQYDIS+LIWKDPKEG+WWMQFGN 
Subjt:  WILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWWMQFGNS

Query:  YVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNYFF
        YVLGYWP+FLFSYLT+SASM+EWGGEVVNS++DG+HTSTQMGSG FP+EGF KA YFRNIQVVDGSNNL+ P+ +GTFTEQ +CYDVQTG + DWG+YF+
Subjt:  YVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNYFF

Query:  YGGPGRNPNCP
        YGGPG+N  CP
Subjt:  YGGPGRNPNCP

AT2G44210.1 Protein of Unknown Function (DUF239)1.5e-16266.93Show/hide
Query:  LEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFKDSKVSSKA--SKSNPITQLWHLKGRCPKGTIPIRRTKKDD
        L+ + HLK+LNKPA+KSIKSPDGD+IDCV +  QPAF HPLL NHT+QM P+ +PE +F +SKVSSK    +SN I QLWH+ G+CPK TIPIRRT++ D
Subjt:  LEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFKDSKVSSKA--SKSNPITQLWHLKGRCPKGTIPIRRTKKDD

Query:  ILRANSVKSYGKKKPQATVKPTSID-IDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDN
        + RA+SV++YG K  ++  KP S +  ++  Q GHQHAI+YVE G +YGAKA INVW P ++  NEFSL+QIW+LGG F  DLNSIEAGWQVSP LYGDN
Subjt:  ILRANSVKSYGKKKPQATVKPTSID-IDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDN

Query:  NTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVV
         TRLFTYWTSDAYQ TGCYNLLCSGFVQIN +IAMG SI P+S+Y +SQYDI++LIWKDPKEG+WW+QFG  Y++GYWPA LFSYL++SASM+EWGGEVV
Subjt:  NTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVV

Query:  NSEA-DGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNCP
        NS++ +G+HT+TQMGSG F +EG+GKA YF+N+QVVDGSN LR PE++  FT+Q +CY+V++G  G WG+YF+YGGPGRNPNCP
Subjt:  NSEA-DGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNCP

AT3G13510.1 Protein of Unknown Function (DUF239)7.7e-18070.97Show/hide
Query:  LLMLVSFSVAAAGKTTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFKDSKVSSK-ASKSNPITQL
        L +++S S AAA   +    ++ E + HL +LNKP VK+I+SPDGDIIDC+ ++ QPAFDHP LK+H IQMRP++HPEG+F D+KVS++   K   I QL
Subjt:  LLMLVSFSVAAAGKTTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFKDSKVSSK-ASKSNPITQL

Query:  WHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFG
        WH  G+C +GTIP+RRT++DD+LRA+SVK YGKKK ++   P S + DL  Q GHQHAI YVEG +YYGAKAT+NVW PKIQ TNEFSLSQIW+LGG+FG
Subjt:  WHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFG

Query:  EDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPA
        +DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGF+QIN+ IAMGASI P+S Y++SQYDIS+LIWKDPKEG+WWMQFGN YVLGYWP+
Subjt:  EDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPA

Query:  FLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNP
        FLFSYLT+SASM+EWGGEVVNS+++G HT TQMGSGHFP+EGF KA YFRNIQVVDGSNNL+ P+ +GTFTE+ +CYDVQTG + DWG+YF+YGGPG+N 
Subjt:  FLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNP

Query:  NCP
        NCP
Subjt:  NCP

AT5G56530.1 Protein of Unknown Function (DUF239)1.1e-17872.41Show/hide
Query:  SVAAAGKTTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFKDSKVSSKASKS-NPITQLWHLKGRC
        S+  AG+ +  R +  E   HL +LNKPAVKSI+SPDGDIIDCVH++ QPAFDHP LK+H IQM P++ PE +F +SKVS K  +S NPITQLWH  G C
Subjt:  SVAAAGKTTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFKDSKVSSKASKS-NPITQLWHLKGRC

Query:  PKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGEDLNSIE
         +GTIP+RRTKK+D+LRA+SVK YGKKK  +   P S D DL  Q+GHQHAI YVEGG++YGAKATINVW PK+Q +NEFSLSQ+WILGG+FG+DLNSIE
Subjt:  PKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGEDLNSIE

Query:  AGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLT
        AGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGF+QIN+QIAMGASI P+S + + QYDIS+ IWKDPKEG+WWMQFG+ YVLGYWP+FLFSYL 
Subjt:  AGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLT

Query:  DSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNC
        DSAS+VEWGGEVVN E DG HT+TQMGSG FPDEGF KA YFRNIQVVD SNNL++P+ + TFTE+ +CYDV+ GK+ DWG+YF+YGGPGRNPNC
Subjt:  DSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNC

AT5G56530.2 Protein of Unknown Function (DUF239)1.1e-17872.41Show/hide
Query:  SVAAAGKTTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFKDSKVSSKASKS-NPITQLWHLKGRC
        S+  AG+ +  R +  E   HL +LNKPAVKSI+SPDGDIIDCVH++ QPAFDHP LK+H IQM P++ PE +F +SKVS K  +S NPITQLWH  G C
Subjt:  SVAAAGKTTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFKDSKVSSKASKS-NPITQLWHLKGRC

Query:  PKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGEDLNSIE
         +GTIP+RRTKK+D+LRA+SVK YGKKK  +   P S D DL  Q+GHQHAI YVEGG++YGAKATINVW PK+Q +NEFSLSQ+WILGG+FG+DLNSIE
Subjt:  PKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGEDLNSIE

Query:  AGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLT
        AGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGF+QIN+QIAMGASI P+S + + QYDIS+ IWKDPKEG+WWMQFG+ YVLGYWP+FLFSYL 
Subjt:  AGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLT

Query:  DSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNC
        DSAS+VEWGGEVVN E DG HT+TQMGSG FPDEGF KA YFRNIQVVD SNNL++P+ + TFTE+ +CYDV+ GK+ DWG+YF+YGGPGRNPNC
Subjt:  DSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTCTCTAAAGCTGCTGCTGCTCTTAATGCTCGTTTCTTTTTCAGTCGCGGCGGCCGGAAAAACCACCCGCCACCGTCACCGGCGGCTCGAAGCTCAGGCCCATCT
GAAGAAGCTCAATAAACCTGCTGTTAAGTCCATCAAGAGTCCAGATGGGGATATTATTGATTGTGTCCATATGGCTCATCAGCCAGCTTTTGATCATCCTCTTCTCAAAA
ACCACACAATTCAGATGAGACCAAATTTTCATCCAGAGGGGATTTTCAAGGACAGTAAAGTGTCTTCAAAAGCTTCAAAATCAAATCCCATAACTCAATTATGGCACTTG
AAAGGCAGATGTCCAAAAGGGACAATTCCCATTAGAAGAACAAAAAAAGATGACATTTTGAGAGCAAACTCTGTGAAAAGCTATGGCAAAAAGAAGCCTCAAGCCACTGT
GAAACCAACCTCCATTGATATTGATCTCAATGGCCAAACTGGACATCAGCATGCAATAATATATGTTGAAGGAGGACAATACTATGGAGCTAAGGCAACTATAAACGTTT
GGTCTCCCAAAATCCAACAGACAAACGAATTTAGCCTCTCGCAGATCTGGATTCTTGGAGGAACTTTTGGGGAAGATCTTAATAGCATTGAAGCTGGTTGGCAGGTCAGC
CCTGATTTGTATGGAGATAACAACACTCGACTTTTCACTTATTGGACTAGTGATGCTTATCAAGCTACTGGTTGCTACAATCTCCTCTGTTCTGGGTTTGTTCAAATCAA
TAATCAAATAGCCATGGGTGCCAGCATTTATCCCATTTCTTCTTACAAAAGTTCTCAATATGACATTAGCTTGCTCATCTGGAAGGACCCTAAAGAAGGAAACTGGTGGA
TGCAATTCGGAAACAGCTACGTATTGGGTTACTGGCCGGCGTTCTTGTTCTCATACCTCACCGACAGCGCCTCCATGGTCGAGTGGGGCGGTGAAGTTGTCAACTCCGAA
GCCGACGGCGAACACACTTCCACTCAGATGGGCAGCGGCCACTTCCCCGACGAGGGCTTCGGCAAGGCCGGCTACTTCCGAAATATTCAGGTGGTCGACGGATCGAACAA
CCTCCGGCAGCCGGAGGACATCGGAACTTTCACAGAGCAGCCCAGTTGCTACGACGTTCAGACCGGGAAGTCCGGCGACTGGGGCAATTACTTCTTCTACGGCGGGCCGG
GCAGAAATCCAAACTGCCCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTCTCTAAAGCTGCTGCTGCTCTTAATGCTCGTTTCTTTTTCAGTCGCGGCGGCCGGAAAAACCACCCGCCACCGTCACCGGCGGCTCGAAGCTCAGGCCCATCT
GAAGAAGCTCAATAAACCTGCTGTTAAGTCCATCAAGAGTCCAGATGGGGATATTATTGATTGTGTCCATATGGCTCATCAGCCAGCTTTTGATCATCCTCTTCTCAAAA
ACCACACAATTCAGATGAGACCAAATTTTCATCCAGAGGGGATTTTCAAGGACAGTAAAGTGTCTTCAAAAGCTTCAAAATCAAATCCCATAACTCAATTATGGCACTTG
AAAGGCAGATGTCCAAAAGGGACAATTCCCATTAGAAGAACAAAAAAAGATGACATTTTGAGAGCAAACTCTGTGAAAAGCTATGGCAAAAAGAAGCCTCAAGCCACTGT
GAAACCAACCTCCATTGATATTGATCTCAATGGCCAAACTGGACATCAGCATGCAATAATATATGTTGAAGGAGGACAATACTATGGAGCTAAGGCAACTATAAACGTTT
GGTCTCCCAAAATCCAACAGACAAACGAATTTAGCCTCTCGCAGATCTGGATTCTTGGAGGAACTTTTGGGGAAGATCTTAATAGCATTGAAGCTGGTTGGCAGGTCAGC
CCTGATTTGTATGGAGATAACAACACTCGACTTTTCACTTATTGGACTAGTGATGCTTATCAAGCTACTGGTTGCTACAATCTCCTCTGTTCTGGGTTTGTTCAAATCAA
TAATCAAATAGCCATGGGTGCCAGCATTTATCCCATTTCTTCTTACAAAAGTTCTCAATATGACATTAGCTTGCTCATCTGGAAGGACCCTAAAGAAGGAAACTGGTGGA
TGCAATTCGGAAACAGCTACGTATTGGGTTACTGGCCGGCGTTCTTGTTCTCATACCTCACCGACAGCGCCTCCATGGTCGAGTGGGGCGGTGAAGTTGTCAACTCCGAA
GCCGACGGCGAACACACTTCCACTCAGATGGGCAGCGGCCACTTCCCCGACGAGGGCTTCGGCAAGGCCGGCTACTTCCGAAATATTCAGGTGGTCGACGGATCGAACAA
CCTCCGGCAGCCGGAGGACATCGGAACTTTCACAGAGCAGCCCAGTTGCTACGACGTTCAGACCGGGAAGTCCGGCGACTGGGGCAATTACTTCTTCTACGGCGGGCCGG
GCAGAAATCCAAACTGCCCGTGA
Protein sequenceShow/hide protein sequence
MGSLKLLLLLMLVSFSVAAAGKTTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFKDSKVSSKASKSNPITQLWHL
KGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGEDLNSIEAGWQVS
PDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSE
ADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNCP