; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy05g003810 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy05g003810
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationChr05:3416717..3421075
RNA-Seq ExpressionLcy05g003810
SyntenyLcy05g003810
Gene Ontology termsGO:0016874 - ligase activity (molecular function)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593345.1 hypothetical protein SDJN03_12821, partial [Cucurbita argyrosperma subsp. sororia]7.1e-21884.04Show/hide
Query:  MGSLKLLLLLMLVSFSVAA---AGKPTRHRH----RRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGILKDSK
        MG+LKLLLLL +VS SVAA   AGK + HRH    RR EAQ HLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEG+L D+K
Subjt:  MGSLKLLLLLMLVSFSVAA---AGKPTRHRH----RRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGILKDSK

Query:  VSSKASKSNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNE
        +SSK S+  PITQ WHLKGRCP+GTIPIRRT K DILRANSVK+YGKKKPQAT KPTSIDIDLNGQTGHQHAI YVEGGQYYGAKATINVWSPKIQ  NE
Subjt:  VSSKASKSNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNE

Query:  FSLSQIWILGGTFGEDLNSIEAGWQVFEMFINSYNSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGAS
        FSLSQIWILGGTFGEDLNSIEAGW                            QVSPDLYGDNNTR FTYWTSDAYQATGCYNLLCSGFVQINN+IA+GAS
Subjt:  FSLSQIWILGGTFGEDLNSIEAGWQVFEMFINSYNSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGAS

Query:  IYPISSYKSSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGS
        IYPISSY+ SQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTD ASMVEWGGEVVNSE+DG+HTSTQMGSGHFP EGFGKAGYFRNIQVVDGS
Subjt:  IYPISSYKSSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGS

Query:  NNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNCP
        NNLRQPEDIGTFTEQPSCYDVQ GKSGDWGNYFFYGGPGRNPNCP
Subjt:  NNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNCP

XP_022152744.1 uncharacterized protein LOC111020390 [Momordica charantia]1.1e-21884.9Show/hide
Query:  LKLLLLLMLVSFSVA--AAGKPTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGILKDSKVSSKASKS
        L LLL L L+S S+A  AA K +R RH RLEA  HLKKLNKP VKSIKSPDGDIIDCVHMAHQPAFDHPLL+NHTIQMRPNFHPEGI KD+KVSS  S+S
Subjt:  LKLLLLLMLVSFSVA--AAGKPTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGILKDSKVSSKASKS

Query:  NPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWI
         PITQLWHLKGRCP+GTIPIRRTKK DILRA+S+KSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGG+YYGAKATINVWSPKIQQTNEFSLSQIWI
Subjt:  NPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWI

Query:  LGGTFGEDLNSIEAGWQVFEMFINSYNSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYK
        LGGTFGEDLNSIEAGW                            QVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSY+
Subjt:  LGGTFGEDLNSIEAGWQVFEMFINSYNSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYK

Query:  SSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPED
        SSQYDISLLIWKDPKEGNWWMQFGNS+VLGYWPAFLFSYLTDSASM+EWGGEVVNSE+DG+HTSTQMGSGHFP+EGFGKAGYFRNIQVVDGSNNL+ PED
Subjt:  SSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPED

Query:  IGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNCP
        IGTFTEQPSCYDVQ GKSG+WGNYFFYGGPGRNPNCP
Subjt:  IGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNCP

XP_022959691.1 uncharacterized protein LOC111460689 [Cucurbita moschata]3.2e-21883.71Show/hide
Query:  MGSLKLLLLLMLVSFSVAAAGKPTRHRH----RRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGILKDSKVSS
        MG+LKLLLLL++     AAAGK + HRH    RR EAQ HLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEG+L D+K+SS
Subjt:  MGSLKLLLLLMLVSFSVAAAGKPTRHRH----RRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGILKDSKVSS

Query:  KASKSNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSL
        K S+  PITQ WHLKGRCPKGTIPIRRT K DILRANSVK+YG+KKPQAT KPTSIDIDLNGQTGHQHAI YVEGGQYYGAKATINVWSPKIQ  NEFSL
Subjt:  KASKSNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSL

Query:  SQIWILGGTFGEDLNSIEAGWQVFEMFINSYNSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYP
        SQIWILGGTFGEDLNSIEAGW                            QVSPDLYGDNNTR FTYWTSDAYQATGCYNLLCSGFVQINN+IA+GASIYP
Subjt:  SQIWILGGTFGEDLNSIEAGWQVFEMFINSYNSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYP

Query:  ISSYKSSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNL
        ISSY+ SQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTD ASMVEWGGEVVNSE+DG+HTSTQMGSGHFP EGFGKAGYFRNIQVVDGSNNL
Subjt:  ISSYKSSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNL

Query:  RQPEDIGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNCP
        RQPEDIGTFTEQPSCYDVQ GKSGDWGNYFFYGGPGRNPNCP
Subjt:  RQPEDIGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNCP

XP_023004212.1 uncharacterized protein LOC111497611 [Cucurbita maxima]8.4e-21984.93Show/hide
Query:  MGSLKLLLLLMLVSFSVAAAGKPTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGILKDSKVSSKASK
        MG+LKLLLLL +VS SVAAAGK + HR RR EAQ HLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEG+L  +K+SSK S+
Subjt:  MGSLKLLLLLMLVSFSVAAAGKPTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGILKDSKVSSKASK

Query:  SNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIW
          PITQ WHLKGRCP+GTIPIRRT K DILRANSVKSYGKKKPQATV+PTSIDIDLNGQTGHQHAI YVEGGQYYGAKATINVWSPKIQ  NEFSLSQIW
Subjt:  SNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIW

Query:  ILGGTFGEDLNSIEAGWQVFEMFINSYNSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSY
        ILGGTFGEDLNSIEAGW                            QVSPDLYGDNNTR FTYWTSDAYQATGCYNLLCSGFVQINN+IA+GASIYPISSY
Subjt:  ILGGTFGEDLNSIEAGWQVFEMFINSYNSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSY

Query:  KSSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPE
        + SQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTD ASMVEWGGEVVNSE+ G+HTSTQMGSGHFP EGFGKAGYFRNIQVVDGSNNLRQPE
Subjt:  KSSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPE

Query:  DIGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNCP
        DIGTFTEQPSCYDVQ GKSGDWGNYFFYGGPGRNPNCP
Subjt:  DIGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNCP

XP_038899904.1 uncharacterized protein LOC120087094 [Benincasa hispida]2.4e-21883.9Show/hide
Query:  MGSLKLLLLLMLVSFSVAA---AGKPTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGILKDSKVSSK
        MG LKLL LL+L+  S++A   AGK TRHRHRRL+  +HLKKLNKP VKSIKSPDGDIIDCVHMAHQPAFDHPLL+NHTIQM PNFHPEGIL++SKVSSK
Subjt:  MGSLKLLLLLMLVSFSVAA---AGKPTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGILKDSKVSSK

Query:  ASKSNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLS
        ASKS  ITQLWHLKGRCPKGTIPIRRTKK+DILRA+SVKSYGKKKP ATVKPTSIDIDLNGQ GHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLS
Subjt:  ASKSNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLS

Query:  QIWILGGTFGEDLNSIEAGWQVFEMFINSYNSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPI
        QIWILGGTFGEDLNSIEAGW                            QVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINN+IAMGASI+PI
Subjt:  QIWILGGTFGEDLNSIEAGWQVFEMFINSYNSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPI

Query:  SSYKSSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLR
        SSYK SQYDISLLIWKDPKEGNWWMQFGN YVLGYWPAFLFSYLTDSASM+EWGGEVVNSE+DG+HTSTQMGSGHFP EGFGKAGYFRNIQ+V  SN+LR
Subjt:  SSYKSSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLR

Query:  QPEDIGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNCP
         PEDIGTFTEQPSCYDV+ GKS DWGNYFFYGGPGRNPNCP
Subjt:  QPEDIGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNCP

TrEMBL top hitse value%identityAlignment
A0A0A0KBL1 Uncharacterized protein1.8e-21181.84Show/hide
Query:  MGSLKL--LLLLMLVSFSVAAAGKPT-----RHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPE-GILKDS
        MG LKL  L LLMLVS ++   G  T     RHR RRLE  +HLKKLNKPAVKSIKSPDGDIIDCV MAHQPAFDHPLLKNHTIQMRP FHPE GIL DS
Subjt:  MGSLKL--LLLLMLVSFSVAAAGKPT-----RHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPE-GILKDS

Query:  KVSSKASKSNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTN
        KVS K SKS  ITQLWHLKG+CPKGTIPIRRTKK+DILR NSVKSYGKKKP ATVKP SI++DLNGQ GHQHAIIYVEGGQYYGAKATINVWSPKIQQTN
Subjt:  KVSSKASKSNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTN

Query:  EFSLSQIWILGGTFGEDLNSIEAGWQVFEMFINSYNSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGA
        EFSLSQIWILGGTFG+DLNSIEAGW                            QVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINN+IAMGA
Subjt:  EFSLSQIWILGGTFGEDLNSIEAGWQVFEMFINSYNSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGA

Query:  SIYPISSYKSSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDG
        SI+PISSYKSSQYDISLLIWKDPKEGNWWMQFGN YVLGYWPAFLFSYLTDSASM+EWGGEVVNSE+DG+HTSTQMGSGHFP EGFGKAGYFRNIQ+V  
Subjt:  SIYPISSYKSSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDG

Query:  SNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNCP
        SN+LR PEDIG FTEQPSCYDVQ GKS DWGNYFFYGGPGRNPNCP
Subjt:  SNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNCP

A0A1S3CE73 LOW QUALITY PROTEIN: uncharacterized protein LOC1034998721.8e-21182.55Show/hide
Query:  MGSLKL--LLLLMLVSFSV-AAAGKPT----RH-RHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPE-GILKD
        MG LKL  L LLMLVS S+    GK T    RH RHRRLE  +HLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPE GIL +
Subjt:  MGSLKL--LLLLMLVSFSV-AAAGKPT----RH-RHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPE-GILKD

Query:  SKVSSKASKSNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQT
        SKVS K SKS  ITQLWHLKG+CPKGTIPIRR KK+DILR NSVKSYGKKKP ATVKP SI+IDLNGQ GHQHAIIYVEGGQYYGAKATINVWSPKIQQT
Subjt:  SKVSSKASKSNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQT

Query:  NEFSLSQIWILGGTFGEDLNSIEAGWQVFEMFINSYNSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMG
        NEFSLSQIWILGGTFG+DLNSIEAGW                            QVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINN+IAMG
Subjt:  NEFSLSQIWILGGTFGEDLNSIEAGWQVFEMFINSYNSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMG

Query:  ASIYPISSYKSSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVD
        ASI+PISSYKSSQYDISLLIWKDPKEGNWWMQFGN YVLGYWPAFLFSYLTDSASM+EWGGEVVNSE+DG+HTSTQMGSGHFP EGFGKAGYFRNIQ+V 
Subjt:  ASIYPISSYKSSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVD

Query:  GSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNCP
         SN+LR PEDIG FTEQPSCYDVQ GKS DWGNYFFYGGPGRNPNCP
Subjt:  GSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNCP

A0A6J1DGY2 uncharacterized protein LOC1110203905.3e-21984.9Show/hide
Query:  LKLLLLLMLVSFSVA--AAGKPTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGILKDSKVSSKASKS
        L LLL L L+S S+A  AA K +R RH RLEA  HLKKLNKP VKSIKSPDGDIIDCVHMAHQPAFDHPLL+NHTIQMRPNFHPEGI KD+KVSS  S+S
Subjt:  LKLLLLLMLVSFSVA--AAGKPTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGILKDSKVSSKASKS

Query:  NPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWI
         PITQLWHLKGRCP+GTIPIRRTKK DILRA+S+KSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGG+YYGAKATINVWSPKIQQTNEFSLSQIWI
Subjt:  NPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWI

Query:  LGGTFGEDLNSIEAGWQVFEMFINSYNSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYK
        LGGTFGEDLNSIEAGW                            QVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSY+
Subjt:  LGGTFGEDLNSIEAGWQVFEMFINSYNSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYK

Query:  SSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPED
        SSQYDISLLIWKDPKEGNWWMQFGNS+VLGYWPAFLFSYLTDSASM+EWGGEVVNSE+DG+HTSTQMGSGHFP+EGFGKAGYFRNIQVVDGSNNL+ PED
Subjt:  SSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPED

Query:  IGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNCP
        IGTFTEQPSCYDVQ GKSG+WGNYFFYGGPGRNPNCP
Subjt:  IGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNCP

A0A6J1H5K4 uncharacterized protein LOC1114606891.5e-21883.71Show/hide
Query:  MGSLKLLLLLMLVSFSVAAAGKPTRHRH----RRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGILKDSKVSS
        MG+LKLLLLL++     AAAGK + HRH    RR EAQ HLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEG+L D+K+SS
Subjt:  MGSLKLLLLLMLVSFSVAAAGKPTRHRH----RRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGILKDSKVSS

Query:  KASKSNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSL
        K S+  PITQ WHLKGRCPKGTIPIRRT K DILRANSVK+YG+KKPQAT KPTSIDIDLNGQTGHQHAI YVEGGQYYGAKATINVWSPKIQ  NEFSL
Subjt:  KASKSNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSL

Query:  SQIWILGGTFGEDLNSIEAGWQVFEMFINSYNSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYP
        SQIWILGGTFGEDLNSIEAGW                            QVSPDLYGDNNTR FTYWTSDAYQATGCYNLLCSGFVQINN+IA+GASIYP
Subjt:  SQIWILGGTFGEDLNSIEAGWQVFEMFINSYNSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYP

Query:  ISSYKSSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNL
        ISSY+ SQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTD ASMVEWGGEVVNSE+DG+HTSTQMGSGHFP EGFGKAGYFRNIQVVDGSNNL
Subjt:  ISSYKSSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNL

Query:  RQPEDIGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNCP
        RQPEDIGTFTEQPSCYDVQ GKSGDWGNYFFYGGPGRNPNCP
Subjt:  RQPEDIGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNCP

A0A6J1KPT3 uncharacterized protein LOC1114976114.1e-21984.93Show/hide
Query:  MGSLKLLLLLMLVSFSVAAAGKPTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGILKDSKVSSKASK
        MG+LKLLLLL +VS SVAAAGK + HR RR EAQ HLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEG+L  +K+SSK S+
Subjt:  MGSLKLLLLLMLVSFSVAAAGKPTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGILKDSKVSSKASK

Query:  SNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIW
          PITQ WHLKGRCP+GTIPIRRT K DILRANSVKSYGKKKPQATV+PTSIDIDLNGQTGHQHAI YVEGGQYYGAKATINVWSPKIQ  NEFSLSQIW
Subjt:  SNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIW

Query:  ILGGTFGEDLNSIEAGWQVFEMFINSYNSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSY
        ILGGTFGEDLNSIEAGW                            QVSPDLYGDNNTR FTYWTSDAYQATGCYNLLCSGFVQINN+IA+GASIYPISSY
Subjt:  ILGGTFGEDLNSIEAGWQVFEMFINSYNSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSY

Query:  KSSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPE
        + SQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTD ASMVEWGGEVVNSE+ G+HTSTQMGSGHFP EGFGKAGYFRNIQVVDGSNNLRQPE
Subjt:  KSSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPE

Query:  DIGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNCP
        DIGTFTEQPSCYDVQ GKSGDWGNYFFYGGPGRNPNCP
Subjt:  DIGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)1.6e-17566.51Show/hide
Query:  GSLKLLLLLMLVSFSVAAAGKPTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGILKDSKVSSKAS--
        G L  L L    S S AA    ++   ++ E + HL +LNKPAVKSI+S DGD+IDCV ++ QPAFDHP LK+H IQM+PN+HPEG+  D+KVS+  S  
Subjt:  GSLKLLLLLMLVSFSVAAAGKPTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGILKDSKVSSKAS--

Query:  KSNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQI
        K   I QLWH  G+C +GTIP+RRTK+DD+LRA+SVK YGKKK ++   P S + DL  Q+GHQHAI YVEG +YYGAKATINVW PKIQQ NEFSLSQI
Subjt:  KSNPITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQI

Query:  WILGGTFGEDLNSIEAGWQVFEMFINSYNSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISS
        W+LGG+FG+DLNSIEAGW                            QVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGF+QIN+ IAMGASI P+S 
Subjt:  WILGGTFGEDLNSIEAGWQVFEMFINSYNSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISS

Query:  YKSSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQP
        Y++SQYDIS+LIWKDPKEG+WWMQFGN YVLGYWP+FLFSYLT+SASM+EWGGEVVNS++DG+HTSTQMGSG FP+EGF KA YFRNIQVVDGSNNL+ P
Subjt:  YKSSQYDISLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQP

Query:  EDIGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNCP
        + +GTFTEQ +CYDVQTG + DWG+YF+YGGPG+N  CP
Subjt:  EDIGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNCP

AT2G44210.1 Protein of Unknown Function (DUF239)2.0e-15762.14Show/hide
Query:  LEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGILKDSKVSSKA--SKSNPITQLWHLKGRCPKGTIPIRRTKKDD
        L+ + HLK+LNKPA+KSIKSPDGD+IDCV +  QPAF HPLL NHT+QM P+ +PE +  +SKVSSK    +SN I QLWH+ G+CPK TIPIRRT++ D
Subjt:  LEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGILKDSKVSSKA--SKSNPITQLWHLKGRCPKGTIPIRRTKKDD

Query:  ILRANSVKSYGKKKPQATVKPTSID-IDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGEDLNSIEAGWQVFEMFINSY
        + RA+SV++YG K  ++  KP S +  ++  Q GHQHAI+YVE G +YGAKA INVW P ++  NEFSL+QIW+LGG F  DLNSIEAGW          
Subjt:  ILRANSVKSYGKKKPQATVKPTSID-IDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGEDLNSIEAGWQVFEMFINSY

Query:  NSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWWMQFGNS
                          QVSP LYGDN TRLFTYWTSDAYQ TGCYNLLCSGFVQIN +IAMG SI P+S+Y +SQYDI++LIWKDPKEG+WW+QFG  
Subjt:  NSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWWMQFGNS

Query:  YVLGYWPAFLFSYLTDSASMVEWGGEVVNSEA-DGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNYF
        Y++GYWPA LFSYL++SASM+EWGGEVVNS++ +G+HT+TQMGSG F +EG+GKA YF+N+QVVDGSN LR PE++  FT+Q +CY+V++G  G WG+YF
Subjt:  YVLGYWPAFLFSYLTDSASMVEWGGEVVNSEA-DGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNYF

Query:  FYGGPGRNPNCP
        +YGGPGRNPNCP
Subjt:  FYGGPGRNPNCP

AT3G13510.1 Protein of Unknown Function (DUF239)6.1e-17566.13Show/hide
Query:  LLMLVSFSVAAAGKPTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGILKDSKVSSK-ASKSNPITQL
        L +++S S AAA   +    ++ E + HL +LNKP VK+I+SPDGDIIDC+ ++ QPAFDHP LK+H IQMRP++HPEG+  D+KVS++   K   I QL
Subjt:  LLMLVSFSVAAAGKPTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGILKDSKVSSK-ASKSNPITQL

Query:  WHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFG
        WH  G+C +GTIP+RRT++DD+LRA+SVK YGKKK ++   P S + DL  Q GHQHAI YVEG +YYGAKAT+NVW PKIQ TNEFSLSQIW+LGG+FG
Subjt:  WHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFG

Query:  EDLNSIEAGWQVFEMFINSYNSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDI
        +DLNSIEAGW                            QVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGF+QIN+ IAMGASI P+S Y++SQYDI
Subjt:  EDLNSIEAGWQVFEMFINSYNSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDI

Query:  SLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTE
        S+LIWKDPKEG+WWMQFGN YVLGYWP+FLFSYLT+SASM+EWGGEVVNS+++G HT TQMGSGHFP+EGF KA YFRNIQVVDGSNNL+ P+ +GTFTE
Subjt:  SLLIWKDPKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTE

Query:  QPSCYDVQTGKSGDWGNYFFYGGPGRNPNCP
        + +CYDVQTG + DWG+YF+YGGPG+N NCP
Subjt:  QPSCYDVQTGKSGDWGNYFFYGGPGRNPNCP

AT5G56530.1 Protein of Unknown Function (DUF239)2.0e-17367.38Show/hide
Query:  SVAAAGKPTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGILKDSKVSSKASKS-NPITQLWHLKGRC
        S+  AG+ +  R +  E   HL +LNKPAVKSI+SPDGDIIDCVH++ QPAFDHP LK+H IQM P++ PE +  +SKVS K  +S NPITQLWH  G C
Subjt:  SVAAAGKPTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGILKDSKVSSKASKS-NPITQLWHLKGRC

Query:  PKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGEDLNSIE
         +GTIP+RRTKK+D+LRA+SVK YGKKK  +   P S D DL  Q+GHQHAI YVEGG++YGAKATINVW PK+Q +NEFSLSQ+WILGG+FG+DLNSIE
Subjt:  PKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGEDLNSIE

Query:  AGWQVFEMFINSYNSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKD
        AGW                            QVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGF+QIN+QIAMGASI P+S + + QYDIS+ IWKD
Subjt:  AGWQVFEMFINSYNSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKD

Query:  PKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDV
        PKEG+WWMQFG+ YVLGYWP+FLFSYL DSAS+VEWGGEVVN E DG HT+TQMGSG FPDEGF KA YFRNIQVVD SNNL++P+ + TFTE+ +CYDV
Subjt:  PKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDV

Query:  QTGKSGDWGNYFFYGGPGRNPNC
        + GK+ DWG+YF+YGGPGRNPNC
Subjt:  QTGKSGDWGNYFFYGGPGRNPNC

AT5G56530.2 Protein of Unknown Function (DUF239)2.0e-17367.38Show/hide
Query:  SVAAAGKPTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGILKDSKVSSKASKS-NPITQLWHLKGRC
        S+  AG+ +  R +  E   HL +LNKPAVKSI+SPDGDIIDCVH++ QPAFDHP LK+H IQM P++ PE +  +SKVS K  +S NPITQLWH  G C
Subjt:  SVAAAGKPTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGILKDSKVSSKASKS-NPITQLWHLKGRC

Query:  PKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGEDLNSIE
         +GTIP+RRTKK+D+LRA+SVK YGKKK  +   P S D DL  Q+GHQHAI YVEGG++YGAKATINVW PK+Q +NEFSLSQ+WILGG+FG+DLNSIE
Subjt:  PKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGEDLNSIE

Query:  AGWQVFEMFINSYNSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKD
        AGW                            QVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGF+QIN+QIAMGASI P+S + + QYDIS+ IWKD
Subjt:  AGWQVFEMFINSYNSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKD

Query:  PKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDV
        PKEG+WWMQFG+ YVLGYWP+FLFSYL DSAS+VEWGGEVVN E DG HT+TQMGSG FPDEGF KA YFRNIQVVD SNNL++P+ + TFTE+ +CYDV
Subjt:  PKEGNWWMQFGNSYVLGYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDV

Query:  QTGKSGDWGNYFFYGGPGRNPNC
        + GK+ DWG+YF+YGGPGRNPNC
Subjt:  QTGKSGDWGNYFFYGGPGRNPNC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTCTCTAAAGCTGCTGCTCCTCTTAATGCTCGTTTCTTTTTCAGTCGCGGCGGCCGGAAAACCCACCCGCCACCGTCACCGGCGGCTCGAAGCTCAGGCCCATCT
GAAGAAGCTCAATAAACCTGCTGTTAAGTCCATTAAGAGTCCAGATGGGGATATAATTGATTGTGTCCATATGGCTCACCAGCCAGCTTTTGATCATCCTCTTCTCAAAA
ACCACACAATTCAGATGAGACCAAATTTTCATCCAGAGGGGATTTTGAAGGACAGTAAAGTGTCTTCAAAAGCTTCAAAATCAAATCCCATAACTCAATTATGGCACTTG
AAAGGCAGATGTCCAAAAGGGACAATTCCCATTAGAAGAACAAAAAAAGATGACATTTTGAGAGCAAACTCTGTGAAAAGCTATGGCAAAAAGAAGCCTCAAGCCACTGT
GAAACCAACCTCCATTGATATTGATCTCAATGGCCAAACGGGACATCAGCATGCAATAATATATGTTGAAGGAGGACAATACTATGGAGCTAAGGCAACTATAAACGTTT
GGTCTCCCAAAATCCAACAGACAAACGAATTTAGCCTCTCGCAGATCTGGATTCTTGGAGGAACTTTTGGGGAAGATCTTAATAGCATTGAAGCTGGTTGGCAGGTCTTT
GAAATGTTCATAAATTCTTATAATTCAAATGAAAGATTTCGATTATTTGAGTTTTTTTTTTGGTTTTGTAAAAAACAGGTCAGCCCTGATTTGTATGGAGATAACAACAC
TCGACTTTTCACTTATTGGACTAGTGATGCATATCAAGCTACTGGCTGCTACAATCTCCTCTGTTCTGGGTTTGTTCAAATCAATAATCAAATAGCCATGGGTGCCAGCA
TTTATCCCATTTCTTCTTACAAAAGTTCTCAATATGACATTAGCTTGCTCATCTGGAAGGACCCTAAAGAAGGAAACTGGTGGATGCAATTCGGAAACAGCTACGTATTG
GGTTACTGGCCGGCGTTCTTGTTCTCATACCTCACCGACAGCGCCTCCATGGTCGAGTGGGGCGGTGAGGTCGTCAACTCCGAAGCCGACGGCGAACACACTTCCACTCA
GATGGGCAGCGGCCACTTCCCTGACGAGGGCTTCGGCAAGGCCGGCTACTTCCGAAATATTCAGGTGGTTGACGGATCGAACAACCTCCGGCAGCCGGAGGACATAGGAA
CTTTCACAGAGCAGCCCAGTTGCTACGACGTTCAGACCGGGAAGTCCGGCGACTGGGGCAATTACTTCTTCTACGGCGGGCCGGGCAGAAATCCAAACTGCCCGTGA
mRNA sequenceShow/hide mRNA sequence
CACAAAATGCCATAGGACAATTGTCCTTAATTAAAAAAAACAAATTACAATTTAAAAATTTTCCCTCTAATTTACTTCCAAATCTCTTAACATAGAAAAAATTATATAAA
TAAAAACTATGCTCTAAAATTTTCTCACTCATTGTGAAAATTTCTGGGCAGCCACACCATAATTCTTACAATAATATTAAATCTCTCTCCTCTTTGCTAAGTGCTAAACA
AATTTATTCTGAAAATTAGCCACTGCCCCCATTATAACACAATCTCAGCCAGCCAATTTTGCTTTTCAATCCCTAAAAGTTCTCCAACTTTTCCACTCAAAATTCAGTGA
AATGGGTTCTCTAAAGCTGCTGCTCCTCTTAATGCTCGTTTCTTTTTCAGTCGCGGCGGCCGGAAAACCCACCCGCCACCGTCACCGGCGGCTCGAAGCTCAGGCCCATC
TGAAGAAGCTCAATAAACCTGCTGTTAAGTCCATTAAGAGTCCAGATGGGGATATAATTGATTGTGTCCATATGGCTCACCAGCCAGCTTTTGATCATCCTCTTCTCAAA
AACCACACAATTCAGATGAGACCAAATTTTCATCCAGAGGGGATTTTGAAGGACAGTAAAGTGTCTTCAAAAGCTTCAAAATCAAATCCCATAACTCAATTATGGCACTT
GAAAGGCAGATGTCCAAAAGGGACAATTCCCATTAGAAGAACAAAAAAAGATGACATTTTGAGAGCAAACTCTGTGAAAAGCTATGGCAAAAAGAAGCCTCAAGCCACTG
TGAAACCAACCTCCATTGATATTGATCTCAATGGCCAAACGGGACATCAGCATGCAATAATATATGTTGAAGGAGGACAATACTATGGAGCTAAGGCAACTATAAACGTT
TGGTCTCCCAAAATCCAACAGACAAACGAATTTAGCCTCTCGCAGATCTGGATTCTTGGAGGAACTTTTGGGGAAGATCTTAATAGCATTGAAGCTGGTTGGCAGGTCTT
TGAAATGTTCATAAATTCTTATAATTCAAATGAAAGATTTCGATTATTTGAGTTTTTTTTTTGGTTTTGTAAAAAACAGGTCAGCCCTGATTTGTATGGAGATAACAACA
CTCGACTTTTCACTTATTGGACTAGTGATGCATATCAAGCTACTGGCTGCTACAATCTCCTCTGTTCTGGGTTTGTTCAAATCAATAATCAAATAGCCATGGGTGCCAGC
ATTTATCCCATTTCTTCTTACAAAAGTTCTCAATATGACATTAGCTTGCTCATCTGGAAGGACCCTAAAGAAGGAAACTGGTGGATGCAATTCGGAAACAGCTACGTATT
GGGTTACTGGCCGGCGTTCTTGTTCTCATACCTCACCGACAGCGCCTCCATGGTCGAGTGGGGCGGTGAGGTCGTCAACTCCGAAGCCGACGGCGAACACACTTCCACTC
AGATGGGCAGCGGCCACTTCCCTGACGAGGGCTTCGGCAAGGCCGGCTACTTCCGAAATATTCAGGTGGTTGACGGATCGAACAACCTCCGGCAGCCGGAGGACATAGGA
ACTTTCACAGAGCAGCCCAGTTGCTACGACGTTCAGACCGGGAAGTCCGGCGACTGGGGCAATTACTTCTTCTACGGCGGGCCGGGCAGAAATCCAAACTGCCCGTGATT
TGATTTTCTCTCTCACTCTAAATATATAAATACATATATATATTTTTATATATGCATTTCTTGTGCATTATTTTTAGTGGGTATCTATTGAGAGATGTTTAACTGGTGAT
GATAAAGCTTTTCTCTCTCTATCTCTCTCTTCAAATCAACTTTGGGAGAGAGAGAGAGAGAAGTTGATTTGTTTGTTTCAAAAGCTTAGAAGCAAAAAACAAGAGCTGAT
GATTTGTTAATTTTATCTCTTTGGAAATGTGTATAAGTGTGTTTAATTTGATTCTTCTCTTCATTCTATCTTTTGCAACACTTTCTTAAACCCTTTGTACGTAGGACCAA
AATCTTTTTTATTGTCTTTAATTCATAGCATTACATATAGTGGAATTTTCATCTCATTTTGGAGGGGTTGATATATAATGGATACAAATGTTTATGTTTATT
Protein sequenceShow/hide protein sequence
MGSLKLLLLLMLVSFSVAAAGKPTRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGILKDSKVSSKASKSNPITQLWHL
KGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGEDLNSIEAGWQVF
EMFINSYNSNERFRLFEFFFWFCKKQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNQIAMGASIYPISSYKSSQYDISLLIWKDPKEGNWWMQFGNSYVL
GYWPAFLFSYLTDSASMVEWGGEVVNSEADGEHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQTGKSGDWGNYFFYGGPGRNPNCP