; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006908 (gene) of Snake gourd v1 genome

Gene IDTan0006908
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationLG06:3858163..3862007
RNA-Seq ExpressionTan0006908
SyntenyTan0006908
Gene Ontology termsGO:0016874 - ligase activity (molecular function)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593345.1 hypothetical protein SDJN03_12821, partial [Cucurbita argyrosperma subsp. sororia]5.6e-22590.87Show/hide
Query:  MGALKLLLLLLVSFSVAA---AGKTSRHRH----RRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFSDNKL
        MGALKLLLLL+VS SVAA   AGK+S HRH    RR EAQ HLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEG+ SDNK+
Subjt:  MGALKLLLLLLVSFSVAA---AGKTSRHRH----RRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFSDNKL

Query:  SSKASKSKGITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEF
        SSK S+ K ITQ WHLKGRCP+GTIPIRRT K DILRANSVK+YGKKKPQAT KPTSIDIDLNGQTGHQHAI YVEGGQYYGAKATINVWSPKIQ  NEF
Subjt:  SSKASKSKGITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEF

Query:  SLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGNWWM
        SLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTR FTYWTSDAYQATGCYNLLCSGFVQINNEIA+GASIYPISSYR SQYDISLLIWKD KEGNWWM
Subjt:  SLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGNWWM

Query:  QFGNNYVLGYWPAFLFSYLTDSASMIEWGGEVVNSEADGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDW
        QFGN+YVLGYWPAFLFSYLTD ASM+EWGGEVVNSE+DGQHTSTQMGSGHFP EGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDW
Subjt:  QFGNNYVLGYWPAFLFSYLTDSASMIEWGGEVVNSEADGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDW

Query:  GNYFFYGGPGRNPNCP
        GNYFFYGGPGRNPNCP
Subjt:  GNYFFYGGPGRNPNCP

XP_022152744.1 uncharacterized protein LOC111020390 [Momordica charantia]5.2e-22391.63Show/hide
Query:  LLLLLLVSFSVA--AAGKTSRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFSDNKLSSKASKSKGI
        LL L L+S S+A  AA KTSR RH RLEA  HLKKLNKP VKSIKSPDGDIIDCVHMAHQPAFDHPLL+NHTIQMRPNFHPEGIF DNK+SS  S+SK I
Subjt:  LLLLLLVSFSVA--AAGKTSRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFSDNKLSSKASKSKGI

Query:  TQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGG
        TQLWHLKGRCP+GTIPIRRTKK DILRA+S+KSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGG+YYGAKATINVWSPKIQQTNEFSLSQIWILGG
Subjt:  TQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGG

Query:  TFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGNWWMQFGNNYVLGY
        TFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINN+IAMGASIYPISSYRSSQYDISLLIWKD KEGNWWMQFGN++VLGY
Subjt:  TFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGNWWMQFGNNYVLGY

Query:  WPAFLFSYLTDSASMIEWGGEVVNSEADGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGNYFFYGGPG
        WPAFLFSYLTDSASMIEWGGEVVNSE+DGQHTSTQMGSGHFP+EGFGKAGYFRNIQVVDGSNNL+ PEDIGTFTEQPSCYDVQNGKSG+WGNYFFYGGPG
Subjt:  WPAFLFSYLTDSASMIEWGGEVVNSEADGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGNYFFYGGPG

Query:  RNPNCP
        RNPNCP
Subjt:  RNPNCP

XP_022959691.1 uncharacterized protein LOC111460689 [Cucurbita moschata]2.5e-22591.3Show/hide
Query:  MGALKLLLLLLVSFSV-AAAGKTSRHRH----RRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFSDNKLSS
        MGALKLLLLL+VS SV AAAGK+S HRH    RR EAQ HLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEG+ SDNK+SS
Subjt:  MGALKLLLLLLVSFSV-AAAGKTSRHRH----RRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFSDNKLSS

Query:  KASKSKGITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSL
        K S+ K ITQ WHLKGRCPKGTIPIRRT K DILRANSVK+YG+KKPQAT KPTSIDIDLNGQTGHQHAI YVEGGQYYGAKATINVWSPKIQ  NEFSL
Subjt:  KASKSKGITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSL

Query:  SQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGNWWMQF
        SQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTR FTYWTSDAYQATGCYNLLCSGFVQINNEIA+GASIYPISSYR SQYDISLLIWKD KEGNWWMQF
Subjt:  SQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGNWWMQF

Query:  GNNYVLGYWPAFLFSYLTDSASMIEWGGEVVNSEADGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGN
        GN+YVLGYWPAFLFSYLTD ASM+EWGGEVVNSE+DGQHTSTQMGSGHFP EGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGN
Subjt:  GNNYVLGYWPAFLFSYLTDSASMIEWGGEVVNSEADGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGN

Query:  YFFYGGPGRNPNCP
        YFFYGGPGRNPNCP
Subjt:  YFFYGGPGRNPNCP

XP_023004212.1 uncharacterized protein LOC111497611 [Cucurbita maxima]6.6e-22691.93Show/hide
Query:  MGALKLLLLLLVSFSVAAAGKTSRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFSDNKLSSKASKS
        MGALKLLLLL+VS SVAAAGK+S HR RR EAQ HLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEG+ S NK+SSK S+ 
Subjt:  MGALKLLLLLLVSFSVAAAGKTSRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFSDNKLSSKASKS

Query:  KGITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWI
        K ITQ WHLKGRCP+GTIPIRRT K DILRANSVKSYGKKKPQATV+PTSIDIDLNGQTGHQHAI YVEGGQYYGAKATINVWSPKIQ  NEFSLSQIWI
Subjt:  KGITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWI

Query:  LGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGNWWMQFGNNYV
        LGGTFGEDLNSIEAGWQVSPDLYGDNNTR FTYWTSDAYQATGCYNLLCSGFVQINNEIA+GASIYPISSYR SQYDISLLIWKD KEGNWWMQFGN+YV
Subjt:  LGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGNWWMQFGNNYV

Query:  LGYWPAFLFSYLTDSASMIEWGGEVVNSEADGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGNYFFYG
        LGYWPAFLFSYLTD ASM+EWGGEVVNSE+ GQHTSTQMGSGHFP EGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGNYFFYG
Subjt:  LGYWPAFLFSYLTDSASMIEWGGEVVNSEADGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGNYFFYG

Query:  GPGRNPNCP
        GPGRNPNCP
Subjt:  GPGRNPNCP

XP_023514011.1 uncharacterized protein LOC111778432 [Cucurbita pepo subsp. pepo]1.6e-22491.3Show/hide
Query:  MGALKLLLLLLVSFSV-AAAGKTSRHRH----RRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFSDNKLSS
        MGALKLLLLL+VS SV AAAGK+S HRH    RR EAQ HLKKLNKPAVKSIKSPDGDIIDCVHMAHQPA DHPLLKNHTIQMRPNFHPEG+ SDNK+SS
Subjt:  MGALKLLLLLLVSFSV-AAAGKTSRHRH----RRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFSDNKLSS

Query:  KASKSKGITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSL
        K S+ K ITQ WHLKGRCPKGTIPIRRT K DILRANSVK+YGKKKPQATVKPTSIDIDLNGQTGHQHAI YVEGGQYYGAKATINVWSPKIQ  NEFSL
Subjt:  KASKSKGITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSL

Query:  SQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGNWWMQF
        SQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTR FTYWTSDAYQATGCYNLLCS FVQINNEIA+GASIYPISSYR SQYDISLLIWKD KEGNWWMQF
Subjt:  SQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGNWWMQF

Query:  GNNYVLGYWPAFLFSYLTDSASMIEWGGEVVNSEADGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGN
        GN+YVLGYWPAFLFSYLTD ASM+EWGGEVVNSE+DGQHTSTQMGSGHFP EGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGN
Subjt:  GNNYVLGYWPAFLFSYLTDSASMIEWGGEVVNSEADGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGN

Query:  YFFYGGPGRNPNCP
        YFFYGGPGRNPNCP
Subjt:  YFFYGGPGRNPNCP

TrEMBL top hitse value%identityAlignment
A0A0A0KBL1 Uncharacterized protein3.3e-21587.32Show/hide
Query:  MGALKLL---LLLLVSFSV-AAAGKTSRHRH----RRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPE-GIFSDN
        MG LKLL   LL+LVS ++    GKT+ HRH    RRLE  +HLKKLNKPAVKSIKSPDGDIIDCV MAHQPAFDHPLLKNHTIQMRP FHPE GI SD+
Subjt:  MGALKLL---LLLLVSFSV-AAAGKTSRHRH----RRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPE-GIFSDN

Query:  KLSSKASKSKGITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTN
        K+S K SKS+ ITQLWHLKG+CPKGTIPIRRTKK+DILR NSVKSYGKKKP ATVKP SI++DLNGQ GHQHAIIYVEGGQYYGAKATINVWSPKIQQTN
Subjt:  KLSSKASKSKGITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTN

Query:  EFSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGNW
        EFSLSQIWILGGTFG+DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASI+PISSY+SSQYDISLLIWKD KEGNW
Subjt:  EFSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGNW

Query:  WMQFGNNYVLGYWPAFLFSYLTDSASMIEWGGEVVNSEADGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSG
        WMQFGN YVLGYWPAFLFSYLTDSASMIEWGGEVVNSE+DGQHTSTQMGSGHFP EGFGKAGYFRNIQ+V  SN+LR PEDIG FTEQPSCYDVQNGKS 
Subjt:  WMQFGNNYVLGYWPAFLFSYLTDSASMIEWGGEVVNSEADGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSG

Query:  DWGNYFFYGGPGRNPNCP
        DWGNYFFYGGPGRNPNCP
Subjt:  DWGNYFFYGGPGRNPNCP

A0A1S3CE73 LOW QUALITY PROTEIN: uncharacterized protein LOC1034998727.4e-21587.59Show/hide
Query:  MGALKLL---LLLLVSFSV-AAAGKTS----RH-RHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPE-GIFSD
        MG LKLL   LL+LVS S+    GK +    RH RHRRLE  +HLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPE GI S+
Subjt:  MGALKLL---LLLLVSFSV-AAAGKTS----RH-RHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPE-GIFSD

Query:  NKLSSKASKSKGITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQT
        +K+S K SKS+ ITQLWHLKG+CPKGTIPIRR KK+DILR NSVKSYGKKKP ATVKP SI+IDLNGQ GHQHAIIYVEGGQYYGAKATINVWSPKIQQT
Subjt:  NKLSSKASKSKGITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQT

Query:  NEFSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGN
        NEFSLSQIWILGGTFG+DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASI+PISSY+SSQYDISLLIWKD KEGN
Subjt:  NEFSLSQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGN

Query:  WWMQFGNNYVLGYWPAFLFSYLTDSASMIEWGGEVVNSEADGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKS
        WWMQFGN YVLGYWPAFLFSYLTDSASMIEWGGEVVNSE+DGQHTSTQMGSGHFP EGFGKAGYFRNIQ+V  SN+LR PEDIG FTEQPSCYDVQNGKS
Subjt:  WWMQFGNNYVLGYWPAFLFSYLTDSASMIEWGGEVVNSEADGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKS

Query:  GDWGNYFFYGGPGRNPNCP
         DWGNYFFYGGPGRNPNCP
Subjt:  GDWGNYFFYGGPGRNPNCP

A0A6J1DGY2 uncharacterized protein LOC1110203902.5e-22391.63Show/hide
Query:  LLLLLLVSFSVA--AAGKTSRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFSDNKLSSKASKSKGI
        LL L L+S S+A  AA KTSR RH RLEA  HLKKLNKP VKSIKSPDGDIIDCVHMAHQPAFDHPLL+NHTIQMRPNFHPEGIF DNK+SS  S+SK I
Subjt:  LLLLLLVSFSVA--AAGKTSRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFSDNKLSSKASKSKGI

Query:  TQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGG
        TQLWHLKGRCP+GTIPIRRTKK DILRA+S+KSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGG+YYGAKATINVWSPKIQQTNEFSLSQIWILGG
Subjt:  TQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGG

Query:  TFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGNWWMQFGNNYVLGY
        TFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINN+IAMGASIYPISSYRSSQYDISLLIWKD KEGNWWMQFGN++VLGY
Subjt:  TFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGNWWMQFGNNYVLGY

Query:  WPAFLFSYLTDSASMIEWGGEVVNSEADGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGNYFFYGGPG
        WPAFLFSYLTDSASMIEWGGEVVNSE+DGQHTSTQMGSGHFP+EGFGKAGYFRNIQVVDGSNNL+ PEDIGTFTEQPSCYDVQNGKSG+WGNYFFYGGPG
Subjt:  WPAFLFSYLTDSASMIEWGGEVVNSEADGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGNYFFYGGPG

Query:  RNPNCP
        RNPNCP
Subjt:  RNPNCP

A0A6J1H5K4 uncharacterized protein LOC1114606891.2e-22591.3Show/hide
Query:  MGALKLLLLLLVSFSV-AAAGKTSRHRH----RRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFSDNKLSS
        MGALKLLLLL+VS SV AAAGK+S HRH    RR EAQ HLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEG+ SDNK+SS
Subjt:  MGALKLLLLLLVSFSV-AAAGKTSRHRH----RRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFSDNKLSS

Query:  KASKSKGITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSL
        K S+ K ITQ WHLKGRCPKGTIPIRRT K DILRANSVK+YG+KKPQAT KPTSIDIDLNGQTGHQHAI YVEGGQYYGAKATINVWSPKIQ  NEFSL
Subjt:  KASKSKGITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSL

Query:  SQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGNWWMQF
        SQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTR FTYWTSDAYQATGCYNLLCSGFVQINNEIA+GASIYPISSYR SQYDISLLIWKD KEGNWWMQF
Subjt:  SQIWILGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGNWWMQF

Query:  GNNYVLGYWPAFLFSYLTDSASMIEWGGEVVNSEADGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGN
        GN+YVLGYWPAFLFSYLTD ASM+EWGGEVVNSE+DGQHTSTQMGSGHFP EGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGN
Subjt:  GNNYVLGYWPAFLFSYLTDSASMIEWGGEVVNSEADGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGN

Query:  YFFYGGPGRNPNCP
        YFFYGGPGRNPNCP
Subjt:  YFFYGGPGRNPNCP

A0A6J1KPT3 uncharacterized protein LOC1114976113.2e-22691.93Show/hide
Query:  MGALKLLLLLLVSFSVAAAGKTSRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFSDNKLSSKASKS
        MGALKLLLLL+VS SVAAAGK+S HR RR EAQ HLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEG+ S NK+SSK S+ 
Subjt:  MGALKLLLLLLVSFSVAAAGKTSRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFSDNKLSSKASKS

Query:  KGITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWI
        K ITQ WHLKGRCP+GTIPIRRT K DILRANSVKSYGKKKPQATV+PTSIDIDLNGQTGHQHAI YVEGGQYYGAKATINVWSPKIQ  NEFSLSQIWI
Subjt:  KGITQLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWI

Query:  LGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGNWWMQFGNNYV
        LGGTFGEDLNSIEAGWQVSPDLYGDNNTR FTYWTSDAYQATGCYNLLCSGFVQINNEIA+GASIYPISSYR SQYDISLLIWKD KEGNWWMQFGN+YV
Subjt:  LGGTFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGNWWMQFGNNYV

Query:  LGYWPAFLFSYLTDSASMIEWGGEVVNSEADGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGNYFFYG
        LGYWPAFLFSYLTD ASM+EWGGEVVNSE+ GQHTSTQMGSGHFP EGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGNYFFYG
Subjt:  LGYWPAFLFSYLTDSASMIEWGGEVVNSEADGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGNYFFYG

Query:  GPGRNPNCP
        GPGRNPNCP
Subjt:  GPGRNPNCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)6.9e-18172.93Show/hide
Query:  SFSVAAAGKTSRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFSDNKLSSKASKSK--GITQLWHLK
        S S AA    S+   ++ E + HL +LNKPAVKSI+S DGD+IDCV ++ QPAFDHP LK+H IQM+PN+HPEG+F DNK+S+  S  K   I QLWH  
Subjt:  SFSVAAAGKTSRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFSDNKLSSKASKSK--GITQLWHLK

Query:  GRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGEDLN
        G+C +GTIP+RRTK+DD+LRA+SVK YGKKK ++   P S + DL  Q+GHQHAI YVEG +YYGAKATINVW PKIQQ NEFSLSQIW+LGG+FG+DLN
Subjt:  GRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGEDLN

Query:  SIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGNWWMQFGNNYVLGYWPAFLFS
        SIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGF+QIN++IAMGASI P+S YR+SQYDIS+LIWKD KEG+WWMQFGN YVLGYWP+FLFS
Subjt:  SIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGNWWMQFGNNYVLGYWPAFLFS

Query:  YLTDSASMIEWGGEVVNSEADGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGNYFFYGGPGRNPNCP
        YLT+SASMIEWGGEVVNS++DGQHTSTQMGSG FP+EGF KA YFRNIQVVDGSNNL+ P+ +GTFTEQ +CYDVQ G + DWG+YF+YGGPG+N  CP
Subjt:  YLTDSASMIEWGGEVVNSEADGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGNYFFYGGPGRNPNCP

AT2G44210.1 Protein of Unknown Function (DUF239)6.1e-16163.88Show/hide
Query:  LLLLLVSFSVAAAGKTSRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFSDNKLSSKA--SKSKGIT
        L L++    +A +  +  +    L+ + HLK+LNKPA+KSIKSPDGD+IDCV +  QPAF HPLL NHT+QM P+ +PE +FS++K+SSK    +S  I 
Subjt:  LLLLLVSFSVAAAGKTSRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFSDNKLSSKA--SKSKGIT

Query:  QLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSID-IDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGG
        QLWH+ G+CPK TIPIRRT++ D+ RA+SV++YG K  ++  KP S +  ++  Q GHQHAI+YVE G +YGAKA INVW P ++  NEFSL+QIW+LGG
Subjt:  QLWHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSID-IDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGG

Query:  TFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGNWWMQFGNNYVLGY
         F  DLNSIEAGWQVSP LYGDN TRLFTYWTSDAYQ TGCYNLLCSGFVQIN EIAMG SI P+S+Y +SQYDI++LIWKD KEG+WW+QFG  Y++GY
Subjt:  TFGEDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGNWWMQFGNNYVLGY

Query:  WPAFLFSYLTDSASMIEWGGEVVNSEA-DGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGNYFFYGGP
        WPA LFSYL++SASMIEWGGEVVNS++ +GQHT+TQMGSG F +EG+GKA YF+N+QVVDGSN LR PE++  FT+Q +CY+V++G  G WG+YF+YGGP
Subjt:  WPAFLFSYLTDSASMIEWGGEVVNSEA-DGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGNYFFYGGP

Query:  GRNPNCP
        GRNPNCP
Subjt:  GRNPNCP

AT3G13510.1 Protein of Unknown Function (DUF239)1.3e-17971.22Show/hide
Query:  LLLLVSFSVAAAGKTSRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFSDNKLSSK-ASKSKGITQL
        L +++S S AAA   S    ++ E + HL +LNKP VK+I+SPDGDIIDC+ ++ QPAFDHP LK+H IQMRP++HPEG+F DNK+S++   K   I QL
Subjt:  LLLLVSFSVAAAGKTSRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFSDNKLSSK-ASKSKGITQL

Query:  WHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFG
        WH  G+C +GTIP+RRT++DD+LRA+SVK YGKKK ++   P S + DL  Q GHQHAI YVEG +YYGAKAT+NVW PKIQ TNEFSLSQIW+LGG+FG
Subjt:  WHLKGRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFG

Query:  EDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGNWWMQFGNNYVLGYWPA
        +DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGF+QIN++IAMGASI P+S YR+SQYDIS+LIWKD KEG+WWMQFGN YVLGYWP+
Subjt:  EDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGNWWMQFGNNYVLGYWPA

Query:  FLFSYLTDSASMIEWGGEVVNSEADGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGNYFFYGGPGRNP
        FLFSYLT+SASMIEWGGEVVNS+++G HT TQMGSGHFP+EGF KA YFRNIQVVDGSNNL+ P+ +GTFTE+ +CYDVQ G + DWG+YF+YGGPG+N 
Subjt:  FLFSYLTDSASMIEWGGEVVNSEADGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGNYFFYGGPGRNP

Query:  NCP
        NCP
Subjt:  NCP

AT5G56530.1 Protein of Unknown Function (DUF239)3.3e-17570.89Show/hide
Query:  SVAAAGKTSRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFSDNKLSSKASKS-KGITQLWHLKGRC
        S+  AG+ S  R +  E   HL +LNKPAVKSI+SPDGDIIDCVH++ QPAFDHP LK+H IQM P++ PE +F ++K+S K  +S   ITQLWH  G C
Subjt:  SVAAAGKTSRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFSDNKLSSKASKS-KGITQLWHLKGRC

Query:  PKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGEDLNSIE
         +GTIP+RRTKK+D+LRA+SVK YGKKK  +   P S D DL  Q+GHQHAI YVEGG++YGAKATINVW PK+Q +NEFSLSQ+WILGG+FG+DLNSIE
Subjt:  PKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGEDLNSIE

Query:  AGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGNWWMQFGNNYVLGYWPAFLFSYLT
        AGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGF+QIN++IAMGASI P+S + + QYDIS+ IWKD KEG+WWMQFG+ YVLGYWP+FLFSYL 
Subjt:  AGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGNWWMQFGNNYVLGYWPAFLFSYLT

Query:  DSASMIEWGGEVVNSEADGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGNYFFYGGPGRNPNC
        DSAS++EWGGEVVN E DG HT+TQMGSG FPDEGF KA YFRNIQVVD SNNL++P+ + TFTE+ +CYDV+ GK+ DWG+YF+YGGPGRNPNC
Subjt:  DSASMIEWGGEVVNSEADGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGNYFFYGGPGRNPNC

AT5G56530.2 Protein of Unknown Function (DUF239)3.3e-17570.89Show/hide
Query:  SVAAAGKTSRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFSDNKLSSKASKS-KGITQLWHLKGRC
        S+  AG+ S  R +  E   HL +LNKPAVKSI+SPDGDIIDCVH++ QPAFDHP LK+H IQM P++ PE +F ++K+S K  +S   ITQLWH  G C
Subjt:  SVAAAGKTSRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFSDNKLSSKASKS-KGITQLWHLKGRC

Query:  PKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGEDLNSIE
         +GTIP+RRTKK+D+LRA+SVK YGKKK  +   P S D DL  Q+GHQHAI YVEGG++YGAKATINVW PK+Q +NEFSLSQ+WILGG+FG+DLNSIE
Subjt:  PKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGEDLNSIE

Query:  AGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGNWWMQFGNNYVLGYWPAFLFSYLT
        AGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGF+QIN++IAMGASI P+S + + QYDIS+ IWKD KEG+WWMQFG+ YVLGYWP+FLFSYL 
Subjt:  AGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGNWWMQFGNNYVLGYWPAFLFSYLT

Query:  DSASMIEWGGEVVNSEADGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGNYFFYGGPGRNPNC
        DSAS++EWGGEVVN E DG HT+TQMGSG FPDEGF KA YFRNIQVVD SNNL++P+ + TFTE+ +CYDV+ GK+ DWG+YF+YGGPGRNPNC
Subjt:  DSASMIEWGGEVVNSEADGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGNYFFYGGPGRNPNC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGCTCTAAAGCTGCTGTTGCTGTTACTGGTTTCTTTCTCAGTGGCGGCGGCCGGAAAAACCAGCCGCCACCGTCACCGGCGGCTCGAAGCTCAGGCTCACCTGAA
GAAGCTCAATAAACCTGCCGTCAAGTCCATCAAGAGTCCAGATGGGGATATAATTGATTGTGTTCATATGGCTCACCAGCCAGCTTTTGATCATCCTCTTCTCAAAAACC
ACACAATTCAGATGAGACCAAATTTTCATCCAGAAGGGATATTCAGTGACAATAAACTGTCTTCAAAAGCTTCAAAATCAAAGGGTATAACTCAATTATGGCACTTGAAA
GGAAGATGCCCAAAAGGGACAATTCCCATTAGAAGAACAAAAAAAGATGACATTTTGAGAGCAAATTCAGTGAAAAGCTATGGCAAAAAGAAGCCTCAAGCCACTGTGAA
ACCAACCTCCATTGATATTGATCTCAATGGCCAAACTGGACATCAGCATGCAATAATATATGTTGAAGGAGGACAATATTATGGAGCCAAGGCAACTATAAACGTTTGGT
CCCCAAAAATCCAACAGACAAACGAATTTAGCCTCTCACAGATCTGGATTCTTGGAGGAACTTTTGGGGAAGATCTTAATAGCATTGAAGCTGGTTGGCAGGTCAGCCCT
GATTTATATGGGGATAACAACACTAGACTTTTCACTTATTGGACTAGTGATGCATATCAAGCTACTGGTTGCTATAATCTCCTCTGTTCTGGGTTTGTTCAAATCAATAA
TGAAATTGCCATGGGTGCTAGCATTTATCCCATTTCTTCTTACAGAAGTTCTCAATATGATATTAGCTTGCTTATTTGGAAGGACCGTAAAGAAGGAAACTGGTGGATGC
AATTCGGAAACAACTATGTATTGGGTTACTGGCCGGCGTTCTTATTCTCATACCTCACCGACAGTGCCTCCATGATCGAGTGGGGCGGCGAAGTCGTCAATTCTGAAGCC
GACGGCCAACACACTTCCACTCAGATGGGCAGCGGCCACTTCCCCGACGAGGGCTTCGGCAAGGCCGGCTACTTTCGCAATATTCAGGTCGTTGACGGATCCAACAACCT
CCGGCAGCCGGAGGACATCGGAACTTTCACAGAGCAACCCAGTTGCTACGACGTTCAGAACGGCAAGTCCGGCGACTGGGGCAATTACTTCTTTTACGGCGGCCCGGGCA
GAAATCCAAACTGCCCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGCTCTAAAGCTGCTGTTGCTGTTACTGGTTTCTTTCTCAGTGGCGGCGGCCGGAAAAACCAGCCGCCACCGTCACCGGCGGCTCGAAGCTCAGGCTCACCTGAA
GAAGCTCAATAAACCTGCCGTCAAGTCCATCAAGAGTCCAGATGGGGATATAATTGATTGTGTTCATATGGCTCACCAGCCAGCTTTTGATCATCCTCTTCTCAAAAACC
ACACAATTCAGATGAGACCAAATTTTCATCCAGAAGGGATATTCAGTGACAATAAACTGTCTTCAAAAGCTTCAAAATCAAAGGGTATAACTCAATTATGGCACTTGAAA
GGAAGATGCCCAAAAGGGACAATTCCCATTAGAAGAACAAAAAAAGATGACATTTTGAGAGCAAATTCAGTGAAAAGCTATGGCAAAAAGAAGCCTCAAGCCACTGTGAA
ACCAACCTCCATTGATATTGATCTCAATGGCCAAACTGGACATCAGCATGCAATAATATATGTTGAAGGAGGACAATATTATGGAGCCAAGGCAACTATAAACGTTTGGT
CCCCAAAAATCCAACAGACAAACGAATTTAGCCTCTCACAGATCTGGATTCTTGGAGGAACTTTTGGGGAAGATCTTAATAGCATTGAAGCTGGTTGGCAGGTCAGCCCT
GATTTATATGGGGATAACAACACTAGACTTTTCACTTATTGGACTAGTGATGCATATCAAGCTACTGGTTGCTATAATCTCCTCTGTTCTGGGTTTGTTCAAATCAATAA
TGAAATTGCCATGGGTGCTAGCATTTATCCCATTTCTTCTTACAGAAGTTCTCAATATGATATTAGCTTGCTTATTTGGAAGGACCGTAAAGAAGGAAACTGGTGGATGC
AATTCGGAAACAACTATGTATTGGGTTACTGGCCGGCGTTCTTATTCTCATACCTCACCGACAGTGCCTCCATGATCGAGTGGGGCGGCGAAGTCGTCAATTCTGAAGCC
GACGGCCAACACACTTCCACTCAGATGGGCAGCGGCCACTTCCCCGACGAGGGCTTCGGCAAGGCCGGCTACTTTCGCAATATTCAGGTCGTTGACGGATCCAACAACCT
CCGGCAGCCGGAGGACATCGGAACTTTCACAGAGCAACCCAGTTGCTACGACGTTCAGAACGGCAAGTCCGGCGACTGGGGCAATTACTTCTTTTACGGCGGCCCGGGCA
GAAATCCAAACTGCCCGTGA
Protein sequenceShow/hide protein sequence
MGALKLLLLLLVSFSVAAAGKTSRHRHRRLEAQAHLKKLNKPAVKSIKSPDGDIIDCVHMAHQPAFDHPLLKNHTIQMRPNFHPEGIFSDNKLSSKASKSKGITQLWHLK
GRCPKGTIPIRRTKKDDILRANSVKSYGKKKPQATVKPTSIDIDLNGQTGHQHAIIYVEGGQYYGAKATINVWSPKIQQTNEFSLSQIWILGGTFGEDLNSIEAGWQVSP
DLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFVQINNEIAMGASIYPISSYRSSQYDISLLIWKDRKEGNWWMQFGNNYVLGYWPAFLFSYLTDSASMIEWGGEVVNSEA
DGQHTSTQMGSGHFPDEGFGKAGYFRNIQVVDGSNNLRQPEDIGTFTEQPSCYDVQNGKSGDWGNYFFYGGPGRNPNCP