; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018439 (gene) of Snake gourd v1 genome

Gene IDTan0018439
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLate embryogenesis abundant protein-related / LEA protein-related protein
Genome locationLG08:30279985..30281754
RNA-Seq ExpressionTan0018439
SyntenyTan0018439
Gene Ontology termsGO:0001505 - regulation of neurotransmitter levels (biological process)
GO:0007186 - G protein-coupled receptor signaling pathway (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004969 - histamine receptor activity (molecular function)
InterPro domainsIPR009646 - Root cap


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585982.1 hypothetical protein SDJN03_18715, partial [Cucurbita argyrosperma subsp. sororia]4.7e-21283.51Show/hide
Query:  MAKITMVLFLFFLLFLSAAVEGAPKPKKVKCKDKKFTQCYKSEYYCPSDCPRTCVVDCSSCKPVCTPPPPPPPSPPPPPPKPRKLKSPPPPPYIYSSPPP
        M KI  +LFLFF LFLSAAVE  PKPKKVKCKDK F QCYKSE+YCP+DC RTCVVDCSSCKPVCTPPPPPPPSPPPPPPKPRKLKSPPPPPY+YSSPPP
Subjt:  MAKITMVLFLFFLLFLSAAVEGAPKPKKVKCKDKKFTQCYKSEYYCPSDCPRTCVVDCSSCKPVCTPPPPPPPSPPPPPPKPRKLKSPPPPPYIYSSPPP

Query:  PP------------HIYSS-PPPPYIYSSPPPPATAEPSPPIPPTPTPTPP-TPTPTPPT--SPPPSSEASGQKKVRCKNKGFPHCYLMELTCPSDCPSQ
        PP            H+YSS PPPPYIYSSPPPP     SPP P T  P PP  PTPTPPT  SPPPSSEASGQKKVRCKN+ FPHCY MELTCP+DCP Q
Subjt:  PP------------HIYSS-PPPPYIYSSPPPPATAEPSPPIPPTPTPTPP-TPTPTPPT--SPPPSSEASGQKKVRCKNKGFPHCYLMELTCPSDCPSQ

Query:  CEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTID
        CEVDCVTCS VCNCNRPGAVCQDPRFIGGDGITFYFHGKKD+DFCIVTDSNLHINAHFIGRRN+DMKRDFTWVQSLGILFDSH+LFIGARKT+TWDD  D
Subjt:  CEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTID

Query:  RLSISFNGETLFLPDKEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQTY
        RLS+ FN +T+ L ++EGATWSNST+YEGITITRTRNTNAVEI VPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFY LSG+VNGVLGQTY
Subjt:  RLSISFNGETLFLPDKEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQTY

Query:  GSNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNRKLEGKNTSLEAAVAYANMSCGSDIGGEGVVCKR
        GSNYVSRAKMGVAMPVLGGDKEFASSG F TDCAVARFN +LEGK++SLE   AY NMSCGSD+ GEGVVCKR
Subjt:  GSNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNRKLEGKNTSLEAAVAYANMSCGSDIGGEGVVCKR

KAG7020779.1 hypothetical protein SDJN02_17467, partial [Cucurbita argyrosperma subsp. argyrosperma]1.8e-21183.3Show/hide
Query:  MAKITMVLFLFFLLFLSAAVEGAPKPKKVKCKDKKFTQCYKSEYYCPSDCPRTCVVDCSSCKPVCTPPPPPPPSPPPPPPKPRKLKSPPPPPYIYSSPPP
        M KI  +LFLFF LFLSAAVE  PKPKKVKCKDK F QCYKSE+YCP+DC RTCVVDCSSCKPVCTPPPPPPPSPPPPPPKPRKLKSPPPPPY+YSSPPP
Subjt:  MAKITMVLFLFFLLFLSAAVEGAPKPKKVKCKDKKFTQCYKSEYYCPSDCPRTCVVDCSSCKPVCTPPPPPPPSPPPPPPKPRKLKSPPPPPYIYSSPPP

Query:  PP------------HI-YSSPPPPYIYSSPPPPATAEPSPPIPPTPTPTPP-TPTPTPPT--SPPPSSEASGQKKVRCKNKGFPHCYLMELTCPSDCPSQ
        PP            H+ YS PPPPYIYSSPPPP     SPP P T  P PP  PTPTPPT  SPPPSSEASGQKKVRCKN+ FPHCY MELTCP+DCP Q
Subjt:  PP------------HI-YSSPPPPYIYSSPPPPATAEPSPPIPPTPTPTPP-TPTPTPPT--SPPPSSEASGQKKVRCKNKGFPHCYLMELTCPSDCPSQ

Query:  CEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTID
        CEVDCVTCS VCNCNRPGAVCQDPRFIGGDGITFYFHGKKD+DFCIVTDSNLHINAHFIGRRN+DMKRDFTWVQSLGILFDSH+LFIGARKT+TWDD  D
Subjt:  CEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTID

Query:  RLSISFNGETLFLPDKEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQTY
        RLS+ FN +T+ L ++EGATWSNST+YEGITITRTRNTNAVEI VPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFY LSG+VNGVLGQTY
Subjt:  RLSISFNGETLFLPDKEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQTY

Query:  GSNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNRKLEGKNTSLEAAVAYANMSCGSDIGGEGVVCKR
        GSNYVSRAKMGVAMPVLGGDKEFASSG F TDCAVARFN +LEGK++SLE   AY NMSCGSD+ GEGVVCKR
Subjt:  GSNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNRKLEGKNTSLEAAVAYANMSCGSDIGGEGVVCKR

XP_022937516.1 uncharacterized protein LOC111443907 [Cucurbita moschata]6.1e-21283.51Show/hide
Query:  MAKITMVLFLFFLLFLSAAVEGAPKPKKVKCKDKKFTQCYKSEYYCPSDCPRTCVVDCSSCKPVCTPPPPPPPSPPPPPPKPRKLKSPPPPPYIYSSPPP
        M KI  +LFLFF LFLSAAVE  PKPKKVKCKDK F QCYKSE+YCP+DC RTCVVDCSSCKPVCTPPPPPPPSPPPPPPKPRKLKSPPPPPY+YSSPPP
Subjt:  MAKITMVLFLFFLLFLSAAVEGAPKPKKVKCKDKKFTQCYKSEYYCPSDCPRTCVVDCSSCKPVCTPPPPPPPSPPPPPPKPRKLKSPPPPPYIYSSPPP

Query:  PP------------HIYSS-PPPPYIYSSPPPPATAEPSPPIPPTPTPTPP-TPTPTPPT--SPPPSSEASGQKKVRCKNKGFPHCYLMELTCPSDCPSQ
        PP            H+YSS PPPPYIYSSPPPP     SPP P T  P PP  PTPTPPT  SPPPSSEASGQKKVRCKN+ FPHCY MELTCP+DCP Q
Subjt:  PP------------HIYSS-PPPPYIYSSPPPPATAEPSPPIPPTPTPTPP-TPTPTPPT--SPPPSSEASGQKKVRCKNKGFPHCYLMELTCPSDCPSQ

Query:  CEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTID
        CEVDCVTCS VCNCNRPGAVCQDPRFIGGDGITFYFHGKKD+DFCIVTDSNLHINAHFIGRRN+DMKRDFTWVQSLGILFDSH+LFIGARKT+TWDD  D
Subjt:  CEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTID

Query:  RLSISFNGETLFLPDKEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQTY
        RLS+ FN +T+ L ++EGATWSNST+YEGITITRTRNTNAVEI VPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFY LSG+VNGVLGQTY
Subjt:  RLSISFNGETLFLPDKEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQTY

Query:  GSNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNRKLEGKNTSLEAAVAYANMSCGSDIGGEGVVCKR
        GSNYVSRAKMGVAMPVLGGDKEFASSG F TDCAVARFN +LEGK++SLE   AY NMSCGSD+ GEGVVCKR
Subjt:  GSNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNRKLEGKNTSLEAAVAYANMSCGSDIGGEGVVCKR

XP_022937854.1 uncharacterized protein LOC111444116 [Cucurbita moschata]3.6e-21283.51Show/hide
Query:  MAKITMVLFLFFLLFLSAAVEGAPKPKKVKCKDKKFTQCYKSEYYCPSDCPRTCVVDCSSCKPVCTPPPPPPPSPPPPPPKPRKLKSPPPPPYIYSSPPP
        M KI  +LFLFF LFLSAAVE  PKPKKVKCKDK F QCYKSE+YCP+DC RTCVVDCSSCKPVCTPPPPPPPSPPPPPPKPRKLKSPPPPPY+YSSPPP
Subjt:  MAKITMVLFLFFLLFLSAAVEGAPKPKKVKCKDKKFTQCYKSEYYCPSDCPRTCVVDCSSCKPVCTPPPPPPPSPPPPPPKPRKLKSPPPPPYIYSSPPP

Query:  PP------------HI-YSSPPPPYIYSSPPPPATAEPSPPIPPTPTPTPP-TPTPTPPT--SPPPSSEASGQKKVRCKNKGFPHCYLMELTCPSDCPSQ
        PP            H+ YS PPPPYIYSSPPPP     SPP P T  P PP  PTPTPPT  SPPPSSEASGQKKVRCKN+ FPHCY MELTCP+DCP Q
Subjt:  PP------------HI-YSSPPPPYIYSSPPPPATAEPSPPIPPTPTPTPP-TPTPTPPT--SPPPSSEASGQKKVRCKNKGFPHCYLMELTCPSDCPSQ

Query:  CEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTID
        CEVDCVTCS VCNCNRPGAVCQDPRFIGGDGITFYFHGKKD+DFCIVTDSNLHINAHFIGRRN+DMKRDFTWVQSLGILFDSH+LFIGARKT+TWDD  D
Subjt:  CEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTID

Query:  RLSISFNGETLFLPDKEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQTY
        RLS+SFN +T+ L ++EGATWSNST+YEGITITRTRNTNAVEI VPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFY LSG+VNGVLGQTY
Subjt:  RLSISFNGETLFLPDKEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQTY

Query:  GSNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNRKLEGKNTSLEAAVAYANMSCGSDIGGEGVVCKR
        GSNYVSRAKMGVAMPVLGGDKEFASSG F TDCAVARFN +LEGK++SLE   AY NMSCGSD+ GEGVVCKR
Subjt:  GSNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNRKLEGKNTSLEAAVAYANMSCGSDIGGEGVVCKR

XP_022969544.1 uncharacterized protein LOC111468530 [Cucurbita maxima]1.1e-21383.76Show/hide
Query:  MAKITMVLFLFFLLFLSAAVEGAPKPKKVKCKDKKFTQCYKSEYYCPSDCPRTCVVDCSSCKPVCTPPPPPPPSPPPPPPKPRKLKSPPPPPYIYSSPPP
        M KI  +LFLFF LFLSAAVE  PKPKKVKCKDKKF QCYKSE+YCP+DC RTCVVDCSSCKPVCTPPPPPPPSPPPPPPKPRKLKSPPPPPY+YSSPPP
Subjt:  MAKITMVLFLFFLLFLSAAVEGAPKPKKVKCKDKKFTQCYKSEYYCPSDCPRTCVVDCSSCKPVCTPPPPPPPSPPPPPPKPRKLKSPPPPPYIYSSPPP

Query:  PP------------HIYSS-PPPPYIYSSPPPPATAEPSPPIPPTPTPTPP--TPTPTPPT--SPPPSSEASGQKKVRCKNKGFPHCYLMELTCPSDCPS
        PP            H+YSS PPPPYIYSSPPPP     SPP PP  T  PP   PTP PPT  SPPPSSEASGQKKVRCKN+ FPHCY MELTCP+DCP 
Subjt:  PP------------HIYSS-PPPPYIYSSPPPPATAEPSPPIPPTPTPTPP--TPTPTPPT--SPPPSSEASGQKKVRCKNKGFPHCYLMELTCPSDCPS

Query:  QCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTI
        QCEVDCVTCS VCNCNRPGAVCQDPRFIGGDGITFYFHGKKD+DFCIVTDSNLHINAHFIGRRN+DMKRDFTWVQSLGILFDSH+LFIGARKT+TWDD  
Subjt:  QCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTI

Query:  DRLSISFNGETLFLPDKEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQT
        DRLS+SFN +T+ L ++EGATWSNST+YEGITITRTRNTNAVEI VPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFY LSGEVNGVLGQT
Subjt:  DRLSISFNGETLFLPDKEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQT

Query:  YGSNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNRKLEGKNTSLEAAVAYANMSCGSDIGGEGVVCKR
        YGSNYVSRAKMGVAMPVLGGDKEFASSG F TDCAVARFN +LEGK++SLE   AY NMSCGSD+ GEGVVCKR
Subjt:  YGSNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNRKLEGKNTSLEAAVAYANMSCGSDIGGEGVVCKR

TrEMBL top hitse value%identityAlignment
A0A0A0LSM1 Uncharacterized protein2.4e-20680.93Show/hide
Query:  MAKITMVLFLFFLLFLSAAVEGAPKPKKVKCKDKKFTQCYKSEYYCPSDCPRTCVVDCSSCKPVCTPPPPPPPSPPPPPPKPRKLKSPPPPPYIYSS-PP
        MA+I + LF F  LFLSA VEGAPK KKVKCKDKKF QCYKSE+YCP+DC RTCVVDCSSC+PVCTPPPPPPPSPPPPPPKPRKLKS PPPPYIYSS PP
Subjt:  MAKITMVLFLFFLLFLSAAVEGAPKPKKVKCKDKKFTQCYKSEYYCPSDCPRTCVVDCSSCKPVCTPPPPPPPSPPPPPPKPRKLKSPPPPPYIYSS-PP

Query:  PPPHIYSS--PPPPYIYSS------------PPPPATAEPSPPIPPTPTPTPPTPTPTPPTSPPPSSEASGQKKVRCKNKGFPHCYLMELTCPSDCPSQC
        PPP IYSS  PPPPYIYSS            PPPP T EPSPP+PP PTP   +P   PP SPPPSSEASGQKKVRCKN+G+PHCY MEL+CPSDCPSQC
Subjt:  PPPHIYSS--PPPPYIYSS------------PPPPATAEPSPPIPPTPTPTPPTPTPTPPTSPPPSSEASGQKKVRCKNKGFPHCYLMELTCPSDCPSQC

Query:  EVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTIDR
        EVDCVTCSPVCNCNRPGAVCQDP+FIGGDGITFYFHGK+D+DFCIVTDSNLHINAHFIGRRN+DMKRDFTWVQSLGILFDSH+LFI ARKT+TWDD  DR
Subjt:  EVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTIDR

Query:  LSISFNGETLFLPDKEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQTYG
        L IS + ET+ LP++EGATWSNSTSYEGI ITR+R TNAVEIEVPGNFKIKAVVVPITEKES IHKYGITQEDCFAHLDLSFKFY LSG VNGVLGQTYG
Subjt:  LSISFNGETLFLPDKEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQTYG

Query:  SNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNRKLEGKNTSLEAAVAYANMSCGSDIGGEGVVCKR
         NYVSRAKMGVAMPVLGGDKEFASS +F TDC V RF ++++ K + +EAA AYANMSCGSD+ G+GVVCKR
Subjt:  SNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNRKLEGKNTSLEAAVAYANMSCGSDIGGEGVVCKR

A0A1S3CF51 uncharacterized protein LOC1035002226.8e-20982.9Show/hide
Query:  MAKITMVLFLFFLLFLSAAVEGAPKPKKVKCKDKKFTQCYKSEYYCPSDCPRTCVVDCSSCKPVCTPPPPPPPSPPPPPPKPRKLKSPPPPPYIYSS-PP
        MA+I + LF FF LFLSA VEG PK KKVKCKDKKF QCYKS++YCP DC RTCVVDCSSC+PVCT PPPPPPSPPPPPPKPRKL+S PPPPYIYSS PP
Subjt:  MAKITMVLFLFFLLFLSAAVEGAPKPKKVKCKDKKFTQCYKSEYYCPSDCPRTCVVDCSSCKPVCTPPPPPPPSPPPPPPKPRKLKSPPPPPYIYSS-PP

Query:  PPPHIYSS--PPPPYIYSS--PPPPATAEPSPPIPPTPTPTPPTPTPTPPTSPPPSSEASGQKKVRCKNKGFPHCYLMELTCPSDCPSQCEVDCVTCSPV
        PPP +YSS  PPPPYIYSS  PPPPAT EPSPP+PPTPTP    P+  PP SPPPSSEASGQKKVRCKN+G+PHCY MEL+CPSDCPSQCEVDCVTCSPV
Subjt:  PPPHIYSS--PPPPYIYSS--PPPPATAEPSPPIPPTPTPTPPTPTPTPPTSPPPSSEASGQKKVRCKNKGFPHCYLMELTCPSDCPSQCEVDCVTCSPV

Query:  CNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTIDRLSISFNGETL
        CNCNRPGAVCQDP+FIGGDGITFYFHGKKD+DFCIVTDSNLHINAHFIGRRN+DMKRDFTWVQSLGILF SHKLFI ARKT+TWDD  DRL IS + ET+
Subjt:  CNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTIDRLSISFNGETL

Query:  FLPDKEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQTYGSNYVSRAKMG
         LP++EGATWSNSTSYEGI I+R+R TNAVEIEVPGNFKIKAVVVPITEKES IHKYGITQEDCFAHLDLSFKFY LSG V+GVLGQTYG+NYVSRAKMG
Subjt:  FLPDKEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQTYGSNYVSRAKMG

Query:  VAMPVLGGDKEFASSGLFTTDCAVARFNRKLEGKNTSLEAAVAYANMSCGSDIGGEGVVCKR
        VAMPVLGGDKEFASS +F TDC VARF+R+L+GK +S+EAA AYANMSCG+D+ G+GVVCKR
Subjt:  VAMPVLGGDKEFASSGLFTTDCAVARFNRKLEGKNTSLEAAVAYANMSCGSDIGGEGVVCKR

A0A6J1FAJ8 uncharacterized protein LOC1114439072.9e-21283.51Show/hide
Query:  MAKITMVLFLFFLLFLSAAVEGAPKPKKVKCKDKKFTQCYKSEYYCPSDCPRTCVVDCSSCKPVCTPPPPPPPSPPPPPPKPRKLKSPPPPPYIYSSPPP
        M KI  +LFLFF LFLSAAVE  PKPKKVKCKDK F QCYKSE+YCP+DC RTCVVDCSSCKPVCTPPPPPPPSPPPPPPKPRKLKSPPPPPY+YSSPPP
Subjt:  MAKITMVLFLFFLLFLSAAVEGAPKPKKVKCKDKKFTQCYKSEYYCPSDCPRTCVVDCSSCKPVCTPPPPPPPSPPPPPPKPRKLKSPPPPPYIYSSPPP

Query:  PP------------HIYSS-PPPPYIYSSPPPPATAEPSPPIPPTPTPTPP-TPTPTPPT--SPPPSSEASGQKKVRCKNKGFPHCYLMELTCPSDCPSQ
        PP            H+YSS PPPPYIYSSPPPP     SPP P T  P PP  PTPTPPT  SPPPSSEASGQKKVRCKN+ FPHCY MELTCP+DCP Q
Subjt:  PP------------HIYSS-PPPPYIYSSPPPPATAEPSPPIPPTPTPTPP-TPTPTPPT--SPPPSSEASGQKKVRCKNKGFPHCYLMELTCPSDCPSQ

Query:  CEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTID
        CEVDCVTCS VCNCNRPGAVCQDPRFIGGDGITFYFHGKKD+DFCIVTDSNLHINAHFIGRRN+DMKRDFTWVQSLGILFDSH+LFIGARKT+TWDD  D
Subjt:  CEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTID

Query:  RLSISFNGETLFLPDKEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQTY
        RLS+ FN +T+ L ++EGATWSNST+YEGITITRTRNTNAVEI VPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFY LSG+VNGVLGQTY
Subjt:  RLSISFNGETLFLPDKEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQTY

Query:  GSNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNRKLEGKNTSLEAAVAYANMSCGSDIGGEGVVCKR
        GSNYVSRAKMGVAMPVLGGDKEFASSG F TDCAVARFN +LEGK++SLE   AY NMSCGSD+ GEGVVCKR
Subjt:  GSNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNRKLEGKNTSLEAAVAYANMSCGSDIGGEGVVCKR

A0A6J1FCE2 uncharacterized protein LOC1114441161.7e-21283.51Show/hide
Query:  MAKITMVLFLFFLLFLSAAVEGAPKPKKVKCKDKKFTQCYKSEYYCPSDCPRTCVVDCSSCKPVCTPPPPPPPSPPPPPPKPRKLKSPPPPPYIYSSPPP
        M KI  +LFLFF LFLSAAVE  PKPKKVKCKDK F QCYKSE+YCP+DC RTCVVDCSSCKPVCTPPPPPPPSPPPPPPKPRKLKSPPPPPY+YSSPPP
Subjt:  MAKITMVLFLFFLLFLSAAVEGAPKPKKVKCKDKKFTQCYKSEYYCPSDCPRTCVVDCSSCKPVCTPPPPPPPSPPPPPPKPRKLKSPPPPPYIYSSPPP

Query:  PP------------HI-YSSPPPPYIYSSPPPPATAEPSPPIPPTPTPTPP-TPTPTPPT--SPPPSSEASGQKKVRCKNKGFPHCYLMELTCPSDCPSQ
        PP            H+ YS PPPPYIYSSPPPP     SPP P T  P PP  PTPTPPT  SPPPSSEASGQKKVRCKN+ FPHCY MELTCP+DCP Q
Subjt:  PP------------HI-YSSPPPPYIYSSPPPPATAEPSPPIPPTPTPTPP-TPTPTPPT--SPPPSSEASGQKKVRCKNKGFPHCYLMELTCPSDCPSQ

Query:  CEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTID
        CEVDCVTCS VCNCNRPGAVCQDPRFIGGDGITFYFHGKKD+DFCIVTDSNLHINAHFIGRRN+DMKRDFTWVQSLGILFDSH+LFIGARKT+TWDD  D
Subjt:  CEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTID

Query:  RLSISFNGETLFLPDKEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQTY
        RLS+SFN +T+ L ++EGATWSNST+YEGITITRTRNTNAVEI VPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFY LSG+VNGVLGQTY
Subjt:  RLSISFNGETLFLPDKEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQTY

Query:  GSNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNRKLEGKNTSLEAAVAYANMSCGSDIGGEGVVCKR
        GSNYVSRAKMGVAMPVLGGDKEFASSG F TDCAVARFN +LEGK++SLE   AY NMSCGSD+ GEGVVCKR
Subjt:  GSNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNRKLEGKNTSLEAAVAYANMSCGSDIGGEGVVCKR

A0A6J1I078 uncharacterized protein LOC1114685305.4e-21483.76Show/hide
Query:  MAKITMVLFLFFLLFLSAAVEGAPKPKKVKCKDKKFTQCYKSEYYCPSDCPRTCVVDCSSCKPVCTPPPPPPPSPPPPPPKPRKLKSPPPPPYIYSSPPP
        M KI  +LFLFF LFLSAAVE  PKPKKVKCKDKKF QCYKSE+YCP+DC RTCVVDCSSCKPVCTPPPPPPPSPPPPPPKPRKLKSPPPPPY+YSSPPP
Subjt:  MAKITMVLFLFFLLFLSAAVEGAPKPKKVKCKDKKFTQCYKSEYYCPSDCPRTCVVDCSSCKPVCTPPPPPPPSPPPPPPKPRKLKSPPPPPYIYSSPPP

Query:  PP------------HIYSS-PPPPYIYSSPPPPATAEPSPPIPPTPTPTPP--TPTPTPPT--SPPPSSEASGQKKVRCKNKGFPHCYLMELTCPSDCPS
        PP            H+YSS PPPPYIYSSPPPP     SPP PP  T  PP   PTP PPT  SPPPSSEASGQKKVRCKN+ FPHCY MELTCP+DCP 
Subjt:  PP------------HIYSS-PPPPYIYSSPPPPATAEPSPPIPPTPTPTPP--TPTPTPPT--SPPPSSEASGQKKVRCKNKGFPHCYLMELTCPSDCPS

Query:  QCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTI
        QCEVDCVTCS VCNCNRPGAVCQDPRFIGGDGITFYFHGKKD+DFCIVTDSNLHINAHFIGRRN+DMKRDFTWVQSLGILFDSH+LFIGARKT+TWDD  
Subjt:  QCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTI

Query:  DRLSISFNGETLFLPDKEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQT
        DRLS+SFN +T+ L ++EGATWSNST+YEGITITRTRNTNAVEI VPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFY LSGEVNGVLGQT
Subjt:  DRLSISFNGETLFLPDKEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQT

Query:  YGSNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNRKLEGKNTSLEAAVAYANMSCGSDIGGEGVVCKR
        YGSNYVSRAKMGVAMPVLGGDKEFASSG F TDCAVARFN +LEGK++SLE   AY NMSCGSD+ GEGVVCKR
Subjt:  YGSNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNRKLEGKNTSLEAAVAYANMSCGSDIGGEGVVCKR

SwissProt top hitse value%identityAlignment
O65375 Leucine-rich repeat extensin-like protein 11.2e-0859.6Show/hide
Query:  PPPPPPPSPPPP-----PPKPRKLKSPPPPPYIYSSPPPPPHIYSSPPPPYIYSSPPPPATAEPSPPIPPTPTPTPPTPTPTPPT--------SPPPSS
        PPPPP PSPPPP     PP P    SPPPPPY+YSSPPPPP++YSSPPPPY+YSSPPPP      PP PP+P P  P  +P PP         SPPP S
Subjt:  PPPPPPPSPPPP-----PPKPRKLKSPPPPPYIYSSPPPPPHIYSSPPPPYIYSSPPPPATAEPSPPIPPTPTPTPPTPTPTPPT--------SPPPSS

Arabidopsis top hitse value%identityAlignment
AT3G19430.1 late embryogenesis abundant protein-related / LEA protein-related2.1e-11446.93Show/hide
Query:  APKPKKVKCKDKKFTQCYKSEYYCPSDCPRTCVVDCSSCKPVCTPP--------------------------------------------PPPP------
        A  P    CK KK+  CY  E+ CP  CP +C V+C+SCKP+C PP                                            PPPP      
Subjt:  APKPKKVKCKDKKFTQCYKSEYYCPSDCPRTCVVDCSSCKPVCTPP--------------------------------------------PPPP------

Query:  PSP-----PPPP------PKPRKLKSPPPP---PYIYS-----SPPPP----------PHIYSSP---PPPYIYSSPPPPATAEPSP----PIPPTP---
        PSP     PPPP      P P    SPPPP   P + S     SPPPP          P + + P   PPP +   PP P  + PSP    P PPTP   
Subjt:  PSP-----PPPP------PKPRKLKSPPPP---PYIYS-----SPPPP----------PHIYSSP---PPPYIYSSPPPPATAEPSP----PIPPTP---

Query:  -----TPTPPTP--------TPTPPTS-------------PPPS--SEASGQKKVRCKNKGFPHCYLMELTCPSDCPSQCEVDCVTCSPVCNCNRPGAVC
             TPTPPTP        TPTPPT              PPPS   EA+G K+VRCK +  P CY +E TCP+DCP  C+VDCVTC PVCNC++PG+VC
Subjt:  -----TPTPPTP--------TPTPPTS-------------PPPS--SEASGQKKVRCKNKGFPHCYLMELTCPSDCPSQCEVDCVTCSPVCNCNRPGAVC

Query:  QDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTIDRLSISFNGETLFLPDKEGATW
        QDPRFIGGDG+TFYFHGKKD +FC+++D NLHINAHFIG+R   M RDFTWVQS+ ILF +H+L++GA KT TWDD++DR+++SF+G  + LP  +GA W
Subjt:  QDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTIDRLSISFNGETLFLPDKEGATW

Query:  SNSTS-YEGITITRTR-NTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQTYGSNYVSRAKMGVAMPVLGG
        ++S   Y  +++ R   +TN +E+EV G  KI A VVPIT ++SRIH Y + ++DC AHLDL FKF  LS  V+GVLGQTY SNYVSR K+GV MPV+GG
Subjt:  SNSTS-YEGITITRTR-NTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQTYGSNYVSRAKMGVAMPVLGG

Query:  DKEFASSGLFTTDCAVARFNRKLEGKNTSLEAAVAYANMSCGSDIGGEGVVCKR
        D+EF ++GLF  DC+ ARF     G + +  + +    MSC S +GG+GVVCKR
Subjt:  DKEFASSGLFTTDCAVARFNRKLEGKNTSLEAAVAYANMSCGSDIGGEGVVCKR

AT4G27400.1 Late embryogenesis abundant (LEA) protein-related4.5e-6438.98Show/hide
Query:  CKNKGFPHCYLMELTCPSDCPSQ---------CEVDCV--TCSPVC-----NCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGR
        C     P C L  + CP +CP++         C VDC    C  VC     NC   G++C DPRFIGGDGI FYFHGK ++ F IV+D +  INA F G 
Subjt:  CKNKGFPHCYLMELTCPSDCPSQ---------CEVDCV--TCSPVC-----NCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGR

Query:  RNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTIDRLSISFNGETLFLPDKEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAVVVPITEK
        R     RDFTW+Q+LG LF+SHK  +   K  TWD  +D L  + +G+ L +P +  +TW +S   + I I R    N+V + +    +I   VVP+T++
Subjt:  RNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTIDRLSISFNGETLFLPDKEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAVVVPITEK

Query:  ESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQTYGSNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNRKLEGKNTSLEAAVAYANMSCG
        + RIH Y +  +DCFAH ++ FKF  LS +V+G+LG+TY  ++ + AK GV MPV+GG+  F +S L +  C    F+      + S++    YA + C 
Subjt:  ESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQTYGSNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNRKLEGKNTSLEAAVAYANMSCG

Query:  SDI-GGEGVVCKR
             G G+VC++
Subjt:  SDI-GGEGVVCKR

AT5G54370.1 Late embryogenesis abundant (LEA) protein-related4.2e-7041.27Show/hide
Query:  VRCKNKGFPHCYLMELTCPSDCPSQ---------CEVDC--VTCSPVC-----NCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFI
        V C N  +  CY   + CP +CPS+         C  DC   TC   C     NCNRPG+ C DPRFIGGDGI FYFHGK +++F +V+DS+L IN  FI
Subjt:  VRCKNKGFPHCYLMELTCPSDCPSQ---------CEVDC--VTCSPVC-----NCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFI

Query:  GRRNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTIDRLSISFNGETLFLPDKEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAVVVPIT
        G R     RDFTW+Q+LG LF+S+K  + A KT +WD+ ID L  S++G+ L +P++  +TW +    + I I R    N+V + +    +I   VVP+T
Subjt:  GRRNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTIDRLSISFNGETLFLPDKEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAVVVPIT

Query:  EKESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQTYGSNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNRKLEGKNTSLEAAVAYANMS
        +++ RIH Y +  +DCFAHL++ F+F+ LS +V+G+LG+TY  ++ + AK GVAMPV+GG+  F +S L + DC    F+   + +  S+++ + YA + 
Subjt:  EKESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQTYGSNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNRKLEGKNTSLEAAVAYANMS

Query:  CGSDI-GGEGVVCKR
        C      G G+VC++
Subjt:  CGSDI-GGEGVVCKR

AT5G60520.1 Late embryogenesis abundant (LEA) protein-related3.0e-6844.25Show/hide
Query:  SGQKKVRCKNKGFPHCYLMELTCPSDCPSQ----------CEVDCVT-CSPVC-----NCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHI
        SGQ++V+C  +G   C    LTCP +CP +          C +DC + C   C     NCN  G++C DPRF+GGDG+ FYFHG KD +F IV+D NL I
Subjt:  SGQKKVRCKNKGFPHCYLMELTCPSDCPSQ----------CEVDCVT-CSPVC-----NCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHI

Query:  NAHFIGRRNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTIDRLSISFNGETLFLPDKEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAV
        NAHFIG R     RDFTWVQ+  ++FDSH L I A+K  +WDD++D L + +NGE + +P +  A W        + + RT   N V + V G  +I   
Subjt:  NAHFIGRRNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTIDRLSISFNGETLFLPDKEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAV

Query:  VVPITEKESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQTYGSNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNRK
        V PI ++E R+HKY + ++D FAHL+  FKF+ LS  V GVLG+TY   YVS  K GV MP++GG+ ++ +  LF+  C V RF  K
Subjt:  VVPITEKESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQTYGSNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNRK

AT5G60530.1 late embryogenesis abundant protein-related / LEA protein-related5.7e-6742.57Show/hide
Query:  SGQKKVRCKNKGFPHCYLMELTCPSDCPSQ----------CEVDCVT-CSPVC-----NCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHI
        +GQ++  C+ +G   CY   L CP +CP +          C +DC   C   C     NCN  G++C DPRF+GGDG+ FYFHG K  +F IV+D+NL I
Subjt:  SGQKKVRCKNKGFPHCYLMELTCPSDCPSQ----------CEVDCVT-CSPVC-----NCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHI

Query:  NAHFIGRRNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTIDRLSISFNGETLFLPDKEGATWSN-STSYEGITITRTRNTNAVEIEVPGNFKIKA
        NAHFIG R +   RDFTWVQ+L ++F++HKL I A +   WD+T D  +I ++GE + LP+ E + W   S   + I I RT   N+V + V    ++  
Subjt:  NAHFIGRRNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTIDRLSISFNGETLFLPDKEGATWSN-STSYEGITITRTRNTNAVEIEVPGNFKIKA

Query:  VVVPITEKESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQTYGSNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNRKLEGKNTSL
         V PI ++E+R+H Y + Q+D FAHL+  FKF  LS  V GVLG+TY  +YVS AK GV MPVLGG+ ++ +  LF+  C + RF  + E  +  +
Subjt:  VVVPITEKESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQTYGSNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNRKLEGKNTSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAAAATCACCATGGTCCTTTTCTTGTTCTTTCTTCTCTTCCTCTCAGCCGCTGTTGAGGGAGCTCCAAAACCCAAGAAAGTTAAATGCAAAGACAAGAAATTTAC
TCAATGTTACAAATCTGAGTACTATTGTCCTTCTGATTGTCCTCGAACTTGTGTTGTTGATTGTTCTTCTTGCAAACCTGTTTGTACTCCGCCCCCTCCACCACCACCTT
CCCCTCCTCCGCCGCCACCAAAACCCCGCAAGCTCAAATCTCCGCCGCCGCCACCGTACATTTACTCTTCCCCACCGCCACCCCCTCATATTTACTCTTCTCCTCCGCCT
CCTTACATTTACTCTTCACCACCACCACCAGCTACGGCGGAGCCTTCACCTCCAATCCCTCCGACTCCCACTCCGACCCCTCCAACTCCAACTCCGACTCCTCCAACATC
TCCACCGCCGTCATCTGAGGCGTCGGGACAGAAGAAAGTTAGGTGCAAGAATAAGGGCTTTCCACATTGCTACTTGATGGAGCTAACTTGTCCAAGTGATTGTCCTAGCC
AATGTGAGGTTGATTGTGTTACTTGCAGCCCCGTTTGCAACTGCAACCGTCCGGGCGCAGTGTGCCAAGATCCCAGATTCATTGGAGGAGATGGAATCACCTTCTACTTC
CATGGCAAAAAAGACCAAGATTTCTGTATCGTCACCGACTCCAACCTCCACATCAATGCTCACTTCATCGGCCGACGAAACCTCGACATGAAGAGGGACTTCACTTGGGT
CCAATCCCTCGGCATCCTCTTCGACTCGCACAAGCTCTTCATCGGTGCTCGGAAAACCACAACATGGGATGACACCATCGACCGCCTCTCCATCTCCTTCAACGGCGAAA
CCCTCTTCCTCCCAGATAAGGAGGGTGCCACCTGGAGTAATTCAACCTCGTACGAGGGAATCACAATAACAAGAACTCGCAACACGAATGCCGTCGAGATTGAAGTCCCC
GGGAACTTCAAGATCAAGGCTGTCGTGGTTCCGATAACGGAAAAGGAATCGAGGATCCACAAGTATGGGATTACACAAGAGGATTGCTTTGCCCATTTGGACTTGAGCTT
CAAGTTCTATGGTTTGAGTGGCGAAGTGAATGGGGTTTTGGGACAGACTTATGGTAGCAATTATGTGAGCAGGGCCAAGATGGGAGTGGCAATGCCTGTTTTGGGGGGCG
ATAAGGAGTTTGCTTCATCAGGTCTTTTTACTACAGATTGTGCAGTGGCACGTTTCAATAGGAAGTTAGAAGGAAAAAACACTTCTTTGGAGGCTGCTGTAGCCTATGCC
AATATGAGCTGTGGCAGTGACATCGGAGGTGAAGGAGTTGTTTGCAAACGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAAAATCACCATGGTCCTTTTCTTGTTCTTTCTTCTCTTCCTCTCAGCCGCTGTTGAGGGAGCTCCAAAACCCAAGAAAGTTAAATGCAAAGACAAGAAATTTAC
TCAATGTTACAAATCTGAGTACTATTGTCCTTCTGATTGTCCTCGAACTTGTGTTGTTGATTGTTCTTCTTGCAAACCTGTTTGTACTCCGCCCCCTCCACCACCACCTT
CCCCTCCTCCGCCGCCACCAAAACCCCGCAAGCTCAAATCTCCGCCGCCGCCACCGTACATTTACTCTTCCCCACCGCCACCCCCTCATATTTACTCTTCTCCTCCGCCT
CCTTACATTTACTCTTCACCACCACCACCAGCTACGGCGGAGCCTTCACCTCCAATCCCTCCGACTCCCACTCCGACCCCTCCAACTCCAACTCCGACTCCTCCAACATC
TCCACCGCCGTCATCTGAGGCGTCGGGACAGAAGAAAGTTAGGTGCAAGAATAAGGGCTTTCCACATTGCTACTTGATGGAGCTAACTTGTCCAAGTGATTGTCCTAGCC
AATGTGAGGTTGATTGTGTTACTTGCAGCCCCGTTTGCAACTGCAACCGTCCGGGCGCAGTGTGCCAAGATCCCAGATTCATTGGAGGAGATGGAATCACCTTCTACTTC
CATGGCAAAAAAGACCAAGATTTCTGTATCGTCACCGACTCCAACCTCCACATCAATGCTCACTTCATCGGCCGACGAAACCTCGACATGAAGAGGGACTTCACTTGGGT
CCAATCCCTCGGCATCCTCTTCGACTCGCACAAGCTCTTCATCGGTGCTCGGAAAACCACAACATGGGATGACACCATCGACCGCCTCTCCATCTCCTTCAACGGCGAAA
CCCTCTTCCTCCCAGATAAGGAGGGTGCCACCTGGAGTAATTCAACCTCGTACGAGGGAATCACAATAACAAGAACTCGCAACACGAATGCCGTCGAGATTGAAGTCCCC
GGGAACTTCAAGATCAAGGCTGTCGTGGTTCCGATAACGGAAAAGGAATCGAGGATCCACAAGTATGGGATTACACAAGAGGATTGCTTTGCCCATTTGGACTTGAGCTT
CAAGTTCTATGGTTTGAGTGGCGAAGTGAATGGGGTTTTGGGACAGACTTATGGTAGCAATTATGTGAGCAGGGCCAAGATGGGAGTGGCAATGCCTGTTTTGGGGGGCG
ATAAGGAGTTTGCTTCATCAGGTCTTTTTACTACAGATTGTGCAGTGGCACGTTTCAATAGGAAGTTAGAAGGAAAAAACACTTCTTTGGAGGCTGCTGTAGCCTATGCC
AATATGAGCTGTGGCAGTGACATCGGAGGTGAAGGAGTTGTTTGCAAACGATAA
Protein sequenceShow/hide protein sequence
MAKITMVLFLFFLLFLSAAVEGAPKPKKVKCKDKKFTQCYKSEYYCPSDCPRTCVVDCSSCKPVCTPPPPPPPSPPPPPPKPRKLKSPPPPPYIYSSPPPPPHIYSSPPP
PYIYSSPPPPATAEPSPPIPPTPTPTPPTPTPTPPTSPPPSSEASGQKKVRCKNKGFPHCYLMELTCPSDCPSQCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYF
HGKKDQDFCIVTDSNLHINAHFIGRRNLDMKRDFTWVQSLGILFDSHKLFIGARKTTTWDDTIDRLSISFNGETLFLPDKEGATWSNSTSYEGITITRTRNTNAVEIEVP
GNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYGLSGEVNGVLGQTYGSNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNRKLEGKNTSLEAAVAYA
NMSCGSDIGGEGVVCKR