; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS006752 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS006752
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionLate embryogenesis abundant protein-related / LEA protein-related protein
Genome locationscaffold60:87874..90033
RNA-Seq ExpressionMS006752
SyntenyMS006752
Gene Ontology termsGO:0001505 - regulation of neurotransmitter levels (biological process)
GO:0007186 - G protein-coupled receptor signaling pathway (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004969 - histamine receptor activity (molecular function)
InterPro domainsIPR003980 - Histamine H3 receptor
IPR009646 - Root cap


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139573.2 uncharacterized protein LOC101207232 [Cucumis sativus]9.0e-20079.87Show/hide
Query:  MARIAIALFLFILLLSAAVEAAPKPKKEKPKKVKCKDKKYPKCYKSDLYCPDNCPRTCVVDCSTCQPVCIPPPPPPPSPPPPPPKPRKRKSPPPPY----
        MARIAI LF   L LSA VE AP     K KKVKCKDKK+P+CYKS+ YCP +C RTCVVDCS+CQPVC PPPPPPPSPPPPPPKPRK KSPPPPY    
Subjt:  MARIAIALFLFILLLSAAVEAAPKPKKEKPKKVKCKDKKYPKCYKSDLYCPDNCPRTCVVDCSTCQPVCIPPPPPPPSPPPPPPKPRKRKSPPPPY----

Query:  -------IYSSPPPPPPYIYSSPPPPPFIYSSPPPPPPATAEPTPPFPPALTPP---PPSLSPPPSSEASGQKRVRCKSRSFPHCYGMELSCPSDCPSQC
               IYSSPPPPPPYIYSSPPPPP IYSSPPPPPP T EP+PP PPA TPP   PP LSPPPSSEASGQK+VRCK+R +PHCYGMELSCPSDCPSQC
Subjt:  -------IYSSPPPPPPYIYSSPPPPPFIYSSPPPPPPATAEPTPPFPPALTPP---PPSLSPPPSSEASGQKRVRCKSRSFPHCYGMELSCPSDCPSQC

Query:  EVDCVTCSPVCNCDRPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINARFIGRRNPDMNRDFTWVQSLGILFDSHKLFIGARKTAIWDDAVDR
        EVDCVTCSPVCNC+RPGAVCQDPKFIGGDGITFYFHGK+D+DFCIVTDSNLHINA FIGRRN DM RDFTWVQSLGILFDSH+LFI ARKT+ WDDA DR
Subjt:  EVDCVTCSPVCNCDRPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINARFIGRRNPDMNRDFTWVQSLGILFDSHKLFIGARKTAIWDDAVDR

Query:  LSLSLNDETILLPNKDASTW-NST---GIAITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYA
        L +SL+DETI+LPN++ +TW NST   GIAITR+R TNAVEIEVPGNFKIKAVVVPITEKES IHKYGITQEDCFAHLDLSFKFYALSG+VNGVLGQTY 
Subjt:  LSLSLNDETILLPNKDASTW-NST---GIAITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYA

Query:  TNYVSRAKMGVAMPVLGGDKEFASSSLFATDCAVARFSGE----ETSLEAVAYANMNCGSDYLGAQGVVCKR
         NYVSRAKMGVAMPVLGGDKEFASSS+FATDC V RF+ E    E+ +EA AYANM+CGSD +  QGVVCKR
Subjt:  TNYVSRAKMGVAMPVLGGDKEFASSSLFATDCAVARFSGE----ETSLEAVAYANMNCGSDYLGAQGVVCKR

XP_008461680.1 PREDICTED: uncharacterized protein LOC103500222 [Cucumis melo]2.1e-20182Show/hide
Query:  MARIAIALFLFILLLSAAVEAAPKPKKEKPKKVKCKDKKYPKCYKSDLYCPDNCPRTCVVDCSTCQPVCIPPPPPPPSPPPPPPKPRKRKSPPPPYIYSS
        MARIAI LF F L LSA VE  P     K KKVKCKDKK+P+CYKS  YCPD+C RTCVVDCS+CQPVC  PPPPPPSPPPPPPKPRK +SPPPPYIYSS
Subjt:  MARIAIALFLFILLLSAAVEAAPKPKKEKPKKVKCKDKKYPKCYKSDLYCPDNCPRTCVVDCSTCQPVCIPPPPPPPSPPPPPPKPRKRKSPPPPYIYSS

Query:  PPPPPPYIYSS-PPPPPFIYSSPPPPPPATAEPTPPFPPALTPP--PPSLSPPPSSEASGQKRVRCKSRSFPHCYGMELSCPSDCPSQCEVDCVTCSPVC
        PPPPPP +YSS PPPPP+IYSSPPPPPPAT EP+PP PP  TPP  PP LSPPPSSEASGQK+VRCK+R +PHCYGMELSCPSDCPSQCEVDCVTCSPVC
Subjt:  PPPPPPYIYSS-PPPPPFIYSSPPPPPPATAEPTPPFPPALTPP--PPSLSPPPSSEASGQKRVRCKSRSFPHCYGMELSCPSDCPSQCEVDCVTCSPVC

Query:  NCDRPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINARFIGRRNPDMNRDFTWVQSLGILFDSHKLFIGARKTAIWDDAVDRLSLSLNDETIL
        NC+RPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINA FIGRRN DM RDFTWVQSLGILF SHKLFI ARKT+ WDDA DRL +SL+DETIL
Subjt:  NCDRPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINARFIGRRNPDMNRDFTWVQSLGILFDSHKLFIGARKTAIWDDAVDRLSLSLNDETIL

Query:  LPNKDASTW-NST---GIAITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYATNYVSRAKMGV
        LPN++ +TW NST   GIAI+R+R TNAVEIEVPGNFKIKAVVVPITEKES IHKYGITQEDCFAHLDLSFKFYALSG+V+GVLGQTY  NYVSRAKMGV
Subjt:  LPNKDASTW-NST---GIAITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYATNYVSRAKMGV

Query:  AMPVLGGDKEFASSSLFATDCAVARFS----GEETSLEAVAYANMNCGSDYLGAQGVVCKR
        AMPVLGGDKEFASSS+FATDC VARFS    G+E+S+EA AYANM+CG+D  G QGVVCKR
Subjt:  AMPVLGGDKEFASSSLFATDCAVARFS----GEETSLEAVAYANMNCGSDYLGAQGVVCKR

XP_022142605.1 uncharacterized protein LOC111012681 [Momordica charantia]2.0e-25299.78Show/hide
Query:  MARIAIALFLFILLLSAAVEAAPKPKKEKPKKVKCKDKKYPKCYKSDLYCPDNCPRTCVVDCSTCQPVCIPPPPPPPSPPPPPPKPRKRKSPPPPYIYSS
        MARIAIALFLFILLLSAAVEAAPKPKKEKPKKVKCKDKKYPKCYKSDLYCPDNCPRTCVVDCSTCQPVCIPPPPPPPSPPPPPPKPRKRKSPPPPYIYSS
Subjt:  MARIAIALFLFILLLSAAVEAAPKPKKEKPKKVKCKDKKYPKCYKSDLYCPDNCPRTCVVDCSTCQPVCIPPPPPPPSPPPPPPKPRKRKSPPPPYIYSS

Query:  PPPPPPYIYSSPPPPPFIYSSPPPPPPATAEPTPPFPPALTPPPPSLSPPPSSEASGQKRVRCKSRSFPHCYGMELSCPSDCPSQCEVDCVTCSPVCNCD
        PPPPPPYIYSSPPPPPFIYSSPPPPPPATAEPTPPFPPALTPPPPSLSPPPSSEASGQKRVRCKSRSFPHCYGMELSCPSDCPSQCEVDCVTCSPVCNCD
Subjt:  PPPPPPYIYSSPPPPPFIYSSPPPPPPATAEPTPPFPPALTPPPPSLSPPPSSEASGQKRVRCKSRSFPHCYGMELSCPSDCPSQCEVDCVTCSPVCNCD

Query:  RPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINARFIGRRNPDMNRDFTWVQSLGILFDSHKLFIGARKTAIWDDAVDRLSLSLNDETILLPN
        RPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINARFIGRRNPDMNRDFTWVQSLGILFDSHKLFIGARKTAIWDDAVDRLSLSLNDETILLPN
Subjt:  RPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINARFIGRRNPDMNRDFTWVQSLGILFDSHKLFIGARKTAIWDDAVDRLSLSLNDETILLPN

Query:  KDASTWNSTGIAITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYATNYVSRAKMGVAMPVLGG
        KDASTWNSTGIAITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYATNYVSRAKMGVAMPVLGG
Subjt:  KDASTWNSTGIAITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYATNYVSRAKMGVAMPVLGG

Query:  DKEFASSSLFATDCAVARFSGEETSLEAVAYANMNCGSDYLGAQGVVCKR
        DKEFASSSLFATDCAVA+FSGEETSLEAVAYANMNCGSDYLGAQGVVCKR
Subjt:  DKEFASSSLFATDCAVARFSGEETSLEAVAYANMNCGSDYLGAQGVVCKR

XP_022937854.1 uncharacterized protein LOC111444116 [Cucurbita moschata]2.5e-19477.82Show/hide
Query:  MARIAIALFLFILLLSAAVEAAPKPKKEKPKKVKCKDKKYPKCYKSDLYCPDNCPRTCVVDCSTCQPVCIPPPPPPPSPPPPPPKPRKRKS-PPPPYIYS
        M +I   LFLF L LSAAVEA P     KPKKVKCKDK +P+CYKS+ YCP +C RTCVVDCS+C+PVC PPPPPPPSPPPPPPKPRK KS PPPPY+YS
Subjt:  MARIAIALFLFILLLSAAVEAAPKPKKEKPKKVKCKDKKYPKCYKSDLYCPDNCPRTCVVDCSTCQPVCIPPPPPPPSPPPPPPKPRKRKS-PPPPYIYS

Query:  SPPPPPPYIYS-----------SPPPPPFIYSSPP--------PPPPATAEPTPPFPPALTPPPPSLSPPPSSEASGQKRVRCKSRSFPHCYGMELSCPS
        SPPPPPPYIYS           SPPPPP+IYSSPP        PPPPAT EP PP PP  T PP SLSPPPSSEASGQK+VRCK+RSFPHCYGMEL+CP+
Subjt:  SPPPPPPYIYS-----------SPPPPPFIYSSPP--------PPPPATAEPTPPFPPALTPPPPSLSPPPSSEASGQKRVRCKSRSFPHCYGMELSCPS

Query:  DCPSQCEVDCVTCSPVCNCDRPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINARFIGRRNPDMNRDFTWVQSLGILFDSHKLFIGARKTAIW
        DCP QCEVDCVTCS VCNC+RPGAVCQDP+FIGGDGITFYFHGKKD+DFCIVTDSNLHINA FIGRRN DM RDFTWVQSLGILFDSH+LFIGARKT+ W
Subjt:  DCPSQCEVDCVTCSPVCNCDRPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINARFIGRRNPDMNRDFTWVQSLGILFDSHKLFIGARKTAIW

Query:  DDAVDRLSLSLNDETILLPNKDASTW-NST---GIAITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSGDVNGV
        DDA DRLSLS N++TI+L N++ +TW NST   GI ITRTRNTNAVEI VPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSG VNGV
Subjt:  DDAVDRLSLSLNDETILLPNKDASTW-NST---GIAITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSGDVNGV

Query:  LGQTYATNYVSRAKMGVAMPVLGGDKEFASSSLFATDCAVARFSGE----ETSLEAVAYANMNCGSDYLGAQGVVCKR
        LGQTY +NYVSRAKMGVAMPVLGGDKEFASS  FATDCAVARF+G+    ++SLE  AY NM+CGSD  G +GVVCKR
Subjt:  LGQTYATNYVSRAKMGVAMPVLGGDKEFASSSLFATDCAVARFSGE----ETSLEAVAYANMNCGSDYLGAQGVVCKR

XP_022969544.1 uncharacterized protein LOC111468530 [Cucurbita maxima]3.3e-19477.92Show/hide
Query:  MARIAIALFLFILLLSAAVEAAPKPKKEKPKKVKCKDKKYPKCYKSDLYCPDNCPRTCVVDCSTCQPVCIPPPPPPPSPPPPPPKPRKRKS-PPPPYIYS
        M +I   LFLF L LSAAVEA P     KPKKVKCKDKK+P+CYKS+ YCP +C RTCVVDCS+C+PVC PPPPPPPSPPPPPPKPRK KS PPPPY+YS
Subjt:  MARIAIALFLFILLLSAAVEAAPKPKKEKPKKVKCKDKKYPKCYKSDLYCPDNCPRTCVVDCSTCQPVCIPPPPPPPSPPPPPPKPRKRKS-PPPPYIYS

Query:  SPPPPPPYI-----------YSSPPPPPFIYSSP---------PPPPPATAEPTPPFPPALTP-PPPSLSPPPSSEASGQKRVRCKSRSFPHCYGMELSC
        SPPPPPPYI           YSSPPPPP+IYSSP         PPPPPAT EP PP PP  TP PP S SPPPSSEASGQK+VRCK+RSFPHCYGMEL+C
Subjt:  SPPPPPPYI-----------YSSPPPPPFIYSSP---------PPPPPATAEPTPPFPPALTP-PPPSLSPPPSSEASGQKRVRCKSRSFPHCYGMELSC

Query:  PSDCPSQCEVDCVTCSPVCNCDRPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINARFIGRRNPDMNRDFTWVQSLGILFDSHKLFIGARKTA
        P+DCP QCEVDCVTCS VCNC+RPGAVCQDP+FIGGDGITFYFHGKKDRDFCIVTDSNLHINA FIGRRN DM RDFTWVQSLGILFDSH+LFIGARKT+
Subjt:  PSDCPSQCEVDCVTCSPVCNCDRPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINARFIGRRNPDMNRDFTWVQSLGILFDSHKLFIGARKTA

Query:  IWDDAVDRLSLSLNDETILLPNKDASTW-NST---GIAITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSGDVN
         WDDA DRLSLS N++TI+L N++ +TW NST   GI ITRTRNTNAVEI VPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSG+VN
Subjt:  IWDDAVDRLSLSLNDETILLPNKDASTW-NST---GIAITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSGDVN

Query:  GVLGQTYATNYVSRAKMGVAMPVLGGDKEFASSSLFATDCAVARFSGE----ETSLEAVAYANMNCGSDYLGAQGVVCKR
        GVLGQTY +NYVSRAKMGVAMPVLGGDKEFASS  FATDCAVARF+G+    ++SLE  AY NM+CGSD  G +GVVCKR
Subjt:  GVLGQTYATNYVSRAKMGVAMPVLGGDKEFASSSLFATDCAVARFSGE----ETSLEAVAYANMNCGSDYLGAQGVVCKR

TrEMBL top hitse value%identityAlignment
A0A0A0LSM1 Uncharacterized protein4.3e-20079.87Show/hide
Query:  MARIAIALFLFILLLSAAVEAAPKPKKEKPKKVKCKDKKYPKCYKSDLYCPDNCPRTCVVDCSTCQPVCIPPPPPPPSPPPPPPKPRKRKSPPPPY----
        MARIAI LF   L LSA VE AP     K KKVKCKDKK+P+CYKS+ YCP +C RTCVVDCS+CQPVC PPPPPPPSPPPPPPKPRK KSPPPPY    
Subjt:  MARIAIALFLFILLLSAAVEAAPKPKKEKPKKVKCKDKKYPKCYKSDLYCPDNCPRTCVVDCSTCQPVCIPPPPPPPSPPPPPPKPRKRKSPPPPY----

Query:  -------IYSSPPPPPPYIYSSPPPPPFIYSSPPPPPPATAEPTPPFPPALTPP---PPSLSPPPSSEASGQKRVRCKSRSFPHCYGMELSCPSDCPSQC
               IYSSPPPPPPYIYSSPPPPP IYSSPPPPPP T EP+PP PPA TPP   PP LSPPPSSEASGQK+VRCK+R +PHCYGMELSCPSDCPSQC
Subjt:  -------IYSSPPPPPPYIYSSPPPPPFIYSSPPPPPPATAEPTPPFPPALTPP---PPSLSPPPSSEASGQKRVRCKSRSFPHCYGMELSCPSDCPSQC

Query:  EVDCVTCSPVCNCDRPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINARFIGRRNPDMNRDFTWVQSLGILFDSHKLFIGARKTAIWDDAVDR
        EVDCVTCSPVCNC+RPGAVCQDPKFIGGDGITFYFHGK+D+DFCIVTDSNLHINA FIGRRN DM RDFTWVQSLGILFDSH+LFI ARKT+ WDDA DR
Subjt:  EVDCVTCSPVCNCDRPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINARFIGRRNPDMNRDFTWVQSLGILFDSHKLFIGARKTAIWDDAVDR

Query:  LSLSLNDETILLPNKDASTW-NST---GIAITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYA
        L +SL+DETI+LPN++ +TW NST   GIAITR+R TNAVEIEVPGNFKIKAVVVPITEKES IHKYGITQEDCFAHLDLSFKFYALSG+VNGVLGQTY 
Subjt:  LSLSLNDETILLPNKDASTW-NST---GIAITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYA

Query:  TNYVSRAKMGVAMPVLGGDKEFASSSLFATDCAVARFSGE----ETSLEAVAYANMNCGSDYLGAQGVVCKR
         NYVSRAKMGVAMPVLGGDKEFASSS+FATDC V RF+ E    E+ +EA AYANM+CGSD +  QGVVCKR
Subjt:  TNYVSRAKMGVAMPVLGGDKEFASSSLFATDCAVARFSGE----ETSLEAVAYANMNCGSDYLGAQGVVCKR

A0A1S3CF51 uncharacterized protein LOC1035002221.0e-20182Show/hide
Query:  MARIAIALFLFILLLSAAVEAAPKPKKEKPKKVKCKDKKYPKCYKSDLYCPDNCPRTCVVDCSTCQPVCIPPPPPPPSPPPPPPKPRKRKSPPPPYIYSS
        MARIAI LF F L LSA VE  P     K KKVKCKDKK+P+CYKS  YCPD+C RTCVVDCS+CQPVC  PPPPPPSPPPPPPKPRK +SPPPPYIYSS
Subjt:  MARIAIALFLFILLLSAAVEAAPKPKKEKPKKVKCKDKKYPKCYKSDLYCPDNCPRTCVVDCSTCQPVCIPPPPPPPSPPPPPPKPRKRKSPPPPYIYSS

Query:  PPPPPPYIYSS-PPPPPFIYSSPPPPPPATAEPTPPFPPALTPP--PPSLSPPPSSEASGQKRVRCKSRSFPHCYGMELSCPSDCPSQCEVDCVTCSPVC
        PPPPPP +YSS PPPPP+IYSSPPPPPPAT EP+PP PP  TPP  PP LSPPPSSEASGQK+VRCK+R +PHCYGMELSCPSDCPSQCEVDCVTCSPVC
Subjt:  PPPPPPYIYSS-PPPPPFIYSSPPPPPPATAEPTPPFPPALTPP--PPSLSPPPSSEASGQKRVRCKSRSFPHCYGMELSCPSDCPSQCEVDCVTCSPVC

Query:  NCDRPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINARFIGRRNPDMNRDFTWVQSLGILFDSHKLFIGARKTAIWDDAVDRLSLSLNDETIL
        NC+RPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINA FIGRRN DM RDFTWVQSLGILF SHKLFI ARKT+ WDDA DRL +SL+DETIL
Subjt:  NCDRPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINARFIGRRNPDMNRDFTWVQSLGILFDSHKLFIGARKTAIWDDAVDRLSLSLNDETIL

Query:  LPNKDASTW-NST---GIAITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYATNYVSRAKMGV
        LPN++ +TW NST   GIAI+R+R TNAVEIEVPGNFKIKAVVVPITEKES IHKYGITQEDCFAHLDLSFKFYALSG+V+GVLGQTY  NYVSRAKMGV
Subjt:  LPNKDASTW-NST---GIAITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYATNYVSRAKMGV

Query:  AMPVLGGDKEFASSSLFATDCAVARFS----GEETSLEAVAYANMNCGSDYLGAQGVVCKR
        AMPVLGGDKEFASSS+FATDC VARFS    G+E+S+EA AYANM+CG+D  G QGVVCKR
Subjt:  AMPVLGGDKEFASSSLFATDCAVARFS----GEETSLEAVAYANMNCGSDYLGAQGVVCKR

A0A6J1CLE3 uncharacterized protein LOC1110126819.8e-25399.78Show/hide
Query:  MARIAIALFLFILLLSAAVEAAPKPKKEKPKKVKCKDKKYPKCYKSDLYCPDNCPRTCVVDCSTCQPVCIPPPPPPPSPPPPPPKPRKRKSPPPPYIYSS
        MARIAIALFLFILLLSAAVEAAPKPKKEKPKKVKCKDKKYPKCYKSDLYCPDNCPRTCVVDCSTCQPVCIPPPPPPPSPPPPPPKPRKRKSPPPPYIYSS
Subjt:  MARIAIALFLFILLLSAAVEAAPKPKKEKPKKVKCKDKKYPKCYKSDLYCPDNCPRTCVVDCSTCQPVCIPPPPPPPSPPPPPPKPRKRKSPPPPYIYSS

Query:  PPPPPPYIYSSPPPPPFIYSSPPPPPPATAEPTPPFPPALTPPPPSLSPPPSSEASGQKRVRCKSRSFPHCYGMELSCPSDCPSQCEVDCVTCSPVCNCD
        PPPPPPYIYSSPPPPPFIYSSPPPPPPATAEPTPPFPPALTPPPPSLSPPPSSEASGQKRVRCKSRSFPHCYGMELSCPSDCPSQCEVDCVTCSPVCNCD
Subjt:  PPPPPPYIYSSPPPPPFIYSSPPPPPPATAEPTPPFPPALTPPPPSLSPPPSSEASGQKRVRCKSRSFPHCYGMELSCPSDCPSQCEVDCVTCSPVCNCD

Query:  RPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINARFIGRRNPDMNRDFTWVQSLGILFDSHKLFIGARKTAIWDDAVDRLSLSLNDETILLPN
        RPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINARFIGRRNPDMNRDFTWVQSLGILFDSHKLFIGARKTAIWDDAVDRLSLSLNDETILLPN
Subjt:  RPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINARFIGRRNPDMNRDFTWVQSLGILFDSHKLFIGARKTAIWDDAVDRLSLSLNDETILLPN

Query:  KDASTWNSTGIAITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYATNYVSRAKMGVAMPVLGG
        KDASTWNSTGIAITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYATNYVSRAKMGVAMPVLGG
Subjt:  KDASTWNSTGIAITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYATNYVSRAKMGVAMPVLGG

Query:  DKEFASSSLFATDCAVARFSGEETSLEAVAYANMNCGSDYLGAQGVVCKR
        DKEFASSSLFATDCAVA+FSGEETSLEAVAYANMNCGSDYLGAQGVVCKR
Subjt:  DKEFASSSLFATDCAVARFSGEETSLEAVAYANMNCGSDYLGAQGVVCKR

A0A6J1FCE2 uncharacterized protein LOC1114441161.2e-19477.82Show/hide
Query:  MARIAIALFLFILLLSAAVEAAPKPKKEKPKKVKCKDKKYPKCYKSDLYCPDNCPRTCVVDCSTCQPVCIPPPPPPPSPPPPPPKPRKRKS-PPPPYIYS
        M +I   LFLF L LSAAVEA P     KPKKVKCKDK +P+CYKS+ YCP +C RTCVVDCS+C+PVC PPPPPPPSPPPPPPKPRK KS PPPPY+YS
Subjt:  MARIAIALFLFILLLSAAVEAAPKPKKEKPKKVKCKDKKYPKCYKSDLYCPDNCPRTCVVDCSTCQPVCIPPPPPPPSPPPPPPKPRKRKS-PPPPYIYS

Query:  SPPPPPPYIYS-----------SPPPPPFIYSSPP--------PPPPATAEPTPPFPPALTPPPPSLSPPPSSEASGQKRVRCKSRSFPHCYGMELSCPS
        SPPPPPPYIYS           SPPPPP+IYSSPP        PPPPAT EP PP PP  T PP SLSPPPSSEASGQK+VRCK+RSFPHCYGMEL+CP+
Subjt:  SPPPPPPYIYS-----------SPPPPPFIYSSPP--------PPPPATAEPTPPFPPALTPPPPSLSPPPSSEASGQKRVRCKSRSFPHCYGMELSCPS

Query:  DCPSQCEVDCVTCSPVCNCDRPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINARFIGRRNPDMNRDFTWVQSLGILFDSHKLFIGARKTAIW
        DCP QCEVDCVTCS VCNC+RPGAVCQDP+FIGGDGITFYFHGKKD+DFCIVTDSNLHINA FIGRRN DM RDFTWVQSLGILFDSH+LFIGARKT+ W
Subjt:  DCPSQCEVDCVTCSPVCNCDRPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINARFIGRRNPDMNRDFTWVQSLGILFDSHKLFIGARKTAIW

Query:  DDAVDRLSLSLNDETILLPNKDASTW-NST---GIAITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSGDVNGV
        DDA DRLSLS N++TI+L N++ +TW NST   GI ITRTRNTNAVEI VPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSG VNGV
Subjt:  DDAVDRLSLSLNDETILLPNKDASTW-NST---GIAITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSGDVNGV

Query:  LGQTYATNYVSRAKMGVAMPVLGGDKEFASSSLFATDCAVARFSGE----ETSLEAVAYANMNCGSDYLGAQGVVCKR
        LGQTY +NYVSRAKMGVAMPVLGGDKEFASS  FATDCAVARF+G+    ++SLE  AY NM+CGSD  G +GVVCKR
Subjt:  LGQTYATNYVSRAKMGVAMPVLGGDKEFASSSLFATDCAVARFSGE----ETSLEAVAYANMNCGSDYLGAQGVVCKR

A0A6J1I078 uncharacterized protein LOC1114685301.6e-19477.92Show/hide
Query:  MARIAIALFLFILLLSAAVEAAPKPKKEKPKKVKCKDKKYPKCYKSDLYCPDNCPRTCVVDCSTCQPVCIPPPPPPPSPPPPPPKPRKRKS-PPPPYIYS
        M +I   LFLF L LSAAVEA P     KPKKVKCKDKK+P+CYKS+ YCP +C RTCVVDCS+C+PVC PPPPPPPSPPPPPPKPRK KS PPPPY+YS
Subjt:  MARIAIALFLFILLLSAAVEAAPKPKKEKPKKVKCKDKKYPKCYKSDLYCPDNCPRTCVVDCSTCQPVCIPPPPPPPSPPPPPPKPRKRKS-PPPPYIYS

Query:  SPPPPPPYI-----------YSSPPPPPFIYSSP---------PPPPPATAEPTPPFPPALTP-PPPSLSPPPSSEASGQKRVRCKSRSFPHCYGMELSC
        SPPPPPPYI           YSSPPPPP+IYSSP         PPPPPAT EP PP PP  TP PP S SPPPSSEASGQK+VRCK+RSFPHCYGMEL+C
Subjt:  SPPPPPPYI-----------YSSPPPPPFIYSSP---------PPPPPATAEPTPPFPPALTP-PPPSLSPPPSSEASGQKRVRCKSRSFPHCYGMELSC

Query:  PSDCPSQCEVDCVTCSPVCNCDRPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINARFIGRRNPDMNRDFTWVQSLGILFDSHKLFIGARKTA
        P+DCP QCEVDCVTCS VCNC+RPGAVCQDP+FIGGDGITFYFHGKKDRDFCIVTDSNLHINA FIGRRN DM RDFTWVQSLGILFDSH+LFIGARKT+
Subjt:  PSDCPSQCEVDCVTCSPVCNCDRPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINARFIGRRNPDMNRDFTWVQSLGILFDSHKLFIGARKTA

Query:  IWDDAVDRLSLSLNDETILLPNKDASTW-NST---GIAITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSGDVN
         WDDA DRLSLS N++TI+L N++ +TW NST   GI ITRTRNTNAVEI VPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSG+VN
Subjt:  IWDDAVDRLSLSLNDETILLPNKDASTW-NST---GIAITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSGDVN

Query:  GVLGQTYATNYVSRAKMGVAMPVLGGDKEFASSSLFATDCAVARFSGE----ETSLEAVAYANMNCGSDYLGAQGVVCKR
        GVLGQTY +NYVSRAKMGVAMPVLGGDKEFASS  FATDCAVARF+G+    ++SLE  AY NM+CGSD  G +GVVCKR
Subjt:  GVLGQTYATNYVSRAKMGVAMPVLGGDKEFASSSLFATDCAVARFSGE----ETSLEAVAYANMNCGSDYLGAQGVVCKR

SwissProt top hitse value%identityAlignment
O65375 Leucine-rich repeat extensin-like protein 18.9e-0963.53Show/hide
Query:  PPPPPPSPPPPPPKPRKRKSPPPPYIYSSPPPPPPYIYSSPPPPPFIYSSPPPPPPATAEPTPPFPPALTPPPPSLSPPPSSEAS
        PPPPPPSP PPP  P    SPPPPY+YSS PPPPPY+YSSPPPPP++YSS PPPP   + P PP+  +  PPPP   PPP  E+S
Subjt:  PPPPPPSPPPPPPKPRKRKSPPPPYIYSSPPPPPPYIYSSPPPPPFIYSSPPPPPPATAEPTPPFPPALTPPPPSLSPPPSSEAS

Q9LUI1 Leucine-rich repeat extensin-like protein 61.0e-0456.67Show/hide
Query:  CSTCQPVCIPPPPPPPSPPPPPPKPRKRKSPPPPYIYSSPPPPPPYIYSSPPPPPFIYSSPPPPPPATAEPTPPFPPALTPPPPSLSPPP
        CS   P   PPPPPPP PPPPPP P     PPPPY+Y SPPPPP      P PPP++Y   PPPPP    P PP PP + PPPP  SP P
Subjt:  CSTCQPVCIPPPPPPPSPPPPPPKPRKRKSPPPPYIYSSPPPPPPYIYSSPPPPPFIYSSPPPPPPATAEPTPPFPPALTPPPPSLSPPP

Q9T0K5 Leucine-rich repeat extensin-like protein 31.3e-0456.84Show/hide
Query:  PVCIPPPPPPPSPPPPP--PKPRKRKSPPPPYIYS----SPPPPPPYIYSSPPPPPFIYSSPPPPPPATAEPT----PPFPPALTPPPPSLSPPP
        PV  PPPPPPP PPPPP    P     PPPP +YS     PPPPPP +YS PPPP  +YSSPPPPP     P     PP PP  +PPPP  SPPP
Subjt:  PVCIPPPPPPPSPPPPP--PKPRKRKSPPPPYIYS----SPPPPPPYIYSSPPPPPFIYSSPPPPPPATAEPT----PPFPPALTPPPPSLSPPP

Arabidopsis top hitse value%identityAlignment
AT3G19430.1 late embryogenesis abundant protein-related / LEA protein-related2.0e-10444.75Show/hide
Query:  EKPKKVKCKDKKYPKCYKSDLYCPDNCPRTCVVDCSTCQPVCIPP----------------------------PPPP----PSP-----PPPP------P
        + P    CK KKY  CY  +  CP  CP +C V+C++C+P+C PP                            PPPP    PSP     PPPP      P
Subjt:  EKPKKVKCKDKKYPKCYKSDLYCPDNCPRTCVVDCSTCQPVCIPP----------------------------PPPP----PSP-----PPPP------P

Query:  KPRKRKSPPPPY-----------IYSSPPPPPPYIYS-----SPPPP---PFIYS-----------SPPP---PPPATAEPTPPFPPALTPPPPSLS---
         P    SPPPP            +   PP P P + S     SPPPP   P + S           SPPP   PPP T  P+ P PP +TP PP+ S   
Subjt:  KPRKRKSPPPPY-----------IYSSPPPPPPYIYS-----SPPPP---PFIYS-----------SPPP---PPPATAEPTPPFPPALTPPPPSLS---

Query:  ---------------------------------------PPPS--SEASGQKRVRCKSRSFPHCYGMELSCPSDCPSQCEVDCVTCSPVCNCDRPGAVCQ
                                               PPPS   EA+G KRVRCK +  P CYG+E +CP+DCP  C+VDCVTC PVCNCD+PG+VCQ
Subjt:  ---------------------------------------PPPS--SEASGQKRVRCKSRSFPHCYGMELSCPSDCPSQCEVDCVTCSPVCNCDRPGAVCQ

Query:  DPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINARFIGRRNPDMNRDFTWVQSLGILFDSHKLFIGARKTAIWDDAVDRLSLSLNDETILLPNKDASTWN
        DP+FIGGDG+TFYFHGKKD +FC+++D NLHINA FIG+R   M RDFTWVQS+ ILF +H+L++GA KTA WDD+VDR+++S +   I LP  D + W 
Subjt:  DPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINARFIGRRNPDMNRDFTWVQSLGILFDSHKLFIGARKTAIWDDAVDRLSLSLNDETILLPNKDASTWN

Query:  ST-----GIAITRTR-NTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYATNYVSRAKMGVAMPVLGGD
        S+      +++ R   +TN +E+EV G  KI A VVPIT ++SRIH Y + ++DC AHLDL FKF  LS +V+GVLGQTY +NYVSR K+GV MPV+GGD
Subjt:  ST-----GIAITRTR-NTNAVEIEVPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYATNYVSRAKMGVAMPVLGGD

Query:  KEFASSSLFATDCAVARFSGEETS---LEAVAYANMNCGSDYLGAQGVVCKR
        +EF ++ LFA DC+ ARF+G   S      +    M+C S  LG +GVVCKR
Subjt:  KEFASSSLFATDCAVARFSGEETS---LEAVAYANMNCGSDYLGAQGVVCKR

AT4G27400.1 Late embryogenesis abundant (LEA) protein-related1.9e-6238.59Show/hide
Query:  CKSRSFPHCYGMELSCPSDCPSQ---------CEVDCV--TCSPVC-----NCDRPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINARFIGR
        C   + P C    + CP +CP++         C VDC    C  VC     NC+  G++C DP+FIGGDGI FYFHGK +  F IV+D +  INARF G 
Subjt:  CKSRSFPHCYGMELSCPSDCPSQ---------CEVDCV--TCSPVC-----NCDRPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINARFIGR

Query:  RNPDMNRDFTWVQSLGILFDSHKLFIGARKTAIWDDAVDRLSLSLNDETILLPNKDASTWNST--GIAITRTRNTNAVEIEVPGNFKIKAVVVPITEKES
        R     RDFTW+Q+LG LF+SHK  +   K A WD  +D L  +++ + +++P +  STW S+   I I R    N+V + +    +I   VVP+T+++ 
Subjt:  RNPDMNRDFTWVQSLGILFDSHKLFIGARKTAIWDDAVDRLSLSLNDETILLPNKDASTWNST--GIAITRTRNTNAVEIEVPGNFKIKAVVVPITEKES

Query:  RIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYATNYVSRAKMGVAMPVLGGDKEFASSSLFATDCAVARFSGEETSLEA-----VAYANMNCGSD
        RIH Y +  +DCFAH ++ FKF  LS  V+G+LG+TY  ++ + AK GV MPV+GG+  F +SSL +  C    FS +             YA ++C   
Subjt:  RIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYATNYVSRAKMGVAMPVLGGDKEFASSSLFATDCAVARFSGEETSLEA-----VAYANMNCGSD

Query:  YLGAQGVVCKR
             G+VC++
Subjt:  YLGAQGVVCKR

AT5G54370.1 Late embryogenesis abundant (LEA) protein-related2.7e-6941.56Show/hide
Query:  SRSFPHCYGMELSCPSDCPSQ---------CEVDC--VTCSPVC-----NCDRPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINARFIGRRN
        S  +  CY   + CP +CPS+         C  DC   TC   C     NC+RPG+ C DP+FIGGDGI FYFHGK + +F +V+DS+L IN RFIG R 
Subjt:  SRSFPHCYGMELSCPSDCPSQ---------CEVDC--VTCSPVC-----NCDRPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHINARFIGRRN

Query:  PDMNRDFTWVQSLGILFDSHKLFIGARKTAIWDDAVDRLSLSLNDETILLPNKDASTWNSTG--IAITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRI
            RDFTW+Q+LG LF+S+K  + A KTA WD+ +D L  S + + + +P +  STW S    I I R    N+V + +    +I   VVP+T+++ RI
Subjt:  PDMNRDFTWVQSLGILFDSHKLFIGARKTAIWDDAVDRLSLSLNDETILLPNKDASTWNSTG--IAITRTRNTNAVEIEVPGNFKIKAVVVPITEKESRI

Query:  HKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYATNYVSRAKMGVAMPVLGGDKEFASSSLFATDCAVARFSGEETSLEAV----AYANMNCGSDYLG
        H Y +  +DCFAHL++ F+F+ LS  V+G+LG+TY  ++ + AK GVAMPV+GG+  F +SSL + DC    FS  +  +++V     YA ++C      
Subjt:  HKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYATNYVSRAKMGVAMPVLGGDKEFASSSLFATDCAVARFSGEETSLEAV----AYANMNCGSDYLG

Query:  AQGVVCKR
          G+VC++
Subjt:  AQGVVCKR

AT5G60520.1 Late embryogenesis abundant (LEA) protein-related6.9e-6543.55Show/hide
Query:  SGQKRVRCKSRSFPHCYGMELSCPSDCPSQ----------CEVDCVT-CSPVC-----NCDRPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHI
        SGQ+RV+C +R    C    L+CP +CP +          C +DC + C   C     NC+  G++C DP+F+GGDG+ FYFHG KD +F IV+D NL I
Subjt:  SGQKRVRCKSRSFPHCYGMELSCPSDCPSQ----------CEVDCVT-CSPVC-----NCDRPGAVCQDPKFIGGDGITFYFHGKKDRDFCIVTDSNLHI

Query:  NARFIGRRNPDMNRDFTWVQSLGILFDSHKLFIGARKTAIWDDAVDRLSLSLNDETILLPNKDASTW----NSTGIAITRTRNTNAVEIEVPGNFKIKAV
        NA FIG R     RDFTWVQ+  ++FDSH L I A+K A WDD+VD L +  N E + +P +  + W    +   + + RT   N V + V G  +I   
Subjt:  NARFIGRRNPDMNRDFTWVQSLGILFDSHKLFIGARKTAIWDDAVDRLSLSLNDETILLPNKDASTW----NSTGIAITRTRNTNAVEIEVPGNFKIKAV

Query:  VVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYATNYVSRAKMGVAMPVLGGDKEFASSSLFATDCAVARFSGE
        V PI ++E R+HKY + ++D FAHL+  FKF+ LS  V GVLG+TY   YVS  K GV MP++GG+ ++ + SLF+  C V RF G+
Subjt:  VVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYATNYVSRAKMGVAMPVLGGDKEFASSSLFATDCAVARFSGE

AT5G60530.1 late embryogenesis abundant protein-related / LEA protein-related1.9e-6242.24Show/hide
Query:  LSPPPSSEASGQKRVRCKSRSFPHCYGMELSCPSDCPSQ----------CEVDCVT-CSPVC-----NCDRPGAVCQDPKFIGGDGITFYFHGKKDRDFC
        LSP P    +GQ++  C+ R    CY   L CP +CP +          C +DC   C   C     NC+  G++C DP+F+GGDG+ FYFHG K  +F 
Subjt:  LSPPPSSEASGQKRVRCKSRSFPHCYGMELSCPSDCPSQ----------CEVDCVT-CSPVC-----NCDRPGAVCQDPKFIGGDGITFYFHGKKDRDFC

Query:  IVTDSNLHINARFIGRRNPDMNRDFTWVQSLGILFDSHKLFIGARKTAIWDDAVDRLSLSLNDETILLPNKDASTWNSTG-----IAITRTRNTNAVEIE
        IV+D+NL INA FIG R     RDFTWVQ+L ++F++HKL I A +   WD+  D  ++  + E I LP  + S W         I I RT   N+V + 
Subjt:  IVTDSNLHINARFIGRRNPDMNRDFTWVQSLGILFDSHKLFIGARKTAIWDDAVDRLSLSLNDETILLPNKDASTWNSTG-----IAITRTRNTNAVEIE

Query:  VPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYATNYVSRAKMGVAMPVLGGDKEFASSSLFATDCAVARFSGEETS
        V    ++   V PI ++E+R+H Y + Q+D FAHL+  FKF  LS  V GVLG+TY  +YVS AK GV MPVLGG+ ++ + SLF+  C + RF  +E S
Subjt:  VPGNFKIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYATNYVSRAKMGVAMPVLGGDKEFASSSLFATDCAVARFSGEETS

Query:  LEA
        L A
Subjt:  LEA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACGAATCGCCATAGCACTCTTCCTCTTCATTCTCCTCCTTTCGGCTGCCGTAGAGGCAGCTCCTAAGCCAAAGAAAGAAAAGCCCAAGAAAGTTAAATGCAAAGA
CAAGAAGTATCCAAAATGTTACAAATCTGACCTCTATTGTCCAGACAATTGCCCTCGAACTTGTGTCGTTGATTGTTCAACTTGTCAACCCGTGTGTATTCCGCCTCCGC
CTCCACCGCCATCCCCTCCTCCACCGCCACCGAAACCCCGCAAGCGCAAGTCTCCACCACCCCCATATATTTACTCTTCACCTCCACCCCCGCCTCCCTACATTTACTCT
TCTCCCCCGCCTCCTCCATTCATATACTCTTCGCCACCTCCTCCTCCACCAGCTACGGCTGAGCCTACACCACCGTTCCCTCCGGCTCTAACTCCCCCACCGCCGTCTCT
TTCTCCACCACCGTCGTCTGAAGCGTCGGGGCAGAAGAGAGTTAGGTGCAAGAGTAGGAGCTTCCCACATTGCTATGGGATGGAGCTAAGTTGTCCAAGTGATTGCCCTA
GCCAATGTGAAGTTGACTGTGTTACTTGCAGCCCCGTTTGCAATTGCGACCGTCCGGGAGCAGTGTGCCAAGACCCAAAATTCATTGGAGGGGATGGAATCACCTTCTAC
TTCCATGGCAAAAAAGACCGAGACTTCTGCATCGTCACCGACTCGAACCTCCACATCAACGCCCGCTTCATCGGCCGACGAAACCCCGACATGAACCGGGACTTCACTTG
GGTCCAATCCCTCGGCATCCTCTTCGACTCCCACAAACTCTTCATCGGCGCCCGCAAAACCGCAATATGGGACGACGCGGTCGACCGCCTGTCCCTCTCCCTCAACGACG
AAACCATCCTCCTCCCCAACAAGGATGCCTCAACATGGAACTCAACAGGGATCGCCATAACCAGGACTCGGAACACGAACGCTGTCGAGATCGAAGTCCCCGGGAATTTC
AAGATCAAGGCGGTCGTGGTCCCGATAACCGAAAAGGAATCGAGGATCCACAAGTATGGGATCACACAAGAGGATTGCTTTGCCCATTTGGATTTGAGCTTCAAGTTCTA
TGCTTTGAGTGGGGATGTGAATGGGGTTTTGGGGCAGACTTATGCTACAAACTATGTGAGTAGGGCAAAGATGGGAGTGGCAATGCCTGTTTTGGGTGGCGATAAGGAAT
TTGCTTCTTCAAGCCTTTTTGCTACCGATTGTGCTGTGGCACGATTTAGTGGAGAAGAAACTTCTTTGGAGGCTGTGGCCTATGCCAATATGAATTGTGGGAGCGATTAT
TTGGGAGCTCAAGGAGTTGTTTGCAAACGA
mRNA sequenceShow/hide mRNA sequence
ATGGCACGAATCGCCATAGCACTCTTCCTCTTCATTCTCCTCCTTTCGGCTGCCGTAGAGGCAGCTCCTAAGCCAAAGAAAGAAAAGCCCAAGAAAGTTAAATGCAAAGA
CAAGAAGTATCCAAAATGTTACAAATCTGACCTCTATTGTCCAGACAATTGCCCTCGAACTTGTGTCGTTGATTGTTCAACTTGTCAACCCGTGTGTATTCCGCCTCCGC
CTCCACCGCCATCCCCTCCTCCACCGCCACCGAAACCCCGCAAGCGCAAGTCTCCACCACCCCCATATATTTACTCTTCACCTCCACCCCCGCCTCCCTACATTTACTCT
TCTCCCCCGCCTCCTCCATTCATATACTCTTCGCCACCTCCTCCTCCACCAGCTACGGCTGAGCCTACACCACCGTTCCCTCCGGCTCTAACTCCCCCACCGCCGTCTCT
TTCTCCACCACCGTCGTCTGAAGCGTCGGGGCAGAAGAGAGTTAGGTGCAAGAGTAGGAGCTTCCCACATTGCTATGGGATGGAGCTAAGTTGTCCAAGTGATTGCCCTA
GCCAATGTGAAGTTGACTGTGTTACTTGCAGCCCCGTTTGCAATTGCGACCGTCCGGGAGCAGTGTGCCAAGACCCAAAATTCATTGGAGGGGATGGAATCACCTTCTAC
TTCCATGGCAAAAAAGACCGAGACTTCTGCATCGTCACCGACTCGAACCTCCACATCAACGCCCGCTTCATCGGCCGACGAAACCCCGACATGAACCGGGACTTCACTTG
GGTCCAATCCCTCGGCATCCTCTTCGACTCCCACAAACTCTTCATCGGCGCCCGCAAAACCGCAATATGGGACGACGCGGTCGACCGCCTGTCCCTCTCCCTCAACGACG
AAACCATCCTCCTCCCCAACAAGGATGCCTCAACATGGAACTCAACAGGGATCGCCATAACCAGGACTCGGAACACGAACGCTGTCGAGATCGAAGTCCCCGGGAATTTC
AAGATCAAGGCGGTCGTGGTCCCGATAACCGAAAAGGAATCGAGGATCCACAAGTATGGGATCACACAAGAGGATTGCTTTGCCCATTTGGATTTGAGCTTCAAGTTCTA
TGCTTTGAGTGGGGATGTGAATGGGGTTTTGGGGCAGACTTATGCTACAAACTATGTGAGTAGGGCAAAGATGGGAGTGGCAATGCCTGTTTTGGGTGGCGATAAGGAAT
TTGCTTCTTCAAGCCTTTTTGCTACCGATTGTGCTGTGGCACGATTTAGTGGAGAAGAAACTTCTTTGGAGGCTGTGGCCTATGCCAATATGAATTGTGGGAGCGATTAT
TTGGGAGCTCAAGGAGTTGTTTGCAAACGA
Protein sequenceShow/hide protein sequence
MARIAIALFLFILLLSAAVEAAPKPKKEKPKKVKCKDKKYPKCYKSDLYCPDNCPRTCVVDCSTCQPVCIPPPPPPPSPPPPPPKPRKRKSPPPPYIYSSPPPPPPYIYS
SPPPPPFIYSSPPPPPPATAEPTPPFPPALTPPPPSLSPPPSSEASGQKRVRCKSRSFPHCYGMELSCPSDCPSQCEVDCVTCSPVCNCDRPGAVCQDPKFIGGDGITFY
FHGKKDRDFCIVTDSNLHINARFIGRRNPDMNRDFTWVQSLGILFDSHKLFIGARKTAIWDDAVDRLSLSLNDETILLPNKDASTWNSTGIAITRTRNTNAVEIEVPGNF
KIKAVVVPITEKESRIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYATNYVSRAKMGVAMPVLGGDKEFASSSLFATDCAVARFSGEETSLEAVAYANMNCGSDY
LGAQGVVCKR