; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g22400 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g22400
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGATA zinc finger domain-containing protein 10
Genome locationchr1:15590807..15593216
RNA-Seq ExpressionMoc01g22400
SyntenyMoc01g22400
Gene Ontology termsGO:0022904 - respiratory electron transport chain (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR016174 - Di-haem cytochrome, transmembrane
IPR025067 - Protein of unknown function DUF4079


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604398.1 hypothetical protein SDJN03_05007, partial [Cucurbita argyrosperma subsp. sororia]3.3e-11482.69Show/hide
Query:  MATLTG-------GASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFV
        MATLTG        +SYMCL+KV PP    S++SFPI LPNL K PR+S FSL SPVTG  K R +PS ++D +FG+ VEM+E  ET LYSLAPFPLLFV
Subjt:  MATLTG-------GASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFV

Query:  AALPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKP
        AALPGAGTVRSLFGPFVELVKSWNLP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS+DVEEKA AKDLHPKLLGGMFFFFALGATGGITSLLTSDKP
Subjt:  AALPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKP

Query:  IFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGL
        IFESPHAVTGFIGLALLT+Q+LLPSLFEDNPGLRN+HGILGSGIMTLFLIHAALGLQLGL
Subjt:  IFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGL

XP_022143716.1 uncharacterized protein LOC111013557 [Momordica charantia]2.8e-142100Show/hide
Query:  MATLTGGASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAALPGAGT
        MATLTGGASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAALPGAGT
Subjt:  MATLTGGASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAALPGAGT

Query:  VRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAV
        VRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAV
Subjt:  VRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAV

Query:  TGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY
        TGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY
Subjt:  TGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY

XP_022925916.1 uncharacterized protein LOC111433189 [Cucurbita moschata]1.1e-11483.98Show/hide
Query:  MATLTG---GASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAALP
        MATLTG    +SYMCL+KV PP    S++SFPI LPNL K PR+S FSL SPVTG  K R +PS ++D +FG+ VEM+E  ET LYSLAPFPLLFVAALP
Subjt:  MATLTG---GASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAALP

Query:  GAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES
        GAGTVRSLFGPFVELVKSWNLP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS+DVEEKA AKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES
Subjt:  GAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES

Query:  PHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGL
        PHAVTGFIGLALLT+Q+LLPSLFEDNPGLRN+HGILGSGIMTLFLIHAALGLQLGL
Subjt:  PHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGL

XP_022977650.1 uncharacterized protein LOC111477900 [Cucurbita maxima]2.1e-11383.59Show/hide
Query:  MATLTG---GASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAALP
        MATLTG    +SYMCL+KV PP    S++SFPI LP L K PR+S FSL SPVTG  K R NPS ++D +FG+ VE +E  ET LYSLAPFPLLFVAALP
Subjt:  MATLTG---GASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAALP

Query:  GAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES
        GAGTVRSLFGPFVELVKSWNLP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS+DVEEKA AKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES
Subjt:  GAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES

Query:  PHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGL
        PHAVTGFIGLALLT+Q+LLPSLFEDNPGLRN+HGILGSGIMTLFLIHAALGLQLGL
Subjt:  PHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGL

XP_038880983.1 uncharacterized protein LOC120072635 [Benincasa hispida]7.7e-11684.56Show/hide
Query:  MATLT----GGASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAAL
        MATLT      +SYMCL+KV  P PLPST SFPIS P  PK PR+S FSL SPVT  T  R NPS ++DD+FG+ VEMKE SET LYSL+PFPLLF+AAL
Subjt:  MATLT----GGASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAAL

Query:  PGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFE
        PGAGTVRSLFGPFVELVKSWNLP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSD+VEEKA AKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFE
Subjt:  PGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFE

Query:  SPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY
        SPHAVTGFIGL LLTVQTLLPSLFE+NPGLRNVHGILGSGIMTLFLIHAALGLQLGL Y
Subjt:  SPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY

TrEMBL top hitse value%identityAlignment
A0A0A0KIF5 Uncharacterized protein1.2e-10679.47Show/hide
Query:  MATLTGGA---SYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRN----SCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMK-EASETLLYSLAPFPLLF
        MATLT  +   SY+CL+KV P  PLPSTS       NLPK PRN    S FS  SP+   T  R NPS +++D+FG+  E K E SE  LYSL+PFPLLF
Subjt:  MATLTGGA---SYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRN----SCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMK-EASETLLYSLAPFPLLF

Query:  VAALPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDK
        +AALPGAGTVRSLFGPFVELVKSWNLP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKA AKDLHPKLLGGMFFFFALGATGG+TSLLTSDK
Subjt:  VAALPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDK

Query:  PIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY
        PI ESPHAVTGFIGL LLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGL Y
Subjt:  PIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY

A0A1S3CHJ9 uncharacterized protein LOC1035008275.8e-10980.23Show/hide
Query:  MATLT------GGASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRN-SCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMK-EASETLLYSLAPFPLLF
        MATLT        +SYMCL+KV PPLP   ++SFPI  P LPK PRN S FS  SP+   T  R NP  +++D FG+ VEMK E SE  LYSL+PFPLLF
Subjt:  MATLT------GGASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRN-SCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMK-EASETLLYSLAPFPLLF

Query:  VAALPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDK
        +AALPG GTVRSLFGPFVELVKS NLP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKA AKDLHPKLLGGMFFFFALGATGGI SLLTSDK
Subjt:  VAALPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDK

Query:  PIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY
        PIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAA GLQLGL Y
Subjt:  PIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY

A0A6J1CRM9 uncharacterized protein LOC1110135571.4e-142100Show/hide
Query:  MATLTGGASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAALPGAGT
        MATLTGGASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAALPGAGT
Subjt:  MATLTGGASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAALPGAGT

Query:  VRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAV
        VRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAV
Subjt:  VRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAV

Query:  TGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY
        TGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY
Subjt:  TGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY

A0A6J1EDG7 uncharacterized protein LOC1114331895.4e-11583.98Show/hide
Query:  MATLTG---GASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAALP
        MATLTG    +SYMCL+KV PP    S++SFPI LPNL K PR+S FSL SPVTG  K R +PS ++D +FG+ VEM+E  ET LYSLAPFPLLFVAALP
Subjt:  MATLTG---GASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAALP

Query:  GAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES
        GAGTVRSLFGPFVELVKSWNLP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS+DVEEKA AKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES
Subjt:  GAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES

Query:  PHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGL
        PHAVTGFIGLALLT+Q+LLPSLFEDNPGLRN+HGILGSGIMTLFLIHAALGLQLGL
Subjt:  PHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGL

A0A6J1IMY7 uncharacterized protein LOC1114779001.0e-11383.59Show/hide
Query:  MATLTG---GASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAALP
        MATLTG    +SYMCL+KV PP    S++SFPI LP L K PR+S FSL SPVTG  K R NPS ++D +FG+ VE +E  ET LYSLAPFPLLFVAALP
Subjt:  MATLTG---GASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAALP

Query:  GAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES
        GAGTVRSLFGPFVELVKSWNLP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS+DVEEKA AKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES
Subjt:  GAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES

Query:  PHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGL
        PHAVTGFIGLALLT+Q+LLPSLFEDNPGLRN+HGILGSGIMTLFLIHAALGLQLGL
Subjt:  PHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G36885.1 unknown protein8.9e-8664.62Show/hide
Query:  MATLTGGASYMCLSKVPPPLPLPST----SSFPIS-LPNLPKFPRNSCFSLPS-PVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAA
        MA +TG ++    + + P   LPS+    S+  +S   N+  FP  S F     P+T    + ++   + +  G+  +  E  ETL+ S++P PLL VA+
Subjt:  MATLTGGASYMCLSKVPPPLPLPST----SSFPIS-LPNLPKFPRNSCFSLPS-PVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAA

Query:  LPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIF
        LPGA TVRS+FGP VE+VKS NLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDD+EEKAKAKDLHPKLL GMFFFFALGATGG+ SLLTSDKPIF
Subjt:  LPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIF

Query:  ESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY
        ESPHAVTG IGL LLTVQT+LPSLF++ P LRNVHGILGSGIM LFL+HAA GLQLGL +
Subjt:  ESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY

AT2G36885.2 unknown protein4.9e-8464.62Show/hide
Query:  MATLTGGASYMCLSKVPPPLPLPST----SSFPIS-LPNLPKFPRNSCFSLPS-PVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAA
        MA +TG ++    + + P   LPS+    S+  +S   N+  FP  S F     P+T    + ++   + +  G+  +  E  ETL+ S++P PLL VA+
Subjt:  MATLTGGASYMCLSKVPPPLPLPST----SSFPIS-LPNLPKFPRNSCFSLPS-PVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAA

Query:  LPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIF
        LPGA TVRS+FGP VE+VKS NLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDD+EEKAKAKDLHPKLL GMFFFFALGATGG+ SLLTSDKPIF
Subjt:  LPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIF

Query:  ESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY
        ESPHAVTG IGL LLTVQT+LPSLF+  P LRNVHGILGSGIM LFL+HAA GLQLGL +
Subjt:  ESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACGCTTACCGGAGGAGCTTCTTATATGTGTTTATCCAAGGTACCACCACCATTGCCATTACCTTCCACTTCTTCATTCCCCATTTCTTTGCCAAATCTTCCCAA
ATTTCCCCGCAATTCCTGCTTCTCTTTGCCTTCCCCTGTCACTGGAACAACAAAAGCTCGCCTCAATCCGTCTTCTCAGGACGACGACTTCGGCGAATCGGTGGAAATGA
AGGAGGCGAGCGAGACGCTTTTGTACTCTCTCGCGCCTTTTCCTTTACTGTTCGTTGCTGCTCTTCCTGGAGCTGGAACTGTGAGGTCTCTCTTTGGCCCTTTCGTTGAG
CTTGTTAAATCTTGGAATCTTCCTGATTGGCTAGTGCATTGGGGTCATCCAGGCAACATGGCGGTAGTGCTCTTCGCTATGGGTGGTTATGGAACATACCTAGGCTTTAG
AATCCGTTACTCTGATGATGTGGAGGAGAAGGCCAAAGCCAAGGACTTGCATCCAAAGCTTCTAGGTGGAATGTTTTTCTTTTTTGCTCTTGGAGCAACGGGTGGAATCA
CTTCTCTACTTACATCAGACAAACCTATATTTGAGAGTCCACATGCTGTAACGGGGTTCATTGGCCTCGCGCTCTTGACTGTACAAACGCTTCTGCCCTCACTTTTTGAG
GATAATCCTGGACTGAGGAATGTTCATGGGATTTTGGGTAGTGGAATCATGACACTATTTCTCATCCATGCTGCACTTGGACTTCAACTTGGACTCGGTTACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTACGCTTACCGGAGGAGCTTCTTATATGTGTTTATCCAAGGTACCACCACCATTGCCATTACCTTCCACTTCTTCATTCCCCATTTCTTTGCCAAATCTTCCCAA
ATTTCCCCGCAATTCCTGCTTCTCTTTGCCTTCCCCTGTCACTGGAACAACAAAAGCTCGCCTCAATCCGTCTTCTCAGGACGACGACTTCGGCGAATCGGTGGAAATGA
AGGAGGCGAGCGAGACGCTTTTGTACTCTCTCGCGCCTTTTCCTTTACTGTTCGTTGCTGCTCTTCCTGGAGCTGGAACTGTGAGGTCTCTCTTTGGCCCTTTCGTTGAG
CTTGTTAAATCTTGGAATCTTCCTGATTGGCTAGTGCATTGGGGTCATCCAGGCAACATGGCGGTAGTGCTCTTCGCTATGGGTGGTTATGGAACATACCTAGGCTTTAG
AATCCGTTACTCTGATGATGTGGAGGAGAAGGCCAAAGCCAAGGACTTGCATCCAAAGCTTCTAGGTGGAATGTTTTTCTTTTTTGCTCTTGGAGCAACGGGTGGAATCA
CTTCTCTACTTACATCAGACAAACCTATATTTGAGAGTCCACATGCTGTAACGGGGTTCATTGGCCTCGCGCTCTTGACTGTACAAACGCTTCTGCCCTCACTTTTTGAG
GATAATCCTGGACTGAGGAATGTTCATGGGATTTTGGGTAGTGGAATCATGACACTATTTCTCATCCATGCTGCACTTGGACTTCAACTTGGACTCGGTTACTAA
Protein sequenceShow/hide protein sequence
MATLTGGASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAALPGAGTVRSLFGPFVE
LVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLALLTVQTLLPSLFE
DNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY