; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS015811 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS015811
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionGATA zinc finger domain-containing protein 10
Genome locationscaffold943_2:108991..111396
RNA-Seq ExpressionMS015811
SyntenyMS015811
Gene Ontology termsGO:0022904 - respiratory electron transport chain (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR016174 - Di-haem cytochrome, transmembrane
IPR025067 - Protein of unknown function DUF4079


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604398.1 hypothetical protein SDJN03_05007, partial [Cucurbita argyrosperma subsp. sororia]1.1e-11482.76Show/hide
Query:  MATLTG-------GASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFV
        MATLTG        +SYMCL+KV PP    S++SFPI LPNL K PR+S FSL SPVTG  K R +PS ++D +FG+ VEM+E  ET LYSLAPFPLLFV
Subjt:  MATLTG-------GASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFV

Query:  AALPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKP
        AALPGAGTVRSLFGPFVELVKSWNLP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS+DVEEKA AKDLHPKLLGGMFFFFALGATGGITSLLTSDKP
Subjt:  AALPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKP

Query:  IFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS
        IFESPHAVTGFIGLALLT+Q+LLPSLFEDNPGLRN+HGILGSGIMTLFLIHAALGLQLGLS
Subjt:  IFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS

XP_022143716.1 uncharacterized protein LOC111013557 [Momordica charantia]1.1e-14199.61Show/hide
Query:  MATLTGGASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAALPGAGT
        MATLTGGASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAALPGAGT
Subjt:  MATLTGGASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAALPGAGT

Query:  VRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAV
        VRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAV
Subjt:  VRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAV

Query:  TGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY
        TGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGL Y
Subjt:  TGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY

XP_022925916.1 uncharacterized protein LOC111433189 [Cucurbita moschata]3.8e-11584.05Show/hide
Query:  MATLTG---GASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAALP
        MATLTG    +SYMCL+KV PP    S++SFPI LPNL K PR+S FSL SPVTG  K R +PS ++D +FG+ VEM+E  ET LYSLAPFPLLFVAALP
Subjt:  MATLTG---GASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAALP

Query:  GAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES
        GAGTVRSLFGPFVELVKSWNLP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS+DVEEKA AKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES
Subjt:  GAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES

Query:  PHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS
        PHAVTGFIGLALLT+Q+LLPSLFEDNPGLRN+HGILGSGIMTLFLIHAALGLQLGLS
Subjt:  PHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS

XP_022977650.1 uncharacterized protein LOC111477900 [Cucurbita maxima]7.2e-11483.66Show/hide
Query:  MATLTG---GASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAALP
        MATLTG    +SYMCL+KV PP    S++SFPI LP L K PR+S FSL SPVTG  K R NPS ++D +FG+ VE +E  ET LYSLAPFPLLFVAALP
Subjt:  MATLTG---GASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAALP

Query:  GAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES
        GAGTVRSLFGPFVELVKSWNLP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS+DVEEKA AKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES
Subjt:  GAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES

Query:  PHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS
        PHAVTGFIGLALLT+Q+LLPSLFEDNPGLRN+HGILGSGIMTLFLIHAALGLQLGLS
Subjt:  PHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS

XP_038880983.1 uncharacterized protein LOC120072635 [Benincasa hispida]2.7e-11684.94Show/hide
Query:  MATLT----GGASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAAL
        MATLT      +SYMCL+KV  P PLPST SFPIS P  PK PR+S FSL SPVT  T  R NPS ++DD+FG+ VEMKE SET LYSL+PFPLLF+AAL
Subjt:  MATLT----GGASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAAL

Query:  PGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFE
        PGAGTVRSLFGPFVELVKSWNLP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSD+VEEKA AKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFE
Subjt:  PGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFE

Query:  SPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY
        SPHAVTGFIGL LLTVQTLLPSLFE+NPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY
Subjt:  SPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY

TrEMBL top hitse value%identityAlignment
A0A0A0KIF5 Uncharacterized protein4.2e-10779.85Show/hide
Query:  MATLTGGA---SYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRN----SCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMK-EASETLLYSLAPFPLLF
        MATLT  +   SY+CL+KV P  PLPSTS       NLPK PRN    S FS  SP+   T  R NPS +++D+FG+  E K E SE  LYSL+PFPLLF
Subjt:  MATLTGGA---SYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRN----SCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMK-EASETLLYSLAPFPLLF

Query:  VAALPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDK
        +AALPGAGTVRSLFGPFVELVKSWNLP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKA AKDLHPKLLGGMFFFFALGATGG+TSLLTSDK
Subjt:  VAALPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDK

Query:  PIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY
        PI ESPHAVTGFIGL LLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY
Subjt:  PIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY

A0A1S3CHJ9 uncharacterized protein LOC1035008272.0e-10980.61Show/hide
Query:  MATLT------GGASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRN-SCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMK-EASETLLYSLAPFPLLF
        MATLT        +SYMCL+KV PPLP   ++SFPI  P LPK PRN S FS  SP+   T  R NP  +++D FG+ VEMK E SE  LYSL+PFPLLF
Subjt:  MATLT------GGASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRN-SCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMK-EASETLLYSLAPFPLLF

Query:  VAALPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDK
        +AALPG GTVRSLFGPFVELVKS NLP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKA AKDLHPKLLGGMFFFFALGATGGI SLLTSDK
Subjt:  VAALPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDK

Query:  PIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY
        PIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAA GLQLGLSY
Subjt:  PIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY

A0A6J1CRM9 uncharacterized protein LOC1110135575.2e-14299.61Show/hide
Query:  MATLTGGASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAALPGAGT
        MATLTGGASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAALPGAGT
Subjt:  MATLTGGASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAALPGAGT

Query:  VRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAV
        VRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAV
Subjt:  VRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAV

Query:  TGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY
        TGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGL Y
Subjt:  TGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY

A0A6J1EDG7 uncharacterized protein LOC1114331891.9e-11584.05Show/hide
Query:  MATLTG---GASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAALP
        MATLTG    +SYMCL+KV PP    S++SFPI LPNL K PR+S FSL SPVTG  K R +PS ++D +FG+ VEM+E  ET LYSLAPFPLLFVAALP
Subjt:  MATLTG---GASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAALP

Query:  GAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES
        GAGTVRSLFGPFVELVKSWNLP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS+DVEEKA AKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES
Subjt:  GAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES

Query:  PHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS
        PHAVTGFIGLALLT+Q+LLPSLFEDNPGLRN+HGILGSGIMTLFLIHAALGLQLGLS
Subjt:  PHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS

A0A6J1IMY7 uncharacterized protein LOC1114779003.5e-11483.66Show/hide
Query:  MATLTG---GASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAALP
        MATLTG    +SYMCL+KV PP    S++SFPI LP L K PR+S FSL SPVTG  K R NPS ++D +FG+ VE +E  ET LYSLAPFPLLFVAALP
Subjt:  MATLTG---GASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAALP

Query:  GAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES
        GAGTVRSLFGPFVELVKSWNLP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS+DVEEKA AKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES
Subjt:  GAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES

Query:  PHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS
        PHAVTGFIGLALLT+Q+LLPSLFEDNPGLRN+HGILGSGIMTLFLIHAALGLQLGLS
Subjt:  PHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G36885.1 unknown protein3.1e-8665Show/hide
Query:  MATLTGGASYMCLSKVPPPLPLPST----SSFPIS-LPNLPKFPRNSCFSLPS-PVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAA
        MA +TG ++    + + P   LPS+    S+  +S   N+  FP  S F     P+T    + ++   + +  G+  +  E  ETL+ S++P PLL VA+
Subjt:  MATLTGGASYMCLSKVPPPLPLPST----SSFPIS-LPNLPKFPRNSCFSLPS-PVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAA

Query:  LPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIF
        LPGA TVRS+FGP VE+VKS NLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDD+EEKAKAKDLHPKLL GMFFFFALGATGG+ SLLTSDKPIF
Subjt:  LPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIF

Query:  ESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY
        ESPHAVTG IGL LLTVQT+LPSLF++ P LRNVHGILGSGIM LFL+HAA GLQLGLS+
Subjt:  ESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY

AT2G36885.2 unknown protein1.7e-8465Show/hide
Query:  MATLTGGASYMCLSKVPPPLPLPST----SSFPIS-LPNLPKFPRNSCFSLPS-PVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAA
        MA +TG ++    + + P   LPS+    S+  +S   N+  FP  S F     P+T    + ++   + +  G+  +  E  ETL+ S++P PLL VA+
Subjt:  MATLTGGASYMCLSKVPPPLPLPST----SSFPIS-LPNLPKFPRNSCFSLPS-PVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAA

Query:  LPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIF
        LPGA TVRS+FGP VE+VKS NLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDD+EEKAKAKDLHPKLL GMFFFFALGATGG+ SLLTSDKPIF
Subjt:  LPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIF

Query:  ESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY
        ESPHAVTG IGL LLTVQT+LPSLF+  P LRNVHGILGSGIM LFL+HAA GLQLGLS+
Subjt:  ESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACGCTTACCGGAGGAGCTTCTTATATGTGTTTATCCAAGGTACCACCACCATTGCCATTACCTTCCACTTCTTCATTCCCCATTTCTTTGCCAAATCTTCCCAA
ATTTCCCCGCAATTCCTGCTTCTCTTTGCCTTCCCCTGTCACTGGAACAACAAAAGCTCGCCTCAATCCGTCTTCTCAGGACGACGACTTCGGCGAATCGGTGGAAATGA
AGGAGGCGAGCGAGACGCTTTTGTACTCTCTCGCGCCTTTTCCTTTACTGTTCGTTGCTGCTCTTCCTGGAGCTGGAACTGTGAGGTCTCTCTTTGGCCCTTTCGTTGAG
CTTGTTAAATCTTGGAATCTTCCTGATTGGCTAGTGCATTGGGGTCATCCAGGCAACATGGCAGTAGTGCTCTTCGCTATGGGTGGTTATGGAACATACCTAGGCTTTAG
AATCCGTTACTCTGATGATGTGGAGGAGAAGGCCAAAGCCAAGGACTTGCATCCAAAGCTTCTAGGTGGAATGTTTTTCTTTTTTGCTCTTGGAGCAACGGGTGGAATCA
CTTCTCTACTTACATCAGACAAACCTATATTTGAGAGTCCACATGCTGTAACGGGGTTCATTGGCCTCGCGCTCTTGACTGTACAAACGCTTCTGCCCTCACTTTTTGAG
GATAATCCTGGACTGAGGAATGTTCATGGGATTTTGGGTAGTGGAATCATGACACTATTTCTCATCCATGCTGCACTTGGACTTCAACTTGGACTCAGTTAC
mRNA sequenceShow/hide mRNA sequence
ATGGCTACGCTTACCGGAGGAGCTTCTTATATGTGTTTATCCAAGGTACCACCACCATTGCCATTACCTTCCACTTCTTCATTCCCCATTTCTTTGCCAAATCTTCCCAA
ATTTCCCCGCAATTCCTGCTTCTCTTTGCCTTCCCCTGTCACTGGAACAACAAAAGCTCGCCTCAATCCGTCTTCTCAGGACGACGACTTCGGCGAATCGGTGGAAATGA
AGGAGGCGAGCGAGACGCTTTTGTACTCTCTCGCGCCTTTTCCTTTACTGTTCGTTGCTGCTCTTCCTGGAGCTGGAACTGTGAGGTCTCTCTTTGGCCCTTTCGTTGAG
CTTGTTAAATCTTGGAATCTTCCTGATTGGCTAGTGCATTGGGGTCATCCAGGCAACATGGCAGTAGTGCTCTTCGCTATGGGTGGTTATGGAACATACCTAGGCTTTAG
AATCCGTTACTCTGATGATGTGGAGGAGAAGGCCAAAGCCAAGGACTTGCATCCAAAGCTTCTAGGTGGAATGTTTTTCTTTTTTGCTCTTGGAGCAACGGGTGGAATCA
CTTCTCTACTTACATCAGACAAACCTATATTTGAGAGTCCACATGCTGTAACGGGGTTCATTGGCCTCGCGCTCTTGACTGTACAAACGCTTCTGCCCTCACTTTTTGAG
GATAATCCTGGACTGAGGAATGTTCATGGGATTTTGGGTAGTGGAATCATGACACTATTTCTCATCCATGCTGCACTTGGACTTCAACTTGGACTCAGTTAC
Protein sequenceShow/hide protein sequence
MATLTGGASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAALPGAGTVRSLFGPFVE
LVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLALLTVQTLLPSLFE
DNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY