; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g0554 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g0554
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionGATA zinc finger domain-containing protein 10
Genome locationMC01:11890212..11893041
RNA-Seq ExpressionMC01g0554
SyntenyMC01g0554
Gene Ontology termsGO:0022904 - respiratory electron transport chain (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR016174 - Di-haem cytochrome, transmembrane
IPR025067 - Protein of unknown function DUF4079


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604398.1 hypothetical protein SDJN03_05007, partial [Cucurbita argyrosperma subsp. sororia]2.37e-14882.69Show/hide
Query:  MATLTGGAS-------YMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFV
        MATLTG +S       YMCL+KV PP    S++SFPI LPNL K PR+S FSL SPVTG  K R +PS ++D +FG+ VEM+E  ET LYSLAPFPLLFV
Subjt:  MATLTGGAS-------YMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFV

Query:  AALPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKP
        AALPGAGTVRSLFGPFVELVKSWNLP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS+DVEEKA AKDLHPKLLGGMFFFFALGATGGITSLLTSDKP
Subjt:  AALPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKP

Query:  IFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGL
        IFESPHAVTGFIGLALLT+Q+LLPSLFEDNPGLRN+HGILGSGIMTLFLIHAALGLQLGL
Subjt:  IFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGL

XP_022143716.1 uncharacterized protein LOC111013557 [Momordica charantia]1.82e-185100Show/hide
Query:  MATLTGGASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAALPGAGT
        MATLTGGASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAALPGAGT
Subjt:  MATLTGGASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAALPGAGT

Query:  VRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAV
        VRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAV
Subjt:  VRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAV

Query:  TGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY
        TGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY
Subjt:  TGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY

XP_022925916.1 uncharacterized protein LOC111433189 [Cucurbita moschata]5.05e-14983.98Show/hide
Query:  MATLTGGAS---YMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAALP
        MATLTG +S   YMCL+KV PP    S++SFPI LPNL K PR+S FSL SPVTG  K R +PS ++D +FG+ VEM+E  ET LYSLAPFPLLFVAALP
Subjt:  MATLTGGAS---YMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAALP

Query:  GAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES
        GAGTVRSLFGPFVELVKSWNLP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS+DVEEKA AKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES
Subjt:  GAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES

Query:  PHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGL
        PHAVTGFIGLALLT+Q+LLPSLFEDNPGLRN+HGILGSGIMTLFLIHAALGLQLGL
Subjt:  PHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGL

XP_022977650.1 uncharacterized protein LOC111477900 [Cucurbita maxima]2.40e-14783.59Show/hide
Query:  MATLTGGAS---YMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAALP
        MATLTG +S   YMCL+KV PP    S++SFPI LP L K PR+S FSL SPVTG  K R NPS ++D +FG+ VE +E  ET LYSLAPFPLLFVAALP
Subjt:  MATLTGGAS---YMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAALP

Query:  GAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES
        GAGTVRSLFGPFVELVKSWNLP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS+DVEEKA AKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES
Subjt:  GAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES

Query:  PHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGL
        PHAVTGFIGLALLT+Q+LLPSLFEDNPGLRN+HGILGSGIMTLFLIHAALGLQLGL
Subjt:  PHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGL

XP_038880983.1 uncharacterized protein LOC120072635 [Benincasa hispida]1.63e-15084.56Show/hide
Query:  MATLTGGAS----YMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAAL
        MATLT  +S    YMCL+KV  P PLPSTS FPIS P  PK PR+S FSL SPVT  T  R NPS ++DD+FG+ VEMKE SET LYSL+PFPLLF+AAL
Subjt:  MATLTGGAS----YMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAAL

Query:  PGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFE
        PGAGTVRSLFGPFVELVKSWNLP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSD+VEEKA AKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFE
Subjt:  PGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFE

Query:  SPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY
        SPHAVTGFIGL LLTVQTLLPSLFE+NPGLRNVHGILGSGIMTLFLIHAALGLQLGL Y
Subjt:  SPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY

TrEMBL top hitse value%identityAlignment
A0A0A0KIF5 Uncharacterized protein1.60e-13879.47Show/hide
Query:  MATLTGGAS---YMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSC----FSLPSPVTGTTKARLNPS-SQDDDFGESVEMKE-ASETLLYSLAPFPLLF
        MATLT  +S   Y+CL+KV PPLP  STS       NLPK PRNS     FS  SP+   T  R NPS +++D+FG+  E KE  SE  LYSL+PFPLLF
Subjt:  MATLTGGAS---YMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSC----FSLPSPVTGTTKARLNPS-SQDDDFGESVEMKE-ASETLLYSLAPFPLLF

Query:  VAALPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDK
        +AALPGAGTVRSLFGPFVELVKSWNLP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKA AKDLHPKLLGGMFFFFALGATGG+TSLLTSDK
Subjt:  VAALPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDK

Query:  PIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY
        PI ESPHAVTGFIGL LLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGL Y
Subjt:  PIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY

A0A1S3CHJ9 uncharacterized protein LOC1035008272.44e-14180.23Show/hide
Query:  MATLTGGA------SYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSC-FSLPSPVTGTTKARLNPS-SQDDDFGESVEMKE-ASETLLYSLAPFPLLF
        MATLT  +      SYMCL+KV PPLP   ++SFPI  P LPK PRNS  FS  SP+   T  R NP  +++D FG+ VEMKE  SE  LYSL+PFPLLF
Subjt:  MATLTGGA------SYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSC-FSLPSPVTGTTKARLNPS-SQDDDFGESVEMKE-ASETLLYSLAPFPLLF

Query:  VAALPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDK
        +AALPG GTVRSLFGPFVELVKS NLP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKA AKDLHPKLLGGMFFFFALGATGGI SLLTSDK
Subjt:  VAALPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDK

Query:  PIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY
        PIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAA GLQLGL Y
Subjt:  PIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY

A0A6J1CRM9 uncharacterized protein LOC1110135578.80e-186100Show/hide
Query:  MATLTGGASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAALPGAGT
        MATLTGGASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAALPGAGT
Subjt:  MATLTGGASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAALPGAGT

Query:  VRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAV
        VRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAV
Subjt:  VRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAV

Query:  TGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY
        TGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY
Subjt:  TGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY

A0A6J1EDG7 uncharacterized protein LOC1114331892.45e-14983.98Show/hide
Query:  MATLTGGAS---YMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAALP
        MATLTG +S   YMCL+KV PP    S++SFPI LPNL K PR+S FSL SPVTG  K R +PS ++D +FG+ VEM+E  ET LYSLAPFPLLFVAALP
Subjt:  MATLTGGAS---YMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAALP

Query:  GAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES
        GAGTVRSLFGPFVELVKSWNLP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS+DVEEKA AKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES
Subjt:  GAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES

Query:  PHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGL
        PHAVTGFIGLALLT+Q+LLPSLFEDNPGLRN+HGILGSGIMTLFLIHAALGLQLGL
Subjt:  PHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGL

A0A6J1IMY7 uncharacterized protein LOC1114779001.16e-14783.59Show/hide
Query:  MATLTGGAS---YMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAALP
        MATLTG +S   YMCL+KV PP    S++SFPI LP L K PR+S FSL SPVTG  K R NPS ++D +FG+ VE +E  ET LYSLAPFPLLFVAALP
Subjt:  MATLTGGAS---YMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPS-SQDDDFGESVEMKEASETLLYSLAPFPLLFVAALP

Query:  GAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES
        GAGTVRSLFGPFVELVKSWNLP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS+DVEEKA AKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES
Subjt:  GAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFES

Query:  PHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGL
        PHAVTGFIGLALLT+Q+LLPSLFEDNPGLRN+HGILGSGIMTLFLIHAALGLQLGL
Subjt:  PHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G36885.1 unknown protein8.9e-8664.62Show/hide
Query:  MATLTGGASYMCLSKVPPPLPLPST----SSFPIS-LPNLPKFPRNSCFSLPS-PVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAA
        MA +TG ++    + + P   LPS+    S+  +S   N+  FP  S F     P+T    + ++   + +  G+  +  E  ETL+ S++P PLL VA+
Subjt:  MATLTGGASYMCLSKVPPPLPLPST----SSFPIS-LPNLPKFPRNSCFSLPS-PVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAA

Query:  LPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIF
        LPGA TVRS+FGP VE+VKS NLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDD+EEKAKAKDLHPKLL GMFFFFALGATGG+ SLLTSDKPIF
Subjt:  LPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIF

Query:  ESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY
        ESPHAVTG IGL LLTVQT+LPSLF++ P LRNVHGILGSGIM LFL+HAA GLQLGL +
Subjt:  ESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY

AT2G36885.2 unknown protein4.9e-8464.62Show/hide
Query:  MATLTGGASYMCLSKVPPPLPLPST----SSFPIS-LPNLPKFPRNSCFSLPS-PVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAA
        MA +TG ++    + + P   LPS+    S+  +S   N+  FP  S F     P+T    + ++   + +  G+  +  E  ETL+ S++P PLL VA+
Subjt:  MATLTGGASYMCLSKVPPPLPLPST----SSFPIS-LPNLPKFPRNSCFSLPS-PVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAA

Query:  LPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIF
        LPGA TVRS+FGP VE+VKS NLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDD+EEKAKAKDLHPKLL GMFFFFALGATGG+ SLLTSDKPIF
Subjt:  LPGAGTVRSLFGPFVELVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIF

Query:  ESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY
        ESPHAVTG IGL LLTVQT+LPSLF+  P LRNVHGILGSGIM LFL+HAA GLQLGL +
Subjt:  ESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACGCTTACCGGAGGAGCTTCTTATATGTGTTTATCCAAGGTACCACCACCATTGCCATTACCTTCCACTTCTTCATTCCCCATTTCTTTGCCAAATCTTCCCAA
ATTTCCCCGCAATTCCTGCTTCTCTTTGCCTTCCCCTGTCACTGGAACAACAAAAGCTCGCCTCAATCCGTCTTCTCAGGACGACGACTTCGGCGAATCGGTGGAAATGA
AGGAGGCGAGCGAGACGCTTTTGTACTCTCTCGCGCCTTTTCCTTTACTGTTCGTTGCTGCTCTTCCTGGAGCTGGAACTGTGAGGTCTCTCTTTGGCCCTTTCGTTGAG
CTTGTTAAATCTTGGAATCTTCCTGATTGGCTAGTGCATTGGGGTCATCCAGGCAACATGGCGGTAGTGCTCTTCGCTATGGGTGGTTATGGAACATACCTAGGCTTTAG
AATCCGTTACTCTGATGATGTGGAGGAGAAGGCCAAAGCCAAGGACTTGCATCCAAAGCTTCTAGGTGGAATGTTTTTCTTTTTTGCTCTTGGAGCAACGGGTGGAATCA
CTTCTCTACTTACATCAGACAAACCTATATTTGAGAGTCCACATGCTGTAACGGGGTTCATTGGCCTCGCGCTCTTGACTGTACAAACGCTTCTGCCCTCACTTTTTGAG
GATAATCCTGGACTGAGGAATGTTCATGGGATTTTGGGTAGTGGAATCATGACACTATTTCTCATCCATGCTGCACTTGGACTTCAACTTGGACTCGGTTACTAA
mRNA sequenceShow/hide mRNA sequence
GTGGGGCCATATCGTAGAATGTTCAGTAGGATCAGCGTGTCTCTTTCTGGACCTTTCCCTCCATTTTCTGCACTTTTTTCTTCCTCTATCAAATCATTTTCTCTCTCACG
ATTCAAAACCAATCCCTGTCTCTGTGGAAGCCACACTCCTCACTCTCTGAACTCCAGAACTTCCTTATCGCTCTCCATCCATGGCTACGCTTACCGGAGGAGCTTCTTAT
ATGTGTTTATCCAAGGTACCACCACCATTGCCATTACCTTCCACTTCTTCATTCCCCATTTCTTTGCCAAATCTTCCCAAATTTCCCCGCAATTCCTGCTTCTCTTTGCC
TTCCCCTGTCACTGGAACAACAAAAGCTCGCCTCAATCCGTCTTCTCAGGACGACGACTTCGGCGAATCGGTGGAAATGAAGGAGGCGAGCGAGACGCTTTTGTACTCTC
TCGCGCCTTTTCCTTTACTGTTCGTTGCTGCTCTTCCTGGAGCTGGAACTGTGAGGTCTCTCTTTGGCCCTTTCGTTGAGCTTGTTAAATCTTGGAATCTTCCTGATTGG
CTAGTGCATTGGGGTCATCCAGGCAACATGGCGGTAGTGCTCTTCGCTATGGGTGGTTATGGAACATACCTAGGCTTTAGAATCCGTTACTCTGATGATGTGGAGGAGAA
GGCCAAAGCCAAGGACTTGCATCCAAAGCTTCTAGGTGGAATGTTTTTCTTTTTTGCTCTTGGAGCAACGGGTGGAATCACTTCTCTACTTACATCAGACAAACCTATAT
TTGAGAGTCCACATGCTGTAACGGGGTTCATTGGCCTCGCGCTCTTGACTGTACAAACGCTTCTGCCCTCACTTTTTGAGGATAATCCTGGACTGAGGAATGTTCATGGG
ATTTTGGGTAGTGGAATCATGACACTATTTCTCATCCATGCTGCACTTGGACTTCAACTTGGACTCGGTTACTAAACCATAGCTTATCCATGTGTAGCACCATATATAGG
TTATAGCTCTGAATTTTTCAGTATGAAAACTTCTTTCGTCACTCGACCTACGATTGTTGGAAAGAATGACGAAAACTTCATATTACTGCGTTTTCTTCAGATATACATGA
AAAGTGGATATGGCTGCATTACTTGGAAATAATAAGAGCAATGATTTGAAGTAATAACCTTTCCTGTTGGCAGTTTATTCATTAT
Protein sequenceShow/hide protein sequence
MATLTGGASYMCLSKVPPPLPLPSTSSFPISLPNLPKFPRNSCFSLPSPVTGTTKARLNPSSQDDDFGESVEMKEASETLLYSLAPFPLLFVAALPGAGTVRSLFGPFVE
LVKSWNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAKAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLALLTVQTLLPSLFE
DNPGLRNVHGILGSGIMTLFLIHAALGLQLGLGY