; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS006334 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS006334
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionRieske domain-containing protein
Genome locationscaffold123_1:187162..189492
RNA-Seq ExpressionMS006334
SyntenyMS006334
Gene Ontology termsGO:0042128 - nitrate assimilation (biological process)
GO:0031967 - organelle envelope (cellular component)
GO:0008942 - nitrite reductase [NAD(P)H] activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0051213 - dioxygenase activity (molecular function)
GO:0051537 - 2 iron, 2 sulfur cluster binding (molecular function)
InterPro domainsIPR012748 - Rieske-like [2Fe-2S] domain, NirD-type
IPR017941 - Rieske [2Fe-2S] iron-sulphur domain
IPR036922 - Rieske [2Fe-2S] iron-sulphur domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141552.1 uncharacterized protein LOC101206141 [Cucumis sativus]4.7e-13087.2Show/hide
Query:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII
        MA+N TN TSHFA TFRR HPISAP TA    LKP+  RSLFAS          P ARKI+CKASEISVAEE SA   SGNWVPVVPL+ALPRGERRVII
Subjt:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII

Query:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA
        QGGETILLLWYKD IFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIY VK D++NIYINM GNV 
Subjt:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA

Query:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
Subjt:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

XP_008459624.1 PREDICTED: uncharacterized protein LOC103498694 [Cucumis melo]4.4e-12885.47Show/hide
Query:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII
        MA+N TN TSHF  TFRR HPISAP TA    LKP+  RSL AS          P ARKI+CKASEISVAEE SA   SGNWVPVVPL+ALPRGERRVII
Subjt:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII

Query:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA
        QGGETILLLWYKD IFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTG+IKEWYP NPVLRVLTPALRKLFIYPVK D++NIYINM GNV 
Subjt:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA

Query:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        ISD+SAEIVFSGKAQPGVTATDVNVDEV+MVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
Subjt:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

XP_022141822.1 uncharacterized protein LOC111012095 [Momordica charantia]1.7e-159100Show/hide
Query:  MAVNPTNLTSHFAPTFRRSHPISAPWTALKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVIIQGGE
        MAVNPTNLTSHFAPTFRRSHPISAPWTALKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVIIQGGE
Subjt:  MAVNPTNLTSHFAPTFRRSHPISAPWTALKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVIIQGGE

Query:  TILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVAISDS
        TILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVAISDS
Subjt:  TILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVAISDS

Query:  SAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        SAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
Subjt:  SAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

XP_022925511.1 uncharacterized protein LOC111432790 [Cucurbita moschata]9.9e-12884.08Show/hide
Query:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII
        MAVN TNLTSHFA  FRR+HPIS+P TA    LKP+ RRSLFAS         P   RKI CKASE+SVAEE SA   SGNWVPVVPL+ALP+GERRVII
Subjt:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII

Query:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA
        QGGET+LLLWYK+ IFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTG+IKEWYP NPVLRVLTPALRKLF+YPVK D++NIYINM GNV 
Subjt:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA

Query:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        ISDSSAEIVFSGKAQPGVTATDVNVDEV+MVVDE+LEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYS SDAFK
Subjt:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

XP_038891275.1 uncharacterized protein LOC120080616 [Benincasa hispida]1.9e-13488.58Show/hide
Query:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII
        MAVN TNLTSHFAPTFRR+HPISAP TA    LKP+  RSLFAS         PPA+RKI+CKASEISVAEE SA   SGNWVPVVPL+ALPRGERRVII
Subjt:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII

Query:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA
        QGGETILLLWYKD IFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTG+IKEWYPKNPVLRVLTPALRKLFIYPVK D+DNIYINM GNV 
Subjt:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA

Query:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        ISDSSAEIVFSGKAQPGVTATDVNVDEV+MVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
Subjt:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

TrEMBL top hitse value%identityAlignment
A0A0A0KXX5 Rieske domain-containing protein2.3e-13087.2Show/hide
Query:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII
        MA+N TN TSHFA TFRR HPISAP TA    LKP+  RSLFAS          P ARKI+CKASEISVAEE SA   SGNWVPVVPL+ALPRGERRVII
Subjt:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII

Query:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA
        QGGETILLLWYKD IFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIY VK D++NIYINM GNV 
Subjt:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA

Query:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
Subjt:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

A0A1S3CBV1 uncharacterized protein LOC1034986942.2e-12885.47Show/hide
Query:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII
        MA+N TN TSHF  TFRR HPISAP TA    LKP+  RSL AS          P ARKI+CKASEISVAEE SA   SGNWVPVVPL+ALPRGERRVII
Subjt:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII

Query:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA
        QGGETILLLWYKD IFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTG+IKEWYP NPVLRVLTPALRKLFIYPVK D++NIYINM GNV 
Subjt:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA

Query:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        ISD+SAEIVFSGKAQPGVTATDVNVDEV+MVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
Subjt:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

A0A5D3BMM3 Rieske domain-containing protein2.2e-12885.47Show/hide
Query:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII
        MA+N TN TSHF  TFRR HPISAP TA    LKP+  RSL AS          P ARKI+CKASEISVAEE SA   SGNWVPVVPL+ALPRGERRVII
Subjt:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII

Query:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA
        QGGETILLLWYKD IFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTG+IKEWYP NPVLRVLTPALRKLFIYPVK D++NIYINM GNV 
Subjt:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA

Query:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        ISD+SAEIVFSGKAQPGVTATDVNVDEV+MVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
Subjt:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

A0A6J1CKX8 uncharacterized protein LOC1110120958.1e-160100Show/hide
Query:  MAVNPTNLTSHFAPTFRRSHPISAPWTALKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVIIQGGE
        MAVNPTNLTSHFAPTFRRSHPISAPWTALKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVIIQGGE
Subjt:  MAVNPTNLTSHFAPTFRRSHPISAPWTALKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVIIQGGE

Query:  TILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVAISDS
        TILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVAISDS
Subjt:  TILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVAISDS

Query:  SAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        SAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
Subjt:  SAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

A0A6J1ECE7 uncharacterized protein LOC1114327904.8e-12884.08Show/hide
Query:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII
        MAVN TNLTSHFA  FRR+HPIS+P TA    LKP+ RRSLFAS         P   RKI CKASE+SVAEE SA   SGNWVPVVPL+ALP+GERRVII
Subjt:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII

Query:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA
        QGGET+LLLWYK+ IFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTG+IKEWYP NPVLRVLTPALRKLF+YPVK D++NIYINM GNV 
Subjt:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA

Query:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        ISDSSAEIVFSGKAQPGVTATDVNVDEV+MVVDE+LEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYS SDAFK
Subjt:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G71500.1 Rieske (2Fe-2S) domain-containing protein4.9e-10177.39Show/hide
Query:  IACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVIIQGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEI
        + C+A+E+S +   S PG   NWVPVVPL+ALP+GERRV+IQ  ETILLLWYK+ +FAIENRSPAEGAY+EGLLNA+LT+DGCIVCP+TDSTFDL+TGEI
Subjt:  IACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVIIQGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEI

Query:  KEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVAISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLL
        +EWYPKNPVLRVLTPALRKLF+YPVK D++NIYI++  +   ++++AEIVFSGKAQPG+TAT+VNVDEVRM+VDE  EGFGFT KNEVINGKAAVIGFLL
Subjt:  KEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVAISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLL

Query:  LLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        LLDFELLTGKGLLKGTGFLDF+YS SDAFK
Subjt:  LLDFELLTGKGLLKGTGFLDFIYSVSDAFK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGTCAATCCCACAAATCTCACTTCCCATTTTGCCCCCACATTCCGCCGGAGCCACCCCATCTCCGCCCCATGGACCGCCCTGAAACCCTCTTTCCGCCGCTCCCT
TTTCGCCTCTCTCGCTGGGAGTTACCGTTGTTTCTCTCCGCCTGCTGCCCGCAAAATCGCGTGCAAGGCATCCGAGATCTCGGTGGCGGAGGAACCGTCGGCACCGGGGG
ACAGTGGTAACTGGGTGCCGGTGGTGCCGCTGGCGGCCCTCCCGAGAGGGGAGCGGCGCGTGATTATTCAGGGCGGCGAGACGATTCTGCTTCTTTGGTATAAGGATGTG
ATTTTCGCTATTGAGAATAGATCTCCTGCTGAAGGTGCTTACACGGAAGGCCTCCTCAATGCCAAACTCACTAAGGATGGGTGCATTGTCTGTCCAACGACAGATAGCAC
ATTCGACCTCCAAACAGGAGAGATCAAGGAATGGTATCCAAAGAACCCAGTTCTCAGAGTCCTCACCCCAGCCTTGAGGAAGCTTTTTATATACCCGGTAAAAATTGATC
AAGACAACATTTACATCAACATGGGAGGAAATGTAGCAATATCAGATTCATCAGCCGAGATCGTTTTCAGTGGAAAGGCTCAACCCGGTGTCACTGCAACCGATGTCAAT
GTCGACGAGGTGAGAATGGTGGTTGATGAAGATCTTGAAGGGTTTGGCTTCACTGGGAAAAATGAAGTGATAAATGGAAAAGCAGCAGTGATTGGGTTCTTGTTGCTGTT
GGATTTTGAGCTCTTAACTGGCAAGGGTCTTCTCAAGGGAACTGGCTTCTTGGACTTCATTTATTCTGTTTCAGATGCTTTCAAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGTCAATCCCACAAATCTCACTTCCCATTTTGCCCCCACATTCCGCCGGAGCCACCCCATCTCCGCCCCATGGACCGCCCTGAAACCCTCTTTCCGCCGCTCCCT
TTTCGCCTCTCTCGCTGGGAGTTACCGTTGTTTCTCTCCGCCTGCTGCCCGCAAAATCGCGTGCAAGGCATCCGAGATCTCGGTGGCGGAGGAACCGTCGGCACCGGGGG
ACAGTGGTAACTGGGTGCCGGTGGTGCCGCTGGCGGCCCTCCCGAGAGGGGAGCGGCGCGTGATTATTCAGGGCGGCGAGACGATTCTGCTTCTTTGGTATAAGGATGTG
ATTTTCGCTATTGAGAATAGATCTCCTGCTGAAGGTGCTTACACGGAAGGCCTCCTCAATGCCAAACTCACTAAGGATGGGTGCATTGTCTGTCCAACGACAGATAGCAC
ATTCGACCTCCAAACAGGAGAGATCAAGGAATGGTATCCAAAGAACCCAGTTCTCAGAGTCCTCACCCCAGCCTTGAGGAAGCTTTTTATATACCCGGTAAAAATTGATC
AAGACAACATTTACATCAACATGGGAGGAAATGTAGCAATATCAGATTCATCAGCCGAGATCGTTTTCAGTGGAAAGGCTCAACCCGGTGTCACTGCAACCGATGTCAAT
GTCGACGAGGTGAGAATGGTGGTTGATGAAGATCTTGAAGGGTTTGGCTTCACTGGGAAAAATGAAGTGATAAATGGAAAAGCAGCAGTGATTGGGTTCTTGTTGCTGTT
GGATTTTGAGCTCTTAACTGGCAAGGGTCTTCTCAAGGGAACTGGCTTCTTGGACTTCATTTATTCTGTTTCAGATGCTTTCAAA
Protein sequenceShow/hide protein sequence
MAVNPTNLTSHFAPTFRRSHPISAPWTALKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVIIQGGETILLLWYKDV
IFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVAISDSSAEIVFSGKAQPGVTATDVN
VDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK