; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC11g1414 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC11g1414
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionRieske domain-containing protein
Genome locationMC11:18687652..18692802
RNA-Seq ExpressionMC11g1414
SyntenyMC11g1414
Gene Ontology termsGO:0042128 - nitrate assimilation (biological process)
GO:0031967 - organelle envelope (cellular component)
GO:0008942 - nitrite reductase [NAD(P)H] activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0051213 - dioxygenase activity (molecular function)
GO:0051537 - 2 iron, 2 sulfur cluster binding (molecular function)
InterPro domainsIPR012748 - Rieske-like [2Fe-2S] domain, NirD-type
IPR017941 - Rieske [2Fe-2S] iron-sulphur domain
IPR036922 - Rieske [2Fe-2S] iron-sulphur domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141552.1 uncharacterized protein LOC101206141 [Cucumis sativus]1.73e-16787.2Show/hide
Query:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII
        MA+N TN TSHFA TFRR HPISAP TA    LKP+  RSLFAS          P ARKI+CKASEISVAEE SA   SGNWVPVVPL+ALPRGERRVII
Subjt:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII

Query:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA
        QGGETILLLWYKD IFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIY VK D++NIYINM GNV 
Subjt:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA

Query:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
Subjt:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

XP_008459624.1 PREDICTED: uncharacterized protein LOC103498694 [Cucumis melo]6.72e-16585.47Show/hide
Query:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII
        MA+N TN TSHF  TFRR HPISAP TA    LKP+  RSL AS          P ARKI+CKASEISVAEE SA   SGNWVPVVPL+ALPRGERRVII
Subjt:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII

Query:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA
        QGGETILLLWYKD IFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTG+IKEWYP NPVLRVLTPALRKLFIYPVK D++NIYINM GNV 
Subjt:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA

Query:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        ISD+SAEIVFSGKAQPGVTATDVNVDEV+MVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
Subjt:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

XP_022141822.1 uncharacterized protein LOC111012095 [Momordica charantia]9.09e-206100Show/hide
Query:  MAVNPTNLTSHFAPTFRRSHPISAPWTALKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVIIQGGE
        MAVNPTNLTSHFAPTFRRSHPISAPWTALKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVIIQGGE
Subjt:  MAVNPTNLTSHFAPTFRRSHPISAPWTALKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVIIQGGE

Query:  TILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVAISDS
        TILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVAISDS
Subjt:  TILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVAISDS

Query:  SAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        SAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
Subjt:  SAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

XP_022925511.1 uncharacterized protein LOC111432790 [Cucurbita moschata]1.92e-16484.08Show/hide
Query:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII
        MAVN TNLTSHFA  FRR+HPIS+P TA    LKP+ RRSLFAS         P   RKI CKASE+SVAEE SA   SGNWVPVVPL+ALP+GERRVII
Subjt:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII

Query:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA
        QGGET+LLLWYK+ IFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTG+IKEWYP NPVLRVLTPALRKLF+YPVK D++NIYINM GNV 
Subjt:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA

Query:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        ISDSSAEIVFSGKAQPGVTATDVNVDEV+MVVDE+LEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYS SDAFK
Subjt:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

XP_038891275.1 uncharacterized protein LOC120080616 [Benincasa hispida]2.81e-17388.58Show/hide
Query:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII
        MAVN TNLTSHFAPTFRR+HPISAP TA    LKP+  RSLFAS         PPA+RKI+CKASEISVAEE SA   SGNWVPVVPL+ALPRGERRVII
Subjt:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII

Query:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA
        QGGETILLLWYKD IFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTG+IKEWYPKNPVLRVLTPALRKLFIYPVK D+DNIYINM GNV 
Subjt:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA

Query:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        ISDSSAEIVFSGKAQPGVTATDVNVDEV+MVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
Subjt:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

TrEMBL top hitse value%identityAlignment
A0A0A0KXX5 Rieske domain-containing protein8.37e-16887.2Show/hide
Query:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII
        MA+N TN TSHFA TFRR HPISAP TA    LKP+  RSLFAS          P ARKI+CKASEISVAEE SA   SGNWVPVVPL+ALPRGERRVII
Subjt:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII

Query:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA
        QGGETILLLWYKD IFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIY VK D++NIYINM GNV 
Subjt:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA

Query:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
Subjt:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

A0A1S3CBV1 uncharacterized protein LOC1034986943.25e-16585.47Show/hide
Query:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII
        MA+N TN TSHF  TFRR HPISAP TA    LKP+  RSL AS          P ARKI+CKASEISVAEE SA   SGNWVPVVPL+ALPRGERRVII
Subjt:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII

Query:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA
        QGGETILLLWYKD IFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTG+IKEWYP NPVLRVLTPALRKLFIYPVK D++NIYINM GNV 
Subjt:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA

Query:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        ISD+SAEIVFSGKAQPGVTATDVNVDEV+MVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
Subjt:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

A0A5D3BMM3 Rieske domain-containing protein3.25e-16585.47Show/hide
Query:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII
        MA+N TN TSHF  TFRR HPISAP TA    LKP+  RSL AS          P ARKI+CKASEISVAEE SA   SGNWVPVVPL+ALPRGERRVII
Subjt:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII

Query:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA
        QGGETILLLWYKD IFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTG+IKEWYP NPVLRVLTPALRKLFIYPVK D++NIYINM GNV 
Subjt:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA

Query:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        ISD+SAEIVFSGKAQPGVTATDVNVDEV+MVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
Subjt:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

A0A6J1CKX8 uncharacterized protein LOC1110120954.40e-206100Show/hide
Query:  MAVNPTNLTSHFAPTFRRSHPISAPWTALKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVIIQGGE
        MAVNPTNLTSHFAPTFRRSHPISAPWTALKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVIIQGGE
Subjt:  MAVNPTNLTSHFAPTFRRSHPISAPWTALKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVIIQGGE

Query:  TILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVAISDS
        TILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVAISDS
Subjt:  TILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVAISDS

Query:  SAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        SAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
Subjt:  SAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

A0A6J1ECE7 uncharacterized protein LOC1114327909.32e-16584.08Show/hide
Query:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII
        MAVN TNLTSHFA  FRR+HPIS+P TA    LKP+ RRSLFAS         P   RKI CKASE+SVAEE SA   SGNWVPVVPL+ALP+GERRVII
Subjt:  MAVNPTNLTSHFAPTFRRSHPISAPWTA----LKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVII

Query:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA
        QGGET+LLLWYK+ IFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTG+IKEWYP NPVLRVLTPALRKLF+YPVK D++NIYINM GNV 
Subjt:  QGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVA

Query:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        ISDSSAEIVFSGKAQPGVTATDVNVDEV+MVVDE+LEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYS SDAFK
Subjt:  ISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G71500.1 Rieske (2Fe-2S) domain-containing protein4.9e-10177.39Show/hide
Query:  IACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVIIQGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEI
        + C+A+E+S +   S PG   NWVPVVPL+ALP+GERRV+IQ  ETILLLWYK+ +FAIENRSPAEGAY+EGLLNA+LT+DGCIVCP+TDSTFDL+TGEI
Subjt:  IACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVIIQGGETILLLWYKDVIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEI

Query:  KEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVAISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLL
        +EWYPKNPVLRVLTPALRKLF+YPVK D++NIYI++  +   ++++AEIVFSGKAQPG+TAT+VNVDEVRM+VDE  EGFGFT KNEVINGKAAVIGFLL
Subjt:  KEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVAISDSSAEIVFSGKAQPGVTATDVNVDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLL

Query:  LLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        LLDFELLTGKGLLKGTGFLDF+YS SDAFK
Subjt:  LLDFELLTGKGLLKGTGFLDFIYSVSDAFK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGTCAATCCCACAAATCTCACTTCCCATTTTGCCCCCACATTCCGCCGGAGCCACCCCATCTCCGCCCCATGGACCGCCCTGAAACCCTCCTTCCGCCGCTCCCT
TTTCGCCTCTCTCGCTGGGAGTTACCGTTGTTTCTCTCCGCCTGCTGCCCGCAAAATCGCGTGCAAGGCATCCGAGATCTCGGTGGCGGAGGAACCGTCGGCACCGGGGG
ACAGTGGTAACTGGGTGCCGGTGGTGCCGCTGGCGGCCCTCCCGAGAGGGGAGCGGCGCGTGATTATTCAGGGCGGCGAGACGATTCTGCTTCTTTGGTATAAGGATGTG
ATTTTCGCTATTGAGAATAGATCTCCTGCTGAAGGTGCTTACACGGAAGGCCTCCTCAATGCCAAACTCACTAAGGATGGGTGCATTGTCTGTCCAACGACAGATAGCAC
ATTCGACCTCCAAACAGGAGAGATCAAGGAATGGTATCCAAAGAACCCAGTTCTCAGAGTCCTCACCCCAGCCTTGAGGAAGCTTTTTATATACCCGGTAAAAATTGATC
AAGACAACATTTACATCAACATGGGAGGAAATGTAGCAATATCAGATTCATCAGCCGAGATCGTTTTCAGTGGAAAGGCTCAACCCGGTGTCACTGCAACCGATGTCAAT
GTCGACGAGGTGAGAATGGTGGTTGATGAAGATCTTGAAGGGTTTGGCTTCACTGGGAAAAATGAAGTGATAAATGGAAAAGCAGCAGTGATTGGGTTCTTGTTGCTGTT
GGATTTTGAGCTCTTAACTGGCAAGGGTCTTCTCAAGGGAACTGGCTTCTTGGACTTCATTTATTCTGTTTCAGATGCTTTCAAATAG
mRNA sequenceShow/hide mRNA sequence
AGTGGATTGAAGTTATGAAACGAGTCCTGCGGGCCAACGAAAGCGCCCATTCGCACCCAATTACACAACGTATAAGAATTTGGTGCACAAATTTGGGTTTCTGCATACGC
CTCGATTACTGGTTCTGCTTTTTCCGTTATCAACTCTCTGTGGTCAATGGAAAATTCTCCCCTCTCTCTCTCCTCCTCTGCTTCTTCTCTCTCTATCTCTCACTCCCCAC
CCCCTCTCTCTCTTCCTCTTCCACGCTCTAATGGCCGTCAATCCCACAAATCTCACTTCCCATTTTGCCCCCACATTCCGCCGGAGCCACCCCATCTCCGCCCCATGGAC
CGCCCTGAAACCCTCCTTCCGCCGCTCCCTTTTCGCCTCTCTCGCTGGGAGTTACCGTTGTTTCTCTCCGCCTGCTGCCCGCAAAATCGCGTGCAAGGCATCCGAGATCT
CGGTGGCGGAGGAACCGTCGGCACCGGGGGACAGTGGTAACTGGGTGCCGGTGGTGCCGCTGGCGGCCCTCCCGAGAGGGGAGCGGCGCGTGATTATTCAGGGCGGCGAG
ACGATTCTGCTTCTTTGGTATAAGGATGTGATTTTCGCTATTGAGAATAGATCTCCTGCTGAAGGTGCTTACACGGAAGGCCTCCTCAATGCCAAACTCACTAAGGATGG
GTGCATTGTCTGTCCAACGACAGATAGCACATTCGACCTCCAAACAGGAGAGATCAAGGAATGGTATCCAAAGAACCCAGTTCTCAGAGTCCTCACCCCAGCCTTGAGGA
AGCTTTTTATATACCCGGTAAAAATTGATCAAGACAACATTTACATCAACATGGGAGGAAATGTAGCAATATCAGATTCATCAGCCGAGATCGTTTTCAGTGGAAAGGCT
CAACCCGGTGTCACTGCAACCGATGTCAATGTCGACGAGGTGAGAATGGTGGTTGATGAAGATCTTGAAGGGTTTGGCTTCACTGGGAAAAATGAAGTGATAAATGGAAA
AGCAGCAGTGATTGGGTTCTTGTTGCTGTTGGATTTTGAGCTCTTAACTGGCAAGGGTCTTCTCAAGGGAACTGGCTTCTTGGACTTCATTTATTCTGTTTCAGATGCTT
TCAAATAGATGGGTAACCAGGTCTCAACCGATGCATTAACGATGACTCGACCATTGGTTTGACGCAGGTATGGAAGAGCCACCAATGTTGGATAAACATTTCCCCAAAAG
TTTATGTCCTGCAAACGATACGATAAACCAGGTCTCAGGTCTTCAGGATAATATTAGGCAATACTAACACAACTACTAGGTTTTAGGCGAATTTGTAAAACCACAAAAAA
CTACTCTCAACTCGTCTTCCCTACCATCAAGTGGGGAAATATAGATGTGTCGGTAACTTCGTCGAAGTAGAATGTATGTCCCAGACTTGCAGTATTTACTAGATGATCCA
CTGCACAAGAACAACAGTTACAAGATAAAGAAAACAACCATTGCTCATAAAGAATATATGGTTCATGAACTACAAATTCTTCATTTTTATGATATCCATGAGTTGGTCTC
CACATTTAAGAACACAAGTTTTGAAAGAGTATATTGACTAACGCTGAGCAATATGAAACCAACTGTGAACTACATGGTTTTTTGAAATAACGACTTTAAAGTTTATTATC
CAAAAAATGGAATTAAAATCACTTTTATGATTTTTCAGTTCAATATCTAACTTCCCAATCATTTTCTTTTCGGATTACTAAGCCCCAATCGATAATCACTGTATTCCTTT
ATGAAGATCTAAGCCAAATTTTGGTTTTCTCGTTTTTGTTTTTAAAATTTAGCTAAGAATTCAAATCCTATACCAAGGAAATGGGGACAGAGTTTGTTTTAACATTTAAT
AATATATAAAATCTAAATAATATTCTAACATCAAAGCTGGAATAGCTCAGTTGGTTAGAGTTTGCGGCTGTTAACCACAAGGTCGGAGGTTCAAGCCCTCTTTCTAGCGG
AAGATTATTTATTTTTTCTTTTTTCGTAATTTCAATAAAATGAAGATTTTTTTTAGAAAGACGCAAGGATAACACACTGCTAATTATACTGACACGTAGGCACTTAGCCA
ATTAAATGAAGATTATTGGTGACTGATTAGAAATATGCATACCACGTCCAAAGAAATTAACAGCCTCAGAAACAAATCTTCGACAATCATCTTCCTTAACAACATCCGCC
GCCATAACCAAAACTCTCTTGGCTCCCATCAGCCTCGCGTTCTCGCTTATCACTCTCAGCCTGTTCTCCCTCCTCGCCACCAGCATCAGATTCGCCCCCCTCTTCGCGTA
CTCGTACGCTATTTGCTGCACTCACTCTAAATCATTAGAAGCACAGCCATAAACAAAAAACATGAAACAGTAACGTTATTATGTTTAACGACCTCTCCAATGCCGGAGGA
GGCTCCGGTGATGATGACGACTTTATCGAGCATGACTTCGGAGTTGAGGGAGCTGTAAATCCACTCGCAGGCGTTGATGAAGGACAATGCCGGCCATGAGAAAGCCAGCA
GCACCAAGCTCGCCGGCGGCACCACCAGGTTCAGGAATGAGTTTATCAACTCCATCTCTCATCAACACACAGTACAAAGACAGACAAAAAAAAATGATCTCTAAAAGGGC
A
Protein sequenceShow/hide protein sequence
MAVNPTNLTSHFAPTFRRSHPISAPWTALKPSFRRSLFASLAGSYRCFSPPAARKIACKASEISVAEEPSAPGDSGNWVPVVPLAALPRGERRVIIQGGETILLLWYKDV
IFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSTFDLQTGEIKEWYPKNPVLRVLTPALRKLFIYPVKIDQDNIYINMGGNVAISDSSAEIVFSGKAQPGVTATDVN
VDEVRMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK