; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10003436 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10003436
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionRieske domain-containing protein
Genome locationChr08:1294694..1296558
RNA-Seq ExpressionHG10003436
SyntenyHG10003436
Gene Ontology termsGO:0042128 - nitrate assimilation (biological process)
GO:0031967 - organelle envelope (cellular component)
GO:0008942 - nitrite reductase [NAD(P)H] activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0051537 - 2 iron, 2 sulfur cluster binding (molecular function)
InterPro domainsIPR012748 - Rieske-like [2Fe-2S] domain, NirD-type
IPR017941 - Rieske [2Fe-2S] iron-sulphur domain
IPR036922 - Rieske [2Fe-2S] iron-sulphur domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141552.1 uncharacterized protein LOC101206141 [Cucumis sativus]1.5e-14494.58Show/hide
Query:  MAVNSTNLTSHFAPTFRRTHPISAPCTAALSLLKPALHRSLFASPPPAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MA+NSTN TSHFA TFRR HPISAPCTAAL LLKPALHRSLFASP P ARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
Subjt:  MAVNSTNLTSHFAPTFRRTHPISAPCTAALSLLKPALHRSLFASPPPAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTQDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVIPDSSAEIVFSG
        KD+IFAIENRSPAEGAYTEGLLNAKLT+DGCIVCPTTDSTFDLQTG+IKEWYPKNPVLRVLTPALRKLFIY VKTDE+NIYINMRGNVI DSSAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTQDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVIPDSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSISDAFK
        KAQPGVTATDVNVDEV+MVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYS+SDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSISDAFK

XP_008459624.1 PREDICTED: uncharacterized protein LOC103498694 [Cucumis melo]4.3e-14494.22Show/hide
Query:  MAVNSTNLTSHFAPTFRRTHPISAPCTAALSLLKPALHRSLFASPPPAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MA+NSTN TSHF  TFRR HPISAPCTAAL LLKPALHRSL ASP P ARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
Subjt:  MAVNSTNLTSHFAPTFRRTHPISAPCTAALSLLKPALHRSLFASPPPAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTQDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVIPDSSAEIVFSG
        KD+IFAIENRSPAEGAYTEGLLNAKLT+DGCIVCPTTDSTFDLQTGDIKEWYP NPVLRVLTPALRKLFIYPVKTDE+NIYINMRGNVI D+SAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTQDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVIPDSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSISDAFK
        KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYS+SDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSISDAFK

XP_022925511.1 uncharacterized protein LOC111432790 [Cucurbita moschata]2.6e-14191.7Show/hide
Query:  MAVNSTNLTSHFAPTFRRTHPISAPCTAALSLLKPALHRSLFASPPPAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MAVNSTNLTSHFA  FRR HPIS+PCTAAL +LKPAL RSLFASPP   RKI CKASE+SVAEESSASGNWVPVVPLSALP+GERRVIIQGGET+LLLWY
Subjt:  MAVNSTNLTSHFAPTFRRTHPISAPCTAALSLLKPALHRSLFASPPPAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTQDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVIPDSSAEIVFSG
        K++IFAIENRSPAEGAYTEGLLNAKLT+DGCIVCPTTDSTFDLQTGDIKEWYP NPVLRVLTPALRKLF+YPVKTDE+NIYINMRGNVI DSSAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTQDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVIPDSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSISDAFK
        KAQPGVTATDVNVDEVKMVVDE+LEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYS SDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSISDAFK

XP_022973986.1 uncharacterized protein LOC111472599 [Cucurbita maxima]1.2e-14192.06Show/hide
Query:  MAVNSTNLTSHFAPTFRRTHPISAPCTAALSLLKPALHRSLFASPPPAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MAVNSTNLTSHFA  FRRTHPIS+PCTAAL +LKP L RSLFASPP   RKI CKASE+SVAEESSASGNWVPVVPLSALP+GERRVIIQGGET+LLLWY
Subjt:  MAVNSTNLTSHFAPTFRRTHPISAPCTAALSLLKPALHRSLFASPPPAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTQDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVIPDSSAEIVFSG
        K+KIFAIENRSPAEGAYTEGLLNAKLT+DGCIVCPTTDSTFDLQTGDIKEWYP NPVLRVLTPALRKLF+YPVKTDE+NIYINMRGNVI DSSAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTQDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVIPDSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSISDAFK
        KAQPGVTATDVNVDEVKMVVDE+LEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYS SDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSISDAFK

XP_038891275.1 uncharacterized protein LOC120080616 [Benincasa hispida]8.1e-15197.83Show/hide
Query:  MAVNSTNLTSHFAPTFRRTHPISAPCTAALSLLKPALHRSLFASPPPAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MAVNSTNLTSHFAPTFRRTHPISAPCTAAL LLKPA+HRSLFASPPPA+RKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
Subjt:  MAVNSTNLTSHFAPTFRRTHPISAPCTAALSLLKPALHRSLFASPPPAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTQDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVIPDSSAEIVFSG
        KDKIFAIENRSPAEGAYTEGLLNAKLT+DGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVI DSSAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTQDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVIPDSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSISDAFK
        KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYS+SDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSISDAFK

TrEMBL top hitse value%identityAlignment
A0A0A0KXX5 Rieske domain-containing protein7.1e-14594.58Show/hide
Query:  MAVNSTNLTSHFAPTFRRTHPISAPCTAALSLLKPALHRSLFASPPPAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MA+NSTN TSHFA TFRR HPISAPCTAAL LLKPALHRSLFASP P ARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
Subjt:  MAVNSTNLTSHFAPTFRRTHPISAPCTAALSLLKPALHRSLFASPPPAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTQDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVIPDSSAEIVFSG
        KD+IFAIENRSPAEGAYTEGLLNAKLT+DGCIVCPTTDSTFDLQTG+IKEWYPKNPVLRVLTPALRKLFIY VKTDE+NIYINMRGNVI DSSAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTQDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVIPDSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSISDAFK
        KAQPGVTATDVNVDEV+MVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYS+SDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSISDAFK

A0A1S3CBV1 uncharacterized protein LOC1034986942.1e-14494.22Show/hide
Query:  MAVNSTNLTSHFAPTFRRTHPISAPCTAALSLLKPALHRSLFASPPPAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MA+NSTN TSHF  TFRR HPISAPCTAAL LLKPALHRSL ASP P ARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
Subjt:  MAVNSTNLTSHFAPTFRRTHPISAPCTAALSLLKPALHRSLFASPPPAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTQDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVIPDSSAEIVFSG
        KD+IFAIENRSPAEGAYTEGLLNAKLT+DGCIVCPTTDSTFDLQTGDIKEWYP NPVLRVLTPALRKLFIYPVKTDE+NIYINMRGNVI D+SAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTQDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVIPDSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSISDAFK
        KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYS+SDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSISDAFK

A0A5D3BMM3 Rieske domain-containing protein2.1e-14494.22Show/hide
Query:  MAVNSTNLTSHFAPTFRRTHPISAPCTAALSLLKPALHRSLFASPPPAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MA+NSTN TSHF  TFRR HPISAPCTAAL LLKPALHRSL ASP P ARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
Subjt:  MAVNSTNLTSHFAPTFRRTHPISAPCTAALSLLKPALHRSLFASPPPAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTQDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVIPDSSAEIVFSG
        KD+IFAIENRSPAEGAYTEGLLNAKLT+DGCIVCPTTDSTFDLQTGDIKEWYP NPVLRVLTPALRKLFIYPVKTDE+NIYINMRGNVI D+SAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTQDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVIPDSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSISDAFK
        KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYS+SDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSISDAFK

A0A6J1ECE7 uncharacterized protein LOC1114327901.3e-14191.7Show/hide
Query:  MAVNSTNLTSHFAPTFRRTHPISAPCTAALSLLKPALHRSLFASPPPAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MAVNSTNLTSHFA  FRR HPIS+PCTAAL +LKPAL RSLFASPP   RKI CKASE+SVAEESSASGNWVPVVPLSALP+GERRVIIQGGET+LLLWY
Subjt:  MAVNSTNLTSHFAPTFRRTHPISAPCTAALSLLKPALHRSLFASPPPAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTQDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVIPDSSAEIVFSG
        K++IFAIENRSPAEGAYTEGLLNAKLT+DGCIVCPTTDSTFDLQTGDIKEWYP NPVLRVLTPALRKLF+YPVKTDE+NIYINMRGNVI DSSAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTQDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVIPDSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSISDAFK
        KAQPGVTATDVNVDEVKMVVDE+LEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYS SDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSISDAFK

A0A6J1IA65 uncharacterized protein LOC1114725995.7e-14292.06Show/hide
Query:  MAVNSTNLTSHFAPTFRRTHPISAPCTAALSLLKPALHRSLFASPPPAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MAVNSTNLTSHFA  FRRTHPIS+PCTAAL +LKP L RSLFASPP   RKI CKASE+SVAEESSASGNWVPVVPLSALP+GERRVIIQGGET+LLLWY
Subjt:  MAVNSTNLTSHFAPTFRRTHPISAPCTAALSLLKPALHRSLFASPPPAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTQDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVIPDSSAEIVFSG
        K+KIFAIENRSPAEGAYTEGLLNAKLT+DGCIVCPTTDSTFDLQTGDIKEWYP NPVLRVLTPALRKLF+YPVKTDE+NIYINMRGNVI DSSAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTQDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVIPDSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSISDAFK
        KAQPGVTATDVNVDEVKMVVDE+LEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYS SDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSISDAFK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G71500.1 Rieske (2Fe-2S) domain-containing protein1.1e-10278.85Show/hide
Query:  ISCKASEISVAEESSASG-NWVPVVPLSALPRGERRVIIQGGETILLLWYKDKIFAIENRSPAEGAYTEGLLNAKLTQDGCIVCPTTDSTFDLQTGDIKE
        + C+A+E+S +   S  G NWVPVVPLSALP+GERRV+IQ  ETILLLWYK+ +FAIENRSPAEGAY+EGLLNA+LTQDGCIVCP+TDSTFDL+TG+I+E
Subjt:  ISCKASEISVAEESSASG-NWVPVVPLSALPRGERRVIIQGGETILLLWYKDKIFAIENRSPAEGAYTEGLLNAKLTQDGCIVCPTTDSTFDLQTGDIKE

Query:  WYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVIPDSSAEIVFSGKAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLD
        WYPKNPVLRVLTPALRKLF+YPVK DE+NIYI++R +   +++AEIVFSGKAQPG+TAT+VNVDEV+M+VDE  EGFGFT KNEVINGKAAVIGFLLLLD
Subjt:  WYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVIPDSSAEIVFSGKAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLD

Query:  FELLTGKGLLKGTGFLDFIYSISDAFK
        FELLTGKGLLKGTGFLDF+YS SDAFK
Subjt:  FELLTGKGLLKGTGFLDFIYSISDAFK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGTCAATTCCACTAATCTCACTTCCCATTTTGCCCCTACATTCCGCCGGACCCACCCCATCTCTGCACCATGCACCGCCGCCCTGTCCCTTCTGAAACCCGCCCT
TCACCGCTCCCTTTTCGCTTCTCCTCCGCCCGCTGCCCGGAAAATCTCCTGCAAAGCCTCCGAGATCTCGGTGGCCGAGGAATCTTCGGCATCTGGTAACTGGGTTCCGG
TGGTTCCATTGTCGGCGCTGCCCAGAGGGGAGCGGCGCGTGATTATTCAGGGCGGTGAAACTATTTTGCTTCTTTGGTATAAGGATAAGATTTTTGCTATTGAGAATCGG
TCCCCTGCTGAAGGTGCTTACACTGAAGGTCTCCTCAATGCCAAGCTTACTCAGGATGGCTGTATTGTCTGTCCAACGACAGATAGCACATTTGACCTCCAAACTGGAGA
TATCAAGGAATGGTATCCAAAGAACCCAGTCCTCAGAGTCCTCACGCCAGCCTTAAGGAAGCTTTTCATATACCCTGTTAAAACTGATGAAGATAACATCTATATCAACA
TGAGAGGAAATGTAATACCAGATTCATCTGCTGAAATTGTCTTCAGTGGGAAAGCTCAACCTGGTGTAACTGCGACTGATGTCAATGTGGACGAGGTGAAAATGGTGGTT
GATGAAGATCTTGAAGGGTTTGGCTTCACTGGAAAGAATGAAGTGATAAATGGAAAGGCAGCAGTGATTGGCTTTTTGCTGTTGTTGGATTTTGAGCTTCTAACTGGTAA
GGGTCTTCTCAAGGGAACTGGTTTCTTGGACTTTATTTATTCTATTTCAGATGCTTTCAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGTCAATTCCACTAATCTCACTTCCCATTTTGCCCCTACATTCCGCCGGACCCACCCCATCTCTGCACCATGCACCGCCGCCCTGTCCCTTCTGAAACCCGCCCT
TCACCGCTCCCTTTTCGCTTCTCCTCCGCCCGCTGCCCGGAAAATCTCCTGCAAAGCCTCCGAGATCTCGGTGGCCGAGGAATCTTCGGCATCTGGTAACTGGGTTCCGG
TGGTTCCATTGTCGGCGCTGCCCAGAGGGGAGCGGCGCGTGATTATTCAGGGCGGTGAAACTATTTTGCTTCTTTGGTATAAGGATAAGATTTTTGCTATTGAGAATCGG
TCCCCTGCTGAAGGTGCTTACACTGAAGGTCTCCTCAATGCCAAGCTTACTCAGGATGGCTGTATTGTCTGTCCAACGACAGATAGCACATTTGACCTCCAAACTGGAGA
TATCAAGGAATGGTATCCAAAGAACCCAGTCCTCAGAGTCCTCACGCCAGCCTTAAGGAAGCTTTTCATATACCCTGTTAAAACTGATGAAGATAACATCTATATCAACA
TGAGAGGAAATGTAATACCAGATTCATCTGCTGAAATTGTCTTCAGTGGGAAAGCTCAACCTGGTGTAACTGCGACTGATGTCAATGTGGACGAGGTGAAAATGGTGGTT
GATGAAGATCTTGAAGGGTTTGGCTTCACTGGAAAGAATGAAGTGATAAATGGAAAGGCAGCAGTGATTGGCTTTTTGCTGTTGTTGGATTTTGAGCTTCTAACTGGTAA
GGGTCTTCTCAAGGGAACTGGTTTCTTGGACTTTATTTATTCTATTTCAGATGCTTTCAAATAG
Protein sequenceShow/hide protein sequence
MAVNSTNLTSHFAPTFRRTHPISAPCTAALSLLKPALHRSLFASPPPAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWYKDKIFAIENR
SPAEGAYTEGLLNAKLTQDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVIPDSSAEIVFSGKAQPGVTATDVNVDEVKMVV
DEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSISDAFK