; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039660 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039660
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRieske domain-containing protein
Genome locationchr2:47889113..47890957
RNA-Seq ExpressionLag0039660
SyntenyLag0039660
Gene Ontology termsGO:0042128 - nitrate assimilation (biological process)
GO:0031967 - organelle envelope (cellular component)
GO:0008942 - nitrite reductase [NAD(P)H] activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0051537 - 2 iron, 2 sulfur cluster binding (molecular function)
InterPro domainsIPR012748 - Rieske-like [2Fe-2S] domain, NirD-type
IPR017941 - Rieske [2Fe-2S] iron-sulphur domain
IPR036922 - Rieske [2Fe-2S] iron-sulphur domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141552.1 uncharacterized protein LOC101206141 [Cucumis sativus]1.6e-13892.42Show/hide
Query:  MAVNPTNLTSHFPPTFRRTHPISAPCTAALPLLKPALRPSLFAS---AARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MA+N TN TSHF  TFRR HPISAPCTAALPLLKPAL  SLFAS    ARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
Subjt:  MAVNPTNLTSHFPPTFRRTHPISAPCTAALPLLKPALRPSLFAS---AARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSAFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDDDNIYINMRGNVISGSSAEIVFSG
        KD+IFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDS FDLQTG+IKEWYPKNPVLRVLTPALRKLFIY VKTD++NIYINMRGNVIS SSAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSAFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDDDNIYINMRGNVISGSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSASDAFK
        KAQPGVTATDVNVDEV+MVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYS SDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSASDAFK

XP_008459624.1 PREDICTED: uncharacterized protein LOC103498694 [Cucumis melo]1.4e-13992.78Show/hide
Query:  MAVNPTNLTSHFPPTFRRTHPISAPCTAALPLLKPALRPSLFAS---AARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MA+N TN TSHFP TFRR HPISAPCTAALPLLKPAL  SL AS    ARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
Subjt:  MAVNPTNLTSHFPPTFRRTHPISAPCTAALPLLKPALRPSLFAS---AARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSAFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDDDNIYINMRGNVISGSSAEIVFSG
        KD+IFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDS FDLQTGDIKEWYP NPVLRVLTPALRKLFIYPVKTD++NIYINMRGNVIS +SAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSAFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDDDNIYINMRGNVISGSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSASDAFK
        KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYS SDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSASDAFK

XP_022925511.1 uncharacterized protein LOC111432790 [Cucurbita moschata]1.7e-13790.61Show/hide
Query:  MAVNPTNLTSHFPPTFRRTHPISAPCTAALPLLKPALRPSLFA---SAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MAVN TNLTSHF   FRR HPIS+PCTAALP+LKPALR SLFA   S  RKI CKASE+SVAEESSASGNWVPVVPLSALP+GERRVIIQGGET+LLLWY
Subjt:  MAVNPTNLTSHFPPTFRRTHPISAPCTAALPLLKPALRPSLFA---SAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSAFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDDDNIYINMRGNVISGSSAEIVFSG
        K++IFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDS FDLQTGDIKEWYP NPVLRVLTPALRKLF+YPVKTD++NIYINMRGNVIS SSAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSAFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDDDNIYINMRGNVISGSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSASDAFK
        KAQPGVTATDVNVDEVKMVVDE+LEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSASDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSASDAFK

XP_022973986.1 uncharacterized protein LOC111472599 [Cucurbita maxima]7.8e-13890.97Show/hide
Query:  MAVNPTNLTSHFPPTFRRTHPISAPCTAALPLLKPALRPSLFA---SAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MAVN TNLTSHF   FRRTHPIS+PCTAALP+LKP LR SLFA   S  RKI CKASE+SVAEESSASGNWVPVVPLSALP+GERRVIIQGGET+LLLWY
Subjt:  MAVNPTNLTSHFPPTFRRTHPISAPCTAALPLLKPALRPSLFA---SAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSAFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDDDNIYINMRGNVISGSSAEIVFSG
        K+KIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDS FDLQTGDIKEWYP NPVLRVLTPALRKLF+YPVKTD++NIYINMRGNVIS SSAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSAFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDDDNIYINMRGNVISGSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSASDAFK
        KAQPGVTATDVNVDEVKMVVDE+LEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSASDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSASDAFK

XP_038891275.1 uncharacterized protein LOC120080616 [Benincasa hispida]1.2e-14395.31Show/hide
Query:  MAVNPTNLTSHFPPTFRRTHPISAPCTAALPLLKPALRPSLFAS---AARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MAVN TNLTSHF PTFRRTHPISAPCTAALPLLKPA+  SLFAS   A+RKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
Subjt:  MAVNPTNLTSHFPPTFRRTHPISAPCTAALPLLKPALRPSLFAS---AARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSAFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDDDNIYINMRGNVISGSSAEIVFSG
        KDKIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDS FDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTD+DNIYINMRGNVIS SSAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSAFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDDDNIYINMRGNVISGSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSASDAFK
        KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYS SDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSASDAFK

TrEMBL top hitse value%identityAlignment
A0A0A0KXX5 Rieske domain-containing protein7.6e-13992.42Show/hide
Query:  MAVNPTNLTSHFPPTFRRTHPISAPCTAALPLLKPALRPSLFAS---AARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MA+N TN TSHF  TFRR HPISAPCTAALPLLKPAL  SLFAS    ARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
Subjt:  MAVNPTNLTSHFPPTFRRTHPISAPCTAALPLLKPALRPSLFAS---AARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSAFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDDDNIYINMRGNVISGSSAEIVFSG
        KD+IFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDS FDLQTG+IKEWYPKNPVLRVLTPALRKLFIY VKTD++NIYINMRGNVIS SSAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSAFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDDDNIYINMRGNVISGSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSASDAFK
        KAQPGVTATDVNVDEV+MVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYS SDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSASDAFK

A0A1S3CBV1 uncharacterized protein LOC1034986946.8e-14092.78Show/hide
Query:  MAVNPTNLTSHFPPTFRRTHPISAPCTAALPLLKPALRPSLFAS---AARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MA+N TN TSHFP TFRR HPISAPCTAALPLLKPAL  SL AS    ARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
Subjt:  MAVNPTNLTSHFPPTFRRTHPISAPCTAALPLLKPALRPSLFAS---AARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSAFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDDDNIYINMRGNVISGSSAEIVFSG
        KD+IFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDS FDLQTGDIKEWYP NPVLRVLTPALRKLFIYPVKTD++NIYINMRGNVIS +SAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSAFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDDDNIYINMRGNVISGSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSASDAFK
        KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYS SDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSASDAFK

A0A5D3BMM3 Rieske domain-containing protein6.8e-14092.78Show/hide
Query:  MAVNPTNLTSHFPPTFRRTHPISAPCTAALPLLKPALRPSLFAS---AARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MA+N TN TSHFP TFRR HPISAPCTAALPLLKPAL  SL AS    ARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
Subjt:  MAVNPTNLTSHFPPTFRRTHPISAPCTAALPLLKPALRPSLFAS---AARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSAFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDDDNIYINMRGNVISGSSAEIVFSG
        KD+IFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDS FDLQTGDIKEWYP NPVLRVLTPALRKLFIYPVKTD++NIYINMRGNVIS +SAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSAFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDDDNIYINMRGNVISGSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSASDAFK
        KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYS SDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSASDAFK

A0A6J1ECE7 uncharacterized protein LOC1114327908.4e-13890.61Show/hide
Query:  MAVNPTNLTSHFPPTFRRTHPISAPCTAALPLLKPALRPSLFA---SAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MAVN TNLTSHF   FRR HPIS+PCTAALP+LKPALR SLFA   S  RKI CKASE+SVAEESSASGNWVPVVPLSALP+GERRVIIQGGET+LLLWY
Subjt:  MAVNPTNLTSHFPPTFRRTHPISAPCTAALPLLKPALRPSLFA---SAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSAFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDDDNIYINMRGNVISGSSAEIVFSG
        K++IFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDS FDLQTGDIKEWYP NPVLRVLTPALRKLF+YPVKTD++NIYINMRGNVIS SSAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSAFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDDDNIYINMRGNVISGSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSASDAFK
        KAQPGVTATDVNVDEVKMVVDE+LEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSASDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSASDAFK

A0A6J1IA65 uncharacterized protein LOC1114725993.8e-13890.97Show/hide
Query:  MAVNPTNLTSHFPPTFRRTHPISAPCTAALPLLKPALRPSLFA---SAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MAVN TNLTSHF   FRRTHPIS+PCTAALP+LKP LR SLFA   S  RKI CKASE+SVAEESSASGNWVPVVPLSALP+GERRVIIQGGET+LLLWY
Subjt:  MAVNPTNLTSHFPPTFRRTHPISAPCTAALPLLKPALRPSLFA---SAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSAFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDDDNIYINMRGNVISGSSAEIVFSG
        K+KIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDS FDLQTGDIKEWYP NPVLRVLTPALRKLF+YPVKTD++NIYINMRGNVIS SSAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSAFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDDDNIYINMRGNVISGSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSASDAFK
        KAQPGVTATDVNVDEVKMVVDE+LEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSASDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSASDAFK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G71500.1 Rieske (2Fe-2S) domain-containing protein1.2e-10177.97Show/hide
Query:  ISCKASEISVAEESSASG-NWVPVVPLSALPRGERRVIIQGGETILLLWYKDKIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSAFDLQTGDIKE
        + C+A+E+S +   S  G NWVPVVPLSALP+GERRV+IQ  ETILLLWYK+ +FAIENRSPAEGAY+EGLLNA+LT+DGCIVCP+TDS FDL+TG+I+E
Subjt:  ISCKASEISVAEESSASG-NWVPVVPLSALPRGERRVIIQGGETILLLWYKDKIFAIENRSPAEGAYTEGLLNAKLTKDGCIVCPTTDSAFDLQTGDIKE

Query:  WYPKNPVLRVLTPALRKLFIYPVKTDDDNIYINMRGNVISGSSAEIVFSGKAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLD
        WYPKNPVLRVLTPALRKLF+YPVK D++NIYI++R +  + ++AEIVFSGKAQPG+TAT+VNVDEV+M+VDE  EGFGFT KNEVINGKAAVIGFLLLLD
Subjt:  WYPKNPVLRVLTPALRKLFIYPVKTDDDNIYINMRGNVISGSSAEIVFSGKAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLD

Query:  FELLTGKGLLKGTGFLDFIYSASDAFK
        FELLTGKGLLKGTGFLDF+YSASDAFK
Subjt:  FELLTGKGLLKGTGFLDFIYSASDAFK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGTCAATCCCACAAATCTCACTTCCCATTTTCCCCCCACATTCCGCCGGACCCACCCGATCTCCGCCCCATGCACAGCCGCCCTGCCCCTTCTCAAACCCGCCCT
CCGCCCCTCCCTTTTCGCATCCGCTGCCCGGAAAATCTCCTGCAAAGCGTCCGAGATCTCCGTGGCCGAGGAATCGTCCGCGTCTGGTAACTGGGTGCCGGTGGTTCCCT
TGTCGGCGCTGCCGAGAGGGGAGCGGCGCGTGATTATTCAGGGCGGTGAGACGATTTTGCTCCTTTGGTATAAGGATAAGATTTTTGCTATTGAGAATAGGTCTCCTGCT
GAAGGTGCTTACACTGAAGGCCTCCTCAATGCCAAGCTAACCAAGGATGGCTGTATTGTCTGTCCAACTACGGATAGCGCATTTGACCTCCAAACTGGAGACATTAAGGA
ATGGTATCCAAAGAACCCAGTCCTCAGAGTCCTCACACCAGCCTTAAGGAAGCTTTTCATATACCCTGTAAAAACTGATGATGACAACATCTATATCAACATGAGAGGAA
ATGTAATATCAGGTTCATCTGCTGAGATTGTCTTCAGTGGTAAAGCTCAACCTGGTGTAACTGCAACCGATGTCAATGTTGACGAGGTGAAAATGGTGGTCGATGAAGAT
CTTGAAGGGTTTGGCTTTACTGGAAAGAATGAAGTGATAAATGGAAAGGCAGCAGTGATTGGCTTCTTGTTGTTGTTGGATTTTGAGCTCCTAACTGGTAAGGGTCTTCT
CAAGGGAACTGGCTTCTTGGACTTCATTTATTCTGCTTCAGATGCTTTCAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGTCAATCCCACAAATCTCACTTCCCATTTTCCCCCCACATTCCGCCGGACCCACCCGATCTCCGCCCCATGCACAGCCGCCCTGCCCCTTCTCAAACCCGCCCT
CCGCCCCTCCCTTTTCGCATCCGCTGCCCGGAAAATCTCCTGCAAAGCGTCCGAGATCTCCGTGGCCGAGGAATCGTCCGCGTCTGGTAACTGGGTGCCGGTGGTTCCCT
TGTCGGCGCTGCCGAGAGGGGAGCGGCGCGTGATTATTCAGGGCGGTGAGACGATTTTGCTCCTTTGGTATAAGGATAAGATTTTTGCTATTGAGAATAGGTCTCCTGCT
GAAGGTGCTTACACTGAAGGCCTCCTCAATGCCAAGCTAACCAAGGATGGCTGTATTGTCTGTCCAACTACGGATAGCGCATTTGACCTCCAAACTGGAGACATTAAGGA
ATGGTATCCAAAGAACCCAGTCCTCAGAGTCCTCACACCAGCCTTAAGGAAGCTTTTCATATACCCTGTAAAAACTGATGATGACAACATCTATATCAACATGAGAGGAA
ATGTAATATCAGGTTCATCTGCTGAGATTGTCTTCAGTGGTAAAGCTCAACCTGGTGTAACTGCAACCGATGTCAATGTTGACGAGGTGAAAATGGTGGTCGATGAAGAT
CTTGAAGGGTTTGGCTTTACTGGAAAGAATGAAGTGATAAATGGAAAGGCAGCAGTGATTGGCTTCTTGTTGTTGTTGGATTTTGAGCTCCTAACTGGTAAGGGTCTTCT
CAAGGGAACTGGCTTCTTGGACTTCATTTATTCTGCTTCAGATGCTTTCAAATAG
Protein sequenceShow/hide protein sequence
MAVNPTNLTSHFPPTFRRTHPISAPCTAALPLLKPALRPSLFASAARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWYKDKIFAIENRSPA
EGAYTEGLLNAKLTKDGCIVCPTTDSAFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDDDNIYINMRGNVISGSSAEIVFSGKAQPGVTATDVNVDEVKMVVDED
LEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSASDAFK