; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005148 (gene) of Snake gourd v1 genome

Gene IDTan0005148
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRieske domain-containing protein
Genome locationLG10:63670357..63672531
RNA-Seq ExpressionTan0005148
SyntenyTan0005148
Gene Ontology termsGO:0042128 - nitrate assimilation (biological process)
GO:0031967 - organelle envelope (cellular component)
GO:0008942 - nitrite reductase [NAD(P)H] activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0051213 - dioxygenase activity (molecular function)
GO:0051537 - 2 iron, 2 sulfur cluster binding (molecular function)
InterPro domainsIPR012748 - Rieske-like [2Fe-2S] domain, NirD-type
IPR017941 - Rieske [2Fe-2S] iron-sulphur domain
IPR036922 - Rieske [2Fe-2S] iron-sulphur domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592369.1 hypothetical protein SDJN03_14715, partial [Cucurbita argyrosperma subsp. sororia]1.1e-14292.42Show/hide
Query:  MAVNPTNLTSHFAPTFRRTHPISTPCTAALPLLKPALRRSLFVSPPSGARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MAVN TNLTSHFA  FRR HPIS+PCTAALP+LKPALRRSLF SPPSG RKI CKASE+SVAEESSASGNWVPVVPLSALP+GERRVIIQGGET+LLLWY
Subjt:  MAVNPTNLTSHFAPTFRRTHPISTPCTAALPLLKPALRRSLFVSPPSGARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTMDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVISGSSAEIVFSG
        K++IFAIENRSPAEGAYTEGLLNAKLT DGCIVCPTTDSTFDLQTGDIKEWYP NPVLRVLTPALRKLF+YPVKTDE+NIYINMRGNVIS SSAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTMDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVISGSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        KAQPGVTATDVNVDEVKMVVDE+LEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYS SDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

XP_004141552.1 uncharacterized protein LOC101206141 [Cucumis sativus]1.5e-14193.5Show/hide
Query:  MAVNPTNLTSHFAPTFRRTHPISTPCTAALPLLKPALRRSLFVSPPSGARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MA+N TN TSHFA TFRR HPIS PCTAALPLLKPAL RSLF SP   ARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
Subjt:  MAVNPTNLTSHFAPTFRRTHPISTPCTAALPLLKPALRRSLFVSPPSGARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTMDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVISGSSAEIVFSG
        KD+IFAIENRSPAEGAYTEGLLNAKLT DGCIVCPTTDSTFDLQTG+IKEWYPKNPVLRVLTPALRKLFIY VKTDE+NIYINMRGNVIS SSAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTMDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVISGSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        KAQPGVTATDVNVDEV+MVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

XP_022925511.1 uncharacterized protein LOC111432790 [Cucurbita moschata]8.1e-14392.42Show/hide
Query:  MAVNPTNLTSHFAPTFRRTHPISTPCTAALPLLKPALRRSLFVSPPSGARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MAVN TNLTSHFA  FRR HPIS+PCTAALP+LKPALRRSLF SPPSG RKI CKASE+SVAEESSASGNWVPVVPLSALP+GERRVIIQGGET+LLLWY
Subjt:  MAVNPTNLTSHFAPTFRRTHPISTPCTAALPLLKPALRRSLFVSPPSGARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTMDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVISGSSAEIVFSG
        K++IFAIENRSPAEGAYTEGLLNAKLT DGCIVCPTTDSTFDLQTGDIKEWYP NPVLRVLTPALRKLF+YPVKTDE+NIYINMRGNVIS SSAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTMDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVISGSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        KAQPGVTATDVNVDEVKMVVDE+LEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYS SDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

XP_022973986.1 uncharacterized protein LOC111472599 [Cucurbita maxima]3.6e-14392.78Show/hide
Query:  MAVNPTNLTSHFAPTFRRTHPISTPCTAALPLLKPALRRSLFVSPPSGARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MAVN TNLTSHFA  FRRTHPIS+PCTAALP+LKP LRRSLF SPPSG RKI CKASE+SVAEESSASGNWVPVVPLSALP+GERRVIIQGGET+LLLWY
Subjt:  MAVNPTNLTSHFAPTFRRTHPISTPCTAALPLLKPALRRSLFVSPPSGARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTMDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVISGSSAEIVFSG
        K+KIFAIENRSPAEGAYTEGLLNAKLT DGCIVCPTTDSTFDLQTGDIKEWYP NPVLRVLTPALRKLF+YPVKTDE+NIYINMRGNVIS SSAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTMDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVISGSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        KAQPGVTATDVNVDEVKMVVDE+LEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYS SDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

XP_038891275.1 uncharacterized protein LOC120080616 [Benincasa hispida]1.1e-14796.39Show/hide
Query:  MAVNPTNLTSHFAPTFRRTHPISTPCTAALPLLKPALRRSLFVSPPSGARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MAVN TNLTSHFAPTFRRTHPIS PCTAALPLLKPA+ RSLF SPP  +RKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
Subjt:  MAVNPTNLTSHFAPTFRRTHPISTPCTAALPLLKPALRRSLFVSPPSGARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTMDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVISGSSAEIVFSG
        KDKIFAIENRSPAEGAYTEGLLNAKLT DGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVIS SSAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTMDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVISGSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

TrEMBL top hitse value%identityAlignment
A0A0A0KXX5 Rieske domain-containing protein7.4e-14293.5Show/hide
Query:  MAVNPTNLTSHFAPTFRRTHPISTPCTAALPLLKPALRRSLFVSPPSGARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MA+N TN TSHFA TFRR HPIS PCTAALPLLKPAL RSLF SP   ARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
Subjt:  MAVNPTNLTSHFAPTFRRTHPISTPCTAALPLLKPALRRSLFVSPPSGARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTMDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVISGSSAEIVFSG
        KD+IFAIENRSPAEGAYTEGLLNAKLT DGCIVCPTTDSTFDLQTG+IKEWYPKNPVLRVLTPALRKLFIY VKTDE+NIYINMRGNVIS SSAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTMDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVISGSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        KAQPGVTATDVNVDEV+MVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

A0A1S3CBV1 uncharacterized protein LOC1034986941.3e-14193.14Show/hide
Query:  MAVNPTNLTSHFAPTFRRTHPISTPCTAALPLLKPALRRSLFVSPPSGARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MA+N TN TSHF  TFRR HPIS PCTAALPLLKPAL RSL  SP   ARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
Subjt:  MAVNPTNLTSHFAPTFRRTHPISTPCTAALPLLKPALRRSLFVSPPSGARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTMDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVISGSSAEIVFSG
        KD+IFAIENRSPAEGAYTEGLLNAKLT DGCIVCPTTDSTFDLQTGDIKEWYP NPVLRVLTPALRKLFIYPVKTDE+NIYINMRGNVIS +SAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTMDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVISGSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

A0A5D3BMM3 Rieske domain-containing protein1.3e-14193.14Show/hide
Query:  MAVNPTNLTSHFAPTFRRTHPISTPCTAALPLLKPALRRSLFVSPPSGARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MA+N TN TSHF  TFRR HPIS PCTAALPLLKPAL RSL  SP   ARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
Subjt:  MAVNPTNLTSHFAPTFRRTHPISTPCTAALPLLKPALRRSLFVSPPSGARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTMDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVISGSSAEIVFSG
        KD+IFAIENRSPAEGAYTEGLLNAKLT DGCIVCPTTDSTFDLQTGDIKEWYP NPVLRVLTPALRKLFIYPVKTDE+NIYINMRGNVIS +SAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTMDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVISGSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

A0A6J1ECE7 uncharacterized protein LOC1114327903.9e-14392.42Show/hide
Query:  MAVNPTNLTSHFAPTFRRTHPISTPCTAALPLLKPALRRSLFVSPPSGARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MAVN TNLTSHFA  FRR HPIS+PCTAALP+LKPALRRSLF SPPSG RKI CKASE+SVAEESSASGNWVPVVPLSALP+GERRVIIQGGET+LLLWY
Subjt:  MAVNPTNLTSHFAPTFRRTHPISTPCTAALPLLKPALRRSLFVSPPSGARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTMDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVISGSSAEIVFSG
        K++IFAIENRSPAEGAYTEGLLNAKLT DGCIVCPTTDSTFDLQTGDIKEWYP NPVLRVLTPALRKLF+YPVKTDE+NIYINMRGNVIS SSAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTMDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVISGSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        KAQPGVTATDVNVDEVKMVVDE+LEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYS SDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

A0A6J1IA65 uncharacterized protein LOC1114725991.8e-14392.78Show/hide
Query:  MAVNPTNLTSHFAPTFRRTHPISTPCTAALPLLKPALRRSLFVSPPSGARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY
        MAVN TNLTSHFA  FRRTHPIS+PCTAALP+LKP LRRSLF SPPSG RKI CKASE+SVAEESSASGNWVPVVPLSALP+GERRVIIQGGET+LLLWY
Subjt:  MAVNPTNLTSHFAPTFRRTHPISTPCTAALPLLKPALRRSLFVSPPSGARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWY

Query:  KDKIFAIENRSPAEGAYTEGLLNAKLTMDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVISGSSAEIVFSG
        K+KIFAIENRSPAEGAYTEGLLNAKLT DGCIVCPTTDSTFDLQTGDIKEWYP NPVLRVLTPALRKLF+YPVKTDE+NIYINMRGNVIS SSAEIVFSG
Subjt:  KDKIFAIENRSPAEGAYTEGLLNAKLTMDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVISGSSAEIVFSG

Query:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK
        KAQPGVTATDVNVDEVKMVVDE+LEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYS SDAFK
Subjt:  KAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G71500.1 Rieske (2Fe-2S) domain-containing protein5.7e-10278.41Show/hide
Query:  ISCKASEISVAEESSASG-NWVPVVPLSALPRGERRVIIQGGETILLLWYKDKIFAIENRSPAEGAYTEGLLNAKLTMDGCIVCPTTDSTFDLQTGDIKE
        + C+A+E+S +   S  G NWVPVVPLSALP+GERRV+IQ  ETILLLWYK+ +FAIENRSPAEGAY+EGLLNA+LT DGCIVCP+TDSTFDL+TG+I+E
Subjt:  ISCKASEISVAEESSASG-NWVPVVPLSALPRGERRVIIQGGETILLLWYKDKIFAIENRSPAEGAYTEGLLNAKLTMDGCIVCPTTDSTFDLQTGDIKE

Query:  WYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVISGSSAEIVFSGKAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLD
        WYPKNPVLRVLTPALRKLF+YPVK DE+NIYI++R +  + ++AEIVFSGKAQPG+TAT+VNVDEV+M+VDE  EGFGFT KNEVINGKAAVIGFLLLLD
Subjt:  WYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVISGSSAEIVFSGKAQPGVTATDVNVDEVKMVVDEDLEGFGFTGKNEVINGKAAVIGFLLLLD

Query:  FELLTGKGLLKGTGFLDFIYSVSDAFK
        FELLTGKGLLKGTGFLDF+YS SDAFK
Subjt:  FELLTGKGLLKGTGFLDFIYSVSDAFK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGTCAATCCCACAAATCTCACTTCCCATTTTGCCCCCACATTCCGGCGGACCCACCCGATCTCAACTCCATGTACCGCTGCCCTTCCCCTTCTGAAGCCCGCCCT
CCGCCGCTCCCTTTTTGTCTCTCCTCCTTCCGGTGCCCGGAAAATCTCCTGCAAAGCGTCGGAGATCTCCGTGGCCGAGGAATCGTCGGCGTCTGGTAACTGGGTCCCGG
TGGTTCCATTGTCGGCGCTGCCGAGAGGGGAGCGGCGCGTGATTATTCAGGGCGGTGAGACGATTTTGCTTCTTTGGTATAAGGATAAGATTTTTGCTATTGAGAATCGG
TCGCCTGCTGAAGGTGCTTACACCGAAGGTCTCCTCAATGCCAAGCTAACGATGGATGGCTGTATTGTCTGTCCAACGACAGATAGCACATTTGACCTGCAAACTGGAGA
CATCAAGGAATGGTATCCAAAGAACCCAGTCCTCAGAGTCCTCACACCAGCCTTAAGGAAGCTTTTCATATACCCTGTGAAAACTGATGAAGACAACATCTATATCAACA
TGAGAGGAAACGTAATATCAGGTTCATCTGCTGAGATTGTCTTTAGTGGTAAAGCTCAACCCGGTGTAACCGCAACTGATGTCAATGTCGATGAGGTGAAAATGGTGGTT
GATGAAGATCTTGAGGGGTTTGGCTTCACTGGAAAGAATGAAGTGATAAATGGAAAGGCAGCAGTGATTGGCTTCTTGTTGTTGTTGGATTTTGAGCTCCTAACTGGTAA
GGGTCTTCTAAAGGGAACTGGCTTCTTGGACTTCATTTATTCTGTTTCAGATGCTTTCAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGTCAATCCCACAAATCTCACTTCCCATTTTGCCCCCACATTCCGGCGGACCCACCCGATCTCAACTCCATGTACCGCTGCCCTTCCCCTTCTGAAGCCCGCCCT
CCGCCGCTCCCTTTTTGTCTCTCCTCCTTCCGGTGCCCGGAAAATCTCCTGCAAAGCGTCGGAGATCTCCGTGGCCGAGGAATCGTCGGCGTCTGGTAACTGGGTCCCGG
TGGTTCCATTGTCGGCGCTGCCGAGAGGGGAGCGGCGCGTGATTATTCAGGGCGGTGAGACGATTTTGCTTCTTTGGTATAAGGATAAGATTTTTGCTATTGAGAATCGG
TCGCCTGCTGAAGGTGCTTACACCGAAGGTCTCCTCAATGCCAAGCTAACGATGGATGGCTGTATTGTCTGTCCAACGACAGATAGCACATTTGACCTGCAAACTGGAGA
CATCAAGGAATGGTATCCAAAGAACCCAGTCCTCAGAGTCCTCACACCAGCCTTAAGGAAGCTTTTCATATACCCTGTGAAAACTGATGAAGACAACATCTATATCAACA
TGAGAGGAAACGTAATATCAGGTTCATCTGCTGAGATTGTCTTTAGTGGTAAAGCTCAACCCGGTGTAACCGCAACTGATGTCAATGTCGATGAGGTGAAAATGGTGGTT
GATGAAGATCTTGAGGGGTTTGGCTTCACTGGAAAGAATGAAGTGATAAATGGAAAGGCAGCAGTGATTGGCTTCTTGTTGTTGTTGGATTTTGAGCTCCTAACTGGTAA
GGGTCTTCTAAAGGGAACTGGCTTCTTGGACTTCATTTATTCTGTTTCAGATGCTTTCAAATAG
Protein sequenceShow/hide protein sequence
MAVNPTNLTSHFAPTFRRTHPISTPCTAALPLLKPALRRSLFVSPPSGARKISCKASEISVAEESSASGNWVPVVPLSALPRGERRVIIQGGETILLLWYKDKIFAIENR
SPAEGAYTEGLLNAKLTMDGCIVCPTTDSTFDLQTGDIKEWYPKNPVLRVLTPALRKLFIYPVKTDEDNIYINMRGNVISGSSAEIVFSGKAQPGVTATDVNVDEVKMVV
DEDLEGFGFTGKNEVINGKAAVIGFLLLLDFELLTGKGLLKGTGFLDFIYSVSDAFK