; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005380 (gene) of Snake gourd v1 genome

Gene IDTan0005380
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionHydroxymethylbilane hydrolyase [cyclizing]
Genome locationLG09:67180883..67182299
RNA-Seq ExpressionTan0005380
SyntenyTan0005380
Gene Ontology termsGO:0006782 - protoporphyrinogen IX biosynthetic process (biological process)
GO:0004852 - uroporphyrinogen-III synthase activity (molecular function)
InterPro domainsIPR003754 - Tetrapyrrole biosynthesis, uroporphyrinogen III synthase
IPR036108 - Tetrapyrrole biosynthesis, uroporphyrinogen III synthase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587847.1 hypothetical protein SDJN03_16412, partial [Cucurbita argyrosperma subsp. sororia]5.0e-15287.46Show/hide
Query:  MSTGAAHPSPI-HLNPVTSSPFPA---NSPPRTAAFTTPQNYAGSLSNLLSLKGFVPVWCPTLTVHPTPLAIKSHLLPPNLHSFSAVAFTSRSGITALLD
        MSTGAAHPSPI HLNP+ SSP PA   NSP RT AFTTPQNYAGSLSNLLSLKGF P+WCPT+TVHPTPLAIKSHLLPPNLH +SAVAFTSRSGITALLD
Subjt:  MSTGAAHPSPI-HLNPVTSSPFPA---NSPPRTAAFTTPQNYAGSLSNLLSLKGFVPVWCPTLTVHPTPLAIKSHLLPPNLHSFSAVAFTSRSGITALLD

Query:  AATEIGEPLLPSHGDAFLIAALGKDSELLDHGFLSKICPNASRIRIVIPEIATPSGLVEALGVGNHRRVLCPVPRVVGLNEPPVVPNFLRDLEANGWVPV
        AATEI EPLL   GD FLIAALGKDSELLDHG LSK CPNASRIR+V+P+IA+PSGLVEALG+GNHRRVLCPVPRVVGLNEPPVVPNFLRDL A+GWVPV
Subjt:  AATEIGEPLLPSHGDAFLIAALGKDSELLDHGFLSKICPNASRIRIVIPEIATPSGLVEALGVGNHRRVLCPVPRVVGLNEPPVVPNFLRDLEANGWVPV

Query:  RVDAYETRWAGPDCARKLMERGEDEKLDAIVFTSTGEVEGLLKSLRDVGLEWEAMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSSKFDSFNGVVDALH
        RVDAYETRW GP+CARKL ERGEDEKLDAIVFTSTGEVEGLLKSLR +GL+WE MRK+WPEMV+AAHGPVTAAGAERLGVK+DLVSSKFDSFNGVVDALH
Subjt:  RVDAYETRWAGPDCARKLMERGEDEKLDAIVFTSTGEVEGLLKSLRDVGLEWEAMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSSKFDSFNGVVDALH

Query:  SRWQSLEQQNP
        SRWQSLE QNP
Subjt:  SRWQSLEQQNP

TYK03635.1 Tetrapyrrole biosynthesis, uroporphyrinogen III synthase [Cucumis melo var. makuwa]9.4e-15187.3Show/hide
Query:  MSTGAAHPS-PIHLNPVTSSPFPA---NSPPRTAAFTTPQNYAGSLSNLLSLKGFVPVWCPTLTVHPTPLAIKSHLLPPNLHSFSAVAFTSRSGITALLD
        MSTGAAHP+ P H++P+TSSP PA   +S PRT AFTTPQNYAGSLS+LLSLKGF P+WCPTLTV PTPLAIKSHLLPP LHSFSAVAFTSRSGITALLD
Subjt:  MSTGAAHPS-PIHLNPVTSSPFPA---NSPPRTAAFTTPQNYAGSLSNLLSLKGFVPVWCPTLTVHPTPLAIKSHLLPPNLHSFSAVAFTSRSGITALLD

Query:  AATEIGEPLLPSHGDAFLIAALGKDSELLDHGFLSKICPNASRIRIVIPEIATPSGLVEALGVGNHRRVLCPVPRVVGLNEPPVVPNFLRDLEANGWVPV
        AATEIGEPLLPSHGD FLIAALGKDSELLDH FL+ ICPN SRIR+V+PEIATP+GLVEALGVGNHRRVLCPVPRVVGLNEPPVVPNFLRDLEA GWVPV
Subjt:  AATEIGEPLLPSHGDAFLIAALGKDSELLDHGFLSKICPNASRIRIVIPEIATPSGLVEALGVGNHRRVLCPVPRVVGLNEPPVVPNFLRDLEANGWVPV

Query:  RVDAYETRWAGPDCARKLMERGEDEKLDAIVFTSTGEVEGLLKSLRDVGLEWEAMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSSKFDSFNGVVDALH
        RVDAYETRWAGPDCARKL+ERG+DEKLDAIVFTSTGEVEGLLKSL  +GLEW+ M+KRWPEMVVAAHGPVTAAGAERLGVKVDLVS KFDSFNGVVD+LH
Subjt:  RVDAYETRWAGPDCARKLMERGEDEKLDAIVFTSTGEVEGLLKSLRDVGLEWEAMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSSKFDSFNGVVDALH

Query:  SRWQSLE
         RWQSL+
Subjt:  SRWQSLE

XP_022926996.1 uncharacterized protein LOC111433956 [Cucurbita moschata]1.4e-15187.14Show/hide
Query:  MSTGAAHPSPI-HLNPVTSSPFPA---NSPPRTAAFTTPQNYAGSLSNLLSLKGFVPVWCPTLTVHPTPLAIKSHLLPPNLHSFSAVAFTSRSGITALLD
        MSTGAAHPSPI HLNP+ SSP PA   NSP RT AFTTPQNYAGSLSNLLSLKGF P+WCPT+TVHPTPLAIKSHLLPPNLH +SAVAFTSRSGITALLD
Subjt:  MSTGAAHPSPI-HLNPVTSSPFPA---NSPPRTAAFTTPQNYAGSLSNLLSLKGFVPVWCPTLTVHPTPLAIKSHLLPPNLHSFSAVAFTSRSGITALLD

Query:  AATEIGEPLLPSHGDAFLIAALGKDSELLDHGFLSKICPNASRIRIVIPEIATPSGLVEALGVGNHRRVLCPVPRVVGLNEPPVVPNFLRDLEANGWVPV
        AATEI EPLL   GD FLIAALGKDSELLDHG  SK CPNASRIR+V+P+IA+PSGLVEALG+GNHRRVLCPVPRVVGLNEPPVVPNFLRDL A+GWVPV
Subjt:  AATEIGEPLLPSHGDAFLIAALGKDSELLDHGFLSKICPNASRIRIVIPEIATPSGLVEALGVGNHRRVLCPVPRVVGLNEPPVVPNFLRDLEANGWVPV

Query:  RVDAYETRWAGPDCARKLMERGEDEKLDAIVFTSTGEVEGLLKSLRDVGLEWEAMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSSKFDSFNGVVDALH
        RVDAYETRW GP+CARKL ERGEDEKLDAIVFTSTGEVEGLLKSLR +GL+WE MRK+WPEMV+AAHGPVTAAGAERLGVK+DLVSSKFDSFNGVVDALH
Subjt:  RVDAYETRWAGPDCARKLMERGEDEKLDAIVFTSTGEVEGLLKSLRDVGLEWEAMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSSKFDSFNGVVDALH

Query:  SRWQSLEQQNP
        SRWQSLE QNP
Subjt:  SRWQSLEQQNP

XP_023003818.1 uncharacterized protein LOC111497286 [Cucurbita maxima]1.4e-15186.77Show/hide
Query:  MSTGAAHPSPIHLNPVTSSPFPA---NSPPRTAAFTTPQNYAGSLSNLLSLKGFVPVWCPTLTVHPTPLAIKSHLLPPNLHSFSAVAFTSRSGITALLDA
        MSTGAAHPS IHLNP+ SSP PA   NSP RT AFTTPQNYAGSLSNLLSLKGF P+WCPT+TVHPTPLAIKSHLLPPNLH +SAVAFTSRSGITALLDA
Subjt:  MSTGAAHPSPIHLNPVTSSPFPA---NSPPRTAAFTTPQNYAGSLSNLLSLKGFVPVWCPTLTVHPTPLAIKSHLLPPNLHSFSAVAFTSRSGITALLDA

Query:  ATEIGEPLLPSHGDAFLIAALGKDSELLDHGFLSKICPNASRIRIVIPEIATPSGLVEALGVGNHRRVLCPVPRVVGLNEPPVVPNFLRDLEANGWVPVR
        ATEI +PLL   GD FLIAALGKDSELLDHG LSK CPN SRIR+V+P+IATPSGLVEALG+GNHRRVLCPVPRVVGLNEPPVVPNFLRDL A+GWVPVR
Subjt:  ATEIGEPLLPSHGDAFLIAALGKDSELLDHGFLSKICPNASRIRIVIPEIATPSGLVEALGVGNHRRVLCPVPRVVGLNEPPVVPNFLRDLEANGWVPVR

Query:  VDAYETRWAGPDCARKLMERGEDEKLDAIVFTSTGEVEGLLKSLRDVGLEWEAMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSSKFDSFNGVVDALHS
        VDAYETRWAGP+CAR L++RGEDEKLDAIVFTSTGEVEGLLKSLR +GL+WE MRK+WPEMV+AAHGPVTAAGAERLGVK+DLVSSKFDSFNGVVDALHS
Subjt:  VDAYETRWAGPDCARKLMERGEDEKLDAIVFTSTGEVEGLLKSLRDVGLEWEAMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSSKFDSFNGVVDALHS

Query:  RWQSLEQQNP
        RWQSLE QNP
Subjt:  RWQSLEQQNP

XP_023530963.1 uncharacterized protein LOC111793350 [Cucurbita pepo subsp. pepo]3.2e-15186.5Show/hide
Query:  MSTGAAHPSPI-HLNPVTSSPFPA---NSPPRTAAFTTPQNYAGSLSNLLSLKGFVPVWCPTLTVHPTPLAIKSHLLPPNLHSFSAVAFTSRSGITALLD
        MSTGAAHPSPI HLNP+ SSP PA   NSP RT AFTTPQNYAGSLSNLLSLKGF P+WCPT+TVHPTP+AIKSHL+PPNLH +SAVAFTSRSGITALLD
Subjt:  MSTGAAHPSPI-HLNPVTSSPFPA---NSPPRTAAFTTPQNYAGSLSNLLSLKGFVPVWCPTLTVHPTPLAIKSHLLPPNLHSFSAVAFTSRSGITALLD

Query:  AATEIGEPLLPSHGDAFLIAALGKDSELLDHGFLSKICPNASRIRIVIPEIATPSGLVEALGVGNHRRVLCPVPRVVGLNEPPVVPNFLRDLEANGWVPV
        AATEI EPLL   GD FLIAALGKDSELLDHG LSK CPNA+RIR+V+P+IA+PSGLVEALG+GNHRRVLCPVPRVVGLNEPPVVPNFLRDL A+GWVPV
Subjt:  AATEIGEPLLPSHGDAFLIAALGKDSELLDHGFLSKICPNASRIRIVIPEIATPSGLVEALGVGNHRRVLCPVPRVVGLNEPPVVPNFLRDLEANGWVPV

Query:  RVDAYETRWAGPDCARKLMERGEDEKLDAIVFTSTGEVEGLLKSLRDVGLEWEAMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSSKFDSFNGVVDALH
        RVDAYETRW GP+CARKL ERGEDEKLDAIVFTSTGEVEGLLKSLR +GL+WE MRK+WPEMV+AAHGPVTAAGAERLGVK+DLVSSKFDSFNGVVDALH
Subjt:  RVDAYETRWAGPDCARKLMERGEDEKLDAIVFTSTGEVEGLLKSLRDVGLEWEAMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSSKFDSFNGVVDALH

Query:  SRWQSLEQQNP
        SRWQSLE QNP
Subjt:  SRWQSLEQQNP

TrEMBL top hitse value%identityAlignment
A0A1S3BA78 Hydroxymethylbilane hydrolyase [cyclizing]4.5e-15187.3Show/hide
Query:  MSTGAAHPS-PIHLNPVTSSPFPA---NSPPRTAAFTTPQNYAGSLSNLLSLKGFVPVWCPTLTVHPTPLAIKSHLLPPNLHSFSAVAFTSRSGITALLD
        MSTGAAHP+ P H++P+TSSP PA   +S PRT AFTTPQNYAGSLS+LLSLKGF P+WCPTLTV PTPLAIKSHLLPP LHSFSAVAFTSRSGITALLD
Subjt:  MSTGAAHPS-PIHLNPVTSSPFPA---NSPPRTAAFTTPQNYAGSLSNLLSLKGFVPVWCPTLTVHPTPLAIKSHLLPPNLHSFSAVAFTSRSGITALLD

Query:  AATEIGEPLLPSHGDAFLIAALGKDSELLDHGFLSKICPNASRIRIVIPEIATPSGLVEALGVGNHRRVLCPVPRVVGLNEPPVVPNFLRDLEANGWVPV
        AATEIGEPLLPSHGD FLIAALGKDSELLDH FL+ ICPN SRIR+V+PEIATP+GLVEALGVGNHRRVLCPVPRVVGLNEPPVVPNFLRDLEA GWVPV
Subjt:  AATEIGEPLLPSHGDAFLIAALGKDSELLDHGFLSKICPNASRIRIVIPEIATPSGLVEALGVGNHRRVLCPVPRVVGLNEPPVVPNFLRDLEANGWVPV

Query:  RVDAYETRWAGPDCARKLMERGEDEKLDAIVFTSTGEVEGLLKSLRDVGLEWEAMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSSKFDSFNGVVDALH
        RVDAYETRWAGPDCARKL+ERG+DEKLDAIVFTSTGEVEGLLKSL  +GLEW+ M+KRWPEMVVAAHGPVTAAGAERLGVKVDLVS KFDSFNGVVD+LH
Subjt:  RVDAYETRWAGPDCARKLMERGEDEKLDAIVFTSTGEVEGLLKSLRDVGLEWEAMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSSKFDSFNGVVDALH

Query:  SRWQSLE
         RWQSL+
Subjt:  SRWQSLE

A0A5A7U8V3 Hydroxymethylbilane hydrolyase [cyclizing]4.5e-15187.3Show/hide
Query:  MSTGAAHPS-PIHLNPVTSSPFPA---NSPPRTAAFTTPQNYAGSLSNLLSLKGFVPVWCPTLTVHPTPLAIKSHLLPPNLHSFSAVAFTSRSGITALLD
        MSTGAAHP+ P H++P+TSSP PA   +S PRT AFTTPQNYAGSLS+LLSLKGF P+WCPTLTV PTPLAIKSHLLPP LHSFSAVAFTSRSGITALLD
Subjt:  MSTGAAHPS-PIHLNPVTSSPFPA---NSPPRTAAFTTPQNYAGSLSNLLSLKGFVPVWCPTLTVHPTPLAIKSHLLPPNLHSFSAVAFTSRSGITALLD

Query:  AATEIGEPLLPSHGDAFLIAALGKDSELLDHGFLSKICPNASRIRIVIPEIATPSGLVEALGVGNHRRVLCPVPRVVGLNEPPVVPNFLRDLEANGWVPV
        AATEIGEPLLPSHGD FLIAALGKDSELLDH FL+ ICPN SRIR+V+PEIATP+GLVEALGVGNHRRVLCPVPRVVGLNEPPVVPNFLRDLEA GWVPV
Subjt:  AATEIGEPLLPSHGDAFLIAALGKDSELLDHGFLSKICPNASRIRIVIPEIATPSGLVEALGVGNHRRVLCPVPRVVGLNEPPVVPNFLRDLEANGWVPV

Query:  RVDAYETRWAGPDCARKLMERGEDEKLDAIVFTSTGEVEGLLKSLRDVGLEWEAMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSSKFDSFNGVVDALH
        RVDAYETRWAGPDCARKL+ERG+DEKLDAIVFTSTGEVEGLLKSL  +GLEW+ M+KRWPEMVVAAHGPVTAAGAERLGVKVDLVS KFDSFNGVVD+LH
Subjt:  RVDAYETRWAGPDCARKLMERGEDEKLDAIVFTSTGEVEGLLKSLRDVGLEWEAMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSSKFDSFNGVVDALH

Query:  SRWQSLE
         RWQSL+
Subjt:  SRWQSLE

A0A5D3BX13 Hydroxymethylbilane hydrolyase [cyclizing]4.5e-15187.3Show/hide
Query:  MSTGAAHPS-PIHLNPVTSSPFPA---NSPPRTAAFTTPQNYAGSLSNLLSLKGFVPVWCPTLTVHPTPLAIKSHLLPPNLHSFSAVAFTSRSGITALLD
        MSTGAAHP+ P H++P+TSSP PA   +S PRT AFTTPQNYAGSLS+LLSLKGF P+WCPTLTV PTPLAIKSHLLPP LHSFSAVAFTSRSGITALLD
Subjt:  MSTGAAHPS-PIHLNPVTSSPFPA---NSPPRTAAFTTPQNYAGSLSNLLSLKGFVPVWCPTLTVHPTPLAIKSHLLPPNLHSFSAVAFTSRSGITALLD

Query:  AATEIGEPLLPSHGDAFLIAALGKDSELLDHGFLSKICPNASRIRIVIPEIATPSGLVEALGVGNHRRVLCPVPRVVGLNEPPVVPNFLRDLEANGWVPV
        AATEIGEPLLPSHGD FLIAALGKDSELLDH FL+ ICPN SRIR+V+PEIATP+GLVEALGVGNHRRVLCPVPRVVGLNEPPVVPNFLRDLEA GWVPV
Subjt:  AATEIGEPLLPSHGDAFLIAALGKDSELLDHGFLSKICPNASRIRIVIPEIATPSGLVEALGVGNHRRVLCPVPRVVGLNEPPVVPNFLRDLEANGWVPV

Query:  RVDAYETRWAGPDCARKLMERGEDEKLDAIVFTSTGEVEGLLKSLRDVGLEWEAMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSSKFDSFNGVVDALH
        RVDAYETRWAGPDCARKL+ERG+DEKLDAIVFTSTGEVEGLLKSL  +GLEW+ M+KRWPEMVVAAHGPVTAAGAERLGVKVDLVS KFDSFNGVVD+LH
Subjt:  RVDAYETRWAGPDCARKLMERGEDEKLDAIVFTSTGEVEGLLKSLRDVGLEWEAMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSSKFDSFNGVVDALH

Query:  SRWQSLE
         RWQSL+
Subjt:  SRWQSLE

A0A6J1EGG3 Hydroxymethylbilane hydrolyase [cyclizing]7.0e-15287.14Show/hide
Query:  MSTGAAHPSPI-HLNPVTSSPFPA---NSPPRTAAFTTPQNYAGSLSNLLSLKGFVPVWCPTLTVHPTPLAIKSHLLPPNLHSFSAVAFTSRSGITALLD
        MSTGAAHPSPI HLNP+ SSP PA   NSP RT AFTTPQNYAGSLSNLLSLKGF P+WCPT+TVHPTPLAIKSHLLPPNLH +SAVAFTSRSGITALLD
Subjt:  MSTGAAHPSPI-HLNPVTSSPFPA---NSPPRTAAFTTPQNYAGSLSNLLSLKGFVPVWCPTLTVHPTPLAIKSHLLPPNLHSFSAVAFTSRSGITALLD

Query:  AATEIGEPLLPSHGDAFLIAALGKDSELLDHGFLSKICPNASRIRIVIPEIATPSGLVEALGVGNHRRVLCPVPRVVGLNEPPVVPNFLRDLEANGWVPV
        AATEI EPLL   GD FLIAALGKDSELLDHG  SK CPNASRIR+V+P+IA+PSGLVEALG+GNHRRVLCPVPRVVGLNEPPVVPNFLRDL A+GWVPV
Subjt:  AATEIGEPLLPSHGDAFLIAALGKDSELLDHGFLSKICPNASRIRIVIPEIATPSGLVEALGVGNHRRVLCPVPRVVGLNEPPVVPNFLRDLEANGWVPV

Query:  RVDAYETRWAGPDCARKLMERGEDEKLDAIVFTSTGEVEGLLKSLRDVGLEWEAMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSSKFDSFNGVVDALH
        RVDAYETRW GP+CARKL ERGEDEKLDAIVFTSTGEVEGLLKSLR +GL+WE MRK+WPEMV+AAHGPVTAAGAERLGVK+DLVSSKFDSFNGVVDALH
Subjt:  RVDAYETRWAGPDCARKLMERGEDEKLDAIVFTSTGEVEGLLKSLRDVGLEWEAMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSSKFDSFNGVVDALH

Query:  SRWQSLEQQNP
        SRWQSLE QNP
Subjt:  SRWQSLEQQNP

A0A6J1KXQ2 Hydroxymethylbilane hydrolyase [cyclizing]7.0e-15286.77Show/hide
Query:  MSTGAAHPSPIHLNPVTSSPFPA---NSPPRTAAFTTPQNYAGSLSNLLSLKGFVPVWCPTLTVHPTPLAIKSHLLPPNLHSFSAVAFTSRSGITALLDA
        MSTGAAHPS IHLNP+ SSP PA   NSP RT AFTTPQNYAGSLSNLLSLKGF P+WCPT+TVHPTPLAIKSHLLPPNLH +SAVAFTSRSGITALLDA
Subjt:  MSTGAAHPSPIHLNPVTSSPFPA---NSPPRTAAFTTPQNYAGSLSNLLSLKGFVPVWCPTLTVHPTPLAIKSHLLPPNLHSFSAVAFTSRSGITALLDA

Query:  ATEIGEPLLPSHGDAFLIAALGKDSELLDHGFLSKICPNASRIRIVIPEIATPSGLVEALGVGNHRRVLCPVPRVVGLNEPPVVPNFLRDLEANGWVPVR
        ATEI +PLL   GD FLIAALGKDSELLDHG LSK CPN SRIR+V+P+IATPSGLVEALG+GNHRRVLCPVPRVVGLNEPPVVPNFLRDL A+GWVPVR
Subjt:  ATEIGEPLLPSHGDAFLIAALGKDSELLDHGFLSKICPNASRIRIVIPEIATPSGLVEALGVGNHRRVLCPVPRVVGLNEPPVVPNFLRDLEANGWVPVR

Query:  VDAYETRWAGPDCARKLMERGEDEKLDAIVFTSTGEVEGLLKSLRDVGLEWEAMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSSKFDSFNGVVDALHS
        VDAYETRWAGP+CAR L++RGEDEKLDAIVFTSTGEVEGLLKSLR +GL+WE MRK+WPEMV+AAHGPVTAAGAERLGVK+DLVSSKFDSFNGVVDALHS
Subjt:  VDAYETRWAGPDCARKLMERGEDEKLDAIVFTSTGEVEGLLKSLRDVGLEWEAMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSSKFDSFNGVVDALHS

Query:  RWQSLEQQNP
        RWQSLE QNP
Subjt:  RWQSLEQQNP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAACGGTGACTGCGTGATTTCCAAAGAACAAAAAATGAGCACCGGCGCCGCCCACCCCAGTCCAATCCACCTAAATCCAGTTACTTCGTCTCCTTTTCCCGCCAA
CTCCCCTCCGCGTACGGCGGCGTTCACGACGCCTCAGAACTACGCTGGCAGCCTCTCCAATCTCCTCTCTCTCAAAGGCTTCGTCCCCGTCTGGTGCCCCACTCTTACCG
TCCACCCCACTCCCCTCGCCATCAAATCCCATCTCCTTCCCCCAAATCTCCATTCCTTCTCCGCCGTCGCTTTCACCTCCCGCTCCGGCATCACAGCCCTCCTCGACGCC
GCTACTGAAATCGGCGAGCCCTTGCTGCCGTCGCACGGCGACGCTTTTCTAATCGCAGCCCTAGGTAAGGACTCCGAGCTTCTCGATCATGGATTTCTTTCCAAAATTTG
CCCTAACGCGAGTCGAATTAGAATCGTGATACCTGAAATTGCAACGCCGAGTGGTCTAGTGGAGGCTCTTGGAGTTGGAAACCACCGTAGGGTTCTGTGTCCGGTTCCTC
GCGTCGTGGGGCTGAACGAGCCTCCGGTGGTTCCGAACTTCCTCCGCGACCTGGAGGCGAACGGCTGGGTTCCGGTTCGTGTCGATGCGTACGAGACCCGATGGGCCGGA
CCGGATTGCGCGAGGAAGCTGATGGAGAGAGGGGAGGATGAGAAATTGGATGCCATTGTGTTTACGAGTACTGGGGAAGTGGAGGGGCTGCTTAAAAGCTTGAGAGATGT
GGGATTGGAGTGGGAGGCGATGAGAAAGAGGTGGCCGGAAATGGTGGTGGCCGCGCACGGGCCGGTGACGGCGGCGGGAGCTGAGAGGCTCGGCGTTAAGGTTGATTTGG
TGAGTTCAAAATTTGATAGCTTCAATGGCGTGGTTGATGCTCTTCATTCGAGATGGCAGAGCTTAGAACAACAGAACCCTGCGTAA
mRNA sequenceShow/hide mRNA sequence
AAACATCTTAAATCACCTCATCATTGCCACTATTTTTTTTATCTAACATTCTAATAAATTTCAAAATAGTTTTAAAAAATTCAAAAGGCAATATTATAACTTTGGAAAGG
GCAAACTTATTCTAATAATCTGGCAAAACTCAAATGGTCGGGAAAAGGGTGGTTCAGTTTCATGAAAAACGGTGACTGCGTGATTTCCAAAGAACAAAAAATGAGCACCG
GCGCCGCCCACCCCAGTCCAATCCACCTAAATCCAGTTACTTCGTCTCCTTTTCCCGCCAACTCCCCTCCGCGTACGGCGGCGTTCACGACGCCTCAGAACTACGCTGGC
AGCCTCTCCAATCTCCTCTCTCTCAAAGGCTTCGTCCCCGTCTGGTGCCCCACTCTTACCGTCCACCCCACTCCCCTCGCCATCAAATCCCATCTCCTTCCCCCAAATCT
CCATTCCTTCTCCGCCGTCGCTTTCACCTCCCGCTCCGGCATCACAGCCCTCCTCGACGCCGCTACTGAAATCGGCGAGCCCTTGCTGCCGTCGCACGGCGACGCTTTTC
TAATCGCAGCCCTAGGTAAGGACTCCGAGCTTCTCGATCATGGATTTCTTTCCAAAATTTGCCCTAACGCGAGTCGAATTAGAATCGTGATACCTGAAATTGCAACGCCG
AGTGGTCTAGTGGAGGCTCTTGGAGTTGGAAACCACCGTAGGGTTCTGTGTCCGGTTCCTCGCGTCGTGGGGCTGAACGAGCCTCCGGTGGTTCCGAACTTCCTCCGCGA
CCTGGAGGCGAACGGCTGGGTTCCGGTTCGTGTCGATGCGTACGAGACCCGATGGGCCGGACCGGATTGCGCGAGGAAGCTGATGGAGAGAGGGGAGGATGAGAAATTGG
ATGCCATTGTGTTTACGAGTACTGGGGAAGTGGAGGGGCTGCTTAAAAGCTTGAGAGATGTGGGATTGGAGTGGGAGGCGATGAGAAAGAGGTGGCCGGAAATGGTGGTG
GCCGCGCACGGGCCGGTGACGGCGGCGGGAGCTGAGAGGCTCGGCGTTAAGGTTGATTTGGTGAGTTCAAAATTTGATAGCTTCAATGGCGTGGTTGATGCTCTTCATTC
GAGATGGCAGAGCTTAGAACAACAGAACCCTGCGTAAGTGCAAAACCATTCCAAAATTTTGACAGTGACGTTGATCGCCATTTTATATTCAAAATCTTCAAGCAGAATCA
TCTTCTTCAGTTGGAATCGTTTCATCTTTCATGTTTTTTGCTTTGAAGCAGAATCATAAATATCAGTACATTCAAATCTTCAGCCCTTTGTTGCCTTATTGATTGTTTTG
TATCGGTTTCTTCTCTTGGCTCTGCCATTTCATCAATAAAATTCACACTAATACGAATTTAGAGATATTAAAATTGGAAAAACCTAAGTCTACGAGT
Protein sequenceShow/hide protein sequence
MKNGDCVISKEQKMSTGAAHPSPIHLNPVTSSPFPANSPPRTAAFTTPQNYAGSLSNLLSLKGFVPVWCPTLTVHPTPLAIKSHLLPPNLHSFSAVAFTSRSGITALLDA
ATEIGEPLLPSHGDAFLIAALGKDSELLDHGFLSKICPNASRIRIVIPEIATPSGLVEALGVGNHRRVLCPVPRVVGLNEPPVVPNFLRDLEANGWVPVRVDAYETRWAG
PDCARKLMERGEDEKLDAIVFTSTGEVEGLLKSLRDVGLEWEAMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSSKFDSFNGVVDALHSRWQSLEQQNPA