; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS002367 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS002367
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionHydroxymethylbilane hydrolyase [cyclizing]
Genome locationscaffold30:4433035..4433958
RNA-Seq ExpressionMS002367
SyntenyMS002367
Gene Ontology termsGO:0006782 - protoporphyrinogen IX biosynthetic process (biological process)
GO:0004852 - uroporphyrinogen-III synthase activity (molecular function)
InterPro domainsIPR003754 - Tetrapyrrole biosynthesis, uroporphyrinogen III synthase
IPR036108 - Tetrapyrrole biosynthesis, uroporphyrinogen III synthase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587847.1 hypothetical protein SDJN03_16412, partial [Cucurbita argyrosperma subsp. sororia]3.4e-14282.26Show/hide
Query:  MSTGAGPPGPIRQ-NPLGSSPSPAIPGHSIRRTAAFTTPQNYAGSLSRLLTLKGMDPLWCPTLTVQPTPHAIKSHILPPNLESFSAVAFTSRTGISALLD
        MSTGA  P PI   NPL SSPSPAIP +S  RT AFTTPQNYAGSLS LL+LKG DPLWCPT+TV PTP AIKSH+LPPNL  +SAVAFTSR+GI+ALLD
Subjt:  MSTGAGPPGPIRQ-NPLGSSPSPAIPGHSIRRTAAFTTPQNYAGSLSRLLTLKGMDPLWCPTLTVQPTPHAIKSHILPPNLESFSAVAFTSRTGISALLD

Query:  AATEIGEPLLPSQGETFLIAALGKDSELLDHGFISQICPNASRIQVVLPKIATPSGLVEALGVGNRRRVLCPVPRVVGLDEPPVVPNFLRDLEANGWVPV
        AATEI EPLL  QG+TFLIAALGKDSELLDHG +S+ CPNASRI+VV+PKIA+PSGLVEALG+GN RRVLCPVPRVVGL+EPPVVPNFLRDL A+GWVPV
Subjt:  AATEIGEPLLPSQGETFLIAALGKDSELLDHGFISQICPNASRIQVVLPKIATPSGLVEALGVGNRRRVLCPVPRVVGLDEPPVVPNFLRDLEANGWVPV

Query:  RVDAYETRWAGPGCARQLVER--DEKLDAIVFTSTGEVEGLLKSLRDLGFEWETMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSCRFDSFNGVVDALH
        RVDAYETRW GP CAR+L ER  DEKLDAIVFTSTGEVEGLLKSLR LG +WE MRK+WPEMV+AAHGPVTAAGAERLGVK+DLVS +FDSFNGVVDALH
Subjt:  RVDAYETRWAGPGCARQLVER--DEKLDAIVFTSTGEVEGLLKSLRDLGFEWETMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSCRFDSFNGVVDALH

Query:  SRWQSLEQNP
        SRWQSLEQNP
Subjt:  SRWQSLEQNP

XP_022134809.1 uncharacterized protein LOC111006990 [Momordica charantia]9.2e-17298.05Show/hide
Query:  MSTGAGPPGPIRQNPLGSSPSPAIPGHSIRRTAAFTTPQNYAGSLSRLLTLKGMDPLWCPTLTVQPTPHAIKSHILPPNLESFSAVAFTSRTGISALLDA
        MST AG PGPIRQNPLGSSPS AIPGHSIRRTAAFTTPQNYAGSLSRLLTLKGMDPLWCPTLTVQPTPHAIKSHILPPNLESFSAVAFTSR GISALLDA
Subjt:  MSTGAGPPGPIRQNPLGSSPSPAIPGHSIRRTAAFTTPQNYAGSLSRLLTLKGMDPLWCPTLTVQPTPHAIKSHILPPNLESFSAVAFTSRTGISALLDA

Query:  ATEIGEPLLPSQGETFLIAALGKDSELLDHGFISQICPNASRIQVVLPKIATPSGLVEALGVGNRRRVLCPVPRVVGLDEPPVVPNFLRDLEANGWVPVR
        ATEIGEPLLPSQGETFLIAALGKDSELLDHGFISQICPNASRIQVVLPKIATPSGLVEALGVGNRRRVLCPVPRVVGL+EPPVVPNFL DLEANGWVPVR
Subjt:  ATEIGEPLLPSQGETFLIAALGKDSELLDHGFISQICPNASRIQVVLPKIATPSGLVEALGVGNRRRVLCPVPRVVGLDEPPVVPNFLRDLEANGWVPVR

Query:  VDAYETRWAGPGCARQLVERDEKLDAIVFTSTGEVEGLLKSLRDLGFEWETMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSCRFDSFNGVVDALHSRW
        VDAYETRWAGPGCARQLVERDEKLDAIVFTSTGEVEGLLKSLRDLGFEWETMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSCRFDSFNGVVDALHSRW
Subjt:  VDAYETRWAGPGCARQLVERDEKLDAIVFTSTGEVEGLLKSLRDLGFEWETMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSCRFDSFNGVVDALHSRW

Query:  QSLEQNPN
        QSLEQNPN
Subjt:  QSLEQNPN

XP_022926996.1 uncharacterized protein LOC111433956 [Cucurbita moschata]1.3e-14182.26Show/hide
Query:  MSTGAGPPGPIRQ-NPLGSSPSPAIPGHSIRRTAAFTTPQNYAGSLSRLLTLKGMDPLWCPTLTVQPTPHAIKSHILPPNLESFSAVAFTSRTGISALLD
        MSTGA  P PIR  NPL SSPSPAI  +S  RT AFTTPQNYAGSLS LL+LKG DPLWCPT+TV PTP AIKSH+LPPNL  +SAVAFTSR+GI+ALLD
Subjt:  MSTGAGPPGPIRQ-NPLGSSPSPAIPGHSIRRTAAFTTPQNYAGSLSRLLTLKGMDPLWCPTLTVQPTPHAIKSHILPPNLESFSAVAFTSRTGISALLD

Query:  AATEIGEPLLPSQGETFLIAALGKDSELLDHGFISQICPNASRIQVVLPKIATPSGLVEALGVGNRRRVLCPVPRVVGLDEPPVVPNFLRDLEANGWVPV
        AATEI EPLL  QG+TFLIAALGKDSELLDHG  S+ CPNASRI+VV+PKIA+PSGLVEALG+GN RRVLCPVPRVVGL+EPPVVPNFLRDL A+GWVPV
Subjt:  AATEIGEPLLPSQGETFLIAALGKDSELLDHGFISQICPNASRIQVVLPKIATPSGLVEALGVGNRRRVLCPVPRVVGLDEPPVVPNFLRDLEANGWVPV

Query:  RVDAYETRWAGPGCARQLVER--DEKLDAIVFTSTGEVEGLLKSLRDLGFEWETMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSCRFDSFNGVVDALH
        RVDAYETRW GP CAR+L ER  DEKLDAIVFTSTGEVEGLLKSLR LG +WE MRK+WPEMV+AAHGPVTAAGAERLGVK+DLVS +FDSFNGVVDALH
Subjt:  RVDAYETRWAGPGCARQLVER--DEKLDAIVFTSTGEVEGLLKSLRDLGFEWETMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSCRFDSFNGVVDALH

Query:  SRWQSLEQNP
        SRWQSLEQNP
Subjt:  SRWQSLEQNP

XP_023003818.1 uncharacterized protein LOC111497286 [Cucurbita maxima]2.6e-14282.2Show/hide
Query:  MSTGAGPPGPIRQNPLGSSPSPAIPGHSIRRTAAFTTPQNYAGSLSRLLTLKGMDPLWCPTLTVQPTPHAIKSHILPPNLESFSAVAFTSRTGISALLDA
        MSTGA  P  I  NPL SSPSPAIP +S  RT AFTTPQNYAGSLS LL+LKG DPLWCPT+TV PTP AIKSH+LPPNL  +SAVAFTSR+GI+ALLDA
Subjt:  MSTGAGPPGPIRQNPLGSSPSPAIPGHSIRRTAAFTTPQNYAGSLSRLLTLKGMDPLWCPTLTVQPTPHAIKSHILPPNLESFSAVAFTSRTGISALLDA

Query:  ATEIGEPLLPSQGETFLIAALGKDSELLDHGFISQICPNASRIQVVLPKIATPSGLVEALGVGNRRRVLCPVPRVVGLDEPPVVPNFLRDLEANGWVPVR
        ATEI +PLL  QG+TFLIAALGKDSELLDHG +S+ CPN SRI+VV+PKIATPSGLVEALG+GN RRVLCPVPRVVGL+EPPVVPNFLRDL A+GWVPVR
Subjt:  ATEIGEPLLPSQGETFLIAALGKDSELLDHGFISQICPNASRIQVVLPKIATPSGLVEALGVGNRRRVLCPVPRVVGLDEPPVVPNFLRDLEANGWVPVR

Query:  VDAYETRWAGPGCARQLVER--DEKLDAIVFTSTGEVEGLLKSLRDLGFEWETMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSCRFDSFNGVVDALHS
        VDAYETRWAGP CAR LV+R  DEKLDAIVFTSTGEVEGLLKSLR LG +WE MRK+WPEMV+AAHGPVTAAGAERLGVK+DLVS +FDSFNGVVDALHS
Subjt:  VDAYETRWAGPGCARQLVER--DEKLDAIVFTSTGEVEGLLKSLRDLGFEWETMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSCRFDSFNGVVDALHS

Query:  RWQSLEQNP
        RWQSLEQNP
Subjt:  RWQSLEQNP

XP_023530963.1 uncharacterized protein LOC111793350 [Cucurbita pepo subsp. pepo]1.3e-14181.61Show/hide
Query:  MSTGAGPPGPIRQ-NPLGSSPSPAIPGHSIRRTAAFTTPQNYAGSLSRLLTLKGMDPLWCPTLTVQPTPHAIKSHILPPNLESFSAVAFTSRTGISALLD
        MSTGA  P PI   NPL SSPSPAIP +S  RT AFTTPQNYAGSLS LL+LKG DPLWCPT+TV PTP AIKSH++PPNL  +SAVAFTSR+GI+ALLD
Subjt:  MSTGAGPPGPIRQ-NPLGSSPSPAIPGHSIRRTAAFTTPQNYAGSLSRLLTLKGMDPLWCPTLTVQPTPHAIKSHILPPNLESFSAVAFTSRTGISALLD

Query:  AATEIGEPLLPSQGETFLIAALGKDSELLDHGFISQICPNASRIQVVLPKIATPSGLVEALGVGNRRRVLCPVPRVVGLDEPPVVPNFLRDLEANGWVPV
        AATEI EPLL  QG+TFLIAALGKDSELLDHG +S+ CPNA+RI+VV+PKIA+PSGLVEALG+GN RRVLCPVPRVVGL+EPPVVPNFLRDL A+GWVPV
Subjt:  AATEIGEPLLPSQGETFLIAALGKDSELLDHGFISQICPNASRIQVVLPKIATPSGLVEALGVGNRRRVLCPVPRVVGLDEPPVVPNFLRDLEANGWVPV

Query:  RVDAYETRWAGPGCARQLVER--DEKLDAIVFTSTGEVEGLLKSLRDLGFEWETMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSCRFDSFNGVVDALH
        RVDAYETRW GP CAR+L ER  DEKLDAIVFTSTGEVEGLLKSLR LG +WE MRK+WPEMV+AAHGPVTAAGAERLGVK+DLVS +FDSFNGVVDALH
Subjt:  RVDAYETRWAGPGCARQLVER--DEKLDAIVFTSTGEVEGLLKSLRDLGFEWETMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSCRFDSFNGVVDALH

Query:  SRWQSLEQNP
        SRWQSLEQNP
Subjt:  SRWQSLEQNP

TrEMBL top hitse value%identityAlignment
A0A5A7U8V3 Hydroxymethylbilane hydrolyase [cyclizing]1.4e-14183.39Show/hide
Query:  MSTGAG-PPGPIRQNPLGSSPSPAIPGHSIRRTAAFTTPQNYAGSLSRLLTLKGMDPLWCPTLTVQPTPHAIKSHILPPNLESFSAVAFTSRTGISALLD
        MSTGA  P GP   +PL SSPSPAIP HS  RT AFTTPQNYAGSLS LL+LKG +PLWCPTLTVQPTP AIKSH+LPP L SFSAVAFTSR+GI+ALLD
Subjt:  MSTGAG-PPGPIRQNPLGSSPSPAIPGHSIRRTAAFTTPQNYAGSLSRLLTLKGMDPLWCPTLTVQPTPHAIKSHILPPNLESFSAVAFTSRTGISALLD

Query:  AATEIGEPLLPSQGETFLIAALGKDSELLDHGFISQICPNASRIQVVLPKIATPSGLVEALGVGNRRRVLCPVPRVVGLDEPPVVPNFLRDLEANGWVPV
        AATEIGEPLLPS G+TFLIAALGKDSELLDH F++ ICPN SRI+VV+P+IATP+GLVEALGVGN RRVLCPVPRVVGL+EPPVVPNFLRDLEA GWVPV
Subjt:  AATEIGEPLLPSQGETFLIAALGKDSELLDHGFISQICPNASRIQVVLPKIATPSGLVEALGVGNRRRVLCPVPRVVGLDEPPVVPNFLRDLEANGWVPV

Query:  RVDAYETRWAGPGCARQLVER--DEKLDAIVFTSTGEVEGLLKSLRDLGFEWETMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSCRFDSFNGVVDALH
        RVDAYETRWAGP CAR+LVER  DEKLDAIVFTSTGEVEGLLKSL  LG EW+ M+KRWPEMVVAAHGPVTAAGAERLGVKVDLVS +FDSFNGVVD+LH
Subjt:  RVDAYETRWAGPGCARQLVER--DEKLDAIVFTSTGEVEGLLKSLRDLGFEWETMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSCRFDSFNGVVDALH

Query:  SRWQSLE
         RWQSL+
Subjt:  SRWQSLE

A0A5D3BX13 Hydroxymethylbilane hydrolyase [cyclizing]1.4e-14183.39Show/hide
Query:  MSTGAG-PPGPIRQNPLGSSPSPAIPGHSIRRTAAFTTPQNYAGSLSRLLTLKGMDPLWCPTLTVQPTPHAIKSHILPPNLESFSAVAFTSRTGISALLD
        MSTGA  P GP   +PL SSPSPAIP HS  RT AFTTPQNYAGSLS LL+LKG +PLWCPTLTVQPTP AIKSH+LPP L SFSAVAFTSR+GI+ALLD
Subjt:  MSTGAG-PPGPIRQNPLGSSPSPAIPGHSIRRTAAFTTPQNYAGSLSRLLTLKGMDPLWCPTLTVQPTPHAIKSHILPPNLESFSAVAFTSRTGISALLD

Query:  AATEIGEPLLPSQGETFLIAALGKDSELLDHGFISQICPNASRIQVVLPKIATPSGLVEALGVGNRRRVLCPVPRVVGLDEPPVVPNFLRDLEANGWVPV
        AATEIGEPLLPS G+TFLIAALGKDSELLDH F++ ICPN SRI+VV+P+IATP+GLVEALGVGN RRVLCPVPRVVGL+EPPVVPNFLRDLEA GWVPV
Subjt:  AATEIGEPLLPSQGETFLIAALGKDSELLDHGFISQICPNASRIQVVLPKIATPSGLVEALGVGNRRRVLCPVPRVVGLDEPPVVPNFLRDLEANGWVPV

Query:  RVDAYETRWAGPGCARQLVER--DEKLDAIVFTSTGEVEGLLKSLRDLGFEWETMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSCRFDSFNGVVDALH
        RVDAYETRWAGP CAR+LVER  DEKLDAIVFTSTGEVEGLLKSL  LG EW+ M+KRWPEMVVAAHGPVTAAGAERLGVKVDLVS +FDSFNGVVD+LH
Subjt:  RVDAYETRWAGPGCARQLVER--DEKLDAIVFTSTGEVEGLLKSLRDLGFEWETMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSCRFDSFNGVVDALH

Query:  SRWQSLE
         RWQSL+
Subjt:  SRWQSLE

A0A6J1BZU3 Hydroxymethylbilane hydrolyase [cyclizing]4.5e-17298.05Show/hide
Query:  MSTGAGPPGPIRQNPLGSSPSPAIPGHSIRRTAAFTTPQNYAGSLSRLLTLKGMDPLWCPTLTVQPTPHAIKSHILPPNLESFSAVAFTSRTGISALLDA
        MST AG PGPIRQNPLGSSPS AIPGHSIRRTAAFTTPQNYAGSLSRLLTLKGMDPLWCPTLTVQPTPHAIKSHILPPNLESFSAVAFTSR GISALLDA
Subjt:  MSTGAGPPGPIRQNPLGSSPSPAIPGHSIRRTAAFTTPQNYAGSLSRLLTLKGMDPLWCPTLTVQPTPHAIKSHILPPNLESFSAVAFTSRTGISALLDA

Query:  ATEIGEPLLPSQGETFLIAALGKDSELLDHGFISQICPNASRIQVVLPKIATPSGLVEALGVGNRRRVLCPVPRVVGLDEPPVVPNFLRDLEANGWVPVR
        ATEIGEPLLPSQGETFLIAALGKDSELLDHGFISQICPNASRIQVVLPKIATPSGLVEALGVGNRRRVLCPVPRVVGL+EPPVVPNFL DLEANGWVPVR
Subjt:  ATEIGEPLLPSQGETFLIAALGKDSELLDHGFISQICPNASRIQVVLPKIATPSGLVEALGVGNRRRVLCPVPRVVGLDEPPVVPNFLRDLEANGWVPVR

Query:  VDAYETRWAGPGCARQLVERDEKLDAIVFTSTGEVEGLLKSLRDLGFEWETMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSCRFDSFNGVVDALHSRW
        VDAYETRWAGPGCARQLVERDEKLDAIVFTSTGEVEGLLKSLRDLGFEWETMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSCRFDSFNGVVDALHSRW
Subjt:  VDAYETRWAGPGCARQLVERDEKLDAIVFTSTGEVEGLLKSLRDLGFEWETMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSCRFDSFNGVVDALHSRW

Query:  QSLEQNPN
        QSLEQNPN
Subjt:  QSLEQNPN

A0A6J1EGG3 Hydroxymethylbilane hydrolyase [cyclizing]6.3e-14282.26Show/hide
Query:  MSTGAGPPGPIRQ-NPLGSSPSPAIPGHSIRRTAAFTTPQNYAGSLSRLLTLKGMDPLWCPTLTVQPTPHAIKSHILPPNLESFSAVAFTSRTGISALLD
        MSTGA  P PIR  NPL SSPSPAI  +S  RT AFTTPQNYAGSLS LL+LKG DPLWCPT+TV PTP AIKSH+LPPNL  +SAVAFTSR+GI+ALLD
Subjt:  MSTGAGPPGPIRQ-NPLGSSPSPAIPGHSIRRTAAFTTPQNYAGSLSRLLTLKGMDPLWCPTLTVQPTPHAIKSHILPPNLESFSAVAFTSRTGISALLD

Query:  AATEIGEPLLPSQGETFLIAALGKDSELLDHGFISQICPNASRIQVVLPKIATPSGLVEALGVGNRRRVLCPVPRVVGLDEPPVVPNFLRDLEANGWVPV
        AATEI EPLL  QG+TFLIAALGKDSELLDHG  S+ CPNASRI+VV+PKIA+PSGLVEALG+GN RRVLCPVPRVVGL+EPPVVPNFLRDL A+GWVPV
Subjt:  AATEIGEPLLPSQGETFLIAALGKDSELLDHGFISQICPNASRIQVVLPKIATPSGLVEALGVGNRRRVLCPVPRVVGLDEPPVVPNFLRDLEANGWVPV

Query:  RVDAYETRWAGPGCARQLVER--DEKLDAIVFTSTGEVEGLLKSLRDLGFEWETMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSCRFDSFNGVVDALH
        RVDAYETRW GP CAR+L ER  DEKLDAIVFTSTGEVEGLLKSLR LG +WE MRK+WPEMV+AAHGPVTAAGAERLGVK+DLVS +FDSFNGVVDALH
Subjt:  RVDAYETRWAGPGCARQLVER--DEKLDAIVFTSTGEVEGLLKSLRDLGFEWETMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSCRFDSFNGVVDALH

Query:  SRWQSLEQNP
        SRWQSLEQNP
Subjt:  SRWQSLEQNP

A0A6J1KXQ2 Hydroxymethylbilane hydrolyase [cyclizing]1.3e-14282.2Show/hide
Query:  MSTGAGPPGPIRQNPLGSSPSPAIPGHSIRRTAAFTTPQNYAGSLSRLLTLKGMDPLWCPTLTVQPTPHAIKSHILPPNLESFSAVAFTSRTGISALLDA
        MSTGA  P  I  NPL SSPSPAIP +S  RT AFTTPQNYAGSLS LL+LKG DPLWCPT+TV PTP AIKSH+LPPNL  +SAVAFTSR+GI+ALLDA
Subjt:  MSTGAGPPGPIRQNPLGSSPSPAIPGHSIRRTAAFTTPQNYAGSLSRLLTLKGMDPLWCPTLTVQPTPHAIKSHILPPNLESFSAVAFTSRTGISALLDA

Query:  ATEIGEPLLPSQGETFLIAALGKDSELLDHGFISQICPNASRIQVVLPKIATPSGLVEALGVGNRRRVLCPVPRVVGLDEPPVVPNFLRDLEANGWVPVR
        ATEI +PLL  QG+TFLIAALGKDSELLDHG +S+ CPN SRI+VV+PKIATPSGLVEALG+GN RRVLCPVPRVVGL+EPPVVPNFLRDL A+GWVPVR
Subjt:  ATEIGEPLLPSQGETFLIAALGKDSELLDHGFISQICPNASRIQVVLPKIATPSGLVEALGVGNRRRVLCPVPRVVGLDEPPVVPNFLRDLEANGWVPVR

Query:  VDAYETRWAGPGCARQLVER--DEKLDAIVFTSTGEVEGLLKSLRDLGFEWETMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSCRFDSFNGVVDALHS
        VDAYETRWAGP CAR LV+R  DEKLDAIVFTSTGEVEGLLKSLR LG +WE MRK+WPEMV+AAHGPVTAAGAERLGVK+DLVS +FDSFNGVVDALHS
Subjt:  VDAYETRWAGPGCARQLVER--DEKLDAIVFTSTGEVEGLLKSLRDLGFEWETMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSCRFDSFNGVVDALHS

Query:  RWQSLEQNP
        RWQSLEQNP
Subjt:  RWQSLEQNP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCACCGGCGCCGGCCCTCCCGGCCCAATCCGCCAAAATCCACTCGGTTCATCTCCTTCTCCCGCCATTCCCGGCCACTCCATTCGGCGTACGGCGGCGTTCACGAC
GCCTCAGAACTACGCCGGCAGCCTCTCCCGCCTCCTCACTCTCAAAGGCATGGATCCCCTGTGGTGCCCCACCCTCACCGTCCAGCCCACTCCCCACGCCATCAAATCCC
ACATCCTTCCCCCAAATCTCGAATCCTTCTCCGCCGTCGCTTTCACTTCCCGCACCGGAATTTCAGCGCTCCTCGACGCTGCAACTGAAATCGGCGAACCCTTGCTACCG
TCGCAGGGCGAGACTTTTCTAATCGCTGCCCTAGGTAAGGATTCGGAGCTTCTCGACCATGGATTTATTTCCCAAATTTGCCCTAACGCGAGTAGAATTCAAGTCGTGCT
CCCTAAAATAGCAACTCCGAGTGGTTTAGTGGAGGCTCTTGGAGTTGGAAACCGCCGGCGGGTTCTGTGTCCGGTTCCTCGCGTCGTGGGGCTGGACGAGCCTCCGGTAG
TTCCGAACTTCCTGCGCGACCTGGAGGCGAACGGATGGGTTCCGGTTCGGGTCGATGCGTACGAGACCCGATGGGCCGGACCCGGGTGCGCGAGGCAGCTCGTGGAGAGA
GATGAGAAATTGGATGCCATTGTGTTCACAAGTACTGGGGAAGTGGAGGGCCTGCTGAAAAGCTTGAGGGATTTAGGGTTCGAGTGGGAGACGATGAGAAAGAGGTGGCC
GGAAATGGTGGTGGCCGCGCACGGGCCGGTGACGGCGGCCGGAGCCGAGAGGCTTGGCGTCAAGGTTGATTTGGTGAGTTGTAGATTCGATAGCTTTAACGGTGTGGTTG
ATGCTCTTCATTCCAGATGGCAGAGCTTAGAACAGAACCCTAAC
mRNA sequenceShow/hide mRNA sequence
ATGAGCACCGGCGCCGGCCCTCCCGGCCCAATCCGCCAAAATCCACTCGGTTCATCTCCTTCTCCCGCCATTCCCGGCCACTCCATTCGGCGTACGGCGGCGTTCACGAC
GCCTCAGAACTACGCCGGCAGCCTCTCCCGCCTCCTCACTCTCAAAGGCATGGATCCCCTGTGGTGCCCCACCCTCACCGTCCAGCCCACTCCCCACGCCATCAAATCCC
ACATCCTTCCCCCAAATCTCGAATCCTTCTCCGCCGTCGCTTTCACTTCCCGCACCGGAATTTCAGCGCTCCTCGACGCTGCAACTGAAATCGGCGAACCCTTGCTACCG
TCGCAGGGCGAGACTTTTCTAATCGCTGCCCTAGGTAAGGATTCGGAGCTTCTCGACCATGGATTTATTTCCCAAATTTGCCCTAACGCGAGTAGAATTCAAGTCGTGCT
CCCTAAAATAGCAACTCCGAGTGGTTTAGTGGAGGCTCTTGGAGTTGGAAACCGCCGGCGGGTTCTGTGTCCGGTTCCTCGCGTCGTGGGGCTGGACGAGCCTCCGGTAG
TTCCGAACTTCCTGCGCGACCTGGAGGCGAACGGATGGGTTCCGGTTCGGGTCGATGCGTACGAGACCCGATGGGCCGGACCCGGGTGCGCGAGGCAGCTCGTGGAGAGA
GATGAGAAATTGGATGCCATTGTGTTCACAAGTACTGGGGAAGTGGAGGGCCTGCTGAAAAGCTTGAGGGATTTAGGGTTCGAGTGGGAGACGATGAGAAAGAGGTGGCC
GGAAATGGTGGTGGCCGCGCACGGGCCGGTGACGGCGGCCGGAGCCGAGAGGCTTGGCGTCAAGGTTGATTTGGTGAGTTGTAGATTCGATAGCTTTAACGGTGTGGTTG
ATGCTCTTCATTCCAGATGGCAGAGCTTAGAACAGAACCCTAAC
Protein sequenceShow/hide protein sequence
MSTGAGPPGPIRQNPLGSSPSPAIPGHSIRRTAAFTTPQNYAGSLSRLLTLKGMDPLWCPTLTVQPTPHAIKSHILPPNLESFSAVAFTSRTGISALLDAATEIGEPLLP
SQGETFLIAALGKDSELLDHGFISQICPNASRIQVVLPKIATPSGLVEALGVGNRRRVLCPVPRVVGLDEPPVVPNFLRDLEANGWVPVRVDAYETRWAGPGCARQLVER
DEKLDAIVFTSTGEVEGLLKSLRDLGFEWETMRKRWPEMVVAAHGPVTAAGAERLGVKVDLVSCRFDSFNGVVDALHSRWQSLEQNPN