; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002803 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002803
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionendonuclease MutS2 isoform X1
Genome locationscaffold6:457442..460487
RNA-Seq ExpressionSpg002803
SyntenySpg002803
Gene Ontology termsGO:0006298 - mismatch repair (biological process)
GO:0045910 - negative regulation of DNA recombination (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0004519 - endonuclease activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0030983 - mismatched DNA binding (molecular function)
InterPro domainsIPR000432 - DNA mismatch repair protein MutS, C-terminal
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR045076 - DNA mismatch repair MutS family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038893999.1 endonuclease MutS2 isoform X5 [Benincasa hispida]5.0e-9888.35Show/hide
Query:  NQKVNARASYGLSFGGTCPNLVLPEGCNSCITNVCLSGDRTSEASNLKKNEWVLYLPNAHHPLLLQQYRVNLENAKRDVRNAFTEMGRKLPGGNMSWKEK
        + KVNARASYGLSFGGTCPNL+LPEGCNS I NVCLSGD+TSEAS+ KKNEWVLYL NAHHPLLLQQYR NLENAKRDV+NAFTEMGRKLPGGNMSWKEK
Subjt:  NQKVNARASYGLSFGGTCPNLVLPEGCNSCITNVCLSGDRTSEASNLKKNEWVLYLPNAHHPLLLQQYRVNLENAKRDVRNAFTEMGRKLPGGNMSWKEK

Query:  EVVDISFLKMKVQKLEQAHPVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLR
        EVVDIS LKMKV++LE+A PVSVDFSIS+RI+VLVITGPNTGGKTVCLKTIGLAAMMAKSG+HVLASES QIPWFDS+FADIGDEQSLTQSLSTFSG+LR
Subjt:  EVVDISFLKMKVQKLEQAHPVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLR

Query:  KISLIK
        KIS I+
Subjt:  KISLIK

XP_038894006.1 endonuclease MutS2 isoform X6 [Benincasa hispida]5.0e-9888.35Show/hide
Query:  NQKVNARASYGLSFGGTCPNLVLPEGCNSCITNVCLSGDRTSEASNLKKNEWVLYLPNAHHPLLLQQYRVNLENAKRDVRNAFTEMGRKLPGGNMSWKEK
        + KVNARASYGLSFGGTCPNL+LPEGCNS I NVCLSGD+TSEAS+ KKNEWVLYL NAHHPLLLQQYR NLENAKRDV+NAFTEMGRKLPGGNMSWKEK
Subjt:  NQKVNARASYGLSFGGTCPNLVLPEGCNSCITNVCLSGDRTSEASNLKKNEWVLYLPNAHHPLLLQQYRVNLENAKRDVRNAFTEMGRKLPGGNMSWKEK

Query:  EVVDISFLKMKVQKLEQAHPVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLR
        EVVDIS LKMKV++LE+A PVSVDFSIS+RI+VLVITGPNTGGKTVCLKTIGLAAMMAKSG+HVLASES QIPWFDS+FADIGDEQSLTQSLSTFSG+LR
Subjt:  EVVDISFLKMKVQKLEQAHPVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLR

Query:  KISLIK
        KIS I+
Subjt:  KISLIK

XP_038894074.1 endonuclease MutS2 isoform X11 [Benincasa hispida]1.3e-9887.98Show/hide
Query:  MANQKVNARASYGLSFGGTCPNLVLPEGCNSCITNVCLSGDRTSEASNLKKNEWVLYLPNAHHPLLLQQYRVNLENAKRDVRNAFTEMGRKLPGGNMSWK
        M  +KVNARASYGLSFGGTCPNL+LPEGCNS I NVCLSGD+TSEAS+ KKNEWVLYL NAHHPLLLQQYR NLENAKRDV+NAFTEMGRKLPGGNMSWK
Subjt:  MANQKVNARASYGLSFGGTCPNLVLPEGCNSCITNVCLSGDRTSEASNLKKNEWVLYLPNAHHPLLLQQYRVNLENAKRDVRNAFTEMGRKLPGGNMSWK

Query:  EKEVVDISFLKMKVQKLEQAHPVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGN
        EKEVVDIS LKMKV++LE+A PVSVDFSIS+RI+VLVITGPNTGGKTVCLKTIGLAAMMAKSG+HVLASES QIPWFDS+FADIGDEQSLTQSLSTFSG+
Subjt:  EKEVVDISFLKMKVQKLEQAHPVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGN

Query:  LRKISLIK
        LRKIS I+
Subjt:  LRKISLIK

XP_038894082.1 endonuclease MutS2 isoform X12 [Benincasa hispida]3.2e-9789.16Show/hide
Query:  VNARASYGLSFGGTCPNLVLPEGCNSCITNVCLSGDRTSEASNLKKNEWVLYLPNAHHPLLLQQYRVNLENAKRDVRNAFTEMGRKLPGGNMSWKEKEVV
        VNARASYGLSFGGTCPNL+LPEGCNS I NVCLSGD+TSEAS+ KKNEWVLYL NAHHPLLLQQYR NLENAKRDV+NAFTEMGRKLPGGNMSWKEKEVV
Subjt:  VNARASYGLSFGGTCPNLVLPEGCNSCITNVCLSGDRTSEASNLKKNEWVLYLPNAHHPLLLQQYRVNLENAKRDVRNAFTEMGRKLPGGNMSWKEKEVV

Query:  DISFLKMKVQKLEQAHPVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRKIS
        DIS LKMKV++LE+A PVSVDFSIS+RI+VLVITGPNTGGKTVCLKTIGLAAMMAKSG+HVLASES QIPWFDS+FADIGDEQSLTQSLSTFSG+LRKIS
Subjt:  DISFLKMKVQKLEQAHPVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRKIS

Query:  LIK
         I+
Subjt:  LIK

XP_038894089.1 endonuclease MutS2 isoform X13 [Benincasa hispida]3.2e-9789.16Show/hide
Query:  VNARASYGLSFGGTCPNLVLPEGCNSCITNVCLSGDRTSEASNLKKNEWVLYLPNAHHPLLLQQYRVNLENAKRDVRNAFTEMGRKLPGGNMSWKEKEVV
        VNARASYGLSFGGTCPNL+LPEGCNS I NVCLSGD+TSEAS+ KKNEWVLYL NAHHPLLLQQYR NLENAKRDV+NAFTEMGRKLPGGNMSWKEKEVV
Subjt:  VNARASYGLSFGGTCPNLVLPEGCNSCITNVCLSGDRTSEASNLKKNEWVLYLPNAHHPLLLQQYRVNLENAKRDVRNAFTEMGRKLPGGNMSWKEKEVV

Query:  DISFLKMKVQKLEQAHPVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRKIS
        DIS LKMKV++LE+A PVSVDFSIS+RI+VLVITGPNTGGKTVCLKTIGLAAMMAKSG+HVLASES QIPWFDS+FADIGDEQSLTQSLSTFSG+LRKIS
Subjt:  DISFLKMKVQKLEQAHPVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRKIS

Query:  LIK
         I+
Subjt:  LIK

TrEMBL top hitse value%identityAlignment
A0A0A0KVU2 Uncharacterized protein1.1e-9387.13Show/hide
Query:  VNARASYGLSFGGTCPNLVLPEGCNSCITNVCLSGDRTSEASNLKKNEWVLYLPNAHHPLLLQQYRVNLENAKRDVRNAFTEMGRKLPGGNMSWKEKEVV
        VNARASYGLSFGGTCPNLVL EGCNS I NVCLSGD+ SEAS+LKKNEWVLYL N HHPLLLQQYR NL+NAKRDV+NAF EMGRK PGGNMSWKEKEV+
Subjt:  VNARASYGLSFGGTCPNLVLPEGCNSCITNVCLSGDRTSEASNLKKNEWVLYLPNAHHPLLLQQYRVNLENAKRDVRNAFTEMGRKLPGGNMSWKEKEVV

Query:  DISFLKMKVQKLEQAHPVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRKIS
        DIS  KMKV +LEQA PVSVDFSIS+RI+VLVITGPNTGGKTVCLKTIGLAAMMAKSG+HVLASESVQIPWFDS+FADIGDEQSLTQSLSTFSG+LRKIS
Subjt:  DISFLKMKVQKLEQAHPVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRKIS

Query:  LI
        ++
Subjt:  LI

A0A1S3CHZ5 endonuclease MutS2 isoform X18.0e-9486.7Show/hide
Query:  VNARASYGLSFGGTCPNLVLPEGCNSCITNVCLSGDRTSEASNLKKNEWVLYLPNAHHPLLLQQYRVNLENAKRDVRNAFTEMGRKLPGGNMSWKEKEVV
        VNARASYGLSFGGTCPNL+L EGCNS I NVCLSGD+ SEAS+ KKNEWVLYL N HHPLLLQQYR NLENAKRDV+NAF E+GRKLPGGNMSWKEKEVV
Subjt:  VNARASYGLSFGGTCPNLVLPEGCNSCITNVCLSGDRTSEASNLKKNEWVLYLPNAHHPLLLQQYRVNLENAKRDVRNAFTEMGRKLPGGNMSWKEKEVV

Query:  DISFLKMKVQKLEQAHPVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRKIS
        DIS  KMKV++LEQAHPVSVDFSIS+R++VLVITGPNTGGKTVCLKTIGLAAMMAKSG+HVLASES QIPWFDS+FADIGDEQSLTQSLSTFSG+LRKIS
Subjt:  DISFLKMKVQKLEQAHPVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRKIS

Query:  LIK
         I+
Subjt:  LIK

A0A1S3CJJ8 endonuclease MutS2 isoform X48.0e-9486.7Show/hide
Query:  VNARASYGLSFGGTCPNLVLPEGCNSCITNVCLSGDRTSEASNLKKNEWVLYLPNAHHPLLLQQYRVNLENAKRDVRNAFTEMGRKLPGGNMSWKEKEVV
        VNARASYGLSFGGTCPNL+L EGCNS I NVCLSGD+ SEAS+ KKNEWVLYL N HHPLLLQQYR NLENAKRDV+NAF E+GRKLPGGNMSWKEKEVV
Subjt:  VNARASYGLSFGGTCPNLVLPEGCNSCITNVCLSGDRTSEASNLKKNEWVLYLPNAHHPLLLQQYRVNLENAKRDVRNAFTEMGRKLPGGNMSWKEKEVV

Query:  DISFLKMKVQKLEQAHPVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRKIS
        DIS  KMKV++LEQAHPVSVDFSIS+R++VLVITGPNTGGKTVCLKTIGLAAMMAKSG+HVLASES QIPWFDS+FADIGDEQSLTQSLSTFSG+LRKIS
Subjt:  DISFLKMKVQKLEQAHPVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRKIS

Query:  LIK
         I+
Subjt:  LIK

A0A5D3BZK6 Endonuclease MutS2 isoform X18.0e-9486.7Show/hide
Query:  VNARASYGLSFGGTCPNLVLPEGCNSCITNVCLSGDRTSEASNLKKNEWVLYLPNAHHPLLLQQYRVNLENAKRDVRNAFTEMGRKLPGGNMSWKEKEVV
        VNARASYGLSFGGTCPNL+L EGCNS I NVCLSGD+ SEAS+ KKNEWVLYL N HHPLLLQQYR NLENAKRDV+NAF E+GRKLPGGNMSWKEKEVV
Subjt:  VNARASYGLSFGGTCPNLVLPEGCNSCITNVCLSGDRTSEASNLKKNEWVLYLPNAHHPLLLQQYRVNLENAKRDVRNAFTEMGRKLPGGNMSWKEKEVV

Query:  DISFLKMKVQKLEQAHPVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRKIS
        DIS  KMKV++LEQAHPVSVDFSIS+R++VLVITGPNTGGKTVCLKTIGLAAMMAKSG+HVLASES QIPWFDS+FADIGDEQSLTQSLSTFSG+LRKIS
Subjt:  DISFLKMKVQKLEQAHPVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRKIS

Query:  LIK
         I+
Subjt:  LIK

A0A6J1JC76 uncharacterized protein LOC111483140 isoform X51.8e-9384.62Show/hide
Query:  MANQKVNARASYGLSFGGTCPNLVLPEGCNSCITNVCLSGDRTSEASNLKKNEWVLYLPNAHHPLLLQQYRVNLENAKRDVRNAFTEMGRKLPGGNMSWK
        M  +KVNARASYGLSFGG CPNL+LP GCNS I NV LSGD+ SEAS+ K+N+WVLYLPNAHHPLL QQYR +LENAKRDVRNA TE+GRKLPGGNMSWK
Subjt:  MANQKVNARASYGLSFGGTCPNLVLPEGCNSCITNVCLSGDRTSEASNLKKNEWVLYLPNAHHPLLLQQYRVNLENAKRDVRNAFTEMGRKLPGGNMSWK

Query:  EKEVVDISFLKMKVQKLEQAHPVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGN
        EKEV DIS LKMKV++LEQA PVSVDF+IS RIRVLVITGPNTGGKTVCLKTIGLAAMMAKSG+HVLASESVQIPWFDSV ADIGDEQSLTQSLSTFSG+
Subjt:  EKEVVDISFLKMKVQKLEQAHPVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGN

Query:  LRKISLIK
        LRKIS I+
Subjt:  LRKISLIK

SwissProt top hitse value%identityAlignment
B8D298 Endonuclease MutS22.8e-1951.14Show/hide
Query:  LEQAHPVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRKI
        L +  PV +D ++    + LVITGPNTGGKTV LKT+GL  +M ++G+H+ A E   I  F+ V+ADIGDEQS+ Q+LSTFS ++ +I
Subjt:  LEQAHPVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRKI

B9DVK7 Endonuclease MutS24.8e-1954.76Show/hide
Query:  HPVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRKI
        +PV+ D   +  + V+VITGPNTGGKT+ LKT+GLA +M +SG+ +LA E  +I  FD+++ADIGDEQS+ QSLSTFS ++  I
Subjt:  HPVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRKI

B9KYW4 Endonuclease MutS24.8e-1951.14Show/hide
Query:  LEQAHPVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRKI
        L++   V +D  + +R R+LVITGPNTGGKTV LKT+GL A+MA++G+ + A+    +  F ++F DIGDEQS+ Q+LSTFS ++R+I
Subjt:  LEQAHPVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRKI

P73625 Endonuclease MutS21.3e-2159.76Show/hide
Query:  VSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRKI
        V +  +I  +IRV+ ITGPNTGGKTV LKT+GL A+MAK G+++ A E+V++PWF  + ADIGDEQSL Q+LSTFSG++ +I
Subjt:  VSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRKI

Q5WEK0 Endonuclease MutS22.8e-1958.54Show/hide
Query:  VSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRKI
        V  D +I  ++R LVITGPNTGGKTV LKTIGL  +MA+SG+ V A+E  ++  F+ +FADIGDEQS+ QSLSTFS +++ I
Subjt:  VSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRKI

Arabidopsis top hitse value%identityAlignment
AT1G65070.1 DNA mismatch repair protein MutS, type 23.1e-2154.65Show/hide
Query:  PVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRKISLI
        PV VD  +    +V+VI+GPNTGGKT  LKT+GL ++M+KSG+++ A    ++PWFD + ADIGD QSL QSLSTFSG++ +I  I
Subjt:  PVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRKISLI

AT1G65070.2 DNA mismatch repair protein MutS, type 23.1e-2154.65Show/hide
Query:  PVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRKISLI
        PV VD  +    +V+VI+GPNTGGKT  LKT+GL ++M+KSG+++ A    ++PWFD + ADIGD QSL QSLSTFSG++ +I  I
Subjt:  PVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRKISLI

AT3G24320.1 MUTL protein homolog 14.5e-0436.99Show/hide
Query:  VLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRKISLI
        + ++TGPN GGK+  L++I  AA++  SG+ V A ES  IP FDS+   +    S     S+F   + +I  I
Subjt:  VLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRKISLI

AT3G24495.1 MUTS homolog 75.8e-0437.5Show/hide
Query:  RVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTF
        R L++TGPN GGK+  L+   LA + A+ G +V   ES +I   D++F  +G    +    STF
Subjt:  RVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTF

AT5G54090.1 DNA mismatch repair protein MutS, type 21.2e-4145.59Show/hide
Query:  VNARASYGLSFGGTCPNLVLP--EGCNSCITNVCLSGDRTSEASNLKKNEWVLYLPNAHHPLLLQQYRVNLENAKRDVRNAFTEMGRKLPGGNMSWKEKE
        +NARA+Y  ++GG  P++ LP  +   S                 L K EW+LYLP  +HPLLL Q++  +   +  V+                     
Subjt:  VNARASYGLSFGGTCPNLVLP--EGCNSCITNVCLSGDRTSEASNLKKNEWVLYLPNAHHPLLLQQYRVNLENAKRDVRNAFTEMGRKLPGGNMSWKEKE

Query:  VVDISFLKMKVQKLEQAHPVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRK
             F K     L  A P+  DF IS+  RVLVITGPNTGGKT+CLK++GLAAMMAKSG++VLA+ES +IPWFD+++ADIGDEQSL QSLSTFSG+L++
Subjt:  VVDISFLKMKVQKLEQAHPVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRK

Query:  ISLI
        IS I
Subjt:  ISLI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAATCAGAAAGTCAATGCTCGAGCATCTTATGGTCTTTCATTTGGGGGGACATGTCCCAATTTAGTTCTACCAGAAGGGTGCAACTCTTGTATCACTAATGTCTG
CTTATCAGGAGACCGAACATCTGAGGCATCAAACCTGAAGAAGAATGAATGGGTCCTCTATTTACCTAATGCCCATCACCCTTTACTACTACAGCAATATAGAGTAAATT
TGGAGAATGCCAAGAGGGATGTCAGAAATGCTTTTACTGAGATGGGGAGAAAACTTCCTGGGGGGAATATGTCATGGAAAGAAAAAGAAGTTGTAGATATTTCGTTCTTA
AAAATGAAGGTTCAGAAATTGGAGCAAGCTCATCCTGTTTCGGTTGATTTTTCAATATCTCAAAGAATTCGAGTTTTAGTTATAACTGGTCCTAATACTGGGGGTAAGAC
AGTTTGTTTGAAAACCATTGGATTGGCAGCCATGATGGCAAAATCAGGGATTCATGTCTTAGCTTCAGAGTCTGTACAAATCCCTTGGTTTGATTCTGTTTTTGCTGATA
TCGGTGATGAACAGTCCCTAACCCAATCTTTGTCTACCTTTTCTGGCAATTTGAGAAAAATAAGCTTGATTAAGCTGTGGAAGTTCTTAGCTACTGATGTCTTGTGGGGG
ATAGTGCCTTTTTCTGATTTCATAAGTGGCTTTGGAATTTTCAAGGTTTCTTGGGTTTTTTCATCCTCTTTGAGTGATAATTTGTCTCATCTGCCTCTTGGTTTATCCTT
GCCAGAAAAATCTCATCTTTTATGGATTAATGCAATTAAAGCTCTTCTTTCGGAGACGAGACACAGTGCTCTTTTGGAGCAATCAGAAAATACCCAAGAAGTAGATGAGA
CACGGTGCTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCAATCAGAAAGTCAATGCTCGAGCATCTTATGGTCTTTCATTTGGGGGGACATGTCCCAATTTAGTTCTACCAGAAGGGTGCAACTCTTGTATCACTAATGTCTG
CTTATCAGGAGACCGAACATCTGAGGCATCAAACCTGAAGAAGAATGAATGGGTCCTCTATTTACCTAATGCCCATCACCCTTTACTACTACAGCAATATAGAGTAAATT
TGGAGAATGCCAAGAGGGATGTCAGAAATGCTTTTACTGAGATGGGGAGAAAACTTCCTGGGGGGAATATGTCATGGAAAGAAAAAGAAGTTGTAGATATTTCGTTCTTA
AAAATGAAGGTTCAGAAATTGGAGCAAGCTCATCCTGTTTCGGTTGATTTTTCAATATCTCAAAGAATTCGAGTTTTAGTTATAACTGGTCCTAATACTGGGGGTAAGAC
AGTTTGTTTGAAAACCATTGGATTGGCAGCCATGATGGCAAAATCAGGGATTCATGTCTTAGCTTCAGAGTCTGTACAAATCCCTTGGTTTGATTCTGTTTTTGCTGATA
TCGGTGATGAACAGTCCCTAACCCAATCTTTGTCTACCTTTTCTGGCAATTTGAGAAAAATAAGCTTGATTAAGCTGTGGAAGTTCTTAGCTACTGATGTCTTGTGGGGG
ATAGTGCCTTTTTCTGATTTCATAAGTGGCTTTGGAATTTTCAAGGTTTCTTGGGTTTTTTCATCCTCTTTGAGTGATAATTTGTCTCATCTGCCTCTTGGTTTATCCTT
GCCAGAAAAATCTCATCTTTTATGGATTAATGCAATTAAAGCTCTTCTTTCGGAGACGAGACACAGTGCTCTTTTGGAGCAATCAGAAAATACCCAAGAAGTAGATGAGA
CACGGTGCTCTTAA
Protein sequenceShow/hide protein sequence
MANQKVNARASYGLSFGGTCPNLVLPEGCNSCITNVCLSGDRTSEASNLKKNEWVLYLPNAHHPLLLQQYRVNLENAKRDVRNAFTEMGRKLPGGNMSWKEKEVVDISFL
KMKVQKLEQAHPVSVDFSISQRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGIHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGNLRKISLIKLWKFLATDVLWG
IVPFSDFISGFGIFKVSWVFSSSLSDNLSHLPLGLSLPEKSHLLWINAIKALLSETRHSALLEQSENTQEVDETRCS