; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000320 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000320
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDNA-directed DNA polymerase
Genome locationchr4:3907649..3913724
RNA-Seq ExpressionLag0000320
SyntenyLag0000320
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017227899.1 PREDICTED: uncharacterized protein LOC108203467 [Daucus carota subsp. sativus]1.7e-6943.9Show/hide
Query:  MMKEYM-------TSTDAAIQSNQASMRALEVQVGQLAIELKARPQGKLSSDTEHPRREGKEQVHVVTLRRGKPLEERKEPSTPQD-VEKNSDKNTVVEK
        M+KEY+       + T+A + S  AS+R LE QVGQLA EL+ RP G L SDTE P+  G E    +TL+ GK L      +   D VE + ++    +K
Subjt:  MMKEYM-------TSTDAAIQSNQASMRALEVQVGQLAIELKARPQGKLSSDTEHPRREGKEQVHVVTLRRGKPLEERKEPSTPQD-VEKNSDKNTVVEK

Query:  ELESGEGAGGSNNDAGASGFVPDVKPPYVPPPPYVPPLPFQQRQRPKNHDGQFKK---------------------------------------------
        E E+ +           S     ++  ++ P P     PF QR + +  D QFKK                                             
Subjt:  ELESGEGAGGSNNDAGASGFVPDVKPPYVPPPPYVPPLPFQQRQRPKNHDGQFKK---------------------------------------------

Query:  ----------NGLPTKAKDPWSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTFQLTDRSITYPEGKIEDVLVKVDKFIFPVDFII
                  + LPTK KDP SFTIP +IG+   G ALCDLGASINLMP+SV+RKLGIGE RPTTVT QL DRS+ +PEGKIEDVLVKVDKFIFP DFI+
Subjt:  ----------NGLPTKAKDPWSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTFQLTDRSITYPEGKIEDVLVKVDKFIFPVDFII

Query:  LDYEADKDVPIILGRPFFATGRTLINVQKGELTMRVYNEEVKFNVFKAMKYLDEVEDCSFIRILENTIVETTIQDLTNKHLED--HGESKKNPISISISF
        LDYEAD++VPIILGRPF ATGRTLI+VQ GELTMRV +E+V FNVFKAMKY D+VEDCS I I +  I +    + ++  LE      S++   ++ +  
Subjt:  LDYEADKDVPIILGRPFFATGRTLINVQKGELTMRVYNEEVKFNVFKAMKYLDEVEDCSFIRILENTIVETTIQDLTNKHLED--HGESKKNPISISISF

Query:  FFESNRRPAR
        + E+N +  R
Subjt:  FFESNRRPAR

XP_022951570.1 uncharacterized protein LOC111454344 [Cucurbita moschata]7.0e-7145.45Show/hide
Query:  MMKEYMTSTDAAIQSNQASMRALEVQVGQLAIELKARPQGKLSSDTEHPRREGKEQVHVVTLRRGKPLEERKEPSTPQDVEKNSDKNTVVEKELESGEGA
        ++KEYM   D  IQS QAS++ LEVQVGQLA EL+ RP GKL +DTE P+REGKEQ   + LR GK +    E +  Q  + +S +    ++  +     
Subjt:  MMKEYMTSTDAAIQSNQASMRALEVQVGQLAIELKARPQGKLSSDTEHPRREGKEQVHVVTLRRGKPLEERKEPSTPQDVEKNSDKNTVVEKELESGEGA

Query:  GGSNNDAGASGFVPDVKPPYVPPP---PYVPPLPFQQRQRPKNHDGQFK---------------------------------------------------
           + D       P ++           Y P  PF QR + K  +  F+                                                   
Subjt:  GGSNNDAGASGFVPDVKPPYVPPP---PYVPPLPFQQRQRPKNHDGQFK---------------------------------------------------

Query:  ----KNGLPTKAKDPWSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTFQLTDRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEA
            KN +P K KDP SFTIPVSIG KELGRALCDLGASINLMPLS+Y+KLGIGEARPTTVT QL DRSITYPEGKIED+L++VDKFIFP DFIILDYEA
Subjt:  ----KNGLPTKAKDPWSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTFQLTDRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEA

Query:  DKDVPIILGRPFFATGRTLINVQKGELTMRVYNEEVKFNVFKAMKYLDEVEDCSFIRILENTIVETTIQDLTNK
        D DVPIILGRPF  TGRTL++V KG +T+R+ +++V+FN+  +MKY   +E+CS +     ++ E T Q  T +
Subjt:  DKDVPIILGRPFFATGRTLINVQKGELTMRVYNEEVKFNVFKAMKYLDEVEDCSFIRILENTIVETTIQDLTNK

XP_023522102.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785979 [Cucurbita pepo subsp. pepo]9.1e-7144.42Show/hide
Query:  MMKEYMTSTDAAIQSNQASMRALEVQVGQLAIELKARPQGKLSSDTEHPRREGKEQVHVVTLR-------RGKPLEERKEPSTPQDVEKNSDKNTVVEKE
        ++KEYM   DAAIQS QAS+R LEVQVGQLA EL+ RP  KL +DTE P+REG EQ   + LR       RG+ ++E  +  + +  +    K   V +E
Subjt:  MMKEYMTSTDAAIQSNQASMRALEVQVGQLAIELKARPQGKLSSDTEHPRREGKEQVHVVTLR-------RGKPLEERKEPSTPQDVEKNSDKNTVVEKE

Query:  LESGEGAGGSNNDAGASGFVPDVKPPYVPPPP--------YVPPLPFQQRQRPKNHDGQFK---------------------------------------
          +   A  +     +  +   VKPP              Y P  PF QR + K  +  F+                                       
Subjt:  LESGEGAGGSNNDAGASGFVPDVKPPYVPPPP--------YVPPLPFQQRQRPKNHDGQFK---------------------------------------

Query:  ----------------KNGLPTKAKDPWSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTFQLTDRSITYPEGKIEDVLVKVDKFI
                        KN +P K KDP SFTIP+SIG K+LGRALCDLG+SINLMPLS+Y+KLGIGEARPTTVT QL DRS TYPEGKIED+L++VDKFI
Subjt:  ----------------KNGLPTKAKDPWSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTFQLTDRSITYPEGKIEDVLVKVDKFI

Query:  FPVDFIILDYEADKDVPIILGRPFFATGRTLINVQKGELTMRVYNEEVKFNVFKAMKYLDEVEDCSFIRILENTIVETTIQDLTNKHLEDHGES
        FP DFIILDYEAD DVPIILGRPF  TGRTL++V KG +T+R+ +++V+FN+  +MKY    E+CS        + E T Q  T +   D GES
Subjt:  FPVDFIILDYEADKDVPIILGRPFFATGRTLINVQKGELTMRVYNEEVKFNVFKAMKYLDEVEDCSFIRILENTIVETTIQDLTNKHLEDHGES

XP_024022362.1 uncharacterized protein LOC112091881 [Morus notabilis]1.2e-7352.74Show/hide
Query:  MMKEYMTSTD-------AAIQSNQASMRALEVQVGQLAIELKARPQGKLSSDTEHPRREGKEQ----VHVVTLRRGKPLEERKEPSTPQDVEKNSDKNTV
        ++KEYM   D       A +QS  AS+R LE QVGQLA  L  RPQG L SDTE+PRR+GKEQ       +TLR G+ +E   +P+      ++S   T 
Subjt:  MMKEYMTSTD-------AAIQSNQASMRALEVQVGQLAIELKARPQGKLSSDTEHPRREGKEQ----VHVVTLRRGKPLEERKEPSTPQDVEKNSDKNTV

Query:  VEKELESGEGAGGSNNDAGASGFVPDVKPPYVPPPPYVPPLPFQQRQRPKNHDGQFK------------KNGLPTKAKDPWSFTIPVSIGEKELGRALCD
          ++  +       + DA     +P V+      P YV  +     +  K   G+F+            KN LP K KDP SFTIP SIG++ +G+ALCD
Subjt:  VEKELESGEGAGGSNNDAGASGFVPDVKPPYVPPPPYVPPLPFQQRQRPKNHDGQFK------------KNGLPTKAKDPWSFTIPVSIGEKELGRALCD

Query:  LGASINLMPLSVYRKLGIGEARPTTVTFQLTDRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFFATGRTLINVQKGELTMRVYNEE
        LGASINLMP+S++RKLGIGEARPTTVT QL DRS  +PEGKIEDVLV+VDKFIFP DFI+LDYEADK+VPIILGRPF ATGRTLI+VQKGELTMRV++++
Subjt:  LGASINLMPLSVYRKLGIGEARPTTVTFQLTDRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFFATGRTLINVQKGELTMRVYNEE

Query:  VKFNVFKAMKYLDEVEDCSFIRILENTI
        V FNVFKAM++ DEVE+CS + IL++ +
Subjt:  VKFNVFKAMKYLDEVEDCSFIRILENTI

XP_030509265.1 uncharacterized protein LOC115723943 [Cannabis sativa]4.8e-7247.97Show/hide
Query:  MMKEYMTSTDAAIQSNQASMRALEVQVGQLAIELKARPQGKLSSDTEHPRREGKEQVHVVTLRRGKPL----EERK---EPSTPQDVEKNSDKNTVVEKE
        +M++YM   DA IQS  AS+R LE+Q+G LA ELKARPQG L SDTE+PRR+GKEQ   + LR GK L    EE K   EP++ Q+ EK S K      +
Subjt:  MMKEYMTSTDAAIQSNQASMRALEVQVGQLAIELKARPQGKLSSDTEHPRREGKEQVHVVTLRRGKPL----EERK---EPSTPQDVEKNSDKNTVVEKE

Query:  LESGEGAGGSNNDAGASGFVPDVKPPYVPPPPYVPPLPFQQRQRPKNHDGQFK-----------------------------------------------
            + A G  +D+  S            P    PPLPF QR R +  DGQFK                                               
Subjt:  LESGEGAGGSNNDAGASGFVPDVKPPYVPPPPYVPPLPFQQRQRPKNHDGQFK-----------------------------------------------

Query:  --------KNGLPTKAKDPWSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTFQLTDRSITYPEGKIEDVLVKVDKFIFPVDFIIL
                K+ +P K KDP SFTIP SIG +++GRALCDLGASINLMP+S+++KLGIGEARPTTVT QL DRS+ +PEGKIEDVLV+VDKFIFP DFIIL
Subjt:  --------KNGLPTKAKDPWSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTFQLTDRSITYPEGKIEDVLVKVDKFIFPVDFIIL

Query:  DYEADKDVPIILGRPFFATGRTLINVQKGELTMRVYNEEVKFNVFKAMKYLDEVEDCSFIRILENTIVE
        DYEAD+DVPIILGRPF ATGRTLI+VQ GELTMR+                DE+E+CS I ++++ + E
Subjt:  DYEADKDVPIILGRPFFATGRTLINVQKGELTMRVYNEEVKFNVFKAMKYLDEVEDCSFIRILENTIVE

TrEMBL top hitse value%identityAlignment
A0A2G9IA86 DNA-directed DNA polymerase2.9e-6244.6Show/hide
Query:  SNQASMRALEVQVGQLAIELKARPQGKLSSDTE-HPRREGKEQVHVVTLRRGKPLEE-RKEPSTPQDVEKNSDKNTVVEKELESG-EGAGGSNNDAGASG
        S   +++ +E Q+GQLA  + +RPQG LSS+TE +PR++GK Q   VTLR G+ L+E  KEP+  +  E  S+K    EKE+E+  E             
Subjt:  SNQASMRALEVQVGQLAIELKARPQGKLSSDTE-HPRREGKEQVHVVTLRRGKPLEE-RKEPSTPQDVEKNSDKNTVVEKELESG-EGAGGSNNDAGASG

Query:  FVPDVKPPYV---------PPPPYVPPLPF--QQRQRPKNHD--------GQFKKNGLPTKAKDPWSFTIPVSIGEKELGRALCDLGASINLMPLSVYRK
        F+   K  ++           P YV  + +   +++R  +++            +N LP K KDP SFTIP +IG    GRALCDLGASINLMP S+YR 
Subjt:  FVPDVKPPYV---------PPPPYVPPLPF--QQRQRPKNHD--------GQFKKNGLPTKAKDPWSFTIPVSIGEKELGRALCDLGASINLMPLSVYRK

Query:  LGIGEARPTTVTFQLTDRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFFATGRTLINVQKGELTMRVYNEEVKFNVFKAMKYLDEV
        LG+GEA+PT++T QL +RS+TYP+G IED+LVKVDKFIFP DF++LD E D +VPIILGRPF ATGRTLI+VQKG+LTMRV ++++ FNVFKAMK+ +E 
Subjt:  LGIGEARPTTVTFQLTDRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFFATGRTLINVQKGELTMRVYNEEVKFNVFKAMKYLDEV

Query:  EDCSFIRILENTIVETTIQDLTNKHL-----EDHGESKKNPISISISFFFES
        ++C  + + +N     +I D   + L     ED+ E ++   ++  S +F+S
Subjt:  EDCSFIRILENTIVETTIQDLTNKHL-----EDHGESKKNPISISISFFFES

A0A6J1DY39 uncharacterized protein LOC1110256531.1e-6441.69Show/hide
Query:  MKEYMTSTDAAIQS----------NQASMRALEVQVGQLAIELKARPQGKLSSDTEHPRREGKEQVHVVTLRRGKPLEERKEP-STPQDVEKNSDKNTVV
        MKE MT TD  ++           N  ++R LE+Q+GQL  E++ RPQG L S TE PRR GKE  + +  R G   E  + P  +     +  D   V 
Subjt:  MKEYMTSTDAAIQS----------NQASMRALEVQVGQLAIELKARPQGKLSSDTEHPRREGKEQVHVVTLRRGKPLEERKEP-STPQDVEKNSDKNTVV

Query:  EKELESGEGAGGSNNDAGASGFVPDVKPPYVPPPPYV-PPLPFQQRQRPKNHDGQFK-------------------------------------------
        +K +E                  P V  P  P      PP PF QR   KN D  F+                                           
Subjt:  EKELESGEGAGGSNNDAGASGFVPDVKPPYVPPPPYV-PPLPFQQRQRPKNHDGQFK-------------------------------------------

Query:  ------------KNGLPTKAKDPWSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTFQLTDRSITYPEGKIEDVLVKVDKFIFPVD
                    K+ +P K KDP SFTIP  IG K++GRALCDLGASINLMPLS+++K  IG+A PTTVT QL DRSIT PEGKIEDVLVKVDKFIFP D
Subjt:  ------------KNGLPTKAKDPWSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTFQLTDRSITYPEGKIEDVLVKVDKFIFPVD

Query:  FIILDYEADKDVPIILGRPFFATGRTLINVQKGELTMRVYNEEVKFNVFKAMKYLDEVEDCSFIRILENTIVETTIQDLTNKHLEDHGESKKNPISISIS
        FIILD EADKDVPIILGRPF ATG TLI+V+KGELTMRV +++V FN+  AMKY D++E+C+ I I +  I    + DL N  +E   E  +    I+ +
Subjt:  FIILDYEADKDVPIILGRPFFATGRTLINVQKGELTMRVYNEEVKFNVFKAMKYLDEVEDCSFIRILENTIVETTIQDLTNKHLEDHGESKKNPISISIS

Query:  FFFESNRRPARHLHL
           +  R+  + L +
Subjt:  FFFESNRRPARHLHL

A0A6J1DZC3 uncharacterized protein LOC1110244497.6e-6346.78Show/hide
Query:  MKEYMTSTDAAIQS----NQASMRALEVQVGQLAIELKARPQGKLSSDTEHPRREGKEQVHVVTLRRGKPLEERKEPSTPQDVEKNSDKNTVVEKELESG
        M+E+ T  D AI+     N A+MR LE Q+GQLA ELK RP+G L S TE P+ EG+E    +T R G   EE K P          +K T  E      
Subjt:  MKEYMTSTDAAIQS----NQASMRALEVQVGQLAIELKARPQGKLSSDTEHPRREGKEQVHVVTLRRGKPLEERKEPSTPQDVEKNSDKNTVVEKELESG

Query:  EGAGGSNNDAGASGFVPDVKPPYVPPPPYVPPLPFQQRQRPKNHDGQFK------------KNGLPTKAKDPWSFTIPVSIGEKELGRALCDLGASINLM
                        PD +P  +  P Y   L     ++ K   G+++            K+ +  K KDP SFTIP SIG K++GRALCDL ASINLM
Subjt:  EGAGGSNNDAGASGFVPDVKPPYVPPPPYVPPLPFQQRQRPKNHDGQFK------------KNGLPTKAKDPWSFTIPVSIGEKELGRALCDLGASINLM

Query:  PLSVYRKLGIGEARPTTVTFQLTDRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFFATGRTLINVQKGELTMRVYNEEVKFNVFKA
        PLS+++KL IG+A PTTVT QL DRSIT PEGKIEDVLVKVDKFIFP DFIIL+ EADKDVPIILGRPF +TG TLI+V+KGELTM V +++V FN+  A
Subjt:  PLSVYRKLGIGEARPTTVTFQLTDRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFFATGRTLINVQKGELTMRVYNEEVKFNVFKA

Query:  MKYLDEVEDCSFIRILENTIVETTIQDLTNKHLEDH-GESKKNPISISISFFFESNR
        MKY D++E+C+ I I    +    + DL N  +E    E++K  I  +I+   E  +
Subjt:  MKYLDEVEDCSFIRILENTIVETTIQDLTNKHLEDH-GESKKNPISISISFFFESNR

A0A6J1GJ68 uncharacterized protein LOC1114543443.4e-7145.45Show/hide
Query:  MMKEYMTSTDAAIQSNQASMRALEVQVGQLAIELKARPQGKLSSDTEHPRREGKEQVHVVTLRRGKPLEERKEPSTPQDVEKNSDKNTVVEKELESGEGA
        ++KEYM   D  IQS QAS++ LEVQVGQLA EL+ RP GKL +DTE P+REGKEQ   + LR GK +    E +  Q  + +S +    ++  +     
Subjt:  MMKEYMTSTDAAIQSNQASMRALEVQVGQLAIELKARPQGKLSSDTEHPRREGKEQVHVVTLRRGKPLEERKEPSTPQDVEKNSDKNTVVEKELESGEGA

Query:  GGSNNDAGASGFVPDVKPPYVPPP---PYVPPLPFQQRQRPKNHDGQFK---------------------------------------------------
           + D       P ++           Y P  PF QR + K  +  F+                                                   
Subjt:  GGSNNDAGASGFVPDVKPPYVPPP---PYVPPLPFQQRQRPKNHDGQFK---------------------------------------------------

Query:  ----KNGLPTKAKDPWSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTFQLTDRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEA
            KN +P K KDP SFTIPVSIG KELGRALCDLGASINLMPLS+Y+KLGIGEARPTTVT QL DRSITYPEGKIED+L++VDKFIFP DFIILDYEA
Subjt:  ----KNGLPTKAKDPWSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTFQLTDRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEA

Query:  DKDVPIILGRPFFATGRTLINVQKGELTMRVYNEEVKFNVFKAMKYLDEVEDCSFIRILENTIVETTIQDLTNK
        D DVPIILGRPF  TGRTL++V KG +T+R+ +++V+FN+  +MKY   +E+CS +     ++ E T Q  T +
Subjt:  DKDVPIILGRPFFATGRTLINVQKGELTMRVYNEEVKFNVFKAMKYLDEVEDCSFIRILENTIVETTIQDLTNK

A0A6J1H7K8 uncharacterized protein LOC1114611671.4e-6941.77Show/hide
Query:  MMKEYMTSTDAAIQSNQASMRALEVQVGQLAIELKARPQGKLSSDTEHPRREGKEQVHVVTLRRGKPLEERKEPSTPQDVEKNSDKNTVVEKELESGEGA
        ++KEYM   D  IQS QAS+R LEVQVGQLA EL+ RP GKL SDTE P+REG EQ   + LR GK +  R+E        ++ +     +++ E+    
Subjt:  MMKEYMTSTDAAIQSNQASMRALEVQVGQLAIELKARPQGKLSSDTEHPRREGKEQVHVVTLRRGKPLEERKEPSTPQDVEKNSDKNTVVEKELESGEGA

Query:  GGSNNDAGA-------SGFVPDVKPPYVPPPP--------YVPPLPFQQRQRPKNHDGQFK---------------------------------------
          + NDA A         +   +KPP              Y P  PF QR + K  +  F+                                       
Subjt:  GGSNNDAGA-------SGFVPDVKPPYVPPPP--------YVPPLPFQQRQRPKNHDGQFK---------------------------------------

Query:  ----------------KNGLPTKAKDPWSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTFQLTDRSITYPEGKIEDVLVKVDKFI
                        KN +P K KDP SFTIP+SIG K+LGRALCDLG+SINLMPLS+Y+KLGIGEARPTTVT QL DRS T+PEGKIED+L++VDKFI
Subjt:  ----------------KNGLPTKAKDPWSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTFQLTDRSITYPEGKIEDVLVKVDKFI

Query:  FPVDFIILDYEADKDVPIILGRPFFATGRTLINVQKGELTMRVYNEEVKFNVFKAMKYLDEVEDCSFI-RILENTIVETTIQDLTNKHLEDHGESKKNPI
        FP DFIILDYEAD DVP ILGRPF  TGRTL++V KG + +R+ +++++F++   MKY   VE+CS I ++ EN   ++  Q        ++G + +  +
Subjt:  FPVDFIILDYEADKDVPIILGRPFFATGRTLINVQKGELTMRVYNEEVKFNVFKAMKYLDEVEDCSFI-RILENTIVETTIQDLTNKHLEDHGESKKNPI

Query:  SISISFF
          ++  F
Subjt:  SISISFF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAAAGAATATATGACTAGTACAGATGCCGCAATTCAAAGTAATCAAGCTTCAATGAGAGCCCTTGAAGTGCAAGTGGGTCAGCTAGCTATTGAGCTGAAGGCAAG
GCCTCAAGGGAAACTTTCTTCGGATACTGAGCACCCTAGAAGGGAAGGTAAGGAGCAGGTACATGTAGTGACTCTAAGGAGAGGTAAGCCACTAGAAGAGAGAAAAGAGC
CTAGTACACCCCAAGATGTAGAGAAGAATAGTGATAAAAATACTGTTGTTGAGAAAGAGTTGGAGTCTGGTGAAGGTGCTGGAGGCAGTAATAATGATGCTGGAGCATCT
GGTTTTGTACCAGATGTGAAACCACCTTATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCAACAAAGGCAAAGGCCTAAGAATCATGATGGTCAATTTAAGAA
GAATGGGCTACCAACCAAGGCTAAGGATCCATGGTCATTTACTATTCCTGTATCAATAGGTGAAAAAGAGTTAGGTAGAGCACTTTGTGATTTAGGTGCAAGCATTAACC
TTATGCCTCTTTCGGTCTATCGGAAGCTAGGGATTGGTGAAGCTAGGCCTACCACAGTCACATTCCAATTAACTGATAGGTCCATCACATATCCTGAAGGTAAAATTGAG
GATGTCTTAGTGAAGGTCGATAAATTCATATTTCCTGTTGATTTTATTATCTTAGATTATGAGGCAGACAAAGATGTCCCCATTATTCTTGGTCGTCCATTTTTTGCTAC
TGGTAGGACATTGATAAATGTTCAAAAAGGGGAATTAACAATGAGGGTTTATAATGAGGAAGTAAAATTTAATGTGTTTAAGGCCATGAAGTATCTAGACGAAGTGGAAG
ATTGTTCTTTCATTAGGATTCTGGAGAACACAATTGTTGAGACAACAATCCAGGATTTGACAAATAAACATTTGGAAGATCATGGAGAGTCGAAAAAAAACCCCATCTCC
ATATCCATCTCCTTCTTCTTTGAATCCAACCGCCGACCAGCTCGTCACCTTCACCTCCGGCGACGTGATCTTCCTCTCCAGCGAGCAGTTCCGGCAAACGGAAATGGTAG
AGTAGCGGCGACGTCGTGCAGAAAATTCTGGCGACAAAAGTTGAATTTTCGCGTTTCCTTCCGGTGGGTACTGTTTAAGCTCGCGAACAGCCTATTTTGGGGCGCTTTTT
GGCGTTCTCTCGGCGTTTTCATTAAGTTTACAGAGTGGCAAGCGTTTCTCATTAGTTTTGCGATTCGAGCGCAAGATTTTCAAGTTTCAGCATCTTTGAGCGAGTTCGTC
ATTAGGTTTGATCGTTGGTGTAGAGTGAATTTAGAACCCCGGACAACAAGCTATGGTTCATTTTTTGGCATGTTTGGCTTGGATGGCTATGGTTTTTCTCTGGATTTTAT
TTATGATATTGGAGGATGGGTTGAATGTCTGTTGGAGTTTGGATTGATGTTTGGATGCGAGATCTGGCTTGTTGTGACTATAGAAGTGCCTTATTTTGTTCCCAGGGTTA
AAATTGGATGGATGATTTGTTTAGACCACGAGTTCTCCAAGCCTGTTTCAGGCTATAAAGCACCGACACTTCAAAAATGGCGAAGTGTCGGTGTCGGACACGTGTTGGAC
ACCGACACGCCCCGAAACGCGTCCGACACGTGTCCGACACGCCACGTGGCGTTTGACTGTTATAGCCCAACTTCAGGCACGTTATCGTGCACTCTCTTAGTTCAGGACAG
TGTTGAGCTTATGATCAAGCACGTTATTCGTGCGCCAGATGGCACCAGGCATGTTTTCATGCGAAAAAGAGGAAAGTTGGAATTTACCCCGAAATGCGACCGCATTTCTG
GGAAGGCTAAAACCAAATGCGACCACATTTCTAGAAAAACAGAGACCGTCCCGAGTCTGTTGCGGGTCGTTTTGAACGAGACTTATGCGCACTTAACTGGCTGTTTTGAC
CCCGAAAATGACTGGGCTACCTACCAGCACCTATTTCGTGGCTTCTCAACCACTTCTCATACTATATAA
mRNA sequenceShow/hide mRNA sequence
ATGATGAAAGAATATATGACTAGTACAGATGCCGCAATTCAAAGTAATCAAGCTTCAATGAGAGCCCTTGAAGTGCAAGTGGGTCAGCTAGCTATTGAGCTGAAGGCAAG
GCCTCAAGGGAAACTTTCTTCGGATACTGAGCACCCTAGAAGGGAAGGTAAGGAGCAGGTACATGTAGTGACTCTAAGGAGAGGTAAGCCACTAGAAGAGAGAAAAGAGC
CTAGTACACCCCAAGATGTAGAGAAGAATAGTGATAAAAATACTGTTGTTGAGAAAGAGTTGGAGTCTGGTGAAGGTGCTGGAGGCAGTAATAATGATGCTGGAGCATCT
GGTTTTGTACCAGATGTGAAACCACCTTATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCAACAAAGGCAAAGGCCTAAGAATCATGATGGTCAATTTAAGAA
GAATGGGCTACCAACCAAGGCTAAGGATCCATGGTCATTTACTATTCCTGTATCAATAGGTGAAAAAGAGTTAGGTAGAGCACTTTGTGATTTAGGTGCAAGCATTAACC
TTATGCCTCTTTCGGTCTATCGGAAGCTAGGGATTGGTGAAGCTAGGCCTACCACAGTCACATTCCAATTAACTGATAGGTCCATCACATATCCTGAAGGTAAAATTGAG
GATGTCTTAGTGAAGGTCGATAAATTCATATTTCCTGTTGATTTTATTATCTTAGATTATGAGGCAGACAAAGATGTCCCCATTATTCTTGGTCGTCCATTTTTTGCTAC
TGGTAGGACATTGATAAATGTTCAAAAAGGGGAATTAACAATGAGGGTTTATAATGAGGAAGTAAAATTTAATGTGTTTAAGGCCATGAAGTATCTAGACGAAGTGGAAG
ATTGTTCTTTCATTAGGATTCTGGAGAACACAATTGTTGAGACAACAATCCAGGATTTGACAAATAAACATTTGGAAGATCATGGAGAGTCGAAAAAAAACCCCATCTCC
ATATCCATCTCCTTCTTCTTTGAATCCAACCGCCGACCAGCTCGTCACCTTCACCTCCGGCGACGTGATCTTCCTCTCCAGCGAGCAGTTCCGGCAAACGGAAATGGTAG
AGTAGCGGCGACGTCGTGCAGAAAATTCTGGCGACAAAAGTTGAATTTTCGCGTTTCCTTCCGGTGGGTACTGTTTAAGCTCGCGAACAGCCTATTTTGGGGCGCTTTTT
GGCGTTCTCTCGGCGTTTTCATTAAGTTTACAGAGTGGCAAGCGTTTCTCATTAGTTTTGCGATTCGAGCGCAAGATTTTCAAGTTTCAGCATCTTTGAGCGAGTTCGTC
ATTAGGTTTGATCGTTGGTGTAGAGTGAATTTAGAACCCCGGACAACAAGCTATGGTTCATTTTTTGGCATGTTTGGCTTGGATGGCTATGGTTTTTCTCTGGATTTTAT
TTATGATATTGGAGGATGGGTTGAATGTCTGTTGGAGTTTGGATTGATGTTTGGATGCGAGATCTGGCTTGTTGTGACTATAGAAGTGCCTTATTTTGTTCCCAGGGTTA
AAATTGGATGGATGATTTGTTTAGACCACGAGTTCTCCAAGCCTGTTTCAGGCTATAAAGCACCGACACTTCAAAAATGGCGAAGTGTCGGTGTCGGACACGTGTTGGAC
ACCGACACGCCCCGAAACGCGTCCGACACGTGTCCGACACGCCACGTGGCGTTTGACTGTTATAGCCCAACTTCAGGCACGTTATCGTGCACTCTCTTAGTTCAGGACAG
TGTTGAGCTTATGATCAAGCACGTTATTCGTGCGCCAGATGGCACCAGGCATGTTTTCATGCGAAAAAGAGGAAAGTTGGAATTTACCCCGAAATGCGACCGCATTTCTG
GGAAGGCTAAAACCAAATGCGACCACATTTCTAGAAAAACAGAGACCGTCCCGAGTCTGTTGCGGGTCGTTTTGAACGAGACTTATGCGCACTTAACTGGCTGTTTTGAC
CCCGAAAATGACTGGGCTACCTACCAGCACCTATTTCGTGGCTTCTCAACCACTTCTCATACTATATAA
Protein sequenceShow/hide protein sequence
MMKEYMTSTDAAIQSNQASMRALEVQVGQLAIELKARPQGKLSSDTEHPRREGKEQVHVVTLRRGKPLEERKEPSTPQDVEKNSDKNTVVEKELESGEGAGGSNNDAGAS
GFVPDVKPPYVPPPPYVPPLPFQQRQRPKNHDGQFKKNGLPTKAKDPWSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTFQLTDRSITYPEGKIE
DVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFFATGRTLINVQKGELTMRVYNEEVKFNVFKAMKYLDEVEDCSFIRILENTIVETTIQDLTNKHLEDHGESKKNPIS
ISISFFFESNRRPARHLHLRRRDLPLQRAVPANGNGRVAATSCRKFWRQKLNFRVSFRWVLFKLANSLFWGAFWRSLGVFIKFTEWQAFLISFAIRAQDFQVSASLSEFV
IRFDRWCRVNLEPRTTSYGSFFGMFGLDGYGFSLDFIYDIGGWVECLLEFGLMFGCEIWLVVTIEVPYFVPRVKIGWMICLDHEFSKPVSGYKAPTLQKWRSVGVGHVLD
TDTPRNASDTCPTRHVAFDCYSPTSGTLSCTLLVQDSVELMIKHVIRAPDGTRHVFMRKRGKLEFTPKCDRISGKAKTKCDHISRKTETVPSLLRVVLNETYAHLTGCFD
PENDWATYQHLFRGFSTTSHTI