; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10006115 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10006115
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein ALP1-like
Genome locationChr07:13884435..13885471
RNA-Seq ExpressionHG10006115
SyntenyHG10006115
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3433792.1 hypothetical protein FNV43_RR24895 [Rhamnella rubrinervis]1.3e-6351.31Show/hide
Query:  MDLIQDNDFENFDFDDKDDVMYIFFNLLYIGFHKIQSSGQPCRTSTLKDHDCVIELLNRNDRRCFDCFRMKRATFIRFCEDLKSKTDLKASKYLTVQEKV
        MDL  D D   F+ DD D++++IF  L+   + +I S  QPCRTS L+ HD V+ELLN ++ RC+DCFRM ++ FI FCE+LKSKT+LK S+++TVQE+V
Subjt:  MDLIQDNDFENFDFDDKDDVMYIFFNLLYIGFHKIQSSGQPCRTSTLKDHDCVIELLNRNDRRCFDCFRMKRATFIRFCEDLKSKTDLKASKYLTVQEKV

Query:  AIFLIIISHNESNRIAAKRFQHSGQTISQAFNLVLRKVCKLGVEIICPPNMDTVPIKIISKSKYYPFFKNLFMFMLIIRDCIGIIDSTHIAACILQNEQI
        AIFL+ I HNE NR+ A+RFQHSG+TIS+ FN VL+KVC LGVE+IC  N D V  +I    KYYPFFKN          CIG ID TH+ A I Q++QI
Subjt:  AIFLIIISHNESNRIAAKRFQHSGQTISQAFNLVLRKVCKLGVEIICPPNMDTVPIKIISKSKYYPFFKNLFMFMLIIRDCIGIIDSTHIAACILQNEQI

Query:  SFHGRKTNTTWNIMCVCSFDMLFTYVKSDRYYLVDSRYSNMLGLLAPFRGQRYHLRDFRERRHRPRG
         F                          D+YY+VDS Y NM G L+P+RG+RYHLRDFR RR +P G
Subjt:  SFHGRKTNTTWNIMCVCSFDMLFTYVKSDRYYLVDSRYSNMLGLLAPFRGQRYHLRDFRERRHRPRG

KAF7123090.1 hypothetical protein RHSIM_Rhsim12G0067000 [Rhododendron simsii]1.9e-5944.22Show/hide
Query:  MDLIQDNDFENFDFDDKDDVMYIFFNLLYIGFHKIQSSGQPCRTSTLKDHDCVIELLNRNDRRCFDCFRMKRATFIRFCEDLKSKTDLKASKYLTVQEKV
        MD I D+     D D +D+ + +  +L    ++ +    +PCRTS L+ HD V+E+LN ++ RC   FRMK   FI FCE LK   +LK S+YLT+QE+V
Subjt:  MDLIQDNDFENFDFDDKDDVMYIFFNLLYIGFHKIQSSGQPCRTSTLKDHDCVIELLNRNDRRCFDCFRMKRATFIRFCEDLKSKTDLKASKYLTVQEKV

Query:  AIFLIIISHNESNRIAAKRFQHSGQTISQAFNLVLRKVCKLGVEIICPPNMDTVPIKIISKSKYYPFFKNLFMFMLIIRDCIGIIDSTHIAACILQNEQI
         IFL+ I HNE NR+  +RFQHSG TIS  F+ VL+ VCKLGV II PP+ D++P +I   +KY+PFFK          DC+G ID THI+A +  ++QI
Subjt:  AIFLIIISHNESNRIAAKRFQHSGQTISQAFNLVLRKVCKLGVEIICPPNMDTVPIKIISKSKYYPFFKNLFMFMLIIRDCIGIIDSTHIAACILQNEQI

Query:  SFHGRKTNTTWNIMCVCSFDMLFTYVKS----------------------------DRYYLVDSRYSNMLGLLAPFRGQRYHLRDFRERRHRPR
         + G+ T TT N+M  CSFDM FTYV S                             +YY+VDS Y+NM G L+P+RG+RYHL  FR    RP+
Subjt:  SFHGRKTNTTWNIMCVCSFDMLFTYVKS----------------------------DRYYLVDSRYSNMLGLLAPFRGQRYHLRDFRERRHRPR

XP_028067161.1 uncharacterized protein LOC114269968 [Camellia sinensis]2.5e-5944.1Show/hide
Query:  NFDFDDKDDVMYIFFNLLYIGFHKIQSSGQPCRTSTLKDHDCVIELLNRNDRRCFDCFRMKRATFIRFCEDLKSKTDLKASKYLTVQEKVAIFLIIISHN
        +FD D  D+ +     +    +       +PCRTS L+ HD V+E+LN ++RR  + FRM+   FI  CE LK    L+ S+YLTVQE+V IFL+ I HN
Subjt:  NFDFDDKDDVMYIFFNLLYIGFHKIQSSGQPCRTSTLKDHDCVIELLNRNDRRCFDCFRMKRATFIRFCEDLKSKTDLKASKYLTVQEKVAIFLIIISHN

Query:  ESNRIAAKRFQHSGQTISQAFNLVLRKVCKLGVEIICPPNMDTVPIKIISKSKYYPFFKNLFMFMLIIRDCIGIIDSTHIAACILQNEQISFHGRKTNTT
        E NR+  +RFQHSGQTIS+ FN VL+ VC+LG ++I PP+ D VP +I    ++YPFFK          DC+G ID THI+A +  +EQI + G+ T TT
Subjt:  ESNRIAAKRFQHSGQTISQAFNLVLRKVCKLGVEIICPPNMDTVPIKIISKSKYYPFFKNLFMFMLIIRDCIGIIDSTHIAACILQNEQISFHGRKTNTT

Query:  WNIMCVCSFDMLFTYV----------------------------KSDRYYLVDSRYSNMLGLLAPFRGQRYHLRDFRERRHRPRGRER
         N+MCVCSFDM FTYV                             +D+YY+VDS Y+NM G L+P+RG+RYHL +FR +R +PR +++
Subjt:  WNIMCVCSFDMLFTYV----------------------------KSDRYYLVDSRYSNMLGLLAPFRGQRYHLRDFRERRHRPRGRER

XP_028094390.1 uncharacterized protein LOC114294454 [Camellia sinensis]2.7e-5843.75Show/hide
Query:  NFDFDDKDDVMYIFFNLLYIGFHKIQSSGQPCRTSTLKDHDCVIELLNRNDRRCFDCFRMKRATFIRFCEDLKSKTDLKASKYLTVQEKVAIFLIIISHN
        +FD D  D+ +     +    +       +PCRTS L+ HD V+E+LN ++RR  + FRM+   FI  CE LK    L+ S+YLTVQE+V IFL+ I HN
Subjt:  NFDFDDKDDVMYIFFNLLYIGFHKIQSSGQPCRTSTLKDHDCVIELLNRNDRRCFDCFRMKRATFIRFCEDLKSKTDLKASKYLTVQEKVAIFLIIISHN

Query:  ESNRIAAKRFQHSGQTISQAFNLVLRKVCKLGVEIICPPNMDTVPIKIISKSKYYPFFKNLFMFMLIIRDCIGIIDSTHIAACILQNEQISFHGRKTNTT
        E NR+  +RFQHSGQTIS+ FN VL+ VC+LG ++I PP+ D VP +I    ++YPFFK          DC+G ID THI+A +  +EQI + G+ T TT
Subjt:  ESNRIAAKRFQHSGQTISQAFNLVLRKVCKLGVEIICPPNMDTVPIKIISKSKYYPFFKNLFMFMLIIRDCIGIIDSTHIAACILQNEQISFHGRKTNTT

Query:  WNIMCVCSFDMLFTYV----------------------------KSDRYYLVDSRYSNMLGLLAPFRGQRYHLRDFRERRHRPRGRER
         N+M VCSFDM FTYV                             +D+YY+VDS Y+NM G L+P+RG+RYHL +FR +R +PR + +
Subjt:  WNIMCVCSFDMLFTYV----------------------------KSDRYYLVDSRYSNMLGLLAPFRGQRYHLRDFRERRHRPRGRER

XP_028100667.1 uncharacterized protein LOC114300013 [Camellia sinensis]2.5e-5944.1Show/hide
Query:  NFDFDDKDDVMYIFFNLLYIGFHKIQSSGQPCRTSTLKDHDCVIELLNRNDRRCFDCFRMKRATFIRFCEDLKSKTDLKASKYLTVQEKVAIFLIIISHN
        +FD D  D+ +     +    +       +PCRTS L+ HD V+E+LN ++RR  + FRM+   FI  CE LK    L+ S+YLTVQE+V IFL+ I HN
Subjt:  NFDFDDKDDVMYIFFNLLYIGFHKIQSSGQPCRTSTLKDHDCVIELLNRNDRRCFDCFRMKRATFIRFCEDLKSKTDLKASKYLTVQEKVAIFLIIISHN

Query:  ESNRIAAKRFQHSGQTISQAFNLVLRKVCKLGVEIICPPNMDTVPIKIISKSKYYPFFKNLFMFMLIIRDCIGIIDSTHIAACILQNEQISFHGRKTNTT
        E NR+  +RFQHSGQTIS+ FN VL+ VC+LG ++I PP+ D VP +I    ++YPFFK          DC+G ID THI+A +  +EQI + G+ T TT
Subjt:  ESNRIAAKRFQHSGQTISQAFNLVLRKVCKLGVEIICPPNMDTVPIKIISKSKYYPFFKNLFMFMLIIRDCIGIIDSTHIAACILQNEQISFHGRKTNTT

Query:  WNIMCVCSFDMLFTYV----------------------------KSDRYYLVDSRYSNMLGLLAPFRGQRYHLRDFRERRHRPRGRER
         N+MCVCSFDM FTYV                             +D+YY+VDS Y+NM G L+P+RG+RYHL +FR +R +PR +++
Subjt:  WNIMCVCSFDMLFTYV----------------------------KSDRYYLVDSRYSNMLGLLAPFRGQRYHLRDFRERRHRPRGRER

TrEMBL top hitse value%identityAlignment
A0A0B2SJL0 Putative nuclease HARBI1 (Fragment)7.5e-4641.8Show/hide
Query:  PCRTSTLKDHDCVIELLNRNDRRCFDCFRMKRATFIRFCEDLKSKTDLKASKYLTVQEKVAIFLIIISHNESNRIAAKRFQHSGQTISQAFNLVLRKVCK
        PCRTS L      I+ L  ++ RC++ F MK+  F+ FCE LK   +L   K ++++E +A+FLIII HN  +R+ A+RFQHS  T+S+ F ++L+ VCK
Subjt:  PCRTSTLKDHDCVIELLNRNDRRCFDCFRMKRATFIRFCEDLKSKTDLKASKYLTVQEKVAIFLIIISHNESNRIAAKRFQHSGQTISQAFNLVLRKVCK

Query:  LGVEIICPPNMDTVPIKIISKSKYYPFFKNLFMFMLIIRDCIGIIDSTHIAACILQNEQISFHGRKTNTTWNIMCVCSFDMLFTYVKS------------
        LG  II   N  +    I    KYYP+FK          DCIG ID  H++A    ++Q +F GRK   T N++ VC FDMLFT+V S            
Subjt:  LGVEIICPPNMDTVPIKIISKSKYYPFFKNLFMFMLIIRDCIGIIDSTHIAACILQNEQISFHGRKTNTTWNIMCVCSFDMLFTYVKS------------

Query:  ---------------DRYYLVDSRYSNMLGLLAPFRGQRYHLRDFRERRHRPRGRE
                       D++YL+DS +SNM G LAPFR  +YHL DFRE   RPRG+E
Subjt:  ---------------DRYYLVDSRYSNMLGLLAPFRGQRYHLRDFRERRHRPRGRE

A0A1S3E695 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like4.2e-4940.48Show/hide
Query:  IQDNDFENFDFDDKDDVMYIFFNLLYIGFHKIQSSGQPCRTSTLKDHDCVIELLNRNDRRCFDCFRMKRATFIRFCEDLKSKTDLKASKYLTVQEKVAIF
        +  +D    D  D D V ++  N++Y   HK  + G+   TS+L   + V E+LN ++  CFD FRMK+  F+ FC +L+ K  L  S+ + V+EKVA F
Subjt:  IQDNDFENFDFDDKDDVMYIFFNLLYIGFHKIQSSGQPCRTSTLKDHDCVIELLNRNDRRCFDCFRMKRATFIRFCEDLKSKTDLKASKYLTVQEKVAIF

Query:  LIIISHNESNRIAAKRFQHSGQTISQAFNLVLRKVCKLGVEIICPPNMDTVPIKIISKSKYYPFFKNLFMFMLIIRDCIGIIDSTHIAACILQNEQISFH
        L II HN  +R+A+ RFQHS +TIS+ F  VLR VC+LG E+I   +++ +P +I + SKYYP+FKN          CIG ID THI+A +   +QIS  
Subjt:  LIIISHNESNRIAAKRFQHSGQTISQAFNLVLRKVCKLGVEIICPPNMDTVPIKIISKSKYYPFFKNLFMFMLIIRDCIGIIDSTHIAACILQNEQISFH

Query:  GRKTNTTWNIMCVCSFDMLFTYVKS----------------------------DRYYLVDSRYSNMLGLLAPFRGQRYHLRDFRERRHRPRGRE
        GRKT  T N+MC C F+M+FTYV S                              +YLVDS Y    GLL P+RG+RYH +++R +  +PR  E
Subjt:  GRKTNTTWNIMCVCSFDMLFTYVKS----------------------------DRYYLVDSRYSNMLGLLAPFRGQRYHLRDFRERRHRPRGRE

A0A3Q7Y331 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like4.2e-4940.48Show/hide
Query:  IQDNDFENFDFDDKDDVMYIFFNLLYIGFHKIQSSGQPCRTSTLKDHDCVIELLNRNDRRCFDCFRMKRATFIRFCEDLKSKTDLKASKYLTVQEKVAIF
        +  +D    D  D D V ++  N++Y   HK  + G+   TS+L   + V E+LN ++  CFD FRMK+  F+ FC +L+ K  L  S+ + V+EKVA F
Subjt:  IQDNDFENFDFDDKDDVMYIFFNLLYIGFHKIQSSGQPCRTSTLKDHDCVIELLNRNDRRCFDCFRMKRATFIRFCEDLKSKTDLKASKYLTVQEKVAIF

Query:  LIIISHNESNRIAAKRFQHSGQTISQAFNLVLRKVCKLGVEIICPPNMDTVPIKIISKSKYYPFFKNLFMFMLIIRDCIGIIDSTHIAACILQNEQISFH
        L II HN  +R+A+ RFQHS +TIS+ F  VLR VC+LG E+I   +++ +P +I + SKYYP+FKN          CIG ID THI+A +   +QIS  
Subjt:  LIIISHNESNRIAAKRFQHSGQTISQAFNLVLRKVCKLGVEIICPPNMDTVPIKIISKSKYYPFFKNLFMFMLIIRDCIGIIDSTHIAACILQNEQISFH

Query:  GRKTNTTWNIMCVCSFDMLFTYVKS----------------------------DRYYLVDSRYSNMLGLLAPFRGQRYHLRDFRERRHRPRGRE
        GRKT  T N+MC C F+M+FTYV S                              +YLVDS Y    GLL P+RG+RYH +++R +  +PR  E
Subjt:  GRKTNTTWNIMCVCSFDMLFTYVKS----------------------------DRYYLVDSRYSNMLGLLAPFRGQRYHLRDFRERRHRPRGRE

A0A4Y7J673 Uncharacterized protein1.2e-4841.57Show/hide
Query:  PCRTSTLKDHDCVIELLNRNDRRCFDCFRMKRATFIRFCEDLKSKTDLKASKYLTVQEKVAIFLIIISHNESNRIAAKRFQHSGQTISQAFNLVLRKVCK
        P  TS L   + + ELLN + RR ++  RM  +TF+  C  L++   L+  + ++V+E V IFL  +S +  NR+ A+ FQHS +T+ + F  VL+ +C+
Subjt:  PCRTSTLKDHDCVIELLNRNDRRCFDCFRMKRATFIRFCEDLKSKTDLKASKYLTVQEKVAIFLIIISHNESNRIAAKRFQHSGQTISQAFNLVLRKVCK

Query:  LGVEIICPPNMDTVPIKIISKSKYYPFFKNLFMFMLIIRDCIGIIDSTHIAACILQNEQISFHGRKTNTTWNIMCVCSFDMLFTYV--------------
        LG  II PPNMD VP +I++  K+YP+F           DC+G ID THI+AC+  ++QI F GRK   T NIMC CSFDMLFT+V              
Subjt:  LGVEIICPPNMDTVPIKIISKSKYYPFFKNLFMFMLIIRDCIGIIDSTHIAACILQNEQISFHGRKTNTTWNIMCVCSFDMLFTYV--------------

Query:  --------------KSDRYYLVDSRYSNMLGLLAPFRGQRYHLRDFRERRHRPRG
                      +  RYY+VDS Y+NM G L P+RG+RYHLRDFR R  + +G
Subjt:  --------------KSDRYYLVDSRYSNMLGLLAPFRGQRYHLRDFRERRHRPRG

A0A5D3C7F6 Protein ALP1-like4.0e-5566.3Show/hide
Query:  AKRFQHSGQTISQAFNLVLRKVCKLGVEIICPPNMDTVPIKIISKSKYYPFFKNLFMFMLIIRDCIGIIDSTHIAACILQNEQISFHGRKTNTTWNIMCV
        A+RFQHSG TIS AFN VLRKVCKLG EII PPNMDTV  KI+S SKYYPFFK          DCIG ID TH+AA I QNEQI F GRKTNTTWNIMCV
Subjt:  AKRFQHSGQTISQAFNLVLRKVCKLGVEIICPPNMDTVPIKIISKSKYYPFFKNLFMFMLIIRDCIGIIDSTHIAACILQNEQISFHGRKTNTTWNIMCV

Query:  CSFDMLFTYV----------------------------KSDRYYLVDSRYSNMLGLLAPFRGQRYHLRDFRERRHRPRGRE
        CSFDMLFTYV                            K D+YYLV+S YSNM G LAPFRGQRYHLRDFRERRHRPRGRE
Subjt:  CSFDMLFTYV----------------------------KSDRYYLVDSRYSNMLGLLAPFRGQRYHLRDFRERRHRPRGRE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein6.3e-2128.11Show/hide
Query:  LNRNDRRCFDCFRMKRATFIRFCEDLKSKTDLKASKYLTVQEKVAIFLIIISHNESNRIAAKRFQHSGQTISQAFNLVLRKVCKLGVEIICPP---NMDT
        L ++   C    RM    F   C  L++  DL+ +  ++++E VA+FL I  HNE  R    RF  + +T+ + F  VL     L  + I  P    +  
Subjt:  LNRNDRRCFDCFRMKRATFIRFCEDLKSKTDLKASKYLTVQEKVAIFLIIISHNESNRIAAKRFQHSGQTISQAFNLVLRKVCKLGVEIICPP---NMDT

Query:  VPIKIISKSKYYPFFKNLFMFMLIIRDCIGIIDSTHIAACILQNEQISFHGRKTNTTWNIMCVCSFDMLFTYV---------------------------
        +P ++    +Y+P+F             +G +D TH+   +  + Q  +  R  N + NIM +C   MLFTY+                           
Subjt:  VPIKIISKSKYYPFFKNLFMFMLIIRDCIGIIDSTHIAACILQNEQISFHGRKTNTTWNIMCVCSFDMLFTYV---------------------------

Query:  KSDRYYLVDSRYSNMLGLLAPFRGQ-----RYHLRDFRERRHRPRGRER
         S++YYLVDS Y N  GLLAP+R       RYH+  F    + PR R +
Subjt:  KSDRYYLVDSRYSNMLGLLAPFRGQ-----RYHLRDFRERRHRPRGRER

AT5G28730.1 unknown protein9.1e-2030.99Show/hide
Query:  DCVIELLNRNDRRCFDCFRMKRATFIRFCEDLKSKTDLKASKYLTVQEKVAIFLIIISHNESNRIAAKRFQHSGQTISQAFNLVLRKVCKLGVEIICPPN
        +C+   +  N+  C    RM    F + CE L  K  L++S  +++ E VAIFLII + N++ R  A RF H+ +TI + F+ VL+ + +L VE I P  
Subjt:  DCVIELLNRNDRRCFDCFRMKRATFIRFCEDLKSKTDLKASKYLTVQEKVAIFLIIISHNESNRIAAKRFQHSGQTISQAFNLVLRKVCKLGVEIICPPN

Query:  MD---TVPIKIISKSKYYPFFKNLFMFMLIIRDCIGIIDSTHIAACILQN-EQISFHGR--KTNTTWNIMCVCSFDMLFTYVKSDRYYLVDSRYSNMLGL
        ++    +  ++   ++Y+PF          + D +GI     +A C L       F G    T+    +    S D LF      +YYLVDS Y+N  G 
Subjt:  MD---TVPIKIISKSKYYPFFKNLFMFMLIIRDCIGIIDSTHIAACILQN-EQISFHGR--KTNTTWNIMCVCSFDMLFTYVKSDRYYLVDSRYSNMLGL

Query:  LAPFRGQRYHLRD
        LAP+R +    +D
Subjt:  LAPFRGQRYHLRD

AT5G28950.1 unknown protein4.2e-0940Show/hide
Query:  VPIKIISKSKYYPFFKNLFMFMLIIRDCIGIIDSTHIAACILQNEQISFHGRKTNTTWNIMCVCSFDMLFTYVKS
        VP KI   ++ YP+FK          DC+G ID THI A + Q +  SF  RK + + N++  C+FD+ F YV S
Subjt:  VPIKIISKSKYYPFFKNLFMFMLIIRDCIGIIDSTHIAACILQNEQISFHGRKTNTTWNIMCVCSFDMLFTYVKS

AT5G35695.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)3.1e-0452.78Show/hide
Query:  RYYLVDSRYSNMLGLLAPFRGQRYHLRDFRERRHRP
        ++YLVD  ++N L  LAPFRG RYHL++F  +R  P
Subjt:  RYYLVDSRYSNMLGLLAPFRGQRYHLRDFRERRHRP

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.8e-2328.81Show/hide
Query:  VIELLNRNDRRCFDCFRMKRATFIRFCEDLKSKTDLKASKYLTVQEKVAIFLIIISHNESNRIAAKRFQHSGQTISQAFNLVLRKVCKLGVEIICP-PNM
        V ++LN  + +CF+ FRM +  F + C+ L+++  L+ +  + ++ ++AIFL II HN   R   + F +SG+TIS+ FN VL  V  +  +   P  N 
Subjt:  VIELLNRNDRRCFDCFRMKRATFIRFCEDLKSKTDLKASKYLTVQEKVAIFLIIISHNESNRIAAKRFQHSGQTISQAFNLVLRKVCKLGVEIICP-PNM

Query:  DTVPIKIISKSKYYPFFKNLFMFMLIIRDCIGIIDSTHIAACILQNEQISFHGRKTNTTWNIMCVCSFDMLFTY--------------------------
        DT+           P+FK          DC+G++DS HI   +  +EQ  F       T N++   SFD+ F Y                          
Subjt:  DTVPIKIISKSKYYPFFKNLFMFMLIIRDCIGIIDSTHIAACILQNEQISFHGRKTNTTWNIMCVCSFDMLFTY--------------------------

Query:  VKSDRYYLVDSRYSNMLGLLAPFRG----QRYHLRDFRERRHR
        V   +YY+VD++Y N+ G +AP+ G     R   ++    RH+
Subjt:  VKSDRYYLVDSRYSNMLGLLAPFRG----QRYHLRDFRERRHR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTGATTCAAGACAATGACTTTGAGAATTTTGATTTTGACGACAAAGATGATGTCATGTACATCTTTTTCAATTTATTGTACATTGGCTTTCACAAGATACAATC
TTCTGGACAACCATGTAGAACCTCTACACTAAAAGATCATGACTGTGTGATTGAGTTGTTAAATAGAAATGATAGAAGATGTTTTGATTGTTTTAGGATGAAAAGAGCAA
CATTCATAAGATTTTGTGAAGATTTAAAATCTAAGACAGATTTGAAAGCATCTAAGTATCTCACTGTTCAAGAAAAAGTTGCTATATTTTTAATAATCATATCACATAAT
GAAAGCAATCGTATAGCAGCAAAAAGGTTTCAACATTCAGGCCAAACCATTTCTCAAGCTTTTAACCTTGTTTTGAGGAAGGTTTGTAAGCTTGGAGTAGAAATTATTTG
CCCACCCAACATGGACACTGTACCAATAAAGATCATATCAAAATCGAAATATTACCCTTTCTTTAAGAATTTATTTATGTTTATGCTTATTATTCGGGATTGTATTGGTA
TTATTGATAGTACTCATATTGCTGCATGTATTCTCCAAAACGAACAAATATCGTTTCATGGAAGAAAAACTAACACAACGTGGAATATAATGTGTGTTTGTTCATTTGAT
ATGTTGTTCACGTATGTCAAATCTGACCGGTACTATCTTGTTGATTCAAGATATTCAAATATGCTTGGACTTTTGGCACCATTTCGCGGTCAAAGATATCATTTACGAGA
TTTTAGAGAAAGGAGACATCGCCCTCGAGGTAGAGAAAGAAGTGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTTGATTCAAGACAATGACTTTGAGAATTTTGATTTTGACGACAAAGATGATGTCATGTACATCTTTTTCAATTTATTGTACATTGGCTTTCACAAGATACAATC
TTCTGGACAACCATGTAGAACCTCTACACTAAAAGATCATGACTGTGTGATTGAGTTGTTAAATAGAAATGATAGAAGATGTTTTGATTGTTTTAGGATGAAAAGAGCAA
CATTCATAAGATTTTGTGAAGATTTAAAATCTAAGACAGATTTGAAAGCATCTAAGTATCTCACTGTTCAAGAAAAAGTTGCTATATTTTTAATAATCATATCACATAAT
GAAAGCAATCGTATAGCAGCAAAAAGGTTTCAACATTCAGGCCAAACCATTTCTCAAGCTTTTAACCTTGTTTTGAGGAAGGTTTGTAAGCTTGGAGTAGAAATTATTTG
CCCACCCAACATGGACACTGTACCAATAAAGATCATATCAAAATCGAAATATTACCCTTTCTTTAAGAATTTATTTATGTTTATGCTTATTATTCGGGATTGTATTGGTA
TTATTGATAGTACTCATATTGCTGCATGTATTCTCCAAAACGAACAAATATCGTTTCATGGAAGAAAAACTAACACAACGTGGAATATAATGTGTGTTTGTTCATTTGAT
ATGTTGTTCACGTATGTCAAATCTGACCGGTACTATCTTGTTGATTCAAGATATTCAAATATGCTTGGACTTTTGGCACCATTTCGCGGTCAAAGATATCATTTACGAGA
TTTTAGAGAAAGGAGACATCGCCCTCGAGGTAGAGAAAGAAGTGTTTAA
Protein sequenceShow/hide protein sequence
MDLIQDNDFENFDFDDKDDVMYIFFNLLYIGFHKIQSSGQPCRTSTLKDHDCVIELLNRNDRRCFDCFRMKRATFIRFCEDLKSKTDLKASKYLTVQEKVAIFLIIISHN
ESNRIAAKRFQHSGQTISQAFNLVLRKVCKLGVEIICPPNMDTVPIKIISKSKYYPFFKNLFMFMLIIRDCIGIIDSTHIAACILQNEQISFHGRKTNTTWNIMCVCSFD
MLFTYVKSDRYYLVDSRYSNMLGLLAPFRGQRYHLRDFRERRHRPRGRERSV