; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G014580 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G014580
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionButirosin biosynthesis
Genome locationchr09:22562332..22576707
RNA-Seq ExpressionLsi09G014580
SyntenyLsi09G014580
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7026329.1 hypothetical protein SDJN02_12830, partial [Cucurbita argyrosperma subsp. argyrosperma]8.3e-10871.99Show/hide
Query:  IPIRSTFQFQLAFPF---SLCIAYSPLCSGSGSG------LLLSSNLRRISPINA-----LSNSMVNEAKPKELRDESDFEAIFSDSDYISVCGFGSLLS
        IP+R   Q  L F F   S CI +S LCSGSGSG      L+  SNLR + PINA     +SNS+VN+ +PKELRDESDFE++FS+  YISVCGFGSLLS
Subjt:  IPIRSTFQFQLAFPF---SLCIAYSPLCSGSGSG------LLLSSNLRRISPINA-----LSNSMVNEAKPKELRDESDFEAIFSDSDYISVCGFGSLLS

Query:  ERSARSTFPDLINFRVARLNGFRRLFGNVAPVFFERGIAKPETKCLTTCYSSEYQEISSLCAEPCEGENIIVTVFEIKKSE---------------VFPE
        ERSARSTFP+LINFRVARLNGFRRLFGNVAPVFFERGIAKPETK           EISSLCAEPCEGE IIVTVFEIKKSE               V PE
Subjt:  ERSARSTFPDLINFRVARLNGFRRLFGNVAPVFFERGIAKPETKCLTTCYSSEYQEISSLCAEPCEGENIIVTVFEIKKSE---------------VFPE

Query:  TLDGKLYHKPAVLCSRSTDEEFFQGIRTFFFIIMAVIILIRYGEMISYLVASIFDTAAKNLGDKAYNNFLDHTFLGDRSTTIREYLTGNGSGIMEEEPPE
        TLDGKLY KP+VLCSRSTDEEFF+ ++TF F IM VIILIR GEMISY VASIFDTAAKNLGD AYNNFLDHTFLGDR TTIREYL  NGSGIMEEEPPE
Subjt:  TLDGKLYHKPAVLCSRSTDEEFFQGIRTFFFIIMAVIILIRYGEMISYLVASIFDTAAKNLGDKAYNNFLDHTFLGDRSTTIREYLTGNGSGIMEEEPPE

Query:  SLKFRYG
        SLKFRYG
Subjt:  SLKFRYG

XP_004150088.1 uncharacterized protein LOC101211371 [Cucumis sativus]4.7e-10372.17Show/hide
Query:  GHSASHRIPIRSTFQFQLAFPFSLCIAYSPLCSGSGSGLLLSSNLRRISPINALSNSMVNEAKPKELRDESDFEAIFSDSDYISVCGFGSLLSERSARST
        G SAS+RIPI S  QFQL FPFS  IA+SPLCSGSG        L+RISPIN LSNS+VNEA+PKELRDESDFEAIFSDSDYISVCGFGSLLSERSARST
Subjt:  GHSASHRIPIRSTFQFQLAFPFSLCIAYSPLCSGSGSGLLLSSNLRRISPINALSNSMVNEAKPKELRDESDFEAIFSDSDYISVCGFGSLLSERSARST

Query:  FPDLINFRVARLNGFRRLFGNVAPVFFERGIAKPETKCLTTCYSSEYQEISSLCAEPCEGENIIVTVFEIKKSE---------------VFPETLDGKLY
        FPDLINFRVARLNGFRR+FGNVAPVFFERGIAKPETK           EISSLCAEPCEGENIIVTVFEIKKSE               VFPETL GK Y
Subjt:  FPDLINFRVARLNGFRRLFGNVAPVFFERGIAKPETKCLTTCYSSEYQEISSLCAEPCEGENIIVTVFEIKKSE---------------VFPETLDGKLY

Query:  HKPAVLCSRSTDEEFFQ----GIRTFFFIIMA--VIILIRYGEMIS---YLVASIFDTAAKNLGDKAYNNFLDHTFLGDRSTTIREYLTGNGSGIMEEEP
         KPAVLCSRSTDEEFFQ    G +  FF       I  I   ++     YL   I   AAKNLGDKAYNNFLDHTFLGDRSTTIREYLTG G GIMEEEP
Subjt:  HKPAVLCSRSTDEEFFQ----GIRTFFFIIMA--VIILIRYGEMIS---YLVASIFDTAAKNLGDKAYNNFLDHTFLGDRSTTIREYLTGNGSGIMEEEP

Query:  PESLKFRYG
        PESLKFRYG
Subjt:  PESLKFRYG

XP_008458400.1 PREDICTED: uncharacterized protein LOC103497824 [Cucumis melo]1.2e-10170.19Show/hide
Query:  TANGHSASHRIPIRSTFQFQLAFPFSLCIAYSPLCSGSGSGLLLSSNLRRISPINALSNSMVNEAKPKELRDESDFEAIFSDSDYISVCGFGSLLSERSA
        T+ G SAS+RIPI S FQFQL FPFS   A+SPLCSGSG        LRRISPIN  S S+V+EA+PKELRDESDFEA+F DSDYISVCGFGSLLSERSA
Subjt:  TANGHSASHRIPIRSTFQFQLAFPFSLCIAYSPLCSGSGSGLLLSSNLRRISPINALSNSMVNEAKPKELRDESDFEAIFSDSDYISVCGFGSLLSERSA

Query:  RSTFPDLINFRVARLNGFRRLFGNVAPVFFERGIAKPETKCLTTCYSSEYQEISSLCAEPCEGENIIVTVFEIKKSE---------------VFPETLDG
        RSTFPDLINFR+ARLNGFRR+FGNVAPVFFERGIAKPETK           EISSLCAEPCEGENII+TVFEIKKSE               VFPETLDG
Subjt:  RSTFPDLINFRVARLNGFRRLFGNVAPVFFERGIAKPETKCLTTCYSSEYQEISSLCAEPCEGENIIVTVFEIKKSE---------------VFPETLDG

Query:  KLYHKPAVLCSRSTDEEFFQ----GIRTFFFIIMA--VIILIRYGEMIS---YLVASIFDTAAKNLGDKAYNNFLDHTFLGDRSTTIREYLTGNGSGIME
        K Y KPAVLCSRSTDEEFFQ    G +  FF       I  I   ++     YL   I   AAKNLGD+AYNNFLDHTFLGDRSTTIREYLTG+GSGIME
Subjt:  KLYHKPAVLCSRSTDEEFFQ----GIRTFFFIIMA--VIILIRYGEMIS---YLVASIFDTAAKNLGDKAYNNFLDHTFLGDRSTTIREYLTGNGSGIME

Query:  EEPPESLKFRYG
        EEPPESLKFRYG
Subjt:  EEPPESLKFRYG

XP_023548382.1 uncharacterized protein LOC111807043 isoform X1 [Cucurbita pepo subsp. pepo]1.4e-9967.39Show/hide
Query:  GHSASHRIPIRSTFQFQLAFPFSLCIAYSPLCSGSGSGLLLSSNLRRISPINA-----LSNSMVNEAKPKELRDESDFEAIFSDSDYISVCGFGSLLSER
        GHSAS RI IRS FQ Q+A PFS C A+SPLC  SGS L+L S LR ISPINA     +SNSMVNEA+PKELRDESDFEAIFS  D+ SVCGFGSLLSER
Subjt:  GHSASHRIPIRSTFQFQLAFPFSLCIAYSPLCSGSGSGLLLSSNLRRISPINA-----LSNSMVNEAKPKELRDESDFEAIFSDSDYISVCGFGSLLSER

Query:  SARSTFPDLINFRVARLNGFRRLFGNVAPVFFERGIAKPETKCLTTCYSSEYQEISSLCAEPCEGENIIVTVFEIKKSE---------------VFPETL
        SARSTFP+LINFRVARLNGFRR+FGNVAP+FFERGIAKPETK           EISSLCAEPCEGE IIVTVFEIKKSE               V PETL
Subjt:  SARSTFPDLINFRVARLNGFRRLFGNVAPVFFERGIAKPETKCLTTCYSSEYQEISSLCAEPCEGENIIVTVFEIKKSE---------------VFPETL

Query:  DGKLYHKPAVLCSRSTDEEFFQ----GIRTFFFIIMAVIILIRYGE-------------MISYLVASIFDTAAKNLGDKAYNNFLDHTFLGDRSTTIREY
        DGKLY KPAVLCSRSTDEEFFQ    G +  F        L  YG                 YL   I   AAKNLG+ AYNNFLDHTFLGDRSTTIREY
Subjt:  DGKLYHKPAVLCSRSTDEEFFQ----GIRTFFFIIMAVIILIRYGE-------------MISYLVASIFDTAAKNLGDKAYNNFLDHTFLGDRSTTIREY

Query:  LTGNGSGIMEEEPPESLKFRYG
        L  +GSGIMEEEPPESLKFRYG
Subjt:  LTGNGSGIMEEEPPESLKFRYG

XP_038876042.1 uncharacterized protein LOC120068372 isoform X1 [Benincasa hispida]6.0e-10669.88Show/hide
Query:  GHSASHRIPIRST----FQFQLAFPFSLCIAYSPLCSGSGSGLLLSSNLRRISPINALSNSMVNEAKPKELRDESDFEAIFSDSDYISVCGFGSLLSERS
        GHSAS+RIPIRST    FQFQL FPFS  IA+SP+C  SGSGLL  SNLRRISPINA  NS++NE  P ELRDESDFEAIFSDSDYISVCGFGSLLSERS
Subjt:  GHSASHRIPIRST----FQFQLAFPFSLCIAYSPLCSGSGSGLLLSSNLRRISPINALSNSMVNEAKPKELRDESDFEAIFSDSDYISVCGFGSLLSERS

Query:  ARSTFPDLINFRVARLNGFRRLFGNVAPVFFERGIAKPETKCLTTCYSSEYQEISSLCAEPCEGENIIVTVFEIKKSE---------------VFPETLD
        ARSTFPDLINFRVARLNGFRRLFGNVAPVFFERGIAKPETK           EISSLCAEPCEGENII+TVFEIKKSE               V PETLD
Subjt:  ARSTFPDLINFRVARLNGFRRLFGNVAPVFFERGIAKPETKCLTTCYSSEYQEISSLCAEPCEGENIIVTVFEIKKSE---------------VFPETLD

Query:  GKLYHKPAVLCSRSTDEEFFQ----GIRTFFF--------------IIMAVIILIRYGEMISYLVASIFDTAAKNLGDKAYNNFLDHTFLGDRSTTIREY
        GKLY KPAVLCSRSTDEEFFQ    G +  FF               I+   I +R+  +           AAKNLGD AYNNFLDHTFLGDRSTTIREY
Subjt:  GKLYHKPAVLCSRSTDEEFFQ----GIRTFFF--------------IIMAVIILIRYGEMISYLVASIFDTAAKNLGDKAYNNFLDHTFLGDRSTTIREY

Query:  LTGNGSGIMEEEPPESLKFRYG
        L  NGSGIMEEEPPESLKFRYG
Subjt:  LTGNGSGIMEEEPPESLKFRYG

TrEMBL top hitse value%identityAlignment
A0A0A0KD12 Uncharacterized protein2.3e-10372.17Show/hide
Query:  GHSASHRIPIRSTFQFQLAFPFSLCIAYSPLCSGSGSGLLLSSNLRRISPINALSNSMVNEAKPKELRDESDFEAIFSDSDYISVCGFGSLLSERSARST
        G SAS+RIPI S  QFQL FPFS  IA+SPLCSGSG        L+RISPIN LSNS+VNEA+PKELRDESDFEAIFSDSDYISVCGFGSLLSERSARST
Subjt:  GHSASHRIPIRSTFQFQLAFPFSLCIAYSPLCSGSGSGLLLSSNLRRISPINALSNSMVNEAKPKELRDESDFEAIFSDSDYISVCGFGSLLSERSARST

Query:  FPDLINFRVARLNGFRRLFGNVAPVFFERGIAKPETKCLTTCYSSEYQEISSLCAEPCEGENIIVTVFEIKKSE---------------VFPETLDGKLY
        FPDLINFRVARLNGFRR+FGNVAPVFFERGIAKPETK           EISSLCAEPCEGENIIVTVFEIKKSE               VFPETL GK Y
Subjt:  FPDLINFRVARLNGFRRLFGNVAPVFFERGIAKPETKCLTTCYSSEYQEISSLCAEPCEGENIIVTVFEIKKSE---------------VFPETLDGKLY

Query:  HKPAVLCSRSTDEEFFQ----GIRTFFFIIMA--VIILIRYGEMIS---YLVASIFDTAAKNLGDKAYNNFLDHTFLGDRSTTIREYLTGNGSGIMEEEP
         KPAVLCSRSTDEEFFQ    G +  FF       I  I   ++     YL   I   AAKNLGDKAYNNFLDHTFLGDRSTTIREYLTG G GIMEEEP
Subjt:  HKPAVLCSRSTDEEFFQ----GIRTFFFIIMA--VIILIRYGEMIS---YLVASIFDTAAKNLGDKAYNNFLDHTFLGDRSTTIREYLTGNGSGIMEEEP

Query:  PESLKFRYG
        PESLKFRYG
Subjt:  PESLKFRYG

A0A1S3C8D5 uncharacterized protein LOC1034978245.6e-10270.19Show/hide
Query:  TANGHSASHRIPIRSTFQFQLAFPFSLCIAYSPLCSGSGSGLLLSSNLRRISPINALSNSMVNEAKPKELRDESDFEAIFSDSDYISVCGFGSLLSERSA
        T+ G SAS+RIPI S FQFQL FPFS   A+SPLCSGSG        LRRISPIN  S S+V+EA+PKELRDESDFEA+F DSDYISVCGFGSLLSERSA
Subjt:  TANGHSASHRIPIRSTFQFQLAFPFSLCIAYSPLCSGSGSGLLLSSNLRRISPINALSNSMVNEAKPKELRDESDFEAIFSDSDYISVCGFGSLLSERSA

Query:  RSTFPDLINFRVARLNGFRRLFGNVAPVFFERGIAKPETKCLTTCYSSEYQEISSLCAEPCEGENIIVTVFEIKKSE---------------VFPETLDG
        RSTFPDLINFR+ARLNGFRR+FGNVAPVFFERGIAKPETK           EISSLCAEPCEGENII+TVFEIKKSE               VFPETLDG
Subjt:  RSTFPDLINFRVARLNGFRRLFGNVAPVFFERGIAKPETKCLTTCYSSEYQEISSLCAEPCEGENIIVTVFEIKKSE---------------VFPETLDG

Query:  KLYHKPAVLCSRSTDEEFFQ----GIRTFFFIIMA--VIILIRYGEMIS---YLVASIFDTAAKNLGDKAYNNFLDHTFLGDRSTTIREYLTGNGSGIME
        K Y KPAVLCSRSTDEEFFQ    G +  FF       I  I   ++     YL   I   AAKNLGD+AYNNFLDHTFLGDRSTTIREYLTG+GSGIME
Subjt:  KLYHKPAVLCSRSTDEEFFQ----GIRTFFFIIMA--VIILIRYGEMIS---YLVASIFDTAAKNLGDKAYNNFLDHTFLGDRSTTIREYLTGNGSGIME

Query:  EEPPESLKFRYG
        EEPPESLKFRYG
Subjt:  EEPPESLKFRYG

A0A5D3BTC8 Uncharacterized protein2.8e-9366.13Show/hide
Query:  TANGHSASHRIPIRSTFQFQLAFPFSLCIAYSPLCSGSGSGLLLSSNLRRISPINALSNSMVNEAKPKELRDESDFEAIFSDSDYISVCGFGSLLSERSA
        T+ G SAS+RIPI S FQFQL FPFS   A+SPLCSGSG        LRRISPIN  S S+V+EA+PKELRDESDFEA+F DSDYISVCGFGSLLSERSA
Subjt:  TANGHSASHRIPIRSTFQFQLAFPFSLCIAYSPLCSGSGSGLLLSSNLRRISPINALSNSMVNEAKPKELRDESDFEAIFSDSDYISVCGFGSLLSERSA

Query:  RSTFPDLINFRVARLNGFRRLFGNVAPVFFERGIAKPETKCLTTCYSSEYQEISSLCAEPCEGENIIVTVFEIKKSEVFPETLDGKLYHK----------
        RSTFPDLINFR+ARLNGFRR+FGNVAPVFFERGIAKPETK           EISSLCAEPCEGENII+TVFEIKKSE+ P  +  ++  +          
Subjt:  RSTFPDLINFRVARLNGFRRLFGNVAPVFFERGIAKPETKCLTTCYSSEYQEISSLCAEPCEGENIIVTVFEIKKSEVFPETLDGKLYHK----------

Query:  ------PAVLCSRSTDEEFFQ----GIRTFFFIIMA--VIILIRYGEMIS---YLVASIFDTAAKNLGDKAYNNFLDHTFLGDRSTTIREYLTGNGSGIM
               +VLCSRSTDEEFFQ    G +  FF       I  I   ++     YL   I   AAKNLGD+AYNNFLDHTFLGDRSTTIREYLTG+GSGIM
Subjt:  ------PAVLCSRSTDEEFFQ----GIRTFFFIIMA--VIILIRYGEMIS---YLVASIFDTAAKNLGDKAYNNFLDHTFLGDRSTTIREYLTGNGSGIM

Query:  EEEPPESLKFRYG
        EEEPPESLKFRYG
Subjt:  EEEPPESLKFRYG

A0A6J1H5P5 uncharacterized protein LOC111460273 isoform X21.6e-9366.56Show/hide
Query:  GHSASHRIPIRSTFQFQLAFPFSLCIAYSPLCSGSGSGLLLSSNLRRISPINA-----LSNSMVNEAKPKELRDESDFEAIFSDSDYISVCGFGSLLSER
        GHSAS RIP+RS FQ Q+A PFS C A+SPLC  SGS L+L S LR ISPINA     +SNS+VNEA+PKELRDESDFEAIFS  D+ISVCGFGSLLSER
Subjt:  GHSASHRIPIRSTFQFQLAFPFSLCIAYSPLCSGSGSGLLLSSNLRRISPINA-----LSNSMVNEAKPKELRDESDFEAIFSDSDYISVCGFGSLLSER

Query:  SARSTFPDLINFRVARLNGFRRLFGNVAPVFFERGIAKPETKCLTTCYSSEYQEISSLCAEPCEGENIIVTVFEIKKSEVFPETLDGKL-YHKPAVLCSR
        SARSTFP+LINFRVARLNGFRR+FGNVAP+FFERGIAKPETK           EISSLCAEPCEGE IIVTVFEIKKSE+ P  +  ++ +   AVLC R
Subjt:  SARSTFPDLINFRVARLNGFRRLFGNVAPVFFERGIAKPETKCLTTCYSSEYQEISSLCAEPCEGENIIVTVFEIKKSEVFPETLDGKL-YHKPAVLCSR

Query:  STDEEFFQ----GIRTFFFIIMAVIILIRYGE-------------MISYLVASIFDTAAKNLGDKAYNNFLDHTFLGDRSTTIREYLTGNGSGIMEEEPP
        STDEEFFQ    G    F        L  YG                 YL   I   AAKNLG+ AYNNFLDHTFLGDRSTTIREYL  +GSGIMEEEPP
Subjt:  STDEEFFQ----GIRTFFFIIMAVIILIRYGE-------------MISYLVASIFDTAAKNLGDKAYNNFLDHTFLGDRSTTIREYLTGNGSGIMEEEPP

Query:  ESLKFRYG
        ESLKFRYG
Subjt:  ESLKFRYG

A0A6J1H7C8 uncharacterized protein LOC111460273 isoform X16.9e-10067.08Show/hide
Query:  GHSASHRIPIRSTFQFQLAFPFSLCIAYSPLCSGSGSGLLLSSNLRRISPINA-----LSNSMVNEAKPKELRDESDFEAIFSDSDYISVCGFGSLLSER
        GHSAS RIP+RS FQ Q+A PFS C A+SPLC  SGS L+L S LR ISPINA     +SNS+VNEA+PKELRDESDFEAIFS  D+ISVCGFGSLLSER
Subjt:  GHSASHRIPIRSTFQFQLAFPFSLCIAYSPLCSGSGSGLLLSSNLRRISPINA-----LSNSMVNEAKPKELRDESDFEAIFSDSDYISVCGFGSLLSER

Query:  SARSTFPDLINFRVARLNGFRRLFGNVAPVFFERGIAKPETKCLTTCYSSEYQEISSLCAEPCEGENIIVTVFEIKKSE---------------VFPETL
        SARSTFP+LINFRVARLNGFRR+FGNVAP+FFERGIAKPETK           EISSLCAEPCEGE IIVTVFEIKKSE               V PETL
Subjt:  SARSTFPDLINFRVARLNGFRRLFGNVAPVFFERGIAKPETKCLTTCYSSEYQEISSLCAEPCEGENIIVTVFEIKKSE---------------VFPETL

Query:  DGKLYHKPAVLCSRSTDEEFFQ----GIRTFFFIIMAVIILIRYGE-------------MISYLVASIFDTAAKNLGDKAYNNFLDHTFLGDRSTTIREY
        DGKLY KPAVLC RSTDEEFFQ    G    F        L  YG                 YL   I   AAKNLG+ AYNNFLDHTFLGDRSTTIREY
Subjt:  DGKLYHKPAVLCSRSTDEEFFQ----GIRTFFFIIMAVIILIRYGE-------------MISYLVASIFDTAAKNLGDKAYNNFLDHTFLGDRSTTIREY

Query:  LTGNGSGIMEEEPPESLKFRYG
        L  +GSGIMEEEPPESLKFRYG
Subjt:  LTGNGSGIMEEEPPESLKFRYG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G16060.1 unknown protein3.2e-6552.2Show/hide
Query:  AFPFSLCIAYSPLCSGSG---SGLLLSSNLRRIS---PINALSNSMVNEAKPKELRDESDFEAIFSDSDYISVCGFGSLLSERSARSTFPDLINFRVARL
        + PFS   + +P  S +    S + LSS+ RR      I A+S  M       EL DESDFE + S  + IS+ GFGSLLSERSARSTFPDL NFR+A+L
Subjt:  AFPFSLCIAYSPLCSGSG---SGLLLSSNLRRIS---PINALSNSMVNEAKPKELRDESDFEAIFSDSDYISVCGFGSLLSERSARSTFPDLINFRVARL

Query:  NGFRRLFGNVAPVFFERGIAKPETKCLTTCYSSEYQEISSLCAEPCEGENIIVTVFEIKKSE---------------VFPETLDGKLYHKPAVLCSRSTD
         GFRR+F + AP+FFERGIA PETK           EISSL  EPCEGE+++VTVFEIK SE               V PETL+GK Y   AVLC R +D
Subjt:  NGFRRLFGNVAPVFFERGIAKPETKCLTTCYSSEYQEISSLCAEPCEGENIIVTVFEIKKSE---------------VFPETLDGKLYHKPAVLCSRSTD

Query:  EEFFQ-------GIRTFFFIIMAVIILIRYGEMISYLVASIFDTAAKNLGDKAYNNFLDHTFLGDRSTTIREYLTGNGSGIMEEEPPESLKFRYG
        EEFFQ       GI    +    +  + R   +   L       AAKNLGD+AYNNFLDHTFLGDR TTIREYL+  GSGIMEEEPPE+LK RYG
Subjt:  EEFFQ-------GIRTFFFIIMAVIILIRYGEMISYLVASIFDTAAKNLGDKAYNNFLDHTFLGDRSTTIREYLTGNGSGIMEEEPPESLKFRYG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAATCGTCCCGCTCTCCAATTCACTCGTCTCAAAGCCATTCCCCTTCGCATCATCCCAATCCGAATTAAGTTCGAGATGAGACCGATTTTGATTGAAGCAGAACC
GAAGGAACTGCGAGATGAGTCCGATTTTGAAGCAATCTTCTCCGACGGCGATTACATTTCCGTTTGTGGCTTTGGATCTCTCCTCTCTGAGAGGAGTGCGCGAAGTACTT
TTCCTGATCTGATCAACTTTAGAGTTGCAAGATTGAAAGGATTCAGACGCGTTTTCGGAATTATAGCTCCTATATTCTTCGAGCACGGCATTGCTAAATCTGAAACCAAG
GAGATTTCAAGCTTATTTGCGGAGCCTTGTGAAGGAGAAACTATCATCGTTACGGTTTTCGAGATTAAGAAGTCTGAGGTTCTTCCTCAAACATTAGATGGAAAGCTATA
TCATAAACCAGCGCTGCAAAAAAGTTGGGATAACAACTTTCTGGATCACACTTTTCTTGGAGATCGTAGTACAATCATTCGTGAATATTTGACTGCTAATGGGCATTCCG
CTTCGCATCGCATCCCAATCCGCTCTACGTTTCAGTTTCAGCTTGCGTTCCCTTTCTCTCTATGTATTGCGTACTCTCCTCTTTGCTCCGGCTCCGGCTCCGGCCTGCTT
CTTTCAAGTAACCTCCGGCGAATATCTCCGATCAACGCGTTGTCGAACTCCATGGTGAATGAAGCAAAACCAAAGGAACTGCGAGATGAGTCCGATTTTGAAGCTATCTT
CTCCGACAGCGATTATATTTCCGTTTGTGGCTTTGGTTCTCTCCTCTCTGAGAGGAGTGCGCGAAGTACCTTTCCTGATCTGATCAACTTTAGAGTGGCGCGATTGAACG
GCTTCAGACGCCTTTTCGGAAATGTAGCTCCTGTATTCTTTGAGCGCGGCATTGCTAAACCTGAAACCAAGTGTTTAACGACGTGCTATTCATCTGAATATCAGGAGATA
TCCAGTTTGTGTGCGGAGCCTTGCGAAGGAGAAAATATCATAGTTACGGTTTTCGAGATTAAGAAGTCTGAGGTTTTTCCTGAAACATTAGATGGAAAGCTATACCATAA
ACCAGCGGTGCTTTGTTCTCGATCCACTGATGAGGAGTTTTTCCAAGGAATAAGGACATTTTTTTTCATCATTATGGCCGTCATAATATTGATAAGATATGGAGAGATGA
TATCTTACCTTGTCGCGTCTATCTTCGACACTGCTGCAAAAAACCTGGGTGACAAAGCTTATAACAACTTTCTGGATCACACTTTCCTTGGAGATCGTAGTACAACCATC
CGTGAATATTTGACCGGTAATGGTTCAGGCATTATGGAAGAGGAGCCTCCAGAATCCCTCAAGTTTCGATATGGCGTTATTAGAGGAACTTCGGGCAACTCCTCCGTGGG
ATTGGATAAAACGCCACCGTTTCTTCTGCGGGCGCCCATGCAAGACCAGGAAGACAGCGAAGAGCGAAAAGAAACCTGGGTGGTGGTAGAAACTGGGCGGAGAAGGGCGC
CGGCGCCGGCGACCTTAGCTTTGGCAGTAGCTATAGATGGAAGGCGTGAGCCCAGAGCACTTGCAGAGCGATACACAACATTGAGAACCCTGTATAGTAAGATTGAAGTT
AAAAGAGGGAGGAGCAAAAATCGCAAGTCATCAATAGAAGCCGCTGAGAAGCTGGGAACCAGATCTGAACCATTAATAATTGTAGTTATGAACTGCTTACCCGATTCTGC
CAACTCCCATGTCATACAAGCAGCTACACCAACAGTAGTAAAATACACGTTGGCCAAGAATACATAA
mRNA sequenceShow/hide mRNA sequence
ATGAACAATCGTCCCGCTCTCCAATTCACTCGTCTCAAAGCCATTCCCCTTCGCATCATCCCAATCCGAATTAAGTTCGAGATGAGACCGATTTTGATTGAAGCAGAACC
GAAGGAACTGCGAGATGAGTCCGATTTTGAAGCAATCTTCTCCGACGGCGATTACATTTCCGTTTGTGGCTTTGGATCTCTCCTCTCTGAGAGGAGTGCGCGAAGTACTT
TTCCTGATCTGATCAACTTTAGAGTTGCAAGATTGAAAGGATTCAGACGCGTTTTCGGAATTATAGCTCCTATATTCTTCGAGCACGGCATTGCTAAATCTGAAACCAAG
GAGATTTCAAGCTTATTTGCGGAGCCTTGTGAAGGAGAAACTATCATCGTTACGGTTTTCGAGATTAAGAAGTCTGAGGTTCTTCCTCAAACATTAGATGGAAAGCTATA
TCATAAACCAGCGCTGCAAAAAAGTTGGGATAACAACTTTCTGGATCACACTTTTCTTGGAGATCGTAGTACAATCATTCGTGAATATTTGACTGCTAATGGGCATTCCG
CTTCGCATCGCATCCCAATCCGCTCTACGTTTCAGTTTCAGCTTGCGTTCCCTTTCTCTCTATGTATTGCGTACTCTCCTCTTTGCTCCGGCTCCGGCTCCGGCCTGCTT
CTTTCAAGTAACCTCCGGCGAATATCTCCGATCAACGCGTTGTCGAACTCCATGGTGAATGAAGCAAAACCAAAGGAACTGCGAGATGAGTCCGATTTTGAAGCTATCTT
CTCCGACAGCGATTATATTTCCGTTTGTGGCTTTGGTTCTCTCCTCTCTGAGAGGAGTGCGCGAAGTACCTTTCCTGATCTGATCAACTTTAGAGTGGCGCGATTGAACG
GCTTCAGACGCCTTTTCGGAAATGTAGCTCCTGTATTCTTTGAGCGCGGCATTGCTAAACCTGAAACCAAGTGTTTAACGACGTGCTATTCATCTGAATATCAGGAGATA
TCCAGTTTGTGTGCGGAGCCTTGCGAAGGAGAAAATATCATAGTTACGGTTTTCGAGATTAAGAAGTCTGAGGTTTTTCCTGAAACATTAGATGGAAAGCTATACCATAA
ACCAGCGGTGCTTTGTTCTCGATCCACTGATGAGGAGTTTTTCCAAGGAATAAGGACATTTTTTTTCATCATTATGGCCGTCATAATATTGATAAGATATGGAGAGATGA
TATCTTACCTTGTCGCGTCTATCTTCGACACTGCTGCAAAAAACCTGGGTGACAAAGCTTATAACAACTTTCTGGATCACACTTTCCTTGGAGATCGTAGTACAACCATC
CGTGAATATTTGACCGGTAATGGTTCAGGCATTATGGAAGAGGAGCCTCCAGAATCCCTCAAGTTTCGATATGGCGTTATTAGAGGAACTTCGGGCAACTCCTCCGTGGG
ATTGGATAAAACGCCACCGTTTCTTCTGCGGGCGCCCATGCAAGACCAGGAAGACAGCGAAGAGCGAAAAGAAACCTGGGTGGTGGTAGAAACTGGGCGGAGAAGGGCGC
CGGCGCCGGCGACCTTAGCTTTGGCAGTAGCTATAGATGGAAGGCGTGAGCCCAGAGCACTTGCAGAGCGATACACAACATTGAGAACCCTGTATAGTAAGATTGAAGTT
AAAAGAGGGAGGAGCAAAAATCGCAAGTCATCAATAGAAGCCGCTGAGAAGCTGGGAACCAGATCTGAACCATTAATAATTGTAGTTATGAACTGCTTACCCGATTCTGC
CAACTCCCATGTCATACAAGCAGCTACACCAACAGTAGTAAAATACACGTTGGCCAAGAATACATAATTATAACTTGCAAATCAGTCAAAATAATAATGGCAAGGCAAAA
GGATGTAGTAAGAAAATACACGTCAGATAATGGAAAATTATGATATCTTGTAGGGGGAAAAAAATGATGGTGGCTATGGATGAGGATAAGAAA
Protein sequenceShow/hide protein sequence
MNNRPALQFTRLKAIPLRIIPIRIKFEMRPILIEAEPKELRDESDFEAIFSDGDYISVCGFGSLLSERSARSTFPDLINFRVARLKGFRRVFGIIAPIFFEHGIAKSETK
EISSLFAEPCEGETIIVTVFEIKKSEVLPQTLDGKLYHKPALQKSWDNNFLDHTFLGDRSTIIREYLTANGHSASHRIPIRSTFQFQLAFPFSLCIAYSPLCSGSGSGLL
LSSNLRRISPINALSNSMVNEAKPKELRDESDFEAIFSDSDYISVCGFGSLLSERSARSTFPDLINFRVARLNGFRRLFGNVAPVFFERGIAKPETKCLTTCYSSEYQEI
SSLCAEPCEGENIIVTVFEIKKSEVFPETLDGKLYHKPAVLCSRSTDEEFFQGIRTFFFIIMAVIILIRYGEMISYLVASIFDTAAKNLGDKAYNNFLDHTFLGDRSTTI
REYLTGNGSGIMEEEPPESLKFRYGVIRGTSGNSSVGLDKTPPFLLRAPMQDQEDSEERKETWVVVETGRRRAPAPATLALAVAIDGRREPRALAERYTTLRTLYSKIEV
KRGRSKNRKSSIEAAEKLGTRSEPLIIVVMNCLPDSANSHVIQAATPTVVKYTLAKNT