; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g07940 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g07940
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionbasic form of pathogenesis-related protein 1-like
Genome locationchr4:5775526..5780481
RNA-Seq ExpressionMoc04g07940
SyntenyMoc04g07940
Gene Ontology termsGO:0005615 - extracellular space (cellular component)
InterPro domainsIPR001283 - Cysteine-rich secretory protein-related
IPR014044 - CAP domain
IPR035940 - CAP superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571903.1 hypothetical protein SDJN03_28631, partial [Cucurbita argyrosperma subsp. sororia]3.1e-4565.52Show/hide
Query:  MAVPKSASTICLVGLTLALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATV
        MA     S   +VG+ L L   ++ +  +A SSPKDFVD HNAIRAENGVGPV+WNTTLA YA ++AKTR+ TCEMEHS GPYAENLAEA+E TTAE TV
Subjt:  MAVPKSASTICLVGLTLALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATV

Query:  KFWATEKEFYDPKANKCVNDECGHFMNVVGKDTKYIGCAEAQMAN
        KFWA+EKEFYDP   KCV +ECGHF+N+V KDT  IGCAE +  N
Subjt:  KFWATEKEFYDPKANKCVNDECGHFMNVVGKDTKYIGCAEAQMAN

XP_022148832.1 basic form of pathogenesis-related protein 1-like [Momordica charantia]7.3e-6383.67Show/hide
Query:  MAVPKSASTICLVGLTLALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATV
        MA PK   T+CLVGLTLALTLT+TA VAVA SSPKDFVD HN IRAE GVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPY E+LAEAFESTTAEATV
Subjt:  MAVPKSASTICLVGLTLALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATV

Query:  KFWATEKEFYDPKANKCVNDECGHFMNVVGKDTKYIGCAEAQMANRH
        K+WA+EKEFYD KANKCVNDECGHF+NVV KDTKYIGCAE +  N +
Subjt:  KFWATEKEFYDPKANKCVNDECGHFMNVVGKDTKYIGCAEAQMANRH

XP_022148837.1 basic form of pathogenesis-related protein 1-like [Momordica charantia]3.5e-7395.92Show/hide
Query:  MAVPKSASTICLVGLTLALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATV
        MAVPKSASTICLVGLTLALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATV
Subjt:  MAVPKSASTICLVGLTLALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATV

Query:  KFWATEKEFYDPKANKCVNDECGHFMNVVGKDTKYIGCAEAQMANRH
        KFWATEKEFYDPKANKCVNDECGHFMNVVGKDTKYIGCAE +  N +
Subjt:  KFWATEKEFYDPKANKCVNDECGHFMNVVGKDTKYIGCAEAQMANRH

XP_022158629.1 basic form of pathogenesis-related protein 1-like [Momordica charantia]1.3e-7295.24Show/hide
Query:  MAVPKSASTICLVGLTLALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATV
        MAVPK ASTICLVGLTLALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATV
Subjt:  MAVPKSASTICLVGLTLALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATV

Query:  KFWATEKEFYDPKANKCVNDECGHFMNVVGKDTKYIGCAEAQMANRH
        KFWATEKEFYDPKANKCVNDECGHFMNVVGKDTKYIGCAE +  N +
Subjt:  KFWATEKEFYDPKANKCVNDECGHFMNVVGKDTKYIGCAEAQMANRH

XP_022952850.1 basic form of pathogenesis-related protein 1-like [Cucurbita moschata]5.3e-4565.52Show/hide
Query:  MAVPKSASTICLVGLTLALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATV
        MA     S   +VG+ L L   +  +  +A SSPKDFVD HNAIRAENGVGPV+WNTTLA YA ++AKTR+ TCEMEHS GPYAENLAEA+E TTAE TV
Subjt:  MAVPKSASTICLVGLTLALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATV

Query:  KFWATEKEFYDPKANKCVNDECGHFMNVVGKDTKYIGCAEAQMAN
        KFWA+EKEFYDP   KCV +ECGHF+N+V KDT  IGCAE +  N
Subjt:  KFWATEKEFYDPKANKCVNDECGHFMNVVGKDTKYIGCAEAQMAN

TrEMBL top hitse value%identityAlignment
A0A5D3C4I8 Basic form of pathogenesis-related protein 1-like1.7e-4465.03Show/hide
Query:  KSASTICLVGLTLALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATVKFWA
        K  S  C+VGL+L L      +  +A SSP++FVD HNAIRA+ GVGPV WN TLA YAEN+AKTRV TCEMEHSMGPY ENLAEAFE TTAE TV +WA
Subjt:  KSASTICLVGLTLALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATVKFWA

Query:  TEKEFYDPKANKCVNDECGHFMNVVGKDTKYIGCAEAQMANRH
        TE +FYD K+NKCV +ECGHF+ VV KDT  IGCAE +  N +
Subjt:  TEKEFYDPKANKCVNDECGHFMNVVGKDTKYIGCAEAQMANRH

A0A6J1D642 basic form of pathogenesis-related protein 1-like3.5e-6383.67Show/hide
Query:  MAVPKSASTICLVGLTLALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATV
        MA PK   T+CLVGLTLALTLT+TA VAVA SSPKDFVD HN IRAE GVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPY E+LAEAFESTTAEATV
Subjt:  MAVPKSASTICLVGLTLALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATV

Query:  KFWATEKEFYDPKANKCVNDECGHFMNVVGKDTKYIGCAEAQMANRH
        K+WA+EKEFYD KANKCVNDECGHF+NVV KDTKYIGCAE +  N +
Subjt:  KFWATEKEFYDPKANKCVNDECGHFMNVVGKDTKYIGCAEAQMANRH

A0A6J1D646 basic form of pathogenesis-related protein 1-like1.7e-7395.92Show/hide
Query:  MAVPKSASTICLVGLTLALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATV
        MAVPKSASTICLVGLTLALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATV
Subjt:  MAVPKSASTICLVGLTLALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATV

Query:  KFWATEKEFYDPKANKCVNDECGHFMNVVGKDTKYIGCAEAQMANRH
        KFWATEKEFYDPKANKCVNDECGHFMNVVGKDTKYIGCAE +  N +
Subjt:  KFWATEKEFYDPKANKCVNDECGHFMNVVGKDTKYIGCAEAQMANRH

A0A6J1DZZ1 basic form of pathogenesis-related protein 1-like6.4e-7395.24Show/hide
Query:  MAVPKSASTICLVGLTLALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATV
        MAVPK ASTICLVGLTLALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATV
Subjt:  MAVPKSASTICLVGLTLALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATV

Query:  KFWATEKEFYDPKANKCVNDECGHFMNVVGKDTKYIGCAEAQMANRH
        KFWATEKEFYDPKANKCVNDECGHFMNVVGKDTKYIGCAE +  N +
Subjt:  KFWATEKEFYDPKANKCVNDECGHFMNVVGKDTKYIGCAEAQMANRH

A0A6J1GMW8 basic form of pathogenesis-related protein 1-like2.6e-4565.52Show/hide
Query:  MAVPKSASTICLVGLTLALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATV
        MA     S   +VG+ L L   +  +  +A SSPKDFVD HNAIRAENGVGPV+WNTTLA YA ++AKTR+ TCEMEHS GPYAENLAEA+E TTAE TV
Subjt:  MAVPKSASTICLVGLTLALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATV

Query:  KFWATEKEFYDPKANKCVNDECGHFMNVVGKDTKYIGCAEAQMAN
        KFWA+EKEFYDP   KCV +ECGHF+N+V KDT  IGCAE +  N
Subjt:  KFWATEKEFYDPKANKCVNDECGHFMNVVGKDTKYIGCAEAQMAN

SwissProt top hitse value%identityAlignment
P08299 Pathogenesis-related protein 1A3.0e-1938.64Show/hide
Query:  TLALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAE-AFESTTAEATVKFWATEKEFYDPKA
        TL L L ++ +    NS  +D++D HN  RA+ GV P+ W+  +A YA+N+A      C + HS G Y ENLAE + +  TA   V+ W  EK++YD  +
Subjt:  TLALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAE-AFESTTAEATVKFWATEKEFYDPKA

Query:  NKCVNDE-CGHFMNVVGKDTKYIGCAEAQMAN
        N C   + CGH+  VV +++  +GCA  Q  N
Subjt:  NKCVNDE-CGHFMNVVGKDTKYIGCAEAQMAN

P11670 Basic form of pathogenesis-related protein 15.7e-2643.31Show/hide
Query:  LTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATVKFWATEKEFYDPKANKCV
        +T  +    + A +SP+D+++ HNA R + GVGP+ W+  LA YA+N+A  R+  C M HS GPY ENLA AF    A   VK W  EK FYD  +N CV
Subjt:  LTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATVKFWATEKEFYDPKANKCV

Query:  NDECGHFMNVVGKDTKYIGCAEAQMAN
           CGH+  VV +++  +GCA  +  N
Subjt:  NDECGHFMNVVGKDTKYIGCAEAQMAN

P35793 Pathogenesis-related protein PRB1-31.8e-1937.33Show/hide
Query:  LTLALTLTMTA---TVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENL--AEAFESTTAEATVKFWATEKEF
        L + L L M+A    ++ A +SP+D+V  HNA RA  GVG V+W+T L  +A+N+A  R++ C+++HS GPY EN+    A     A   V  W +EK+ 
Subjt:  LTLALTLTMTA---TVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENL--AEAFESTTAEATVKFWATEKEF

Query:  YDPKANKCVNDE-CGHFMNVVGKDTKYIGCAEAQMANRHTGAPSCTLASR
        YD  +N C   + CGH+  VV + +  IGCA     N      +C    R
Subjt:  YDPKANKCVNDE-CGHFMNVVGKDTKYIGCAEAQMANRHTGAPSCTLASR

Q05968 Pathogenesis-related protein 15.2e-1936.67Show/hide
Query:  LTLALTLTMTA---TVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENL--AEAFESTTAEATVKFWATEKEF
        L + L L M A    ++ A +SP+D+V  HNA R+  GVG V+W+T L  +A+N+A  R++ C+++HS GPY EN+    A     A   V  W +EK+ 
Subjt:  LTLALTLTMTA---TVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENL--AEAFESTTAEATVKFWATEKEF

Query:  YDPKANKCVNDE-CGHFMNVVGKDTKYIGCAEAQMANRHTGAPSCTLASR
        YD  +N C   + CGH+  VV + +  IGCA     N      +C    R
Subjt:  YDPKANKCVNDE-CGHFMNVVGKDTKYIGCAEAQMANRHTGAPSCTLASR

Q08697 Pathogenesis-related protein 1A11.1e-2143.86Show/hide
Query:  ANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATVKFWATEKEFYDPKANKCVNDE-CGHFMNV
        A +  ++F++ HNA R   GVGP+ W+  LA YA+N+A  R D C M HS GPY ENLA AF    A   VK W  EK++YD  +N C   + CGH+  V
Subjt:  ANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATVKFWATEKEFYDPKANKCVNDE-CGHFMNV

Query:  VGKDTKYIGCAEAQ
        V + +  +GCA  +
Subjt:  VGKDTKYIGCAEAQ

Arabidopsis top hitse value%identityAlignment
AT1G50060.1 CAP (Cysteine-rich secretory proteins, Antigen 5, and Pathogenesis-related 1 protein) superfamily protein2.2e-2040.46Show/hide
Query:  LALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFEST-TAEATVKFWATEKEFYDPKAN
        +A++  + AT   A ++P+D+++ HN  RA+ GV  V W+TTLA YA N++  R   C + HS GPY ENLA+   S+ +A + VK W  EK +Y    N
Subjt:  LALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFEST-TAEATVKFWATEKEFYDPKAN

Query:  KCV-NDECGHFMNVVGKDTKYIGCAEAQMAN
         C    +C H+  VV +D+  IGCA  Q  N
Subjt:  KCV-NDECGHFMNVVGKDTKYIGCAEAQMAN

AT2G14580.1 basic pathogenesis-related protein 13.7e-2039.5Show/hide
Query:  ANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATVKFWATEKEFYDPKANKCVNDECGHFMNVV
        A  S +D+V+ HN  R++ GVGP+ W+  LA YA N+A      C + HS GPY ENLA++    +  A V  W  EK  Y+   N C N  CGH+  VV
Subjt:  ANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATVKFWATEKEFYDPKANKCVNDECGHFMNVV

Query:  GKDTKYIGCAEAQMANRHT
         +++  +GCA+ +  N  T
Subjt:  GKDTKYIGCAEAQMANRHT

AT2G14610.1 pathogenesis-related gene 15.9e-1837.41Show/hide
Query:  ICLVGLTLALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATVKFWATEKEF
        I  V L  AL L      + A  SP+D++ VHN  R   GVGP+ W+  +A YA ++A+     C + HS GPY ENLA      +  + V  W +EK  
Subjt:  ICLVGLTLALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATVKFWATEKEF

Query:  YDPKANKCVNDECGHFMNVVGKDTKYIGCAEAQMANRHT
        Y+  AN C N  CGH+  VV + +  +GCA+ +  N  T
Subjt:  YDPKANKCVNDECGHFMNVVGKDTKYIGCAEAQMANRHT

AT2G19990.1 pathogenesis-related protein-1-like4.6e-2342.48Show/hide
Query:  PKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATVKFWATEKEFYDPKANKCVND-ECGHFMNVVGKD
        P++ + VHN  RA  GVGP+ WN TLA YA+++A  R   C M+HS+GP+ ENLA  + + +     ++W TEKE YD  +N C  D  CGH+  +V +D
Subjt:  PKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATVKFWATEKEFYDPKANKCVND-ECGHFMNVVGKD

Query:  TKYIGCAEAQMAN
        +  +GCA  +  N
Subjt:  TKYIGCAEAQMAN

AT4G33720.1 CAP (Cysteine-rich secretory proteins, Antigen 5, and Pathogenesis-related 1 protein) superfamily protein1.0e-2241.91Show/hide
Query:  LTLALTLTMTATVAV-ANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATVKFWATEKEFYDPK
        L LA+T  +   V + A  SP+DF+ VHN  RAE GVGP+ W+  +A YA N+A  R   C M+HS G Y EN+A +  S T  A V  W  E+  YD  
Subjt:  LTLALTLTMTATVAV-ANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATVKFWATEKEFYDPK

Query:  ANKCVND-ECGHFMNVVGKDTKYIGCAEAQMANRHT
        +N C  D +CGH+  VV ++++ +GCA+ +  N  T
Subjt:  ANKCVND-ECGHFMNVVGKDTKYIGCAEAQMANRHT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGTCCCAAAGTCTGCCTCGACGATTTGTTTGGTGGGGCTAACCCTAGCCCTAACCCTAACCATGACTGCAACCGTAGCGGTTGCGAATAGCAGCCCGAAGGACTT
TGTGGATGTCCACAATGCGATTCGTGCCGAGAACGGCGTTGGCCCTGTGGCTTGGAATACGACGTTGGCTGACTATGCCGAGAACTTCGCAAAGACAAGGGTTGATACCT
GCGAGATGGAGCATTCGATGGGACCTTATGCCGAAAACTTGGCGGAGGCGTTCGAGTCGACGACGGCGGAGGCGACGGTGAAGTTCTGGGCTACTGAGAAGGAATTCTAC
GACCCCAAGGCCAACAAGTGTGTGAACGATGAGTGTGGCCATTTTATGAATGTGGTGGGAAAGGACACAAAATACATTGGTTGTGCTGAGGCGCAGATGGCTAACCGACA
TACAGGTGCACCGAGCTGTACGTTGGCCTCACGGAGAGGATCAAAATATATCGATCCACCAGAGCGAGAAGGTCGATCAAGATCCACCAGAGCGAGAAGGTCGATCAAGA
TCCACCAGAGCGAGAAGGTCGATCAAGATCCACCAGAGCGAGAAGGTCGATCAAGATCAACCAGAGCGAGAAGGTCGATCAAGATCAACCAGAGCGAGAAGGTCGATCAA
GATCAACCAGAGCGAGAAGGTCGATCAAGATCCACCAGAGCGAGAAGGTCGATCAAGATCCACCAGAGCGAGAAGGTCGATCAAGATCCACCAGAGCGAGAAGGTCGATC
AAGATCAACCACAAGATCAACCGGAAGCCAGAAGGTCGATCAAGATCAACCACAAGATCAACCGGAAGCCAGAAGGTCGATCAAGATCAACGAGAAGCCAGAAGGCCGCC
AAAAGGCCGATCAAGATCAACGAGAAGCCAGAAGGCCGATTAAGATCCACAAGAAGCCAGAAGGCCGATCAAGATCAACAAGAAGCCAGAAGGCCGATCAAGATCCACAA
GAAGCATCAAGATCCACAAGAAGCCAGAAGGCCGCCATGAGGCCGATCAAGATCCACAAGAAGCCAGAAGGCCGATCAAGATCCACAAGAAGCCAGAAGGCCGATCAAGA
TCCACAAGCCACTAAGAGGTCGATCAAGATCAACAAGCTGTCAAAGAGGAAGGCCGATCAAGATCCACAAGCCACTAAGAGGTCGATCAAGATCAACAAGTTGTCAAAGA
GGCCGATCCAGATCAACACACCATCAAGAGGACGATACAGGTCAACACGTCGCCAAGAGGTCGATCAAGATCAACACGCCGCCAACAGACCAATCAATATCCACAAGCGG
CAAGAGCCGAATTTAGAGAAGTTGCGGAGGAGCAGTAGAATGCCAGAATTAGAGAAGAAATGTCGGAGCATTGTTGTGCCAAAGCATGTCAATGCTGGAGTATCACTTGC
TGCAGAAATTCCCAGGCACAACAAAATAAGGATTGTCTCCCCATCAATCCCCAAGGGGGGAATTGATCCCAAGAGAGAAAATACTCCAAAAATGGTGCTGAAGTCGAGGA
GAGCAGACCCATTTCGTGGAAGAGCATATGACAGGGAAGAGCTGCTGCTGCCATTGCGTGCTGGTGGTCTCCATCTGCATTTTTCTCTGTTTCTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAGTCCCAAAGTCTGCCTCGACGATTTGTTTGGTGGGGCTAACCCTAGCCCTAACCCTAACCATGACTGCAACCGTAGCGGTTGCGAATAGCAGCCCGAAGGACTT
TGTGGATGTCCACAATGCGATTCGTGCCGAGAACGGCGTTGGCCCTGTGGCTTGGAATACGACGTTGGCTGACTATGCCGAGAACTTCGCAAAGACAAGGGTTGATACCT
GCGAGATGGAGCATTCGATGGGACCTTATGCCGAAAACTTGGCGGAGGCGTTCGAGTCGACGACGGCGGAGGCGACGGTGAAGTTCTGGGCTACTGAGAAGGAATTCTAC
GACCCCAAGGCCAACAAGTGTGTGAACGATGAGTGTGGCCATTTTATGAATGTGGTGGGAAAGGACACAAAATACATTGGTTGTGCTGAGGCGCAGATGGCTAACCGACA
TACAGGTGCACCGAGCTGTACGTTGGCCTCACGGAGAGGATCAAAATATATCGATCCACCAGAGCGAGAAGGTCGATCAAGATCCACCAGAGCGAGAAGGTCGATCAAGA
TCCACCAGAGCGAGAAGGTCGATCAAGATCCACCAGAGCGAGAAGGTCGATCAAGATCAACCAGAGCGAGAAGGTCGATCAAGATCAACCAGAGCGAGAAGGTCGATCAA
GATCAACCAGAGCGAGAAGGTCGATCAAGATCCACCAGAGCGAGAAGGTCGATCAAGATCCACCAGAGCGAGAAGGTCGATCAAGATCCACCAGAGCGAGAAGGTCGATC
AAGATCAACCACAAGATCAACCGGAAGCCAGAAGGTCGATCAAGATCAACCACAAGATCAACCGGAAGCCAGAAGGTCGATCAAGATCAACGAGAAGCCAGAAGGCCGCC
AAAAGGCCGATCAAGATCAACGAGAAGCCAGAAGGCCGATTAAGATCCACAAGAAGCCAGAAGGCCGATCAAGATCAACAAGAAGCCAGAAGGCCGATCAAGATCCACAA
GAAGCATCAAGATCCACAAGAAGCCAGAAGGCCGCCATGAGGCCGATCAAGATCCACAAGAAGCCAGAAGGCCGATCAAGATCCACAAGAAGCCAGAAGGCCGATCAAGA
TCCACAAGCCACTAAGAGGTCGATCAAGATCAACAAGCTGTCAAAGAGGAAGGCCGATCAAGATCCACAAGCCACTAAGAGGTCGATCAAGATCAACAAGTTGTCAAAGA
GGCCGATCCAGATCAACACACCATCAAGAGGACGATACAGGTCAACACGTCGCCAAGAGGTCGATCAAGATCAACACGCCGCCAACAGACCAATCAATATCCACAAGCGG
CAAGAGCCGAATTTAGAGAAGTTGCGGAGGAGCAGTAGAATGCCAGAATTAGAGAAGAAATGTCGGAGCATTGTTGTGCCAAAGCATGTCAATGCTGGAGTATCACTTGC
TGCAGAAATTCCCAGGCACAACAAAATAAGGATTGTCTCCCCATCAATCCCCAAGGGGGGAATTGATCCCAAGAGAGAAAATACTCCAAAAATGGTGCTGAAGTCGAGGA
GAGCAGACCCATTTCGTGGAAGAGCATATGACAGGGAAGAGCTGCTGCTGCCATTGCGTGCTGGTGGTCTCCATCTGCATTTTTCTCTGTTTCTCTAG
Protein sequenceShow/hide protein sequence
MAVPKSASTICLVGLTLALTLTMTATVAVANSSPKDFVDVHNAIRAENGVGPVAWNTTLADYAENFAKTRVDTCEMEHSMGPYAENLAEAFESTTAEATVKFWATEKEFY
DPKANKCVNDECGHFMNVVGKDTKYIGCAEAQMANRHTGAPSCTLASRRGSKYIDPPEREGRSRSTRARRSIKIHQSEKVDQDPPEREGRSRSTRARRSIKINQSEKVDQ
DQPEREGRSRSTRARRSIKIHQSEKVDQDPPEREGRSRSTTRSTGSQKVDQDQPQDQPEARRSIKINEKPEGRQKADQDQREARRPIKIHKKPEGRSRSTRSQKADQDPQ
EASRSTRSQKAAMRPIKIHKKPEGRSRSTRSQKADQDPQATKRSIKINKLSKRKADQDPQATKRSIKINKLSKRPIQINTPSRGRYRSTRRQEVDQDQHAANRPINIHKR
QEPNLEKLRRSSRMPELEKKCRSIVVPKHVNAGVSLAAEIPRHNKIRIVSPSIPKGGIDPKRENTPKMVLKSRRADPFRGRAYDREELLLPLRAGGLHLHFSLFL