; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g1266 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g1266
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionUsp domain-containing protein
Genome locationMC04:20660445..20662986
RNA-Seq ExpressionMC04g1266
SyntenyMC04g1266
Gene Ontology termsNA
InterPro domainsIPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578971.1 hypothetical protein SDJN03_23419, partial [Cucurbita argyrosperma subsp. sororia]4.36e-11882.3Show/hide
Query:  MPSTESFLRQISTREGSKSTSRRWGGE------------GSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLH
        MPSTESFLRQIS REGS+S SRRWGGE            G  W QKMEGGVNN+YGI++GGMSR+KRVMVVVDQTSQ+NHATMWALTHVANKGDVLTLLH
Subjt:  MPSTESFLRQISTREGSKSTSRRWGGE------------GSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLH

Query:  VIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPS-FLSCFRGSGGAGDLVDHCINNAECLTI
        VI T S  S + + SSSSSSF ATSLGSLCKASRPEVEVEVLV++GPKLATVMNQVKKLEVSVLVLGQRRPS F SCF GSGGAGDLV+ CINNAECLTI
Subjt:  VIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPS-FLSCFRGSGGAGDLVDHCINNAECLTI

Query:  GVRKQSRDMGGYVINTRWQKNFWLLA
        GVRKQSRDMGGYVINTRWQKNFWLLA
Subjt:  GVRKQSRDMGGYVINTRWQKNFWLLA

KAG7016492.1 hypothetical protein SDJN02_21601, partial [Cucurbita argyrosperma subsp. argyrosperma]1.91e-11781.86Show/hide
Query:  MPSTESFLRQISTREGSKSTSRRWGGE------------GSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLH
        MPSTESFLRQIS RE S+S SRRWGGE            G  W QKMEGGVNN+YGI++GGMSR+KRVMVVVDQTSQ+NHATMWALTHVANKGDVLTLLH
Subjt:  MPSTESFLRQISTREGSKSTSRRWGGE------------GSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLH

Query:  VIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPS-FLSCFRGSGGAGDLVDHCINNAECLTI
        VI T S  S + + SSSSSSF ATSLGSLCKASRPEVEVEVLV++GPKLATVMNQVKKLEVSVLVLGQRRPS F SCF GSGGAGDLV+ CINNAECLTI
Subjt:  VIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPS-FLSCFRGSGGAGDLVDHCINNAECLTI

Query:  GVRKQSRDMGGYVINTRWQKNFWLLA
        GVRKQSRDMGGYVINTRWQKNFWLLA
Subjt:  GVRKQSRDMGGYVINTRWQKNFWLLA

XP_022141362.1 universal stress protein PHOS32 [Momordica charantia]1.13e-145100Show/hide
Query:  MPSTESFLRQISTREGSKSTSRRWGGEGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSS
        MPSTESFLRQISTREGSKSTSRRWGGEGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSS
Subjt:  MPSTESFLRQISTREGSKSTSRRWGGEGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSS

Query:  ADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFRGSGGAGDLVDHCINNAECLTIGVRKQSRDMGGYV
        ADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFRGSGGAGDLVDHCINNAECLTIGVRKQSRDMGGYV
Subjt:  ADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFRGSGGAGDLVDHCINNAECLTIGVRKQSRDMGGYV

Query:  INTRWQKNFWLLA
        INTRWQKNFWLLA
Subjt:  INTRWQKNFWLLA

XP_022993828.1 uncharacterized protein LOC111489724 [Cucurbita maxima]4.81e-11983.78Show/hide
Query:  MPSTESFLRQISTREGSKSTSRRWGGE--------GSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPT
        MPSTESFLRQIS REGS+S SRRWGGE        G  W QKMEGGVNN+YGI++GGMSR+KRVMVVVDQTSQ+NHATMWALTHVANKGDVLTLLHVI  
Subjt:  MPSTESFLRQISTREGSKSTSRRWGGE--------GSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPT

Query:  SSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPS-FLSCFRGSGGAGDLVDHCINNAECLTIGVRK
        S+  S+ S  SSSSSSF ATSLGSLCKASRPEVEVEVLV++GPKLATVMNQVKKLEVSVLVLGQRRPS F SCF GSGGAGDLV+ CINNAECLTIGVRK
Subjt:  SSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPS-FLSCFRGSGGAGDLVDHCINNAECLTIGVRK

Query:  QSRDMGGYVINTRWQKNFWLLA
        QSRDMGGYVINTRWQKNFWLLA
Subjt:  QSRDMGGYVINTRWQKNFWLLA

XP_023550879.1 uncharacterized protein LOC111808882 [Cucurbita pepo subsp. pepo]3.05e-11982.38Show/hide
Query:  MPSTESFLRQISTREGSKSTSRRWGGE------------GSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLH
        MPSTESFLRQIS REGS+S SRRWGGE            G  W QKMEGGVNN+YGI++GGMSR+KRVMVVVDQTSQ+NHATMWALTHVANKGDVLTLLH
Subjt:  MPSTESFLRQISTREGSKSTSRRWGGE------------GSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLH

Query:  VIPTS-SAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPS-FLSCFRGSGGAGDLVDHCINNAECLT
        VI  S S+ + SS+ SSSSSSF ATSLGSLCKASRPEVEVEVLV++GPKLATVMNQVKKLEVSVLVLGQRRPS F SCF GSGGAGDLV+ CINNAECLT
Subjt:  VIPTS-SAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPS-FLSCFRGSGGAGDLVDHCINNAECLT

Query:  IGVRKQSRDMGGYVINTRWQKNFWLLA
        IGVRKQSRDMGGYVINTRWQKNFWLLA
Subjt:  IGVRKQSRDMGGYVINTRWQKNFWLLA

TrEMBL top hitse value%identityAlignment
A0A1S3CRX0 uncharacterized protein LOC1035040533.99e-11580.35Show/hide
Query:  MPSTESFLRQISTREG---SKSTSRRWGGE------------GSHWGQKMEGG-VNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVL
        MPSTESFLRQIS+R G   S+STSRRWGGE            GS W QKMEGG VN++YGI++GGMSRRKRVMVVVD TSQ++HATMWALTH+ANKGDVL
Subjt:  MPSTESFLRQISTREG---SKSTSRRWGGE------------GSHWGQKMEGG-VNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVL

Query:  TLLHVIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFRGSGGAGDLVDHCINNAEC
        TLLHVI  SS  SSSSA + SSSSF A+SLGSLCKASRPEVEVEVLVI+GPKLATVMNQVKKLEVSVLV+GQRRPS  SCF GSGGAGDLV+ CINNAEC
Subjt:  TLLHVIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFRGSGGAGDLVDHCINNAEC

Query:  LTIGVRKQSRDMGGYVINTRWQKNFWLLA
        LTIGVRKQSRDMGGYVINTRWQKNFWLLA
Subjt:  LTIGVRKQSRDMGGYVINTRWQKNFWLLA

A0A6J1CJN1 universal stress protein PHOS325.49e-146100Show/hide
Query:  MPSTESFLRQISTREGSKSTSRRWGGEGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSS
        MPSTESFLRQISTREGSKSTSRRWGGEGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSS
Subjt:  MPSTESFLRQISTREGSKSTSRRWGGEGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSS

Query:  ADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFRGSGGAGDLVDHCINNAECLTIGVRKQSRDMGGYV
        ADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFRGSGGAGDLVDHCINNAECLTIGVRKQSRDMGGYV
Subjt:  ADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFRGSGGAGDLVDHCINNAECLTIGVRKQSRDMGGYV

Query:  INTRWQKNFWLLA
        INTRWQKNFWLLA
Subjt:  INTRWQKNFWLLA

A0A6J1FKL3 uncharacterized protein LOC1114448691.48e-11681.42Show/hide
Query:  MPSTESFLRQISTREGSKSTSRRWGGE------------GSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLH
        M STESFLRQIS REGS+S SRRWGGE            G  W QKMEGGVNN+YGI++GGMSR+KRVMVVVDQTSQ+NHATMWALTHVANKGDVLTLLH
Subjt:  MPSTESFLRQISTREGSKSTSRRWGGE------------GSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLH

Query:  VIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPS-FLSCFRGSGGAGDLVDHCINNAECLTI
        VI  S+  SS +  SSSSSSF ATSLGS+CKASRPEVEVEVLV++GPKLATVMNQVKKLEVSVLVLGQRRPS F SCF GSGGAGDLV+ CINNAECLTI
Subjt:  VIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPS-FLSCFRGSGGAGDLVDHCINNAECLTI

Query:  GVRKQSRDMGGYVINTRWQKNFWLLA
        GVRKQSRDMGGYVINTRWQKNFWLLA
Subjt:  GVRKQSRDMGGYVINTRWQKNFWLLA

A0A6J1JPP1 uncharacterized protein LOC1114877676.30e-11580.09Show/hide
Query:  MPSTESFLRQISTREGSKSTSRRWGGE-------------GSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLL
        MP  ESFLRQIS  EGS+STS+RWGGE             GSHW +KMEGGVN +YG++ GG+SRRKRVMVVVD TSQ+NHATMWALTHVANKGDVLTLL
Subjt:  MPSTESFLRQISTREGSKSTSRRWGGE-------------GSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLL

Query:  HVIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFRGSGGAGDLVDHCINNAECLTI
        HVI  SS  SSSS    SSS F ATSLGSLCKASRPEVEVEVLVI+GPKL TVMNQVKKLEVSVLVLGQRRPSFLSCF GSGGAGDLV+ CINNAECLTI
Subjt:  HVIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFRGSGGAGDLVDHCINNAECLTI

Query:  GVRKQSRDMGGYVINTRWQKNFWLLA
        GVRKQSRDMGGYVINTRWQ+NFWLLA
Subjt:  GVRKQSRDMGGYVINTRWQKNFWLLA

A0A6J1K3E8 uncharacterized protein LOC1114897242.33e-11983.78Show/hide
Query:  MPSTESFLRQISTREGSKSTSRRWGGE--------GSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPT
        MPSTESFLRQIS REGS+S SRRWGGE        G  W QKMEGGVNN+YGI++GGMSR+KRVMVVVDQTSQ+NHATMWALTHVANKGDVLTLLHVI  
Subjt:  MPSTESFLRQISTREGSKSTSRRWGGE--------GSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPT

Query:  SSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPS-FLSCFRGSGGAGDLVDHCINNAECLTIGVRK
        S+  S+ S  SSSSSSF ATSLGSLCKASRPEVEVEVLV++GPKLATVMNQVKKLEVSVLVLGQRRPS F SCF GSGGAGDLV+ CINNAECLTIGVRK
Subjt:  SSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPS-FLSCFRGSGGAGDLVDHCINNAECLTIGVRK

Query:  QSRDMGGYVINTRWQKNFWLLA
        QSRDMGGYVINTRWQKNFWLLA
Subjt:  QSRDMGGYVINTRWQKNFWLLA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G44760.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.5e-5756.36Show/hide
Query:  STESFLRQISTREGSKSTSRRW--GGEGSHWGQKMEGG----VNNLYGIESGG--MSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSS
        S  S LRQ+S +EG +S S+RW  G   + +     GG    +  LYG+ SGG   +R KRVMVVVD++S++ HA MWALTH+ NKGD++TLLHV+    
Subjt:  STESFLRQISTREGSKSTSRRW--GGEGSHWGQKMEGG----VNNLYGIESGG--MSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSS

Query:  AHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPS-FLSCFRGSGGAGDLVDHCINNAECLTIGVRKQS
             S D  ++ S  A SLGSLCKA +PEV+VE LVIQGPKLATV++QVKKLEVSVLVLGQ++ +  +SC  G   + +LV+ CIN A+CLTIGVRKQ 
Subjt:  AHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPS-FLSCFRGSGGAGDLVDHCINNAECLTIGVRKQS

Query:  RDMGGYVINTRWQKNFWLLA
        + +GGY+INTRWQKNFWLLA
Subjt:  RDMGGYVINTRWQKNFWLLA

AT1G69080.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.3e-2132.78Show/hide
Query:  KRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSSAD------------SSSSSSFSATSLGSLCKASRPEVEVEVLVIQG-PKLATV
        +R++VVVD  S+A +A +W L+H A   D + LLH +   ++ S   A+            ++S +    ++L ++C+  RPEV+ EV+ ++G  K  T+
Subjt:  KRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSSAD------------SSSSSSFSATSLGSLCKASRPEVEVEVLVIQG-PKLATV

Query:  MNQVKKLEVSVLVLGQRRP----SFLSCFRGSG---GAGDLVDHCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA
        + + ++ E S+LVLGQ++       L  +          D V++CINN+ C+ I VRK+ + +GGY + T+  K+FWLLA
Subjt:  MNQVKKLEVSVLVLGQRRP----SFLSCFRGSG---GAGDLVDHCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA

AT1G69080.2 Adenine nucleotide alpha hydrolases-like superfamily protein5.1e-1832.14Show/hide
Query:  KRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQG-PKLATVMNQVKKLEVSVL
        +R++VVVD  S+A +A +W L+H A   D + LLH +   ++ S   A+       S          +  +V+ EV+ ++G  K  T++ + ++ E S+L
Subjt:  KRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQG-PKLATVMNQVKKLEVSVL

Query:  VLGQRRP----SFLSCFRGSG---GAGDLVDHCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA
        VLGQ++       L  +          D V++CINN+ C+ I VRK+ + +GGY + T+  K+FWLLA
Subjt:  VLGQRRP----SFLSCFRGSG---GAGDLVDHCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA

AT2G03720.1 Adenine nucleotide alpha hydrolases-like superfamily protein5.8e-2239.16Show/hide
Query:  MVVVDQTSQANHATMWALTHVANKGDVLTLLHV--IPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQ--GPKLATVMNQVKKLEVSVL
        MVVVD TSQ  +A  WALTH     D +TLLHV   P   A   +  + +S +      L + C+  +P V+ E++V++    K  T++ + KK    VL
Subjt:  MVVVDQTSQANHATMWALTHVANKGDVLTLLHV--IPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQ--GPKLATVMNQVKKLEVSVL

Query:  VLGQRRPS----FLSCFRGSGG-AGDLVDHCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA
        VLGQR+ +     +  +R  GG  G +V++CI+N++C+ I VRK+S + GGY+I T+  K+FWLLA
Subjt:  VLGQRRPS----FLSCFRGSGG-AGDLVDHCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA

AT5G17390.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.1e-1733.33Show/hide
Query:  RVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQG---PKLATVMNQVKKLEVSV
        RVMVVVD+   +  A  WA+TH     D L LL+           +      +     +L  LC+  RP +EVE+  ++G    K   ++ + KK +VS+
Subjt:  RVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQG---PKLATVMNQVKKLEVSV

Query:  LVLGQ-RRPSFLS-----CFRGSGGAGDLVDHCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA
        LV+GQ ++P          ++   G   ++ +C+ NA C+TI V+ ++R +GGY+I T+  KNFWLLA
Subjt:  LVLGQ-RRPSFLS-----CFRGSGGAGDLVDHCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAAGTACCGAATCGTTTCTGAGGCAGATAAGTACAAGGGAGGGCTCCAAATCTACATCTAGGAGGTGGGGTGGCGAGGGAAGCCATTGGGGCCAGAAGATGGAGGG
CGGTGTTAACAACTTGTATGGGATCGAGAGCGGCGGAATGTCTAGGAGGAAGAGGGTGATGGTGGTGGTGGATCAGACTTCACAGGCCAACCATGCAACCATGTGGGCGC
TAACTCATGTGGCTAACAAGGGCGATGTTCTTACTCTTCTTCATGTCATCCCTACCTCCTCTGCCCACTCTTCTTCTTCTGCAGATTCTTCTTCCTCTTCCTCTTTCTCT
GCTACTTCTCTTGGTTCCCTCTGCAAGGCTTCTAGACCTGAGGTGGAGGTGGAAGTGCTGGTGATCCAGGGGCCAAAGCTGGCCACAGTGATGAATCAAGTGAAGAAGTT
GGAGGTGTCGGTGCTGGTTCTGGGACAGAGAAGACCATCGTTTCTCAGTTGCTTTCGTGGAAGTGGTGGGGCGGGAGATCTGGTGGATCATTGCATAAATAACGCAGAGT
GCTTGACCATTGGAGTGAGAAAGCAGAGCAGAGACATGGGTGGCTATGTAATCAACACCAGATGGCAGAAGAACTTCTGGCTTCTGGCTTAG
mRNA sequenceShow/hide mRNA sequence
GTATAGAAATTTTGGAAATAACCAAAAACAAATTTCTTCTTTCTTATTCCCACTTGAAACAAAAAACAAGTTTATCATACTTAAAACTGAAACAAAAAACATAAACGTTG
TCAAAAGAGACCCCAGATGAAAACACCAATCTAATTTAATGTTTTTGGCCTATCCTAGAAAACAAATAGTATAACATCAAACGTCTAATTTTTTTAAATTTAAAATTCTA
AAACCCTAAACAAATCATCAAGCAACAATCTAATTTATTTATATTTTGTCAGAAATAATATTGATATTCATCTGACGGTACTTGCCCACTTTAGGTTGCAGATTCTGGTG
AAAAGTTCCAAAAGAAATATTCAAAAAAGAAGTAAAATATTTTGAAAGAGAGAGAGTACAGAGGGAATCCTTACCTTTCATGATGATACGCAGACAGCAGTGATTGTGTT
TCATTTCTTTGTGTGTGCTGTTCAGTAGAGCCACGCCTGCCCACCCAAGCACTTCCCGTGACGCTTTAAGATCTGTTCATAGAGAGCTCGAAATTATGGCTTTCTTTGCT
TCTCCTCAGCACTTCCCTCAGTCCTTTCCCAACCATTAGACCATAGCTCCTCTCTTAATTTAACTTAACCCCACATCTCCCAGGGCTCAGCCCAGTCCCATCCATCAATT
TTTTGTCTTTCTCCTCCCTTAAGATTAATGACCCATTACCCAGATCTAGGCCACTGATTCTGCTCAGCTCTATAAAGCAGGAGCTCCTGCTGCTACTGCTGCAGACGTGG
TGTTCGATAAAATGCCAAGTACCGAATCGTTTCTGAGGCAGATAAGTACAAGGGAGGGCTCCAAATCTACATCTAGGAGGTGGGGTGGCGAGGGAAGCCATTGGGGCCAG
AAGATGGAGGGCGGTGTTAACAACTTGTATGGGATCGAGAGCGGCGGAATGTCTAGGAGGAAGAGGGTGATGGTGGTGGTGGATCAGACTTCACAGGCCAACCATGCAAC
CATGTGGGCGCTAACTCATGTGGCTAACAAGGGCGATGTTCTTACTCTTCTTCATGTCATCCCTACCTCCTCTGCCCACTCTTCTTCTTCTGCAGATTCTTCTTCCTCTT
CCTCTTTCTCTGCTACTTCTCTTGGTTCCCTCTGCAAGGCTTCTAGACCTGAGGTGGAGGTGGAAGTGCTGGTGATCCAGGGGCCAAAGCTGGCCACAGTGATGAATCAA
GTGAAGAAGTTGGAGGTGTCGGTGCTGGTTCTGGGACAGAGAAGACCATCGTTTCTCAGTTGCTTTCGTGGAAGTGGTGGGGCGGGAGATCTGGTGGATCATTGCATAAA
TAACGCAGAGTGCTTGACCATTGGAGTGAGAAAGCAGAGCAGAGACATGGGTGGCTATGTAATCAACACCAGATGGCAGAAGAACTTCTGGCTTCTGGCTTAGTTGGTTT
AGTTAGAGAGTTACGACAGAAGCCTGACCCTTGGAAGTTCATAATGAATTGCAACTTTTTAGCTAGCTCGATGGGAATTCAGAATAAGTGTTCACATCATGTGAACTATC
TCTTTGCACCTTAGAAAACAGAGCTTGGAAGTTGAGAAAATTATTCTAATTCCATTCAATGTTGGAAGGTCCTGGATTGCATTCTTCTTCTTTTTGGTTTCTATTAAGCC
CACTGAAAATGGATCTATACCTTTTGGGTGAACAGATGCAGCACCAAATTGGCGTTCCCTATTCAAAAGGGATATTTAACGCAGTATGAGCATGAATCTAACATCCTAAT
GTTTTCAGATCACTCAAACTACCTACTGAAAAAAAAAGGTGCTAGATCTTCGAGCCAATTCTTAATGACGCAATCTAGAAAAAGGGAGAGTGAAGTCAAACCAACTTATT
GATGTCCAACCACTTAATCATTTATAAGAGTTAGTAATGTATGACCATTGTAGACGTACGGTTTGAAATCATATGGTTACAATATTCATTGATAATGTTTCTATTTTTTG
TTTTTTGTTTTTATTTTTCTTTAGAAATCTTAGCATTTTGCAGTTATGTTTTTTGTTTTTCATGTTTTAGGAATATTTTCAAAATTATATAGAAATTTTGGAACCATGTT
TTTTTTTTTCCAATTTCAATTTGTTCCAAAACACAAACAAGAAAATTATTTTATCAC
Protein sequenceShow/hide protein sequence
MPSTESFLRQISTREGSKSTSRRWGGEGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSSADSSSSSSFS
ATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFRGSGGAGDLVDHCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA