; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g31200 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g31200
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUsp domain-containing protein
Genome locationchr4:23420032..23424755
RNA-Seq ExpressionMoc04g31200
SyntenyMoc04g31200
Gene Ontology termsNA
InterPro domainsIPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578971.1 hypothetical protein SDJN03_23419, partial [Cucurbita argyrosperma subsp. sororia]1.1e-9382.17Show/hide
Query:  VFDKMPSTESFLRQISTREGSKSTSRRWGG------------EGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVL
        VF+KMPSTESFLRQIS REGS+S SRRWGG            EG  W QKMEGGVNN+YGI++GGMSR+KRVMVVVDQTSQ+NHATMWALTHVANKGDVL
Subjt:  VFDKMPSTESFLRQISTREGSKSTSRRWGG------------EGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVL

Query:  TLLHVIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFRGSGGAGDLVDHCINNAE
        TLLHVI T S  S + + SSSSSSF ATSLGSLCKASRPEVEVEVLV++GPKLATVMNQVKKLEVSVLVLGQRRP SF SCF GSGGAGDLV+ CINNAE
Subjt:  TLLHVIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFRGSGGAGDLVDHCINNAE

Query:  CLTIGVRKQSRDMGGYVINTRWQKNFWLLA
        CLTIGVRKQSRDMGGYVINTRWQKNFWLLA
Subjt:  CLTIGVRKQSRDMGGYVINTRWQKNFWLLA

KAG7016492.1 hypothetical protein SDJN02_21601, partial [Cucurbita argyrosperma subsp. argyrosperma]8.5e-9181.86Show/hide
Query:  MPSTESFLRQISTREGSKSTSRRWGG------------EGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLH
        MPSTESFLRQIS RE S+S SRRWGG            EG  W QKMEGGVNN+YGI++GGMSR+KRVMVVVDQTSQ+NHATMWALTHVANKGDVLTLLH
Subjt:  MPSTESFLRQISTREGSKSTSRRWGG------------EGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLH

Query:  VIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFRGSGGAGDLVDHCINNAECLTI
        VI T S  S + + SSSSSSF ATSLGSLCKASRPEVEVEVLV++GPKLATVMNQVKKLEVSVLVLGQRRP SF SCF GSGGAGDLV+ CINNAECLTI
Subjt:  VIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFRGSGGAGDLVDHCINNAECLTI

Query:  GVRKQSRDMGGYVINTRWQKNFWLLA
        GVRKQSRDMGGYVINTRWQKNFWLLA
Subjt:  GVRKQSRDMGGYVINTRWQKNFWLLA

XP_022141362.1 universal stress protein PHOS32 [Momordica charantia]3.9e-112100Show/hide
Query:  MPSTESFLRQISTREGSKSTSRRWGGEGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSS
        MPSTESFLRQISTREGSKSTSRRWGGEGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSS
Subjt:  MPSTESFLRQISTREGSKSTSRRWGGEGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSS

Query:  ADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFRGSGGAGDLVDHCINNAECLTIGVRKQSRDMGGYV
        ADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFRGSGGAGDLVDHCINNAECLTIGVRKQSRDMGGYV
Subjt:  ADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFRGSGGAGDLVDHCINNAECLTIGVRKQSRDMGGYV

Query:  INTRWQKNFWLLA
        INTRWQKNFWLLA
Subjt:  INTRWQKNFWLLA

XP_022993828.1 uncharacterized protein LOC111489724 [Cucurbita maxima]5.9e-9283.78Show/hide
Query:  MPSTESFLRQISTREGSKSTSRRWGG--------EGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPT
        MPSTESFLRQIS REGS+S SRRWGG        EG  W QKMEGGVNN+YGI++GGMSR+KRVMVVVDQTSQ+NHATMWALTHVANKGDVLTLLHVI  
Subjt:  MPSTESFLRQISTREGSKSTSRRWGG--------EGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPT

Query:  SSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFRGSGGAGDLVDHCINNAECLTIGVRK
        S+  S+ S  SSSSSSF ATSLGSLCKASRPEVEVEVLV++GPKLATVMNQVKKLEVSVLVLGQRRP SF SCF GSGGAGDLV+ CINNAECLTIGVRK
Subjt:  SSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFRGSGGAGDLVDHCINNAECLTIGVRK

Query:  QSRDMGGYVINTRWQKNFWLLA
        QSRDMGGYVINTRWQKNFWLLA
Subjt:  QSRDMGGYVINTRWQKNFWLLA

XP_023550879.1 uncharacterized protein LOC111808882 [Cucurbita pepo subsp. pepo]3.5e-9282.38Show/hide
Query:  MPSTESFLRQISTREGSKSTSRRWGG------------EGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLH
        MPSTESFLRQIS REGS+S SRRWGG            EG  W QKMEGGVNN+YGI++GGMSR+KRVMVVVDQTSQ+NHATMWALTHVANKGDVLTLLH
Subjt:  MPSTESFLRQISTREGSKSTSRRWGG------------EGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLH

Query:  VIPTS-SAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFRGSGGAGDLVDHCINNAECLT
        VI  S S+ + SS+ SSSSSSF ATSLGSLCKASRPEVEVEVLV++GPKLATVMNQVKKLEVSVLVLGQRRP SF SCF GSGGAGDLV+ CINNAECLT
Subjt:  VIPTS-SAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFRGSGGAGDLVDHCINNAECLT

Query:  IGVRKQSRDMGGYVINTRWQKNFWLLA
        IGVRKQSRDMGGYVINTRWQKNFWLLA
Subjt:  IGVRKQSRDMGGYVINTRWQKNFWLLA

TrEMBL top hitse value%identityAlignment
A0A1S3CRX0 uncharacterized protein LOC1035040533.9e-8980.35Show/hide
Query:  MPSTESFLRQISTREG---SKSTSRRWGG------------EGSHWGQKME-GGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVL
        MPSTESFLRQIS+R G   S+STSRRWGG            EGS W QKME GGVN++YGI++GGMSRRKRVMVVVD TSQ++HATMWALTH+ANKGDVL
Subjt:  MPSTESFLRQISTREG---SKSTSRRWGG------------EGSHWGQKME-GGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVL

Query:  TLLHVIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFRGSGGAGDLVDHCINNAEC
        TLLHVI  SS  SSSSA + SSSSF A+SLGSLCKASRPEVEVEVLVI+GPKLATVMNQVKKLEVSVLV+GQRRPS  SCF GSGGAGDLV+ CINNAEC
Subjt:  TLLHVIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFRGSGGAGDLVDHCINNAEC

Query:  LTIGVRKQSRDMGGYVINTRWQKNFWLLA
        LTIGVRKQSRDMGGYVINTRWQKNFWLLA
Subjt:  LTIGVRKQSRDMGGYVINTRWQKNFWLLA

A0A6J1CJN1 universal stress protein PHOS321.9e-112100Show/hide
Query:  MPSTESFLRQISTREGSKSTSRRWGGEGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSS
        MPSTESFLRQISTREGSKSTSRRWGGEGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSS
Subjt:  MPSTESFLRQISTREGSKSTSRRWGGEGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSS

Query:  ADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFRGSGGAGDLVDHCINNAECLTIGVRKQSRDMGGYV
        ADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFRGSGGAGDLVDHCINNAECLTIGVRKQSRDMGGYV
Subjt:  ADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFRGSGGAGDLVDHCINNAECLTIGVRKQSRDMGGYV

Query:  INTRWQKNFWLLA
        INTRWQKNFWLLA
Subjt:  INTRWQKNFWLLA

A0A6J1FKL3 uncharacterized protein LOC1114448693.5e-9081.42Show/hide
Query:  MPSTESFLRQISTREGSKSTSRRWGG------------EGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLH
        M STESFLRQIS REGS+S SRRWGG            EG  W QKMEGGVNN+YGI++GGMSR+KRVMVVVDQTSQ+NHATMWALTHVANKGDVLTLLH
Subjt:  MPSTESFLRQISTREGSKSTSRRWGG------------EGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLH

Query:  VIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFRGSGGAGDLVDHCINNAECLTI
        VI  S+  SS +  SSSSSSF ATSLGS+CKASRPEVEVEVLV++GPKLATVMNQVKKLEVSVLVLGQRRP SF SCF GSGGAGDLV+ CINNAECLTI
Subjt:  VIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFRGSGGAGDLVDHCINNAECLTI

Query:  GVRKQSRDMGGYVINTRWQKNFWLLA
        GVRKQSRDMGGYVINTRWQKNFWLLA
Subjt:  GVRKQSRDMGGYVINTRWQKNFWLLA

A0A6J1JPP1 uncharacterized protein LOC1114877675.0e-8980.09Show/hide
Query:  MPSTESFLRQISTREGSKSTSRRWGG-------------EGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLL
        MP  ESFLRQIS  EGS+STS+RWGG             EGSHW +KMEGGV N+YG++ GG+SRRKRVMVVVD TSQ+NHATMWALTHVANKGDVLTLL
Subjt:  MPSTESFLRQISTREGSKSTSRRWGG-------------EGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLL

Query:  HVIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFRGSGGAGDLVDHCINNAECLTI
        HVI  SS  SS    SSSSS F ATSLGSLCKASRPEVEVEVLVI+GPKL TVMNQVKKLEVSVLVLGQRRPSFLSCF GSGGAGDLV+ CINNAECLTI
Subjt:  HVIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFRGSGGAGDLVDHCINNAECLTI

Query:  GVRKQSRDMGGYVINTRWQKNFWLLA
        GVRKQSRDMGGYVINTRWQ+NFWLLA
Subjt:  GVRKQSRDMGGYVINTRWQKNFWLLA

A0A6J1K3E8 uncharacterized protein LOC1114897242.9e-9283.78Show/hide
Query:  MPSTESFLRQISTREGSKSTSRRWGG--------EGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPT
        MPSTESFLRQIS REGS+S SRRWGG        EG  W QKMEGGVNN+YGI++GGMSR+KRVMVVVDQTSQ+NHATMWALTHVANKGDVLTLLHVI  
Subjt:  MPSTESFLRQISTREGSKSTSRRWGG--------EGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPT

Query:  SSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFRGSGGAGDLVDHCINNAECLTIGVRK
        S+  S+ S  SSSSSSF ATSLGSLCKASRPEVEVEVLV++GPKLATVMNQVKKLEVSVLVLGQRRP SF SCF GSGGAGDLV+ CINNAECLTIGVRK
Subjt:  SSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFRGSGGAGDLVDHCINNAECLTIGVRK

Query:  QSRDMGGYVINTRWQKNFWLLA
        QSRDMGGYVINTRWQKNFWLLA
Subjt:  QSRDMGGYVINTRWQKNFWLLA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G44760.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.3e-5756.36Show/hide
Query:  STESFLRQISTREGSKSTSRRW--GGEGSHWGQKMEGG----VNNLYGIESGG--MSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSS
        S  S LRQ+S +EG +S S+RW  G   + +     GG    +  LYG+ SGG   +R KRVMVVVD++S++ HA MWALTH+ NKGD++TLLHV+    
Subjt:  STESFLRQISTREGSKSTSRRW--GGEGSHWGQKMEGG----VNNLYGIESGG--MSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSS

Query:  AHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPS-FLSCFRGSGGAGDLVDHCINNAECLTIGVRKQS
             S D  ++ S  A SLGSLCKA +PEV+VE LVIQGPKLATV++QVKKLEVSVLVLGQ++ +  +SC  G   + +LV+ CIN A+CLTIGVRKQ 
Subjt:  AHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPS-FLSCFRGSGGAGDLVDHCINNAECLTIGVRKQS

Query:  RDMGGYVINTRWQKNFWLLA
        + +GGY+INTRWQKNFWLLA
Subjt:  RDMGGYVINTRWQKNFWLLA

AT1G69080.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.5e-2132.78Show/hide
Query:  KRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSSAD------------SSSSSSFSATSLGSLCKASRPEVEVEVLVIQG-PKLATV
        +R++VVVD  S+A +A +W L+H A   D + LLH +   ++ S   A+            ++S +    ++L ++C+  RPEV+ EV+ ++G  K  T+
Subjt:  KRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSSAD------------SSSSSSFSATSLGSLCKASRPEVEVEVLVIQG-PKLATV

Query:  MNQVKKLEVSVLVLGQRRP----SFLSCFRGSG---GAGDLVDHCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA
        + + ++ E S+LVLGQ++       L  +          D V++CINN+ C+ I VRK+ + +GGY + T+  K+FWLLA
Subjt:  MNQVKKLEVSVLVLGQRRP----SFLSCFRGSG---GAGDLVDHCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA

AT1G69080.2 Adenine nucleotide alpha hydrolases-like superfamily protein6.0e-1832.14Show/hide
Query:  KRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQG-PKLATVMNQVKKLEVSVL
        +R++VVVD  S+A +A +W L+H A   D + LLH +   ++ S   A+       S          +  +V+ EV+ ++G  K  T++ + ++ E S+L
Subjt:  KRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQG-PKLATVMNQVKKLEVSVL

Query:  VLGQRRP----SFLSCFRGSG---GAGDLVDHCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA
        VLGQ++       L  +          D V++CINN+ C+ I VRK+ + +GGY + T+  K+FWLLA
Subjt:  VLGQRRP----SFLSCFRGSG---GAGDLVDHCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA

AT2G03720.1 Adenine nucleotide alpha hydrolases-like superfamily protein6.9e-2239.16Show/hide
Query:  MVVVDQTSQANHATMWALTHVANKGDVLTLLHV--IPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQ--GPKLATVMNQVKKLEVSVL
        MVVVD TSQ  +A  WALTH     D +TLLHV   P   A   +  + +S +      L + C+  +P V+ E++V++    K  T++ + KK    VL
Subjt:  MVVVDQTSQANHATMWALTHVANKGDVLTLLHV--IPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQ--GPKLATVMNQVKKLEVSVL

Query:  VLGQRRPS----FLSCFRGSGG-AGDLVDHCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA
        VLGQR+ +     +  +R  GG  G +V++CI+N++C+ I VRK+S + GGY+I T+  K+FWLLA
Subjt:  VLGQRRPS----FLSCFRGSGG-AGDLVDHCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA

AT5G17390.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.3e-1733.33Show/hide
Query:  RVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQG---PKLATVMNQVKKLEVSV
        RVMVVVD+   +  A  WA+TH     D L LL+           +      +     +L  LC+  RP +EVE+  ++G    K   ++ + KK +VS+
Subjt:  RVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQG---PKLATVMNQVKKLEVSV

Query:  LVLGQ-RRPSFLS-----CFRGSGGAGDLVDHCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA
        LV+GQ ++P          ++   G   ++ +C+ NA C+TI V+ ++R +GGY+I T+  KNFWLLA
Subjt:  LVLGQ-RRPSFLS-----CFRGSGGAGDLVDHCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGCGGAACGCTAATTTGTCCCTACTTGCAAATTGTTTTGCCACTGATTCTGCTCAGCTCTATAAAGCAGGAGCTCCTGCTGCTACTGCTGCAGACGTGGTGTTCGA
TAAAATGCCAAGTACCGAATCGTTTCTGAGGCAGATAAGTACAAGGGAGGGCTCCAAATCTACATCTAGGAGGTGGGGTGGCGAGGGAAGCCATTGGGGCCAGAAGATGG
AGGGCGGTGTTAACAACTTGTATGGGATCGAGAGCGGCGGAATGTCTAGGAGGAAGAGGGTGATGGTGGTGGTGGATCAGACTTCACAGGCCAACCATGCAACCATGTGG
GCGCTAACTCATGTGGCTAACAAGGGCGATGTTCTTACTCTTCTTCATGTCATCCCTACCTCCTCTGCCCACTCTTCTTCTTCTGCAGATTCTTCTTCCTCTTCCTCTTT
CTCTGCTACTTCTCTTGGTTCCCTCTGCAAGGCTTCTAGACCTGAGGTGGAGGTGGAAGTGCTGGTGATCCAGGGGCCAAAGCTGGCCACAGTGATGAATCAAGTGAAGA
AGTTGGAGGTGTCGGTGCTGGTTCTGGGACAGAGAAGACCATCGTTTCTCAGTTGCTTTCGTGGAAGTGGTGGGGCGGGAGATCTGGTGGATCATTGCATAAATAACGCA
GAGTGCTTGACCATTGGAGTGAGAAAGCAGAGCAGAGACATGGGTGGCTATGTAATCAACACCAGATGGCAGAAGAACTTCTGGCTTCTGGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCGCGGAACGCTAATTTGTCCCTACTTGCAAATTGTTTTGCCACTGATTCTGCTCAGCTCTATAAAGCAGGAGCTCCTGCTGCTACTGCTGCAGACGTGGTGTTCGA
TAAAATGCCAAGTACCGAATCGTTTCTGAGGCAGATAAGTACAAGGGAGGGCTCCAAATCTACATCTAGGAGGTGGGGTGGCGAGGGAAGCCATTGGGGCCAGAAGATGG
AGGGCGGTGTTAACAACTTGTATGGGATCGAGAGCGGCGGAATGTCTAGGAGGAAGAGGGTGATGGTGGTGGTGGATCAGACTTCACAGGCCAACCATGCAACCATGTGG
GCGCTAACTCATGTGGCTAACAAGGGCGATGTTCTTACTCTTCTTCATGTCATCCCTACCTCCTCTGCCCACTCTTCTTCTTCTGCAGATTCTTCTTCCTCTTCCTCTTT
CTCTGCTACTTCTCTTGGTTCCCTCTGCAAGGCTTCTAGACCTGAGGTGGAGGTGGAAGTGCTGGTGATCCAGGGGCCAAAGCTGGCCACAGTGATGAATCAAGTGAAGA
AGTTGGAGGTGTCGGTGCTGGTTCTGGGACAGAGAAGACCATCGTTTCTCAGTTGCTTTCGTGGAAGTGGTGGGGCGGGAGATCTGGTGGATCATTGCATAAATAACGCA
GAGTGCTTGACCATTGGAGTGAGAAAGCAGAGCAGAGACATGGGTGGCTATGTAATCAACACCAGATGGCAGAAGAACTTCTGGCTTCTGGCTTAG
Protein sequenceShow/hide protein sequence
MPRNANLSLLANCFATDSAQLYKAGAPAATAADVVFDKMPSTESFLRQISTREGSKSTSRRWGGEGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMW
ALTHVANKGDVLTLLHVIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFRGSGGAGDLVDHCINNA
ECLTIGVRKQSRDMGGYVINTRWQKNFWLLA