; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS015681 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS015681
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUsp domain-containing protein
Genome locationscaffold983:230104..231041
RNA-Seq ExpressionMS015681
SyntenyMS015681
Gene Ontology termsNA
InterPro domainsIPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578971.1 hypothetical protein SDJN03_23419, partial [Cucurbita argyrosperma subsp. sororia]5.9e-9382.74Show/hide
Query:  MPSTESFLRQISTREGSKSTSRRWGG------------EGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLH
        MPSTESFLRQIS REGS+S SRRWGG            EG  W QKMEGGVNN+YGI++GGMSR+KRVMVVVDQTSQ+NHATMWALTHVANKGDVLTLLH
Subjt:  MPSTESFLRQISTREGSKSTSRRWGG------------EGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLH

Query:  VIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFCGSGGAGDLVDHCINNAECLTI
        VI T S  S + + SSSSSSF ATSLGSLCKASRPEVEVEVLV++GPKLATVMNQVKKLEVSVLVLGQRRP SF SCFCGSGGAGDLV+ CINNAECLTI
Subjt:  VIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFCGSGGAGDLVDHCINNAECLTI

Query:  GVRKQSRDMGGYVINTRWQKNFWLLA
        GVRKQSRDMGGYVINTRWQKNFWLLA
Subjt:  GVRKQSRDMGGYVINTRWQKNFWLLA

KAG7016492.1 hypothetical protein SDJN02_21601, partial [Cucurbita argyrosperma subsp. argyrosperma]5.0e-9282.3Show/hide
Query:  MPSTESFLRQISTREGSKSTSRRWGG------------EGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLH
        MPSTESFLRQIS RE S+S SRRWGG            EG  W QKMEGGVNN+YGI++GGMSR+KRVMVVVDQTSQ+NHATMWALTHVANKGDVLTLLH
Subjt:  MPSTESFLRQISTREGSKSTSRRWGG------------EGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLH

Query:  VIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFCGSGGAGDLVDHCINNAECLTI
        VI T S  S + + SSSSSSF ATSLGSLCKASRPEVEVEVLV++GPKLATVMNQVKKLEVSVLVLGQRRP SF SCFCGSGGAGDLV+ CINNAECLTI
Subjt:  VIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFCGSGGAGDLVDHCINNAECLTI

Query:  GVRKQSRDMGGYVINTRWQKNFWLLA
        GVRKQSRDMGGYVINTRWQKNFWLLA
Subjt:  GVRKQSRDMGGYVINTRWQKNFWLLA

XP_022141362.1 universal stress protein PHOS32 [Momordica charantia]4.8e-11199.53Show/hide
Query:  MPSTESFLRQISTREGSKSTSRRWGGEGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSS
        MPSTESFLRQISTREGSKSTSRRWGGEGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSS
Subjt:  MPSTESFLRQISTREGSKSTSRRWGGEGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSS

Query:  ADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFCGSGGAGDLVDHCINNAECLTIGVRKQSRDMGGYV
        ADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCF GSGGAGDLVDHCINNAECLTIGVRKQSRDMGGYV
Subjt:  ADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFCGSGGAGDLVDHCINNAECLTIGVRKQSRDMGGYV

Query:  INTRWQKNFWLLA
        INTRWQKNFWLLA
Subjt:  INTRWQKNFWLLA

XP_022993828.1 uncharacterized protein LOC111489724 [Cucurbita maxima]3.5e-9384.23Show/hide
Query:  MPSTESFLRQISTREGSKSTSRRWGG--------EGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPT
        MPSTESFLRQIS REGS+S SRRWGG        EG  W QKMEGGVNN+YGI++GGMSR+KRVMVVVDQTSQ+NHATMWALTHVANKGDVLTLLHVI  
Subjt:  MPSTESFLRQISTREGSKSTSRRWGG--------EGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPT

Query:  SSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFCGSGGAGDLVDHCINNAECLTIGVRK
        S+  S+ S  SSSSSSF ATSLGSLCKASRPEVEVEVLV++GPKLATVMNQVKKLEVSVLVLGQRRP SF SCFCGSGGAGDLV+ CINNAECLTIGVRK
Subjt:  SSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFCGSGGAGDLVDHCINNAECLTIGVRK

Query:  QSRDMGGYVINTRWQKNFWLLA
        QSRDMGGYVINTRWQKNFWLLA
Subjt:  QSRDMGGYVINTRWQKNFWLLA

XP_023550879.1 uncharacterized protein LOC111808882 [Cucurbita pepo subsp. pepo]2.0e-9382.82Show/hide
Query:  MPSTESFLRQISTREGSKSTSRRWGG------------EGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLH
        MPSTESFLRQIS REGS+S SRRWGG            EG  W QKMEGGVNN+YGI++GGMSR+KRVMVVVDQTSQ+NHATMWALTHVANKGDVLTLLH
Subjt:  MPSTESFLRQISTREGSKSTSRRWGG------------EGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLH

Query:  VIPTS-SAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFCGSGGAGDLVDHCINNAECLT
        VI  S S+ + SS+ SSSSSSF ATSLGSLCKASRPEVEVEVLV++GPKLATVMNQVKKLEVSVLVLGQRRP SF SCFCGSGGAGDLV+ CINNAECLT
Subjt:  VIPTS-SAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFCGSGGAGDLVDHCINNAECLT

Query:  IGVRKQSRDMGGYVINTRWQKNFWLLA
        IGVRKQSRDMGGYVINTRWQKNFWLLA
Subjt:  IGVRKQSRDMGGYVINTRWQKNFWLLA

TrEMBL top hitse value%identityAlignment
A0A1S3CRX0 uncharacterized protein LOC1035040532.3e-9080.79Show/hide
Query:  MPSTESFLRQISTREG---SKSTSRRWGG------------EGSHWGQKME-GGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVL
        MPSTESFLRQIS+R G   S+STSRRWGG            EGS W QKME GGVN++YGI++GGMSRRKRVMVVVD TSQ++HATMWALTH+ANKGDVL
Subjt:  MPSTESFLRQISTREG---SKSTSRRWGG------------EGSHWGQKME-GGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVL

Query:  TLLHVIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFCGSGGAGDLVDHCINNAEC
        TLLHVI  SS  SSSSA + SSSSF A+SLGSLCKASRPEVEVEVLVI+GPKLATVMNQVKKLEVSVLV+GQRRPS  SCFCGSGGAGDLV+ CINNAEC
Subjt:  TLLHVIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFCGSGGAGDLVDHCINNAEC

Query:  LTIGVRKQSRDMGGYVINTRWQKNFWLLA
        LTIGVRKQSRDMGGYVINTRWQKNFWLLA
Subjt:  LTIGVRKQSRDMGGYVINTRWQKNFWLLA

A0A6J1CJN1 universal stress protein PHOS322.3e-11199.53Show/hide
Query:  MPSTESFLRQISTREGSKSTSRRWGGEGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSS
        MPSTESFLRQISTREGSKSTSRRWGGEGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSS
Subjt:  MPSTESFLRQISTREGSKSTSRRWGGEGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSS

Query:  ADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFCGSGGAGDLVDHCINNAECLTIGVRKQSRDMGGYV
        ADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCF GSGGAGDLVDHCINNAECLTIGVRKQSRDMGGYV
Subjt:  ADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFCGSGGAGDLVDHCINNAECLTIGVRKQSRDMGGYV

Query:  INTRWQKNFWLLA
        INTRWQKNFWLLA
Subjt:  INTRWQKNFWLLA

A0A6J1FKL3 uncharacterized protein LOC1114448692.1e-9181.86Show/hide
Query:  MPSTESFLRQISTREGSKSTSRRWGG------------EGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLH
        M STESFLRQIS REGS+S SRRWGG            EG  W QKMEGGVNN+YGI++GGMSR+KRVMVVVDQTSQ+NHATMWALTHVANKGDVLTLLH
Subjt:  MPSTESFLRQISTREGSKSTSRRWGG------------EGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLH

Query:  VIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFCGSGGAGDLVDHCINNAECLTI
        VI  S+  SS +  SSSSSSF ATSLGS+CKASRPEVEVEVLV++GPKLATVMNQVKKLEVSVLVLGQRRP SF SCFCGSGGAGDLV+ CINNAECLTI
Subjt:  VIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFCGSGGAGDLVDHCINNAECLTI

Query:  GVRKQSRDMGGYVINTRWQKNFWLLA
        GVRKQSRDMGGYVINTRWQKNFWLLA
Subjt:  GVRKQSRDMGGYVINTRWQKNFWLLA

A0A6J1JPP1 uncharacterized protein LOC1114877673.0e-9080.53Show/hide
Query:  MPSTESFLRQISTREGSKSTSRRWGG-------------EGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLL
        MP  ESFLRQIS  EGS+STS+RWGG             EGSHW +KMEGGV N+YG++ GG+SRRKRVMVVVD TSQ+NHATMWALTHVANKGDVLTLL
Subjt:  MPSTESFLRQISTREGSKSTSRRWGG-------------EGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLL

Query:  HVIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFCGSGGAGDLVDHCINNAECLTI
        HVI  SS  SS    SSSSS F ATSLGSLCKASRPEVEVEVLVI+GPKL TVMNQVKKLEVSVLVLGQRRPSFLSCFCGSGGAGDLV+ CINNAECLTI
Subjt:  HVIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFCGSGGAGDLVDHCINNAECLTI

Query:  GVRKQSRDMGGYVINTRWQKNFWLLA
        GVRKQSRDMGGYVINTRWQ+NFWLLA
Subjt:  GVRKQSRDMGGYVINTRWQKNFWLLA

A0A6J1K3E8 uncharacterized protein LOC1114897241.7e-9384.23Show/hide
Query:  MPSTESFLRQISTREGSKSTSRRWGG--------EGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPT
        MPSTESFLRQIS REGS+S SRRWGG        EG  W QKMEGGVNN+YGI++GGMSR+KRVMVVVDQTSQ+NHATMWALTHVANKGDVLTLLHVI  
Subjt:  MPSTESFLRQISTREGSKSTSRRWGG--------EGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPT

Query:  SSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFCGSGGAGDLVDHCINNAECLTIGVRK
        S+  S+ S  SSSSSSF ATSLGSLCKASRPEVEVEVLV++GPKLATVMNQVKKLEVSVLVLGQRRP SF SCFCGSGGAGDLV+ CINNAECLTIGVRK
Subjt:  SSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFCGSGGAGDLVDHCINNAECLTIGVRK

Query:  QSRDMGGYVINTRWQKNFWLLA
        QSRDMGGYVINTRWQKNFWLLA
Subjt:  QSRDMGGYVINTRWQKNFWLLA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G44760.1 Adenine nucleotide alpha hydrolases-like superfamily protein7.8e-5956.82Show/hide
Query:  STESFLRQISTREGSKSTSRRW--GGEGSHWGQKMEGG----VNNLYGIESGG--MSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSS
        S  S LRQ+S +EG +S S+RW  G   + +     GG    +  LYG+ SGG   +R KRVMVVVD++S++ HA MWALTH+ NKGD++TLLHV+    
Subjt:  STESFLRQISTREGSKSTSRRW--GGEGSHWGQKMEGG----VNNLYGIESGG--MSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSS

Query:  AHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPS-FLSCFCGSGGAGDLVDHCINNAECLTIGVRKQS
             S D  ++ S  A SLGSLCKA +PEV+VE LVIQGPKLATV++QVKKLEVSVLVLGQ++ +  +SC CG   + +LV+ CIN A+CLTIGVRKQ 
Subjt:  AHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPS-FLSCFCGSGGAGDLVDHCINNAECLTIGVRKQS

Query:  RDMGGYVINTRWQKNFWLLA
        + +GGY+INTRWQKNFWLLA
Subjt:  RDMGGYVINTRWQKNFWLLA

AT1G69080.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.3e-2132.78Show/hide
Query:  KRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSSAD------------SSSSSSFSATSLGSLCKASRPEVEVEVLVIQG-PKLATV
        +R++VVVD  S+A +A +W L+H A   D + LLH +   ++ S   A+            ++S +    ++L ++C+  RPEV+ EV+ ++G  K  T+
Subjt:  KRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSSAD------------SSSSSSFSATSLGSLCKASRPEVEVEVLVIQG-PKLATV

Query:  MNQVKKLEVSVLVLGQRRP----SFLSCFCGSG---GAGDLVDHCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA
        + + ++ E S+LVLGQ++       L  +          D V++CINN+ C+ I VRK+ + +GGY + T+  K+FWLLA
Subjt:  MNQVKKLEVSVLVLGQRRP----SFLSCFCGSG---GAGDLVDHCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA

AT1G69080.2 Adenine nucleotide alpha hydrolases-like superfamily protein5.1e-1832.14Show/hide
Query:  KRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQG-PKLATVMNQVKKLEVSVL
        +R++VVVD  S+A +A +W L+H A   D + LLH +   ++ S   A+       S          +  +V+ EV+ ++G  K  T++ + ++ E S+L
Subjt:  KRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQG-PKLATVMNQVKKLEVSVL

Query:  VLGQRRP----SFLSCFCGSG---GAGDLVDHCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA
        VLGQ++       L  +          D V++CINN+ C+ I VRK+ + +GGY + T+  K+FWLLA
Subjt:  VLGQRRP----SFLSCFCGSG---GAGDLVDHCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA

AT2G03720.1 Adenine nucleotide alpha hydrolases-like superfamily protein4.9e-2137.95Show/hide
Query:  MVVVDQTSQANHATMWALTHVANKGDVLTLLHV--IPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQ--GPKLATVMNQVKKLEVSVL
        MVVVD TSQ  +A  WALTH     D +TLLHV   P   A   +  + +S +      L + C+  +P V+ E++V++    K  T++ + KK    VL
Subjt:  MVVVDQTSQANHATMWALTHVANKGDVLTLLHV--IPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQ--GPKLATVMNQVKKLEVSVL

Query:  VLGQRRPS-----FLSCFCGSGGAGDLVDHCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA
        VLGQR+ +             G  G +V++CI+N++C+ I VRK+S + GGY+I T+  K+FWLLA
Subjt:  VLGQRRPS-----FLSCFCGSGGAGDLVDHCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA

AT5G17390.1 Adenine nucleotide alpha hydrolases-like superfamily protein5.6e-1733.33Show/hide
Query:  RVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQG---PKLATVMNQVKKLEVSV
        RVMVVVD+   +  A  WA+TH     D L LL+           +      +     +L  LC+  RP +EVE+  ++G    K   ++ + KK +VS+
Subjt:  RVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSSADSSSSSSFSATSLGSLCKASRPEVEVEVLVIQG---PKLATVMNQVKKLEVSV

Query:  LVLGQ-RRPSFLS-----CFCGSGGAGDLVDHCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA
        LV+GQ ++P          +    G   ++ +C+ NA C+TI V+ ++R +GGY+I T+  KNFWLLA
Subjt:  LVLGQ-RRPSFLS-----CFCGSGGAGDLVDHCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAAGTACGGAATCGTTTCTGAGGCAGATAAGTACAAGGGAGGGCTCCAAATCTACATCTAGGAGGTGGGGTGGCGAGGGAAGCCATTGGGGCCAGAAGATGGAGGG
CGGTGTTAACAACTTGTATGGGATCGAGAGCGGCGGAATGTCTAGGAGGAAGAGGGTGATGGTGGTGGTGGATCAGACTTCACAGGCCAACCATGCAACCATGTGGGCGC
TAACTCATGTGGCTAACAAGGGCGATGTTCTTACTCTTCTTCATGTCATCCCTACCTCCTCTGCCCACTCTTCTTCTTCTGCAGATTCTTCTTCCTCTTCCTCTTTCTCT
GCTACTTCTCTTGGTTCCCTCTGCAAGGCTTCTAGACCTGAGGTGGAGGTGGAAGTGCTGGTGATCCAGGGGCCAAAGCTGGCCACAGTGATGAATCAAGTGAAGAAGTT
GGAGGTGTCGGTGCTGGTTCTGGGACAGAGAAGACCATCGTTTCTCAGTTGCTTTTGTGGAAGTGGTGGGGCGGGAGATCTGGTGGATCATTGCATAAATAACGCCGAGT
GCTTGACCATTGGAGTGAGAAAGCAGAGCAGAGACATGGGTGGCTATGTAATCAACACCAGATGGCAGAAGAACTTCTGGCTTCTGGCT
mRNA sequenceShow/hide mRNA sequence
ATGCCAAGTACGGAATCGTTTCTGAGGCAGATAAGTACAAGGGAGGGCTCCAAATCTACATCTAGGAGGTGGGGTGGCGAGGGAAGCCATTGGGGCCAGAAGATGGAGGG
CGGTGTTAACAACTTGTATGGGATCGAGAGCGGCGGAATGTCTAGGAGGAAGAGGGTGATGGTGGTGGTGGATCAGACTTCACAGGCCAACCATGCAACCATGTGGGCGC
TAACTCATGTGGCTAACAAGGGCGATGTTCTTACTCTTCTTCATGTCATCCCTACCTCCTCTGCCCACTCTTCTTCTTCTGCAGATTCTTCTTCCTCTTCCTCTTTCTCT
GCTACTTCTCTTGGTTCCCTCTGCAAGGCTTCTAGACCTGAGGTGGAGGTGGAAGTGCTGGTGATCCAGGGGCCAAAGCTGGCCACAGTGATGAATCAAGTGAAGAAGTT
GGAGGTGTCGGTGCTGGTTCTGGGACAGAGAAGACCATCGTTTCTCAGTTGCTTTTGTGGAAGTGGTGGGGCGGGAGATCTGGTGGATCATTGCATAAATAACGCCGAGT
GCTTGACCATTGGAGTGAGAAAGCAGAGCAGAGACATGGGTGGCTATGTAATCAACACCAGATGGCAGAAGAACTTCTGGCTTCTGGCT
Protein sequenceShow/hide protein sequence
MPSTESFLRQISTREGSKSTSRRWGGEGSHWGQKMEGGVNNLYGIESGGMSRRKRVMVVVDQTSQANHATMWALTHVANKGDVLTLLHVIPTSSAHSSSSADSSSSSSFS
ATSLGSLCKASRPEVEVEVLVIQGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFCGSGGAGDLVDHCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA