; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029237 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029237
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUsp domain-containing protein
Genome locationchr8:36804647..36805619
RNA-Seq ExpressionLag0029237
SyntenyLag0029237
Gene Ontology termsNA
InterPro domainsIPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578971.1 hypothetical protein SDJN03_23419, partial [Cucurbita argyrosperma subsp. sororia]7.1e-10590.22Show/hide
Query:  MPSTESFLRQISRREGSRSTSKRWGGEFRRSSAEERVSEGSRWNQKMDGGVNSMYGMDNGGMSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVLTLLH
        MPSTESFLRQISRREGSRS S+RWGGEFRRS  EE VSEG RW+QKM+GGVN+MYG+DNGGMSR+KRVMVVVDQTSQSNHATMWALTHVANKGDVLTLLH
Subjt:  MPSTESFLRQISRREGSRSTSKRWGGEFRRSSAEERVSEGSRWNQKMDGGVNSMYGMDNGGMSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVLTLLH

Query:  VITDSIDS-SSSADSSSSSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFCGSGGAGDLVEQCINNAECLTIG
        VITDS  S + S+ SSSSSFCA+SLGSLCKASRPEVEVEVLV+EGPKLATVMNQVKKLEVSVLVLGQRRP SF SCFCGSGGAGDLVEQCINNAECLTIG
Subjt:  VITDSIDS-SSSADSSSSSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFCGSGGAGDLVEQCINNAECLTIG

Query:  VRKQSRDMGGYVINTRWQKNFWLLA
        VRKQSRDMGGYVINTRWQKNFWLLA
Subjt:  VRKQSRDMGGYVINTRWQKNFWLLA

KAG7016492.1 hypothetical protein SDJN02_21601, partial [Cucurbita argyrosperma subsp. argyrosperma]6.0e-10489.78Show/hide
Query:  MPSTESFLRQISRREGSRSTSKRWGGEFRRSSAEERVSEGSRWNQKMDGGVNSMYGMDNGGMSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVLTLLH
        MPSTESFLRQISRRE SRS S+RWGGEFRRS  EE VSEG RW+QKM+GGVN+MYG+DNGGMSR+KRVMVVVDQTSQSNHATMWALTHVANKGDVLTLLH
Subjt:  MPSTESFLRQISRREGSRSTSKRWGGEFRRSSAEERVSEGSRWNQKMDGGVNSMYGMDNGGMSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVLTLLH

Query:  VITDSIDS-SSSADSSSSSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFCGSGGAGDLVEQCINNAECLTIG
        VITDS  S + S+ SSSSSFCA+SLGSLCKASRPEVEVEVLV+EGPKLATVMNQVKKLEVSVLVLGQRRP SF SCFCGSGGAGDLVEQCINNAECLTIG
Subjt:  VITDSIDS-SSSADSSSSSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFCGSGGAGDLVEQCINNAECLTIG

Query:  VRKQSRDMGGYVINTRWQKNFWLLA
        VRKQSRDMGGYVINTRWQKNFWLLA
Subjt:  VRKQSRDMGGYVINTRWQKNFWLLA

XP_022938730.1 uncharacterized protein LOC111444869 [Cucurbita moschata]3.5e-10489.29Show/hide
Query:  MPSTESFLRQISRREGSRSTSKRWGGEFRRSSAEERVSEGSRWNQKMDGGVNSMYGMDNGGMSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVLTLLH
        M STESFLRQISRREGSRS S+RWGGEFRRS  EE VSEG RW+QKM+GGVN+MYG+DNGGMSR+KRVMVVVDQTSQSNHATMWALTHVANKGDVLTLLH
Subjt:  MPSTESFLRQISRREGSRSTSKRWGGEFRRSSAEERVSEGSRWNQKMDGGVNSMYGMDNGGMSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVLTLLH

Query:  VITDSIDSSSSADSSSSSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFCGSGGAGDLVEQCINNAECLTIGV
        VITDS  S + + SSSSSFCA+SLGS+CKASRPEVEVEVLV+EGPKLATVMNQVKKLEVSVLVLGQRRP SF SCFCGSGGAGDLVEQCINNAECLTIGV
Subjt:  VITDSIDSSSSADSSSSSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFCGSGGAGDLVEQCINNAECLTIGV

Query:  RKQSRDMGGYVINTRWQKNFWLLA
        RKQSRDMGGYVINTRWQKNFWLLA
Subjt:  RKQSRDMGGYVINTRWQKNFWLLA

XP_023550879.1 uncharacterized protein LOC111808882 [Cucurbita pepo subsp. pepo]1.1e-10590.31Show/hide
Query:  MPSTESFLRQISRREGSRSTSKRWGGEFRRSSAEERVSEGSRWNQKMDGGVNSMYGMDNGGMSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVLTLLH
        MPSTESFLRQIS+REGSRS S+RWGGEFRRS  EE VSEG RWNQKM+GGVN+MYG+DNGGMSR+KRVMVVVDQTSQSNHATMWALTHVANKGDVLTLLH
Subjt:  MPSTESFLRQISRREGSRSTSKRWGGEFRRSSAEERVSEGSRWNQKMDGGVNSMYGMDNGGMSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVLTLLH

Query:  VITDSIDS---SSSADSSSSSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFCGSGGAGDLVEQCINNAECLT
        VITDS  S   SSS+ SSSSSFCA+SLGSLCKASRPEVEVEVLV+EGPKLATVMNQVKKLEVSVLVLGQRRP SF SCFCGSGGAGDLVEQCINNAECLT
Subjt:  VITDSIDS---SSSADSSSSSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFCGSGGAGDLVEQCINNAECLT

Query:  IGVRKQSRDMGGYVINTRWQKNFWLLA
        IGVRKQSRDMGGYVINTRWQKNFWLLA
Subjt:  IGVRKQSRDMGGYVINTRWQKNFWLLA

XP_038884412.1 uncharacterized protein LOC120075267 [Benincasa hispida]1.2e-10490.75Show/hide
Query:  MPSTESFLRQISRR-EGSRSTSKRWGGEFRRSSAEERVSEGSRWNQKMDGGVNSMYGMDNGGMSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVLTLL
        MPSTESF+RQISRR EGSRSTS+RWGGEFRRS  EERVSEG+RWNQKM+GGVN MYG+DNGGMSRRKRVMVVVD TSQSNHATMWALTHVANKGDVLTLL
Subjt:  MPSTESFLRQISRR-EGSRSTSKRWGGEFRRSSAEERVSEGSRWNQKMDGGVNSMYGMDNGGMSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVLTLL

Query:  HVITD-SIDSSSSAD--SSSSSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFCGSGGAGDLVEQCINNAECLT
        HVIT+ S DSSS+AD  SSSSSFCA+SLGSLCKASRPEVEVEVLVIEGPKLATV+NQVKKLEVSVLV+GQR+PSFLSCFCGSGGAGDLVEQCINN ECLT
Subjt:  HVITD-SIDSSSSAD--SSSSSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFCGSGGAGDLVEQCINNAECLT

Query:  IGVRKQSRDMGGYVINTRWQKNFWLLA
        IGVRKQSRDMGGYVINTRWQKNFWLLA
Subjt:  IGVRKQSRDMGGYVINTRWQKNFWLLA

TrEMBL top hitse value%identityAlignment
A0A0A0KDD8 Usp domain-containing protein2.7e-10288.11Show/hide
Query:  MPSTESFLRQISRREG---SRSTSKRWGGEFRRSSAEERVSEGSRWNQKMD-GGVNSMYGMDNGGMSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVL
        MPSTESFLRQIS R G   SRSTS+RWGGEFRR+  EERVSEGS W+QKM+ GGVNSM+G+DNGGMSRRKRVMVVVD TSQSNHATMWALTH+ANKGDVL
Subjt:  MPSTESFLRQISRREG---SRSTSKRWGGEFRRSSAEERVSEGSRWNQKMD-GGVNSMYGMDNGGMSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVL

Query:  TLLHVITDSIDSSSSADSSSSSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFCGSGGAGDLVEQCINNAECLT
        TLLHVIT+S   SSSA  S+SSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLV+GQRRPS  SCFCGSGGAGDLVEQCINNAECLT
Subjt:  TLLHVITDSIDSSSSADSSSSSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFCGSGGAGDLVEQCINNAECLT

Query:  IGVRKQSRDMGGYVINTRWQKNFWLLA
        IGVRKQSRDMGGYVINTRWQKNFWLLA
Subjt:  IGVRKQSRDMGGYVINTRWQKNFWLLA

A0A1S3CRX0 uncharacterized protein LOC1035040537.1e-10389.47Show/hide
Query:  MPSTESFLRQISRREG---SRSTSKRWGGEFRRSSAEERVSEGSRWNQKMD-GGVNSMYGMDNGGMSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVL
        MPSTESFLRQIS R G   SRSTS+RWGGEFR S  EERVSEGS WNQKM+ GGVNSMYG+DNGGMSRRKRVMVVVD TSQS+HATMWALTH+ANKGDVL
Subjt:  MPSTESFLRQISRREG---SRSTSKRWGGEFRRSSAEERVSEGSRWNQKMD-GGVNSMYGMDNGGMSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVL

Query:  TLLHVITD-SIDSSSSADSSSSSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFCGSGGAGDLVEQCINNAECL
        TLLHVIT+ S DSSSSA  SSSSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLV+GQRRPS  SCFCGSGGAGDLVEQCINNAECL
Subjt:  TLLHVITD-SIDSSSSADSSSSSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFCGSGGAGDLVEQCINNAECL

Query:  TIGVRKQSRDMGGYVINTRWQKNFWLLA
        TIGVRKQSRDMGGYVINTRWQKNFWLLA
Subjt:  TIGVRKQSRDMGGYVINTRWQKNFWLLA

A0A6J1EA58 uncharacterized protein LOC1114307484.2e-10388.84Show/hide
Query:  MPSTESFLRQISRREGSRSTSKRWGGEFRRSSA-EERVSEGSRWNQKMDGGVNSMYGMDNGGMSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVLTLL
        MP  ESFLRQISR EGSRSTSKRWGGEFRRSSA EERVSEGS WN+KM+GGVN MYGMD+GG+SRRKRVMVVVD TSQSNHATMWALTHVANKGDVLTLL
Subjt:  MPSTESFLRQISRREGSRSTSKRWGGEFRRSSA-EERVSEGSRWNQKMDGGVNSMYGMDNGGMSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVLTLL

Query:  HVITDSIDSSSSADSSSSSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFCGSGGAGDLVEQCINNAECLTIGV
        H+IT++   SSS+ SSSS FCA+SLGSLCKASRPEVEVEVLVIEGP+L+TVMNQVKKLEVSVLV+GQRRPSFLSCFCGSGGAGDLVEQCINNAECLTIGV
Subjt:  HVITDSIDSSSSADSSSSSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFCGSGGAGDLVEQCINNAECLTIGV

Query:  RKQSRDMGGYVINTRWQKNFWLLA
        RKQSRDMGGYVINTRWQ+NFWLLA
Subjt:  RKQSRDMGGYVINTRWQKNFWLLA

A0A6J1FKL3 uncharacterized protein LOC1114448691.7e-10489.29Show/hide
Query:  MPSTESFLRQISRREGSRSTSKRWGGEFRRSSAEERVSEGSRWNQKMDGGVNSMYGMDNGGMSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVLTLLH
        M STESFLRQISRREGSRS S+RWGGEFRRS  EE VSEG RW+QKM+GGVN+MYG+DNGGMSR+KRVMVVVDQTSQSNHATMWALTHVANKGDVLTLLH
Subjt:  MPSTESFLRQISRREGSRSTSKRWGGEFRRSSAEERVSEGSRWNQKMDGGVNSMYGMDNGGMSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVLTLLH

Query:  VITDSIDSSSSADSSSSSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFCGSGGAGDLVEQCINNAECLTIGV
        VITDS  S + + SSSSSFCA+SLGS+CKASRPEVEVEVLV+EGPKLATVMNQVKKLEVSVLVLGQRRP SF SCFCGSGGAGDLVEQCINNAECLTIGV
Subjt:  VITDSIDSSSSADSSSSSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFCGSGGAGDLVEQCINNAECLTIGV

Query:  RKQSRDMGGYVINTRWQKNFWLLA
        RKQSRDMGGYVINTRWQKNFWLLA
Subjt:  RKQSRDMGGYVINTRWQKNFWLLA

A0A6J1K3E8 uncharacterized protein LOC1114897241.4e-10389.73Show/hide
Query:  MPSTESFLRQISRREGSRSTSKRWGGEFRRSSAEERVSEGSRWNQKMDGGVNSMYGMDNGGMSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVLTLLH
        MPSTESFLRQISRREGSRS S+RWGGEFRRS  E    EG RWNQKM+GGVN+MYG+DNGGMSR+KRVMVVVDQTSQSNHATMWALTHVANKGDVLTLLH
Subjt:  MPSTESFLRQISRREGSRSTSKRWGGEFRRSSAEERVSEGSRWNQKMDGGVNSMYGMDNGGMSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVLTLLH

Query:  VITDSIDSSSSADSSSSSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFCGSGGAGDLVEQCINNAECLTIGV
        VITDS  SS+ + SSSSSFCA+SLGSLCKASRPEVEVEVLV+EGPKLATVMNQVKKLEVSVLVLGQRRP SF SCFCGSGGAGDLVEQCINNAECLTIGV
Subjt:  VITDSIDSSSSADSSSSSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVLGQRRP-SFLSCFCGSGGAGDLVEQCINNAECLTIGV

Query:  RKQSRDMGGYVINTRWQKNFWLLA
        RKQSRDMGGYVINTRWQKNFWLLA
Subjt:  RKQSRDMGGYVINTRWQKNFWLLA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G44760.1 Adenine nucleotide alpha hydrolases-like superfamily protein6.2e-5956.25Show/hide
Query:  STESFLRQISRREGSRSTSKRWGGEFRRSSAEERVSEGSRWNQKMDGGVNSMYGMDNGG--MSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVLTLLH
        S  S LRQ+SR+EG RS SKRW      ++  +  S G      M+G    +YG+ +GG   +R KRVMVVVD++S+S HA MWALTH+ NKGD++TLLH
Subjt:  STESFLRQISRREGSRSTSKRWGGEFRRSSAEERVSEGSRWNQKMDGGVNSMYGMDNGG--MSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVLTLLH

Query:  VITDSIDSSSSADSSSSSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVLGQRRPS-FLSCFCGSGGAGDLVEQCINNAECLTIGV
        V+        S D  ++   A SLGSLCKA +PEV+VE LVI+GPKLATV++QVKKLEVSVLVLGQ++ +  +SC CG   + +LV +CIN A+CLTIGV
Subjt:  VITDSIDSSSSADSSSSSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVLGQRRPS-FLSCFCGSGGAGDLVEQCINNAECLTIGV

Query:  RKQSRDMGGYVINTRWQKNFWLLA
        RKQ + +GGY+INTRWQKNFWLLA
Subjt:  RKQSRDMGGYVINTRWQKNFWLLA

AT1G69080.1 Adenine nucleotide alpha hydrolases-like superfamily protein5.2e-2133.89Show/hide
Query:  KRVMVVVDQTSQSNHATMWALTHVANKGDVLTLLHVITDSIDSS-----------SSADSSSSSFC---ASSLGSLCKASRPEVEVEVLVIEG-PKLATV
        +R++VVVD  S++ +A +W L+H A   D + LLH +      S            S D  ++S      S+L ++C+  RPEV+ EV+ ++G  K  T+
Subjt:  KRVMVVVDQTSQSNHATMWALTHVANKGDVLTLLHVITDSIDSS-----------SSADSSSSSFC---ASSLGSLCKASRPEVEVEVLVIEG-PKLATV

Query:  MNQVKKLEVSVLVLGQRRP----SFLSCFCGSG---GAGDLVEQCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA
        + + ++ E S+LVLGQ++       L  +          D VE CINN+ C+ I VRK+ + +GGY + T+  K+FWLLA
Subjt:  MNQVKKLEVSVLVLGQRRP----SFLSCFCGSG---GAGDLVEQCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA

AT1G69080.2 Adenine nucleotide alpha hydrolases-like superfamily protein1.2e-1731.93Show/hide
Query:  KRVMVVVDQTSQSNHATMWALTHVANKGDVLTLLHVITDSIDSSSSADSSSSSFCASSLGSLCKASRPEVEVEVLVIEG-PKLATVMNQVKKLEVSVLVL
        +R++VVVD  S++ +A +W L+H A   D + LLH +      S    +       S        +  +V+ EV+ ++G  K  T++ + ++ E S+LVL
Subjt:  KRVMVVVDQTSQSNHATMWALTHVANKGDVLTLLHVITDSIDSSSSADSSSSSFCASSLGSLCKASRPEVEVEVLVIEG-PKLATVMNQVKKLEVSVLVL

Query:  GQRRP----SFLSCFCGSG---GAGDLVEQCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA
        GQ++       L  +          D VE CINN+ C+ I VRK+ + +GGY + T+  K+FWLLA
Subjt:  GQRRP----SFLSCFCGSG---GAGDLVEQCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA

AT2G03720.1 Adenine nucleotide alpha hydrolases-like superfamily protein2.1e-2239.16Show/hide
Query:  MVVVDQTSQSNHATMWALTHVANKGDVLTLLHV----ITDSIDSSSSADSSSSSFCASSLGSLCKASRPEVEVEVLVIE--GPKLATVMNQVKKLEVSVL
        MVVVD TSQ+ +A  WALTH     D +TLLHV    +  +ID +    +S +      L + C+  +P V+ E++V+E    K  T++ + KK    VL
Subjt:  MVVVDQTSQSNHATMWALTHVANKGDVLTLLHV----ITDSIDSSSSADSSSSSFCASSLGSLCKASRPEVEVEVLVIE--GPKLATVMNQVKKLEVSVL

Query:  VLGQRRPS-----FLSCFCGSGGAGDLVEQCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA
        VLGQR+ +             G  G +VE CI+N++C+ I VRK+S + GGY+I T+  K+FWLLA
Subjt:  VLGQRRPS-----FLSCFCGSGGAGDLVEQCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA

AT5G17390.1 Adenine nucleotide alpha hydrolases-like superfamily protein2.0e-1735.12Show/hide
Query:  RVMVVVDQTSQSNHATMWALTHVANKGDVLTLLHVITDSIDS--SSSADSSSSSFCASSLGSLCKASRPEVEVEVLVIEG---PKLATVMNQVKKLEVSV
        RVMVVVD+   S  A  WA+TH     D L LL+       S   +      +     +L  LC+  RP +EVE+  +EG    K   ++ + KK +VS+
Subjt:  RVMVVVDQTSQSNHATMWALTHVANKGDVLTLLHVITDSIDS--SSSADSSSSSFCASSLGSLCKASRPEVEVEVLVIEG---PKLATVMNQVKKLEVSV

Query:  LVLGQ-RRPSFLS-----CFCGSGGAGDLVEQCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA
        LV+GQ ++P          +    G   +++ C+ NA C+TI V+ ++R +GGY+I T+  KNFWLLA
Subjt:  LVLGQ-RRPSFLS-----CFCGSGGAGDLVEQCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAAGTACGGAATCGTTTCTGAGGCAGATAAGTAGAAGGGAGGGCTCTAGATCAACATCGAAGAGGTGGGGTGGGGAGTTTCGGAGAAGCTCTGCTGAAGAGCGTGT
CAGTGAGGGGAGCCGTTGGAATCAGAAGATGGACGGCGGTGTTAACAGCATGTACGGGATGGACAATGGCGGAATGTCGAGAAGGAAGAGGGTGATGGTAGTGGTGGATC
AGACTTCACAATCCAACCATGCAACCATGTGGGCGCTTACTCATGTGGCTAACAAGGGCGATGTTCTTACTCTTCTTCATGTCATCACTGACTCTATCGACTCTTCTTCC
TCTGCTGATTCTTCTTCTTCCTCTTTCTGTGCTAGCTCTCTGGGTTCCCTCTGCAAGGCTTCTAGACCTGAGGTAGAAGTGGAGGTGTTGGTGATCGAGGGACCAAAGCT
GGCCACAGTGATGAATCAAGTTAAGAAGCTGGAGGTGTCGGTGCTGGTTCTGGGACAGAGAAGGCCATCATTTCTCAGCTGCTTTTGTGGGAGTGGTGGGGCGGGAGATC
TGGTGGAACAGTGCATAAACAACGCAGAGTGCTTGACTATTGGAGTTAGAAAGCAGAGCAGAGACATGGGAGGGTATGTAATCAACACCAGATGGCAGAAGAATTTCTGG
CTCCTTGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCAAGTACGGAATCGTTTCTGAGGCAGATAAGTAGAAGGGAGGGCTCTAGATCAACATCGAAGAGGTGGGGTGGGGAGTTTCGGAGAAGCTCTGCTGAAGAGCGTGT
CAGTGAGGGGAGCCGTTGGAATCAGAAGATGGACGGCGGTGTTAACAGCATGTACGGGATGGACAATGGCGGAATGTCGAGAAGGAAGAGGGTGATGGTAGTGGTGGATC
AGACTTCACAATCCAACCATGCAACCATGTGGGCGCTTACTCATGTGGCTAACAAGGGCGATGTTCTTACTCTTCTTCATGTCATCACTGACTCTATCGACTCTTCTTCC
TCTGCTGATTCTTCTTCTTCCTCTTTCTGTGCTAGCTCTCTGGGTTCCCTCTGCAAGGCTTCTAGACCTGAGGTAGAAGTGGAGGTGTTGGTGATCGAGGGACCAAAGCT
GGCCACAGTGATGAATCAAGTTAAGAAGCTGGAGGTGTCGGTGCTGGTTCTGGGACAGAGAAGGCCATCATTTCTCAGCTGCTTTTGTGGGAGTGGTGGGGCGGGAGATC
TGGTGGAACAGTGCATAAACAACGCAGAGTGCTTGACTATTGGAGTTAGAAAGCAGAGCAGAGACATGGGAGGGTATGTAATCAACACCAGATGGCAGAAGAATTTCTGG
CTCCTTGCTTAG
Protein sequenceShow/hide protein sequence
MPSTESFLRQISRREGSRSTSKRWGGEFRRSSAEERVSEGSRWNQKMDGGVNSMYGMDNGGMSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVLTLLHVITDSIDSSS
SADSSSSSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVLGQRRPSFLSCFCGSGGAGDLVEQCINNAECLTIGVRKQSRDMGGYVINTRWQKNFW
LLA