; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg00458 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg00458
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionUbiquitin-like domain-containing protein
Genome locationCarg_Chr04:7407528..7411618
RNA-Seq ExpressionCarg00458
SyntenyCarg00458
Gene Ontology termsGO:0000398 - mRNA splicing, via spliceosome (biological process)
GO:0005689 - U12-type spliceosomal complex (cellular component)
InterPro domainsIPR029071 - Ubiquitin-like domain superfamily
IPR039690 - U11/U12 small nuclear ribonucleoprotein 25kDa protein
IPR040610 - SNRNP25, ubiquitin-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7031799.1 hypothetical protein SDJN02_05840, partial [Cucurbita argyrosperma subsp. argyrosperma]1.3e-131100Show/hide
Query:  MAIDGVCTLDVADFPPLPVVEKRRRSFSLLLSSLMIVGFSRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVEAKLIDDTGYIAKLGIK
        MAIDGVCTLDVADFPPLPVVEKRRRSFSLLLSSLMIVGFSRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVEAKLIDDTGYIAKLGIK
Subjt:  MAIDGVCTLDVADFPPLPVVEKRRRSFSLLLSSLMIVGFSRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVEAKLIDDTGYIAKLGIK

Query:  DGDQLQFVRHVTTGYNVIRKQSKKGQVSPKLHNRTSSLSRNYDQDDIESGRIQHHGNNQLFKHHEPKMVVFLGGWFSHIKLASAGKTCIESLVRSSGTQH
        DGDQLQFVRHVTTGYNVIRKQSKKGQVSPKLHNRTSSLSRNYDQDDIESGRIQHHGNNQLFKHHEPKMVVFLGGWFSHIKLASAGKTCIESLVRSSGTQH
Subjt:  DGDQLQFVRHVTTGYNVIRKQSKKGQVSPKLHNRTSSLSRNYDQDDIESGRIQHHGNNQLFKHHEPKMVVFLGGWFSHIKLASAGKTCIESLVRSSGTQH

Query:  SLVHGFKNLIQLCRQKQYYQKVSKRKRSILAMKLEFAFEKKVK
        SLVHGFKNLIQLCRQKQYYQKVSKRKRSILAMKLEFAFEKKVK
Subjt:  SLVHGFKNLIQLCRQKQYYQKVSKRKRSILAMKLEFAFEKKVK

XP_022940425.1 uncharacterized protein LOC111446036 [Cucurbita moschata]5.5e-11987.36Show/hide
Query:  MAIDGVCTLDVADFPPLPVVEKRRRSFSLLLSSLMIVGFSRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVEA---------------
        MAIDGVCTLDVADFPPLPVVEKRRRSFSLLLSSLMIVGFSRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVEA               
Subjt:  MAIDGVCTLDVADFPPLPVVEKRRRSFSLLLSSLMIVGFSRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVEA---------------

Query:  --------------KLIDDTGYIAKLGIKDGDQLQFVRHVTTGYNVIRKQSKKGQVSPKLHNRTSSLSRNYDQDDIESGRIQHHGNNQLFKHHEPKMVVF
                      KLIDDTGYIAKLGIKDGDQLQFVRHVTTGYNVIRKQSKKGQVSPKLHNRTSSLSRNYDQDDIESGRIQHHGNNQLFKHHEPKMVVF
Subjt:  --------------KLIDDTGYIAKLGIKDGDQLQFVRHVTTGYNVIRKQSKKGQVSPKLHNRTSSLSRNYDQDDIESGRIQHHGNNQLFKHHEPKMVVF

Query:  LGGWFSHIKLASAGKTCIESLVRSSGTQHSLVHGFKNLIQLCRQKQYYQKVSKRKRSILAM
        LGGWFSHIKLASAGKTCIESLVRSSGT+HSLVHGFKNLIQLCRQKQ YQKVSKRKRSIL +
Subjt:  LGGWFSHIKLASAGKTCIESLVRSSGTQHSLVHGFKNLIQLCRQKQYYQKVSKRKRSILAM

XP_022982434.1 uncharacterized protein LOC111481262 [Cucurbita maxima]2.7e-11886.59Show/hide
Query:  MAIDGVCTLDVADFPPLPVVEKRRRSFSLLLSSLMIVGFSRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVEA---------------
        MAIDGVCTLDV+DFPPLPVVEKRRRSFSLLLSSLMIVGFSRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVEA               
Subjt:  MAIDGVCTLDVADFPPLPVVEKRRRSFSLLLSSLMIVGFSRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVEA---------------

Query:  --------------KLIDDTGYIAKLGIKDGDQLQFVRHVTTGYNVIRKQSKKGQVSPKLHNRTSSLSRNYDQDDIESGRIQHHGNNQLFKHHEPKMVVF
                      KLIDDTGYIAKLGIKDGDQLQFVRHVTTGYNVIRKQSKKGQVSPKLHNRTSSLSRNYDQDDIESGRIQHHGNNQLFKHHEPKMVVF
Subjt:  --------------KLIDDTGYIAKLGIKDGDQLQFVRHVTTGYNVIRKQSKKGQVSPKLHNRTSSLSRNYDQDDIESGRIQHHGNNQLFKHHEPKMVVF

Query:  LGGWFSHIKLASAGKTCIESLVRSSGTQHSLVHGFKNLIQLCRQKQYYQKVSKRKRSILAM
        LGGWFSHIKLASAGKTCIESLVRSSGT HSLV GFKNLIQLCRQKQYYQKVSK+KRSIL +
Subjt:  LGGWFSHIKLASAGKTCIESLVRSSGTQHSLVHGFKNLIQLCRQKQYYQKVSKRKRSILAM

XP_023524342.1 U11/U12 small nuclear ribonucleoprotein 25 kDa protein-like [Cucurbita pepo subsp. pepo]6.1e-11886.59Show/hide
Query:  MAIDGVCTLDVADFPPLPVVEKRRRSFSLLLSSLMIVGFSRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVEA---------------
        MAIDGVCTLDVADFPP PVVEKRR+SFSLLLSSLMIVGFSRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVEA               
Subjt:  MAIDGVCTLDVADFPPLPVVEKRRRSFSLLLSSLMIVGFSRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVEA---------------

Query:  --------------KLIDDTGYIAKLGIKDGDQLQFVRHVTTGYNVIRKQSKKGQVSPKLHNRTSSLSRNYDQDDIESGRIQHHGNNQLFKHHEPKMVVF
                      KLIDDTGYIAKLGIKDGDQLQFVRHVTTGYNVIRKQSKKGQVSPKL+NRTSSLSRNYDQDDIESGRIQHHGNNQLFKHHEPKMVVF
Subjt:  --------------KLIDDTGYIAKLGIKDGDQLQFVRHVTTGYNVIRKQSKKGQVSPKLHNRTSSLSRNYDQDDIESGRIQHHGNNQLFKHHEPKMVVF

Query:  LGGWFSHIKLASAGKTCIESLVRSSGTQHSLVHGFKNLIQLCRQKQYYQKVSKRKRSILAM
        LGGWFSHIKLASAGKTCIESLVRSSGT+HSLVHGFKNLIQLCRQKQYYQKVSKRKRSIL +
Subjt:  LGGWFSHIKLASAGKTCIESLVRSSGTQHSLVHGFKNLIQLCRQKQYYQKVSKRKRSILAM

XP_038899214.1 uncharacterized protein LOC120086568 [Benincasa hispida]1.6e-8665.7Show/hide
Query:  MAIDGVCTLDVADFPPLP-----VVEKRRRSFSLLLSSLMIVGFSRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVEA----------
        MAIDGVCTLDV +FPP P      VEK RRSFSLL S LMIVG SRK+ LYRQLPHQPL LSVLKLDGSCFDI+VKRSATVAELKGAVE+          
Subjt:  MAIDGVCTLDVADFPPLP-----VVEKRRRSFSLLLSSLMIVGFSRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVEA----------

Query:  -------------------KLIDDTGYIAKLGIKDGDQLQFVRHVTTGYNVIRKQSKKGQVSPKLHNRTSSLSRNYDQ-----------DDIESGRIQHH
                           KL+DD  YIA  GIKDGDQLQFVRHVTTGYN IRKQSKK  VS KL +R S  S+NY+Q           +DIESGR QHH
Subjt:  -------------------KLIDDTGYIAKLGIKDGDQLQFVRHVTTGYNVIRKQSKKGQVSPKLHNRTSSLSRNYDQ-----------DDIESGRIQHH

Query:  GNNQLFKHHEPKMVVFLGGWFSHIKLASAGKTCIESLVRSSGTQHSLVHGFKNLIQLCRQKQYYQKVSKRKRSILAM
        GNN+LF +HEPKMVVFLGGWFSH KLASAGKT I SLVR SGT+ SLV GFKNLIQLCR+K++Y+KV+K+KRSI+ +
Subjt:  GNNQLFKHHEPKMVVFLGGWFSHIKLASAGKTCIESLVRSSGTQHSLVHGFKNLIQLCRQKQYYQKVSKRKRSILAM

TrEMBL top hitse value%identityAlignment
A0A0A0LH97 Ubiquitin-like domain-containing protein1.2e-7963.18Show/hide
Query:  MAIDGVCTLDVADFPPL---PVVEKRRRSFSLLLSSLMIVGFSRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVE-------------
        MA+DGVC LDV DFPPL      EK RRSFSLL S LMIVG SRK+ LYRQLPHQPL LSVLKLDGS FDI+V+RSATVAELKGAVE             
Subjt:  MAIDGVCTLDVADFPPL---PVVEKRRRSFSLLLSSLMIVGFSRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVE-------------

Query:  ----------------AKLIDDTGYIAKLGIKDGDQLQFVRHVTTGYNVIRKQSKKGQVSPKLHNRTSSLSRN-----------YDQDDIESGRIQHHG-
                         KL+DD  YIA  GIKDGDQLQFVRHVTTGYNVIRKQSKK  VS KL +R SS S++           Y+ +DIESGR QH G 
Subjt:  ----------------AKLIDDTGYIAKLGIKDGDQLQFVRHVTTGYNVIRKQSKKGQVSPKLHNRTSSLSRN-----------YDQDDIESGRIQHHG-

Query:  -NNQLFKHHEPKMVVFLGGWFSHIKLASAGKTCIESLVRSSGTQHSLVHGFKNLIQLCRQKQYYQKVSKRKRSILAM
         NN LF +HEPKMVVFLGGWFSH KLASAGK  I+SLVR S T+ SLV GFKNLIQLCR+K++Y+KV+K+KRSI+ +
Subjt:  -NNQLFKHHEPKMVVFLGGWFSHIKLASAGKTCIESLVRSSGTQHSLVHGFKNLIQLCRQKQYYQKVSKRKRSILAM

A0A5A7T9I3 U11/U12 small nuclear ribonucleoprotein 25 kDa protein-like4.1e-8063.18Show/hide
Query:  MAIDGVCTLDVADFPP---LPVVEKRRRSFSLLLSSLMIVGFSRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVE-------------
        MA+DGVCTLDV DFPP       EK RRSFSLL S LMIVG SRK+ LYRQLPHQPL LSVLKLDGSCFDI+V+RSATVAELKGAVE             
Subjt:  MAIDGVCTLDVADFPP---LPVVEKRRRSFSLLLSSLMIVGFSRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVE-------------

Query:  ----------------AKLIDDTGYIAKLGIKDGDQLQFVRHVTTGYNVIRKQSKKGQVSPKLHNRTSSLSR-------------NYDQDDIESGRIQHH
                         KLIDD  YIA  GIKDGDQLQFVRHVTTGYNVIRKQSKK +VS KL +R SS S+             NY  +DIESGR QH+
Subjt:  ----------------AKLIDDTGYIAKLGIKDGDQLQFVRHVTTGYNVIRKQSKKGQVSPKLHNRTSSLSR-------------NYDQDDIESGRIQHH

Query:  GNNQLFKHHEPKMVVFLGGWFSHIKLASAGKTCIESLVRSSGTQHSLVHGFKNLIQLCRQKQYYQKVSKRKRSILAM
        GNN LF  ++PKMVVFLGGWFSH KLASAGK  I+SLVR S T+ SLV G KNLIQLCR+K++Y+KV+K+KRSI+ +
Subjt:  GNNQLFKHHEPKMVVFLGGWFSHIKLASAGKTCIESLVRSSGTQHSLVHGFKNLIQLCRQKQYYQKVSKRKRSILAM

A0A6J1BY98 uncharacterized protein LOC1110068317.5e-8264.34Show/hide
Query:  MAIDGVCTLDVADFPPLP----VVEKRRRSFSLLLSS-LMIVGFSRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVEA----------
        MA+DGVC LD+ DFP  P     + KRRRSFSLL+SS LMIVG SRK+  YRQLPHQPL LSVLKLDGSCFDI+VK+SATVAELKGAVEA          
Subjt:  MAIDGVCTLDVADFPPLP----VVEKRRRSFSLLLSS-LMIVGFSRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVEA----------

Query:  -------------------KLIDDTGYIAKLGIKDGDQLQFVRHVTTGYNVIRKQSKKGQVSPKLHNRTSSLSRNY---DQD---DIESGRIQHHGNNQL
                           KL+DDT YIA  GIKDGDQLQFVRHVTTGYNVIRKQS +  VS KLH+RTSS S+++   DQD   DIE GR QHH N +L
Subjt:  -------------------KLIDDTGYIAKLGIKDGDQLQFVRHVTTGYNVIRKQSKKGQVSPKLHNRTSSLSRNY---DQD---DIESGRIQHHGNNQL

Query:  FKHHEPKMVVFLGGWFSHIKLASAGKTCIESLVRSSGTQHSLVHGFKNLIQLCRQKQYYQKVSKRKRSILAM
        F   EPKMVVFLGGWFSH KLASAG+  I SLVR SG++ SLV GFKNL+QLCR+K++YQKVSK+KRSI+ +
Subjt:  FKHHEPKMVVFLGGWFSHIKLASAGKTCIESLVRSSGTQHSLVHGFKNLIQLCRQKQYYQKVSKRKRSILAM

A0A6J1FQJ1 uncharacterized protein LOC1114460362.7e-11987.36Show/hide
Query:  MAIDGVCTLDVADFPPLPVVEKRRRSFSLLLSSLMIVGFSRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVEA---------------
        MAIDGVCTLDVADFPPLPVVEKRRRSFSLLLSSLMIVGFSRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVEA               
Subjt:  MAIDGVCTLDVADFPPLPVVEKRRRSFSLLLSSLMIVGFSRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVEA---------------

Query:  --------------KLIDDTGYIAKLGIKDGDQLQFVRHVTTGYNVIRKQSKKGQVSPKLHNRTSSLSRNYDQDDIESGRIQHHGNNQLFKHHEPKMVVF
                      KLIDDTGYIAKLGIKDGDQLQFVRHVTTGYNVIRKQSKKGQVSPKLHNRTSSLSRNYDQDDIESGRIQHHGNNQLFKHHEPKMVVF
Subjt:  --------------KLIDDTGYIAKLGIKDGDQLQFVRHVTTGYNVIRKQSKKGQVSPKLHNRTSSLSRNYDQDDIESGRIQHHGNNQLFKHHEPKMVVF

Query:  LGGWFSHIKLASAGKTCIESLVRSSGTQHSLVHGFKNLIQLCRQKQYYQKVSKRKRSILAM
        LGGWFSHIKLASAGKTCIESLVRSSGT+HSLVHGFKNLIQLCRQKQ YQKVSKRKRSIL +
Subjt:  LGGWFSHIKLASAGKTCIESLVRSSGTQHSLVHGFKNLIQLCRQKQYYQKVSKRKRSILAM

A0A6J1J4J2 uncharacterized protein LOC1114812621.3e-11886.59Show/hide
Query:  MAIDGVCTLDVADFPPLPVVEKRRRSFSLLLSSLMIVGFSRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVEA---------------
        MAIDGVCTLDV+DFPPLPVVEKRRRSFSLLLSSLMIVGFSRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVEA               
Subjt:  MAIDGVCTLDVADFPPLPVVEKRRRSFSLLLSSLMIVGFSRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVEA---------------

Query:  --------------KLIDDTGYIAKLGIKDGDQLQFVRHVTTGYNVIRKQSKKGQVSPKLHNRTSSLSRNYDQDDIESGRIQHHGNNQLFKHHEPKMVVF
                      KLIDDTGYIAKLGIKDGDQLQFVRHVTTGYNVIRKQSKKGQVSPKLHNRTSSLSRNYDQDDIESGRIQHHGNNQLFKHHEPKMVVF
Subjt:  --------------KLIDDTGYIAKLGIKDGDQLQFVRHVTTGYNVIRKQSKKGQVSPKLHNRTSSLSRNYDQDDIESGRIQHHGNNQLFKHHEPKMVVF

Query:  LGGWFSHIKLASAGKTCIESLVRSSGTQHSLVHGFKNLIQLCRQKQYYQKVSKRKRSILAM
        LGGWFSHIKLASAGKTCIESLVRSSGT HSLV GFKNLIQLCRQKQYYQKVSK+KRSIL +
Subjt:  LGGWFSHIKLASAGKTCIESLVRSSGTQHSLVHGFKNLIQLCRQKQYYQKVSKRKRSILAM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G80060.1 Ubiquitin-like superfamily protein1.5e-1027.23Show/hide
Query:  RKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVE-----------------------------AKLIDDTGYIAKLGIKDGDQLQFVRHV
        R+S   +  P   ++LSV+KL+GS FD+ V +  +VAELK AVE                              +L++D   I  LG+ DGDQL FVRH+
Subjt:  RKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVE-----------------------------AKLIDDTGYIAKLGIKDGDQLQFVRHV

Query:  TTGYNVIRKQSKKGQVSPKLHNRTSSL-----SRNYDQDDIES-GRIQHHGNNQLFKHHEPKMVVFLGGWFSHI---KLASAGKTCIESLVRSSGTQHSL
        +  ++ + K+SK       L    SS+      +N +Q+ ++      + G        E ++V  + GW  +     ++  G  C     RS  ++ SL
Subjt:  TTGYNVIRKQSKKGQVSPKLHNRTSSL-----SRNYDQDDIES-GRIQHHGNNQLFKHHEPKMVVFLGGWFSHI---KLASAGKTCIESLVRSSGTQHSL

Query:  VH
         H
Subjt:  VH

AT4G32270.1 Ubiquitin-like superfamily protein2.7e-1536.36Show/hide
Query:  DFPPLPVVEKRRRSFSLLLSSLMIVGF--SRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVEA-------------------------
        D PP       RRS +  LS L ++     R+S  Y Q+P +P++L+VLKLDGS F I+V ++ATV ELK AVEA                         
Subjt:  DFPPLPVVEKRRRSFSLLLSSLMIVGF--SRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVEA-------------------------

Query:  ----KLIDDTGYIAKLGIKDGDQLQFVRHVTTGYNVIRKQSKK
            +LI+++ Y+ + GIKDGDQL+F+RH++    ++ K   K
Subjt:  ----KLIDDTGYIAKLGIKDGDQLQFVRHVTTGYNVIRKQSKK

AT4G32270.2 Ubiquitin-like superfamily protein5.8e-1036.19Show/hide
Query:  LPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVEA-----------------------------KLIDDTGYIAKLGIKDGDQLQFVRHVTTGYNVIR
        +P +P++L+VLKLDGS F   V ++ATV ELK AVEA                             +LI+++ Y+ + GIKDGDQL+F+RH++    ++ 
Subjt:  LPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVEA-----------------------------KLIDDTGYIAKLGIKDGDQLQFVRHVTTGYNVIR

Query:  KQSKK
        K   K
Subjt:  KQSKK

AT5G25340.1 Ubiquitin-like superfamily protein2.5e-1335.1Show/hide
Query:  FSRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVEA-----------------------------KLIDDTGYIAKLGIKDGDQLQFVR
        F R++  Y +LP++P+ LSVLKLDGS FD+ V  SATV +LK A+E                              KLI DT  I   G+KDGD+++F  
Subjt:  FSRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVEA-----------------------------KLIDDTGYIAKLGIKDGDQLQFVR

Query:  HV------TTGYNVIRKQSKKGQVSPK----LHNRTSSLSRNYDQDDIESG
        HV      + GY+   KQ    +V PK    + NR   +  ++  DD+E G
Subjt:  HV------TTGYNVIRKQSKKGQVSPK----LHNRTSSLSRNYDQDDIESG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATCGACGGCGTTTGCACACTCGATGTCGCAGATTTTCCGCCGCTTCCGGTGGTCGAGAAGCGTCGCCGGTCATTCTCGCTGCTCTTGTCTTCTTTGATGATTGT
TGGATTCTCGAGGAAGAGTGGTTTGTATCGGCAGCTTCCGCATCAGCCTCTCGAACTCTCCGTCCTTAAATTGGATGGCTCTTGTTTTGATATTCGAGTTAAGAGATCTG
CTACTGTTGCTGAATTAAAGGGGGCAGTGGAGGCAAAGTTGATTGATGATACGGGCTACATTGCAAAATTAGGCATCAAGGATGGTGATCAGCTTCAATTTGTCCGGCAT
GTCACAACTGGCTACAATGTTATAAGAAAGCAATCAAAGAAAGGCCAGGTTTCCCCAAAACTGCACAATAGGACATCATCTCTATCAAGAAACTATGATCAGGACGATAT
AGAGAGTGGAAGGATTCAACATCATGGCAACAACCAACTCTTCAAACACCACGAGCCAAAGATGGTGGTCTTTTTGGGAGGGTGGTTCTCCCACATCAAGCTGGCATCTG
CAGGGAAAACGTGCATCGAAAGCTTGGTTCGTTCGTCGGGAACTCAGCATAGTCTGGTTCATGGTTTTAAGAACTTAATTCAACTATGCCGCCAGAAACAATATTATCAG
AAAGTGAGTAAAAGGAAGAGATCCATCCTTGCAATGAAGCTGGAATTTGCATTTGAGAAGAAAGTGAAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAATCGACGGCGTTTGCACACTCGATGTCGCAGATTTTCCGCCGCTTCCGGTGGTCGAGAAGCGTCGCCGGTCATTCTCGCTGCTCTTGTCTTCTTTGATGATTGT
TGGATTCTCGAGGAAGAGTGGTTTGTATCGGCAGCTTCCGCATCAGCCTCTCGAACTCTCCGTCCTTAAATTGGATGGCTCTTGTTTTGATATTCGAGTTAAGAGATCTG
CTACTGTTGCTGAATTAAAGGGGGCAGTGGAGGCAAAGTTGATTGATGATACGGGCTACATTGCAAAATTAGGCATCAAGGATGGTGATCAGCTTCAATTTGTCCGGCAT
GTCACAACTGGCTACAATGTTATAAGAAAGCAATCAAAGAAAGGCCAGGTTTCCCCAAAACTGCACAATAGGACATCATCTCTATCAAGAAACTATGATCAGGACGATAT
AGAGAGTGGAAGGATTCAACATCATGGCAACAACCAACTCTTCAAACACCACGAGCCAAAGATGGTGGTCTTTTTGGGAGGGTGGTTCTCCCACATCAAGCTGGCATCTG
CAGGGAAAACGTGCATCGAAAGCTTGGTTCGTTCGTCGGGAACTCAGCATAGTCTGGTTCATGGTTTTAAGAACTTAATTCAACTATGCCGCCAGAAACAATATTATCAG
AAAGTGAGTAAAAGGAAGAGATCCATCCTTGCAATGAAGCTGGAATTTGCATTTGAGAAGAAAGTGAAG
Protein sequenceShow/hide protein sequence
MAIDGVCTLDVADFPPLPVVEKRRRSFSLLLSSLMIVGFSRKSGLYRQLPHQPLELSVLKLDGSCFDIRVKRSATVAELKGAVEAKLIDDTGYIAKLGIKDGDQLQFVRH
VTTGYNVIRKQSKKGQVSPKLHNRTSSLSRNYDQDDIESGRIQHHGNNQLFKHHEPKMVVFLGGWFSHIKLASAGKTCIESLVRSSGTQHSLVHGFKNLIQLCRQKQYYQ
KVSKRKRSILAMKLEFAFEKKVK