; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh02G018050 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh02G018050
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionSmall nuclear ribonucleoprotein Sm D1
Genome locationCma_Chr02:10041144..10045843
RNA-Seq ExpressionCmaCh02G018050
SyntenyCmaCh02G018050
Gene Ontology termsGO:0000387 - spliceosomal snRNP assembly (biological process)
GO:0034715 - pICln-Sm protein complex (cellular component)
GO:0097526 - spliceosomal tri-snRNP complex (cellular component)
GO:0071013 - catalytic step 2 spliceosome (cellular component)
GO:0071011 - precatalytic spliceosome (cellular component)
GO:0034719 - SMN-Sm protein complex (cellular component)
GO:0000243 - commitment complex (cellular component)
GO:0005689 - U12-type spliceosomal complex (cellular component)
GO:0005687 - U4 snRNP (cellular component)
GO:0005686 - U2 snRNP (cellular component)
GO:0005685 - U1 snRNP (cellular component)
GO:0005682 - U5 snRNP (cellular component)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR001163 - LSM domain, eukaryotic/archaea-type
IPR010920 - LSM domain superfamily
IPR027141 - Like-Sm (LSM) domain containing protein, LSm4/SmD1/SmD3
IPR034102 - Small nuclear ribonucleoprotein D1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF9686976.1 hypothetical protein SADUNF_Sadunf02G0046000 [Salix dunnii]1.6e-5562.79Show/hide
Query:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAGMRCPCILP
        ++FLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLK VKLT+KGKNPVT+DHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTA         
Subjt:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAGMRCPCILP

Query:  GSLWGEGEDEAVGVVVVVDARFVTVSNFDLHSCDKVLGVIQPTQQEKMGKKQSDTKQAAIVFGALALGWLAIEMAFKPFLDRVRSAMDNSDPAR---DTD
                  A G V  +              C K L   Q  ++EKMG K +  KQA   FGALA GWLA E+AFKPFLD+ RSA+D SDPAR   D D
Subjt:  GSLWGEGEDEAVGVVVVVDARFVTVSNFDLHSCDKVLGVIQPTQQEKMGKKQSDTKQAAIVFGALALGWLAIEMAFKPFLDRVRSAMDNSDPAR---DTD

Query:  DLADNQSDQRPSDDD
        D AD+     PS+ D
Subjt:  DLADNQSDQRPSDDD

KAG6606428.1 Small nuclear ribonucleoprotein SmD1b, partial [Cucurbita argyrosperma subsp. sororia]5.5e-6458.52Show/hide
Query:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAGMRCPCILP
        ++FLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLS                         P   P+          +L 
Subjt:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAGMRCPCILP

Query:  GSLWGEGEDEAVGVVVVVDARFVTVSNFDLHSCDK-VLGVI-----------------------------------------------------QPTQQE
        GSLWG  EDEAVGVVVVVDARFV VSNFDL S +  +LG +                                                       T QE
Subjt:  GSLWGEGEDEAVGVVVVVDARFVTVSNFDLHSCDK-VLGVI-----------------------------------------------------QPTQQE

Query:  KMGKKQSDTKQAAIVFGALALGWLAIEMAFKPFLDRVRSAMDNSDPARDTDDLADNQSDQRPSDDDQSTT
        KMGKKQSDTKQAAIVFGALALGWLAIEMAFKP LDRVRSAMDN+DPARDTDDLADNQSDQRPSDDDQSTT
Subjt:  KMGKKQSDTKQAAIVFGALALGWLAIEMAFKPFLDRVRSAMDNSDPARDTDDLADNQSDQRPSDDDQSTT

OMP05040.1 hypothetical protein COLO4_09117 [Corchorus olitorius]2.3e-5460.19Show/hide
Query:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAGMRCPCILP
        ++FLMKLNNETVSIELKNGTVV+GTITGVDISMNTHLK VKLTLKGKNPV++DHLSVRGNNIRYYILPDS+NLETLLVEETPRVKPKKPTA         
Subjt:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAGMRCPCILP

Query:  GSLWGEGEDEAVGVVVVVDARFVTVSNFDLHSCDKVLGVIQPTQQEKMGKKQSDTKQAAIVFGALALGWLAIEMAFKPFLDRVRSAMDNSDPARDTDDLA
               E+E                              +  ++ KMGK  +  KQAA+VFGALA GWLAIEMAFKPFLD+ R +MD SDP RD DD  
Subjt:  GSLWGEGEDEAVGVVVVVDARFVTVSNFDLHSCDKVLGVIQPTQQEKMGKKQSDTKQAAIVFGALALGWLAIEMAFKPFLDRVRSAMDNSDPARDTDDLA

Query:  DNQSDQ-RPSDDDQST
         ++ D  R +DDD S+
Subjt:  DNQSDQ-RPSDDDQST

TXG63624.1 hypothetical protein EZV62_010618 [Acer yangbiense]1.0e-5462.62Show/hide
Query:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAGMRCPCILP
        ++FLMKLNNETVSIELKNGT+VHGTITGVDISMNTHLKAVKLTLKGKNPVT+DHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTA        P
Subjt:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAGMRCPCILP

Query:  GSLWGEGEDEAVGVVVVVDARFVTVSNFDLHSCDKVLGVIQPTQQEKMGKKQSDTKQAAIVFGALALGWLAIEMAFKPFLDRVRSAMDNSDPARDTDDLA
        G +        VG  V  D R V                                K+A  VFGALALGWLAIE+A KPFLD+ R+AMD SDPARD DD+ 
Subjt:  GSLWGEGEDEAVGVVVVVDARFVTVSNFDLHSCDKVLGVIQPTQQEKMGKKQSDTKQAAIVFGALALGWLAIEMAFKPFLDRVRSAMDNSDPARDTDDLA

Query:  DNQSDQRPSDDDQS
        D  +++ PSD D S
Subjt:  DNQSDQRPSDDDQS

VFR02113.1 unnamed protein product [Cuscuta campestris]9.4e-5660.63Show/hide
Query:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAGMRCPCILP
        ++FLMKLNNETVSIELKNGTVV+GTITGVD+SMNTHLK VKLT KGKNPVTMDHLSVRGNNIRYYILPDS+NLETLLVE+TP+VKPKKPTAG     +  
Subjt:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAGMRCPCILP

Query:  GSLWGEGEDEA-----VGVVVVVDARFVTVSNFDLHSCDKVLGVIQPTQQEKMGKKQSDTKQAAIVFGALALGWLAIEMAFKPFLDRVRSAMDNSDPARD
        G   G G         V   +V   RF T+                   +EKMG+K S  K AA+VFGALA GWLAIE+AFKPFLD+ R+A+  SDP  D
Subjt:  GSLWGEGEDEA-----VGVVVVVDARFVTVSNFDLHSCDKVLGVIQPTQQEKMGKKQSDTKQAAIVFGALALGWLAIEMAFKPFLDRVRSAMDNSDPARD

Query:  TDDLADNQSDQRPSD-DDQST
         DD  +N     PSD DD S+
Subjt:  TDDLADNQSDQRPSD-DDQST

TrEMBL top hitse value%identityAlignment
A0A1R3KD71 Small nuclear ribonucleoprotein Sm D11.1e-5460.19Show/hide
Query:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAGMRCPCILP
        ++FLMKLNNETVSIELKNGTVV+GTITGVDISMNTHLK VKLTLKGKNPV++DHLSVRGNNIRYYILPDS+NLETLLVEETPRVKPKKPTA         
Subjt:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAGMRCPCILP

Query:  GSLWGEGEDEAVGVVVVVDARFVTVSNFDLHSCDKVLGVIQPTQQEKMGKKQSDTKQAAIVFGALALGWLAIEMAFKPFLDRVRSAMDNSDPARDTDDLA
               E+E                              +  ++ KMGK  +  KQAA+VFGALA GWLAIEMAFKPFLD+ R +MD SDP RD DD  
Subjt:  GSLWGEGEDEAVGVVVVVDARFVTVSNFDLHSCDKVLGVIQPTQQEKMGKKQSDTKQAAIVFGALALGWLAIEMAFKPFLDRVRSAMDNSDPARDTDDLA

Query:  DNQSDQ-RPSDDDQST
         ++ D  R +DDD S+
Subjt:  DNQSDQ-RPSDDDQST

A0A371G0W1 Small nuclear ribonucleoprotein Sm D1 (Fragment)7.3e-4675.37Show/hide
Query:  MKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAGMRCPCILPGSLW
        MKLNNETVSIELKNGTVVHGTITGVDISMNTHLK VKLTLKGKNPVT+DHLSVRGNNIRYYILPDSLNLETLLVEE PR+KPKKPTAG        GSLW
Subjt:  MKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAGMRCPCILPGSLW

Query:  GEGEDEAVGVVVVVDARFVTVSNFDLHSCDKVLG
        GEG  EAV V VVV  + + ++ F    C  ++G
Subjt:  GEGEDEAVGVVVVVDARFVTVSNFDLHSCDKVLG

A0A484NMA3 Small nuclear ribonucleoprotein Sm D14.5e-5660.63Show/hide
Query:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAGMRCPCILP
        ++FLMKLNNETVSIELKNGTVV+GTITGVD+SMNTHLK VKLT KGKNPVTMDHLSVRGNNIRYYILPDS+NLETLLVE+TP+VKPKKPTAG     +  
Subjt:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAGMRCPCILP

Query:  GSLWGEGEDEA-----VGVVVVVDARFVTVSNFDLHSCDKVLGVIQPTQQEKMGKKQSDTKQAAIVFGALALGWLAIEMAFKPFLDRVRSAMDNSDPARD
        G   G G         V   +V   RF T+                   +EKMG+K S  K AA+VFGALA GWLAIE+AFKPFLD+ R+A+  SDP  D
Subjt:  GSLWGEGEDEA-----VGVVVVVDARFVTVSNFDLHSCDKVLGVIQPTQQEKMGKKQSDTKQAAIVFGALALGWLAIEMAFKPFLDRVRSAMDNSDPARD

Query:  TDDLADNQSDQRPSD-DDQST
         DD  +N     PSD DD S+
Subjt:  TDDLADNQSDQRPSD-DDQST

A0A5C7I3G1 Small nuclear ribonucleoprotein Sm D15.0e-5562.62Show/hide
Query:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAGMRCPCILP
        ++FLMKLNNETVSIELKNGT+VHGTITGVDISMNTHLKAVKLTLKGKNPVT+DHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTA        P
Subjt:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAGMRCPCILP

Query:  GSLWGEGEDEAVGVVVVVDARFVTVSNFDLHSCDKVLGVIQPTQQEKMGKKQSDTKQAAIVFGALALGWLAIEMAFKPFLDRVRSAMDNSDPARDTDDLA
        G +        VG  V  D R V                                K+A  VFGALALGWLAIE+A KPFLD+ R+AMD SDPARD DD+ 
Subjt:  GSLWGEGEDEAVGVVVVVDARFVTVSNFDLHSCDKVLGVIQPTQQEKMGKKQSDTKQAAIVFGALALGWLAIEMAFKPFLDRVRSAMDNSDPARDTDDLA

Query:  DNQSDQRPSDDDQS
        D  +++ PSD D S
Subjt:  DNQSDQRPSDDDQS

A0A5N6PG97 Small nuclear ribonucleoprotein Sm D14.2e-5458.99Show/hide
Query:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAGMRCPCILP
        ++FLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLK VKLTLKGKNPVT+DHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTA         
Subjt:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAGMRCPCILP

Query:  GSLWGEGEDEAVGVVVVVDARFVTVSN----FDLHSCDKVLGVIQPTQQEKMGKKQS-DTKQAAIVFGALALGWLAIEMAFKPFLDRVRSAMDNSDPARD
            G+ E     V  +VD +  ++ +        + +  + V   + +  M K +S   K A +V GALA GW AIE+AFKP+LD+ R++M+ SDP RD
Subjt:  GSLWGEGEDEAVGVVVVVDARFVTVSN----FDLHSCDKVLGVIQPTQQEKMGKKQS-DTKQAAIVFGALALGWLAIEMAFKPFLDRVRSAMDNSDPARD

Query:  TDDLADNQSDQRPSDDD
         DD AD  +D + S D+
Subjt:  TDDLADNQSDQRPSDDD

SwissProt top hitse value%identityAlignment
P62314 Small nuclear ribonucleoprotein Sm D11.8e-3071.43Show/hide
Query:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTA
        ++FLMKL++ETV+IELKNGT VHGTITGVD+SMNTHLKAVK+TLK + PV ++ LS+RGNNIRY+ILPDSL L+TLLV+  P+VK KK  A
Subjt:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTA

Q3ZC10 Small nuclear ribonucleoprotein Sm D11.8e-3071.43Show/hide
Query:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTA
        ++FLMKL++ETV+IELKNGT VHGTITGVD+SMNTHLKAVK+TLK + PV ++ LS+RGNNIRY+ILPDSL L+TLLV+  P+VK KK  A
Subjt:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTA

Q4R5F6 Small nuclear ribonucleoprotein Sm D11.8e-3071.43Show/hide
Query:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTA
        ++FLMKL++ETV+IELKNGT VHGTITGVD+SMNTHLKAVK+TLK + PV ++ LS+RGNNIRY+ILPDSL L+TLLV+  P+VK KK  A
Subjt:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTA

Q9SSF1 Small nuclear ribonucleoprotein SmD1a6.1e-4289.13Show/hide
Query:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAG
        ++FLMKLNNETVSIELKNGTVVHGTITGVD+SMNTHLK VK++LKGKNPVT+DHLS+RGNNIRYYILPDSLNLETLLVE+TPRVKPKKP AG
Subjt:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAG

Q9SY09 Small nuclear ribonucleoprotein SmD1b8.5e-4492.39Show/hide
Query:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAG
        ++FLMKLNNETVSIELKNGT+VHGTITGVD+SMNTHLKAVKLTLKGKNPVT+DHLSVRGNNIRYYILPDSLNLETLLVE+TPR+KPKKPTAG
Subjt:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAG

Arabidopsis top hitse value%identityAlignment
AT3G07590.1 Small nuclear ribonucleoprotein family protein4.3e-4389.13Show/hide
Query:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAG
        ++FLMKLNNETVSIELKNGTVVHGTITGVD+SMNTHLK VK++LKGKNPVT+DHLS+RGNNIRYYILPDSLNLETLLVE+TPRVKPKKP AG
Subjt:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAG

AT3G07590.2 Small nuclear ribonucleoprotein family protein4.3e-4389.13Show/hide
Query:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAG
        ++FLMKLNNETVSIELKNGTVVHGTITGVD+SMNTHLK VK++LKGKNPVT+DHLS+RGNNIRYYILPDSLNLETLLVE+TPRVKPKKP AG
Subjt:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAG

AT3G52420.1 outer envelope membrane protein 72.6e-1164.58Show/hide
Query:  KQSDTKQAAIVFGALALGWLAIEMAFKPFLDRVRSAMDNSDPARDTDD
        K S  KQA +V  A+ALGWLAIE+AFKPFLD+ RS++D SDP +D DD
Subjt:  KQSDTKQAAIVFGALALGWLAIEMAFKPFLDRVRSAMDNSDPARDTDD

AT4G02840.1 Small nuclear ribonucleoprotein family protein6.1e-4592.39Show/hide
Query:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAG
        ++FLMKLNNETVSIELKNGT+VHGTITGVD+SMNTHLKAVKLTLKGKNPVT+DHLSVRGNNIRYYILPDSLNLETLLVE+TPR+KPKKPTAG
Subjt:  LQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAG

AT4G02840.2 Small nuclear ribonucleoprotein family protein1.7e-4283.33Show/hide
Query:  LQFLMKLNNETVSIELKNGTVVHGTIT----------GVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPT
        ++FLMKLNNETVSIELKNGT+VHGTIT          GVD+SMNTHLKAVKLTLKGKNPVT+DHLSVRGNNIRYYILPDSLNLETLLVE+TPR+KPKKPT
Subjt:  LQFLMKLNNETVSIELKNGTVVHGTIT----------GVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPT

Query:  AG
        AG
Subjt:  AG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAACGGCAAAAAGGAAGGAAGCTCACTGATAAAAATGGTGAAACCAGGAGGACGAAGGGAAAGACGAGCGGAAATCGGAGCAGCAGAGACGGAGAGTGTA
GGGAGCGACGGTAATGGTTGTACCTGTACGCGCCAGCTGCAGTTTTTGATGAAGCTCAACAATGAGACAGTCTCAATCGAGCTGAAAAATGGAACCGTTGTCCAT
GGCACCATCACAGGTGTGGATATCAGCATGAATACACATTTAAAGGCTGTGAAGCTTACTCTAAAGGGGAAAAATCCAGTTACCATGGATCATTTAAGTGTGAGG
GGAAACAACATCAGATATTATATTCTACCTGACAGCTTGAATCTTGAGACTTTACTTGTTGAAGAGACACCCAGGGTCAAGCCCAAGAAACCAACTGCAGGTATG
CGCTGTCCTTGCATTTTGCCTGGAAGCCTTTGGGGCGAGGGCGAGGACGAGGCCGTGGGCGTGGTCGTGGTCGTGGACGCTAGATTTGTTACAGTTTCAAATTTT
GATCTCCATAGTTGTGATAAAGTTCTTGGTGTGATACAGCCGACACAGCAAGAAAAAATGGGAAAGAAACAGTCTGATACCAAGCAAGCAGCTATCGTCTTTGGA
GCCTTAGCGCTGGGTTGGCTCGCCATTGAGATGGCTTTCAAGCCCTTCCTCGATAGGGTCCGCTCCGCCATGGACAACTCTGATCCGGCTCGGGATACCGACGAC
CTTGCCGATAACCAATCTGATCAGAGGCCATCTGATGACGATCAAAGTACCACCGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAACGGCAAAAAGGAAGGAAGCTCACTGATAAAAATGGTGAAACCAGGAGGACGAAGGGAAAGACGAGCGGAAATCGGAGCAGCAGAGACGGAGAGTGTA
GGGAGCGACGGTAATGGTTGTACCTGTACGCGCCAGCTGCAGTTTTTGATGAAGCTCAACAATGAGACAGTCTCAATCGAGCTGAAAAATGGAACCGTTGTCCAT
GGCACCATCACAGGTGTGGATATCAGCATGAATACACATTTAAAGGCTGTGAAGCTTACTCTAAAGGGGAAAAATCCAGTTACCATGGATCATTTAAGTGTGAGG
GGAAACAACATCAGATATTATATTCTACCTGACAGCTTGAATCTTGAGACTTTACTTGTTGAAGAGACACCCAGGGTCAAGCCCAAGAAACCAACTGCAGGTATG
CGCTGTCCTTGCATTTTGCCTGGAAGCCTTTGGGGCGAGGGCGAGGACGAGGCCGTGGGCGTGGTCGTGGTCGTGGACGCTAGATTTGTTACAGTTTCAAATTTT
GATCTCCATAGTTGTGATAAAGTTCTTGGTGTGATACAGCCGACACAGCAAGAAAAAATGGGAAAGAAACAGTCTGATACCAAGCAAGCAGCTATCGTCTTTGGA
GCCTTAGCGCTGGGTTGGCTCGCCATTGAGATGGCTTTCAAGCCCTTCCTCGATAGGGTCCGCTCCGCCATGGACAACTCTGATCCGGCTCGGGATACCGACGAC
CTTGCCGATAACCAATCTGATCAGAGGCCATCTGATGACGATCAAAGTACCACCGCTTAA
Protein sequenceShow/hide protein sequence
MENGKKEGSSLIKMVKPGGRRERRAEIGAAETESVGSDGNGCTCTRQLQFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVR
GNNIRYYILPDSLNLETLLVEETPRVKPKKPTAGMRCPCILPGSLWGEGEDEAVGVVVVVDARFVTVSNFDLHSCDKVLGVIQPTQQEKMGKKQSDTKQAAIVFG
ALALGWLAIEMAFKPFLDRVRSAMDNSDPARDTDDLADNQSDQRPSDDDQSTTA