; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg18146 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg18146
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionDUF4228 domain-containing protein
Genome locationCarg_Chr11:633226..634185
RNA-Seq ExpressionCarg18146
SyntenyCarg18146
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587486.1 hypothetical protein SDJN03_16051, partial [Cucurbita argyrosperma subsp. sororia]8.3e-103100Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT
        MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT

Query:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS
        RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS
Subjt:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS

Query:  RTWQPSLHSISEAGS
        RTWQPSLHSISEAGS
Subjt:  RTWQPSLHSISEAGS

XP_022933525.1 uncharacterized protein LOC111440926 isoform X1 [Cucurbita moschata]4.1e-10299.53Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT
        MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENE SSNSVRLT
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT

Query:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS
        RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS
Subjt:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS

Query:  RTWQPSLHSISEAGS
        RTWQPSLHSISEAGS
Subjt:  RTWQPSLHSISEAGS

XP_022933526.1 uncharacterized protein LOC111440926 isoform X2 [Cucurbita moschata]1.7e-9595.35Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT
        MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEA          SSNSVRLT
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT

Query:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS
        RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS
Subjt:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS

Query:  RTWQPSLHSISEAGS
        RTWQPSLHSISEAGS
Subjt:  RTWQPSLHSISEAGS

XP_022965761.1 uncharacterized protein LOC111465553 isoform X1 [Cucurbita maxima]3.5e-10198.6Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT
        MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEA KVEKKENEA KVEKKENEGSSNSVRLT
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT

Query:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS
        RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAA AAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS
Subjt:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS

Query:  RTWQPSLHSISEAGS
        RTWQPSLHSISEAGS
Subjt:  RTWQPSLHSISEAGS

XP_023529445.1 uncharacterized protein LOC111792303 [Cucurbita pepo subsp. pepo]2.0e-10198.6Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT
        MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENE SKVEKKENEGSSNSVRLT
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT

Query:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS
        RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAA AAGRSVSEDPFQATKHEKNNRPRTSTSTTSAI RS
Subjt:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS

Query:  RTWQPSLHSISEAGS
        RTWQPSLHSISEAGS
Subjt:  RTWQPSLHSISEAGS

TrEMBL top hitse value%identityAlignment
A0A0A0LRI4 Uncharacterized protein2.3e-7478.14Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT
        MGNCQAIDAATLVIQHPSGK DKLYWPVTAREIMKMNPGHYVALLISTTM TPNES+NNN+TS                          NE SSNSVRLT
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT

Query:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS
        RIKLLRPAD LVLGQVYRLIT++EVM GLSAKKQAKVKQSQLEAA+K  RRK+R  R SD AAAAAAGRSVSED  QA KHEKNNRPRTSTSTTSA ARS
Subjt:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS

Query:  RTWQPSLHSISEAGS
        RTWQPSLHSISEAGS
Subjt:  RTWQPSLHSISEAGS

A0A6J1EZA8 uncharacterized protein LOC111440926 isoform X28.1e-9695.35Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT
        MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEA          SSNSVRLT
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT

Query:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS
        RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS
Subjt:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS

Query:  RTWQPSLHSISEAGS
        RTWQPSLHSISEAGS
Subjt:  RTWQPSLHSISEAGS

A0A6J1F544 uncharacterized protein LOC111440926 isoform X12.0e-10299.53Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT
        MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENE SSNSVRLT
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT

Query:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS
        RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS
Subjt:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS

Query:  RTWQPSLHSISEAGS
        RTWQPSLHSISEAGS
Subjt:  RTWQPSLHSISEAGS

A0A6J1HMJ8 uncharacterized protein LOC111465553 isoform X11.7e-10198.6Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT
        MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEA KVEKKENEA KVEKKENEGSSNSVRLT
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT

Query:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS
        RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAA AAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS
Subjt:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS

Query:  RTWQPSLHSISEAGS
        RTWQPSLHSISEAGS
Subjt:  RTWQPSLHSISEAGS

A0A6J1HRW5 uncharacterized protein LOC111465553 isoform X23.1e-9594.42Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT
        MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNET          SKVEKKENEA KVEKKENEGSSNSVRLT
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT

Query:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS
        RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAA AAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS
Subjt:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS

Query:  RTWQPSLHSISEAGS
        RTWQPSLHSISEAGS
Subjt:  RTWQPSLHSISEAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10530.1 unknown protein1.4e-2337.5Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT
        MGNCQA++AA LV+QHP G +D+ Y  V+  E+M M PGHYV+L+I  +                          E++E      EK +++    +VR T
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT

Query:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRP-RTSTSTTSAIAR
        R++LLRP + LVLG  YRLITS+EVM  L  KK AK K+ Q+E    A                    +  S+      K  K  R  R STS    + +
Subjt:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRP-RTSTSTTSAIAR

Query:  SRTWQPSLHSISEAGS
        S+TW+PSL SISEA S
Subjt:  SRTWQPSLHSISEAGS

AT1G60010.1 unknown protein1.9e-2840.93Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT
        MGNCQA+DAA LV+QHP GK+D+ Y PV+  EIM+M PGHYV+L+I      P ++     T+  +K E +                         VR T
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT

Query:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS
        R+KLLRP + LVLG  YRLITS+EVM  L AKK AK K+ Q E +      KE+    S+        + + E+  +    E  +  + S  T SA +RS
Subjt:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS

Query:  RTWQPSLHSISEAGS
        +TW+PSL SISEA S
Subjt:  RTWQPSLHSISEAGS

AT5G50090.1 unknown protein3.8e-2942.79Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT
        MGNCQA+D A +VIQHP+GK +KL  PV+A  +MKMNPGH V+LLISTT ++   S                                   G    +RLT
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT

Query:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS
        RIKLLRP DTLVLG VYRLIT+KEVM GL AKK +K+K+    + +K            +   A  + +  +ED  Q  K EK  R R           S
Subjt:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS

Query:  RTWQPSLHSISEAGS
        R+WQPSL SISE GS
Subjt:  RTWQPSLHSISEAGS

AT5G50090.2 unknown protein7.2e-2840.47Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT
        MGNCQA+D A +VIQHP+GK +KL  PV+A  +MKMNPGH V+LLISTT ++   S                                   G    +RLT
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT

Query:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS
        RIKLLRP DTLVLG VYRLIT+KEVM GL AKK +K+K               + ++GSD             D  +  K   + +             S
Subjt:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARS

Query:  RTWQPSLHSISEAGS
        R+WQPSL SISE GS
Subjt:  RTWQPSLHSISEAGS

AT5G62900.1 unknown protein1.0e-2136.41Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT
        MGNCQA +AAT VIQ P GK  + Y  V A E++K +PGH+VALL+S+ +                                             S+R+T
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLT

Query:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQ--SQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIA
        RIKLLRP+D L+LG VYRLI+S+EVM G+ AKK  K+K+   +   AE+            +         S S+   Q   HEK    R   +T  A  
Subjt:  RIKLLRPADTLVLGQVYRLITSKEVMSGLSAKKQAKVKQ--SQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIA

Query:  RSRTWQPSLHSISEAGS
        + R WQPSL SISE+ S
Subjt:  RSRTWQPSLHSISEAGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAATTGCCAAGCCATTGATGCAGCAACACTTGTGATACAACACCCAAGTGGGAAAGTTGACAAACTATATTGGCCTGTGACTGCTAGAGAGATCATGAAGATGAA
TCCTGGTCATTATGTTGCTCTTCTTATCTCCACCACCATGGTTACACCGAATGAAAGTTCTAATAACAATGAAACCAGCAAGGTTGAGAAAAAGGAGAATGAAGCCAGCA
AGGTTGAGAAAAAGGAGAATGAAGCTAGCAAGGTTGAGAAAAAGGAGAATGAAGGCAGTAGTAATTCGGTTCGTTTAACTAGAATCAAGCTTCTTCGCCCAGCTGACACG
CTTGTTCTTGGCCAAGTTTACAGGCTCATCACTTCTAAAGAGGTTATGAGTGGTTTATCCGCAAAGAAACAAGCAAAGGTTAAACAAAGCCAGTTAGAAGCCGCTGAGAA
GGCAGGGAGGAGGAAAGAACGTGCAGCCAGAGGCTCAGATTCAGCAGCCGCCGCCGCAGCTGGAAGATCTGTATCTGAAGATCCTTTTCAGGCGACCAAACACGAGAAGA
ACAACAGACCAAGAACAAGTACATCGACAACCTCGGCCATAGCCAGGTCAAGAACATGGCAACCTTCATTACATAGCATCTCAGAAGCTGGTAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAATTGCCAAGCCATTGATGCAGCAACACTTGTGATACAACACCCAAGTGGGAAAGTTGACAAACTATATTGGCCTGTGACTGCTAGAGAGATCATGAAGATGAA
TCCTGGTCATTATGTTGCTCTTCTTATCTCCACCACCATGGTTACACCGAATGAAAGTTCTAATAACAATGAAACCAGCAAGGTTGAGAAAAAGGAGAATGAAGCCAGCA
AGGTTGAGAAAAAGGAGAATGAAGCTAGCAAGGTTGAGAAAAAGGAGAATGAAGGCAGTAGTAATTCGGTTCGTTTAACTAGAATCAAGCTTCTTCGCCCAGCTGACACG
CTTGTTCTTGGCCAAGTTTACAGGCTCATCACTTCTAAAGAGGTTATGAGTGGTTTATCCGCAAAGAAACAAGCAAAGGTTAAACAAAGCCAGTTAGAAGCCGCTGAGAA
GGCAGGGAGGAGGAAAGAACGTGCAGCCAGAGGCTCAGATTCAGCAGCCGCCGCCGCAGCTGGAAGATCTGTATCTGAAGATCCTTTTCAGGCGACCAAACACGAGAAGA
ACAACAGACCAAGAACAAGTACATCGACAACCTCGGCCATAGCCAGGTCAAGAACATGGCAACCTTCATTACATAGCATCTCAGAAGCTGGTAGCTAATCATTTATGTTC
CCTATTTCCCACATGCTATTGAGAGACAGGGTGGCACAGATTGTCCAAGGCTCTTTAAAC
Protein sequenceShow/hide protein sequence
MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMVTPNESSNNNETSKVEKKENEASKVEKKENEASKVEKKENEGSSNSVRLTRIKLLRPADT
LVLGQVYRLITSKEVMSGLSAKKQAKVKQSQLEAAEKAGRRKERAARGSDSAAAAAAGRSVSEDPFQATKHEKNNRPRTSTSTTSAIARSRTWQPSLHSISEAGS