; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg004597 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg004597
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDUF1985 domain-containing protein
Genome locationscaffold9:11589197..11592605
RNA-Seq ExpressionSpg004597
SyntenySpg004597
Gene Ontology termsGO:0048856 - anatomical structure development (biological process)
GO:0016020 - membrane (cellular component)
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060374.1 uncharacterized protein E6C27_scaffold22G001730 [Cucumis melo var. makuwa]4.4e-5453.65Show/hide
Query:  KGRVVKFGIKEFAKITGLNCGELPPLDMVKLKGRFLHKYFDTEKPIKRSIVSELFTSMEGVKRKDRVRLAKLYFLSNFLLGKQISTRIEMDFITLLDDEQ
        +G + KFGI++F  ITGLNC  LP +D  K+KG+FL KYF  E PI RS VS LF   + +K +D++++AK+YFL NFLLGKQ +T  + + I LLDDEQ
Subjt:  KGRVVKFGIKEFAKITGLNCGELPPLDMVKLKGRFLHKYFDTEKPIKRSIVSELFTSMEGVKRKDRVRLAKLYFLSNFLLGKQISTRIEMDFITLLDDEQ

Query:  LFDSYPWGRIAYTTTIESIRKSIKNPEALVVGITGFPYALLVWAYECILLLSGPSMICAQRVSSTMPLMNNGVADSHPEWKDLAIKCFENEN
         FD+YPWGRI Y   I+SI+KSIKNP AL VGI+GF Y+LLVW Y+C+ LL  PS+ CAQ++      + N + ++HPEWK+LA + F +E+
Subjt:  LFDSYPWGRIAYTTTIESIRKSIKNPEALVVGITGFPYALLVWAYECILLLSGPSMICAQRVSSTMPLMNNGVADSHPEWKDLAIKCFENEN

KGN48800.2 hypothetical protein Csa_003918 [Cucumis sativus]3.5e-5153.33Show/hide
Query:  KGRVVKFGIKEFAKITGLNCGELPPLDMVKL-KGRFLHKYFDTEKPIKRSIVSELFTSMEGVKRKDRVRLAKLYFLSNFLLGKQISTRIEMDFITLLDDE
        +GR+ KFG+K+FA ITGLNCGELP +DM K+ KG+F  +YF  EK I+R+ + ++FT M+  + KD V++AKLY L  F+LGKQI T I  ++  L+DD+
Subjt:  KGRVVKFGIKEFAKITGLNCGELPPLDMVKL-KGRFLHKYFDTEKPIKRSIVSELFTSMEGVKRKDRVRLAKLYFLSNFLLGKQISTRIEMDFITLLDDE

Query:  QLFDSYPWGRIAYTTTIESIRKSIKNPEALVVGITGFPYALLVWAYECILLLSGPSMICAQRVSSTMPLMNNGVADSHPEWKDLAIKCFENENFE
        + FDSYPWGRI+Y  T++ ++KSIK+ +A  +G+ GFPYALLVWAYE I LL+  S   A R+S   P MNN  A  HPEWKDL+ K F++E F+
Subjt:  QLFDSYPWGRIAYTTTIESIRKSIKNPEALVVGITGFPYALLVWAYECILLLSGPSMICAQRVSSTMPLMNNGVADSHPEWKDLAIKCFENENFE

XP_031743197.1 uncharacterized protein LOC101221625 isoform X9 [Cucumis sativus]3.5e-5153.33Show/hide
Query:  KGRVVKFGIKEFAKITGLNCGELPPLDMVKL-KGRFLHKYFDTEKPIKRSIVSELFTSMEGVKRKDRVRLAKLYFLSNFLLGKQISTRIEMDFITLLDDE
        +GR+ KFG+K+FA ITGLNCGELP +DM K+ KG+F  +YF  EK I+R+ + ++FT M+  + KD V++AKLY L  F+LGKQI T I  ++  L+DD+
Subjt:  KGRVVKFGIKEFAKITGLNCGELPPLDMVKL-KGRFLHKYFDTEKPIKRSIVSELFTSMEGVKRKDRVRLAKLYFLSNFLLGKQISTRIEMDFITLLDDE

Query:  QLFDSYPWGRIAYTTTIESIRKSIKNPEALVVGITGFPYALLVWAYECILLLSGPSMICAQRVSSTMPLMNNGVADSHPEWKDLAIKCFENENFE
        + FDSYPWGRI+Y  T++ ++KSIK+ +A  +G+ GFPYALLVWAYE I LL+  S   A R+S   P MNN  A  HPEWKDL+ K F++E F+
Subjt:  QLFDSYPWGRIAYTTTIESIRKSIKNPEALVVGITGFPYALLVWAYECILLLSGPSMICAQRVSSTMPLMNNGVADSHPEWKDLAIKCFENENFE

XP_031743205.1 uncharacterized protein LOC101221625 isoform X17 [Cucumis sativus]3.5e-5153.33Show/hide
Query:  KGRVVKFGIKEFAKITGLNCGELPPLDMVKL-KGRFLHKYFDTEKPIKRSIVSELFTSMEGVKRKDRVRLAKLYFLSNFLLGKQISTRIEMDFITLLDDE
        +GR+ KFG+K+FA ITGLNCGELP +DM K+ KG+F  +YF  EK I+R+ + ++FT M+  + KD V++AKLY L  F+LGKQI T I  ++  L+DD+
Subjt:  KGRVVKFGIKEFAKITGLNCGELPPLDMVKL-KGRFLHKYFDTEKPIKRSIVSELFTSMEGVKRKDRVRLAKLYFLSNFLLGKQISTRIEMDFITLLDDE

Query:  QLFDSYPWGRIAYTTTIESIRKSIKNPEALVVGITGFPYALLVWAYECILLLSGPSMICAQRVSSTMPLMNNGVADSHPEWKDLAIKCFENENFE
        + FDSYPWGRI+Y  T++ ++KSIK+ +A  +G+ GFPYALLVWAYE I LL+  S   A R+S   P MNN  A  HPEWKDL+ K F++E F+
Subjt:  QLFDSYPWGRIAYTTTIESIRKSIKNPEALVVGITGFPYALLVWAYECILLLSGPSMICAQRVSSTMPLMNNGVADSHPEWKDLAIKCFENENFE

XP_031743208.1 uncharacterized protein LOC101221625 isoform X20 [Cucumis sativus]3.5e-5153.33Show/hide
Query:  KGRVVKFGIKEFAKITGLNCGELPPLDMVKL-KGRFLHKYFDTEKPIKRSIVSELFTSMEGVKRKDRVRLAKLYFLSNFLLGKQISTRIEMDFITLLDDE
        +GR+ KFG+K+FA ITGLNCGELP +DM K+ KG+F  +YF  EK I+R+ + ++FT M+  + KD V++AKLY L  F+LGKQI T I  ++  L+DD+
Subjt:  KGRVVKFGIKEFAKITGLNCGELPPLDMVKL-KGRFLHKYFDTEKPIKRSIVSELFTSMEGVKRKDRVRLAKLYFLSNFLLGKQISTRIEMDFITLLDDE

Query:  QLFDSYPWGRIAYTTTIESIRKSIKNPEALVVGITGFPYALLVWAYECILLLSGPSMICAQRVSSTMPLMNNGVADSHPEWKDLAIKCFENENFE
        + FDSYPWGRI+Y  T++ ++KSIK+ +A  +G+ GFPYALLVWAYE I LL+  S   A R+S   P MNN  A  HPEWKDL+ K F++E F+
Subjt:  QLFDSYPWGRIAYTTTIESIRKSIKNPEALVVGITGFPYALLVWAYECILLLSGPSMICAQRVSSTMPLMNNGVADSHPEWKDLAIKCFENENFE

TrEMBL top hitse value%identityAlignment
A0A0A0KI50 TF-B3 domain-containing protein1.7e-5153.33Show/hide
Query:  KGRVVKFGIKEFAKITGLNCGELPPLDMVKL-KGRFLHKYFDTEKPIKRSIVSELFTSMEGVKRKDRVRLAKLYFLSNFLLGKQISTRIEMDFITLLDDE
        +GR+ KFG+K+FA ITGLNCGELP +DM K+ KG+F  +YF  EK I+R+ + ++FT M+  + KD V++AKLY L  F+LGKQI T I  ++  L+DD+
Subjt:  KGRVVKFGIKEFAKITGLNCGELPPLDMVKL-KGRFLHKYFDTEKPIKRSIVSELFTSMEGVKRKDRVRLAKLYFLSNFLLGKQISTRIEMDFITLLDDE

Query:  QLFDSYPWGRIAYTTTIESIRKSIKNPEALVVGITGFPYALLVWAYECILLLSGPSMICAQRVSSTMPLMNNGVADSHPEWKDLAIKCFENENFE
        + FDSYPWGRI+Y  T++ ++KSIK+ +A  +G+ GFPYALLVWAYE I LL+  S   A R+S   P MNN  A  HPEWKDL+ K F++E F+
Subjt:  QLFDSYPWGRIAYTTTIESIRKSIKNPEALVVGITGFPYALLVWAYECILLLSGPSMICAQRVSSTMPLMNNGVADSHPEWKDLAIKCFENENFE

A0A1S3B0L9 uncharacterized protein LOC103484737 isoform X51.3e-4339.53Show/hide
Query:  KGRVVKFGIKEFAKITGLNCGELPPLDMVKL-KGRFLHKYFDTEKPIKRSIVSELFTSMEGVKRKDRVRLAKLYFLSNFLLGKQISTRIEMDFITLLDDE
        +GR+ KFG+K+FA ITGLNCGELP +DM K+ KG+F  +YF  EK I+R+ + E+FT M+  + KD V++AKLY L  F+LGKQ+ T I  ++  L+DD+
Subjt:  KGRVVKFGIKEFAKITGLNCGELPPLDMVKL-KGRFLHKYFDTEKPIKRSIVSELFTSMEGVKRKDRVRLAKLYFLSNFLLGKQISTRIEMDFITLLDDE

Query:  QLFDSYPWGRIAYTTTIESIRKSIKNPEALVVGITGFPYALLVWAYECILLLSGPSMICAQRVSSTMPLMNNGVADSHPEWKDLAIKCFENENFEFFEMT
        + FDSYPWGRI+Y  TI+ ++K+IK+ +A  +G+ GFP+AL VWAYE I LL+  S   A R+S   P MNN  AD HPEWKDL+ K F++E F+   + 
Subjt:  QLFDSYPWGRIAYTTTIESIRKSIKNPEALVVGITGFPYALLVWAYECILLLSGPSMICAQRVSSTMPLMNNGVADSHPEWKDLAIKCFENENFEFFEMT

Query:  DEENESMLFGEMEKLKKMMNKENEKMGGQEKKDKEDHNDKDDDGQNNDEGDQNDENDNEQIPQEDAGDNSLGKEINMPSQDDVVSTSFLREVDKIE
          E E     EM  +           GG +  ++++    D +  ++     N ++ N + PQ  + D +     N     + +  + + ++D ++
Subjt:  DEENESMLFGEMEKLKKMMNKENEKMGGQEKKDKEDHNDKDDDGQNNDEGDQNDENDNEQIPQEDAGDNSLGKEINMPSQDDVVSTSFLREVDKIE

A0A1S3B181 uncharacterized protein LOC103484737 isoform X71.3e-4339.53Show/hide
Query:  KGRVVKFGIKEFAKITGLNCGELPPLDMVKL-KGRFLHKYFDTEKPIKRSIVSELFTSMEGVKRKDRVRLAKLYFLSNFLLGKQISTRIEMDFITLLDDE
        +GR+ KFG+K+FA ITGLNCGELP +DM K+ KG+F  +YF  EK I+R+ + E+FT M+  + KD V++AKLY L  F+LGKQ+ T I  ++  L+DD+
Subjt:  KGRVVKFGIKEFAKITGLNCGELPPLDMVKL-KGRFLHKYFDTEKPIKRSIVSELFTSMEGVKRKDRVRLAKLYFLSNFLLGKQISTRIEMDFITLLDDE

Query:  QLFDSYPWGRIAYTTTIESIRKSIKNPEALVVGITGFPYALLVWAYECILLLSGPSMICAQRVSSTMPLMNNGVADSHPEWKDLAIKCFENENFEFFEMT
        + FDSYPWGRI+Y  TI+ ++K+IK+ +A  +G+ GFP+AL VWAYE I LL+  S   A R+S   P MNN  AD HPEWKDL+ K F++E F+   + 
Subjt:  QLFDSYPWGRIAYTTTIESIRKSIKNPEALVVGITGFPYALLVWAYECILLLSGPSMICAQRVSSTMPLMNNGVADSHPEWKDLAIKCFENENFEFFEMT

Query:  DEENESMLFGEMEKLKKMMNKENEKMGGQEKKDKEDHNDKDDDGQNNDEGDQNDENDNEQIPQEDAGDNSLGKEINMPSQDDVVSTSFLREVDKIE
          E E     EM  +           GG +  ++++    D +  ++     N ++ N + PQ  + D +     N     + +  + + ++D ++
Subjt:  DEENESMLFGEMEKLKKMMNKENEKMGGQEKKDKEDHNDKDDDGQNNDEGDQNDENDNEQIPQEDAGDNSLGKEINMPSQDDVVSTSFLREVDKIE

A0A5A7UZA2 DUF1985 domain-containing protein2.1e-5453.65Show/hide
Query:  KGRVVKFGIKEFAKITGLNCGELPPLDMVKLKGRFLHKYFDTEKPIKRSIVSELFTSMEGVKRKDRVRLAKLYFLSNFLLGKQISTRIEMDFITLLDDEQ
        +G + KFGI++F  ITGLNC  LP +D  K+KG+FL KYF  E PI RS VS LF   + +K +D++++AK+YFL NFLLGKQ +T  + + I LLDDEQ
Subjt:  KGRVVKFGIKEFAKITGLNCGELPPLDMVKLKGRFLHKYFDTEKPIKRSIVSELFTSMEGVKRKDRVRLAKLYFLSNFLLGKQISTRIEMDFITLLDDEQ

Query:  LFDSYPWGRIAYTTTIESIRKSIKNPEALVVGITGFPYALLVWAYECILLLSGPSMICAQRVSSTMPLMNNGVADSHPEWKDLAIKCFENEN
         FD+YPWGRI Y   I+SI+KSIKNP AL VGI+GF Y+LLVW Y+C+ LL  PS+ CAQ++      + N + ++HPEWK+LA + F +E+
Subjt:  LFDSYPWGRIAYTTTIESIRKSIKNPEALVVGITGFPYALLVWAYECILLLSGPSMICAQRVSSTMPLMNNGVADSHPEWKDLAIKCFENEN

A0A6J1BX50 uncharacterized protein LOC1110055243.1e-4544.85Show/hide
Query:  SSKTANK----RKGRVVKFGIKEFAKITGLNCGELPPLDMVKL-KGRFLHKYFDTEKPIKRSIVSELFTSMEGVKRKDRVRLAKLYFLSNFLLGKQISTR
        SSK  N+     +GRV KFGIK+FA ITG+NCGELP +DM K+ +  F  +YF  E+ IKR+ + E+F  M+  + KD V++AKLY L  FLLGKQI+T 
Subjt:  SSKTANK----RKGRVVKFGIKEFAKITGLNCGELPPLDMVKL-KGRFLHKYFDTEKPIKRSIVSELFTSMEGVKRKDRVRLAKLYFLSNFLLGKQISTR

Query:  IEMDFITLLDDEQLFDSYPWGRIAYTTTIESIRKSIKNPEALVVGITGFPYALLVWAYECILLLSGPSMICAQRVSSTMPLMNNGVADSHPEWKDLAIKC
        I  ++  L+DD++ F+ YPWGR++Y  TI+ ++K+IK+ +A  +GI GFPYALLVWAYE I LLS  S + A ++SS MP MNN VA+ HPEW+DL+ K 
Subjt:  IEMDFITLLDDEQLFDSYPWGRIAYTTTIESIRKSIKNPEALVVGITGFPYALLVWAYECILLLSGPSMICAQRVSSTMPLMNNGVADSHPEWKDLAIKC

Query:  FENENFEF--FEMTDEE----NESMLFGEMEKLKKMMNKENEKMGGQEKKDKEDHNDKDDDGQNNDEGDQND
        F +++F+    E TD E      S+  G +E+ K   +K    +  ++  D     +KD D         ND
Subjt:  FENENFEF--FEMTDEE----NESMLFGEMEKLKKMMNKENEKMGGQEKKDKEDHNDKDDDGQNNDEGDQND

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G31150.1 Domain of unknown function (DUF1985)3.9e-0827.78Show/hide
Query:  GRVVKFGIKEFAKITGLNCGELPPLDMVK--LKGRFL---HKYFDTEKPIKRSIVSELFTSMEGVKRKDRVRLAKLYFLSNFLLGKQISTRIEMDFITLL
        G  ++F I+EF  +TGL CG+LP  D VK     ++L   ++ F  ++ +    V E+    + +    ++ LA +  +   ++    S  + +DF+ +L
Subjt:  GRVVKFGIKEFAKITGLNCGELPPLDMVK--LKGRFL---HKYFDTEKPIKRSIVSELFTSMEGVKRKDRVRLAKLYFLSNFLLGKQISTRIEMDFITLL

Query:  DDEQLFDSYPWGRIAYTTTIESI--RKSIKNP--------EALVVGITGFPYALLVWAYECI
        +D   F  YPWGR A+  TI      K   NP        +       GFP AL +  +E I
Subjt:  DDEQLFDSYPWGRIAYTTTIESI--RKSIKNP--------EALVVGITGFPYALLVWAYECI

AT4G08430.1 Ulp1 protease family protein5.1e-0825.74Show/hide
Query:  RVVKFGIKEFAKITGLNCGELPPLDMVKLKGRFLHKYFDTEKPIKRSIVSELFTSMEGV-------KRKDRVRLAKLYFLSNFLLGKQISTRIEMDFITL
        R ++F + EF  ITGLNC      D     G   +K F  E  +  S V  LFT +E V         + R+ + +L      + G    +R+ +     
Subjt:  RVVKFGIKEFAKITGLNCGELPPLDMVKLKGRFLHKYFDTEKPIKRSIVSELFTSMEGV-------KRKDRVRLAKLYFLSNFLLGKQISTRIEMDFITL

Query:  LDDEQLFDSYPWGRIAYTTTIESIRKSIKNPEALVVGITGFPYALLVWAYECILLLSGPSMICAQRVSST--MPLMNNGVADSHPEWKDLAIKCFENENF
        + D   F+ YPWGR+A+ + + S++    + ++ V  I G   ALLVW YE +    G    C  R ++   +PL+         +W+         + F
Subjt:  LDDEQLFDSYPWGRIAYTTTIESIRKSIKNPEALVVGITGFPYALLVWAYECILLLSGPSMICAQRVSST--MPLMNNGVADSHPEWKDLAIKCFENENF

Query:  EFFEMTDEENESMLFGEMEKLKKMMNKENEKMGGQEK
         F      E E    G++ + K+  N+E+    G+ K
Subjt:  EFFEMTDEENESMLFGEMEKLKKMMNKENEKMGGQEK

AT5G28810.1 Domain of unknown function (DUF1985)1.9e-0728.47Show/hide
Query:  VKFGIKEFAKITGLNCGELPPLDMVKLKGRFLHKYFDTEKPIKRSIVSELFTSMEGVKRKDRVRLAKLYFLSNFLLGKQISTRIEMDFITLLDDEQLFDS
        ++F + EF  ITGLNC      DM       L + F+  K            S+E      R+ + +L  LS  + G    +R+ +     + D   F+ 
Subjt:  VKFGIKEFAKITGLNCGELPPLDMVKLKGRFLHKYFDTEKPIKRSIVSELFTSMEGVKRKDRVRLAKLYFLSNFLLGKQISTRIEMDFITLLDDEQLFDS

Query:  YPWGRIAYTTTIESIRKSIKNPEALVVGITGFPYALLVWAYECI
        YPWGR+A+ + + S++    + ++ V  I G   ALLVW YE +
Subjt:  YPWGRIAYTTTIESIRKSIKNPEALVVGITGFPYALLVWAYECI

AT5G45570.1 Ulp1 protease family protein1.5e-0425.7Show/hide
Query:  RVVKFGIKEFAKITGLNCGELPPLDMVKLKGRFLHKYFDTEKPIKRSIVSELFTSMEGV-------KRKDRVRLAKLYFLSNFLLGKQISTRIEMDFITL
        R ++F + EF  ITGLNC      D     G   +K F  E  +  S V  LFT +E V         + R+ + +L  LS  + G    +R+ +     
Subjt:  RVVKFGIKEFAKITGLNCGELPPLDMVKLKGRFLHKYFDTEKPIKRSIVSELFTSMEGV-------KRKDRVRLAKLYFLSNFLLGKQISTRIEMDFITL

Query:  LDDEQLFDSYPWGRIAYTTTIESIRKSIKNPEALVVGITGFPYALLVWAYECILLLSGPSMICAQRVSST--MPLMNNGVADSHPEWKDLAIKCFENENF
        + D   F+ YPWGR+A+ +   S++    + ++ V  I G    LLVW YE +    G    C  R ++   +PL+         +W+         + F
Subjt:  LDDEQLFDSYPWGRIAYTTTIESIRKSIKNPEALVVGITGFPYALLVWAYECILLLSGPSMICAQRVSST--MPLMNNGVADSHPEWKDLAIKCFENENF

Query:  EFFEMTDEENESMLFGEMEKLKKMMNKENEKMGGQEKKDKEDHNDKDDD
         F      E E    G++ +++ M+    E M  Q     E+H+   D+
Subjt:  EFFEMTDEENESMLFGEMEKLKKMMNKENEKMGGQEKKDKEDHNDKDDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAACAGAAACCCACATGAAGATGTTGAACAAGAACAGACCGAAGAGGTTCCCGGGACCCAGATAGTCCCGTATGGTTATTGTTTCGATGATGTGTTTCCTGAGGA
AACAGAAGAAATAGAGAATACATCAGATGGCGATGAGGTTTTACAGCAAGAAACAGTCGATGAACAAACTGAAGTTGTTTATGAGGAAGATACTCGCATGGAGACTGATA
AAGAAACGAGCCCACTAGGAGGTGGGGAGAAACGAAAACAGCCACAAGACCATCAAAAGGAGAAAACCCCAAAAAGGAAGAGAACCGAGCAAGATGCTACTGCTTGTAGG
AGGGAAACCCGACAATCAACGTCTTCCAGAACCCTTAAGCAAACAGGCGCTCCTCCAAAGCCTAAAGGAAACAAATTTGGAAAGAAGGCAAACACTTCTTCCAAAACAGC
CAACAAGAGGAAGGGGAGGGTAGTTAAATTTGGCATAAAGGAATTTGCCAAAATCACAGGTCTGAATTGTGGAGAACTCCCACCATTGGATATGGTAAAGTTGAAAGGGA
GGTTCCTCCACAAGTACTTCGACACAGAGAAACCCATCAAAAGATCAATAGTGAGTGAACTATTCACTTCGATGGAGGGGGTCAAAAGGAAGGATAGGGTTAGGTTGGCA
AAATTGTATTTCCTATCGAATTTTCTTCTAGGAAAACAAATTAGTACGAGAATAGAAATGGATTTCATAACTTTGCTTGATGATGAGCAACTATTTGACAGCTACCCTTG
GGGGAGAATCGCATATACCACTACCATAGAATCTATAAGGAAATCAATTAAAAATCCTGAAGCTCTAGTGGTAGGAATCACAGGATTCCCATATGCCTTGCTTGTTTGGG
CATATGAGTGCATTCTCCTTCTATCCGGCCCTTCCATGATCTGTGCACAAAGAGTATCCTCCACAATGCCGTTGATGAACAATGGGGTAGCTGATAGCCACCCTGAGTGG
AAAGATCTTGCTATCAAATGCTTTGAAAATGAAAATTTTGAGTTTTTTGAGATGACGGATGAAGAGAATGAGTCTATGCTTTTTGGAGAAATGGAAAAACTGAAAAAAAT
GATGAACAAGGAGAATGAAAAAATGGGGGGACAGGAGAAAAAAGACAAAGAAGATCACAATGATAAAGATGACGACGGACAAAACAATGATGAAGGTGACCAAAATGATG
AGAATGATAACGAGCAAATCCCTCAAGAAGATGCTGGAGACAACTCATTAGGAAAGGAAATAAATATGCCAAGTCAGGATGATGTCGTAAGCACATCTTTCTTGAGGGAA
GTTGACAAAATTGAGAAGGAGGCTAAGTTTATTAAGGCCAAATTTGTTCATATCAAGAAAGAGATTGTGACTTCTACCGATTCAAGATTTATTCATGCTATGAAAGGAAC
AGGACCGTTTGCTCAAACAAACAACATGTTCACTAAACGAGAACGAAGGGTCATCGTTCCTTCTGTGATCTTGAGATTAGAGTTCTGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACAACAGAAACCCACATGAAGATGTTGAACAAGAACAGACCGAAGAGGTTCCCGGGACCCAGATAGTCCCGTATGGTTATTGTTTCGATGATGTGTTTCCTGAGGA
AACAGAAGAAATAGAGAATACATCAGATGGCGATGAGGTTTTACAGCAAGAAACAGTCGATGAACAAACTGAAGTTGTTTATGAGGAAGATACTCGCATGGAGACTGATA
AAGAAACGAGCCCACTAGGAGGTGGGGAGAAACGAAAACAGCCACAAGACCATCAAAAGGAGAAAACCCCAAAAAGGAAGAGAACCGAGCAAGATGCTACTGCTTGTAGG
AGGGAAACCCGACAATCAACGTCTTCCAGAACCCTTAAGCAAACAGGCGCTCCTCCAAAGCCTAAAGGAAACAAATTTGGAAAGAAGGCAAACACTTCTTCCAAAACAGC
CAACAAGAGGAAGGGGAGGGTAGTTAAATTTGGCATAAAGGAATTTGCCAAAATCACAGGTCTGAATTGTGGAGAACTCCCACCATTGGATATGGTAAAGTTGAAAGGGA
GGTTCCTCCACAAGTACTTCGACACAGAGAAACCCATCAAAAGATCAATAGTGAGTGAACTATTCACTTCGATGGAGGGGGTCAAAAGGAAGGATAGGGTTAGGTTGGCA
AAATTGTATTTCCTATCGAATTTTCTTCTAGGAAAACAAATTAGTACGAGAATAGAAATGGATTTCATAACTTTGCTTGATGATGAGCAACTATTTGACAGCTACCCTTG
GGGGAGAATCGCATATACCACTACCATAGAATCTATAAGGAAATCAATTAAAAATCCTGAAGCTCTAGTGGTAGGAATCACAGGATTCCCATATGCCTTGCTTGTTTGGG
CATATGAGTGCATTCTCCTTCTATCCGGCCCTTCCATGATCTGTGCACAAAGAGTATCCTCCACAATGCCGTTGATGAACAATGGGGTAGCTGATAGCCACCCTGAGTGG
AAAGATCTTGCTATCAAATGCTTTGAAAATGAAAATTTTGAGTTTTTTGAGATGACGGATGAAGAGAATGAGTCTATGCTTTTTGGAGAAATGGAAAAACTGAAAAAAAT
GATGAACAAGGAGAATGAAAAAATGGGGGGACAGGAGAAAAAAGACAAAGAAGATCACAATGATAAAGATGACGACGGACAAAACAATGATGAAGGTGACCAAAATGATG
AGAATGATAACGAGCAAATCCCTCAAGAAGATGCTGGAGACAACTCATTAGGAAAGGAAATAAATATGCCAAGTCAGGATGATGTCGTAAGCACATCTTTCTTGAGGGAA
GTTGACAAAATTGAGAAGGAGGCTAAGTTTATTAAGGCCAAATTTGTTCATATCAAGAAAGAGATTGTGACTTCTACCGATTCAAGATTTATTCATGCTATGAAAGGAAC
AGGACCGTTTGCTCAAACAAACAACATGTTCACTAAACGAGAACGAAGGGTCATCGTTCCTTCTGTGATCTTGAGATTAGAGTTCTGGTAG
Protein sequenceShow/hide protein sequence
MDNRNPHEDVEQEQTEEVPGTQIVPYGYCFDDVFPEETEEIENTSDGDEVLQQETVDEQTEVVYEEDTRMETDKETSPLGGGEKRKQPQDHQKEKTPKRKRTEQDATACR
RETRQSTSSRTLKQTGAPPKPKGNKFGKKANTSSKTANKRKGRVVKFGIKEFAKITGLNCGELPPLDMVKLKGRFLHKYFDTEKPIKRSIVSELFTSMEGVKRKDRVRLA
KLYFLSNFLLGKQISTRIEMDFITLLDDEQLFDSYPWGRIAYTTTIESIRKSIKNPEALVVGITGFPYALLVWAYECILLLSGPSMICAQRVSSTMPLMNNGVADSHPEW
KDLAIKCFENENFEFFEMTDEENESMLFGEMEKLKKMMNKENEKMGGQEKKDKEDHNDKDDDGQNNDEGDQNDENDNEQIPQEDAGDNSLGKEINMPSQDDVVSTSFLRE
VDKIEKEAKFIKAKFVHIKKEIVTSTDSRFIHAMKGTGPFAQTNNMFTKRERRVIVPSVILRLEFW