; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0003021 (gene) of Chayote v1 genome

Gene IDSed0003021
OrganismSechium edule (Chayote v1)
DescriptionUnknown protein
Genome locationLG05:23912200..23914991
RNA-Seq ExpressionSed0003021
SyntenySed0003021
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606925.1 hypothetical protein SDJN03_00267, partial [Cucurbita argyrosperma subsp. sororia]7.2e-8570.66Show/hide
Query:  MSGLDCMATKDRECEIDLESGSNTSEEDLNSETEPISKPHSRKTFGRLRSGFLCTDGSTNRDSSFASSSSSSKLVKLGVEENVEFLMDKSLDEEKRREHG
        MSGLDCMAT+DRE EIDLE G  TSE+DL+SE++  SKPH RK   RLRSGFLCTDGS NR  SFASSS+S+KL KLGV+ENVE L+DKSLD EKRRE G
Subjt:  MSGLDCMATKDRECEIDLESGSNTSEEDLNSETEPISKPHSRKTFGRLRSGFLCTDGSTNRDSSFASSSSSSKLVKLGVEENVEFLMDKSLDEEKRREHG

Query:  ALAEKKKNVKEKIKTNGKVHKPPRPPRGPSLDAADRMFVKEIAELAEKKRATVERVKALKKMKAEKTSSFNSSFPALFVTLLFFAIIIFQGMSAKGSIMA
        ALAEKKKN+KE IK NGK+HKPPRPPRGPSLDAADR+FVKE+ +LA KKRA VER+KALKKMKAEKTSSFNS+ PALF+TLLFF +IIFQGMSA GS + 
Subjt:  ALAEKKKNVKEKIKTNGKVHKPPRPPRGPSLDAADRMFVKEIAELAEKKRATVERVKALKKMKAEKTSSFNSSFPALFVTLLFFAIIIFQGMSAKGSIMA

Query:  LESPVPSIGGRSSLITDQHSSKSTSKVNEHHSRILNFEGKRPSDPATIGEASAVEDSKN
         ESPVPS    + LI+ QHSSKS   VN   S IL+F           GEAS+VED KN
Subjt:  LESPVPSIGGRSSLITDQHSSKSTSKVNEHHSRILNFEGKRPSDPATIGEASAVEDSKN

KAG7036629.1 hypothetical protein SDJN02_00248 [Cucurbita argyrosperma subsp. argyrosperma]3.6e-8470.27Show/hide
Query:  MSGLDCMATKDRECEIDLESGSNTSEEDLNSETEPISKPHSRKTFGRLRSGFLCTDGSTNRDSSFASSSSSSKLVKLGVEENVEFLMDKSLDEEKRREHG
        MSGLDCMAT+DRE EIDLE G  TSE+DL+SE++  SKPH RK   RLRSGFLCTDGS N+  SFASSS+S+KL KLGV+ENVE L+DKSLD EKRRE G
Subjt:  MSGLDCMATKDRECEIDLESGSNTSEEDLNSETEPISKPHSRKTFGRLRSGFLCTDGSTNRDSSFASSSSSSKLVKLGVEENVEFLMDKSLDEEKRREHG

Query:  ALAEKKKNVKEKIKTNGKVHKPPRPPRGPSLDAADRMFVKEIAELAEKKRATVERVKALKKMKAEKTSSFNSSFPALFVTLLFFAIIIFQGMSAKGSIMA
        ALAEKKKN+KE IK NGK+HKPPRPPRGPSLDAADR+FVKE+ +LA KKRA VER+KALKKMKAEKTSSFNS+ PALF+TLLFF +IIFQGMSA GS + 
Subjt:  ALAEKKKNVKEKIKTNGKVHKPPRPPRGPSLDAADRMFVKEIAELAEKKRATVERVKALKKMKAEKTSSFNSSFPALFVTLLFFAIIIFQGMSAKGSIMA

Query:  LESPVPSIGGRSSLITDQHSSKSTSKVNEHHSRILNFEGKRPSDPATIGEASAVEDSKN
         ESPVPS    + LI  QHSSKS   VN   S IL+F           GEAS+VED KN
Subjt:  LESPVPSIGGRSSLITDQHSSKSTSKVNEHHSRILNFEGKRPSDPATIGEASAVEDSKN

XP_004147994.1 uncharacterized protein LOC101214824 [Cucumis sativus]2.1e-8471.97Show/hide
Query:  MSGLDCMATKDRECEIDLESGSNTSEEDLNSETEPISKPHSRKTFGRLRSGFLCTDGSTNRDSSFASSSSSSKLVKLGVEENVEFLMDKSLDEEKRREHG
        M  +DCMATKDRE EIDLE G NTSE+DL+SET+  SKPH+RKTFGRLRSGFL +D S +R   FASSS+S+KLVKLGV+ENVE LM+ S D EKRRE G
Subjt:  MSGLDCMATKDRECEIDLESGSNTSEEDLNSETEPISKPHSRKTFGRLRSGFLCTDGSTNRDSSFASSSSSSKLVKLGVEENVEFLMDKSLDEEKRREHG

Query:  ALAEKKKNVKEKIKTNGKVHKPPRPPRGPSLDAADRMFVKEIAELAEKKRATVERVKALKKMKAEKTSSFNSSFPALFVTLLFFAIIIFQGMSAKGSIMA
        A AE K NVK KIK NGKVHKPPRPPRGPSLDAADR+FV+EIAELA KKRATVER+KALKKMKAEKTSSFNSS PALF+TLLFF +IIFQGMSAKGS M 
Subjt:  ALAEKKKNVKEKIKTNGKVHKPPRPPRGPSLDAADRMFVKEIAELAEKKRATVERVKALKKMKAEKTSSFNSSFPALFVTLLFFAIIIFQGMSAKGSIMA

Query:  L--ESPVPSIGGRSSLITDQHS--SKSTSKVNEHHSRILNFEGKRPSDPAT-IGEASAVEDSKN
           +SP PS+GG + LI  QHS   +S+  VNE  S ILNF GK+ SDPAT + EAS VE+ KN
Subjt:  L--ESPVPSIGGRSSLITDQHS--SKSTSKVNEHHSRILNFEGKRPSDPAT-IGEASAVEDSKN

XP_022157030.1 uncharacterized protein LOC111023856 [Momordica charantia]5.0e-8671.26Show/hide
Query:  MSGLDCMATKDRECEIDLESGSNTSEEDLNSETEPISKPHSRKTFGRLRSGFLCTDGSTNRDSSFASSSSSSKLVKLGVEENVEFLMDKSLDEEKRREHG
        M GL+ MATK+RE  IDLE G ++SEED N+E E ISK HSRKTFGRLRSGFLC DGS +R SSFASSS++++L KL V+ENVE LMDKS + EKRRE G
Subjt:  MSGLDCMATKDRECEIDLESGSNTSEEDLNSETEPISKPHSRKTFGRLRSGFLCTDGSTNRDSSFASSSSSSKLVKLGVEENVEFLMDKSLDEEKRREHG

Query:  ALAEKKKNVKEKIKTNGKVHKPPRPPRGPSLDAADRMFVKEIAELAEKKRATVERVKALKKMKAEKTSSFNSSFPALFVTLLFFAIIIFQGMSAKGSIMA
        AL EKKKN+KEKIK NGK+HKPPRPP GPSLDAADR+ V+EI ELA KKRATVER+KALKKMKAEKTSSFNSS PALF+TLLFF IIIFQGMSAKGS M 
Subjt:  ALAEKKKNVKEKIKTNGKVHKPPRPPRGPSLDAADRMFVKEIAELAEKKRATVERVKALKKMKAEKTSSFNSSFPALFVTLLFFAIIIFQGMSAKGSIMA

Query:  LESPVPSIGGRSSLITD-QHSSKS-TSKVNEHHSRILNFEGKRPSDPATIGEASAVEDSKN
         ESPVP++GG +SLI+  +HSS+S   KVN   SR LNF  K  SD + +GEA  VED KN
Subjt:  LESPVPSIGGRSSLITD-QHSSKS-TSKVNEHHSRILNFEGKRPSDPATIGEASAVEDSKN

XP_038903260.1 uncharacterized protein LOC120089897 [Benincasa hispida]2.3e-8372.03Show/hide
Query:  MSGLDCMATKDRECEIDLESGSNTSEEDLNSETEPISKPHSRKTFGRLRSGFLCTDGSTNRDSSFASSSSSSKLVKLGVEENVEFLMDKSLDEEKRREHG
        M GLDCMATKDRE EIDLE G NTSE+DL+SET+  SK H+RKTF RLRSGFL +DGS +R  SFASSS+S+KLVKLGV+ENVE  MD S D EKRRE G
Subjt:  MSGLDCMATKDRECEIDLESGSNTSEEDLNSETEPISKPHSRKTFGRLRSGFLCTDGSTNRDSSFASSSSSSKLVKLGVEENVEFLMDKSLDEEKRREHG

Query:  ALAEKKKNVKEKIKTNGKVHKPPRPPRGPSLDAADRMFVKEIAELAEKKRATVERVKALKKMKAEKTSSFNSSFPALFVTLLFFAIIIFQGMSAKGSIMA
        A AE KKNVK KI   GKVHKPPRPPRGPSLDAADR+FVKE+ ELA KKRATVER+KALKKMKAEK SSFNSS PALF+TLLFF +IIFQGMSAKGS M 
Subjt:  ALAEKKKNVKEKIKTNGKVHKPPRPPRGPSLDAADRMFVKEIAELAEKKRATVERVKALKKMKAEKTSSFNSSFPALFVTLLFFAIIIFQGMSAKGSIMA

Query:  L--ESPVPSIGGRSSLITDQHSSKSTSKVNEHHSRILNFEGKRPSDPATIGEASAVEDSKN
        L  +SP PS+GG + LI  QHS +S+  VNE  S ILNF GK+ SDP  I EA +VE+ KN
Subjt:  L--ESPVPSIGGRSSLITDQHSSKSTSKVNEHHSRILNFEGKRPSDPATIGEASAVEDSKN

TrEMBL top hitse value%identityAlignment
A0A0A0LB07 Uncharacterized protein1.0e-8471.97Show/hide
Query:  MSGLDCMATKDRECEIDLESGSNTSEEDLNSETEPISKPHSRKTFGRLRSGFLCTDGSTNRDSSFASSSSSSKLVKLGVEENVEFLMDKSLDEEKRREHG
        M  +DCMATKDRE EIDLE G NTSE+DL+SET+  SKPH+RKTFGRLRSGFL +D S +R   FASSS+S+KLVKLGV+ENVE LM+ S D EKRRE G
Subjt:  MSGLDCMATKDRECEIDLESGSNTSEEDLNSETEPISKPHSRKTFGRLRSGFLCTDGSTNRDSSFASSSSSSKLVKLGVEENVEFLMDKSLDEEKRREHG

Query:  ALAEKKKNVKEKIKTNGKVHKPPRPPRGPSLDAADRMFVKEIAELAEKKRATVERVKALKKMKAEKTSSFNSSFPALFVTLLFFAIIIFQGMSAKGSIMA
        A AE K NVK KIK NGKVHKPPRPPRGPSLDAADR+FV+EIAELA KKRATVER+KALKKMKAEKTSSFNSS PALF+TLLFF +IIFQGMSAKGS M 
Subjt:  ALAEKKKNVKEKIKTNGKVHKPPRPPRGPSLDAADRMFVKEIAELAEKKRATVERVKALKKMKAEKTSSFNSSFPALFVTLLFFAIIIFQGMSAKGSIMA

Query:  L--ESPVPSIGGRSSLITDQHS--SKSTSKVNEHHSRILNFEGKRPSDPAT-IGEASAVEDSKN
           +SP PS+GG + LI  QHS   +S+  VNE  S ILNF GK+ SDPAT + EAS VE+ KN
Subjt:  L--ESPVPSIGGRSSLITDQHS--SKSTSKVNEHHSRILNFEGKRPSDPAT-IGEASAVEDSKN

A0A1S3BHI9 uncharacterized protein LOC1034899099.5e-8370.45Show/hide
Query:  MSGLDCMATKDRECEIDLESGSNTSEEDLNSETEPISKPHSRKTFGRLRSGFLCTDGSTNRDSSFASSSSSSKLVKLGVEENVEFLMDKSLDEEKRREHG
        M  LDCMATKDRE EIDLE G NTSE+DL+SET+  SK H+RKTFGRLRSGFL +D S +R  +FASSS+S+KLVKLGV++NVE LM+ S D EKRRE G
Subjt:  MSGLDCMATKDRECEIDLESGSNTSEEDLNSETEPISKPHSRKTFGRLRSGFLCTDGSTNRDSSFASSSSSSKLVKLGVEENVEFLMDKSLDEEKRREHG

Query:  ALAEKKKNVKEKIKTNGKVHKPPRPPRGPSLDAADRMFVKEIAELAEKKRATVERVKALKKMKAEKTSSFNSSFPALFVTLLFFAIIIFQGMSAKGS--I
        A AE K NVK KIK NGKVHKPPRPPRGPSLDAADR+FV+EIAELA KKRATVER+KALKKMKAEK SSFNSS PALF+TLLFF +IIFQGMSAKGS  +
Subjt:  ALAEKKKNVKEKIKTNGKVHKPPRPPRGPSLDAADRMFVKEIAELAEKKRATVERVKALKKMKAEKTSSFNSSFPALFVTLLFFAIIIFQGMSAKGS--I

Query:  MALESPVPSIGGRSSLITDQHS--SKSTSKVNEHHSRILNFEGKRPSDP-ATIGEASAVEDSKN
        M  +SP PS+GG + LI  QHS   +S+  VNE  S ILNF GK+ SDP A + E S+VE+ KN
Subjt:  MALESPVPSIGGRSSLITDQHS--SKSTSKVNEHHSRILNFEGKRPSDP-ATIGEASAVEDSKN

A0A5D3DAG0 Putative transmembrane protein9.5e-8370.45Show/hide
Query:  MSGLDCMATKDRECEIDLESGSNTSEEDLNSETEPISKPHSRKTFGRLRSGFLCTDGSTNRDSSFASSSSSSKLVKLGVEENVEFLMDKSLDEEKRREHG
        M  LDCMATKDRE EIDLE G NTSE+DL+SET+  SK H+RKTFGRLRSGFL +D S +R  +FASSS+S+KLVKLGV++NVE LM+ S D EKRRE G
Subjt:  MSGLDCMATKDRECEIDLESGSNTSEEDLNSETEPISKPHSRKTFGRLRSGFLCTDGSTNRDSSFASSSSSSKLVKLGVEENVEFLMDKSLDEEKRREHG

Query:  ALAEKKKNVKEKIKTNGKVHKPPRPPRGPSLDAADRMFVKEIAELAEKKRATVERVKALKKMKAEKTSSFNSSFPALFVTLLFFAIIIFQGMSAKGS--I
        A AE K NVK KIK NGKVHKPPRPPRGPSLDAADR+FV+EIAELA KKRATVER+KALKKMKAEK SSFNSS PALF+TLLFF +IIFQGMSAKGS  +
Subjt:  ALAEKKKNVKEKIKTNGKVHKPPRPPRGPSLDAADRMFVKEIAELAEKKRATVERVKALKKMKAEKTSSFNSSFPALFVTLLFFAIIIFQGMSAKGS--I

Query:  MALESPVPSIGGRSSLITDQHS--SKSTSKVNEHHSRILNFEGKRPSDP-ATIGEASAVEDSKN
        M  +SP PS+GG + LI  QHS   +S+  VNE  S ILNF GK+ SDP A + E S+VE+ KN
Subjt:  MALESPVPSIGGRSSLITDQHS--SKSTSKVNEHHSRILNFEGKRPSDP-ATIGEASAVEDSKN

A0A6J1DS03 uncharacterized protein LOC1110238562.4e-8671.26Show/hide
Query:  MSGLDCMATKDRECEIDLESGSNTSEEDLNSETEPISKPHSRKTFGRLRSGFLCTDGSTNRDSSFASSSSSSKLVKLGVEENVEFLMDKSLDEEKRREHG
        M GL+ MATK+RE  IDLE G ++SEED N+E E ISK HSRKTFGRLRSGFLC DGS +R SSFASSS++++L KL V+ENVE LMDKS + EKRRE G
Subjt:  MSGLDCMATKDRECEIDLESGSNTSEEDLNSETEPISKPHSRKTFGRLRSGFLCTDGSTNRDSSFASSSSSSKLVKLGVEENVEFLMDKSLDEEKRREHG

Query:  ALAEKKKNVKEKIKTNGKVHKPPRPPRGPSLDAADRMFVKEIAELAEKKRATVERVKALKKMKAEKTSSFNSSFPALFVTLLFFAIIIFQGMSAKGSIMA
        AL EKKKN+KEKIK NGK+HKPPRPP GPSLDAADR+ V+EI ELA KKRATVER+KALKKMKAEKTSSFNSS PALF+TLLFF IIIFQGMSAKGS M 
Subjt:  ALAEKKKNVKEKIKTNGKVHKPPRPPRGPSLDAADRMFVKEIAELAEKKRATVERVKALKKMKAEKTSSFNSSFPALFVTLLFFAIIIFQGMSAKGSIMA

Query:  LESPVPSIGGRSSLITD-QHSSKS-TSKVNEHHSRILNFEGKRPSDPATIGEASAVEDSKN
         ESPVP++GG +SLI+  +HSS+S   KVN   SR LNF  K  SD + +GEA  VED KN
Subjt:  LESPVPSIGGRSSLITD-QHSSKS-TSKVNEHHSRILNFEGKRPSDPATIGEASAVEDSKN

A0A6J1G9J4 uncharacterized protein LOC1114521943.3e-8369.88Show/hide
Query:  MSGLDCMATKDRECEIDLESGSNTSEEDLNSETEPISKPHSRKTFGRLRSGFLCTDGSTNRDSSFASSSSSSKLVKLGVEENVEFLMDKSLDEEKRREHG
        MSGLDCMAT+DRE EIDLE G  TSE+DL+SE++  SKPH RK   RLRSGFLCTDGS NR  SFASSS+S+KL KLGV+ENVE L+DKSLD EKRRE G
Subjt:  MSGLDCMATKDRECEIDLESGSNTSEEDLNSETEPISKPHSRKTFGRLRSGFLCTDGSTNRDSSFASSSSSSKLVKLGVEENVEFLMDKSLDEEKRREHG

Query:  ALAEKKKNVKEKIKTNGKVHKPPRPPRGPSLDAADRMFVKEIAELAEKKRATVERVKALKKMKAEKTSSFNSSFPALFVTLLFFAIIIFQGMSAKGSIMA
        ALAEKKKN+KE IK NGK+HKPPRPPR PSLDAADR+FVKE+ +LA KKRA VER+KALKK KAEKTSSFNS+ PALF+TLLFF +IIFQGMSA GS + 
Subjt:  ALAEKKKNVKEKIKTNGKVHKPPRPPRGPSLDAADRMFVKEIAELAEKKRATVERVKALKKMKAEKTSSFNSSFPALFVTLLFFAIIIFQGMSAKGSIMA

Query:  LESPVPSIGGRSSLITDQHSSKSTSKVNEHHSRILNFEGKRPSDPATIGEASAVEDSKN
         ESPVPS    + LI  QHSSKS   VN   S IL+F           GEAS+VED KN
Subjt:  LESPVPSIGGRSSLITDQHSSKSTSKVNEHHSRILNFEGKRPSDPATIGEASAVEDSKN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02380.1 unknown protein3.5e-1332.27Show/hide
Query:  LDCMATKDRECEIDLESGSNTSEEDLNSETEPISKPHS-RKTFGRLRSGFLCTDGSTNRDSSFASSSSSSKLVKLGVEENVEFLMDKSLDEEKRREHGAL
        +D M + +++ E+D+E+G +   ++  S+T   +   S R  FG                S   +   S  L++   +EN      +SLD         L
Subjt:  LDCMATKDRECEIDLESGSNTSEEDLNSETEPISKPHS-RKTFGRLRSGFLCTDGSTNRDSSFASSSSSSKLVKLGVEENVEFLMDKSLDEEKRREHGAL

Query:  AEKKKNVKEKIKTNGKVHKPPRPPRGPSLDAADRMFVKEIAELAEKKRATVERV-KALKKMKAEKTSSFNSSFP--ALFVTLLFFAIIIFQGMSAKGSIM
        +EKK     K K + K  KPPRPP+GPSL   DR  +++I ELA +KRA +ER+ K+LK++KA KTS  +      ++ +T +FFA ++FQG S   S M
Subjt:  AEKKKNVKEKIKTNGKVHKPPRPPRGPSLDAADRMFVKEIAELAEKKRATVERV-KALKKMKAEKTSSFNSSFP--ALFVTLLFFAIIIFQGMSAKGSIM

Query:  -ALESPVPSIGGRSSLITDQ
         + +SP P++   + +I+ Q
Subjt:  -ALESPVPSIGGRSSLITDQ

AT3G17120.1 unknown protein4.9e-1551.65Show/hide
Query:  KEKIKTNGKVHKPPRPPRGPSLDAADRMFVKEIAELAEKKRATVERVKALKKMKAEKTSSFNSSFPALFVTL---LFFAIIIFQGMSAKGS
        KEK K +    KPPRPPRGPSLDAAD+  ++EIAELA  KRA +ER++ALKK +A K +S  SS   +  TL   +FF +++FQG+S + +
Subjt:  KEKIKTNGKVHKPPRPPRGPSLDAADRMFVKEIAELAEKKRATVERVKALKKMKAEKTSSFNSSFPALFVTL---LFFAIIIFQGMSAKGS

AT3G17120.2 unknown protein4.9e-1551.65Show/hide
Query:  KEKIKTNGKVHKPPRPPRGPSLDAADRMFVKEIAELAEKKRATVERVKALKKMKAEKTSSFNSSFPALFVTL---LFFAIIIFQGMSAKGS
        KEK K +    KPPRPPRGPSLDAAD+  ++EIAELA  KRA +ER++ALKK +A K +S  SS   +  TL   +FF +++FQG+S + +
Subjt:  KEKIKTNGKVHKPPRPPRGPSLDAADRMFVKEIAELAEKKRATVERVKALKKMKAEKTSSFNSSFPALFVTL---LFFAIIIFQGMSAKGS

AT4G01960.1 unknown protein5.3e-1729.83Show/hide
Query:  KDRECEIDLESGSNTSEEDLNSETEPISKPHSRKTFGRLRSGFLCTDGSTNRDSSFASSSSSSKLVKLGVEENVEFLMDKSLDEEKRREHGALAEKKKNV
        ++ + ++D+E G     +++ +     ++  S      + SG L  DGS                     E++ + L+D  + E KRRE  + +    ++
Subjt:  KDRECEIDLESGSNTSEEDLNSETEPISKPHSRKTFGRLRSGFLCTDGSTNRDSSFASSSSSSKLVKLGVEENVEFLMDKSLDEEKRREHGALAEKKKNV

Query:  K---EKIKTNGKVHKPPRPPRGPSLDAADRMFVKEIAELAEKKRATVERVKALKKMKAEKTSSFNSSFPALFVTLLFFAIIIFQG-MSAKGSIMALESPV
        K   +K K   K  KPPRPP+GP L A D+  ++EI ELA +KRA +ER+K L+++KA K+SS  SS  A+ VT++FF  +IFQG  ++  S+ +  SP 
Subjt:  K---EKIKTNGKVHKPPRPPRGPSLDAADRMFVKEIAELAEKKRATVERVKALKKMKAEKTSSFNSSFPALFVTLLFFAIIIFQG-MSAKGSIMALESPV

Query:  PSIGGRSSLITDQHSSKSTSKVNEHHSRILNFEGKRPS
        P+    + +++ Q  ++   +     S   +F  KR S
Subjt:  PSIGGRSSLITDQHSSKSTSKVNEHHSRILNFEGKRPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGGTTTAGATTGTATGGCTACAAAAGATCGAGAGTGTGAAATAGATCTTGAAAGTGGTAGTAATACTAGTGAAGAGGATTTGAACAGTGAAACTGAACCAATATC
AAAACCACATTCTAGAAAAACCTTCGGCAGGCTTCGGAGTGGGTTTCTGTGTACCGATGGATCTACAAATAGAGATTCTAGCTTTGCTTCAAGTAGTAGCTCCTCCAAAC
TCGTTAAGCTTGGTGTTGAAGAGAATGTCGAATTTTTGATGGACAAGAGTTTGGATGAAGAAAAGAGAAGAGAACACGGGGCTCTTGCAGAGAAGAAGAAGAATGTCAAA
GAGAAGATTAAGACAAATGGAAAGGTTCACAAGCCACCACGACCTCCGAGGGGTCCATCGTTGGATGCTGCTGACAGAATGTTTGTTAAAGAGATCGCAGAGTTGGCAGA
GAAGAAGCGTGCCACGGTTGAGCGAGTTAAAGCTTTGAAGAAGATGAAAGCAGAGAAAACTTCTTCATTCAACAGCAGCTTTCCTGCCTTGTTTGTCACATTGCTTTTCT
TTGCAATCATTATCTTTCAAGGAATGAGTGCTAAAGGTAGCATAATGGCATTGGAGTCTCCTGTGCCTTCCATTGGCGGAAGATCGAGCTTGATAACTGATCAGCATTCA
TCCAAGTCGACCTCTAAAGTTAATGAACATCACTCCCGCATTCTCAATTTTGAAGGTAAGCGACCTTCTGACCCAGCTACCATAGGAGAAGCGAGCGCAGTGGAAGATTC
GAAGAACCGTTGA
mRNA sequenceShow/hide mRNA sequence
TTAGAAAATGGCGAAATGAACTCGTTTTAGAATTGCAATTGCAATTTAAGCGCCGCACCTCACGATTTTAGATCTGAAATTGAAACTCCATTTCTGATTATGCCCCAAAT
TTCTTTCCTCAATCTCACGAATTGGGTTCGCCCTTTTTCTTCTTCCTCTTCTAATTCTCCAGATCTCTGTGTGTTGTTCGTCTGTTCGTTGGATCTCTCCATGCACACTC
TAGCTTGAAAATGGCAGAAACGGCTTCTACTTCTTGATTTTTTATCTGTCGGTTCATCAGTGTGTTTAAGAAGATGAGAACAAAAGAAGGCCTCTTTGATTTGGGATCTT
GGTAGAATGAATGCCCTGTGGAAATGGGAAGTAAAACCTTATTCTTAGATGAGTGGTTTAGATTGTATGGCTACAAAAGATCGAGAGTGTGAAATAGATCTTGAAAGTGG
TAGTAATACTAGTGAAGAGGATTTGAACAGTGAAACTGAACCAATATCAAAACCACATTCTAGAAAAACCTTCGGCAGGCTTCGGAGTGGGTTTCTGTGTACCGATGGAT
CTACAAATAGAGATTCTAGCTTTGCTTCAAGTAGTAGCTCCTCCAAACTCGTTAAGCTTGGTGTTGAAGAGAATGTCGAATTTTTGATGGACAAGAGTTTGGATGAAGAA
AAGAGAAGAGAACACGGGGCTCTTGCAGAGAAGAAGAAGAATGTCAAAGAGAAGATTAAGACAAATGGAAAGGTTCACAAGCCACCACGACCTCCGAGGGGTCCATCGTT
GGATGCTGCTGACAGAATGTTTGTTAAAGAGATCGCAGAGTTGGCAGAGAAGAAGCGTGCCACGGTTGAGCGAGTTAAAGCTTTGAAGAAGATGAAAGCAGAGAAAACTT
CTTCATTCAACAGCAGCTTTCCTGCCTTGTTTGTCACATTGCTTTTCTTTGCAATCATTATCTTTCAAGGAATGAGTGCTAAAGGTAGCATAATGGCATTGGAGTCTCCT
GTGCCTTCCATTGGCGGAAGATCGAGCTTGATAACTGATCAGCATTCATCCAAGTCGACCTCTAAAGTTAATGAACATCACTCCCGCATTCTCAATTTTGAAGGTAAGCG
ACCTTCTGACCCAGCTACCATAGGAGAAGCGAGCGCAGTGGAAGATTCGAAGAACCGTTGAAGTTAAGAGTCAATTATCTGGTAAACCATTGCCAGCTCCTTCACACTGA
AATTTTCATATGTTATCACTTGATTGTCATCATGTTGGTGTCTGTAAATATACAAAGGCCTCAAACCTCTTTTGTTACTCTTTTCTTGTACAAAGAGAATTCACATTTGA
TTGATCCAATATTATTTTTGTTGTTCTCAAATTGGCTTGCTCTCACCAGAACCCTGAACTTTGAACTTAGCCAAAATGGACTTCAAAAAATTGGTGGTGCTTACAAGATT
TATATAAATATATTGATATTTGAAGAGG
Protein sequenceShow/hide protein sequence
MSGLDCMATKDRECEIDLESGSNTSEEDLNSETEPISKPHSRKTFGRLRSGFLCTDGSTNRDSSFASSSSSSKLVKLGVEENVEFLMDKSLDEEKRREHGALAEKKKNVK
EKIKTNGKVHKPPRPPRGPSLDAADRMFVKEIAELAEKKRATVERVKALKKMKAEKTSSFNSSFPALFVTLLFFAIIIFQGMSAKGSIMALESPVPSIGGRSSLITDQHS
SKSTSKVNEHHSRILNFEGKRPSDPATIGEASAVEDSKNR