; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0011016 (gene) of Chayote v1 genome

Gene IDSed0011016
OrganismSechium edule (Chayote v1)
DescriptionProtein Ycf2-like
Genome locationLG01:62274291..62277476
RNA-Seq ExpressionSed0011016
SyntenySed0011016
Gene Ontology termsNA
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047596.1 protein Ycf2-like [Cucumis melo var. makuwa]6.9e-4528.02Show/hide
Query:  KIPAKKEVKTLDDDYEMTEQRRSHPFKINLCCKSNIMTAILNNLGEELAPLFRGMCFGRLLDFSVTKTSSQLLLHLIQRQCKATKFPELTFKIGGRV---
        K P   E K+ D  Y M  +RR+ P KINL  KS ++  I  NLG+ L   FR   FG  L+ S+T  SSQLLLHLIQR CK     +L F IGGRV   
Subjt:  KIPAKKEVKTLDDDYEMTEQRRSHPFKINLCCKSNIMTAILNNLGEELAPLFRGMCFGRLLDFSVTKTSSQLLLHLIQRQCKATKFPELTFKIGGRV---

Query:  ----------------------------------------------------NKRAPEEHMIKMSLLYRLESFVLAKHDKVNIEDKHILMVDDLDMFNTY
                                                            +    ++  IKM+ LY LESF++ K +  +++  HI+MVDD ++F+ Y
Subjt:  ----------------------------------------------------NKRAPEEHMIKMSLLYRLESFVLAKHDKVNIEDKHILMVDDLDMFNTY

Query:  PWGMVAYKLLVSSIRNAGAARGNHTVGMGGMVYALLTWAYEVIPALSSASVFYAKRTGDLLPKMLSWDFETHPEWRELLTGIFECPPLEVYPLIASEEEM
        PWG VA++LLV  +     ++G   + MGG ++ +L WAYEVIP LS+   F+  R  + +P++++   +T P+W++L   +F+ P LEV P++A+ +E+
Subjt:  PWGMVAYKLLVSSIRNAGAARGNHTVGMGGMVYALLTWAYEVIPALSSASVFYAKRTGDLLPKMLSWDFETHPEWRELLTGIFECPPLEVYPLIASEEEM

Query:  AMPYFAPFVAKE-------------------LAS----RGKYIDSRFEKMEKGMDEI---REQLSLLIRSFQTFTNYV-------------------TTI
         MP+FAPF+  E                   +AS    RG    S    + K +++I   ++++   +     F  +V                    T 
Subjt:  AMPYFAPFVAKE-------------------LAS----RGKYIDSRFEKMEKGMDEI---REQLSLLIRSFQTFTNYV-------------------TTI

Query:  CKSKVASPPPDQKTKPENEEDKKDTEHEVGDDHTMHKQVCHPRNDDEDGPTGGNARGTGLPDPIRA----GHLQGRN-TTNDPTPPKTQPGAGGQ--PIE
         + +       ++   + EED+++ + E  +  T ++ V   R+DDED        G GL D  +     GH  G++      TPPK       Q    E
Subjt:  CKSKVASPPPDQKTKPENEEDKKDTEHEVGDDHTMHKQVCHPRNDDEDGPTGGNARGTGLPDPIRA----GHLQGRN-TTNDPTPPKTQPGAGGQ--PIE

Query:  DSDPKINDVILSISERTVLSRLAVEKRQAEIAKVNQIVTKNLVPGV
        ++D +IN +I SI E  +   +  ++++    + +   T  ++  V
Subjt:  DSDPKINDVILSISERTVLSRLAVEKRQAEIAKVNQIVTKNLVPGV

XP_038883715.1 uncharacterized protein LOC120074618 isoform X1 [Benincasa hispida]3.7e-3832.38Show/hide
Query:  KINLCCKSNIMTAILNNLGEELAPLFRGMCFGRLLDFSVTKTSSQLLLHLIQRQCKATKFPELTFKIGGRVNK---------------------------
        +INL  K ++++ I N L E     F+  CFG  LD  +TK SSQL  HL++RQC +T   EL F + GR++K                           
Subjt:  KINLCCKSNIMTAILNNLGEELAPLFRGMCFGRLLDFSVTKTSSQLLLHLIQRQCKATKFPELTFKIGGRVNK---------------------------

Query:  ---------------------------RAPEEHMIKMSLLYRLESFVLAKHDKVNIEDKHILMVDDLDMFNTYPWGMVAYKLLVSSIRNAGAARGNHTVG
                                   +   + ++KM+ LY LE F+L K  +  I  ++ L+VDD + F+ YPWG ++Y++ +  ++ A  +     +G
Subjt:  ---------------------------RAPEEHMIKMSLLYRLESFVLAKHDKVNIEDKHILMVDDLDMFNTYPWGMVAYKLLVSSIRNAGAARGNHTVG

Query:  MGGMVYALLTWAYEVIPALSSASVFYAKRTGDLLPKMLSWDFETHPEWRELLTGIFECPPLEVYPLIASEEEMAMPYFAPF
        +GG  +ALL WAYE IP L   S F A R     P+M +W  + HPEW++L   +F+    +V PLIA++ EM MPY  PF
Subjt:  MGGMVYALLTWAYEVIPALSSASVFYAKRTGDLLPKMLSWDFETHPEWRELLTGIFECPPLEVYPLIASEEEMAMPYFAPF

XP_038883717.1 uncharacterized protein LOC120074618 isoform X3 [Benincasa hispida]3.7e-3832.38Show/hide
Query:  KINLCCKSNIMTAILNNLGEELAPLFRGMCFGRLLDFSVTKTSSQLLLHLIQRQCKATKFPELTFKIGGRVNK---------------------------
        +INL  K ++++ I N L E     F+  CFG  LD  +TK SSQL  HL++RQC +T   EL F + GR++K                           
Subjt:  KINLCCKSNIMTAILNNLGEELAPLFRGMCFGRLLDFSVTKTSSQLLLHLIQRQCKATKFPELTFKIGGRVNK---------------------------

Query:  ---------------------------RAPEEHMIKMSLLYRLESFVLAKHDKVNIEDKHILMVDDLDMFNTYPWGMVAYKLLVSSIRNAGAARGNHTVG
                                   +   + ++KM+ LY LE F+L K  +  I  ++ L+VDD + F+ YPWG ++Y++ +  ++ A  +     +G
Subjt:  ---------------------------RAPEEHMIKMSLLYRLESFVLAKHDKVNIEDKHILMVDDLDMFNTYPWGMVAYKLLVSSIRNAGAARGNHTVG

Query:  MGGMVYALLTWAYEVIPALSSASVFYAKRTGDLLPKMLSWDFETHPEWRELLTGIFECPPLEVYPLIASEEEMAMPYFAPF
        +GG  +ALL WAYE IP L   S F A R     P+M +W  + HPEW++L   +F+    +V PLIA++ EM MPY  PF
Subjt:  MGGMVYALLTWAYEVIPALSSASVFYAKRTGDLLPKMLSWDFETHPEWRELLTGIFECPPLEVYPLIASEEEMAMPYFAPF

XP_038883718.1 uncharacterized protein LOC120074618 isoform X4 [Benincasa hispida]3.7e-3832.38Show/hide
Query:  KINLCCKSNIMTAILNNLGEELAPLFRGMCFGRLLDFSVTKTSSQLLLHLIQRQCKATKFPELTFKIGGRVNK---------------------------
        +INL  K ++++ I N L E     F+  CFG  LD  +TK SSQL  HL++RQC +T   EL F + GR++K                           
Subjt:  KINLCCKSNIMTAILNNLGEELAPLFRGMCFGRLLDFSVTKTSSQLLLHLIQRQCKATKFPELTFKIGGRVNK---------------------------

Query:  ---------------------------RAPEEHMIKMSLLYRLESFVLAKHDKVNIEDKHILMVDDLDMFNTYPWGMVAYKLLVSSIRNAGAARGNHTVG
                                   +   + ++KM+ LY LE F+L K  +  I  ++ L+VDD + F+ YPWG ++Y++ +  ++ A  +     +G
Subjt:  ---------------------------RAPEEHMIKMSLLYRLESFVLAKHDKVNIEDKHILMVDDLDMFNTYPWGMVAYKLLVSSIRNAGAARGNHTVG

Query:  MGGMVYALLTWAYEVIPALSSASVFYAKRTGDLLPKMLSWDFETHPEWRELLTGIFECPPLEVYPLIASEEEMAMPYFAPF
        +GG  +ALL WAYE IP L   S F A R     P+M +W  + HPEW++L   +F+    +V PLIA++ EM MPY  PF
Subjt:  MGGMVYALLTWAYEVIPALSSASVFYAKRTGDLLPKMLSWDFETHPEWRELLTGIFECPPLEVYPLIASEEEMAMPYFAPF

XP_038883719.1 uncharacterized protein LOC120074618 isoform X5 [Benincasa hispida]3.7e-3832.38Show/hide
Query:  KINLCCKSNIMTAILNNLGEELAPLFRGMCFGRLLDFSVTKTSSQLLLHLIQRQCKATKFPELTFKIGGRVNK---------------------------
        +INL  K ++++ I N L E     F+  CFG  LD  +TK SSQL  HL++RQC +T   EL F + GR++K                           
Subjt:  KINLCCKSNIMTAILNNLGEELAPLFRGMCFGRLLDFSVTKTSSQLLLHLIQRQCKATKFPELTFKIGGRVNK---------------------------

Query:  ---------------------------RAPEEHMIKMSLLYRLESFVLAKHDKVNIEDKHILMVDDLDMFNTYPWGMVAYKLLVSSIRNAGAARGNHTVG
                                   +   + ++KM+ LY LE F+L K  +  I  ++ L+VDD + F+ YPWG ++Y++ +  ++ A  +     +G
Subjt:  ---------------------------RAPEEHMIKMSLLYRLESFVLAKHDKVNIEDKHILMVDDLDMFNTYPWGMVAYKLLVSSIRNAGAARGNHTVG

Query:  MGGMVYALLTWAYEVIPALSSASVFYAKRTGDLLPKMLSWDFETHPEWRELLTGIFECPPLEVYPLIASEEEMAMPYFAPF
        +GG  +ALL WAYE IP L   S F A R     P+M +W  + HPEW++L   +F+    +V PLIA++ EM MPY  PF
Subjt:  MGGMVYALLTWAYEVIPALSSASVFYAKRTGDLLPKMLSWDFETHPEWRELLTGIFECPPLEVYPLIASEEEMAMPYFAPF

TrEMBL top hitse value%identityAlignment
A0A0A0KI50 TF-B3 domain-containing protein9.8e-3732.03Show/hide
Query:  KINLCCKSNIMTAILNNLGEELAPLFRGMCFGRLLDFSVTKTSSQLLLHLIQRQCKATKFPELTFKIGGRVNK---------------------------
        +INL  K ++++ I N L E     F+  CFG  LD  ++K SSQL  HLI+RQC +    EL F + GR++K                           
Subjt:  KINLCCKSNIMTAILNNLGEELAPLFRGMCFGRLLDFSVTKTSSQLLLHLIQRQCKATKFPELTFKIGGRVNK---------------------------

Query:  ---------------------------RAPEEHMIKMSLLYRLESFVLAKHDKVNIEDKHILMVDDLDMFNTYPWGMVAYKLLVSSIRNAGAARGNHTVG
                                   +   + ++KM+ LY LE F+L K  +  I  ++ L++DD   F++YPWG ++Y++ V  ++ +  +     +G
Subjt:  ---------------------------RAPEEHMIKMSLLYRLESFVLAKHDKVNIEDKHILMVDDLDMFNTYPWGMVAYKLLVSSIRNAGAARGNHTVG

Query:  MGGMVYALLTWAYEVIPALSSASVFYAKRTGDLLPKMLSWDFETHPEWRELLTGIFECPPLEVYPLIASEEEMAMPYFAPF
        +GG  YALL WAYE IP L+  S F A R     P+M +W    HPEW++L   +F+    +V PLIA+  EM MPY  PF
Subjt:  MGGMVYALLTWAYEVIPALSSASVFYAKRTGDLLPKMLSWDFETHPEWRELLTGIFECPPLEVYPLIASEEEMAMPYFAPF

A0A1S3B065 uncharacterized protein LOC103484737 isoform X49.8e-3731.67Show/hide
Query:  KINLCCKSNIMTAILNNLGEELAPLFRGMCFGRLLDFSVTKTSSQLLLHLIQRQCKATKFPELTFKIGGRVNK---------------------------
        +INL  K ++++ I N L E     F+  CFG  LD  V+K SSQL  HLI+RQC +    EL F + GR++K                           
Subjt:  KINLCCKSNIMTAILNNLGEELAPLFRGMCFGRLLDFSVTKTSSQLLLHLIQRQCKATKFPELTFKIGGRVNK---------------------------

Query:  ---------------------------RAPEEHMIKMSLLYRLESFVLAKHDKVNIEDKHILMVDDLDMFNTYPWGMVAYKLLVSSIRNAGAARGNHTVG
                                   +   + ++KM+ LY LE F+L K  +  I  ++ L++DD + F++YPWG ++Y++ +  ++ A  +     +G
Subjt:  ---------------------------RAPEEHMIKMSLLYRLESFVLAKHDKVNIEDKHILMVDDLDMFNTYPWGMVAYKLLVSSIRNAGAARGNHTVG

Query:  MGGMVYALLTWAYEVIPALSSASVFYAKRTGDLLPKMLSWDFETHPEWRELLTGIFECPPLEVYPLIASEEEMAMPYFAPF
        +GG  +AL  WAYE IP L+  S F+A R     P+M +W  + HPEW++L   +F+    +V PLIA+E EM M Y  PF
Subjt:  MGGMVYALLTWAYEVIPALSSASVFYAKRTGDLLPKMLSWDFETHPEWRELLTGIFECPPLEVYPLIASEEEMAMPYFAPF

A0A1S3B0L9 uncharacterized protein LOC103484737 isoform X59.8e-3731.67Show/hide
Query:  KINLCCKSNIMTAILNNLGEELAPLFRGMCFGRLLDFSVTKTSSQLLLHLIQRQCKATKFPELTFKIGGRVNK---------------------------
        +INL  K ++++ I N L E     F+  CFG  LD  V+K SSQL  HLI+RQC +    EL F + GR++K                           
Subjt:  KINLCCKSNIMTAILNNLGEELAPLFRGMCFGRLLDFSVTKTSSQLLLHLIQRQCKATKFPELTFKIGGRVNK---------------------------

Query:  ---------------------------RAPEEHMIKMSLLYRLESFVLAKHDKVNIEDKHILMVDDLDMFNTYPWGMVAYKLLVSSIRNAGAARGNHTVG
                                   +   + ++KM+ LY LE F+L K  +  I  ++ L++DD + F++YPWG ++Y++ +  ++ A  +     +G
Subjt:  ---------------------------RAPEEHMIKMSLLYRLESFVLAKHDKVNIEDKHILMVDDLDMFNTYPWGMVAYKLLVSSIRNAGAARGNHTVG

Query:  MGGMVYALLTWAYEVIPALSSASVFYAKRTGDLLPKMLSWDFETHPEWRELLTGIFECPPLEVYPLIASEEEMAMPYFAPF
        +GG  +AL  WAYE IP L+  S F+A R     P+M +W  + HPEW++L   +F+    +V PLIA+E EM M Y  PF
Subjt:  MGGMVYALLTWAYEVIPALSSASVFYAKRTGDLLPKMLSWDFETHPEWRELLTGIFECPPLEVYPLIASEEEMAMPYFAPF

A0A1S3B181 uncharacterized protein LOC103484737 isoform X79.8e-3731.67Show/hide
Query:  KINLCCKSNIMTAILNNLGEELAPLFRGMCFGRLLDFSVTKTSSQLLLHLIQRQCKATKFPELTFKIGGRVNK---------------------------
        +INL  K ++++ I N L E     F+  CFG  LD  V+K SSQL  HLI+RQC +    EL F + GR++K                           
Subjt:  KINLCCKSNIMTAILNNLGEELAPLFRGMCFGRLLDFSVTKTSSQLLLHLIQRQCKATKFPELTFKIGGRVNK---------------------------

Query:  ---------------------------RAPEEHMIKMSLLYRLESFVLAKHDKVNIEDKHILMVDDLDMFNTYPWGMVAYKLLVSSIRNAGAARGNHTVG
                                   +   + ++KM+ LY LE F+L K  +  I  ++ L++DD + F++YPWG ++Y++ +  ++ A  +     +G
Subjt:  ---------------------------RAPEEHMIKMSLLYRLESFVLAKHDKVNIEDKHILMVDDLDMFNTYPWGMVAYKLLVSSIRNAGAARGNHTVG

Query:  MGGMVYALLTWAYEVIPALSSASVFYAKRTGDLLPKMLSWDFETHPEWRELLTGIFECPPLEVYPLIASEEEMAMPYFAPF
        +GG  +AL  WAYE IP L+  S F+A R     P+M +W  + HPEW++L   +F+    +V PLIA+E EM M Y  PF
Subjt:  MGGMVYALLTWAYEVIPALSSASVFYAKRTGDLLPKMLSWDFETHPEWRELLTGIFECPPLEVYPLIASEEEMAMPYFAPF

A0A5A7U047 Protein Ycf2-like3.4e-4528.02Show/hide
Query:  KIPAKKEVKTLDDDYEMTEQRRSHPFKINLCCKSNIMTAILNNLGEELAPLFRGMCFGRLLDFSVTKTSSQLLLHLIQRQCKATKFPELTFKIGGRV---
        K P   E K+ D  Y M  +RR+ P KINL  KS ++  I  NLG+ L   FR   FG  L+ S+T  SSQLLLHLIQR CK     +L F IGGRV   
Subjt:  KIPAKKEVKTLDDDYEMTEQRRSHPFKINLCCKSNIMTAILNNLGEELAPLFRGMCFGRLLDFSVTKTSSQLLLHLIQRQCKATKFPELTFKIGGRV---

Query:  ----------------------------------------------------NKRAPEEHMIKMSLLYRLESFVLAKHDKVNIEDKHILMVDDLDMFNTY
                                                            +    ++  IKM+ LY LESF++ K +  +++  HI+MVDD ++F+ Y
Subjt:  ----------------------------------------------------NKRAPEEHMIKMSLLYRLESFVLAKHDKVNIEDKHILMVDDLDMFNTY

Query:  PWGMVAYKLLVSSIRNAGAARGNHTVGMGGMVYALLTWAYEVIPALSSASVFYAKRTGDLLPKMLSWDFETHPEWRELLTGIFECPPLEVYPLIASEEEM
        PWG VA++LLV  +     ++G   + MGG ++ +L WAYEVIP LS+   F+  R  + +P++++   +T P+W++L   +F+ P LEV P++A+ +E+
Subjt:  PWGMVAYKLLVSSIRNAGAARGNHTVGMGGMVYALLTWAYEVIPALSSASVFYAKRTGDLLPKMLSWDFETHPEWRELLTGIFECPPLEVYPLIASEEEM

Query:  AMPYFAPFVAKE-------------------LAS----RGKYIDSRFEKMEKGMDEI---REQLSLLIRSFQTFTNYV-------------------TTI
         MP+FAPF+  E                   +AS    RG    S    + K +++I   ++++   +     F  +V                    T 
Subjt:  AMPYFAPFVAKE-------------------LAS----RGKYIDSRFEKMEKGMDEI---REQLSLLIRSFQTFTNYV-------------------TTI

Query:  CKSKVASPPPDQKTKPENEEDKKDTEHEVGDDHTMHKQVCHPRNDDEDGPTGGNARGTGLPDPIRA----GHLQGRN-TTNDPTPPKTQPGAGGQ--PIE
         + +       ++   + EED+++ + E  +  T ++ V   R+DDED        G GL D  +     GH  G++      TPPK       Q    E
Subjt:  CKSKVASPPPDQKTKPENEEDKKDTEHEVGDDHTMHKQVCHPRNDDEDGPTGGNARGTGLPDPIRA----GHLQGRN-TTNDPTPPKTQPGAGGQ--PIE

Query:  DSDPKINDVILSISERTVLSRLAVEKRQAEIAKVNQIVTKNLVPGV
        ++D +IN +I SI E  +   +  ++++    + +   T  ++  V
Subjt:  DSDPKINDVILSISERTVLSRLAVEKRQAEIAKVNQIVTKNLVPGV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G07240.1 cysteine-type peptidases;cysteine-type peptidases3.7e-0426.97Show/hide
Query:  EHMIKMSLLYRLESFVLAKHDKVNIEDKHILMVDDLDMFNTYPWGMVAYKLLVSSIRNAGAAR-GNHTVGMGGMVYALLTWAYEVIPAL
        E  ++ + L  ++ F+L       I   H  M +DL  F +YPWG ++++++++SI+     +     V + G++YAL     E +PA+
Subjt:  EHMIKMSLLYRLESFVLAKHDKVNIEDKHILMVDDLDMFNTYPWGMVAYKLLVSSIRNAGAAR-GNHTVGMGGMVYALLTWAYEVIPAL

AT2G10260.1 FUNCTIONS IN: molecular_function unknown8.3e-0430.84Show/hide
Query:  MIKMSLLYRLESFVLAKHDKVNIEDKHILMVDDLDMFNTYPWGMVAYKLLVSSIRNAGAARGNHTVGMGGMVYALLTWAYEVIPALSSASVFYAKRTGDL
        M+ +  L  L   +   H    I     + V DL  F  YPWG+VA++ L+ S++     +    V + G V+ALL W YE I  L  A  F  +R    
Subjt:  MIKMSLLYRLESFVLAKHDKVNIEDKHILMVDDLDMFNTYPWGMVAYKLLVSSIRNAGAARGNHTVGMGGMVYALLTWAYEVIPALSSASVFYAKRTGDL

Query:  LPKMLSW
           +L W
Subjt:  LPKMLSW

AT2G10260.2 FUNCTIONS IN: molecular_function unknown8.3e-0430.84Show/hide
Query:  MIKMSLLYRLESFVLAKHDKVNIEDKHILMVDDLDMFNTYPWGMVAYKLLVSSIRNAGAARGNHTVGMGGMVYALLTWAYEVIPALSSASVFYAKRTGDL
        M+ +  L  L   +   H    I     + V DL  F  YPWG+VA++ L+ S++     +    V + G V+ALL W YE I  L  A  F  +R    
Subjt:  MIKMSLLYRLESFVLAKHDKVNIEDKHILMVDDLDMFNTYPWGMVAYKLLVSSIRNAGAARGNHTVGMGGMVYALLTWAYEVIPALSSASVFYAKRTGDL

Query:  LPKMLSW
           +L W
Subjt:  LPKMLSW

AT5G28810.1 Domain of unknown function (DUF1985)8.3e-0435.21Show/hide
Query:  FNTYPWGMVAYKLLVSSIRNAGAARGNHTVGMGGMVYALLTWAYEVIPALSSASVFYAKRTGDLLPKMLSW
        F  YPWG VA+  L+ S++       ++ +   G V ALL W YE +P +  A  F  K T   +P +L W
Subjt:  FNTYPWGMVAYKLLVSSIRNAGAARGNHTVGMGGMVYALLTWAYEVIPALSSASVFYAKRTGDLLPKMLSW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAAAGGGTAACCCTCCTTCGCGAGGGTTTTTCCCTGGCAGAGTGTTGATGTTGGTTCTAGAGAATCAACCGCAAAGATGGAGATGGCTAAAGGGTGTAAGAGGTCT
AGAGAAGTCACGCAACGAAAATGAAATGGATTCTAACCACGAGGAAAGTGACGCACCTGAAGACCAGGAAGTCGCTGACTCCGAAGATGAGGAGGTTGTACCCTTGGCAA
AAATACCTGCCAAGAAAGAAGTGAAAACACTGGATGATGATTATGAAATGACAGAACAGAGGAGATCGCACCCATTCAAGATAAACTTGTGTTGCAAGAGTAATATAATG
ACTGCAATCCTAAATAATCTAGGCGAGGAGTTAGCACCACTATTCAGAGGGATGTGTTTTGGGCGTTTGTTAGATTTTTCTGTAACCAAAACATCGTCACAATTGTTGTT
GCATTTGATCCAACGACAATGCAAGGCAACGAAGTTCCCTGAGCTAACATTCAAGATAGGGGGGAGGGTTAATAAGAGGGCCCCCGAAGAGCATATGATAAAGATGTCCT
TATTGTACCGCTTAGAGAGCTTTGTACTAGCTAAGCATGACAAAGTTAACATTGAGGATAAACACATACTCATGGTGGATGATCTAGACATGTTTAACACATACCCGTGG
GGTATGGTCGCGTACAAGCTACTTGTTTCAAGCATCCGTAATGCGGGGGCCGCTAGGGGTAATCATACAGTAGGGATGGGTGGCATGGTTTATGCCCTCCTGACATGGGC
GTACGAGGTAATACCAGCTCTGAGTTCTGCCTCGGTATTCTATGCAAAGAGAACAGGTGACCTGCTACCCAAGATGTTGAGCTGGGACTTTGAAACGCATCCTGAATGGA
GAGAACTCCTTACTGGCATATTCGAATGCCCACCGCTTGAGGTCTATCCACTTATTGCTAGCGAAGAAGAAATGGCGATGCCTTACTTTGCTCCGTTTGTTGCTAAGGAG
TTGGCTAGTCGAGGAAAATATATCGACAGTCGCTTTGAGAAGATGGAAAAAGGGATGGATGAGATTCGCGAACAACTATCCTTGCTAATAAGGTCATTCCAAACGTTCAC
TAATTATGTGACAACAATATGCAAGTCAAAAGTCGCCTCCCCTCCACCGGACCAAAAGACTAAACCAGAGAATGAGGAGGACAAGAAAGATACAGAACATGAAGTTGGGG
ATGACCACACAATGCATAAGCAAGTGTGCCACCCCAGGAATGATGATGAAGATGGGCCAACAGGTGGCAACGCTAGAGGTACGGGTCTGCCAGACCCTATACGTGCAGGA
CATTTACAAGGCAGGAACACTACAAATGATCCAACACCACCCAAGACCCAACCAGGTGCCGGTGGGCAGCCAATTGAGGATTCAGATCCGAAGATAAATGATGTGATTTT
ATCCATAAGTGAACGAACTGTTCTAAGTAGGTTGGCGGTAGAAAAGAGGCAAGCTGAGATCGCAAAGGTGAACCAAATTGTTACAAAGAATCTTGTGCCAGGAGTGGCAT
CGTCCCCATATGTTGCTGCATCGAGGGAGGAGAATTGGACCCCGCCATCGAACGATATGCTGAAAGAATTTGACGCTCCTTCATTCGACCTAAAGCTCAGCCAGCAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGTAAAGGGTAACCCTCCTTCGCGAGGGTTTTTCCCTGGCAGAGTGTTGATGTTGGTTCTAGAGAATCAACCGCAAAGATGGAGATGGCTAAAGGGTGTAAGAGGTCT
AGAGAAGTCACGCAACGAAAATGAAATGGATTCTAACCACGAGGAAAGTGACGCACCTGAAGACCAGGAAGTCGCTGACTCCGAAGATGAGGAGGTTGTACCCTTGGCAA
AAATACCTGCCAAGAAAGAAGTGAAAACACTGGATGATGATTATGAAATGACAGAACAGAGGAGATCGCACCCATTCAAGATAAACTTGTGTTGCAAGAGTAATATAATG
ACTGCAATCCTAAATAATCTAGGCGAGGAGTTAGCACCACTATTCAGAGGGATGTGTTTTGGGCGTTTGTTAGATTTTTCTGTAACCAAAACATCGTCACAATTGTTGTT
GCATTTGATCCAACGACAATGCAAGGCAACGAAGTTCCCTGAGCTAACATTCAAGATAGGGGGGAGGGTTAATAAGAGGGCCCCCGAAGAGCATATGATAAAGATGTCCT
TATTGTACCGCTTAGAGAGCTTTGTACTAGCTAAGCATGACAAAGTTAACATTGAGGATAAACACATACTCATGGTGGATGATCTAGACATGTTTAACACATACCCGTGG
GGTATGGTCGCGTACAAGCTACTTGTTTCAAGCATCCGTAATGCGGGGGCCGCTAGGGGTAATCATACAGTAGGGATGGGTGGCATGGTTTATGCCCTCCTGACATGGGC
GTACGAGGTAATACCAGCTCTGAGTTCTGCCTCGGTATTCTATGCAAAGAGAACAGGTGACCTGCTACCCAAGATGTTGAGCTGGGACTTTGAAACGCATCCTGAATGGA
GAGAACTCCTTACTGGCATATTCGAATGCCCACCGCTTGAGGTCTATCCACTTATTGCTAGCGAAGAAGAAATGGCGATGCCTTACTTTGCTCCGTTTGTTGCTAAGGAG
TTGGCTAGTCGAGGAAAATATATCGACAGTCGCTTTGAGAAGATGGAAAAAGGGATGGATGAGATTCGCGAACAACTATCCTTGCTAATAAGGTCATTCCAAACGTTCAC
TAATTATGTGACAACAATATGCAAGTCAAAAGTCGCCTCCCCTCCACCGGACCAAAAGACTAAACCAGAGAATGAGGAGGACAAGAAAGATACAGAACATGAAGTTGGGG
ATGACCACACAATGCATAAGCAAGTGTGCCACCCCAGGAATGATGATGAAGATGGGCCAACAGGTGGCAACGCTAGAGGTACGGGTCTGCCAGACCCTATACGTGCAGGA
CATTTACAAGGCAGGAACACTACAAATGATCCAACACCACCCAAGACCCAACCAGGTGCCGGTGGGCAGCCAATTGAGGATTCAGATCCGAAGATAAATGATGTGATTTT
ATCCATAAGTGAACGAACTGTTCTAAGTAGGTTGGCGGTAGAAAAGAGGCAAGCTGAGATCGCAAAGGTGAACCAAATTGTTACAAAGAATCTTGTGCCAGGAGTGGCAT
CGTCCCCATATGTTGCTGCATCGAGGGAGGAGAATTGGACCCCGCCATCGAACGATATGCTGAAAGAATTTGACGCTCCTTCATTCGACCTAAAGCTCAGCCAGCAATAG
Protein sequenceShow/hide protein sequence
MVKGNPPSRGFFPGRVLMLVLENQPQRWRWLKGVRGLEKSRNENEMDSNHEESDAPEDQEVADSEDEEVVPLAKIPAKKEVKTLDDDYEMTEQRRSHPFKINLCCKSNIM
TAILNNLGEELAPLFRGMCFGRLLDFSVTKTSSQLLLHLIQRQCKATKFPELTFKIGGRVNKRAPEEHMIKMSLLYRLESFVLAKHDKVNIEDKHILMVDDLDMFNTYPW
GMVAYKLLVSSIRNAGAARGNHTVGMGGMVYALLTWAYEVIPALSSASVFYAKRTGDLLPKMLSWDFETHPEWRELLTGIFECPPLEVYPLIASEEEMAMPYFAPFVAKE
LASRGKYIDSRFEKMEKGMDEIREQLSLLIRSFQTFTNYVTTICKSKVASPPPDQKTKPENEEDKKDTEHEVGDDHTMHKQVCHPRNDDEDGPTGGNARGTGLPDPIRAG
HLQGRNTTNDPTPPKTQPGAGGQPIEDSDPKINDVILSISERTVLSRLAVEKRQAEIAKVNQIVTKNLVPGVASSPYVAASREENWTPPSNDMLKEFDAPSFDLKLSQQ