; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0017413 (gene) of Chayote v1 genome

Gene IDSed0017413
OrganismSechium edule (Chayote v1)
DescriptionProtein Ycf2-like
Genome locationLG11:6974011..6977435
RNA-Seq ExpressionSed0017413
SyntenySed0017413
Gene Ontology termsNA
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047596.1 protein Ycf2-like [Cucumis melo var. makuwa]2.4e-4237.58Show/hide
Query:  TKTVKKCKKPEAVVPEAETKYDSKPKGKG-LRGEKVKPVEALDDPDLDDDYVMTAYHRSRAIKINLCCRSGVMAAIQKNLGEHLMEKFKNTCFGHLLHFS
        +K  +K KK +AV  E + + D + KGK  +  EK +          D  Y+M +  R+  +KINL  +S V+  I++NLG+ L+ +F+   FGH L+ S
Subjt:  TKTVKKCKKPEAVVPEAETKYDSKPKGKG-LRGEKVKPVEALDDPDLDDDYVMTAYHRSRAIKINLCCRSGVMAAIQKNLGEHLMEKFKNTCFGHLLHFS

Query:  VKQASSQLLLRIIQRQCDPQKYPKLTFKIGGKLLTFGLREFALITGLNYGPLVKIDTTTIKDAGRLRKDYFEEDEVIKWFVLNMTFRVNRRAPEVNMLKM
        +   SSQLLL +IQR C P+   +L F IGG++L FGLREFALITGL    +  I+   I   GRL+  YFE  + +    LN+ F ++    + + +KM
Subjt:  VKQASSQLLLRIIQRQCDPQKYPKLTFKIGGKLLTFGLREFALITGLNYGPLVKIDTTTIKDAGRLRKDYFEEDEVIKWFVLNMTFRVNRRAPEVNMLKM

Query:  SLLYCLESFLLAKQD--KVAW------------------RVAYQLLGSNVRNAGVGQGNCTVGMAGFVYALLVWAYEVIPALSAAPAYYATRIDNTLP
        + LY LESFL+ KQ+   V W                  RVA++LL   +      +G   + M GF++ +L WAYEVIP LS  P ++ TRI N +P
Subjt:  SLLYCLESFLLAKQD--KVAW------------------RVAYQLLGSNVRNAGVGQGNCTVGMAGFVYALLVWAYEVIPALSAAPAYYATRIDNTLP

XP_031743189.1 uncharacterized protein LOC101221625 isoform X2 [Cucumis sativus]2.3e-2931.76Show/hide
Query:  KKPEAVVPEAETKYDSKPKGKGLRGEKV----KPVEALDDPDLDDDYVMTAYHRSRAI--KINLCCRSGVMAAIQKNLGEHLMEKFKNTCFGHLLHFSVK
        ++ + +  +++   D K K +G RG+K        E  D+ D+  +Y +     S A   +INL  +  V++ I+  L E  ++KFK +CFG+ L   + 
Subjt:  KKPEAVVPEAETKYDSKPKGKGLRGEKV----KPVEALDDPDLDDDYVMTAYHRSRAI--KINLCCRSGVMAAIQKNLGEHLMEKFKNTCFGHLLHFSVK

Query:  QASSQLLLRIIQRQCDPQKYPKLTFKIGGKLLTFGLREFALITGLNYGPLVKIDTTTIKDAGRLRKDYFEEDEVIKWFVLNMTFRVNRRAPEVNMLKMSL
        + SSQL   +I+RQC  +   +L F + G++  FG+++FALITGLN G L  ID + I+  G+  K YF  ++ I+   L+  F    +    +++KM+ 
Subjt:  QASSQLLLRIIQRQCDPQKYPKLTFKIGGKLLTFGLREFALITGLNYGPLVKIDTTTIKDAGRLRKDYFEEDEVIKWFVLNMTFRVNRRAPEVNMLKMSL

Query:  LYCLESFLLAKQ-------------------DKVAW-RVAYQLLGSNVRNAGVGQGNCTVGMAGFVYALLVWAYEVIPALSAAPAYYATRIDNTLP
        LY LE F+L KQ                   D   W R++Y++    V+ +        +G+ GF YALLVWAYE IP L+    + A RI    P
Subjt:  LYCLESFLLAKQ-------------------DKVAW-RVAYQLLGSNVRNAGVGQGNCTVGMAGFVYALLVWAYEVIPALSAAPAYYATRIDNTLP

XP_031743190.1 uncharacterized protein LOC101221625 isoform X3 [Cucumis sativus]1.4e-2932.76Show/hide
Query:  VPEAETKYDSKPKGKGLRGEKV----KPVEALDDPDLDDDYVMTAYHRSRAI--KINLCCRSGVMAAIQKNLGEHLMEKFKNTCFGHLLHFSVKQASSQL
        + E     D K K +G RG+K        E  D+ D+  +Y +     S A   +INL  +  V++ I+  L E  ++KFK +CFG+ L   + + SSQL
Subjt:  VPEAETKYDSKPKGKGLRGEKV----KPVEALDDPDLDDDYVMTAYHRSRAI--KINLCCRSGVMAAIQKNLGEHLMEKFKNTCFGHLLHFSVKQASSQL

Query:  LLRIIQRQCDPQKYPKLTFKIGGKLLTFGLREFALITGLNYGPLVKIDTTTIKDAGRLRKDYFEEDEVIKWFVLNMTFRVNRRAPEVNMLKMSLLYCLES
           +I+RQC  +   +L F + G++  FG+++FALITGLN G L  ID + I+  G+  K YF  ++ I+   L+  F    +    +++KM+ LY LE 
Subjt:  LLRIIQRQCDPQKYPKLTFKIGGKLLTFGLREFALITGLNYGPLVKIDTTTIKDAGRLRKDYFEEDEVIKWFVLNMTFRVNRRAPEVNMLKMSLLYCLES

Query:  FLLAKQ-------------------DKVAW-RVAYQLLGSNVRNAGVGQGNCTVGMAGFVYALLVWAYEVIPALSAAPAYYATRIDNTLP
        F+L KQ                   D   W R++Y++    V+ +        +G+ GF YALLVWAYE IP L+    + A RI    P
Subjt:  FLLAKQ-------------------DKVAW-RVAYQLLGSNVRNAGVGQGNCTVGMAGFVYALLVWAYEVIPALSAAPAYYATRIDNTLP

XP_031743194.1 uncharacterized protein LOC101221625 isoform X7 [Cucumis sativus]1.8e-2931.51Show/hide
Query:  KKPEAVVPEAETKYDSKPKGKGLRGEKVKPVEALDDPDLDDDYVMTAYHRSRAI--KINLCCRSGVMAAIQKNLGEHLMEKFKNTCFGHLLHFSVKQASS
        ++ + +  +++   D K K +G RG+K   +        D++Y +     S A   +INL  +  V++ I+  L E  ++KFK +CFG+ L   + + SS
Subjt:  KKPEAVVPEAETKYDSKPKGKGLRGEKVKPVEALDDPDLDDDYVMTAYHRSRAI--KINLCCRSGVMAAIQKNLGEHLMEKFKNTCFGHLLHFSVKQASS

Query:  QLLLRIIQRQCDPQKYPKLTFKIGGKLLTFGLREFALITGLNYGPLVKIDTTTIKDAGRLRKDYFEEDEVIKWFVLNMTFRVNRRAPEVNMLKMSLLYCL
        QL   +I+RQC  +   +L F + G++  FG+++FALITGLN G L  ID + I+  G+  K YF  ++ I+   L+  F    +    +++KM+ LY L
Subjt:  QLLLRIIQRQCDPQKYPKLTFKIGGKLLTFGLREFALITGLNYGPLVKIDTTTIKDAGRLRKDYFEEDEVIKWFVLNMTFRVNRRAPEVNMLKMSLLYCL

Query:  ESFLLAKQ-------------------DKVAW-RVAYQLLGSNVRNAGVGQGNCTVGMAGFVYALLVWAYEVIPALSAAPAYYATRIDNTLP
        E F+L KQ                   D   W R++Y++    V+ +        +G+ GF YALLVWAYE IP L+    + A RI    P
Subjt:  ESFLLAKQ-------------------DKVAW-RVAYQLLGSNVRNAGVGQGNCTVGMAGFVYALLVWAYEVIPALSAAPAYYATRIDNTLP

XP_031743195.1 uncharacterized protein LOC101221625 isoform X8 [Cucumis sativus]1.0e-2932.52Show/hide
Query:  VPEAETKYDSKPKGKGLRGEKVKPVEALDDPDLDDDYVMTAYHRSRAI--KINLCCRSGVMAAIQKNLGEHLMEKFKNTCFGHLLHFSVKQASSQLLLRI
        + E     D K K +G RG+K   +        D++Y +     S A   +INL  +  V++ I+  L E  ++KFK +CFG+ L   + + SSQL   +
Subjt:  VPEAETKYDSKPKGKGLRGEKVKPVEALDDPDLDDDYVMTAYHRSRAI--KINLCCRSGVMAAIQKNLGEHLMEKFKNTCFGHLLHFSVKQASSQLLLRI

Query:  IQRQCDPQKYPKLTFKIGGKLLTFGLREFALITGLNYGPLVKIDTTTIKDAGRLRKDYFEEDEVIKWFVLNMTFRVNRRAPEVNMLKMSLLYCLESFLLA
        I+RQC  +   +L F + G++  FG+++FALITGLN G L  ID + I+  G+  K YF  ++ I+   L+  F    +    +++KM+ LY LE F+L 
Subjt:  IQRQCDPQKYPKLTFKIGGKLLTFGLREFALITGLNYGPLVKIDTTTIKDAGRLRKDYFEEDEVIKWFVLNMTFRVNRRAPEVNMLKMSLLYCLESFLLA

Query:  KQ-------------------DKVAW-RVAYQLLGSNVRNAGVGQGNCTVGMAGFVYALLVWAYEVIPALSAAPAYYATRIDNTLP
        KQ                   D   W R++Y++    V+ +        +G+ GF YALLVWAYE IP L+    + A RI    P
Subjt:  KQ-------------------DKVAW-RVAYQLLGSNVRNAGVGQGNCTVGMAGFVYALLVWAYEVIPALSAAPAYYATRIDNTLP

TrEMBL top hitse value%identityAlignment
A0A0A0KI50 TF-B3 domain-containing protein3.3e-2931.21Show/hide
Query:  EEAVLTTPGPSTKTVKKCKKPEAVVPEAETKYDSKPKGKGLRGEKV----KPVEALDDPDLDDDYVMTAYHRSRAI--KINLCCRSGVMAAIQKNLGEHL
        E + + T   S + V+  ++ +      +   D K K +G RG+K        E  D+ D+  +Y +     S A   +INL  +  V++ I+  L E  
Subjt:  EEAVLTTPGPSTKTVKKCKKPEAVVPEAETKYDSKPKGKGLRGEKV----KPVEALDDPDLDDDYVMTAYHRSRAI--KINLCCRSGVMAAIQKNLGEHL

Query:  MEKFKNTCFGHLLHFSVKQASSQLLLRIIQRQCDPQKYPKLTFKIGGKLLTFGLREFALITGLNYGPLVKIDTTTIKDAGRLRKDYFEEDEVIKWFVLNM
        ++KFK +CFG+ L   + + SSQL   +I+RQC  +   +L F + G++  FG+++FALITGLN G L  ID + I+  G+  K YF  ++ I+   L+ 
Subjt:  MEKFKNTCFGHLLHFSVKQASSQLLLRIIQRQCDPQKYPKLTFKIGGKLLTFGLREFALITGLNYGPLVKIDTTTIKDAGRLRKDYFEEDEVIKWFVLNM

Query:  TFRVNRRAPEVNMLKMSLLYCLESFLLAKQ-------------------DKVAW-RVAYQLLGSNVRNAGVGQGNCTVGMAGFVYALLVWAYEVIPALSA
         F    +    +++KM+ LY LE F+L KQ                   D   W R++Y++    V+ +        +G+ GF YALLVWAYE IP L+ 
Subjt:  TFRVNRRAPEVNMLKMSLLYCLESFLLAKQ-------------------DKVAW-RVAYQLLGSNVRNAGVGQGNCTVGMAGFVYALLVWAYEVIPALSA

Query:  APAYYATRIDNTLP
           + A RI    P
Subjt:  APAYYATRIDNTLP

A0A1S3B0L9 uncharacterized protein LOC103484737 isoform X52.1e-2830.89Show/hide
Query:  EEAVLTTPGPSTKTVKKCKKPEAVVPEAETKYDSKPKGKGLRGEK----VKPVEALDDPDLDDDY--VMTAYHRSRAIKINLCCRSGVMAAIQKNLGEHL
        E + + T   S + V+  +K +      +     K K +G +G K      P E  D+ D+  +Y  ++     S   +INL  +  V++ I+  L E  
Subjt:  EEAVLTTPGPSTKTVKKCKKPEAVVPEAETKYDSKPKGKGLRGEK----VKPVEALDDPDLDDDY--VMTAYHRSRAIKINLCCRSGVMAAIQKNLGEHL

Query:  MEKFKNTCFGHLLHFSVKQASSQLLLRIIQRQCDPQKYPKLTFKIGGKLLTFGLREFALITGLNYGPLVKIDTTTIKDAGRLRKDYFEEDEVIKWFVLNM
        ++KFK +CFG+ L   V + SSQL   +I+RQC  +   +L F + G++  FG+++FALITGLN G L  ID + I+  G+  K YF  ++ I+   L+ 
Subjt:  MEKFKNTCFGHLLHFSVKQASSQLLLRIIQRQCDPQKYPKLTFKIGGKLLTFGLREFALITGLNYGPLVKIDTTTIKDAGRLRKDYFEEDEVIKWFVLNM

Query:  TFRVNRRAPEVNMLKMSLLYCLESFLLAKQ-------------------DKVAW-RVAYQLLGSNVRNAGVGQGNCTVGMAGFVYALLVWAYEVIPALSA
         F    +    +++KM+ LY LE F+L KQ                   D   W R++Y++    V+ A        +G+ GF +AL VWAYE IP L+ 
Subjt:  TFRVNRRAPEVNMLKMSLLYCLESFLLAKQ-------------------DKVAW-RVAYQLLGSNVRNAGVGQGNCTVGMAGFVYALLVWAYEVIPALSA

Query:  APAYYATRIDNTLP
           ++A RI    P
Subjt:  APAYYATRIDNTLP

A0A1S3B181 uncharacterized protein LOC103484737 isoform X72.1e-2830.89Show/hide
Query:  EEAVLTTPGPSTKTVKKCKKPEAVVPEAETKYDSKPKGKGLRGEK----VKPVEALDDPDLDDDY--VMTAYHRSRAIKINLCCRSGVMAAIQKNLGEHL
        E + + T   S + V+  +K +      +     K K +G +G K      P E  D+ D+  +Y  ++     S   +INL  +  V++ I+  L E  
Subjt:  EEAVLTTPGPSTKTVKKCKKPEAVVPEAETKYDSKPKGKGLRGEK----VKPVEALDDPDLDDDY--VMTAYHRSRAIKINLCCRSGVMAAIQKNLGEHL

Query:  MEKFKNTCFGHLLHFSVKQASSQLLLRIIQRQCDPQKYPKLTFKIGGKLLTFGLREFALITGLNYGPLVKIDTTTIKDAGRLRKDYFEEDEVIKWFVLNM
        ++KFK +CFG+ L   V + SSQL   +I+RQC  +   +L F + G++  FG+++FALITGLN G L  ID + I+  G+  K YF  ++ I+   L+ 
Subjt:  MEKFKNTCFGHLLHFSVKQASSQLLLRIIQRQCDPQKYPKLTFKIGGKLLTFGLREFALITGLNYGPLVKIDTTTIKDAGRLRKDYFEEDEVIKWFVLNM

Query:  TFRVNRRAPEVNMLKMSLLYCLESFLLAKQ-------------------DKVAW-RVAYQLLGSNVRNAGVGQGNCTVGMAGFVYALLVWAYEVIPALSA
         F    +    +++KM+ LY LE F+L KQ                   D   W R++Y++    V+ A        +G+ GF +AL VWAYE IP L+ 
Subjt:  TFRVNRRAPEVNMLKMSLLYCLESFLLAKQ-------------------DKVAW-RVAYQLLGSNVRNAGVGQGNCTVGMAGFVYALLVWAYEVIPALSA

Query:  APAYYATRIDNTLP
           ++A RI    P
Subjt:  APAYYATRIDNTLP

A0A5A7U047 Protein Ycf2-like1.2e-4237.58Show/hide
Query:  TKTVKKCKKPEAVVPEAETKYDSKPKGKG-LRGEKVKPVEALDDPDLDDDYVMTAYHRSRAIKINLCCRSGVMAAIQKNLGEHLMEKFKNTCFGHLLHFS
        +K  +K KK +AV  E + + D + KGK  +  EK +          D  Y+M +  R+  +KINL  +S V+  I++NLG+ L+ +F+   FGH L+ S
Subjt:  TKTVKKCKKPEAVVPEAETKYDSKPKGKG-LRGEKVKPVEALDDPDLDDDYVMTAYHRSRAIKINLCCRSGVMAAIQKNLGEHLMEKFKNTCFGHLLHFS

Query:  VKQASSQLLLRIIQRQCDPQKYPKLTFKIGGKLLTFGLREFALITGLNYGPLVKIDTTTIKDAGRLRKDYFEEDEVIKWFVLNMTFRVNRRAPEVNMLKM
        +   SSQLLL +IQR C P+   +L F IGG++L FGLREFALITGL    +  I+   I   GRL+  YFE  + +    LN+ F ++    + + +KM
Subjt:  VKQASSQLLLRIIQRQCDPQKYPKLTFKIGGKLLTFGLREFALITGLNYGPLVKIDTTTIKDAGRLRKDYFEEDEVIKWFVLNMTFRVNRRAPEVNMLKM

Query:  SLLYCLESFLLAKQD--KVAW------------------RVAYQLLGSNVRNAGVGQGNCTVGMAGFVYALLVWAYEVIPALSAAPAYYATRIDNTLP
        + LY LESFL+ KQ+   V W                  RVA++LL   +      +G   + M GF++ +L WAYEVIP LS  P ++ TRI N +P
Subjt:  SLLYCLESFLLAKQD--KVAW------------------RVAYQLLGSNVRNAGVGQGNCTVGMAGFVYALLVWAYEVIPALSAAPAYYATRIDNTLP

A0A5D3CNI7 TF-B3 domain-containing protein2.1e-2830.89Show/hide
Query:  EEAVLTTPGPSTKTVKKCKKPEAVVPEAETKYDSKPKGKGLRGEK----VKPVEALDDPDLDDDY--VMTAYHRSRAIKINLCCRSGVMAAIQKNLGEHL
        E + + T   S + V+  +K +      +     K K +G +G K      P E  D+ D+  +Y  ++     S   +INL  +  V++ I+  L E  
Subjt:  EEAVLTTPGPSTKTVKKCKKPEAVVPEAETKYDSKPKGKGLRGEK----VKPVEALDDPDLDDDY--VMTAYHRSRAIKINLCCRSGVMAAIQKNLGEHL

Query:  MEKFKNTCFGHLLHFSVKQASSQLLLRIIQRQCDPQKYPKLTFKIGGKLLTFGLREFALITGLNYGPLVKIDTTTIKDAGRLRKDYFEEDEVIKWFVLNM
        ++KFK +CFG+ L   V + SSQL   +I+RQC  +   +L F + G++  FG+++FALITGLN G L  ID + I+  G+  K YF  ++ I+   L+ 
Subjt:  MEKFKNTCFGHLLHFSVKQASSQLLLRIIQRQCDPQKYPKLTFKIGGKLLTFGLREFALITGLNYGPLVKIDTTTIKDAGRLRKDYFEEDEVIKWFVLNM

Query:  TFRVNRRAPEVNMLKMSLLYCLESFLLAKQ-------------------DKVAW-RVAYQLLGSNVRNAGVGQGNCTVGMAGFVYALLVWAYEVIPALSA
         F    +    +++KM+ LY LE F+L KQ                   D   W R++Y++    V+ A        +G+ GF +AL VWAYE IP L+ 
Subjt:  TFRVNRRAPEVNMLKMSLLYCLESFLLAKQ-------------------DKVAW-RVAYQLLGSNVRNAGVGQGNCTVGMAGFVYALLVWAYEVIPALSA

Query:  APAYYATRIDNTLP
           ++A RI    P
Subjt:  APAYYATRIDNTLP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G31150.1 Domain of unknown function (DUF1985)1.9e-0532.26Show/hide
Query:  KINLCCRSGVMAAIQKNL-GEHLMEKFKNTCFGHLLHFSVKQA--SSQLLLRIIQRQCDPQKYPKLTFKIGGKLLTFGLREFALITGLNYGPL
        ++N+  R   +  I   L G    E+ K++ FG L  F V +   S +L+  ++ RQ   +K  +L F  GG  + F +REF ++TGL  G L
Subjt:  KINLCCRSGVMAAIQKNL-GEHLMEKFKNTCFGHLLHFSVKQA--SSQLLLRIIQRQCDPQKYPKLTFKIGGKLLTFGLREFALITGLNYGPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCGAACCAAGCGAGTAAGGGAAGAGGCTGTACTTACTACACCTGGGCCATCGACCAAAACTGTGAAAAAATGTAAAAAGCCAGAGGCAGTTGTACCCGAGGCTGA
GACCAAGTATGACAGCAAACCAAAGGGGAAGGGACTAAGGGGAGAAAAGGTCAAACCTGTTGAAGCATTGGATGACCCTGATCTGGATGATGACTATGTTATGACGGCCT
ACCATAGATCAAGAGCAATCAAGATTAACTTGTGCTGCCGGAGTGGTGTAATGGCAGCTATTCAAAAGAACTTGGGAGAGCATCTCATGGAGAAGTTTAAAAACACCTGT
TTTGGGCACCTACTACATTTCTCGGTTAAACAAGCATCATCCCAACTGTTGTTGCGCATAATACAACGACAATGTGACCCTCAGAAGTACCCAAAGTTGACATTTAAGAT
TGGTGGAAAATTGTTGACTTTCGGGCTACGAGAGTTTGCACTTATTACAGGGCTGAACTATGGGCCCTTGGTGAAAATAGATACAACAACAATCAAGGATGCGGGGCGCC
TCAGAAAAGACTACTTTGAAGAGGATGAGGTGATCAAGTGGTTTGTATTGAACATGACTTTTAGGGTCAATAGGCGAGCTCCTGAAGTAAACATGTTGAAGATGTCTTTG
TTGTACTGCCTTGAGAGTTTCTTATTAGCAAAACAAGACAAGGTTGCATGGCGCGTCGCATATCAATTGCTGGGGTCGAATGTGCGAAATGCTGGAGTCGGACAAGGAAA
TTGTACAGTAGGAATGGCTGGATTTGTCTATGCATTGTTGGTATGGGCGTATGAGGTCATACCCGCTCTGAGTGCAGCCCCTGCGTATTATGCAACAAGGATTGACAATA
CATTGCCTCATGAGAAGTTAGCAGCTCGTGCACGAAGAAACGGACTAGGTGGACATGAGGATGTACCATTGAACTTCTTCCACGGATCTCCATCAACTACAACACTCAAT
AAAGAGATTTCCTTGAGGGTGGATGCTTACGAAAGACAACAGGAAGCCCTACAACAACAAATATCCTTAATGATGGTGACTTTGGATAATGCTATGAAATATATCAAAAT
GATAAGCAGCGCAGTGGTGAGCCATACAACAATGCCACCACCAGCATCAGTTGATCCGGAGGACAAGACACAGAAAGAGGATGACATGGTGGATAAAGGGACAACGAGTA
ATGTTGACGACCATAGCATGCACAAGGATCCAGATGATGACAATGGTCCTATCAGTGGACCATCTGGTGCCCATGGACTAAATATCCAGGGGACTCAATTAACAAAAATT
CAGACTTCCTCAACGAATGAACAACACACATCGCGTTCATCAGGGGGGGAGCAACAGTACGCCGATGATAACAATAAGTCAATGAACAATGCGATCAACCCAATCAGTGA
ACGAGAGGTACTTAGCAAACTTGCAGCAAAGGCGAGATTTGCTGAAACTTTAAAGCGCCCCCGGTGGGTCACCAAAGCTGAAGCACTGCCACCAAAATTTGATGCCCCGT
CGTTCGACCTACATCTCATCCATCTATTAGGAACTTATGTTGTTCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCGAACCAAGCGAGTAAGGGAAGAGGCTGTACTTACTACACCTGGGCCATCGACCAAAACTGTGAAAAAATGTAAAAAGCCAGAGGCAGTTGTACCCGAGGCTGA
GACCAAGTATGACAGCAAACCAAAGGGGAAGGGACTAAGGGGAGAAAAGGTCAAACCTGTTGAAGCATTGGATGACCCTGATCTGGATGATGACTATGTTATGACGGCCT
ACCATAGATCAAGAGCAATCAAGATTAACTTGTGCTGCCGGAGTGGTGTAATGGCAGCTATTCAAAAGAACTTGGGAGAGCATCTCATGGAGAAGTTTAAAAACACCTGT
TTTGGGCACCTACTACATTTCTCGGTTAAACAAGCATCATCCCAACTGTTGTTGCGCATAATACAACGACAATGTGACCCTCAGAAGTACCCAAAGTTGACATTTAAGAT
TGGTGGAAAATTGTTGACTTTCGGGCTACGAGAGTTTGCACTTATTACAGGGCTGAACTATGGGCCCTTGGTGAAAATAGATACAACAACAATCAAGGATGCGGGGCGCC
TCAGAAAAGACTACTTTGAAGAGGATGAGGTGATCAAGTGGTTTGTATTGAACATGACTTTTAGGGTCAATAGGCGAGCTCCTGAAGTAAACATGTTGAAGATGTCTTTG
TTGTACTGCCTTGAGAGTTTCTTATTAGCAAAACAAGACAAGGTTGCATGGCGCGTCGCATATCAATTGCTGGGGTCGAATGTGCGAAATGCTGGAGTCGGACAAGGAAA
TTGTACAGTAGGAATGGCTGGATTTGTCTATGCATTGTTGGTATGGGCGTATGAGGTCATACCCGCTCTGAGTGCAGCCCCTGCGTATTATGCAACAAGGATTGACAATA
CATTGCCTCATGAGAAGTTAGCAGCTCGTGCACGAAGAAACGGACTAGGTGGACATGAGGATGTACCATTGAACTTCTTCCACGGATCTCCATCAACTACAACACTCAAT
AAAGAGATTTCCTTGAGGGTGGATGCTTACGAAAGACAACAGGAAGCCCTACAACAACAAATATCCTTAATGATGGTGACTTTGGATAATGCTATGAAATATATCAAAAT
GATAAGCAGCGCAGTGGTGAGCCATACAACAATGCCACCACCAGCATCAGTTGATCCGGAGGACAAGACACAGAAAGAGGATGACATGGTGGATAAAGGGACAACGAGTA
ATGTTGACGACCATAGCATGCACAAGGATCCAGATGATGACAATGGTCCTATCAGTGGACCATCTGGTGCCCATGGACTAAATATCCAGGGGACTCAATTAACAAAAATT
CAGACTTCCTCAACGAATGAACAACACACATCGCGTTCATCAGGGGGGGAGCAACAGTACGCCGATGATAACAATAAGTCAATGAACAATGCGATCAACCCAATCAGTGA
ACGAGAGGTACTTAGCAAACTTGCAGCAAAGGCGAGATTTGCTGAAACTTTAAAGCGCCCCCGGTGGGTCACCAAAGCTGAAGCACTGCCACCAAAATTTGATGCCCCGT
CGTTCGACCTACATCTCATCCATCTATTAGGAACTTATGTTGTTCTTTAA
Protein sequenceShow/hide protein sequence
MARTKRVREEAVLTTPGPSTKTVKKCKKPEAVVPEAETKYDSKPKGKGLRGEKVKPVEALDDPDLDDDYVMTAYHRSRAIKINLCCRSGVMAAIQKNLGEHLMEKFKNTC
FGHLLHFSVKQASSQLLLRIIQRQCDPQKYPKLTFKIGGKLLTFGLREFALITGLNYGPLVKIDTTTIKDAGRLRKDYFEEDEVIKWFVLNMTFRVNRRAPEVNMLKMSL
LYCLESFLLAKQDKVAWRVAYQLLGSNVRNAGVGQGNCTVGMAGFVYALLVWAYEVIPALSAAPAYYATRIDNTLPHEKLAARARRNGLGGHEDVPLNFFHGSPSTTTLN
KEISLRVDAYERQQEALQQQISLMMVTLDNAMKYIKMISSAVVSHTTMPPPASVDPEDKTQKEDDMVDKGTTSNVDDHSMHKDPDDDNGPISGPSGAHGLNIQGTQLTKI
QTSSTNEQHTSRSSGGEQQYADDNNKSMNNAINPISEREVLSKLAAKARFAETLKRPRWVTKAEALPPKFDAPSFDLHLIHLLGTYVVL