; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0004223 (gene) of Chayote v1 genome

Gene IDSed0004223
OrganismSechium edule (Chayote v1)
DescriptionUnknown protein
Genome locationLG10:31434714..31436183
RNA-Seq ExpressionSed0004223
SyntenySed0004223
Gene Ontology termsNA
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588851.1 hypothetical protein SDJN03_17416, partial [Cucurbita argyrosperma subsp. sororia]7.2e-7169.91Show/hide
Query:  GLQSSVRAKRLWNVLKVTLLLMAKKGLISKRRFIMDMNFMVKRGKLFRKSVTNYFMFHHHHS----RDTTRGGFGIQDYEFSCSNTPINPVFSHMSK---
        GLQSSVRAKRLWNVLKVTL+LMA+KGLISKRRFIMDMN MVKRGKLFRKS T+YFMFHHHH      D T GGFGIQDYEFSCSNTPINPVFSHMSK   
Subjt:  GLQSSVRAKRLWNVLKVTLLLMAKKGLISKRRFIMDMNFMVKRGKLFRKSVTNYFMFHHHHS----RDTTRGGFGIQDYEFSCSNTPINPVFSHMSK---

Query:  EHHTYFPCINLPMEIIETKQQEND-------------------FT-----------------LSPAMEGGSPLVSSFNVRVSNYSSEEENEISSESGLVD
        ++H YFPCINLP+E +ETKQQE +                   FT                 LSPAMEG SPLVSSFNVR+SNYSSEEEN+ISSESG VD
Subjt:  EHHTYFPCINLPMEIIETKQQEND-------------------FT-----------------LSPAMEGGSPLVSSFNVRVSNYSSEEENEISSESGLVD

Query:  NQAEEFIRRFYEQLKMQKRLQLLQYS
        NQAEEFIRRFYEQL+ Q+RLQ LQYS
Subjt:  NQAEEFIRRFYEQLKMQKRLQLLQYS

KAG7022606.1 hypothetical protein SDJN02_16340, partial [Cucurbita argyrosperma subsp. argyrosperma]1.9e-7171.17Show/hide
Query:  GLQSSVRAKRLWNVLKVTLLLMAKKGLISKRRFIMDMNFMVKRGKLFRKSVTNYFMFHHHHSR----DTTRGGFGIQDYEFSCSNTPINPVFSHMSK---
        GLQSSVRAKRLWNVLKVTL+LMA+KGLISKRRFIMDMN MVKRGKLFRKS T+YFMFHHHH      D T GGFGIQDYEFSCSNTPINPVFSHMSK   
Subjt:  GLQSSVRAKRLWNVLKVTLLLMAKKGLISKRRFIMDMNFMVKRGKLFRKSVTNYFMFHHHHSR----DTTRGGFGIQDYEFSCSNTPINPVFSHMSK---

Query:  EHHTYFPCINLPMEIIETKQQEND---------------FT-----------------LSPAMEGGSPLVSSFNVRVSNYSSEEENEISSESGLVDNQAE
        ++H YFPCINLP+E +ETKQQE +               FT                 LSPAMEG SPLVSSFNVR+SNYSSEEEN+ISSESG VDNQAE
Subjt:  EHHTYFPCINLPMEIIETKQQEND---------------FT-----------------LSPAMEGGSPLVSSFNVRVSNYSSEEENEISSESGLVDNQAE

Query:  EFIRRFYEQLKMQKRLQLLQYS
        EFIRRFYEQL+ Q+RLQ LQYS
Subjt:  EFIRRFYEQLKMQKRLQLLQYS

XP_022928225.1 uncharacterized protein LOC111435119 [Cucurbita moschata]1.6e-7071.04Show/hide
Query:  GLQSSVRAKRLWNVLKVTLLLMAKKGLISKRRFIMDMNFMVKRGKLFRKSVTNYFMFHHHH----SRDTTRGGFGIQDYEFSCSNTPINPVFSHMSK---
        GLQSSVRAKRLWNVLKVTL+LMA+KGLISKRRFIMDMN MVKRGKLFRKS T+YFMFHHHH    + D T GGFGIQDYEFSCSNTPIN VFSHMSK   
Subjt:  GLQSSVRAKRLWNVLKVTLLLMAKKGLISKRRFIMDMNFMVKRGKLFRKSVTNYFMFHHHH----SRDTTRGGFGIQDYEFSCSNTPINPVFSHMSK---

Query:  EHHTYFPCINLPMEIIETKQQEND--------------FT-----------------LSPAMEGGSPLVSSFNVRVSNYSSEEENEISSESGLVDNQAEE
        ++H YFPCINLP+E +ETKQQE +              FT                 LSPAMEG SPLVS FNVR+SNYSSEEENEISSESG VDNQAEE
Subjt:  EHHTYFPCINLPMEIIETKQQEND--------------FT-----------------LSPAMEGGSPLVSSFNVRVSNYSSEEENEISSESGLVDNQAEE

Query:  FIRRFYEQLKMQKRLQLLQYS
        FIRRFYEQL+ Q+RLQ LQYS
Subjt:  FIRRFYEQLKMQKRLQLLQYS

XP_022989722.1 uncharacterized protein LOC111486722 [Cucurbita maxima]3.8e-7272.48Show/hide
Query:  GLQSSVRAKRLWNVLKVTLLLMAKKGLISKRRFIMDMNFMVKRGKLFRKSVTNYFMFHHHH----SRDTTRGGFGIQDYEFSCSNTPINPVFSHMSK---
        GLQSSVRAKRLWNVLKVTL+LMA+KGLISKRRFIMDMN M+KRGKLFRKS T+YFMFHHHH    + D T GGFGIQDYEFSCSNTPINPVFSHMSK   
Subjt:  GLQSSVRAKRLWNVLKVTLLLMAKKGLISKRRFIMDMNFMVKRGKLFRKSVTNYFMFHHHH----SRDTTRGGFGIQDYEFSCSNTPINPVFSHMSK---

Query:  EHHTYFPCINLPMEIIETKQQEND-----------FT-----------------LSPAMEGGSPLVSSFNVRVSNYSSEEENEISSESGLVDNQAEEFIR
        ++H YFPCINLP+E +ETKQQE +           FT                 LSPAMEG SPLVS FNVR+SNYSSEEENEISSESG VDNQAEEFIR
Subjt:  EHHTYFPCINLPMEIIETKQQEND-----------FT-----------------LSPAMEGGSPLVSSFNVRVSNYSSEEENEISSESGLVDNQAEEFIR

Query:  RFYEQLKMQKRLQLLQYS
        RFYEQL+ Q+RLQLLQYS
Subjt:  RFYEQLKMQKRLQLLQYS

XP_038887134.1 uncharacterized protein LOC120077324 [Benincasa hispida]1.2e-7369.47Show/hide
Query:  MGLQSSVRAKRLWNVLKVTLLLMAKKGLISKRRFIMDMNFMVKRGKLFRKSVTNYFMFHHHH----SRDTTRGGFGIQDYEFSCSNTPINPVFSHMSK--
        MGLQSSVRAKRLWNVLKVTL+LMA+KGLISKRRFIMDMN MVKRGKLFRKSVTNYFMFHHHH     RD TRG FGIQDYEFSCSNTPINPVFSH SK  
Subjt:  MGLQSSVRAKRLWNVLKVTLLLMAKKGLISKRRFIMDMNFMVKRGKLFRKSVTNYFMFHHHH----SRDTTRGGFGIQDYEFSCSNTPINPVFSHMSK--

Query:  -EHHTYFPCINLPMEIIETKQQEND-----------------------------------FTLSPAMEGGSPLVSSFNVRVSNYSSEEENEISSESGLVD
         ++HTYFPCINLP+EI   +QQE +                                   + LSPAMEG SPLVS FNVR+SNYSSEE+NEIS ESG VD
Subjt:  -EHHTYFPCINLPMEIIETKQQEND-----------------------------------FTLSPAMEGGSPLVSSFNVRVSNYSSEEENEISSESGLVD

Query:  NQAEEFIRRFYEQLKMQKRLQLLQYS
        NQAEEFI+RFYEQLKMQ+RLQLLQY+
Subjt:  NQAEEFIRRFYEQLKMQKRLQLLQYS

TrEMBL top hitse value%identityAlignment
A0A5D3DYM4 Uncharacterized protein4.2e-6966.96Show/hide
Query:  MGLQSSVRAKRLWNVLKVTLLLMAKKGLISKRRFIMDMNFMVKRGKLFRKSVTNYFMFHHHHS----RDTTRGGFGIQDYEFSCSNTPINPVFSHMSK--
        MGLQS VRAKRLWNVLKVTL+LMA+KGLISKRRFIMDMN MVKRGK+FRKSVTNYFMFHHHH+    RD T GGFGIQDYEFSCSNTPIN VFSHMSK  
Subjt:  MGLQSSVRAKRLWNVLKVTLLLMAKKGLISKRRFIMDMNFMVKRGKLFRKSVTNYFMFHHHHS----RDTTRGGFGIQDYEFSCSNTPINPVFSHMSK--

Query:  -EHHTYFPCINLPMEIIETKQQEND---------------------------------------FTLSPAMEGGSPLVSSFNVRVSNYSSEEENEISSES
         ++ TYFPCINLP+E+   +QQ++D                                       F LSPAME  SP    FNVR+SNYSSEEENEISSES
Subjt:  -EHHTYFPCINLPMEIIETKQQEND---------------------------------------FTLSPAMEGGSPLVSSFNVRVSNYSSEEENEISSES

Query:  GLVDNQAEEFIRRFYEQLKMQKRLQLLQYS
        G VDNQAEEFIRRFYEQLKMQKRLQLLQY+
Subjt:  GLVDNQAEEFIRRFYEQLKMQKRLQLLQYS

A0A6J1ER45 uncharacterized protein LOC1114351197.7e-7171.04Show/hide
Query:  GLQSSVRAKRLWNVLKVTLLLMAKKGLISKRRFIMDMNFMVKRGKLFRKSVTNYFMFHHHH----SRDTTRGGFGIQDYEFSCSNTPINPVFSHMSK---
        GLQSSVRAKRLWNVLKVTL+LMA+KGLISKRRFIMDMN MVKRGKLFRKS T+YFMFHHHH    + D T GGFGIQDYEFSCSNTPIN VFSHMSK   
Subjt:  GLQSSVRAKRLWNVLKVTLLLMAKKGLISKRRFIMDMNFMVKRGKLFRKSVTNYFMFHHHH----SRDTTRGGFGIQDYEFSCSNTPINPVFSHMSK---

Query:  EHHTYFPCINLPMEIIETKQQEND--------------FT-----------------LSPAMEGGSPLVSSFNVRVSNYSSEEENEISSESGLVDNQAEE
        ++H YFPCINLP+E +ETKQQE +              FT                 LSPAMEG SPLVS FNVR+SNYSSEEENEISSESG VDNQAEE
Subjt:  EHHTYFPCINLPMEIIETKQQEND--------------FT-----------------LSPAMEGGSPLVSSFNVRVSNYSSEEENEISSESGLVDNQAEE

Query:  FIRRFYEQLKMQKRLQLLQYS
        FIRRFYEQL+ Q+RLQ LQYS
Subjt:  FIRRFYEQLKMQKRLQLLQYS

A0A6J1F4V9 uncharacterized protein LOC1114421853.8e-7071.98Show/hide
Query:  MGLQSSVRAKRLWNVLKVTLLLMAKKGLISKRRFIMDMNFMVKRGKLFRKSVTNYFMFHHHH----SRDTTRGGFGIQDYEFSCSNTPINPVFSHMSK--
        MGLQSSVR KRLWNVLK TL+LMA+KGLISKRRFIMDMN MVKRGK+FRKS+TNYFMFHHHH     R+ TRGGF IQDYEF  SNTPINPVFSHMSK  
Subjt:  MGLQSSVRAKRLWNVLKVTLLLMAKKGLISKRRFIMDMNFMVKRGKLFRKSVTNYFMFHHHH----SRDTTRGGFGIQDYEFSCSNTPINPVFSHMSK--

Query:  -EHHTYFPCINLPMEI--IETKQQEND--------------FTLSPAMEGGSPLVSSFNVRVSNYSSEEENEISSESGLVDNQAEEFIRRFYEQLKMQKR
         ++ TYFPCINLP+EI   E K++E D              + LSPAME GSPLV+ FNVR+SNYSSEEEN+I+ ESG VDNQAEEFIRRFYEQLKMQ+ 
Subjt:  -EHHTYFPCINLPMEI--IETKQQEND--------------FTLSPAMEGGSPLVSSFNVRVSNYSSEEENEISSESGLVDNQAEEFIRRFYEQLKMQKR

Query:  LQLLQYS
        LQLLQY+
Subjt:  LQLLQYS

A0A6J1J771 uncharacterized protein LOC1114823801.5e-6971.63Show/hide
Query:  MGLQSSVRAKRLWNVLKVTLLLMAKKGLISKRRFIMDMNFMVKRGKLFRKSVTNYFMFHHHH-----SRDTTRGGFGIQDYEFSCSNTPINPVFSHMSK-
        MGLQSSVR KRLWNVLK TL+LMA+KGLISKRRFIMDMN MVKRGK+FRKS+TNYFMFHHHH      R  TRGGF IQDYEF  SNTPINPVFSHMSK 
Subjt:  MGLQSSVRAKRLWNVLKVTLLLMAKKGLISKRRFIMDMNFMVKRGKLFRKSVTNYFMFHHHH-----SRDTTRGGFGIQDYEFSCSNTPINPVFSHMSK-

Query:  --EHHTYFPCINLPMEI--IETKQQEND--------------FTLSPAMEGGSPLVSSFNVRVSNYSSEEENEISSESGLVDNQAEEFIRRFYEQLKMQK
           + TYFPCINLP+EI   E K++E D              + LSPAME GSPLV+ FNVR+SNYSSEEEN+I+ ESG VDNQAEEFIRRFYEQLKMQ+
Subjt:  --EHHTYFPCINLPMEI--IETKQQEND--------------FTLSPAMEGGSPLVSSFNVRVSNYSSEEENEISSESGLVDNQAEEFIRRFYEQLKMQK

Query:  RLQLLQYS
         LQLLQY+
Subjt:  RLQLLQYS

A0A6J1JGM3 uncharacterized protein LOC1114867221.8e-7272.48Show/hide
Query:  GLQSSVRAKRLWNVLKVTLLLMAKKGLISKRRFIMDMNFMVKRGKLFRKSVTNYFMFHHHH----SRDTTRGGFGIQDYEFSCSNTPINPVFSHMSK---
        GLQSSVRAKRLWNVLKVTL+LMA+KGLISKRRFIMDMN M+KRGKLFRKS T+YFMFHHHH    + D T GGFGIQDYEFSCSNTPINPVFSHMSK   
Subjt:  GLQSSVRAKRLWNVLKVTLLLMAKKGLISKRRFIMDMNFMVKRGKLFRKSVTNYFMFHHHH----SRDTTRGGFGIQDYEFSCSNTPINPVFSHMSK---

Query:  EHHTYFPCINLPMEIIETKQQEND-----------FT-----------------LSPAMEGGSPLVSSFNVRVSNYSSEEENEISSESGLVDNQAEEFIR
        ++H YFPCINLP+E +ETKQQE +           FT                 LSPAMEG SPLVS FNVR+SNYSSEEENEISSESG VDNQAEEFIR
Subjt:  EHHTYFPCINLPMEIIETKQQEND-----------FT-----------------LSPAMEGGSPLVSSFNVRVSNYSSEEENEISSESGLVDNQAEEFIR

Query:  RFYEQLKMQKRLQLLQYS
        RFYEQL+ Q+RLQLLQYS
Subjt:  RFYEQLKMQKRLQLLQYS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52140.1 unknown protein1.5e-1031.25Show/hide
Query:  AKRLWNVLKVTLLLMAKKGLISKRRFIMDMNFMVKRGKLFRKSVTNYFMFHHH---HSRDTTRGGFGI--------QDYEFSCSNTP-INPVFSHMS---
        +K+LWN+++  LL M +KG +SK + I D N  +KRGK          MFH     H+  T               Q+YEFSCSNTP  +  FS+M+   
Subjt:  AKRLWNVLKVTLLLMAKKGLISKRRFIMDMNFMVKRGKLFRKSVTNYFMFHHH---HSRDTTRGGFGI--------QDYEFSCSNTP-INPVFSHMS---

Query:  -KEHHTYFPCINLP-------------MEIIETKQQENDFT-------LSPAME--GGSPLVSSFNVRVSNYSSEEENEISSESGLVDNQAEEFIRRFYE
         K H+  F C   P             +E++    ++ + T       LSP     G +PLV    V  S +    EN     +G VD  A++FI++FY+
Subjt:  -KEHHTYFPCINLP-------------MEIIETKQQENDFT-------LSPAME--GGSPLVSSFNVRVSNYSSEEENEISSESGLVDNQAEEFIRRFYE

Query:  QLKMQKRL
         L  QK++
Subjt:  QLKMQKRL

AT3G16330.1 unknown protein8.6e-0626.47Show/hide
Query:  AKRLWNVLKVTLLLMAKKGLISKRRFIMDMNFMVKRGKLFRKSVTNYFMFHHHHS-RDTTRGGFGIQDYEFSCSNT-----PINPVFSHMSKEHHTYFPC
        +K+L N+++  L ++ K   ISK++ + D N  +KRGK          MFH+      +        +YEFSCS+T     P N         H++ F C
Subjt:  AKRLWNVLKVTLLLMAKKGLISKRRFIMDMNFMVKRGKLFRKSVTNYFMFHHHHS-RDTTRGGFGIQDYEFSCSNT-----PINPVFSHMSKEHHTYFPC

Query:  INLP----------MEIIETKQQENDF---------------TLSPAMEGGSPLVSS---FNVRVSNYSSEEENEISSESGLVDNQAEEFIRRFYEQLKM
           P            ++E      D                 LSP + G     SS     V  S +   EE +++  +G VD  A+EFI++FY+ L  
Subjt:  INLP----------MEIIETKQQENDF---------------TLSPAMEGGSPLVSS---FNVRVSNYSSEEENEISSESGLVDNQAEEFIRRFYEQLKM

Query:  QKRL
        QK++
Subjt:  QKRL

AT4G29110.1 unknown protein1.9e-0526.92Show/hide
Query:  MGLQSSVRAKRLWNVLKVTLLLMAKKGLISKRRFIMDMNFMVKRGKLFRKSVTNYFMFHHHHSRDTTRGGFGIQDYE-FSCSNTPINPVFSHMSKEHHTY
        M   + V AKRLW V+++   ++ K G + K + ++D+N M+KRG    K++TN                  ++DY+ F+  +     V      E    
Subjt:  MGLQSSVRAKRLWNVLKVTLLLMAKKGLISKRRFIMDMNFMVKRGKLFRKSVTNYFMFHHHHSRDTTRGGFGIQDYE-FSCSNTPINPVFSHMSKEHHTY

Query:  FPCINLPMEIIETKQQEND-FTLSPAMEGGSPLVSSFNVRVSNYSSE--EENEISSESGLVDNQAEEFIRRFYEQLKMQKRL
           +    E++     END  T++      SPL+ S  VR    +      ++      +VD  AEEFI++FY+ LK+QK++
Subjt:  FPCINLPMEIIETKQQEND-FTLSPAMEGGSPLVSSFNVRVSNYSSE--EENEISSESGLVDNQAEEFIRRFYEQLKMQKRL

AT4G32860.1 unknown protein9.5e-0526.7Show/hide
Query:  MGLQSSVRAKRLWNVLKVTLLLMAKKGLISKRRFI--MDMNFMVKRGKLFRKSVTNYFMFHHHHSRDTTR------------GGFGIQ-DYEFSCSNTPI
        M + S+V  K+L ++ K+ L  + K    S+ + +  +D + + KRGK+ RKS+         HSR T R                +Q +YEFSCS+TP 
Subjt:  MGLQSSVRAKRLWNVLKVTLLLMAKKGLISKRRFI--MDMNFMVKRGKLFRKSVTNYFMFHHHHSRDTTR------------GGFGIQ-DYEFSCSNTPI

Query:  NPVFSHMSKEHHTYFPCINLPMEIIETKQQENDFTLSPAMEGGSPLVSSFNVRVSNYSSEEENEISSESGLVDNQAEEFIRRFYEQLKMQK
           ++    +        N P+     +Q    +   P +      +   +V  + +     +  + ES  VD  AEEFI+ FY QL++QK
Subjt:  NPVFSHMSKEHHTYFPCINLPMEIIETKQQENDFTLSPAMEGGSPLVSSFNVRVSNYSSEEENEISSESGLVDNQAEEFIRRFYEQLKMQK

AT5G38700.1 unknown protein9.2e-0830.05Show/hide
Query:  SVRAKRLWNVLKVTLLLMAKKGLISKRRFIMDMNFMVKRGKLFRKSVTNYFMFHHHHSRDTTRGGFGIQDYEFSCSNTPINPVFSHMSKEHHTYFPCINL
        S+  +R W+++++  LL  +KG I KR  + ++        LF K +      +HH+S D  R      + + S   TPI  V  H         PCI  
Subjt:  SVRAKRLWNVLKVTLLLMAKKGLISKRRFIMDMNFMVKRGKLFRKSVTNYFMFHHHHSRDTTRGGFGIQDYEFSCSNTPINPVFSHMSKEHHTYFPCINL

Query:  PMEI-----IETKQQENDFTLSPAMEGGSPLVSSFNVRVSNYSSEEENEISSESGLVDNQAEEFIRRFYEQLKMQKRLQLLQY
        P +      ++  Q + D   S     GS   +    R +N   EEE E       VD +AEEFI +FYEQ+K+Q+++  LQY
Subjt:  PMEI-----IETKQQENDFTLSPAMEGGSPLVSSFNVRVSNYSSEEENEISSESGLVDNQAEEFIRRFYEQLKMQKRLQLLQY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGACTCCAATCCTCTGTAAGAGCCAAGAGGCTTTGGAATGTTCTTAAAGTGACACTTCTTCTAATGGCCAAAAAGGGCCTAATCTCAAAGAGAAGGTTCATCATGGA
CATGAACTTCATGGTCAAGAGAGGCAAGCTCTTCAGAAAATCTGTCACCAATTACTTCATGTTCCATCACCACCACTCCCGCGATACGACACGTGGCGGTTTCGGCATAC
AGGACTATGAATTCTCTTGCAGCAACACTCCCATTAATCCTGTTTTTTCCCACATGTCAAAGGAGCACCATACTTACTTCCCCTGCATCAACCTTCCTATGGAGATAATA
GAAACAAAGCAACAAGAAAATGACTTTACTTTGAGCCCTGCAATGGAGGGGGGAAGTCCACTCGTTTCGTCGTTCAACGTGCGGGTATCGAACTATTCGTCCGAGGAAGA
AAATGAAATTAGCAGTGAAAGTGGACTGGTGGATAATCAGGCTGAGGAGTTCATCAGAAGGTTCTATGAACAGCTGAAGATGCAAAAACGCCTGCAATTACTGCAATATA
GTTAA
mRNA sequenceShow/hide mRNA sequence
CACACCCTTTTGGTGTTTCCCATCATCTTCCCTTCCTCCTCCAGTTTCGTCTTCTCGTTCGGGTTTTCTTTGCTTCTGCGAGGCCTCTCAACCCGACCTCGCAGGAACAA
AGAAACCCATCTCCGAGCCTAGCAACCAACCATGGGACTCCAATCCTCTGTAAGAGCCAAGAGGCTTTGGAATGTTCTTAAAGTGACACTTCTTCTAATGGCCAAAAAGG
GCCTAATCTCAAAGAGAAGGTTCATCATGGACATGAACTTCATGGTCAAGAGAGGCAAGCTCTTCAGAAAATCTGTCACCAATTACTTCATGTTCCATCACCACCACTCC
CGCGATACGACACGTGGCGGTTTCGGCATACAGGACTATGAATTCTCTTGCAGCAACACTCCCATTAATCCTGTTTTTTCCCACATGTCAAAGGAGCACCATACTTACTT
CCCCTGCATCAACCTTCCTATGGAGATAATAGAAACAAAGCAACAAGAAAATGACTTTACTTTGAGCCCTGCAATGGAGGGGGGAAGTCCACTCGTTTCGTCGTTCAACG
TGCGGGTATCGAACTATTCGTCCGAGGAAGAAAATGAAATTAGCAGTGAAAGTGGACTGGTGGATAATCAGGCTGAGGAGTTCATCAGAAGGTTCTATGAACAGCTGAAG
ATGCAAAAACGCCTGCAATTACTGCAATATAGTTAATATTCATTCCTCTTTCTGTTGTGTGATTTCTTGAAATTCAACATCATATTCTTTTATCCATTATGCAGCATACT
AAATGTATGGAAAAGGAAAAATACACAAGAGGATTTGTTGTGTGTTACATCTAAATGTTGAGTTTTCATGGATGAATCATGAGTCCTTTCTGAGTTTGGTTTCAGGAGGA
TATTATTGAAAGTGACAGCATTAATGGGGTTTATATACATATTTATATATAAATATATATACACAAATACTCATTTAGCATCTCATCTCATGTTTTTGTGGGGCACGGAA
CTTTTCAATGAATAGAGATTCTTAAAAGATACAAACTCCACAAATGGGATGGGGATCAGGAAACATGGATGAGAAATACATTACCTGCTTAGCCCGGTAAGAATTTATTT
TCATCCCTGTTCTCCACCTAAACAGGGATTCTCCGTCCCGTTATGGTCTGTACGCCCACACCACATTTGATACCCCACAATTGAAGGTTAAAGACAAATTCGTGTAAACA
CTAGACATGCAATATTAGTTCAAGTCTTGAATTTGGAAGAGTTTCAAATAATAGAGAATGTGCCTGATCTCTTTTCTTTCTATAATAAAAATTCTATCTTTCATGTATAT
CTACCTTGATAAGCAAAGCTACAAACTACCTAGACTTTATTAACATTTGAGGTCTTTGTGAGTCGATGGTACATGCCAATTATTGTTGAGTCAAGTTCACTTCTAGGTGA
CTTCATATAAACAACAAGTAATCCAAGAAAATGAATATCA
Protein sequenceShow/hide protein sequence
MGLQSSVRAKRLWNVLKVTLLLMAKKGLISKRRFIMDMNFMVKRGKLFRKSVTNYFMFHHHHSRDTTRGGFGIQDYEFSCSNTPINPVFSHMSKEHHTYFPCINLPMEII
ETKQQENDFTLSPAMEGGSPLVSSFNVRVSNYSSEEENEISSESGLVDNQAEEFIRRFYEQLKMQKRLQLLQYS