; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0018084 (gene) of Chayote v1 genome

Gene IDSed0018084
OrganismSechium edule (Chayote v1)
DescriptionUnknown protein
Genome locationLG08:3157151..3158301
RNA-Seq ExpressionSed0018084
SyntenySed0018084
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589708.1 hypothetical protein SDJN03_15131, partial [Cucurbita argyrosperma subsp. sororia]2.6e-9070.44Show/hide
Query:  MDDAGE-----SNRRKDHHGGDGDGEEETEECASLSDLPLDNS-------ETFRKNPRRSSSEPLDLFEFFTTGFIASEISPAEDLIFRGRLLPLNDHS-
        MDD  E     S  R+DHH  + + E E EE  S SDLPLD         E+FRKNPRRSSSEPLDLFEFF+ GFI SEISPAEDLIF GRLLPLNDHS 
Subjt:  MDDAGE-----SNRRKDHHGGDGDGEEETEECASLSDLPLDNS-------ETFRKNPRRSSSEPLDLFEFFTTGFIASEISPAEDLIFRGRLLPLNDHS-

Query:  ---------DKNFF-----KQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQSDSIFSPTVEFDRKFPIKSGFKADPMNKKPSSKPRWY
                 +K+F+     KQT FRKRSESLSGLQSSVSRSN+AKINLKRNSRSLDYRKLYRQ++SIFSPT E DR   IK+G K DP+NKK SSKPRWY
Subjt:  ---------DKNFF-----KQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQSDSIFSPTVEFDRKFPIKSGFKADPMNKKPSSKPRWY

Query:  FLMFGMVKFPAEMELSDIKSRQVRRRPSAVFTANESKGKFPC-NRSSSEAAWRILRALSCKNHVSVDVRASLTA
         LMFGMVKFPAEM+LSDIKSRQVRR  SA+F ANESKGK+ C NRSS EA WRILRALSCKN+ SVDV ASLTA
Subjt:  FLMFGMVKFPAEMELSDIKSRQVRRRPSAVFTANESKGKFPC-NRSSSEAAWRILRALSCKNHVSVDVRASLTA

KAG7023388.1 hypothetical protein SDJN02_14413 [Cucurbita argyrosperma subsp. argyrosperma]1.1e-9170.65Show/hide
Query:  MFMDDAGE-----SNRRKDHHGGDGDGEEETEECASLSDLPLDNS-------ETFRKNPRRSSSEPLDLFEFFTTGFIASEISPAEDLIFRGRLLPLNDH
        MFMDD  E     S  R+DHH  + + E E EE  S SDLPLD         E+FRKNPRRSSSEPLDLFEFF+ GFI SEISPAEDLIF GRLLPLNDH
Subjt:  MFMDDAGE-----SNRRKDHHGGDGDGEEETEECASLSDLPLDNS-------ETFRKNPRRSSSEPLDLFEFFTTGFIASEISPAEDLIFRGRLLPLNDH

Query:  S----------DKNFF-----KQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQSDSIFSPTVEFDRKFPIKSGFKADPMNKKPSSKPR
        S          +K+F+     KQT FRKRSESLSGLQSSVSRSN+AKINLKRNSRSLDYRKLYRQ++SIFSPT E DR   IK+G K DP+NKK SSKPR
Subjt:  S----------DKNFF-----KQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQSDSIFSPTVEFDRKFPIKSGFKADPMNKKPSSKPR

Query:  WYFLMFGMVKFPAEMELSDIKSRQVRRRPSAVFTANESKGKFPC-NRSSSEAAWRILRALSCKNHVSVDVRASLTA
        WY LMFGMVKFPAEM+LSDIKSRQVRR  SA+F ANESKGK+ C NRSS EA WRILRALSCKN+ SVDV ASLTA
Subjt:  WYFLMFGMVKFPAEMELSDIKSRQVRRRPSAVFTANESKGKFPC-NRSSSEAAWRILRALSCKNHVSVDVRASLTA

XP_022921809.1 uncharacterized protein LOC111429953 [Cucurbita moschata]2.4e-9170.65Show/hide
Query:  MFMDDAGE-----SNRRKDHHGGDGDGEEETEECASLSDLPLDNS-------ETFRKNPRRSSSEPLDLFEFFTTGFIASEISPAEDLIFRGRLLPLNDH
        MFMDD  E     S  R+DHH  +   E E EE  S SDLPLD         E+FRKNPRRSSSEPLDLFEFF+ GFI SEISPAEDLIF GRLLPLNDH
Subjt:  MFMDDAGE-----SNRRKDHHGGDGDGEEETEECASLSDLPLDNS-------ETFRKNPRRSSSEPLDLFEFFTTGFIASEISPAEDLIFRGRLLPLNDH

Query:  S----------DKNFF-----KQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQSDSIFSPTVEFDRKFPIKSGFKADPMNKKPSSKPR
        S          +K+F+     KQT FRKRSESLSGLQSSVSRSN+AKINLKRNSRSLDYRKLYRQ++SIFSPT E DR   IK+G K DP+NKK SSKPR
Subjt:  S----------DKNFF-----KQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQSDSIFSPTVEFDRKFPIKSGFKADPMNKKPSSKPR

Query:  WYFLMFGMVKFPAEMELSDIKSRQVRRRPSAVFTANESKGKFPC-NRSSSEAAWRILRALSCKNHVSVDVRASLTA
        WY LMFGMVKFPAEM+LSDIKSRQVRR  SA+F ANESKGK+ C NRSS EA WRILRALSCKN+ SVDV ASLTA
Subjt:  WYFLMFGMVKFPAEMELSDIKSRQVRRRPSAVFTANESKGKFPC-NRSSSEAAWRILRALSCKNHVSVDVRASLTA

XP_023516073.1 uncharacterized protein LOC111780044 [Cucurbita pepo subsp. pepo]1.2e-9070.04Show/hide
Query:  MFMDDAGE-----SNRRKDHHGGDGDGEEETEECASLSDLPLDNS-------ETFRKNPRRSSSEPLDLFEFFTTGFIASEISPAEDLIFRGRLLPLNDH
        MFMDD  E     S  R+DHH  +   E E EE  S SDLPLD         E+FRKNPRRSSSEPLDLFEFF+ GFI SEISPAEDLIF GRLLPLNDH
Subjt:  MFMDDAGE-----SNRRKDHHGGDGDGEEETEECASLSDLPLDNS-------ETFRKNPRRSSSEPLDLFEFFTTGFIASEISPAEDLIFRGRLLPLNDH

Query:  S-----------DKNFF-----KQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQSDSIFSPTVEFDRKFPIKSGFKADPMNKKPSSKP
        S           +K+F+     KQ  FRKRSESLSGLQSSVSRSN+AKINLKRNSRSLDYRKLYRQ++SIFSPT E DR   IK+G K DP+NKK SSKP
Subjt:  S-----------DKNFF-----KQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQSDSIFSPTVEFDRKFPIKSGFKADPMNKKPSSKP

Query:  RWYFLMFGMVKFPAEMELSDIKSRQVRRRPSAVFTANESKGKFPC-NRSSSEAAWRILRALSCKNHVSVDVRASLTA
        RWY LMFGMVKFPAEM+LSDIKSRQVRR  SA+F ANESKGK+ C NRSS EA WRILRALSCKN+ SVDV ASLTA
Subjt:  RWYFLMFGMVKFPAEMELSDIKSRQVRRRPSAVFTANESKGKFPC-NRSSSEAAWRILRALSCKNHVSVDVRASLTA

XP_038879756.1 uncharacterized protein LOC120071507 [Benincasa hispida]4.4e-9071Show/hide
Query:  MFMDDAGESNRRKDHHGGDGDGEEETEECASLSDLPLD--------NSETFRKNPRRSSSEPLDLFEFFTTGFIASEISPAEDLIFRGRLLPLNDHS---
        MFMDD+ ESN     H  D D ++E+EE  S SDLPLD        + E+ RKNPRRSSSEPLDLFEFF+ GFI SEISPAEDLIF GRLLPLNDHS   
Subjt:  MFMDDAGESNRRKDHHGGDGDGEEETEECASLSDLPLD--------NSETFRKNPRRSSSEPLDLFEFFTTGFIASEISPAEDLIFRGRLLPLNDHS---

Query:  ----DKNFF------KQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQSDSIFSPTVEFDRKFPIKSGFKADPMNKKPSSKPRWYFLMF
            +K+F+      KQT FRKRSESLSGLQSSVSRSNSAK NLKRNSRSLDYRKLYRQ +SIFSPT E DR   I +G K D +NKK SSKPRWY LMF
Subjt:  ----DKNFF------KQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQSDSIFSPTVEFDRKFPIKSGFKADPMNKKPSSKPRWYFLMF

Query:  GMVKFPAEMELSDIKSRQVRRRPSAVFTANESKGKFPCNRSSSEAAWRILRALSCKNHVSVDVRASLTA
        GMVKFPAEM+L DIKSRQVRR  S +F ANE+KGKF CNRSS EAAWRILRALSCKNH SVDV ASLTA
Subjt:  GMVKFPAEMELSDIKSRQVRRRPSAVFTANESKGKFPCNRSSSEAAWRILRALSCKNHVSVDVRASLTA

TrEMBL top hitse value%identityAlignment
A0A1S3BW36 uncharacterized protein LOC1034942651.7e-8769.14Show/hide
Query:  MDDAGESN----RRKDHHGGDGDGEEETEECASLSDLPLD--------NSETFRKNPRRSSSEPLDLFEFFTTGFIASEISPAEDLIFRGRLLPLNDHS-
        MDD  ESN    +R  H   + D + E++E  S SDLP+D        + ++FRKNPRRSSSEPLDLFEFF+ GFI SEISPAEDLIF GRLLPLNDHS 
Subjt:  MDDAGESN----RRKDHHGGDGDGEEETEECASLSDLPLD--------NSETFRKNPRRSSSEPLDLFEFFTTGFIASEISPAEDLIFRGRLLPLNDHS-

Query:  ----DKNFF------KQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQSDSIFSPTVEFDRKFPIKSGFKADPMNKKPSSKPRWYFLMF
            DK+F+      KQT FRKRSESLSGLQSSVSRSNSAK NLKRNSRSLDYR+LYRQ++SIFSPT E DR   IK+G K D +NKK SSKPRWY LMF
Subjt:  ----DKNFF------KQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQSDSIFSPTVEFDRKFPIKSGFKADPMNKKPSSKPRWYFLMF

Query:  GMVKFPAEMELSDIKSRQVRRRPSAVFTANESKGKFPCNRSSSEAAWRILRALSCKNHVSVDVRASLTA
        GMVKFPAEMELSDIKSRQVRR  S +F +NE+K KF C RSS EA WRILRALSCKNH SVDV ASLTA
Subjt:  GMVKFPAEMELSDIKSRQVRRRPSAVFTANESKGKFPCNRSSSEAAWRILRALSCKNHVSVDVRASLTA

A0A5A7UVS6 Uncharacterized protein1.7e-8769.14Show/hide
Query:  MDDAGESN----RRKDHHGGDGDGEEETEECASLSDLPLD--------NSETFRKNPRRSSSEPLDLFEFFTTGFIASEISPAEDLIFRGRLLPLNDHS-
        MDD  ESN    +R  H   + D + E++E  S SDLP+D        + ++FRKNPRRSSSEPLDLFEFF+ GFI SEISPAEDLIF GRLLPLNDHS 
Subjt:  MDDAGESN----RRKDHHGGDGDGEEETEECASLSDLPLD--------NSETFRKNPRRSSSEPLDLFEFFTTGFIASEISPAEDLIFRGRLLPLNDHS-

Query:  ----DKNFF------KQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQSDSIFSPTVEFDRKFPIKSGFKADPMNKKPSSKPRWYFLMF
            DK+F+      KQT FRKRSESLSGLQSSVSRSNSAK NLKRNSRSLDYR+LYRQ++SIFSPT E DR   IK+G K D +NKK SSKPRWY LMF
Subjt:  ----DKNFF------KQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQSDSIFSPTVEFDRKFPIKSGFKADPMNKKPSSKPRWYFLMF

Query:  GMVKFPAEMELSDIKSRQVRRRPSAVFTANESKGKFPCNRSSSEAAWRILRALSCKNHVSVDVRASLTA
        GMVKFPAEMELSDIKSRQVRR  S +F +NE+K KF C RSS EA WRILRALSCKNH SVDV ASLTA
Subjt:  GMVKFPAEMELSDIKSRQVRRRPSAVFTANESKGKFPCNRSSSEAAWRILRALSCKNHVSVDVRASLTA

A0A5D3CL04 Uncharacterized protein7.2e-8668.77Show/hide
Query:  MDDAGESN----RRKDHHGGDGDGEEETEECASLSDLPLD--------NSETFRKNPRRSSSEPLDLFEFFTTGFIASEISPAEDLIFRGRLLPLNDHS-
        MDD  ESN    +R  H   + D + E++E  S SDLP+D        + ++FRKNPRRSSSEPLDLFEFF+ GFI SEISPAEDLIF GRLLPLNDHS 
Subjt:  MDDAGESN----RRKDHHGGDGDGEEETEECASLSDLPLD--------NSETFRKNPRRSSSEPLDLFEFFTTGFIASEISPAEDLIFRGRLLPLNDHS-

Query:  ----DKNFF------KQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQSDSIFSPTVEFDRKFPIKSGFKADPMNKKPSSKPRWYFLMF
            DK+F+      KQT FRKRSESLSGLQSSVSRSNSAK NLKRNSRSLDYR+LYRQ++SIFSPT E DR   IK+G K D +NKK +SKPRWY LMF
Subjt:  ----DKNFF------KQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQSDSIFSPTVEFDRKFPIKSGFKADPMNKKPSSKPRWYFLMF

Query:  GMVKFPAEMELSDIKSRQVRRRPSAVFTANESKGKFPCNRSSSEAAWRILRALSCKNHVSVDVRASLTA
        GMVKFPAEMELSDIKSRQVRR  S +F +NE+K KF C RSS EA WRILRALSCKNH SVDV ASLTA
Subjt:  GMVKFPAEMELSDIKSRQVRRRPSAVFTANESKGKFPCNRSSSEAAWRILRALSCKNHVSVDVRASLTA

A0A6J1E1L2 uncharacterized protein LOC1114299531.1e-9170.65Show/hide
Query:  MFMDDAGE-----SNRRKDHHGGDGDGEEETEECASLSDLPLDNS-------ETFRKNPRRSSSEPLDLFEFFTTGFIASEISPAEDLIFRGRLLPLNDH
        MFMDD  E     S  R+DHH  +   E E EE  S SDLPLD         E+FRKNPRRSSSEPLDLFEFF+ GFI SEISPAEDLIF GRLLPLNDH
Subjt:  MFMDDAGE-----SNRRKDHHGGDGDGEEETEECASLSDLPLDNS-------ETFRKNPRRSSSEPLDLFEFFTTGFIASEISPAEDLIFRGRLLPLNDH

Query:  S----------DKNFF-----KQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQSDSIFSPTVEFDRKFPIKSGFKADPMNKKPSSKPR
        S          +K+F+     KQT FRKRSESLSGLQSSVSRSN+AKINLKRNSRSLDYRKLYRQ++SIFSPT E DR   IK+G K DP+NKK SSKPR
Subjt:  S----------DKNFF-----KQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQSDSIFSPTVEFDRKFPIKSGFKADPMNKKPSSKPR

Query:  WYFLMFGMVKFPAEMELSDIKSRQVRRRPSAVFTANESKGKFPC-NRSSSEAAWRILRALSCKNHVSVDVRASLTA
        WY LMFGMVKFPAEM+LSDIKSRQVRR  SA+F ANESKGK+ C NRSS EA WRILRALSCKN+ SVDV ASLTA
Subjt:  WYFLMFGMVKFPAEMELSDIKSRQVRRRPSAVFTANESKGKFPC-NRSSSEAAWRILRALSCKNHVSVDVRASLTA

A0A6J1JAL3 uncharacterized protein LOC1114850632.8e-9070.29Show/hide
Query:  MFMDDAGE-----SNRRKDHHGGDGDGEEETEECASLSDLPLDNS-------ETFRKNPRRSSSEPLDLFEFFTTGFIASEISPAEDLIFRGRLLPLNDH
        MFMDD  E     S  R+DHH  +   E E EE  S SDLPLD         E+FRKNPRRSSSEPLDLFEFF+ GFI SEISPAEDLIF GRLLPLND 
Subjt:  MFMDDAGE-----SNRRKDHHGGDGDGEEETEECASLSDLPLDNS-------ETFRKNPRRSSSEPLDLFEFFTTGFIASEISPAEDLIFRGRLLPLNDH

Query:  S----------DKNFF-----KQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQSDSIFSPTVEFDRKFPIKSGFKADPMNKKPSSKPR
        S          DK+F+     KQT FRKRSESLSGLQSSVSRSN+AKINLKRNSRSLDYRKLYRQ++SIFSPT E DR   IK+G K DP+NKK SSKPR
Subjt:  S----------DKNFF-----KQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQSDSIFSPTVEFDRKFPIKSGFKADPMNKKPSSKPR

Query:  WYFLMFGMVKFPAEMELSDIKSRQVRRRPSAVFTANESKGKFPC-NRSSSEAAWRILRALSCKNHVSVDVRASLTA
        WY LMFGMVKFPAEM+LSDIKSRQVRR  SA+F A+ESKGK+ C NRSS EA WRILRALSCKN+ SVDV ASLTA
Subjt:  WYFLMFGMVKFPAEMELSDIKSRQVRRRPSAVFTANESKGKFPC-NRSSSEAAWRILRALSCKNHVSVDVRASLTA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G30230.1 unknown protein3.9e-2838.84Show/hide
Query:  EEETEECASLSDLPLDNSETFRKNPRRSSSE-----PLDLFEFFTTGFIASEISPAEDLIFRGRLLPLNDHSDKNFFK-----QTGFRKRSESLSGLQS-
        EEE E+  SL DLPL       KNP  +++E       +LFEF T+   + +++PAE++IF G+L+PLN      FF          R RSESLS +Q  
Subjt:  EEETEECASLSDLPLDNSETFRKNPRRSSSE-----PLDLFEFFTTGFIASEISPAEDLIFRGRLLPLNDHSDKNFFK-----QTGFRKRSESLSGLQS-

Query:  SVSRSNSAKINLKRN------SRSLDYRKLYRQSDSIFSPTVEFDRKFPIKSGFKADPMN--KKPSSKPRWYFLMFGMVKFPAEMELSDIKSRQVRRR-P
         ++R  S  +  + N      SRSLDYRKL R   ++ SP    +     K+  K +  +     S +PRWY +MFGMVKFP E+EL DIKSRQ+RR  P
Subjt:  SVSRSNSAKINLKRN------SRSLDYRKLYRQSDSIFSPTVEFDRKFPIKSGFKADPMN--KKPSSKPRWYFLMFGMVKFPAEMELSDIKSRQVRRR-P

Query:  SAVFTANESKGKFPCNRSSSEAAWRILRALSCKNHVSVDVRA
          +F +  ++        S   +WR L ALSCK   SV   A
Subjt:  SAVFTANESKGKFPCNRSSSEAAWRILRALSCKNHVSVDVRA

AT5G67350.1 unknown protein2.1e-0529.54Show/hide
Query:  EEETEECASLSDLPLDNSETFRKNPRRSSSEPLDLFEF-----FTTGF----IASEISPAEDLIFRGRLLPLNDHS---DKNFFKQTGFRKRSESLSGLQ
        EEE EE  SL DLP +  E  R   +    E    FEF     F  G      A E+S A++L F+GR+LPL  HS   D    + T    RSES+   +
Subjt:  EEETEECASLSDLPLDNSETFRKNPRRSSSEPLDLFEF-----FTTGF----IASEISPAEDLIFRGRLLPLNDHS---DKNFFKQTGFRKRSESLSGLQ

Query:  SSVSRSNSAKINLKRNSRSLDYRKLYRQSDSIFSPTVEFDRKFPIKSGFKADPMNKKPSSKPRWYFLMFGMVKFPAEMELSDIKSR---QVRRRPSAVFT
        + + RS+      K  +  +DY        S  SP  +  R   + +   +    + P S   W FL  G+V+ P E+EL          V R  S   T
Subjt:  SSVSRSNSAKINLKRNSRSLDYRKLYRQSDSIFSPTVEFDRKFPIKSGFKADPMNKKPSSKPRWYFLMFGMVKFPAEMELSDIKSR---QVRRRPSAVFT

Query:  ANESKG-KFPCNRSSSEAAWRILRALSCKNHVSVDVR
        +  S   K     S S    R      CK  VS + +
Subjt:  ANESKG-KFPCNRSSSEAAWRILRALSCKNHVSVDVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATAATTCCAAACACAGCCTTTTCCTTTATCTTCCCTTCCACACAATGTTCATGGACGACGCCGGCGAATCCAATCGGAGAAAAGATCATCACGGCGGCGACGGCGA
CGGCGAAGAAGAAACAGAGGAGTGCGCTTCTCTCTCCGATCTTCCGCTCGACAATTCGGAAACCTTCCGCAAGAATCCACGCAGATCCTCCTCCGAGCCGCTCGATCTGT
TCGAGTTCTTCACCACTGGATTCATCGCCTCTGAAATTTCTCCGGCCGAGGATTTGATCTTCCGCGGCAGATTGCTTCCTCTCAACGATCACTCCGACAAGAATTTCTTC
AAACAGACTGGATTTCGAAAACGATCCGAGTCGCTGTCCGGATTGCAGAGCTCTGTTTCTCGATCCAACAGTGCGAAGATCAATCTCAAGCGGAATAGCCGATCGCTCGA
TTACCGCAAGCTATATCGCCAATCGGACTCGATTTTCTCTCCGACGGTTGAATTCGATCGAAAATTTCCGATCAAGAGCGGATTCAAGGCGGATCCGATGAACAAAAAGC
CGTCGTCGAAGCCGCGGTGGTACTTTCTAATGTTCGGAATGGTGAAATTTCCGGCGGAGATGGAACTCAGCGACATTAAGAGCAGACAAGTCCGCCGCCGTCCGTCGGCA
GTTTTTACGGCGAATGAGAGTAAAGGTAAGTTTCCGTGTAACCGGAGCTCCAGCGAGGCGGCCTGGAGAATTCTCCGGGCGCTGAGCTGCAAGAACCACGTTAGTGTAGA
TGTAAGGGCGTCGTTAACCGCCTGA
mRNA sequenceShow/hide mRNA sequence
CAATTTCGCACCCAAAATCCAATATTAAAAAAATAAAAAATAAAAGCGTTGATATCATTGTCTGCATATAAATATAATTCTGATGCATAATTCCAAACACAGCCTTTTCC
TTTATCTTCCCTTCCACACAATGTTCATGGACGACGCCGGCGAATCCAATCGGAGAAAAGATCATCACGGCGGCGACGGCGACGGCGAAGAAGAAACAGAGGAGTGCGCT
TCTCTCTCCGATCTTCCGCTCGACAATTCGGAAACCTTCCGCAAGAATCCACGCAGATCCTCCTCCGAGCCGCTCGATCTGTTCGAGTTCTTCACCACTGGATTCATCGC
CTCTGAAATTTCTCCGGCCGAGGATTTGATCTTCCGCGGCAGATTGCTTCCTCTCAACGATCACTCCGACAAGAATTTCTTCAAACAGACTGGATTTCGAAAACGATCCG
AGTCGCTGTCCGGATTGCAGAGCTCTGTTTCTCGATCCAACAGTGCGAAGATCAATCTCAAGCGGAATAGCCGATCGCTCGATTACCGCAAGCTATATCGCCAATCGGAC
TCGATTTTCTCTCCGACGGTTGAATTCGATCGAAAATTTCCGATCAAGAGCGGATTCAAGGCGGATCCGATGAACAAAAAGCCGTCGTCGAAGCCGCGGTGGTACTTTCT
AATGTTCGGAATGGTGAAATTTCCGGCGGAGATGGAACTCAGCGACATTAAGAGCAGACAAGTCCGCCGCCGTCCGTCGGCAGTTTTTACGGCGAATGAGAGTAAAGGTA
AGTTTCCGTGTAACCGGAGCTCCAGCGAGGCGGCCTGGAGAATTCTCCGGGCGCTGAGCTGCAAGAACCACGTTAGTGTAGATGTAAGGGCGTCGTTAACCGCCTGAGTT
TGTCACGTGCATTGCACGTGACAGGTGGTTGCATTTGTTACGATATTGCCCTCTCGTGATGCCGTGAAGTTCCATTACCGTGAGAAGGTCATTTTGGGAATTTATGAATT
AAAATGAAAAACGTCATTATGATTCACGTCGAAGGGACGGTACTGTACAGTAGCGGGGCGTGACCGATGATGATGGTGCGGTTTCTTTTAAATCTTTTTTTTTTTTTTTT
TAATTTTTGTTTTAATTTGAAAGAATTGTTTGGTTTGTTGGTGACAAAGAG
Protein sequenceShow/hide protein sequence
MHNSKHSLFLYLPFHTMFMDDAGESNRRKDHHGGDGDGEEETEECASLSDLPLDNSETFRKNPRRSSSEPLDLFEFFTTGFIASEISPAEDLIFRGRLLPLNDHSDKNFF
KQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQSDSIFSPTVEFDRKFPIKSGFKADPMNKKPSSKPRWYFLMFGMVKFPAEMELSDIKSRQVRRRPSA
VFTANESKGKFPCNRSSSEAAWRILRALSCKNHVSVDVRASLTA