; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0005882 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0005882
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionTesmin/TSO1-like CXC domain-containing protein
Genome locationchr07:23710918..23713183
RNA-Seq ExpressionIVF0005882
SyntenyIVF0005882
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR005172 - CRC domain
IPR033467 - Tesmin/TSO1-like CXC domain
IPR044522 - CRC domain-containing protein TSO1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4101286.1 unnamed protein product [Lactuca saligna]1.27e-1640.32Show/hide
Query:  GCKCKNSKCIKLYCDCFASEMFCDENFL---------YVDTVVAAKKKLRTKKPSAFDRE-----------NVGESST------ERKGCNCKNSSCKRII
        GCKCK SKC+KLYCDCFA E +C E+           Y DTV  A++K++ + P AFD +           N G +         RKGC CKNS C ++ 
Subjt:  GCKCKNSKCIKLYCDCFASEMFCDENFL---------YVDTVVAAKKKLRTKKPSAFDRE-----------NVGESST------ERKGCNCKNSSCKRII

Query:  -----AGVACTEACNCQGCQNPCG
             A V CT  C C+GC+N  G
Subjt:  -----AGVACTEACNCQGCQNPCG

KAF8088985.1 hypothetical protein N665_0522s0014 [Sinapis alba]3.33e-1635.92Show/hide
Query:  SGSTKRKRKKSTVENKRKSSPGGCKCKNSKCIKLYCDCFASEMFCDENF---------LYVDTVVAAKKKLRTKKPSAF-------------DRENVGES
        S S K+KR+KS    + +SS   C CK SKC+KLYC+CFA+ ++C E           ++ DTV+A +K++ ++ P AF             D      S
Subjt:  SGSTKRKRKKSTVENKRKSSPGGCKCKNSKCIKLYCDCFASEMFCDENF---------LYVDTVVAAKKKLRTKKPSAF-------------DRENVGES

Query:  STERKGCNCKNSSCKRII-----AGVACTEACNCQGCQNPCG
        +  ++GCNCK S+C +       +GV C+  C C+GC+NP G
Subjt:  STERKGCNCKNSSCKRII-----AGVACTEACNCQGCQNPCG

KGN52869.1 hypothetical protein Csa_014688 [Cucumis sativus]3.12e-9467.69Show/hide
Query:  MTSN-NREAGIGDDHFDLSNFHETPVLFADPT-IEINSDIGYQNWNDEVRYLNFQSNVEIENLNNMEMVSVGVSANEINNDG-TSGSTKRKRKKSTVENK
        MTSN NREAGIGDDH ++SNFHE+PV FADP+ IEINSDI Y NWND++ YLNFQSN EI+N N+ E+VS+GVS NEINNDG +SGSTKRKRKKS VEN+
Subjt:  MTSN-NREAGIGDDHFDLSNFHETPVLFADPT-IEINSDIGYQNWNDEVRYLNFQSNVEIENLNNMEMVSVGVSANEINNDG-TSGSTKRKRKKSTVENK

Query:  RKSSPGGCKCKNSKCIKLYCDCFASEMFCDE----------NFLYVDTVVAAKKKLRTKKPSAFDRENVGE-------SSTERKGCNCKNSSCKR-----
          SSP  C CK S+C+KLYC+CFAS  FC+E          N  Y+DTVVAAK+KL+TKK SAFDRENV E       SSTER+GCNCKNS C++     
Subjt:  RKSSPGGCKCKNSKCIKLYCDCFASEMFCDE----------NFLYVDTVVAAKKKLRTKKPSAFDRENVGE-------SSTERKGCNCKNSSCKR-----

Query:  IIAGVACTEACNCQGCQNPCGTACAGNLN
          AGVACTEACNCQGCQNPCGTAC GN N
Subjt:  IIAGVACTEACNCQGCQNPCGTACAGNLN

XP_008454271.1 PREDICTED: protein tesmin/TSO1-like CXC 2 [Cucumis melo]2.88e-2838.43Show/hide
Query:  MTSNNREAGIGDDHFDLSNFHETPVLFADPTIEINSDIGYQNWNDEVRYLNFQSNVEIENLNNMEMVSVGV-SANEINNDGT-SGSTKRKRKKSTVENKR
         ++ N+EA IGD+H  +SNFH+T  + A  ++E N  + Y N N +          E  N+ + +  +  + S N++N + T + +TKRK  K+ VE+ +
Subjt:  MTSNNREAGIGDDHFDLSNFHETPVLFADPTIEINSDIGYQNWNDEVRYLNFQSNVEIENLNNMEMVSVGV-SANEINNDGT-SGSTKRKRKKSTVENKR

Query:  KSSPGGCKC--KNSKCIKLYCDCFASEMFCDENF-----------LYVDTVVAAKKKLRTKKPSAFDRENVGESSTERKGCNCKNSSCKRII-----AGV
        K+SP  C C  KN++C+ L C+CFA EMFC E             LY DTV  AK+++R+  PSAFD+  V      R  CNC  S+C++I      AG+
Subjt:  KSSPGGCKC--KNSKCIKLYCDCFASEMFCDENF-----------LYVDTVVAAKKKLRTKKPSAFDRENVGESSTERKGCNCKNSSCKRII-----AGV

Query:  ACTEACNCQGCQNPCG
         CTEAC+CQ C NP G
Subjt:  ACTEACNCQGCQNPCG

XP_038878069.1 protein tesmin/TSO1-like CXC 2 [Benincasa hispida]4.44e-4442.92Show/hide
Query:  SNNREAGIGDDHFDLSNFHETPVLFADPTIEINSDIGYQNWNDEVRYLNFQSNVEIENLNN---MEMVSVGVSANEINNDGTSGSTKRKRKKSTVENKRK
        SN+REA IG DHF++S FHET  + A+  +E N  + YQNW D+V YLNF+S  +  N NN      +S  VS + +NN  +S   KRKR+KS VE + K
Subjt:  SNNREAGIGDDHFDLSNFHETPVLFADPTIEINSDIGYQNWNDEVRYLNFQSNVEIENLNN---MEMVSVGVSANEINNDGTSGSTKRKRKKSTVENKRK

Query:  SSPGG-CKCKNSKCIKLYCDCFASEMFCDE---------NFLYVDTVVAAKKKLRTKKPSAFDRENVGE-----SSTERKGCNCKNSSCKR-----IIAG
           G  C C  S+C+ L C+CF S M+C E         N LY DTV   K+++ +  PSAF  + V E     SS+ER+GCNC +S C++       AG
Subjt:  SSPGG-CKCKNSKCIKLYCDCFASEMFCDE---------NFLYVDTVVAAKKKLRTKKPSAFDRENVGE-----SSTERKGCNCKNSSCKR-----IIAG

Query:  VACTEACNCQGCQNPCG----TACAGNLNSSIVLHEIVKN
        V CTEAC+CQ CQNPCG     + AG+ + + +  E+  N
Subjt:  VACTEACNCQGCQNPCG----TACAGNLNSSIVLHEIVKN

TrEMBL top hitse value%identityAlignment
A0A0A0KVD1 CRC domain-containing protein3.8e-7367.69Show/hide
Query:  MTSN-NREAGIGDDHFDLSNFHETPVLFADPT-IEINSDIGYQNWNDEVRYLNFQSNVEIENLNNMEMVSVGVSANEINNDG-TSGSTKRKRKKSTVENK
        MTSN NREAGIGDDH ++SNFHE+PV FADP+ IEINSDI Y NWND++ YLNFQSN EI+N N+ E+VS+GVS NEINNDG +SGSTKRKRKKS VEN+
Subjt:  MTSN-NREAGIGDDHFDLSNFHETPVLFADPT-IEINSDIGYQNWNDEVRYLNFQSNVEIENLNNMEMVSVGVSANEINNDG-TSGSTKRKRKKSTVENK

Query:  RKSSPGGCKCKNSKCIKLYCDCFASEMFCDE----------NFLYVDTVVAAKKKLRTKKPSAFDRENVGE-------SSTERKGCNCKNSSCKR-----
          SSP  C CK S+C+KLYC+CFAS  FC+E          N  Y+DTVVAAK+KL+TKK SAFDRENV E       SSTER+GCNCKNS C++     
Subjt:  RKSSPGGCKCKNSKCIKLYCDCFASEMFCDE----------NFLYVDTVVAAKKKLRTKKPSAFDRENVGE-------SSTERKGCNCKNSSCKR-----

Query:  IIAGVACTEACNCQGCQNPCGTACAGNLN
          AGVACTEACNCQGCQNPCGTAC GN N
Subjt:  IIAGVACTEACNCQGCQNPCGTACAGNLN

A0A1S3BY78 protein tesmin/TSO1-like CXC 21.4e-2239.07Show/hide
Query:  TSNNREAGIGDDHFDLSNFHETPVLFADPTIEINSDIGYQNWNDEVRYLNFQSNVEIENL-NNMEMVSVGVSANEINNDGT-SGSTKRKRKKSTVENKRK
        ++ N+EA IGD+H  +SNFH+T  + A  ++E N  + Y N N +          E  N+ ++ +  S   S N++N + T + +TKRK  K+ VE+ +K
Subjt:  TSNNREAGIGDDHFDLSNFHETPVLFADPTIEINSDIGYQNWNDEVRYLNFQSNVEIENL-NNMEMVSVGVSANEINNDGT-SGSTKRKRKKSTVENKRK

Query:  SSPGGCKC--KNSKCIKLYCDCFASEMFCDENF-----------LYVDTVVAAKKKLRTKKPSAFDRENVGESSTERKGCNCKNSSCKRI-----IAGVA
        +SP  C C  KN++C+ L C+CFA EMFC E             LY DTV  AK+++R+  PSAFD+  V      R  CNC  S+C++I      AG+ 
Subjt:  SSPGGCKC--KNSKCIKLYCDCFASEMFCDENF-----------LYVDTVVAAKKKLRTKKPSAFDRENVGESSTERKGCNCKNSSCKRI-----IAGVA

Query:  CTEACNCQGCQNPCG
        CTEAC+CQ C NP G
Subjt:  CTEACNCQGCQNPCG

A0A4Y7L837 CRC domain-containing protein3.8e-1232.76Show/hide
Query:  SANEINNDGTSGSTKRKRKKSTVENKRKSSPGGCKCKNSKCIKLYCDCFASEMFCDENF---------LYVDTVVAAKKKLRTKKPSAF------DRENV
        +A  I+ +   G+ K+K+     ++        C CK SKC+KLYC+CFA+ ++C E+           + DTV+A +K++ ++ P AF      + ++V
Subjt:  SANEINNDGTSGSTKRKRKKSTVENKRKSSPGGCKCKNSKCIKLYCDCFASEMFCDENF---------LYVDTVVAAKKKLRTKKPSAF------DRENV

Query:  GE--------SSTERKGCNCKNSSC-KRII----AGVACTEACNCQGCQNPCGTACAGNLNSSIVLHEIVKNDK
         E        S+  +KGCNCK S C KR       GV CT  C CQGCQN  G   +  L   + LH+    +K
Subjt:  GE--------SSTERKGCNCKNSSC-KRII----AGVACTEACNCQGCQNPCGTACAGNLNSSIVLHEIVKNDK

A0A6S7NJR0 CRC domain-containing protein1.5e-1340.32Show/hide
Query:  GCKCKNSKCIKLYCDCFASEMFCDEN---------FLYVDTVVAAKKKLRTKKPSAFD-----------RENVGESST------ERKGCNCKNSSCKRII
        GCKCK SKC+KLYCDCFA E +C E+           Y DTV  A++K++ + P AFD             N G +         RKGC CKNS C ++ 
Subjt:  GCKCKNSKCIKLYCDCFASEMFCDEN---------FLYVDTVVAAKKKLRTKKPSAFD-----------RENVGESST------ERKGCNCKNSSCKRII

Query:  -----AGVACTEACNCQGCQNPCG
             A V CT  C C+GC+N  G
Subjt:  -----AGVACTEACNCQGCQNPCG

A0A7I4ESU3 CRC domain-containing protein9.9e-1334.25Show/hide
Query:  STKRKRKKSTVENKRKSSPGGCK---CKNSKCIKLYCDCFASEMFCDENFL---------YVDTVVAAKKKLRTKKPSAFDRE----------------N
        S KR+R+KSTV ++ + S  GCK   CK SKC+KLYC+CFA+ ++C  +           Y++TV+  ++++ ++ P AF  +                +
Subjt:  STKRKRKKSTVENKRKSSPGGCK---CKNSKCIKLYCDCFASEMFCDENFL---------YVDTVVAAKKKLRTKKPSAFDRE----------------N

Query:  VGESSTERKGCNCKNSSCKR-----IIAGVACTEACNCQGCQNPCG
           S+  ++GCNCK S C +       AGV C+E C C+GC N  G
Subjt:  VGESSTERKGCNCKNSSCKR-----IIAGVACTEACNCQGCQNPCG

SwissProt top hitse value%identityAlignment
A1Z9E2 Protein lin-54 homolog4.4e-1036.79Show/hide
Query:  CKCKNSKCIKLYCDCFASEMFCD-----ENFLYVDTVVAAKKKLRT---KKPSAFDRE----NVGESSTERKGCNCKNSSCKR-----IIAGVACTEACN
        C C  S+C+KLYCDCFA+  FC      + F  +D  V  ++ +R+   + PSAF  +    N G+     KGCNCK S C +       A + C+  C 
Subjt:  CKCKNSKCIKLYCDCFASEMFCD-----ENFLYVDTVVAAKKKLRT---KKPSAFDRE----NVGESSTERKGCNCKNSSCKR-----IIAGVACTEACN

Query:  CQGCQN
        C GC+N
Subjt:  CQGCQN

F4JIF5 Protein tesmin/TSO1-like CXC 28.1e-1230.94Show/hide
Query:  KRKKSTVENKRKSSPGGCKCKNSKCIKLYCDCFASEMFCDE---------NFLYVDTVVAAKKKLRTKKPSAF----------------DRENVGESSTE
        K+K+  +++    S   C CK SKC+KLYC+CFA+ ++C E           ++ D V+A +K++ ++ P AF                D      S+  
Subjt:  KRKKSTVENKRKSSPGGCKCKNSKCIKLYCDCFASEMFCDE---------NFLYVDTVVAAKKKLRTKKPSAF----------------DRENVGESSTE

Query:  RKGCNCKNSSCKR-----IIAGVACTEACNCQGCQNPCG
        ++GCNCK S+C +        GV C+  C C+GC+N  G
Subjt:  RKGCNCKNSSCKR-----IIAGVACTEACNCQGCQNPCG

Q84JZ8 Protein tesmin/TSO1-like CXC 43.1e-1130.53Show/hide
Query:  CKCKNSKCIKLYCDCFASEMFCDE---------NFLYVDTVVAAKKKLRTKKPSAFDRENVGESST----------------ERKGCNCKNSSCKR----
        CKC+ S+C+KLYC+CF++ +FC E           ++ D V+ +++ ++ + P AF  + V  S T                 ++GCNC+ S C +    
Subjt:  CKCKNSKCIKLYCDCFASEMFCDE---------NFLYVDTVVAAKKKLRTKKPSAFDRENVGESST----------------ERKGCNCKNSSCKR----

Query:  -IIAGVACTEACNCQGCQNPCG---TACAGN
          + GV C+  C C GC+N  G     CAG+
Subjt:  -IIAGVACTEACNCQGCQNPCG---TACAGN

Q8L548 Protein tesmin/TSO1-like CXC 31.5e-1333.78Show/hide
Query:  DGTSGSTKRKRKKSTVENKRKSSPGGCKCKNSKCIKLYCDCFASEMFCDE---------NFLYVDTVVAAKKKLRTKKPSAF----------------DR
        D T  S K+KR+KS    +  SS   C CK SKC+KLYC+CFA+  +C E           ++ D V+A +K++ ++ P AF                D 
Subjt:  DGTSGSTKRKRKKSTVENKRKSSPGGCKCKNSKCIKLYCDCFASEMFCDE---------NFLYVDTVVAAKKKLRTKKPSAF----------------DR

Query:  ENVGESSTERKGCNCKNSSCKR-----IIAGVACTEACNCQGCQNPCG
             S+  ++GCNCK S+C +        GV C+  C C+GC+N  G
Subjt:  ENVGESSTERKGCNCKNSSCKR-----IIAGVACTEACNCQGCQNPCG

Q9LUI3 CRC domain-containing protein TSO12.1e-1233.33Show/hide
Query:  GSTKRKRKKSTVENKRKSSPGGCKCKNSKCIKLYCDCFASEMFCDENFLYVD---------TVVAAKKKLRTKKPSAF----------------DRENVG
        GS K+K +KS    + +S    C CK SKC+KLYC+CFA+ ++C E    +D         TV+A +K++ ++ P AF                D     
Subjt:  GSTKRKRKKSTVENKRKSSPGGCKCKNSKCIKLYCDCFASEMFCDENFLYVD---------TVVAAKKKLRTKKPSAF----------------DRENVG

Query:  ESSTERKGCNCKNSSCKR-----IIAGVACTEACNCQGCQNPCG
         S+  ++GCNCK S+C +        GV C+  C C+GC N  G
Subjt:  ESSTERKGCNCKNSSCKR-----IIAGVACTEACNCQGCQNPCG

Arabidopsis top hitse value%identityAlignment
AT2G20110.2 Tesmin/TSO1-like CXC domain-containing protein1.7e-0933.05Show/hide
Query:  CKCKNSKCIKLYCDCFASEMFCD-----ENFLYVDTVVAAKKKLRT---KKPSAF-------------DRENVGE---SSTERKGCNCKNSSCKR-----
        C CK+S+C+KLYC+CFAS  +CD       F  V+   A ++ + +   + P+AF             +RE VG+    +   KGC+CK S C +     
Subjt:  CKCKNSKCIKLYCDCFASEMFCD-----ENFLYVDTVVAAKKKLRT---KKPSAF-------------DRENVGE---SSTERKGCNCKNSSCKR-----

Query:  IIAGVACTEACNCQGCQN
          A + C+E C C  C+N
Subjt:  IIAGVACTEACNCQGCQN

AT3G04850.1 Tesmin/TSO1-like CXC domain-containing protein2.2e-1230.53Show/hide
Query:  CKCKNSKCIKLYCDCFASEMFCDE---------NFLYVDTVVAAKKKLRTKKPSAFDRENVGESST----------------ERKGCNCKNSSCKR----
        CKC+ S+C+KLYC+CF++ +FC E           ++ D V+ +++ ++ + P AF  + V  S T                 ++GCNC+ S C +    
Subjt:  CKCKNSKCIKLYCDCFASEMFCDE---------NFLYVDTVVAAKKKLRTKKPSAFDRENVGESST----------------ERKGCNCKNSSCKR----

Query:  -IIAGVACTEACNCQGCQNPCG---TACAGN
          + GV C+  C C GC+N  G     CAG+
Subjt:  -IIAGVACTEACNCQGCQNPCG---TACAGN

AT3G22760.1 Tesmin/TSO1-like CXC domain-containing protein1.0e-1433.78Show/hide
Query:  DGTSGSTKRKRKKSTVENKRKSSPGGCKCKNSKCIKLYCDCFASEMFCDE---------NFLYVDTVVAAKKKLRTKKPSAF----------------DR
        D T  S K+KR+KS    +  SS   C CK SKC+KLYC+CFA+  +C E           ++ D V+A +K++ ++ P AF                D 
Subjt:  DGTSGSTKRKRKKSTVENKRKSSPGGCKCKNSKCIKLYCDCFASEMFCDE---------NFLYVDTVVAAKKKLRTKKPSAF----------------DR

Query:  ENVGESSTERKGCNCKNSSCKR-----IIAGVACTEACNCQGCQNPCG
             S+  ++GCNCK S+C +        GV C+  C C+GC+N  G
Subjt:  ENVGESSTERKGCNCKNSSCKR-----IIAGVACTEACNCQGCQNPCG

AT3G22780.1 Tesmin/TSO1-like CXC domain-containing protein1.5e-1333.33Show/hide
Query:  GSTKRKRKKSTVENKRKSSPGGCKCKNSKCIKLYCDCFASEMFCDENFLYVD---------TVVAAKKKLRTKKPSAF----------------DRENVG
        GS K+K +KS    + +S    C CK SKC+KLYC+CFA+ ++C E    +D         TV+A +K++ ++ P AF                D     
Subjt:  GSTKRKRKKSTVENKRKSSPGGCKCKNSKCIKLYCDCFASEMFCDENFLYVD---------TVVAAKKKLRTKKPSAF----------------DRENVG

Query:  ESSTERKGCNCKNSSCKR-----IIAGVACTEACNCQGCQNPCG
         S+  ++GCNCK S+C +        GV C+  C C+GC N  G
Subjt:  ESSTERKGCNCKNSSCKR-----IIAGVACTEACNCQGCQNPCG

AT4G14770.1 TESMIN/TSO1-like CXC 25.8e-1330.94Show/hide
Query:  KRKKSTVENKRKSSPGGCKCKNSKCIKLYCDCFASEMFCDE---------NFLYVDTVVAAKKKLRTKKPSAF----------------DRENVGESSTE
        K+K+  +++    S   C CK SKC+KLYC+CFA+ ++C E           ++ D V+A +K++ ++ P AF                D      S+  
Subjt:  KRKKSTVENKRKSSPGGCKCKNSKCIKLYCDCFASEMFCDE---------NFLYVDTVVAAKKKLRTKKPSAF----------------DRENVGESSTE

Query:  RKGCNCKNSSCKR-----IIAGVACTEACNCQGCQNPCG
        ++GCNCK S+C +        GV C+  C C+GC+N  G
Subjt:  RKGCNCKNSSCKR-----IIAGVACTEACNCQGCQNPCG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGAGCAACAATAGAGAAGCTGGAATTGGTGATGATCACTTCGACTTGAGCAATTTCCACGAGACTCCAGTTCTTTTTGCTGATCCGACTATCGAGATAAATAGCGA
TATTGGATATCAAAACTGGAATGATGAGGTTCGATACTTAAATTTTCAGAGCAATGTGGAAATTGAAAATCTTAATAATATGGAAATGGTGTCTGTAGGTGTTTCCGCTA
ATGAGATAAACAATGACGGCACTAGTGGTAGCACCAAAAGAAAGAGGAAGAAATCAACCGTGGAAAATAAGAGGAAATCCAGCCCAGGTGGTTGCAAATGCAAAAACAGC
AAGTGCATAAAACTATATTGCGACTGCTTTGCGAGTGAAATGTTTTGCGACGAAAACTTTCTGTACGTAGACACTGTTGTTGCTGCTAAAAAAAAACTTAGAACCAAGAA
ACCCTCGGCCTTTGATAGAGAAAATGTAGGAGAGTCATCTACAGAAAGAAAAGGATGCAACTGCAAGAACTCATCGTGTAAAAGAATTATTGCAGGAGTTGCTTGCACGG
AAGCATGTAATTGTCAAGGTTGTCAAAATCCTTGCGGCACAGCCTGTGCAGGTAATTTGAATAGCAGTATTGTTTTACATGAAATCGTCAAAAATGACAAATTTACTGTA
CGTTATAAATATTTAATTTATTTTATTATATTTAAAAATGTTCATTGTTTTATTTTTTTTAAGGGTTTTAAGTTTAATTTTTTATAA
mRNA sequenceShow/hide mRNA sequence
ATGACGAGCAACAATAGAGAAGCTGGAATTGGTGATGATCACTTCGACTTGAGCAATTTCCACGAGACTCCAGTTCTTTTTGCTGATCCGACTATCGAGATAAATAGCGA
TATTGGATATCAAAACTGGAATGATGAGGTTCGATACTTAAATTTTCAGAGCAATGTGGAAATTGAAAATCTTAATAATATGGAAATGGTGTCTGTAGGTGTTTCCGCTA
ATGAGATAAACAATGACGGCACTAGTGGTAGCACCAAAAGAAAGAGGAAGAAATCAACCGTGGAAAATAAGAGGAAATCCAGCCCAGGTGGTTGCAAATGCAAAAACAGC
AAGTGCATAAAACTATATTGCGACTGCTTTGCGAGTGAAATGTTTTGCGACGAAAACTTTCTGTACGTAGACACTGTTGTTGCTGCTAAAAAAAAACTTAGAACCAAGAA
ACCCTCGGCCTTTGATAGAGAAAATGTAGGAGAGTCATCTACAGAAAGAAAAGGATGCAACTGCAAGAACTCATCGTGTAAAAGAATTATTGCAGGAGTTGCTTGCACGG
AAGCATGTAATTGTCAAGGTTGTCAAAATCCTTGCGGCACAGCCTGTGCAGGTAATTTGAATAGCAGTATTGTTTTACATGAAATCGTCAAAAATGACAAATTTACTGTA
CGTTATAAATATTTAATTTATTTTATTATATTTAAAAATGTTCATTGTTTTATTTTTTTTAAGGGTTTTAAGTTTAATTTTTTATAA
Protein sequenceShow/hide protein sequence
MTSNNREAGIGDDHFDLSNFHETPVLFADPTIEINSDIGYQNWNDEVRYLNFQSNVEIENLNNMEMVSVGVSANEINNDGTSGSTKRKRKKSTVENKRKSSPGGCKCKNS
KCIKLYCDCFASEMFCDENFLYVDTVVAAKKKLRTKKPSAFDRENVGESSTERKGCNCKNSSCKRIIAGVACTEACNCQGCQNPCGTACAGNLNSSIVLHEIVKNDKFTV
RYKYLIYFIIFKNVHCFIFFKGFKFNFL