; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G22510 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G22510
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionProtein of unknown function DUF455
Genome locationChr6:20367836..20369727
RNA-Seq ExpressionCSPI06G22510
SyntenyCSPI06G22510
Gene Ontology termsNA
InterPro domainsIPR007402 - Protein of unknown function DUF455


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140115.1 uncharacterized protein LOC101207410 isoform X3 [Cucumis sativus]2.0e-13899.2Show/hide
Query:  MMQRLQLKALHLWPTLRSSSSPHLHSQTLNLSSSSSLQYTPWSGLKAWKQSPLNENRFWGPNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDP
        M+QRLQLKALHLWPTLRSSSSPHLHSQTLNLSSSSSLQYTPWSGLKAWKQSPLNENRFWGPNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDP
Subjt:  MMQRLQLKALHLWPTLRSSSSPHLHSQTLNLSSSSSLQYTPWSGLKAWKQSPLNENRFWGPNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDP

Query:  LTKSKLSHLAYSRWSQEGLPIGVFEVPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHVAD
        LTKSKLSHLAYSRWSQEGLPIGVFE PSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHVAD
Subjt:  LTKSKLSHLAYSRWSQEGLPIGVFEVPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHVAD

Query:  DESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQ
        DESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQ
Subjt:  DESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQ

XP_008449389.1 PREDICTED: uncharacterized protein HI_0077 [Cucumis melo]1.1e-13195.24Show/hide
Query:  MMQRLQLKALHLWPTLRSSSSPHLHSQTLNL-SSSSSLQYTPWSGLKAWKQSPLNENRFWGPNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSD
        M+QRLQLKALHLWPTLRSSS  H HSQTLN+ SSSSS QYTPWSGLKAWKQSPLNENRFWGPNGPEPL+ESSSTGVFFDSRIESASSLAELGALVLSTSD
Subjt:  MMQRLQLKALHLWPTLRSSSSPHLHSQTLNL-SSSSSLQYTPWSGLKAWKQSPLNENRFWGPNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSD

Query:  PLTKSKLSHLAYSRWSQEGLPIGVFEVPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHVA
        PLTKS+LSHLAYSRWSQE LPIGVFE PSHPARP LPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHVA
Subjt:  PLTKSKLSHLAYSRWSQEGLPIGVFEVPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHVA

Query:  DDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQ
        DDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQ
Subjt:  DDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQ

XP_011657545.1 uncharacterized protein LOC101207410 isoform X1 [Cucumis sativus]2.0e-13899.2Show/hide
Query:  MMQRLQLKALHLWPTLRSSSSPHLHSQTLNLSSSSSLQYTPWSGLKAWKQSPLNENRFWGPNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDP
        M+QRLQLKALHLWPTLRSSSSPHLHSQTLNLSSSSSLQYTPWSGLKAWKQSPLNENRFWGPNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDP
Subjt:  MMQRLQLKALHLWPTLRSSSSPHLHSQTLNLSSSSSLQYTPWSGLKAWKQSPLNENRFWGPNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDP

Query:  LTKSKLSHLAYSRWSQEGLPIGVFEVPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHVAD
        LTKSKLSHLAYSRWSQEGLPIGVFE PSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHVAD
Subjt:  LTKSKLSHLAYSRWSQEGLPIGVFEVPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHVAD

Query:  DESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQ
        DESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQ
Subjt:  DESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQ

XP_031743610.1 uncharacterized protein LOC101207410 isoform X2 [Cucumis sativus]2.0e-13899.2Show/hide
Query:  MMQRLQLKALHLWPTLRSSSSPHLHSQTLNLSSSSSLQYTPWSGLKAWKQSPLNENRFWGPNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDP
        M+QRLQLKALHLWPTLRSSSSPHLHSQTLNLSSSSSLQYTPWSGLKAWKQSPLNENRFWGPNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDP
Subjt:  MMQRLQLKALHLWPTLRSSSSPHLHSQTLNLSSSSSLQYTPWSGLKAWKQSPLNENRFWGPNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDP

Query:  LTKSKLSHLAYSRWSQEGLPIGVFEVPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHVAD
        LTKSKLSHLAYSRWSQEGLPIGVFE PSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHVAD
Subjt:  LTKSKLSHLAYSRWSQEGLPIGVFEVPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHVAD

Query:  DESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQ
        DESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQ
Subjt:  DESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQ

XP_038887628.1 uncharacterized protein HI_0077 [Benincasa hispida]2.8e-12490.44Show/hide
Query:  MMQRLQLKALHLWPTLRSSSSPHLHSQTLNLSSSSSLQYTPWSGLKAWKQSPLNENRFWGPNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDP
        MMQRL+LK+L LWPTLRSSS PHLHSQTL L +SSSLQYTPWSG+KAW+QSPLNENRFWGPNGPEPL ESSSTG  FDSRIESASSLAELGALVLSTSDP
Subjt:  MMQRLQLKALHLWPTLRSSSSPHLHSQTLNLSSSSSLQYTPWSGLKAWKQSPLNENRFWGPNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDP

Query:  LTKSKLSHLAYSRWSQEGLPIGVFEVPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHVAD
        LTKS LSHLAYSRWSQE LPIGVF+ PS PARP +PKLVSPKEIPAPKN+GLPLNAYMLHNLAHVELNAIDLAWDTVVRFS FS+VLGEGFFADFAHVAD
Subjt:  LTKSKLSHLAYSRWSQEGLPIGVFEVPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHVAD

Query:  DESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQ
        DESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSS+NVAARLA IPLVQ
Subjt:  DESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQ

TrEMBL top hitse value%identityAlignment
A0A0A0KHU6 Uncharacterized protein1.1e-14799.25Show/hide
Query:  MMQRLQLKALHLWPTLRSSSSPHLHSQTLNLSSSSSLQYTPWSGLKAWKQSPLNENRFWGPNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDP
        M+QRLQLKALHLWPTLRSSSSPHLHSQTLNLSSSSSLQYTPWSGLKAWKQSPLNENRFWGPNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDP
Subjt:  MMQRLQLKALHLWPTLRSSSSPHLHSQTLNLSSSSSLQYTPWSGLKAWKQSPLNENRFWGPNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDP

Query:  LTKSKLSHLAYSRWSQEGLPIGVFEVPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHVAD
        LTKSKLSHLAYSRWSQEGLPIGVFE PSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHVAD
Subjt:  LTKSKLSHLAYSRWSQEGLPIGVFEVPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHVAD

Query:  DESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQACFAYSSSTPYVFD
        DESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQACFAYSSSTPYVFD
Subjt:  DESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQACFAYSSSTPYVFD

A0A1S3BMU7 uncharacterized protein HI_00775.1e-13295.24Show/hide
Query:  MMQRLQLKALHLWPTLRSSSSPHLHSQTLNL-SSSSSLQYTPWSGLKAWKQSPLNENRFWGPNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSD
        M+QRLQLKALHLWPTLRSSS  H HSQTLN+ SSSSS QYTPWSGLKAWKQSPLNENRFWGPNGPEPL+ESSSTGVFFDSRIESASSLAELGALVLSTSD
Subjt:  MMQRLQLKALHLWPTLRSSSSPHLHSQTLNL-SSSSSLQYTPWSGLKAWKQSPLNENRFWGPNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSD

Query:  PLTKSKLSHLAYSRWSQEGLPIGVFEVPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHVA
        PLTKS+LSHLAYSRWSQE LPIGVFE PSHPARP LPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHVA
Subjt:  PLTKSKLSHLAYSRWSQEGLPIGVFEVPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHVA

Query:  DDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQ
        DDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQ
Subjt:  DDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQ

A0A5D3E2D9 DUF455 domain-containing protein5.1e-13295.24Show/hide
Query:  MMQRLQLKALHLWPTLRSSSSPHLHSQTLNL-SSSSSLQYTPWSGLKAWKQSPLNENRFWGPNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSD
        M+QRLQLKALHLWPTLRSSS  H HSQTLN+ SSSSS QYTPWSGLKAWKQSPLNENRFWGPNGPEPL+ESSSTGVFFDSRIESASSLAELGALVLSTSD
Subjt:  MMQRLQLKALHLWPTLRSSSSPHLHSQTLNL-SSSSSLQYTPWSGLKAWKQSPLNENRFWGPNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSD

Query:  PLTKSKLSHLAYSRWSQEGLPIGVFEVPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHVA
        PLTKS+LSHLAYSRWSQE LPIGVFE PSHPARP LPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHVA
Subjt:  PLTKSKLSHLAYSRWSQEGLPIGVFEVPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHVA

Query:  DDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQ
        DDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQ
Subjt:  DDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQ

A0A6J1EXG4 uncharacterized protein LOC1114370703.0e-11683.66Show/hide
Query:  MQRLQLKALHLWPTLRSSSSPHLHSQTLNL-------SSSSSLQYTPWSGLKAWKQSPLNENRFWGPNGPEPLLESSSTGVFFDSRIESASSLAELGALV
        MQRLQLKAL +WPT R S   ++HSQ++ L       SSSSSL+YTPWSGLKAW+QSP+NENRFWG NGPE L+ESSS G  FDSRIESASSLAELGALV
Subjt:  MQRLQLKALHLWPTLRSSSSPHLHSQTLNL-------SSSSSLQYTPWSGLKAWKQSPLNENRFWGPNGPEPLLESSSTGVFFDSRIESASSLAELGALV

Query:  LSTSDPLTKSKLSHLAYSRWSQEGLPIGVFEVPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFAD
        LSTSDPL KS+LSHLAYSRWS E LPIGVFE P  PARP  PKLVSP+EIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFS FS+VLGEGFFAD
Subjt:  LSTSDPLTKSKLSHLAYSRWSQEGLPIGVFEVPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFAD

Query:  FAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQ
        FAHVADDESRHF WCSQRLAELGFKYGDMAAHNLLWRECEKSS+NVAARLAAIPLVQ
Subjt:  FAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQ

A0A6J1JES7 uncharacterized protein LOC1114862572.3e-11684.58Show/hide
Query:  MQRLQLKALHLWPTLRSSSSPHLHSQTLNL---SSSSSLQYTPWSGLKAWKQSPLNENRFWGPNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTS
        MQRLQLKAL +WPT R S   ++HSQ++ L   SSSSSL+YTPWSGLKAW+QSP+NENRFWG NGPE L+ESSS G  FDSRIESASSLAELGALVLSTS
Subjt:  MQRLQLKALHLWPTLRSSSSPHLHSQTLNL---SSSSSLQYTPWSGLKAWKQSPLNENRFWGPNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTS

Query:  DPLTKSKLSHLAYSRWSQEGLPIGVFEVPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHV
        DPL KS+LSHLAYSRWS E LPIGVFE P  PARP  PKLVSP+EIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFS FS++LGEGFFADFAHV
Subjt:  DPLTKSKLSHLAYSRWSQEGLPIGVFEVPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHV

Query:  ADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQ
        ADDESRHF WCSQRLAELGFKYGDMAAHNLLWRECEKSS+NVAARLAAIPLVQ
Subjt:  ADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQ

SwissProt top hitse value%identityAlignment
P43935 Uncharacterized protein HI_00776.4e-1533.33Show/hide
Query:  LSTSDPLTKSKLSHLAYSRWSQEGLPIGVFEVP------SHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSL-FSDVL
        L T++P  K +L +  Y     +   I + + P      +  A P  P LV+PK++P    +     A  LH +AH+E NAI+L  D   RF     + L
Subjt:  LSTSDPLTKSKLSHLAYSRWSQEGLPIGVFEVP------SHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSL-FSDVL

Query:  GEG--FFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLV
        GEG  F  D+  VA +ES HF   ++ L  LG++YGD  AH  LW   + +++++  R+A +P V
Subjt:  GEG--FFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLV

Arabidopsis top hitse value%identityAlignment
AT1G06240.1 Protein of unknown function DUF4551.6e-8265.31Show/hide
Query:  HLWPTLRSSSSPH-LHSQTLNLSSSSSLQYTPWSGLKAWKQSPLNENRFWGPNGPEPLLESSSTGVFFD--SRIESASSLAELGALVLSTSDPLTKSKLS
        HL P   +S SP  L   T   SSS+  Q+  WSGL+ W++SP+N+ R WGP G  PLL SSS  +  D    + +ASSLA+LGALVLSTSDPL+KS +S
Subjt:  HLWPTLRSSSSPH-LHSQTLNLSSSSSLQYTPWSGLKAWKQSPLNENRFWGPNGPEPLLESSSTGVFFD--SRIESASSLAELGALVLSTSDPLTKSKLS

Query:  HLAYSRWSQEGLPIG-VFEVPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHVADDESRHF
        HLA+SRW +E LP+G +  +PS PARP  P LV+  ++P PK+S LPLNA+MLHNLAHVELNAIDLAWDTV RFS F D+LG  FF DFAHVADDESRHF
Subjt:  HLAYSRWSQEGLPIG-VFEVPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHVADDESRHF

Query:  MWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQ
        +WCSQRLAELGFKYGD+ A+NLL RECEK+SNNVAARLA IPLVQ
Subjt:  MWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLAAIPLVQ

AT5G04520.1 Protein of unknown function DUF4558.6e-2340.12Show/hide
Query:  SLAELGALVLSTSDPLTKSKLSHLAYSRWSQEGL-----PIGVFEVPSHPARPSLP-KLVSPKEIPAPKNSG-LPLNAYMLHNLAHVELNAIDLAWDTVV
        +L E    +L+TSDP  K++L      +W Q  +     P   F VP  PAR  LP KLVSP  +P    +G L     ++H+LAH E  AIDL+WD + 
Subjt:  SLAELGALVLSTSDPLTKSKLSHLAYSRWSQEGL-----PIGVFEVPSHPARPSLP-KLVSPKEIPAPKNSG-LPLNAYMLHNLAHVELNAIDLAWDTVV

Query:  RFSLFSDVLGEGFFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLA
        RF    + +   FF DF  VA DE RHF   + RL E+G  YG + AH+ LW     +S+++ ARLA
Subjt:  RFSLFSDVLGEGFFADFAHVADDESRHFMWCSQRLAELGFKYGDMAAHNLLWRECEKSSNNVAARLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGCAGCGTCTTCAACTCAAGGCTCTACATTTATGGCCTACTCTTCGCTCTTCGTCCTCTCCCCATCTCCATTCTCAAACCCTCAACCTAAGTTCCTCTTCTTCTCT
TCAATACACGCCTTGGTCTGGTCTCAAAGCTTGGAAACAGAGTCCCCTTAATGAGAATCGATTCTGGGGACCCAATGGACCAGAGCCTCTGCTTGAATCTTCATCAACTG
GGGTTTTCTTTGATAGCCGAATCGAATCGGCTTCGTCTCTTGCGGAATTGGGTGCATTGGTTCTCTCTACAAGTGACCCTTTAACCAAATCCAAACTCTCTCATCTTGCT
TACTCCAGATGGTCTCAGGAAGGTCTTCCCATTGGCGTTTTCGAAGTTCCTTCTCATCCTGCTCGACCTTCGTTGCCGAAATTGGTCTCTCCAAAGGAAATTCCGGCTCC
TAAAAACTCAGGATTACCTCTGAATGCTTATATGTTACACAATCTTGCTCATGTTGAGCTTAATGCAATTGATTTGGCATGGGACACTGTCGTTCGATTTTCTCTCTTCA
GTGATGTTCTTGGGGAGGGTTTTTTTGCTGACTTTGCTCATGTTGCTGATGATGAGAGTCGCCATTTTATGTGGTGTTCACAGAGACTTGCTGAACTTGGTTTCAAATAT
GGAGATATGGCTGCTCATAATTTGCTTTGGAGGGAGTGTGAGAAATCATCCAACAATGTAGCTGCACGCTTGGCAGCAATACCGCTTGTCCAGGCTTGCTTCGCTTATTC
AAGCTCCACCCCTTATGTTTTTGATTAG
mRNA sequenceShow/hide mRNA sequence
AAGGGATTGAATGTCACTTTCCACAAAAGTTTAGCTCAAGGAAGATTAGAAGTTGAAACGAACTTAGCTCAAAGCTACAAAGTTGGTTTCTCCATCTCAATTTGGCCATA
GATATGAGCATGACCACAGAAATTTTGGCGCTGAAGGAAGAATGATGCAGCGTCTTCAACTCAAGGCTCTACATTTATGGCCTACTCTTCGCTCTTCGTCCTCTCCCCAT
CTCCATTCTCAAACCCTCAACCTAAGTTCCTCTTCTTCTCTTCAATACACGCCTTGGTCTGGTCTCAAAGCTTGGAAACAGAGTCCCCTTAATGAGAATCGATTCTGGGG
ACCCAATGGACCAGAGCCTCTGCTTGAATCTTCATCAACTGGGGTTTTCTTTGATAGCCGAATCGAATCGGCTTCGTCTCTTGCGGAATTGGGTGCATTGGTTCTCTCTA
CAAGTGACCCTTTAACCAAATCCAAACTCTCTCATCTTGCTTACTCCAGATGGTCTCAGGAAGGTCTTCCCATTGGCGTTTTCGAAGTTCCTTCTCATCCTGCTCGACCT
TCGTTGCCGAAATTGGTCTCTCCAAAGGAAATTCCGGCTCCTAAAAACTCAGGATTACCTCTGAATGCTTATATGTTACACAATCTTGCTCATGTTGAGCTTAATGCAAT
TGATTTGGCATGGGACACTGTCGTTCGATTTTCTCTCTTCAGTGATGTTCTTGGGGAGGGTTTTTTTGCTGACTTTGCTCATGTTGCTGATGATGAGAGTCGCCATTTTA
TGTGGTGTTCACAGAGACTTGCTGAACTTGGTTTCAAATATGGAGATATGGCTGCTCATAATTTGCTTTGGAGGGAGTGTGAGAAATCATCCAACAATGTAGCTGCACGC
TTGGCAGCAATACCGCTTGTCCAGGCTTGCTTCGCTTATTCAAGCTCCACCCCTTATGTTTTTGATTAG
Protein sequenceShow/hide protein sequence
MMQRLQLKALHLWPTLRSSSSPHLHSQTLNLSSSSSLQYTPWSGLKAWKQSPLNENRFWGPNGPEPLLESSSTGVFFDSRIESASSLAELGALVLSTSDPLTKSKLSHLA
YSRWSQEGLPIGVFEVPSHPARPSLPKLVSPKEIPAPKNSGLPLNAYMLHNLAHVELNAIDLAWDTVVRFSLFSDVLGEGFFADFAHVADDESRHFMWCSQRLAELGFKY
GDMAAHNLLWRECEKSSNNVAARLAAIPLVQACFAYSSSTPYVFD