; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G12400 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G12400
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionUnknown protein
Genome locationChr4:10844194..10853125
RNA-Seq ExpressionCSPI04G12400
SyntenyCSPI04G12400
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004149633.1 uncharacterized protein LOC101215314 isoform X1 [Cucumis sativus]3.1e-13099.59Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAALLFSAIFLDVFWFILFAYDTWNISS
        MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYA LLFSAIFLDVFWFILFAYDTWNISS
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAALLFSAIFLDVFWFILFAYDTWNISS

Query:  KQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC
        KQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC
Subjt:  KQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC

Query:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEYADGHQQTV
        LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEYADGHQQTV
Subjt:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEYADGHQQTV

XP_008449916.1 PREDICTED: uncharacterized protein LOC103491645 isoform X4 [Cucumis melo]2.9e-12898.77Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAALLFSAIFLDVFWFILFAYDTWNISS
        MLCNSLRDRLRP LRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYA LLFSAIFLDVFWFILFAYDTWNISS
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAALLFSAIFLDVFWFILFAYDTWNISS

Query:  KQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC
        +QYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC
Subjt:  KQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC

Query:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEYADGHQQTV
        LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEYADGHQQTV
Subjt:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEYADGHQQTV

XP_011653507.1 uncharacterized protein LOC101215314 isoform X2 [Cucumis sativus]2.8e-12399.57Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAALLFSAIFLDVFWFILFAYDTWNISS
        MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYA LLFSAIFLDVFWFILFAYDTWNISS
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAALLFSAIFLDVFWFILFAYDTWNISS

Query:  KQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC
        KQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC
Subjt:  KQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC

Query:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVAD
        LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVAD
Subjt:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVAD

XP_038901807.1 uncharacterized protein LOC120088513 isoform X1 [Benincasa hispida]2.8e-12394.05Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAALLFSAIFLDVFWFILFAYDTWNISS
        MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYA LLFSAIFLDVFWFILFAYDTWN SS
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAALLFSAIFLDVFWFILFAYDTWNISS

Query:  KQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC
        +QYG LFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDP+YYSSLF+DGQDSKC
Subjt:  KQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC

Query:  LSG-------ISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEYADGHQ-QTV
        LSG       ISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEYADG+Q QTV
Subjt:  LSG-------ISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEYADGHQ-QTV

XP_038901808.1 uncharacterized protein LOC120088513 isoform X2 [Benincasa hispida]2.3e-12596.73Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAALLFSAIFLDVFWFILFAYDTWNISS
        MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYA LLFSAIFLDVFWFILFAYDTWN SS
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAALLFSAIFLDVFWFILFAYDTWNISS

Query:  KQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC
        +QYG LFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDP+YYSSLF+DGQDSKC
Subjt:  KQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC

Query:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEYADGHQ-QTV
        LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEYADG+Q QTV
Subjt:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEYADGHQ-QTV

TrEMBL top hitse value%identityAlignment
A0A0A0KWY7 Uncharacterized protein1.5e-13099.59Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAALLFSAIFLDVFWFILFAYDTWNISS
        MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYA LLFSAIFLDVFWFILFAYDTWNISS
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAALLFSAIFLDVFWFILFAYDTWNISS

Query:  KQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC
        KQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC
Subjt:  KQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC

Query:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEYADGHQQTV
        LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEYADGHQQTV
Subjt:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEYADGHQQTV

A0A1S3BMI4 uncharacterized protein LOC103491645 isoform X41.4e-12898.77Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAALLFSAIFLDVFWFILFAYDTWNISS
        MLCNSLRDRLRP LRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYA LLFSAIFLDVFWFILFAYDTWNISS
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAALLFSAIFLDVFWFILFAYDTWNISS

Query:  KQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC
        +QYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC
Subjt:  KQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC

Query:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEYADGHQQTV
        LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEYADGHQQTV
Subjt:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEYADGHQQTV

A0A1S3BP27 uncharacterized protein LOC103491645 isoform X29.8e-12295.47Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAALLFSAIFLDVFWFILFAYDTWNISS
        MLCNSLRDRLRP LRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYA LLFSAIFLDVFWFILFAYDTWNISS
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAALLFSAIFLDVFWFILFAYDTWNISS

Query:  KQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC
        +QYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC
Subjt:  KQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC

Query:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEYADGHQQT
        LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVAD +   G   T
Subjt:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEYADGHQQT

A0A1S4DXR3 uncharacterized protein LOC103491645 isoform X12.0e-12296.69Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAALLFSAIFLDVFWFILFAYDTWNISS
        MLCNSLRDRLRP LRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYA LLFSAIFLDVFWFILFAYDTWNISS
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAALLFSAIFLDVFWFILFAYDTWNISS

Query:  KQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC
        +QYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC
Subjt:  KQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC

Query:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEYADGHQQ
        LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVAD  Y   HQQ
Subjt:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEYADGHQQ

A0A6J1CT50 uncharacterized protein LOC111014335 isoform X11.5e-12293.85Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAALLFSAIFLDVFWFILFAYDTWNISS
        MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYA LL SAIFLD+FWFILFAYDTWNISS
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAALLFSAIFLDVFWFILFAYDTWNISS

Query:  KQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC
        KQYGPLF+FSVKLTLAMQIIGFS+RLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSP TPVV RQ SGSDDMIGGSIYDP YYSSLFEDGQDSKC
Subjt:  KQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKC

Query:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEYADGHQQTV
        LSGISHFGNGDNGSTSG DVSRSK+SRHFQV DDE+A G+QQTV
Subjt:  LSGISHFGNGDNGSTSGPDVSRSKLSRHFQVADDEYADGHQQTV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55535.1 unknown protein2.4e-8064.17Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAALLFSAIFLDVFWFILFAYDTWNISS
        MLC SLRDR+ PWLRDY +LQS AV LIY QIGCALIGSLGALYNGVLLINLAIALFALVAIES+SQSLGRTYA LLF A+ LD+ WFILF  + W+IS+
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAALLFSAIFLDVFWFILFAYDTWNISS

Query:  KQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQ---D
        + YG  F FSVKLT+AM++IGF VRLSSSLLW QIYRLG + ++TS+PRE D DLRNSFL+P TP + RQ SG+++++GGSIYDP YY+SLFE+ Q   +
Subjt:  KQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQ---D

Query:  SKCLSGISHFGNGDNGSTSGPDVS--RSKLSRHFQVADDE
        S   + ++H+  G+NGS S  + S  +S + R     D+E
Subjt:  SKCLSGISHFGNGDNGSTSGPDVS--RSKLSRHFQVADDE

AT1G55535.2 unknown protein2.4e-8064.17Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAALLFSAIFLDVFWFILFAYDTWNISS
        MLC SLRDR+ PWLRDY +LQS AV LIY QIGCALIGSLGALYNGVLLINLAIALFALVAIES+SQSLGRTYA LLF A+ LD+ WFILF  + W+IS+
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAALLFSAIFLDVFWFILFAYDTWNISS

Query:  KQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQ---D
        + YG  F FSVKLT+AM++IGF VRLSSSLLW QIYRLG + ++TS+PRE D DLRNSFL+P TP + RQ SG+++++GGSIYDP YY+SLFE+ Q   +
Subjt:  KQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQ---D

Query:  SKCLSGISHFGNGDNGSTSGPDVS--RSKLSRHFQVADDE
        S   + ++H+  G+NGS S  + S  +S + R     D+E
Subjt:  SKCLSGISHFGNGDNGSTSGPDVS--RSKLSRHFQVADDE

AT3G13420.1 unknown protein6.6e-6261.03Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAALLFSAIFLDVFWFILFAYDTWNISS
        MLC SLR+R+  WLRDY RLQS  +ILIY QIGCALIGSLGALYNGV+LINLAIALF LVAIES+SQSLGRTYA LLF AI LDV WFILF+ + WNISS
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAALLFSAIFLDVFWFILFAYDTWNISS

Query:  KQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSP------------------ATPVVVRQPSGSDDMIGGSI
          Y   + FSVKLTLAM+I GF VRLSSSLLW QIYRLG S +++  PR++D DLRNSFL P                    P + +Q S SD+++  SI
Subjt:  KQYGPLFTFSVKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSP------------------ATPVVVRQPSGSDDMIGGSI

Query:  YDPTYYSSLFEDG
         +P  Y+ L + G
Subjt:  YDPTYYSSLFEDG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTTGCAATTCATTGAGAGATCGACTTCGGCCATGGCTTCGTGATTATGATAGGCTTCAGTCTTTCGCCGTCATTCTCATTTATATTCAGATCGGGTGCGCATTGAT
TGGATCCCTAGGGGCATTGTACAATGGTGTTTTGCTTATAAATTTGGCAATCGCATTGTTCGCGTTGGTAGCCATAGAGAGCAGCAGTCAGAGTCTTGGTCGTACATACG
CTGCTCTCCTCTTCTCTGCCATTTTCCTCGACGTCTTCTGGTTTATTCTTTTCGCCTACGACACATGGAACATCTCATCCAAGCAATATGGACCCCTTTTTACCTTTTCA
GTGAAGCTTACTCTGGCTATGCAGATTATTGGATTTTCTGTTAGGCTATCATCTTCTCTACTTTGGATTCAAATTTACAGGTTGGGGATTTCATACATGGAAACTTCAGT
TCCCCGAGAGGCAGACTACGATCTGAGAAATAGTTTTCTTAGCCCGGCTACTCCTGTTGTAGTTAGACAACCATCAGGTTCCGATGACATGATTGGGGGCTCTATCTATG
ATCCAACTTACTACTCGTCCCTATTTGAAGATGGTCAAGATAGTAAATGTTTGTCTGGGATCTCTCATTTCGGTAATGGTGATAATGGTTCTACCTCTGGGCCAGATGTA
TCTCGATCAAAGCTGTCCAGACATTTCCAAGTAGCAGATGATGAGTATGCAGATGGACATCAGCAGACGGTTTAG
mRNA sequenceShow/hide mRNA sequence
GCAAGAAAGAAATCCAACGCTAGTTTCCAAATTCCGTTGCCACACAGTTTAAAATCTTCAACCTAAAAACTACTTAATTGCCTTTTCAGAATTGATTCTTCTTTCAATAG
ATTACTCCCTCGTCCTCGCTGCATCGCGCTTGTACTCGTCTCACTCGCTCATGAATGATTTGATCATACGGTGTTCTTGCGGTTTAGGAAAAAGAAACAAAGAAGAGAGA
AATTCGAAATCGAGCGGCCAACAGGAGCGGTTAAGAGGTTAGGACGGCTCGTTGGTGGGGTTAGGTGGATTTTTCAAATGGTATATCATGCTTTGCAATTCATTGAGAGA
TCGACTTCGGCCATGGCTTCGTGATTATGATAGGCTTCAGTCTTTCGCCGTCATTCTCATTTATATTCAGATCGGGTGCGCATTGATTGGATCCCTAGGGGCATTGTACA
ATGGTGTTTTGCTTATAAATTTGGCAATCGCATTGTTCGCGTTGGTAGCCATAGAGAGCAGCAGTCAGAGTCTTGGTCGTACATACGCTGCTCTCCTCTTCTCTGCCATT
TTCCTCGACGTCTTCTGGTTTATTCTTTTCGCCTACGACACATGGAACATCTCATCCAAGCAATATGGACCCCTTTTTACCTTTTCAGTGAAGCTTACTCTGGCTATGCA
GATTATTGGATTTTCTGTTAGGCTATCATCTTCTCTACTTTGGATTCAAATTTACAGGTTGGGGATTTCATACATGGAAACTTCAGTTCCCCGAGAGGCAGACTACGATC
TGAGAAATAGTTTTCTTAGCCCGGCTACTCCTGTTGTAGTTAGACAACCATCAGGTTCCGATGACATGATTGGGGGCTCTATCTATGATCCAACTTACTACTCGTCCCTA
TTTGAAGATGGTCAAGATAGTAAATGTTTGTCTGGGATCTCTCATTTCGGTAATGGTGATAATGGTTCTACCTCTGGGCCAGATGTATCTCGATCAAAGCTGTCCAGACA
TTTCCAAGTAGCAGATGATGAGTATGCAGATGGACATCAGCAGACGGTTTAGAGCATTGAGTTGGTGAATTTGTTGACTATCTGCTCCTATTATGATATCGCAAATATTG
TCGTTTACAACTATCTACTCAGTACTCAGGTGCGGCTGGGTAATGTAGAGCGATGTTGTACAGCTTCACATTATTAGGTATAATTCTTTCTTCCCGGATAAATCTTTTTT
CCATTGAAGGAAATGGCTTAAGCGATGCGTTGGCTTTGGCGTCTCAAGAGGGAAACAAATTCATAAAAAGAACTATGACTTTCACTCTTTCTGAATTCCGTAGATAAAGG
GCCAGTTAGATCTTGGAGCGGAAGAAATCTGTGGCTCATGGAAACGCATTTTTTCTATTGTAGTATTTTCAAATTTTGGGGAGATTGTGGAAATTAACGTACCTTACAAG
GTGAACTATCATTTTGTTTTGTTATATCTAAATAATACGTCTCGTGGTTCTCGACTGAATGTGTTTTCTTC
Protein sequenceShow/hide protein sequence
MLCNSLRDRLRPWLRDYDRLQSFAVILIYIQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAALLFSAIFLDVFWFILFAYDTWNISSKQYGPLFTFS
VKLTLAMQIIGFSVRLSSSLLWIQIYRLGISYMETSVPREADYDLRNSFLSPATPVVVRQPSGSDDMIGGSIYDPTYYSSLFEDGQDSKCLSGISHFGNGDNGSTSGPDV
SRSKLSRHFQVADDEYADGHQQTV