; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G10360 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G10360
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionAnkyrin repeat-containing protein
Genome locationChr1:6533319..6534648
RNA-Seq ExpressionCSPI01G10360
SyntenyCSPI01G10360
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002110 - Ankyrin repeat
IPR020683 - Ankyrin repeat-containing domain
IPR036770 - Ankyrin repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055403.1 ankyrin repeat-containing protein [Cucumis melo var. makuwa]1.9e-11486.82Show/hide
Query:  MGSSALTSDDPISYFPSSSPSQT---SVADTVVINIIDGQGASMESKENIKNAVKLHEAALKGDWEAANNIFKKDSSWITKKITIRENTALHIAAAGKHI
        MGSS LTSDD ISYFPSSS SQT   SVAD VVI I D Q       ENIKNAVKLH+AALKGDWEAAN+IFKKD+SWITKKITIRENTALHIAAA KHI
Subjt:  MGSSALTSDDPISYFPSSSPSQT---SVADTVVINIIDGQGASMESKENIKNAVKLHEAALKGDWEAANNIFKKDSSWITKKITIRENTALHIAAAGKHI

Query:  SFVEKLVKLYSSNGFDLAIKNRDGRTALAYAAVSGIVRIAETIVDNDHKLRDPVDDAHLKYVPLLSSVFYKLKDMASYLFSQTNFNDLQTNQQLDLLLAT
        SFVEKLVKLYSSN FDLAIKNRDGR ALAYAA+SGIVRIAETIVDNDHKLRDPVDDAHLKY+PLLSSVFYKLKDMASYLFSQTNF+ LQT QQL+LLLAT
Subjt:  SFVEKLVKLYSSNGFDLAIKNRDGRTALAYAAVSGIVRIAETIVDNDHKLRDPVDDAHLKYVPLLSSVFYKLKDMASYLFSQTNFNDLQTNQQLDLLLAT

Query:  VDSDYYDIALDILKKKPDLAKERVGGTGETALHLLSRKPNAIGSSNKLCFWKKYINSR
        VDSDYYDIALDILKKKP+LAKERV   GETALHLL+RKPNAIGSSNKLCFWKKYINSR
Subjt:  VDSDYYDIALDILKKKPDLAKERVGGTGETALHLLSRKPNAIGSSNKLCFWKKYINSR

XP_008440746.1 PREDICTED: uncharacterized protein LOC103485063 isoform X1 [Cucumis melo]1.2e-6861.15Show/hide
Query:  TSDDPISYFPSSSPSQ----TSVADTVVINIIDGQGASMESK--ENIKNAVKLHEAALKGDWEAANNIFKKDSSWITKKITIRENTALHIAAAGKHISFV
        TSD   S F SS+ S     TS   +  +NI D QG +ME+K   NIK A++L++AALKGDWEAA  IFK+DSSWITKKITI++NTALH+AAA K I FV
Subjt:  TSDDPISYFPSSSPSQ----TSVADTVVINIIDGQGASMESK--ENIKNAVKLHEAALKGDWEAANNIFKKDSSWITKKITIRENTALHIAAAGKHISFV

Query:  EKLVKLYSSNGFDLAIKNRDGRTALAYAAVSGIVRIAETIVDNDHKL--RDPVDDAHL--KYVPLLSSVFYKLKDMASYLFSQTNFNDLQTNQQLDLLLA
        E+LVKLY SN  DL IKN +G TAL YAA+SG+VRIAE IV ND +      V+ AHL   YVPLL++V YK K+MASYLFS +    LQ  QQ  LLLA
Subjt:  EKLVKLYSSNGFDLAIKNRDGRTALAYAAVSGIVRIAETIVDNDHKL--RDPVDDAHL--KYVPLLSSVFYKLKDMASYLFSQTNFNDLQTNQQLDLLLA

Query:  TVDSDYYDIALDILKKKPDLAKERVGGTGETALHLLSRKPNAIGSSNKLCFWKKYINSRN
        T+DSDYYDIALDILK K  LAK R    G+TALHLL+RKP AIGSSN+L   KKYI+  N
Subjt:  TVDSDYYDIALDILKKKPDLAKERVGGTGETALHLLSRKPNAIGSSNKLCFWKKYINSRN

XP_008440754.1 PREDICTED: uncharacterized protein LOC103485063 isoform X2 [Cucumis melo]1.2e-6861.15Show/hide
Query:  TSDDPISYFPSSSPSQ----TSVADTVVINIIDGQGASMESK--ENIKNAVKLHEAALKGDWEAANNIFKKDSSWITKKITIRENTALHIAAAGKHISFV
        TSD   S F SS+ S     TS   +  +NI D QG +ME+K   NIK A++L++AALKGDWEAA  IFK+DSSWITKKITI++NTALH+AAA K I FV
Subjt:  TSDDPISYFPSSSPSQ----TSVADTVVINIIDGQGASMESK--ENIKNAVKLHEAALKGDWEAANNIFKKDSSWITKKITIRENTALHIAAAGKHISFV

Query:  EKLVKLYSSNGFDLAIKNRDGRTALAYAAVSGIVRIAETIVDNDHKL--RDPVDDAHL--KYVPLLSSVFYKLKDMASYLFSQTNFNDLQTNQQLDLLLA
        E+LVKLY SN  DL IKN +G TAL YAA+SG+VRIAE IV ND +      V+ AHL   YVPLL++V YK K+MASYLFS +    LQ  QQ  LLLA
Subjt:  EKLVKLYSSNGFDLAIKNRDGRTALAYAAVSGIVRIAETIVDNDHKL--RDPVDDAHL--KYVPLLSSVFYKLKDMASYLFSQTNFNDLQTNQQLDLLLA

Query:  TVDSDYYDIALDILKKKPDLAKERVGGTGETALHLLSRKPNAIGSSNKLCFWKKYINSRN
        T+DSDYYDIALDILK K  LAK R    G+TALHLL+RKP AIGSSN+L   KKYI+  N
Subjt:  TVDSDYYDIALDILKKKPDLAKERVGGTGETALHLLSRKPNAIGSSNKLCFWKKYINSRN

XP_008440794.1 PREDICTED: ankyrin repeat-containing protein At3g12360-like [Cucumis melo]3.1e-12089.53Show/hide
Query:  MGSSALTSDDPISYFPSSSPSQT---SVADTVVINIIDGQGASMESKENIKNAVKLHEAALKGDWEAANNIFKKDSSWITKKITIRENTALHIAAAGKHI
        MGSS LTSDD ISYFPSSS SQT   SVAD VVI I D QGASMESKENIKNAVKLH+AALKGDWEAAN+IFKKD+SWITKKITIRENTALHIAAA KHI
Subjt:  MGSSALTSDDPISYFPSSSPSQT---SVADTVVINIIDGQGASMESKENIKNAVKLHEAALKGDWEAANNIFKKDSSWITKKITIRENTALHIAAAGKHI

Query:  SFVEKLVKLYSSNGFDLAIKNRDGRTALAYAAVSGIVRIAETIVDNDHKLRDPVDDAHLKYVPLLSSVFYKLKDMASYLFSQTNFNDLQTNQQLDLLLAT
        SFVEKLVKLYSSN FDLAIKNRDGR ALAYAA+SGIVRIAETIVDNDHKLRDPVDDAHLKY+PLLSSVFYKLKDMASYLFSQTNF+ LQT QQL+LLLAT
Subjt:  SFVEKLVKLYSSNGFDLAIKNRDGRTALAYAAVSGIVRIAETIVDNDHKLRDPVDDAHLKYVPLLSSVFYKLKDMASYLFSQTNFNDLQTNQQLDLLLAT

Query:  VDSDYYDIALDILKKKPDLAKERVGGTGETALHLLSRKPNAIGSSNKLCFWKKYINSR
        VDSDYYDIALDILKKKP+LAKERV   GETALHLL+RKPNAIGSSNKLCFWKKYINSR
Subjt:  VDSDYYDIALDILKKKPDLAKERVGGTGETALHLLSRKPNAIGSSNKLCFWKKYINSR

XP_011648415.2 ankyrin repeat-containing protein ITN1 [Cucumis sativus]2.8e-137100Show/hide
Query:  MGSSALTSDDPISYFPSSSPSQTSVADTVVINIIDGQGASMESKENIKNAVKLHEAALKGDWEAANNIFKKDSSWITKKITIRENTALHIAAAGKHISFV
        MGSSALTSDDPISYFPSSSPSQTSVADTVVINIIDGQGASMESKENIKNAVKLHEAALKGDWEAANNIFKKDSSWITKKITIRENTALHIAAAGKHISFV
Subjt:  MGSSALTSDDPISYFPSSSPSQTSVADTVVINIIDGQGASMESKENIKNAVKLHEAALKGDWEAANNIFKKDSSWITKKITIRENTALHIAAAGKHISFV

Query:  EKLVKLYSSNGFDLAIKNRDGRTALAYAAVSGIVRIAETIVDNDHKLRDPVDDAHLKYVPLLSSVFYKLKDMASYLFSQTNFNDLQTNQQLDLLLATVDS
        EKLVKLYSSNGFDLAIKNRDGRTALAYAAVSGIVRIAETIVDNDHKLRDPVDDAHLKYVPLLSSVFYKLKDMASYLFSQTNFNDLQTNQQLDLLLATVDS
Subjt:  EKLVKLYSSNGFDLAIKNRDGRTALAYAAVSGIVRIAETIVDNDHKLRDPVDDAHLKYVPLLSSVFYKLKDMASYLFSQTNFNDLQTNQQLDLLLATVDS

Query:  DYYDIALDILKKKPDLAKERVGGTGETALHLLSRKPNAIGSSNKLCFWKKYINSR
        DYYDIALDILKKKPDLAKERVGGTGETALHLLSRKPNAIGSSNKLCFWKKYINSR
Subjt:  DYYDIALDILKKKPDLAKERVGGTGETALHLLSRKPNAIGSSNKLCFWKKYINSR

TrEMBL top hitse value%identityAlignment
A0A0A0LU57 ANK_REP_REGION domain-containing protein4.6e-106100Show/hide
Query:  MGSSALTSDDPISYFPSSSPSQTSVADTVVINIIDGQGASMESKENIKNAVKLHEAALKGDWEAANNIFKKDSSWITKKITIRENTALHIAAAGKHISFV
        MGSSALTSDDPISYFPSSSPSQTSVADTVVINIIDGQGASMESKENIKNAVKLHEAALKGDWEAANNIFKKDSSWITKKITIRENTALHIAAAGKHISFV
Subjt:  MGSSALTSDDPISYFPSSSPSQTSVADTVVINIIDGQGASMESKENIKNAVKLHEAALKGDWEAANNIFKKDSSWITKKITIRENTALHIAAAGKHISFV

Query:  EKLVKLYSSNGFDLAIKNRDGRTALAYAAVSGIVRIAETIVDNDHKLRDPVDDAHLKYVPLLSSVFYKLKDMASYLFSQTNFNDLQTNQQLDLLLATVDS
        EKLVKLYSSNGFDLAIKNRDGRTALAYAAVSGIVRIAETIVDNDHKLRDPVDDAHLKYVPLLSSVFYKLKDMASYLFSQTNFNDLQTNQQLDLLLATVDS
Subjt:  EKLVKLYSSNGFDLAIKNRDGRTALAYAAVSGIVRIAETIVDNDHKLRDPVDDAHLKYVPLLSSVFYKLKDMASYLFSQTNFNDLQTNQQLDLLLATVDS

Query:  DYY
        DYY
Subjt:  DYY

A0A1S3B1U8 uncharacterized protein LOC103485063 isoform X15.9e-6961.15Show/hide
Query:  TSDDPISYFPSSSPSQ----TSVADTVVINIIDGQGASMESK--ENIKNAVKLHEAALKGDWEAANNIFKKDSSWITKKITIRENTALHIAAAGKHISFV
        TSD   S F SS+ S     TS   +  +NI D QG +ME+K   NIK A++L++AALKGDWEAA  IFK+DSSWITKKITI++NTALH+AAA K I FV
Subjt:  TSDDPISYFPSSSPSQ----TSVADTVVINIIDGQGASMESK--ENIKNAVKLHEAALKGDWEAANNIFKKDSSWITKKITIRENTALHIAAAGKHISFV

Query:  EKLVKLYSSNGFDLAIKNRDGRTALAYAAVSGIVRIAETIVDNDHKL--RDPVDDAHL--KYVPLLSSVFYKLKDMASYLFSQTNFNDLQTNQQLDLLLA
        E+LVKLY SN  DL IKN +G TAL YAA+SG+VRIAE IV ND +      V+ AHL   YVPLL++V YK K+MASYLFS +    LQ  QQ  LLLA
Subjt:  EKLVKLYSSNGFDLAIKNRDGRTALAYAAVSGIVRIAETIVDNDHKL--RDPVDDAHL--KYVPLLSSVFYKLKDMASYLFSQTNFNDLQTNQQLDLLLA

Query:  TVDSDYYDIALDILKKKPDLAKERVGGTGETALHLLSRKPNAIGSSNKLCFWKKYINSRN
        T+DSDYYDIALDILK K  LAK R    G+TALHLL+RKP AIGSSN+L   KKYI+  N
Subjt:  TVDSDYYDIALDILKKKPDLAKERVGGTGETALHLLSRKPNAIGSSNKLCFWKKYINSRN

A0A1S3B1Y7 ankyrin repeat-containing protein At3g12360-like1.5e-12089.53Show/hide
Query:  MGSSALTSDDPISYFPSSSPSQT---SVADTVVINIIDGQGASMESKENIKNAVKLHEAALKGDWEAANNIFKKDSSWITKKITIRENTALHIAAAGKHI
        MGSS LTSDD ISYFPSSS SQT   SVAD VVI I D QGASMESKENIKNAVKLH+AALKGDWEAAN+IFKKD+SWITKKITIRENTALHIAAA KHI
Subjt:  MGSSALTSDDPISYFPSSSPSQT---SVADTVVINIIDGQGASMESKENIKNAVKLHEAALKGDWEAANNIFKKDSSWITKKITIRENTALHIAAAGKHI

Query:  SFVEKLVKLYSSNGFDLAIKNRDGRTALAYAAVSGIVRIAETIVDNDHKLRDPVDDAHLKYVPLLSSVFYKLKDMASYLFSQTNFNDLQTNQQLDLLLAT
        SFVEKLVKLYSSN FDLAIKNRDGR ALAYAA+SGIVRIAETIVDNDHKLRDPVDDAHLKY+PLLSSVFYKLKDMASYLFSQTNF+ LQT QQL+LLLAT
Subjt:  SFVEKLVKLYSSNGFDLAIKNRDGRTALAYAAVSGIVRIAETIVDNDHKLRDPVDDAHLKYVPLLSSVFYKLKDMASYLFSQTNFNDLQTNQQLDLLLAT

Query:  VDSDYYDIALDILKKKPDLAKERVGGTGETALHLLSRKPNAIGSSNKLCFWKKYINSR
        VDSDYYDIALDILKKKP+LAKERV   GETALHLL+RKPNAIGSSNKLCFWKKYINSR
Subjt:  VDSDYYDIALDILKKKPDLAKERVGGTGETALHLLSRKPNAIGSSNKLCFWKKYINSR

A0A1S3B2L9 uncharacterized protein LOC103485063 isoform X25.9e-6961.15Show/hide
Query:  TSDDPISYFPSSSPSQ----TSVADTVVINIIDGQGASMESK--ENIKNAVKLHEAALKGDWEAANNIFKKDSSWITKKITIRENTALHIAAAGKHISFV
        TSD   S F SS+ S     TS   +  +NI D QG +ME+K   NIK A++L++AALKGDWEAA  IFK+DSSWITKKITI++NTALH+AAA K I FV
Subjt:  TSDDPISYFPSSSPSQ----TSVADTVVINIIDGQGASMESK--ENIKNAVKLHEAALKGDWEAANNIFKKDSSWITKKITIRENTALHIAAAGKHISFV

Query:  EKLVKLYSSNGFDLAIKNRDGRTALAYAAVSGIVRIAETIVDNDHKL--RDPVDDAHL--KYVPLLSSVFYKLKDMASYLFSQTNFNDLQTNQQLDLLLA
        E+LVKLY SN  DL IKN +G TAL YAA+SG+VRIAE IV ND +      V+ AHL   YVPLL++V YK K+MASYLFS +    LQ  QQ  LLLA
Subjt:  EKLVKLYSSNGFDLAIKNRDGRTALAYAAVSGIVRIAETIVDNDHKL--RDPVDDAHL--KYVPLLSSVFYKLKDMASYLFSQTNFNDLQTNQQLDLLLA

Query:  TVDSDYYDIALDILKKKPDLAKERVGGTGETALHLLSRKPNAIGSSNKLCFWKKYINSRN
        T+DSDYYDIALDILK K  LAK R    G+TALHLL+RKP AIGSSN+L   KKYI+  N
Subjt:  TVDSDYYDIALDILKKKPDLAKERVGGTGETALHLLSRKPNAIGSSNKLCFWKKYINSRN

A0A5A7UHI9 Ankyrin repeat-containing protein9.3e-11586.82Show/hide
Query:  MGSSALTSDDPISYFPSSSPSQT---SVADTVVINIIDGQGASMESKENIKNAVKLHEAALKGDWEAANNIFKKDSSWITKKITIRENTALHIAAAGKHI
        MGSS LTSDD ISYFPSSS SQT   SVAD VVI I D Q       ENIKNAVKLH+AALKGDWEAAN+IFKKD+SWITKKITIRENTALHIAAA KHI
Subjt:  MGSSALTSDDPISYFPSSSPSQT---SVADTVVINIIDGQGASMESKENIKNAVKLHEAALKGDWEAANNIFKKDSSWITKKITIRENTALHIAAAGKHI

Query:  SFVEKLVKLYSSNGFDLAIKNRDGRTALAYAAVSGIVRIAETIVDNDHKLRDPVDDAHLKYVPLLSSVFYKLKDMASYLFSQTNFNDLQTNQQLDLLLAT
        SFVEKLVKLYSSN FDLAIKNRDGR ALAYAA+SGIVRIAETIVDNDHKLRDPVDDAHLKY+PLLSSVFYKLKDMASYLFSQTNF+ LQT QQL+LLLAT
Subjt:  SFVEKLVKLYSSNGFDLAIKNRDGRTALAYAAVSGIVRIAETIVDNDHKLRDPVDDAHLKYVPLLSSVFYKLKDMASYLFSQTNFNDLQTNQQLDLLLAT

Query:  VDSDYYDIALDILKKKPDLAKERVGGTGETALHLLSRKPNAIGSSNKLCFWKKYINSR
        VDSDYYDIALDILKKKP+LAKERV   GETALHLL+RKPNAIGSSNKLCFWKKYINSR
Subjt:  VDSDYYDIALDILKKKPDLAKERVGGTGETALHLLSRKPNAIGSSNKLCFWKKYINSR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G18670.1 Ankyrin repeat family protein1.3e-1531.41Show/hide
Query:  GDWEAANNIFKKDSSWITKKITIRENTALHIAAAGKHISFVEKLVKLYSSNGFDLAIKNRDGRTALAYAAVSGIVRIAETIVDNDHKLRDPVDDAHLKYV
        G+ EA  +   ++   +T  +T   +T +H A    HI  VE++++        L IKN +G TAL YAA  GIVRIAE +V+    L   V +A  +++
Subjt:  GDWEAANNIFKKDSSWITKKITIRENTALHIAAAGKHISFVEKLVKLYSSNGFDLAIKNRDGRTALAYAAVSGIVRIAETIVDNDHKLRDPVDDAHLKYV

Query:  PLLSSVFYKLKDMASYLFSQTNFNDLQTNQQLD---------LLLATVDSDYYDIALDILKKKPDLAKERVGGTGETALHLLSRKPNAIGS
        P++ +  Y  K +  YL+S T  +DL      D         L+   +    Y IALD++++ P LA  R     +TA+  L++ P A  S
Subjt:  PLLSSVFYKLKDMASYLFSQTNFNDLQTNQQLD---------LLLATVDSDYYDIALDILKKKPDLAKERVGGTGETALHLLSRKPNAIGS

AT3G54070.1 Ankyrin repeat family protein6.3e-2331.25Show/hide
Query:  LHEAALKGDWEAANNIFKKDSSWITKKITIRENTALHIAAAGKHISFVEKLVKLYSSNGFDLAIKNRDGRTALAYAAVSGIVRIAETIVDNDHKLRDPVD
        +++A L GDW+ A+ +  +    + ++IT     ALHIA A KH  FV  L++    +  DL++KN+DG T L++AA  G +  AE ++   + +RD  D
Subjt:  LHEAALKGDWEAANNIFKKDSSWITKKITIRENTALHIAAAGKHISFVEKLVKLYSSNGFDLAIKNRDGRTALAYAAVSGIVRIAETIVDNDHKLRDPVD

Query:  DAHLK-YVPLLSSVFYKLKDMASYLFSQTNFNDLQTNQQLDLLLATVDSDYYDIALDI---LKKKPDLAKERVG--GTGETALHLLSRKPNAIGSSNKLC
         ++ K   P+  +  Y   +M  YLFS+T+  DL   Q L+L    + +D Y +  D+   + ++ DL ++ +        ALHLL+RK +AI   ++L 
Subjt:  DAHLK-YVPLLSSVFYKLKDMASYLFSQTNFNDLQTNQQLDLLLATVDSDYYDIALDI---LKKKPDLAKERVG--GTGETALHLLSRKPNAIGSSNKLC

Query:  FWKKYINS
         +++  +S
Subjt:  FWKKYINS

AT5G35830.1 Ankyrin repeat family protein4.8e-2338.56Show/hide
Query:  VKLHEAALKGDWEAANNIFKKDSSWITKKITIRENTALHIAAAGKHISFVEKLVKLYSSNGFDLAIKNRDGRTALAYAAVSGIVRIAETIVDNDHKLRDP
        V+L++AALKGDW+AAN I  +    I +KIT +  T LHIA A KH  FV  L+    SN  DLA++N DG TAL +AA SG+V IA+ +++ +  L  P
Subjt:  VKLHEAALKGDWEAANNIFKKDSSWITKKITIRENTALHIAAAGKHISFVEKLVKLYSSNGFDLAIKNRDGRTALAYAAVSGIVRIAETIVDNDHKLRDP

Query:  VDDAHLKYVPLLSSVFYKLKDMASYLFSQTNFNDLQTNQQLDLLLATVDSDYY
        +     K  P+  +  +   +M  YL+  T F +    + ++L  A + +D Y
Subjt:  VDDAHLKYVPLLSSVFYKLKDMASYLFSQTNFNDLQTNQQLDLLLATVDSDYY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGCTCAGCGTTGACTTCCGACGATCCTATTTCTTATTTTCCTTCTTCTTCTCCTTCACAAACCTCTGTTGCCGACACTGTGGTGATCAATATAATCGACGGACA
AGGTGCATCAATGGAGAGTAAAGAAAACATTAAAAATGCTGTTAAGTTACATGAAGCTGCTTTAAAGGGTGATTGGGAAGCTGCCAATAATATATTTAAAAAAGATTCAT
CATGGATCACTAAGAAGATAACTATAAGAGAGAATACGGCACTCCATATTGCTGCTGCTGGAAAGCATATTTCTTTTGTTGAAAAGTTGGTTAAACTTTACTCTTCAAAT
GGCTTTGACTTAGCTATAAAAAATAGAGATGGACGTACTGCCCTTGCTTATGCTGCTGTATCAGGAATTGTAAGGATCGCTGAAACAATTGTTGACAATGATCACAAGCT
TCGAGATCCTGTTGATGATGCTCATCTTAAATATGTTCCACTTCTTAGTTCTGTATTTTACAAACTCAAAGACATGGCTTCTTATCTTTTCTCTCAGACTAATTTTAATG
ATCTACAAACTAATCAGCAACTTGATCTTCTCTTAGCTACAGTGGACAGTGATTATTATGATATAGCATTAGATATTCTGAAAAAGAAACCTGATTTAGCAAAGGAGAGG
GTGGGAGGAACTGGTGAAACAGCTTTGCATTTACTATCTAGAAAGCCAAATGCAATTGGTAGCAGCAACAAGCTTTGCTTCTGGAAAAAATATATAAACTCTCGTAAT
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGCTCAGCGTTGACTTCCGACGATCCTATTTCTTATTTTCCTTCTTCTTCTCCTTCACAAACCTCTGTTGCCGACACTGTGGTGATCAATATAATCGACGGACA
AGGTGCATCAATGGAGAGTAAAGAAAACATTAAAAATGCTGTTAAGTTACATGAAGCTGCTTTAAAGGGTGATTGGGAAGCTGCCAATAATATATTTAAAAAAGATTCAT
CATGGATCACTAAGAAGATAACTATAAGAGAGAATACGGCACTCCATATTGCTGCTGCTGGAAAGCATATTTCTTTTGTTGAAAAGTTGGTTAAACTTTACTCTTCAAAT
GGCTTTGACTTAGCTATAAAAAATAGAGATGGACGTACTGCCCTTGCTTATGCTGCTGTATCAGGAATTGTAAGGATCGCTGAAACAATTGTTGACAATGATCACAAGCT
TCGAGATCCTGTTGATGATGCTCATCTTAAATATGTTCCACTTCTTAGTTCTGTATTTTACAAACTCAAAGACATGGCTTCTTATCTTTTCTCTCAGACTAATTTTAATG
ATCTACAAACTAATCAGCAACTTGATCTTCTCTTAGCTACAGTGGACAGTGATTATTATGATATAGCATTAGATATTCTGAAAAAGAAACCTGATTTAGCAAAGGAGAGG
GTGGGAGGAACTGGTGAAACAGCTTTGCATTTACTATCTAGAAAGCCAAATGCAATTGGTAGCAGCAACAAGCTTTGCTTCTGGAAAAAATATATAAACTCTCGTAAT
Protein sequenceShow/hide protein sequence
MGSSALTSDDPISYFPSSSPSQTSVADTVVINIIDGQGASMESKENIKNAVKLHEAALKGDWEAANNIFKKDSSWITKKITIRENTALHIAAAGKHISFVEKLVKLYSSN
GFDLAIKNRDGRTALAYAAVSGIVRIAETIVDNDHKLRDPVDDAHLKYVPLLSSVFYKLKDMASYLFSQTNFNDLQTNQQLDLLLATVDSDYYDIALDILKKKPDLAKER
VGGTGETALHLLSRKPNAIGSSNKLCFWKKYINSRN