; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G06020 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G06020
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionBEST Arabidopsis thaliana protein match is: methyltransferases .
Genome locationChr7:4474789..4476561
RNA-Seq ExpressionCSPI07G06020
SyntenyCSPI07G06020
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031439.1 uncharacterized protein E6C27_scaffold139G001960 [Cucumis melo var. makuwa]3.3e-10490.09Show/hide
Query:  MVILTFPCIVSILGQESGPSEFFSVPDIVDSEKLDLFFRDLGHEGFSNNGHKVLILSSAETNGLIQIRVLDGDEHKLNIVVDSDFDRTGLFSDDSFDFVL
        MVILTFPCIVSILGQESG SEFFSV D+VDS KLDLFFRDLGHEGFS NGHKVLILSSAET GLIQIRVLDGDEHKLNIVVDSDFDRTGLFSDDSFDFVL
Subjt:  MVILTFPCIVSILGQESGPSEFFSVPDIVDSEKLDLFFRDLGHEGFSNNGHKVLILSSAETNGLIQIRVLDGDEHKLNIVVDSDFDRTGLFSDDSFDFVL

Query:  SWGFMDSDFIDRILKIGGIVAFPLVNNNDPSSHFEKKPNYKPVFLNRYTSIIVAMEKTVMADKLVYVSASRRRLLKSSLPTRNAALRDLEDVTKPNELGR
        SW FMDSDFIDRILK GGIVAFPL+NNNDPS+HFEKKPNYKP+FLNRYTSIIVAMEKT +AD LVY SASRRRLLKSSLPT NAALRDLEDVTKPN+LGR
Subjt:  SWGFMDSDFIDRILKIGGIVAFPLVNNNDPSSHFEKKPNYKPVFLNRYTSIIVAMEKTVMADKLVYVSASRRRLLKSSLPTRNAALRDLEDVTKPNELGR

Query:  KIKYLPDVSKL-EETSSSVTSR
        KI YL DV KL EE+SSSVTSR
Subjt:  KIKYLPDVSKL-EETSSSVTSR

XP_008455527.1 PREDICTED: uncharacterized protein LOC103495679 [Cucumis melo]5.0e-12990.26Show/hide
Query:  MDLARFNRPITYAFDNISWNSKTHLVINFPTTQILRVISYSSFFAMVILTFPCIVSILGQESGPSEFFSVPDIVDSEKLDLFFRDLGHEGFSNNGHKVLI
        MDLARFNRP T+AFDNISWNSKTHLVINFP T+ILRVISYSSFFAMVILTFPCIVSILGQESG SEFFSV D+VDS KLDLFFRDLGHEGFS NGHKVLI
Subjt:  MDLARFNRPITYAFDNISWNSKTHLVINFPTTQILRVISYSSFFAMVILTFPCIVSILGQESGPSEFFSVPDIVDSEKLDLFFRDLGHEGFSNNGHKVLI

Query:  LSSAETNGLIQIRVLDGDEHKLNIVVDSDFDRTGLFSDDSFDFVLSWGFMDSDFIDRILKIGGIVAFPLVNNNDPSSHFEKKPNYKPVFLNRYTSIIVAM
        LSSAET GLIQIRVLDGDEHKLNIVVDSDFDRTGLFSDDSFDFVLSW FMDSDFIDRILK GGIVAFPL+NNNDPS+HFEKKPNYKP+FLNRYTSIIVAM
Subjt:  LSSAETNGLIQIRVLDGDEHKLNIVVDSDFDRTGLFSDDSFDFVLSWGFMDSDFIDRILKIGGIVAFPLVNNNDPSSHFEKKPNYKPVFLNRYTSIIVAM

Query:  EKTVMADKLVYVSASRRRLLKSSLPTRNAALRDLEDVTKPNELGRKIKYLPDVSKL-EETSSSVTSR
        EKT +AD LVY SASRRRLLKSSLPT NAALRDLEDVTKPN+LGRKI YL DV KL EE+SSSVTSR
Subjt:  EKTVMADKLVYVSASRRRLLKSSLPTRNAALRDLEDVTKPNELGRKIKYLPDVSKL-EETSSSVTSR

XP_011659719.1 uncharacterized protein LOC105436238 [Cucumis sativus]4.1e-14798.9Show/hide
Query:  MDLARFNRPITYAFDNISWNSKTHLVINFPTTQILRVISYSSFFAMVILTFPCIVSILGQESGPSEFFSVPDIVDSEKLDLFFRDLGHEGFSNNGHKVLI
        MDLARFNRPITYAFDNISWNSKTHLVINFPTTQILRVISYSSFFAMVILTFPCIVSILGQE+GPSEFFSVPD+VDSEKLDLFFRDLGHEGFSNNGHKVLI
Subjt:  MDLARFNRPITYAFDNISWNSKTHLVINFPTTQILRVISYSSFFAMVILTFPCIVSILGQESGPSEFFSVPDIVDSEKLDLFFRDLGHEGFSNNGHKVLI

Query:  LSSAETNGLIQIRVLDGDEHKLNIVVDSDFDRTGLFSDDSFDFVLSWGFMDSDFIDRILKIGGIVAFPLVNNNDPSSHFEKKPNYKPVFLNRYTSIIVAM
        LSSAETNGLIQIRVLDGDEHKLNIVVDSDFDRTGLFSDDSFDFVLSWGFMDSDFIDRILKIGGIVAFPLVNNNDPSSHFEKKPNYKPVFLNRYTSIIVAM
Subjt:  LSSAETNGLIQIRVLDGDEHKLNIVVDSDFDRTGLFSDDSFDFVLSWGFMDSDFIDRILKIGGIVAFPLVNNNDPSSHFEKKPNYKPVFLNRYTSIIVAM

Query:  EKTVMADKLVYVSASRRRLLKSSLPTRNAALRDLEDVTKPNELGRKIKYLPDVSKLEETSSSVTSRQEMLLE
        EKTVMADKLVY SASRRRLLKSSLPTRNAALRDLEDVTKPNELGRKIKYLPDVSKLEETSSSVTSRQEMLLE
Subjt:  EKTVMADKLVYVSASRRRLLKSSLPTRNAALRDLEDVTKPNELGRKIKYLPDVSKLEETSSSVTSRQEMLLE

XP_022141924.1 uncharacterized protein LOC111012177 [Momordica charantia]3.2e-7560.74Show/hide
Query:  MDLARFNRPITY----AFDN---ISWNSKTHLVINFPTTQILRVISYSSFFAMVILTFPCIVSILGQESGPSEFFSVPDIVDSEKLDLFFRDLGHEGFSN
        MD ARFNR         F N    +WNS THLVI FP  +IL VIS S F A+VILT PCIVSILG+ES  SEF SV D+VDS +LDL FRD G+EG + 
Subjt:  MDLARFNRPITY----AFDN---ISWNSKTHLVINFPTTQILRVISYSSFFAMVILTFPCIVSILGQESGPSEFFSVPDIVDSEKLDLFFRDLGHEGFSN

Query:  NGHKVLILSSAETNGLIQIRVLDGDEHKLNIVVDSDFDRTGLFSDDSFDFVLSWGFMDSDFIDRILKIGGIVAFPLVNNNDPSSHFEKKPNYKPVFLNRY
        NG K +ILSS  T+GL Q+RV+D DE KL+IV+DSDFD++GLFSDDSFDFV +WG +DSDF+DRILK GGI+AFP  N+  PS+HF+KKPNY+PVFL+RY
Subjt:  NGHKVLILSSAETNGLIQIRVLDGDEHKLNIVVDSDFDRTGLFSDDSFDFVLSWGFMDSDFIDRILKIGGIVAFPLVNNNDPSSHFEKKPNYKPVFLNRY

Query:  TSIIVAMEKTVMADKLVYVSASRRRLLKSSLPTRNAALRDLED----------VTKPNELGRKIKYLPDV
        +SIIVAMEKT M D +VY SASRR L + S  T  AA+R LE+          V KP+ L RKIKY+ D+
Subjt:  TSIIVAMEKTVMADKLVYVSASRRRLLKSSLPTRNAALRDLED----------VTKPNELGRKIKYLPDV

XP_038889013.1 uncharacterized protein LOC120078778 [Benincasa hispida]3.7e-10877.37Show/hide
Query:  MDLARFNRPITYAFDNISWNSKTHLVINFPTTQILRVISYSSFFAMVILTFPCIVSILGQESGPSEFFSVPDIVDSEKLDLFFRDLGHEGFSNNGHKVLI
        MD   FNRP + AFD +SWNSKTHLVI FP TQILRVISYS FFAM ILTFP IVSILGQESG SEFFSV D++DSE+LDLFFRDLGHEG + NGHK LI
Subjt:  MDLARFNRPITYAFDNISWNSKTHLVINFPTTQILRVISYSSFFAMVILTFPCIVSILGQESGPSEFFSVPDIVDSEKLDLFFRDLGHEGFSNNGHKVLI

Query:  LSSAETNGLIQIRVLDGDEHKLNIVVDSDFDRTGLFSDDSFDFVLSWGFMDSDFIDRILKIGGIVAFPLVNNNDPSSHFEKKPNYKPVFLNRYTSIIVAM
        LSSAET GLIQIRVLDGDEHKLNIVVDSDFDR+GLFSDDSFDFVLS G +DSDFIDRILKIGGIVAFPL NNNDPS+HF+KKPNY+PVFLNRY+SIIV M
Subjt:  LSSAETNGLIQIRVLDGDEHKLNIVVDSDFDRTGLFSDDSFDFVLSWGFMDSDFIDRILKIGGIVAFPLVNNNDPSSHFEKKPNYKPVFLNRYTSIIVAM

Query:  EKTVMADKLVYVSASRRRLLKSSLPTRNAALRDLE---------DVTKPNELGRKIKYLPDVSKLEETSSSVTS
        EKT MAD+LVY S+SRRRL + SLPTRNAALRDLE         DV KPN+LGRK+KYLPD+  +++ SS + S
Subjt:  EKTVMADKLVYVSASRRRLLKSSLPTRNAALRDLE---------DVTKPNELGRKIKYLPDVSKLEETSSSVTS

TrEMBL top hitse value%identityAlignment
A0A0A0K451 Uncharacterized protein2.0e-14798.9Show/hide
Query:  MDLARFNRPITYAFDNISWNSKTHLVINFPTTQILRVISYSSFFAMVILTFPCIVSILGQESGPSEFFSVPDIVDSEKLDLFFRDLGHEGFSNNGHKVLI
        MDLARFNRPITYAFDNISWNSKTHLVINFPTTQILRVISYSSFFAMVILTFPCIVSILGQE+GPSEFFSVPD+VDSEKLDLFFRDLGHEGFSNNGHKVLI
Subjt:  MDLARFNRPITYAFDNISWNSKTHLVINFPTTQILRVISYSSFFAMVILTFPCIVSILGQESGPSEFFSVPDIVDSEKLDLFFRDLGHEGFSNNGHKVLI

Query:  LSSAETNGLIQIRVLDGDEHKLNIVVDSDFDRTGLFSDDSFDFVLSWGFMDSDFIDRILKIGGIVAFPLVNNNDPSSHFEKKPNYKPVFLNRYTSIIVAM
        LSSAETNGLIQIRVLDGDEHKLNIVVDSDFDRTGLFSDDSFDFVLSWGFMDSDFIDRILKIGGIVAFPLVNNNDPSSHFEKKPNYKPVFLNRYTSIIVAM
Subjt:  LSSAETNGLIQIRVLDGDEHKLNIVVDSDFDRTGLFSDDSFDFVLSWGFMDSDFIDRILKIGGIVAFPLVNNNDPSSHFEKKPNYKPVFLNRYTSIIVAM

Query:  EKTVMADKLVYVSASRRRLLKSSLPTRNAALRDLEDVTKPNELGRKIKYLPDVSKLEETSSSVTSRQEMLLE
        EKTVMADKLVY SASRRRLLKSSLPTRNAALRDLEDVTKPNELGRKIKYLPDVSKLEETSSSVTSRQEMLLE
Subjt:  EKTVMADKLVYVSASRRRLLKSSLPTRNAALRDLEDVTKPNELGRKIKYLPDVSKLEETSSSVTSRQEMLLE

A0A1S3C0P0 uncharacterized protein LOC1034956792.4e-12990.26Show/hide
Query:  MDLARFNRPITYAFDNISWNSKTHLVINFPTTQILRVISYSSFFAMVILTFPCIVSILGQESGPSEFFSVPDIVDSEKLDLFFRDLGHEGFSNNGHKVLI
        MDLARFNRP T+AFDNISWNSKTHLVINFP T+ILRVISYSSFFAMVILTFPCIVSILGQESG SEFFSV D+VDS KLDLFFRDLGHEGFS NGHKVLI
Subjt:  MDLARFNRPITYAFDNISWNSKTHLVINFPTTQILRVISYSSFFAMVILTFPCIVSILGQESGPSEFFSVPDIVDSEKLDLFFRDLGHEGFSNNGHKVLI

Query:  LSSAETNGLIQIRVLDGDEHKLNIVVDSDFDRTGLFSDDSFDFVLSWGFMDSDFIDRILKIGGIVAFPLVNNNDPSSHFEKKPNYKPVFLNRYTSIIVAM
        LSSAET GLIQIRVLDGDEHKLNIVVDSDFDRTGLFSDDSFDFVLSW FMDSDFIDRILK GGIVAFPL+NNNDPS+HFEKKPNYKP+FLNRYTSIIVAM
Subjt:  LSSAETNGLIQIRVLDGDEHKLNIVVDSDFDRTGLFSDDSFDFVLSWGFMDSDFIDRILKIGGIVAFPLVNNNDPSSHFEKKPNYKPVFLNRYTSIIVAM

Query:  EKTVMADKLVYVSASRRRLLKSSLPTRNAALRDLEDVTKPNELGRKIKYLPDVSKL-EETSSSVTSR
        EKT +AD LVY SASRRRLLKSSLPT NAALRDLEDVTKPN+LGRKI YL DV KL EE+SSSVTSR
Subjt:  EKTVMADKLVYVSASRRRLLKSSLPTRNAALRDLEDVTKPNELGRKIKYLPDVSKL-EETSSSVTSR

A0A5A7SQ50 Uncharacterized protein1.6e-10490.09Show/hide
Query:  MVILTFPCIVSILGQESGPSEFFSVPDIVDSEKLDLFFRDLGHEGFSNNGHKVLILSSAETNGLIQIRVLDGDEHKLNIVVDSDFDRTGLFSDDSFDFVL
        MVILTFPCIVSILGQESG SEFFSV D+VDS KLDLFFRDLGHEGFS NGHKVLILSSAET GLIQIRVLDGDEHKLNIVVDSDFDRTGLFSDDSFDFVL
Subjt:  MVILTFPCIVSILGQESGPSEFFSVPDIVDSEKLDLFFRDLGHEGFSNNGHKVLILSSAETNGLIQIRVLDGDEHKLNIVVDSDFDRTGLFSDDSFDFVL

Query:  SWGFMDSDFIDRILKIGGIVAFPLVNNNDPSSHFEKKPNYKPVFLNRYTSIIVAMEKTVMADKLVYVSASRRRLLKSSLPTRNAALRDLEDVTKPNELGR
        SW FMDSDFIDRILK GGIVAFPL+NNNDPS+HFEKKPNYKP+FLNRYTSIIVAMEKT +AD LVY SASRRRLLKSSLPT NAALRDLEDVTKPN+LGR
Subjt:  SWGFMDSDFIDRILKIGGIVAFPLVNNNDPSSHFEKKPNYKPVFLNRYTSIIVAMEKTVMADKLVYVSASRRRLLKSSLPTRNAALRDLEDVTKPNELGR

Query:  KIKYLPDVSKL-EETSSSVTSR
        KI YL DV KL EE+SSSVTSR
Subjt:  KIKYLPDVSKL-EETSSSVTSR

A0A6J1CK51 uncharacterized protein LOC1110121771.5e-7560.74Show/hide
Query:  MDLARFNRPITY----AFDN---ISWNSKTHLVINFPTTQILRVISYSSFFAMVILTFPCIVSILGQESGPSEFFSVPDIVDSEKLDLFFRDLGHEGFSN
        MD ARFNR         F N    +WNS THLVI FP  +IL VIS S F A+VILT PCIVSILG+ES  SEF SV D+VDS +LDL FRD G+EG + 
Subjt:  MDLARFNRPITY----AFDN---ISWNSKTHLVINFPTTQILRVISYSSFFAMVILTFPCIVSILGQESGPSEFFSVPDIVDSEKLDLFFRDLGHEGFSN

Query:  NGHKVLILSSAETNGLIQIRVLDGDEHKLNIVVDSDFDRTGLFSDDSFDFVLSWGFMDSDFIDRILKIGGIVAFPLVNNNDPSSHFEKKPNYKPVFLNRY
        NG K +ILSS  T+GL Q+RV+D DE KL+IV+DSDFD++GLFSDDSFDFV +WG +DSDF+DRILK GGI+AFP  N+  PS+HF+KKPNY+PVFL+RY
Subjt:  NGHKVLILSSAETNGLIQIRVLDGDEHKLNIVVDSDFDRTGLFSDDSFDFVLSWGFMDSDFIDRILKIGGIVAFPLVNNNDPSSHFEKKPNYKPVFLNRY

Query:  TSIIVAMEKTVMADKLVYVSASRRRLLKSSLPTRNAALRDLED----------VTKPNELGRKIKYLPDV
        +SIIVAMEKT M D +VY SASRR L + S  T  AA+R LE+          V KP+ L RKIKY+ D+
Subjt:  TSIIVAMEKTVMADKLVYVSASRRRLLKSSLPTRNAALRDLED----------VTKPNELGRKIKYLPDV

A0A6J5W009 Uncharacterized protein1.2e-3541.39Show/hide
Query:  NSKTHLVINFPTTQILRVISYSSFFAMVILTFPCIVSIL-GQESGPSEFFSVPDIVDSEKLDLFFRDLGHEGFSNNGHKVLILSSAETNGLIQIRVLDGD
        +S+THLVI  P +++LR+IS S F  +VILT PCI S+L G      ++ +  +I + E+L   F DL  EG      K LI+S +    +  +     D
Subjt:  NSKTHLVINFPTTQILRVISYSSFFAMVILTFPCIVSIL-GQESGPSEFFSVPDIVDSEKLDLFFRDLGHEGFSNNGHKVLILSSAETNGLIQIRVLDGD

Query:  EHKLNIVVDSDFDRTGLFSDDSFDFVLSWGFMDSDFIDRILKIGGIVAFPLVNNNDPSSHFEKKPNYKPVFLNRYTSIIVAMEKTVMADKLVYVSASRRR
         +  +IV+DSD +R   F D+S DFV ++  +D+ F+DRILKIGGIVA PL  +NDPS+ F  KPNYK V+L RY S  VAM KT  +  L   S   RR
Subjt:  EHKLNIVVDSDFDRTGLFSDDSFDFVLSWGFMDSDFIDRILKIGGIVAFPLVNNNDPSSHFEKKPNYKPVFLNRYTSIIVAMEKTVMADKLVYVSASRRR

Query:  LLKSSLPTRNAALRDLEDV---------TKPNELGRKIKYLPDV
        L +     +   L+ LEDV          K NE  +KIK+LP++
Subjt:  LLKSSLPTRNAALRDLEDV---------TKPNELGRKIKYLPDV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G58120.1 BEST Arabidopsis thaliana protein match is: methyltransferases (TAIR:AT5G01710.1)2.4e-1228.21Show/hide
Query:  TQILRVISYSSFFAMVILTFPCIVSILGQESGPSEFFSVPDIVDSEKLDLFFRDLGHEGFSNNGHKVLILSSAETNGLIQIRVLDGDEHKLNIVVDSDFD
        +++L +   S+  A++ L+F  +  +    +  +   SV   +  E L L   DL  +G    G K L LS  +    +        E  + +V  SD +
Subjt:  TQILRVISYSSFFAMVILTFPCIVSILGQESGPSEFFSVPDIVDSEKLDLFFRDLGHEGFSNNGHKVLILSSAETNGLIQIRVLDGDEHKLNIVVDSDFD

Query:  RTGLFSDDSFDFVLSWG-FMDS-DFIDRILKIGGIVAFPLVNNNDPSSHFEKKPNYKPVFLNRYTSIIVAMEKTVMADKLVYVSASRRRLLK-SSLPTRN
           +  D++FDF  +    +DS +FIDR LK+GGI    L N  D   +F K PNY+ V++      ++ M KT   ++   + A+ R+LL  +    R 
Subjt:  RTGLFSDDSFDFVLSWG-FMDS-DFIDRILKIGGIVAFPLVNNNDPSSHFEKKPNYKPVFLNRYTSIIVAMEKTVMADKLVYVSASRRRLLK-SSLPTRN

Query:  AALRDLEDV---------TKPNELGRKIKYLPDV
         ALR LEDV          K     ++ +YLPD+
Subjt:  AALRDLEDV---------TKPNELGRKIKYLPDV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTAGCTCGTTTCAATCGACCCATTACCTATGCATTTGACAATATCAGCTGGAATTCTAAGACCCATTTGGTTATTAACTTTCCTACTACTCAAATTCTTCGTGT
GATTTCTTATTCGTCGTTTTTCGCTATGGTTATTCTCACGTTTCCTTGTATTGTGTCCATTCTTGGGCAAGAAAGTGGGCCGTCTGAGTTTTTTTCTGTACCAGATATAG
TTGATTCTGAGAAATTGGATTTGTTTTTTCGTGATTTGGGTCATGAAGGTTTTTCCAATAACGGCCATAAGGTTCTCATTTTGAGCTCTGCTGAAACAAATGGCTTGATT
CAGATACGTGTGTTGGATGGTGATGAACACAAACTCAACATTGTTGTGGACTCTGATTTTGATCGAACTGGGTTGTTTTCTGACGATTCTTTTGATTTCGTGTTATCTTG
GGGCTTTATGGACTCTGATTTCATTGATAGAATTCTGAAAATAGGTGGCATTGTGGCTTTTCCACTCGTTAATAACAATGACCCATCAAGTCATTTTGAGAAGAAACCAA
ATTACAAACCTGTGTTTCTCAATAGATACACCTCCATTATTGTTGCAATGGAGAAGACAGTCATGGCTGATAAGCTAGTTTATGTTTCAGCTTCAAGAAGACGTCTCTTA
AAATCCTCATTGCCAACTAGAAATGCAGCTTTGAGAGACCTTGAGGACGTGACCAAACCAAACGAACTTGGGAGGAAAATCAAGTACCTTCCCGACGTTTCTAAACTTGA
AGAAACATCAAGCAGCGTCACGTCACGTCAAGAAATGCTGTTGGAGTAG
mRNA sequenceShow/hide mRNA sequence
GCATAATCATAAAACGGTGTCGAGTTGAGCTTTGCATAATCATAAAACGGTGTCGAGAGTCTAATTTTAACACTTTAATTTGAGATTTGTACCAATTGGAATATCCTTTT
TGACTCAAAACTGAAAACACTGCTCTGACACGAACCCTCTCATGGTGAAGGCCTAAATAAAGCAAAGCCCTTTTTTTTTCACGACAACGCACGCTCAACGTACTCAAACC
GCTCAATACCAATCTCGCAAATGGTTATGCGTATTAAATCTCCCAATTTCTCCTCGACCCTTTTCCCATCCAAAACAATCAAAACGAAAGGAAAATAAAGCCTTCACAAA
TCTCCTTATCTTCTTCCTCCGCTCCGGTCGGATCCATGGTGTTTCAGCCGTAATCAGCAATGCGTCATCGTTGATCCGATCTGGGTCGGTGATGAATCTCAAGAAAAAAG
CCATGGAGTTTGAAAGAATTCGAGATTTGTTGCTCATTTGGTGCGGAATTCGAAGCCTCCGAGTGGCCCTCGTGGGTGGAAATCACACCGCTGCCAGGTTTTGCTCTCGC
TAGAATCGCTTTTCTCACTCCTTATCTAACCCCACCTCAATTTTCCTCTTCTTCTTCACCTTATCTTCAACTTTTAGCATTTGGGTATTTCCCTAATTATCAAACCATTC
CACATTCTCCTACGAATTTGCCATTGAAATCATGGATTTAGCTCGTTTCAATCGACCCATTACCTATGCATTTGACAATATCAGCTGGAATTCTAAGACCCATTTGGTTA
TTAACTTTCCTACTACTCAAATTCTTCGTGTGATTTCTTATTCGTCGTTTTTCGCTATGGTTATTCTCACGTTTCCTTGTATTGTGTCCATTCTTGGGCAAGAAAGTGGG
CCGTCTGAGTTTTTTTCTGTACCAGATATAGTTGATTCTGAGAAATTGGATTTGTTTTTTCGTGATTTGGGTCATGAAGGTTTTTCCAATAACGGCCATAAGGTTCTCAT
TTTGAGCTCTGCTGAAACAAATGGCTTGATTCAGATACGTGTGTTGGATGGTGATGAACACAAACTCAACATTGTTGTGGACTCTGATTTTGATCGAACTGGGTTGTTTT
CTGACGATTCTTTTGATTTCGTGTTATCTTGGGGCTTTATGGACTCTGATTTCATTGATAGAATTCTGAAAATAGGTGGCATTGTGGCTTTTCCACTCGTTAATAACAAT
GACCCATCAAGTCATTTTGAGAAGAAACCAAATTACAAACCTGTGTTTCTCAATAGATACACCTCCATTATTGTTGCAATGGAGAAGACAGTCATGGCTGATAAGCTAGT
TTATGTTTCAGCTTCAAGAAGACGTCTCTTAAAATCCTCATTGCCAACTAGAAATGCAGCTTTGAGAGACCTTGAGGACGTGACCAAACCAAACGAACTTGGGAGGAAAA
TCAAGTACCTTCCCGACGTTTCTAAACTTGAAGAAACATCAAGCAGCGTCACGTCACGTCAAGAAATGCTGTTGGAGTAGAAGAATGAATGGTGTGAGAAGAAGAGGAAA
AGAGGTTTTGGGGGATTGTTGAATGTTGATTGATGTGTTTGTATGTCAAGTTTGAAAGATGAGGGGAGTGGGGAGTGTATTTAATGGTGGGGTTAAAAGGGAAAGTGTTT
TTTCTTCTGTAATTGATGTAATGTGGTAGAAAAGCTAGTGCTATGCTTCCTGATGTGGTTTTAAGAAGTTCATCAATGGTGGATGTTTTCAGTTTGTTTGTGTCATATAC
TTGGCTTTCGCAT
Protein sequenceShow/hide protein sequence
MDLARFNRPITYAFDNISWNSKTHLVINFPTTQILRVISYSSFFAMVILTFPCIVSILGQESGPSEFFSVPDIVDSEKLDLFFRDLGHEGFSNNGHKVLILSSAETNGLI
QIRVLDGDEHKLNIVVDSDFDRTGLFSDDSFDFVLSWGFMDSDFIDRILKIGGIVAFPLVNNNDPSSHFEKKPNYKPVFLNRYTSIIVAMEKTVMADKLVYVSASRRRLL
KSSLPTRNAALRDLEDVTKPNELGRKIKYLPDVSKLEETSSSVTSRQEMLLE