; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G01840 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G01840
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionIntegrator complex subunit 7
Genome locationChr5:2502124..2503945
RNA-Seq ExpressionCSPI05G01840
SyntenyCSPI05G01840
Gene Ontology termsGO:0016180 - snRNA processing (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0032039 - integrator complex (cellular component)
InterPro domainsIPR016024 - Armadillo-type fold
IPR033060 - Integrator complex subunit 7


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040971.1 integrator complex subunit 7 [Cucumis melo var. makuwa]9.3e-11994.17Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
        MER+SAACAMEWSIELEKALR KKPGRAVEAIRQIGCRLQQWSREPEPN+AVYNMFDLVTWED+LFSNTILLRLADAFK DDKHIRLAVVRVFLSELYSR
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR

Query:  DSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLIFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
        DSSRSKQYQGILSKARVQN HELLTRVKVVL+GGDPEA+ LALI+LGCWAHFAKDSAQIRYLIF SLFSSHLSEVKASIFAAACI QLADDFAQVFL IL
Subjt:  DSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLIFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL

Query:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVMLVF
        VNIMTSTTSL IRMAGARVFAKLGCSHSMAKTAYKVMLVF
Subjt:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVMLVF

XP_008464722.1 PREDICTED: uncharacterized protein LOC103502541 isoform X1 [Cucumis melo]3.3e-11693.28Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
        MER+SAACAMEWSIELEKALR KKPGRAVEAIRQIGCRLQQWSREPEPN+AVYNMFDLVTWED+LFSNTILLRLADAFK DDKHIRLAVVRVFLSELYSR
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR

Query:  DSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLIFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
        DSSRSKQYQGILSKARVQN HELLTRVKVVL+GGDPEA+ LALI+LGCWAHFAKDSAQIRYLIF SLFSSHLSEVKASIFAAACI QLADDFAQVFL IL
Subjt:  DSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLIFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL

Query:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVML
        VNIMTSTTSL IRMAGARVFAKLGCSHSMAKTAYK  L
Subjt:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVML

XP_011654518.1 uncharacterized protein LOC101204851 isoform X1 [Cucumis sativus]3.3e-12499.16Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
        MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR

Query:  DSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLIFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
        DSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLIFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
Subjt:  DSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLIFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL

Query:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVML
        VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYK  L
Subjt:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVML

XP_011654519.1 uncharacterized protein LOC101204851 isoform X2 [Cucumis sativus]1.1e-12498.74Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
        MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR

Query:  DSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLIFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
        DSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLIFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
Subjt:  DSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLIFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL

Query:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVMLV
        VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKV  +
Subjt:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVMLV

XP_038892419.1 uncharacterized protein LOC120081531 isoform X1 [Benincasa hispida]3.1e-11491.6Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
        MER+SAACAMEWSIELEKALR KKPG+AVEAI QIG RLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDK+IRLAVVRVFLSELYSR
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR

Query:  DSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLIFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
        DS+RSKQYQGILSKAR+QNHHELL+RVKVVLNGGDPEAR LALILLGCWAHFAKDSAQIRYLIFSS++SSHLSEVKASIFAAACI QLADDFAQVFL IL
Subjt:  DSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLIFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL

Query:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVML
        VNIMTSTTS+ IRMAGARVFAKLGCSHS+AK AYK  L
Subjt:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVML

TrEMBL top hitse value%identityAlignment
A0A0A0KJG8 Uncharacterized protein4.5e-127100Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
        MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR

Query:  DSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLIFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
        DSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLIFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
Subjt:  DSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLIFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL

Query:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVMLVF
        VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVMLVF
Subjt:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVMLVF

A0A1S3CMM3 uncharacterized protein LOC103502541 isoform X11.6e-11693.28Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
        MER+SAACAMEWSIELEKALR KKPGRAVEAIRQIGCRLQQWSREPEPN+AVYNMFDLVTWED+LFSNTILLRLADAFK DDKHIRLAVVRVFLSELYSR
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR

Query:  DSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLIFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
        DSSRSKQYQGILSKARVQN HELLTRVKVVL+GGDPEA+ LALI+LGCWAHFAKDSAQIRYLIF SLFSSHLSEVKASIFAAACI QLADDFAQVFL IL
Subjt:  DSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLIFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL

Query:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVML
        VNIMTSTTSL IRMAGARVFAKLGCSHSMAKTAYK  L
Subjt:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVML

A0A5A7TH30 Integrator complex subunit 74.5e-11994.17Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
        MER+SAACAMEWSIELEKALR KKPGRAVEAIRQIGCRLQQWSREPEPN+AVYNMFDLVTWED+LFSNTILLRLADAFK DDKHIRLAVVRVFLSELYSR
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR

Query:  DSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLIFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
        DSSRSKQYQGILSKARVQN HELLTRVKVVL+GGDPEA+ LALI+LGCWAHFAKDSAQIRYLIF SLFSSHLSEVKASIFAAACI QLADDFAQVFL IL
Subjt:  DSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLIFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL

Query:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVMLVF
        VNIMTSTTSL IRMAGARVFAKLGCSHSMAKTAYKVMLVF
Subjt:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVMLVF

A0A6J1FP76 uncharacterized protein LOC1114472553.8e-11088.24Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
        MER++AACAMEWSIELEKALR KKPGRAVEAI QIG RLQQWSREPEPN+AVYNMFDLVTWEDRLFSNTILLRLADAFK DDKHIR+AVV+VFLSEL SR
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR

Query:  DSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLIFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
        D ++SKQYQG+LSKARVQNHHELLTRVKVVL GGDPEAR LALILLGCWAHFA+ SAQIRY+I SS+ SSH+SEVKASIFAAACI QLADDFAQVFLAIL
Subjt:  DSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLIFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL

Query:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVML
        VNIMTSTTSL I+MAGARVFAKLGCSHSMAKTAYK  L
Subjt:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVML

A0A6J1JBG3 uncharacterized protein LOC1114852606.1e-10886.97Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
        MER++AA AMEWSIELEKALR KKPGRAVEAI QIG RLQQWSREPEPN+AVYNMFDLVTWEDRLFSNTILLRLADAFK DDKHIR+AVVRVFLSEL SR
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR

Query:  DSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLIFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
        D ++S+QYQG+LSKARVQNHHELLTRVKVVL GGDPEAR LALILLGCWAHFA+ SAQIRY+I  SL SSH+SEVKASIFAAACI QLADDFA+VFLAIL
Subjt:  DSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLIFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL

Query:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVML
        VNIMTSTTSL ++MAGARVFAKLGCSHSMAKTAYK  L
Subjt:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKVML

SwissProt top hitse value%identityAlignment
Q54PL2 Integrator complex subunit 7 homolog8.4e-0626.26Show/hide
Query:  LFSNTILLRLADAFKFDDKHIRLAVVRVF---LSELYSRDSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRY
        L  N+++ RL+D F+     ++  +++VF    SE++                 +V N  E+L R+  V+   DP AR L+L +LG   H   D   I +
Subjt:  LFSNTILLRLADAFKFDDKHIRLAVVRVF---LSELYSRDSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRY

Query:  LIFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAILVNIMTSTTSLTI-RMAGARVFAKLGCSHSMAKTAYKVMLV
         I + + S    E++A+IF    +C+++  F+   +  +  ++ +  +  I ++   R+F  +  SHS+A T  K MLV
Subjt:  LIFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAILVNIMTSTTSLTI-RMAGARVFAKLGCSHSMAKTAYKVMLV

Arabidopsis top hitse value%identityAlignment
AT4G20060.1 ARM repeat superfamily protein2.4e-6456.02Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSEL-YS
        ME+ SAACAMEWSI+LEK+LR K   +AVEAI + G +L+QWS+EPE  +AVYN+F LV  ED+LFSNTILLRL DAF   DK I+LAVVRVF+S    S
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSEL-YS

Query:  RDSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLIFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAI
        R  + ++     LSK RV NH ELLTRVK V + GD E++ LALIL GCW  FA + A +RYL+FSS+ S H  E ++++FAAAC C++ADDFA V L +
Subjt:  RDSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLIFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAI

Query:  LVNIMTSTTSLT--IRMAGARVFAKLGCSHSMAKTAYKVML
        L N M     +T   R+A  RVFAK+GCSH++A  A+K+ +
Subjt:  LVNIMTSTTSLT--IRMAGARVFAKLGCSHSMAKTAYKVML


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGAAGTTCTGCAGCTTGCGCTATGGAATGGAGTATTGAGCTGGAGAAGGCTCTCCGTTTGAAGAAACCAGGTCGGGCTGTTGAAGCTATACGTCAGATTGGCTG
TCGACTTCAGCAATGGAGTAGAGAGCCAGAACCCAATGTAGCTGTATATAATATGTTTGACCTTGTTACTTGGGAGGATAGGCTATTTTCCAACACTATTCTCCTACGGC
TTGCTGATGCCTTTAAGTTTGATGATAAGCATATTAGACTTGCAGTTGTTAGAGTTTTCTTATCCGAGCTATATAGCCGCGACAGCTCACGAAGTAAACAATACCAAGGG
ATTCTTTCAAAGGCAAGGGTGCAAAATCACCATGAATTACTTACTCGAGTCAAGGTTGTTCTTAATGGAGGGGATCCTGAGGCTAGAGGTCTAGCTTTGATTCTATTGGG
ATGTTGGGCACATTTTGCAAAAGACAGTGCCCAGATACGTTATTTGATATTTTCTAGTCTGTTTTCTTCTCATCTTTCGGAGGTAAAAGCGTCAATATTTGCTGCAGCAT
GCATTTGTCAGTTAGCAGATGACTTTGCGCAAGTTTTCTTAGCGATTTTGGTTAATATAATGACTTCTACTACATCCTTGACCATCAGAATGGCTGGAGCTCGAGTTTTT
GCAAAACTGGGATGCTCACATTCAATGGCCAAAACTGCTTATAAGGTTATGCTCGTCTTTTGA
mRNA sequenceShow/hide mRNA sequence
CAAATACTACCAAGCTCTCTACTACTCAACCTTCTTCTTCTTCTTTCTTTCTCTTGATTCCCTAACGTTAAGTTTGTAAAGAAACCAAGGAATACACTTTCTATAATCCC
CTGATTCAAAATTGTACTCCATTTTACCTCCAATTCCACAGGGTTTCTGCTTCCCCATTTGTGTTATAGGGTTCCACAGAAGATGGCATTGGATGAGAATATTGTGTTGC
GCGATGTCACTAATGCTGGCATTGTCATCAGCGACCGCACTGTCAAACACCAGAGGCGCTGTATTAAGTAATTGGGTTTGAAGATTTGGATGCCGTGTGCTGTGGTAGGG
ACTATTTGGTGATGGTTTTATTAAGGGGTTCTATGGTTTCAGAGTTTATGTGTGTCTGTGACTAGCTATCATTTTTGGTCATTTGCTACAAATATGGAGAGAAGTTCTGC
AGCTTGCGCTATGGAATGGAGTATTGAGCTGGAGAAGGCTCTCCGTTTGAAGAAACCAGGTCGGGCTGTTGAAGCTATACGTCAGATTGGCTGTCGACTTCAGCAATGGA
GTAGAGAGCCAGAACCCAATGTAGCTGTATATAATATGTTTGACCTTGTTACTTGGGAGGATAGGCTATTTTCCAACACTATTCTCCTACGGCTTGCTGATGCCTTTAAG
TTTGATGATAAGCATATTAGACTTGCAGTTGTTAGAGTTTTCTTATCCGAGCTATATAGCCGCGACAGCTCACGAAGTAAACAATACCAAGGGATTCTTTCAAAGGCAAG
GGTGCAAAATCACCATGAATTACTTACTCGAGTCAAGGTTGTTCTTAATGGAGGGGATCCTGAGGCTAGAGGTCTAGCTTTGATTCTATTGGGATGTTGGGCACATTTTG
CAAAAGACAGTGCCCAGATACGTTATTTGATATTTTCTAGTCTGTTTTCTTCTCATCTTTCGGAGGTAAAAGCGTCAATATTTGCTGCAGCATGCATTTGTCAGTTAGCA
GATGACTTTGCGCAAGTTTTCTTAGCGATTTTGGTTAATATAATGACTTCTACTACATCCTTGACCATCAGAATGGCTGGAGCTCGAGTTTTTGCAAAACTGGGATGCTC
ACATTCAATGGCCAAAACTGCTTATAAGGTTATGCTCGTCTTTTGA
Protein sequenceShow/hide protein sequence
MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSRDSSRSKQYQG
ILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLIFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAILVNIMTSTTSLTIRMAGARVF
AKLGCSHSMAKTAYKVMLVF