; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G01660 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G01660
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionDUF4228 domain-containing protein
Genome locationChr1:1110084..1112053
RNA-Seq ExpressionCSPI01G01660
SyntenyCSPI01G01660
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057887.1 DUF4228 domain-containing protein [Cucumis melo var. makuwa]4.6e-8997.87Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
        MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM

Query:  KGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        KGLSAKKQAKVKQSQLEAADKPDRRK+RTTRSSD   AAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
Subjt:  KGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

XP_004138125.1 uncharacterized protein LOC101211887 [Cucumis sativus]7.6e-9299.47Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
        MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNN+NQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM

Query:  KGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        KGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
Subjt:  KGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

XP_008453120.1 PREDICTED: uncharacterized protein LOC103493930 [Cucumis melo]4.6e-8997.87Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
        MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM

Query:  KGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        KGLSAKKQAKVKQSQLEAADKPDRRK+RTTRSSD   AAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
Subjt:  KGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

XP_022933526.1 uncharacterized protein LOC111440926 isoform X2 [Cucurbita moschata]2.9e-7581.46Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTS----------------NETSSNSVRLTRIKLLRPADM
        MGNCQAIDAATLVIQHPSGK DKLYWPVTAREIMKMNPGHYVALLISTTM TPNES+N+N+TS                NE SSNSVRLTRIKLLRPAD 
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTS----------------NETSSNSVRLTRIKLLRPADM

Query:  LVLGQVYRLITTQEVMKGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSD-AAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSI
        LVLGQVYRLIT++EVM GLSAKKQAKVKQSQLEAA+K  RRK+R  R SD AAAAAAGRSVSED  QA KHEKNNRPRTSTSTTSA ARSRTWQPSLHSI
Subjt:  LVLGQVYRLITTQEVMKGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSD-AAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSI

Query:  SEAGS
        SEAGS
Subjt:  SEAGS

XP_038880684.1 uncharacterized protein LOC120072302 [Benincasa hispida]6.0e-8192.06Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
        MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESN +N+T+N    NSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM

Query:  KGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRT-STSTTSATARSRTWQPSLHSISEAGS
        KGLSAKKQAKVKQSQLEAADKPDRRK+RTTRSSD AAAAAGRSVSED IQA KHEKNNRPRT STSTTS  ARSRTWQPSLHSISEAGS
Subjt:  KGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRT-STSTTSATARSRTWQPSLHSISEAGS

TrEMBL top hitse value%identityAlignment
A0A0A0LRI4 Uncharacterized protein3.7e-9299.47Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
        MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNN+NQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM

Query:  KGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        KGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
Subjt:  KGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

A0A1S3BVG8 uncharacterized protein LOC1034939302.2e-8997.87Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
        MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM

Query:  KGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        KGLSAKKQAKVKQSQLEAADKPDRRK+RTTRSSD   AAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
Subjt:  KGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

A0A5A7UT51 DUF4228 domain-containing protein2.2e-8997.87Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
        MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM

Query:  KGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        KGLSAKKQAKVKQSQLEAADKPDRRK+RTTRSSD   AAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
Subjt:  KGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

A0A5D3CU41 DUF4228 domain-containing protein2.2e-8997.87Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
        MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM

Query:  KGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        KGLSAKKQAKVKQSQLEAADKPDRRK+RTTRSSD   AAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
Subjt:  KGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

A0A6J1EZA8 uncharacterized protein LOC111440926 isoform X21.4e-7581.46Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTS----------------NETSSNSVRLTRIKLLRPADM
        MGNCQAIDAATLVIQHPSGK DKLYWPVTAREIMKMNPGHYVALLISTTM TPNES+N+N+TS                NE SSNSVRLTRIKLLRPAD 
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTS----------------NETSSNSVRLTRIKLLRPADM

Query:  LVLGQVYRLITTQEVMKGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSD-AAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSI
        LVLGQVYRLIT++EVM GLSAKKQAKVKQSQLEAA+K  RRK+R  R SD AAAAAAGRSVSED  QA KHEKNNRPRTSTSTTSA ARSRTWQPSLHSI
Subjt:  LVLGQVYRLITTQEVMKGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSD-AAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSI

Query:  SEAGS
        SEAGS
Subjt:  SEAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10530.1 unknown protein1.0e-2541.8Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
        MGNCQA++AA LV+QHP G  D+ Y  V+  E+M M PGHYV+L+I  +         + +  ++    +VR TR++LLRP + LVLG  YRLIT+QEVM
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM

Query:  KGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRP-RTSTSTTSATARSRTWQPSLHSISEAGS
        K L  KK AK K+ Q+E          +TT           +  S+ ++   K  K  R  R STS      +S+TW+PSL SISEA S
Subjt:  KGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRP-RTSTSTTSATARSRTWQPSLHSISEAGS

AT1G60010.1 unknown protein2.5e-3246.28Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
        MGNCQA+DAA LV+QHP GK D+ Y PV+  EIM+M PGHYV+L+I   +   N    +  T +++    VR TR+KLLRP + LVLG  YRLIT+QEVM
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM

Query:  KGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        K L AKK AK K+ Q E +      K++   SS+       + + E+  +    E  +  + S  T SA++RS+TW+PSL SISEA S
Subjt:  KGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

AT5G50090.1 unknown protein2.6e-3449.47Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
        MGNCQA+D A +VIQHP+GKE+KL  PV+A  +MKMNPGH V+LLISTT  +   S +            +RLTRIKLLRP D LVLG VYRLITT+EVM
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM

Query:  KGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        KGL AKK +K+K+    + DK +  K   +   D          +EDQ+Q  K EK             +  SR+WQPSL SISE GS
Subjt:  KGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

AT5G50090.2 unknown protein1.2e-3147.34Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
        MGNCQA+D A +VIQHP+GKE+KL  PV+A  +MKMNPGH V+LLISTT  +   S +            +RLTRIKLLRP D LVLG VYRLITT+EVM
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM

Query:  KGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        KGL AKK +K+K+    + DK +  K   +   D                 N+ ++  R R           SR+WQPSL SISE GS
Subjt:  KGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

AT5G62900.1 unknown protein4.1e-2743.62Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
        MGNCQA +AAT VIQ P GK  + Y  V A E++K +PGH+VALL+S+ +                   S+R+TRIKLLRP+D L+LG VYRLI+++EVM
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM

Query:  KGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        KG+ AKK  K+K+   E +   +     T RS          S S+   Q   HEK    R   +T  AT + R WQPSL SISE+ S
Subjt:  KGLSAKKQAKVKQSQLEAADKPDRRKQRTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAATTGTCAAGCCATTGATGCAGCAACACTTGTGATACAACACCCAAGTGGGAAAGAAGACAAATTGTATTGGCCTGTAACTGCTAGAGAGATTATGAAGATGAA
TCCTGGTCACTATGTTGCTCTTCTCATCTCTACTACCATGTTTACACCAAATGAAAGTAATAACAGCAATCAAACAAGCAATGAAACCAGTAGTAATTCGGTTCGTTTAA
CTCGAATCAAGCTTCTTCGCCCAGCTGACATGCTTGTTCTTGGCCAAGTTTACAGACTCATCACTACTCAAGAGGTCATGAAAGGTTTATCAGCAAAGAAACAAGCAAAA
GTTAAACAAAGCCAGTTAGAAGCAGCTGATAAGCCAGACAGAAGGAAACAACGTACAACCAGAAGTTCAGATGCAGCAGCAGCAGCAGCTGGAAGATCTGTTTCTGAAGA
TCAAATTCAGGCTAACAAACATGAGAAAAACAACCGACCAAGGACAAGTACGTCGACAACCTCGGCCACAGCCAGATCAAGAACATGGCAACCTTCATTGCATAGCATCT
CAGAAGCTGGAAGCTAA
mRNA sequenceShow/hide mRNA sequence
TCAAAATGTATACATCCCCATTTCATTTATTAATATTCATATATTCTTTTTCATTTACAAAGTGGTGGTCTCAATTTGAATTGATGTGTGGAGAGGGGGTATAAAAAATA
CCCAAATGAAAAGAGAAACTATTGGGAGGTAAAGAGAAAGAAAAGAGAAGTGATGAGAGAGAATGTATTCATCATGTAATGTGTATATCAAGACTTGAAGCTGTATTTTC
ACCTTTGGCACCAGATCCTCTGTTCCTCTACCCACCCAATCATAGCTCACCACCTCATAACTTGTTGGACAATCTTGCCCTTCTTTATCACCTGAGTGAGTAGCTTTTTT
TTTTTTTTACCCTCAATGATCTATTAAGCAGATAAGAATTTAAGGCTGCTGGTGAATGAGTGTTTCTTAAGTGGCTATAAAAAAAGATTGGTTCTCTCTTCACAGCCCCT
CCCCTCTTGCTTTGCTCATTATCCTCATATCTCCCACAATTTCAGAGAGAACTAGAGATGGGAAATTGTCAAGCCATTGATGCAGCAACACTTGTGATACAACACCCAAG
TGGGAAAGAAGACAAATTGTATTGGCCTGTAACTGCTAGAGAGATTATGAAGATGAATCCTGGTCACTATGTTGCTCTTCTCATCTCTACTACCATGTTTACACCAAATG
AAAGTAATAACAGCAATCAAACAAGCAATGAAACCAGTAGTAATTCGGTTCGTTTAACTCGAATCAAGCTTCTTCGCCCAGCTGACATGCTTGTTCTTGGCCAAGTTTAC
AGACTCATCACTACTCAAGAGGTCATGAAAGGTTTATCAGCAAAGAAACAAGCAAAAGTTAAACAAAGCCAGTTAGAAGCAGCTGATAAGCCAGACAGAAGGAAACAACG
TACAACCAGAAGTTCAGATGCAGCAGCAGCAGCAGCTGGAAGATCTGTTTCTGAAGATCAAATTCAGGCTAACAAACATGAGAAAAACAACCGACCAAGGACAAGTACGT
CGACAACCTCGGCCACAGCCAGATCAAGAACATGGCAACCTTCATTGCATAGCATCTCAGAAGCTGGAAGCTAATCATTTATGTTTCCTATTTCCCACATGCTATTGAGA
GACAGGGTGGCACAGATTGTCCTAAGGCTCTTTAAGCTTCTCCTAAGAGGAAAAAAAAAAAAAAAAGCAAAAAAGGAGAGATAGGCCAAATGAGCCTTTTAATTCAAGAA
AAATTCTTCACCAACACGTTACTGATAATGATATTAGGATGGTGGCTGTAAAAGAGTTGTTTAAAACTTTTTCTTCAATCTTGGGGGAAAAAAAGAGAACCAAAACTGTG
TGCATAAGAAAGCAGGGTGGGTATATAGTGAAAACCAGTTGAGATTCTTCAAACATCTTTGTAAATTCATAGTCTTTTTCACCAATTAAATGAGATTTGAGCAGAAAAAG
ATAGCTCAGTTTTTTTTCTTCTTCTTATTACCATTCTGATTATCTGTCTGCAGAACTGACAAATTCTTGTGATTCACTGTTGTTTCTGTTTTACACCTTTGGTTTATGAA
TTATAATCTCGTGAAAAAAAAGGTTTAATGGTTGGATTCTCTATTAAGTCGAGGTTGAGCTCAAAAAAGCAACTTCCCACTTGCTTTAATGCTTACAACTCATGAGGAGC
ACAGTTGAGTTGATGTGGATTTGCATCACTATCATCTCAAGTTTAGCTAAGCA
Protein sequenceShow/hide protein sequence
MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVMKGLSAKKQAK
VKQSQLEAADKPDRRKQRTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS