; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0006131 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0006131
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionDUF4228 domain-containing protein
Genome locationchr02:24106070..24107769
RNA-Seq ExpressionPI0006131
SyntenyPI0006131
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057887.1 DUF4228 domain-containing protein [Cucumis melo var. makuwa]3.5e-8998.4Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
        MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM

Query:  KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSD   AAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
Subjt:  KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

XP_004138125.1 uncharacterized protein LOC101211887 [Cucumis sativus]2.9e-9198.94Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
        MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNN+NQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM

Query:  KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        KGLSAKKQAKVKQSQLEAADKPDRRK+RTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
Subjt:  KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

XP_008453120.1 PREDICTED: uncharacterized protein LOC103493930 [Cucumis melo]3.5e-8998.4Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
        MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM

Query:  KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSD   AAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
Subjt:  KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

XP_022933526.1 uncharacterized protein LOC111440926 isoform X2 [Cucurbita moschata]2.2e-7581.95Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTS----------------NETSSNSVRLTRIKLLRPADM
        MGNCQAIDAATLVIQHPSGK DKLYWPVTAREIMKMNPGHYVALLISTTM TPNES+N+N+TS                NE SSNSVRLTRIKLLRPAD 
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTS----------------NETSSNSVRLTRIKLLRPADM

Query:  LVLGQVYRLITTQEVMKGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSD-AAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSI
        LVLGQVYRLIT++EVM GLSAKKQAKVKQSQLEAA+K  RRKER  R SD AAAAAAGRSVSED  QA KHEKNNRPRTSTSTTSA ARSRTWQPSLHSI
Subjt:  LVLGQVYRLITTQEVMKGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSD-AAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSI

Query:  SEAGS
        SEAGS
Subjt:  SEAGS

XP_038880684.1 uncharacterized protein LOC120072302 [Benincasa hispida]6.0e-8192.59Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
        MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESN +N+T+N    NSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM

Query:  KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRT-STSTTSATARSRTWQPSLHSISEAGS
        KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSD AAAAAGRSVSED IQA KHEKNNRPRT STSTTS  ARSRTWQPSLHSISEAGS
Subjt:  KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRT-STSTTSATARSRTWQPSLHSISEAGS

TrEMBL top hitse value%identityAlignment
A0A0A0LRI4 Uncharacterized protein1.4e-9198.94Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
        MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNN+NQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM

Query:  KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        KGLSAKKQAKVKQSQLEAADKPDRRK+RTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
Subjt:  KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

A0A1S3BVG8 uncharacterized protein LOC1034939301.7e-8998.4Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
        MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM

Query:  KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSD   AAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
Subjt:  KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

A0A5A7UT51 DUF4228 domain-containing protein1.7e-8998.4Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
        MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM

Query:  KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSD   AAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
Subjt:  KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

A0A5D3CU41 DUF4228 domain-containing protein1.7e-8998.4Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
        MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM

Query:  KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSD   AAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
Subjt:  KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

A0A6J1EZA8 uncharacterized protein LOC111440926 isoform X21.1e-7581.95Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTS----------------NETSSNSVRLTRIKLLRPADM
        MGNCQAIDAATLVIQHPSGK DKLYWPVTAREIMKMNPGHYVALLISTTM TPNES+N+N+TS                NE SSNSVRLTRIKLLRPAD 
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTS----------------NETSSNSVRLTRIKLLRPADM

Query:  LVLGQVYRLITTQEVMKGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSD-AAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSI
        LVLGQVYRLIT++EVM GLSAKKQAKVKQSQLEAA+K  RRKER  R SD AAAAAAGRSVSED  QA KHEKNNRPRTSTSTTSA ARSRTWQPSLHSI
Subjt:  LVLGQVYRLITTQEVMKGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSD-AAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSI

Query:  SEAGS
        SEAGS
Subjt:  SEAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10530.1 unknown protein2.2e-2541.8Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
        MGNCQA++AA LV+QHP G  D+ Y  V+  E+M M PGHYV+L+I  +         + +  ++    +VR TR++LLRP + LVLG  YRLIT+QEVM
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM

Query:  KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRP-RTSTSTTSATARSRTWQPSLHSISEAGS
        K L  KK AK K+ Q+          E+TT           +  S+ ++   K  K  R  R STS      +S+TW+PSL SISEA S
Subjt:  KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRP-RTSTSTTSATARSRTWQPSLHSISEAGS

AT1G60010.1 unknown protein1.9e-3246.81Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
        MGNCQA+DAA LV+QHP GK D+ Y PV+  EIM+M PGHYV+L+I   +   N    +  T +++    VR TR+KLLRP + LVLG  YRLIT+QEVM
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM

Query:  KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        K L AKK AK K+ Q E +      KE+   SS+       + + E+  +    E  +  + S  T SA++RS+TW+PSL SISEA S
Subjt:  KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

AT5G50090.1 unknown protein5.9e-3449.47Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
        MGNCQA+D A +VIQHP+GKE+KL  PV+A  +MKMNPGH V+LLISTT  +   S +            +RLTRIKLLRP D LVLG VYRLITT+EVM
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM

Query:  KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        KGL AKK +K+K+    + DK +  K   +   D          +EDQ+Q  K EK             +  SR+WQPSL SISE GS
Subjt:  KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

AT5G50090.2 unknown protein2.1e-3147.34Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
        MGNCQA+D A +VIQHP+GKE+KL  PV+A  +MKMNPGH V+LLISTT  +   S +            +RLTRIKLLRP D LVLG VYRLITT+EVM
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM

Query:  KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        KGL AKK +K+K+    + DK +  K   +   D                 N+ ++  R R           SR+WQPSL SISE GS
Subjt:  KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

AT5G62900.1 unknown protein6.9e-2743.62Show/hide
Query:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM
        MGNCQA +AAT VIQ P GK  + Y  V A E++K +PGH+VALL+S+ +                   S+R+TRIKLLRP+D L+LG VYRLI+++EVM
Subjt:  MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVM

Query:  KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        KG+ AKK  K+K+   E +   +     T RS          S S+   Q   HEK    R   +T  AT + R WQPSL SISE+ S
Subjt:  KGLSAKKQAKVKQSQLEAADKPDRRKERTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAATTGTCAAGCCATTGATGCAGCAACACTTGTGATACAACACCCGAGTGGGAAAGAAGACAAATTGTATTGGCCTGTAACTGCTAGAGAGATTATGAAGATGAA
TCCTGGTCACTATGTTGCTCTTCTCATCTCTACTACCATGTTTACACCAAATGAAAGTAATAACAGCAATCAAACAAGCAATGAAACCAGTAGTAATTCAGTTCGTTTAA
CTCGAATCAAGCTTCTTCGCCCAGCTGACATGCTTGTTCTTGGCCAAGTTTACAGACTCATCACTACTCAAGAGGTCATGAAAGGTTTATCGGCAAAGAAACAAGCAAAA
GTTAAACAAAGCCAGTTAGAAGCAGCTGATAAGCCAGACAGGAGGAAAGAACGTACAACCAGAAGTTCAGATGCAGCAGCAGCAGCAGCTGGAAGATCTGTATCTGAAGA
TCAAATTCAGGCGAACAAACATGAGAAAAACAACCGACCAAGGACAAGTACGTCAACAACCTCGGCCACAGCCAGATCAAGAACATGGCAACCTTCATTACATAGCATCT
CAGAAGCTGGAAGCTAA
mRNA sequenceShow/hide mRNA sequence
TGAAAAGAGAAACTATTGGGAGGTAAAGAGAAAGAAAGAGAAGTGATGAGAGAGAATGTATTCATCATGTAATGTATATATCAAGACTTGAAGCTGTATTTTCACCTTTG
GCACCAGATCCTCTGTTCCTCTACCCACCCAATCATAGCTCACCACCTCATAACTTGTTGGACAATCTTGCCCTTCTTTATCACCTGAGTGAGTAGCTTTTTTTTACCCT
CAATGATCTATTAAACAGATAAGAATTTAAGGCTGCTGGTGAATGAGTGTTTCTTAAGTGGCTATAAAAAAGATTGGTCCTCTCTTCACAGCCCCTCCCCTCTTGCTTTG
CTCATTATCCTCAAATCTCCCACAATTTCAGAGAGAAATAGAGATGGGAAATTGTCAAGCCATTGATGCAGCAACACTTGTGATACAACACCCGAGTGGGAAAGAAGACA
AATTGTATTGGCCTGTAACTGCTAGAGAGATTATGAAGATGAATCCTGGTCACTATGTTGCTCTTCTCATCTCTACTACCATGTTTACACCAAATGAAAGTAATAACAGC
AATCAAACAAGCAATGAAACCAGTAGTAATTCAGTTCGTTTAACTCGAATCAAGCTTCTTCGCCCAGCTGACATGCTTGTTCTTGGCCAAGTTTACAGACTCATCACTAC
TCAAGAGGTCATGAAAGGTTTATCGGCAAAGAAACAAGCAAAAGTTAAACAAAGCCAGTTAGAAGCAGCTGATAAGCCAGACAGGAGGAAAGAACGTACAACCAGAAGTT
CAGATGCAGCAGCAGCAGCAGCTGGAAGATCTGTATCTGAAGATCAAATTCAGGCGAACAAACATGAGAAAAACAACCGACCAAGGACAAGTACGTCAACAACCTCGGCC
ACAGCCAGATCAAGAACATGGCAACCTTCATTACATAGCATCTCAGAAGCTGGAAGCTAATCATTTATGTTTCCTATTTCCCACATGCTATTGAGAGACAGGGTGGCACA
GATTGTCCTAAGGCTCTTTAAGCTTCTCCTAAGAGGAAAAAAAAAAAGCAAAAAAGGAGAGATAGGCCAAATGAGCCTTTTAATTCAAGAAGAATTCTTCACCAACACAT
CACTGATAATGATATTAGGATGGTGGCTGTAAAAGAGTTGTTTAAAACTTTTTTCTTCAATCTTGGGGAAAAAAGGGCACCAAAAACTGTGTGCATAAGAAAGCAGGGTG
GGTATATAGTGCAAACCAGTTGAGATTCTTCAAACATCTTTGTAAATTCATAGTCTTTTTCACTAATGAAATGAGACTTGAGCAAAAAAAGAGAGCTCAGTTTTTTTTTC
TTCTTCTTATTACCATTCTGATTATCTGTTTGCAGACCTGACAAATTCTTGTGATTTACTGTTGTTTCTATTTTACACCTTTGGTTTCACAATAGAATGAGATTATGAAT
TA
Protein sequenceShow/hide protein sequence
MGNCQAIDAATLVIQHPSGKEDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNSNQTSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITTQEVMKGLSAKKQAK
VKQSQLEAADKPDRRKERTTRSSDAAAAAAGRSVSEDQIQANKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS