; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0013973 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0013973
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionGag-proteinase polyprotein
Genome locationchr09:11171690..11173683
RNA-Seq ExpressionPay0013973
SyntenyPay0013973
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034783.1 gag-proteinase polyprotein [Cucumis melo var. makuwa]1.1e-8977.49Show/hide
Query:  NERVLEIANESLLLGERIPKSKIMRKVLRSLPGKFDMKVTAIDEAHDITKLKLDELFGSLLTFEMAISHRENKKGKVIAFKLIYEEETTINKSDNEANMN
        NER+LEIANESLL GERIP+SKI+ K+L+SL GKFDMKVTAI++AH+ITKLKLDELFGSLLT EMAISHRENKKGK IAFK IYEEET +N+SDNEANMN
Subjt:  NERVLEIANESLLLGERIPKSKIMRKVLRSLPGKFDMKVTAIDEAHDITKLKLDELFGSLLTFEMAISHRENKKGKVIAFKLIYEEETTINKSDNEANMN

Query:  ESIALLMKHFSKVVKKFKNLNTTGSNARNLTNYRRRDGKNNTRRFNEISNMRDSDYGRKKKGEGRIFRCRECEKVGHYRAKCPTFSRRQKKNFCANLSDE
        ESIALL K FSKVVKK KNLN T SN RNLTNYR+ +G+NNTRRFNEISNMRDSDYGRKK+G GRIF+CREC +VGHY+A+ PTF RRQKKNF A LSDE
Subjt:  ESIALLMKHFSKVVKKFKNLNTTGSNARNLTNYRRRDGKNNTRRFNEISNMRDSDYGRKKKGEGRIFRCRECEKVGHYRAKCPTFSRRQKKNFCANLSDE

Query:  DTDDSEKDNSMNAFTACTTETNSGEVSKNAK
        DTDDSE+ N MNAFT   T+TNS + S++++
Subjt:  DTDDSEKDNSMNAFTACTTETNSGEVSKNAK

KAA0039550.1 uncharacterized protein E6C27_scaffold744G00190 [Cucumis melo var. makuwa]5.1e-9074.69Show/hide
Query:  MSEDESVSD-NERVLEIANESLLLGERIPKSKIMRKVLRSLPGKFDMKVTAIDEAHDITKLKLDELFGSLLTFEMAISHRENKKGKVIAFKLIYEEETTI
        MSEDESVSD N+RVL+I+NESLL GE+IPKSKI+ KVLRSL GKFDMKVT I+EAH+ITKLKL ELFGSLLTFEMAISHRE+K GK IAFK IY+EETT+
Subjt:  MSEDESVSD-NERVLEIANESLLLGERIPKSKIMRKVLRSLPGKFDMKVTAIDEAHDITKLKLDELFGSLLTFEMAISHRENKKGKVIAFKLIYEEETTI

Query:  NKSDNEANMNESIALLMKHFSKVVKKFKNLNTTGSNARNLTNYRRRDGKNNTRRFNEISNMRDSDYGRKKKGEGRIFRCRECEKVGHYRAKCPTFSRRQK
        N+SDN AN+NESIALL K FSKVVKKFK LNTTGSNA+NLTNYRR+DG+NNTRR+NE+SN R SDYGRK +GEGR F CREC  VGHY+ +CP F RRQK
Subjt:  NKSDNEANMNESIALLMKHFSKVVKKFKNLNTTGSNARNLTNYRRRDGKNNTRRFNEISNMRDSDYGRKKKGEGRIFRCRECEKVGHYRAKCPTFSRRQK

Query:  KNFCANLSDEDTDDSEKDNSMNAFTACTTETNSGEVSKNAK
        K+FCA LSDEDT+D+E+DN M AFT    ETN G+ S+ ++
Subjt:  KNFCANLSDEDTDDSEKDNSMNAFTACTTETNSGEVSKNAK

TYK09581.1 gag-proteinase polyprotein [Cucumis melo var. makuwa]5.0e-9379.22Show/hide
Query:  NERVLEIANESLLLGERIPKSKIMRKVLRSLPGKFDMKVTAIDEAHDITKLKLDELFGSLLTFEMAISHRENKKGKVIAFKLIYEEETTINKSDNEANMN
        NER+LEIANESLL GERIP+SKI+ K+LRSL  KFDMKVTAI++AH+ITKLKLDELFGSLLTFEMAISHRENKKGK IAFK IYEEET +N+SDNEANMN
Subjt:  NERVLEIANESLLLGERIPKSKIMRKVLRSLPGKFDMKVTAIDEAHDITKLKLDELFGSLLTFEMAISHRENKKGKVIAFKLIYEEETTINKSDNEANMN

Query:  ESIALLMKHFSKVVKKFKNLNTTGSNARNLTNYRRRDGKNNTRRFNEISNMRDSDYGRKKKGEGRIFRCRECEKVGHYRAKCPTFSRRQKKNFCANLSDE
        ESIALL K FSKVVKK KNLNTT SN RNLTNYR+ +G+NNTRRFNEISNMRDSDYGRKK+G GRIF+CREC +VGHY+A+ PTF RRQKKNF A LSDE
Subjt:  ESIALLMKHFSKVVKKFKNLNTTGSNARNLTNYRRRDGKNNTRRFNEISNMRDSDYGRKKKGEGRIFRCRECEKVGHYRAKCPTFSRRQKKNFCANLSDE

Query:  DTDDSEKDNSMNAFTACTTETNSGEVSKNAK
        DTDDSE+DN MNAFT C T+TNS + S++++
Subjt:  DTDDSEKDNSMNAFTACTTETNSGEVSKNAK

TYK26468.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.1e-14587.5Show/hide
Query:  MSEDESVSDNERVLEIANESLLLGERIPKSKIMRKVLRSLPGKFDMKVTAIDEAHDITKLKLDELFGSLLTFEMAISHRENKKGKVIAFKLIYEEETTIN
        MSEDESVSDNERVLEIANESLLLGERIPKSKIMRKVLRSLPGKFDMKVTAIDEAHDITKLKLDELFGSLLTFEMAISHRENKKG                
Subjt:  MSEDESVSDNERVLEIANESLLLGERIPKSKIMRKVLRSLPGKFDMKVTAIDEAHDITKLKLDELFGSLLTFEMAISHRENKKGKVIAFKLIYEEETTIN

Query:  KSDNEANMNESIALLMKHFSKVVKKFKNLNTTGSNARNLTNYRRRDGKNNTRRFNEISNMRDSDYGRKKKGEGRIFRCRECEKVGHYRAKCPTFSRRQKK
                            KVVKKFKNLNTTGSNARNLTNYRRRDGKNNTRRFNEISNMRDSDYGRKKKGEGRIFRCRECEKVGHYRAKCPTFSRRQKK
Subjt:  KSDNEANMNESIALLMKHFSKVVKKFKNLNTTGSNARNLTNYRRRDGKNNTRRFNEISNMRDSDYGRKKKGEGRIFRCRECEKVGHYRAKCPTFSRRQKK

Query:  NFCANLSDEDTDDSEKDNSMNAFTACTTETNSGEVSKNAKLWKCKSMHLQQRKLLALQLKLLDEFVITVAREVILGHFAICYKETECINRSGCSKHMTGK
         FCANLSDEDTDDSE+DNSMNAFTACTTETNSGEVSKNAKLWKCKSMHLQQRKLLALQLKLLDEFVITVAREVILGHFAICYKETECIN SGCSKHMTGK
Subjt:  NFCANLSDEDTDDSEKDNSMNAFTACTTETNSGEVSKNAKLWKCKSMHLQQRKLLALQLKLLDEFVITVAREVILGHFAICYKETECINRSGCSKHMTGK

Query:  RSFFSKLKECAQDMLLLEMI
        RSFFSKLKECAQDMLLLEM+
Subjt:  RSFFSKLKECAQDMLLLEMI

XP_008465844.1 PREDICTED: uncharacterized protein LOC103503438 [Cucumis melo]1.5e-21999.5Show/hide
Query:  MSEDESVSDNERVLEIANESLLLGERIPKSKIMRKVLRSLPGKFDMKVTAIDEAHDITKLKLDELFGSLLTFEMAISHRENKKGKVIAFKLIYEEETTIN
        MSEDESVSDNERVLEIANESLLL ERIPKSKIMRKVLRSLPGKFDMKVTAIDEAHDITKLKLDELFGSLLTFEMAISHRENKKGKVIAFKLIYEEETTIN
Subjt:  MSEDESVSDNERVLEIANESLLLGERIPKSKIMRKVLRSLPGKFDMKVTAIDEAHDITKLKLDELFGSLLTFEMAISHRENKKGKVIAFKLIYEEETTIN

Query:  KSDNEANMNESIALLMKHFSKVVKKFKNLNTTGSNARNLTNYRRRDGKNNTRRFNEISNMRDSDYGRKKKGEGRIFRCRECEKVGHYRAKCPTFSRRQKK
        KSDNEANMNESIALLMKHFSKVVKKFKNLNTTGSNARNLTNYRRRDGKNNTRRFNEISNMRDSDYGRKKKGEGRIFRCRECEKVGHYRAKCPTFSRRQKK
Subjt:  KSDNEANMNESIALLMKHFSKVVKKFKNLNTTGSNARNLTNYRRRDGKNNTRRFNEISNMRDSDYGRKKKGEGRIFRCRECEKVGHYRAKCPTFSRRQKK

Query:  NFCANLSDEDTDDSEKDNSMNAFTACTTETNSGEVSKNAKLWKCKSMHLQQRKLLALQLKLLDEFVITVAREVILGHFAICYKETECINRSGCSKHMTGK
         FCANLSDEDTDDSEKDNSMNAFTACTTETNSGEVSKNAKLWKCKSMHLQQRKLLALQLKLLDEFVITVAREVILGHFAICYKETECINRSGCSKHMTGK
Subjt:  NFCANLSDEDTDDSEKDNSMNAFTACTTETNSGEVSKNAKLWKCKSMHLQQRKLLALQLKLLDEFVITVAREVILGHFAICYKETECINRSGCSKHMTGK

Query:  RSFFSKLKECAQDMLLLEMIVEGLKANLISVSQLCDQGYNVKFSNDSCVVMFEFTKGKREKIVRIRSDHGKEFENEDLSNFCKMEGIHHEYSAPLTPQ
        RSFFSKLKECAQDMLLLEMIVEGLKANLISVSQLCDQGYNVKFSNDSCVVMFEFTKGKREKIVRIRSDHGKEFENEDLSNFCKMEGIHHEYSAPLTPQ
Subjt:  RSFFSKLKECAQDMLLLEMIVEGLKANLISVSQLCDQGYNVKFSNDSCVVMFEFTKGKREKIVRIRSDHGKEFENEDLSNFCKMEGIHHEYSAPLTPQ

TrEMBL top hitse value%identityAlignment
A0A1S3CPU7 uncharacterized protein LOC1035034387.4e-22099.5Show/hide
Query:  MSEDESVSDNERVLEIANESLLLGERIPKSKIMRKVLRSLPGKFDMKVTAIDEAHDITKLKLDELFGSLLTFEMAISHRENKKGKVIAFKLIYEEETTIN
        MSEDESVSDNERVLEIANESLLL ERIPKSKIMRKVLRSLPGKFDMKVTAIDEAHDITKLKLDELFGSLLTFEMAISHRENKKGKVIAFKLIYEEETTIN
Subjt:  MSEDESVSDNERVLEIANESLLLGERIPKSKIMRKVLRSLPGKFDMKVTAIDEAHDITKLKLDELFGSLLTFEMAISHRENKKGKVIAFKLIYEEETTIN

Query:  KSDNEANMNESIALLMKHFSKVVKKFKNLNTTGSNARNLTNYRRRDGKNNTRRFNEISNMRDSDYGRKKKGEGRIFRCRECEKVGHYRAKCPTFSRRQKK
        KSDNEANMNESIALLMKHFSKVVKKFKNLNTTGSNARNLTNYRRRDGKNNTRRFNEISNMRDSDYGRKKKGEGRIFRCRECEKVGHYRAKCPTFSRRQKK
Subjt:  KSDNEANMNESIALLMKHFSKVVKKFKNLNTTGSNARNLTNYRRRDGKNNTRRFNEISNMRDSDYGRKKKGEGRIFRCRECEKVGHYRAKCPTFSRRQKK

Query:  NFCANLSDEDTDDSEKDNSMNAFTACTTETNSGEVSKNAKLWKCKSMHLQQRKLLALQLKLLDEFVITVAREVILGHFAICYKETECINRSGCSKHMTGK
         FCANLSDEDTDDSEKDNSMNAFTACTTETNSGEVSKNAKLWKCKSMHLQQRKLLALQLKLLDEFVITVAREVILGHFAICYKETECINRSGCSKHMTGK
Subjt:  NFCANLSDEDTDDSEKDNSMNAFTACTTETNSGEVSKNAKLWKCKSMHLQQRKLLALQLKLLDEFVITVAREVILGHFAICYKETECINRSGCSKHMTGK

Query:  RSFFSKLKECAQDMLLLEMIVEGLKANLISVSQLCDQGYNVKFSNDSCVVMFEFTKGKREKIVRIRSDHGKEFENEDLSNFCKMEGIHHEYSAPLTPQ
        RSFFSKLKECAQDMLLLEMIVEGLKANLISVSQLCDQGYNVKFSNDSCVVMFEFTKGKREKIVRIRSDHGKEFENEDLSNFCKMEGIHHEYSAPLTPQ
Subjt:  RSFFSKLKECAQDMLLLEMIVEGLKANLISVSQLCDQGYNVKFSNDSCVVMFEFTKGKREKIVRIRSDHGKEFENEDLSNFCKMEGIHHEYSAPLTPQ

A0A5A7SW91 Gag-proteinase polyprotein5.5e-9077.49Show/hide
Query:  NERVLEIANESLLLGERIPKSKIMRKVLRSLPGKFDMKVTAIDEAHDITKLKLDELFGSLLTFEMAISHRENKKGKVIAFKLIYEEETTINKSDNEANMN
        NER+LEIANESLL GERIP+SKI+ K+L+SL GKFDMKVTAI++AH+ITKLKLDELFGSLLT EMAISHRENKKGK IAFK IYEEET +N+SDNEANMN
Subjt:  NERVLEIANESLLLGERIPKSKIMRKVLRSLPGKFDMKVTAIDEAHDITKLKLDELFGSLLTFEMAISHRENKKGKVIAFKLIYEEETTINKSDNEANMN

Query:  ESIALLMKHFSKVVKKFKNLNTTGSNARNLTNYRRRDGKNNTRRFNEISNMRDSDYGRKKKGEGRIFRCRECEKVGHYRAKCPTFSRRQKKNFCANLSDE
        ESIALL K FSKVVKK KNLN T SN RNLTNYR+ +G+NNTRRFNEISNMRDSDYGRKK+G GRIF+CREC +VGHY+A+ PTF RRQKKNF A LSDE
Subjt:  ESIALLMKHFSKVVKKFKNLNTTGSNARNLTNYRRRDGKNNTRRFNEISNMRDSDYGRKKKGEGRIFRCRECEKVGHYRAKCPTFSRRQKKNFCANLSDE

Query:  DTDDSEKDNSMNAFTACTTETNSGEVSKNAK
        DTDDSE+ N MNAFT   T+TNS + S++++
Subjt:  DTDDSEKDNSMNAFTACTTETNSGEVSKNAK

A0A5A7T8I2 CCHC-type domain-containing protein2.5e-9074.69Show/hide
Query:  MSEDESVSD-NERVLEIANESLLLGERIPKSKIMRKVLRSLPGKFDMKVTAIDEAHDITKLKLDELFGSLLTFEMAISHRENKKGKVIAFKLIYEEETTI
        MSEDESVSD N+RVL+I+NESLL GE+IPKSKI+ KVLRSL GKFDMKVT I+EAH+ITKLKL ELFGSLLTFEMAISHRE+K GK IAFK IY+EETT+
Subjt:  MSEDESVSD-NERVLEIANESLLLGERIPKSKIMRKVLRSLPGKFDMKVTAIDEAHDITKLKLDELFGSLLTFEMAISHRENKKGKVIAFKLIYEEETTI

Query:  NKSDNEANMNESIALLMKHFSKVVKKFKNLNTTGSNARNLTNYRRRDGKNNTRRFNEISNMRDSDYGRKKKGEGRIFRCRECEKVGHYRAKCPTFSRRQK
        N+SDN AN+NESIALL K FSKVVKKFK LNTTGSNA+NLTNYRR+DG+NNTRR+NE+SN R SDYGRK +GEGR F CREC  VGHY+ +CP F RRQK
Subjt:  NKSDNEANMNESIALLMKHFSKVVKKFKNLNTTGSNARNLTNYRRRDGKNNTRRFNEISNMRDSDYGRKKKGEGRIFRCRECEKVGHYRAKCPTFSRRQK

Query:  KNFCANLSDEDTDDSEKDNSMNAFTACTTETNSGEVSKNAK
        K+FCA LSDEDT+D+E+DN M AFT    ETN G+ S+ ++
Subjt:  KNFCANLSDEDTDDSEKDNSMNAFTACTTETNSGEVSKNAK

A0A5D3CCM8 Gag-proteinase polyprotein2.4e-9379.22Show/hide
Query:  NERVLEIANESLLLGERIPKSKIMRKVLRSLPGKFDMKVTAIDEAHDITKLKLDELFGSLLTFEMAISHRENKKGKVIAFKLIYEEETTINKSDNEANMN
        NER+LEIANESLL GERIP+SKI+ K+LRSL  KFDMKVTAI++AH+ITKLKLDELFGSLLTFEMAISHRENKKGK IAFK IYEEET +N+SDNEANMN
Subjt:  NERVLEIANESLLLGERIPKSKIMRKVLRSLPGKFDMKVTAIDEAHDITKLKLDELFGSLLTFEMAISHRENKKGKVIAFKLIYEEETTINKSDNEANMN

Query:  ESIALLMKHFSKVVKKFKNLNTTGSNARNLTNYRRRDGKNNTRRFNEISNMRDSDYGRKKKGEGRIFRCRECEKVGHYRAKCPTFSRRQKKNFCANLSDE
        ESIALL K FSKVVKK KNLNTT SN RNLTNYR+ +G+NNTRRFNEISNMRDSDYGRKK+G GRIF+CREC +VGHY+A+ PTF RRQKKNF A LSDE
Subjt:  ESIALLMKHFSKVVKKFKNLNTTGSNARNLTNYRRRDGKNNTRRFNEISNMRDSDYGRKKKGEGRIFRCRECEKVGHYRAKCPTFSRRQKKNFCANLSDE

Query:  DTDDSEKDNSMNAFTACTTETNSGEVSKNAK
        DTDDSE+DN MNAFT C T+TNS + S++++
Subjt:  DTDDSEKDNSMNAFTACTTETNSGEVSKNAK

A0A5D3DSI5 Gag-pol polyprotein5.4e-14687.5Show/hide
Query:  MSEDESVSDNERVLEIANESLLLGERIPKSKIMRKVLRSLPGKFDMKVTAIDEAHDITKLKLDELFGSLLTFEMAISHRENKKGKVIAFKLIYEEETTIN
        MSEDESVSDNERVLEIANESLLLGERIPKSKIMRKVLRSLPGKFDMKVTAIDEAHDITKLKLDELFGSLLTFEMAISHRENKKG                
Subjt:  MSEDESVSDNERVLEIANESLLLGERIPKSKIMRKVLRSLPGKFDMKVTAIDEAHDITKLKLDELFGSLLTFEMAISHRENKKGKVIAFKLIYEEETTIN

Query:  KSDNEANMNESIALLMKHFSKVVKKFKNLNTTGSNARNLTNYRRRDGKNNTRRFNEISNMRDSDYGRKKKGEGRIFRCRECEKVGHYRAKCPTFSRRQKK
                            KVVKKFKNLNTTGSNARNLTNYRRRDGKNNTRRFNEISNMRDSDYGRKKKGEGRIFRCRECEKVGHYRAKCPTFSRRQKK
Subjt:  KSDNEANMNESIALLMKHFSKVVKKFKNLNTTGSNARNLTNYRRRDGKNNTRRFNEISNMRDSDYGRKKKGEGRIFRCRECEKVGHYRAKCPTFSRRQKK

Query:  NFCANLSDEDTDDSEKDNSMNAFTACTTETNSGEVSKNAKLWKCKSMHLQQRKLLALQLKLLDEFVITVAREVILGHFAICYKETECINRSGCSKHMTGK
         FCANLSDEDTDDSE+DNSMNAFTACTTETNSGEVSKNAKLWKCKSMHLQQRKLLALQLKLLDEFVITVAREVILGHFAICYKETECIN SGCSKHMTGK
Subjt:  NFCANLSDEDTDDSEKDNSMNAFTACTTETNSGEVSKNAKLWKCKSMHLQQRKLLALQLKLLDEFVITVAREVILGHFAICYKETECINRSGCSKHMTGK

Query:  RSFFSKLKECAQDMLLLEMI
        RSFFSKLKECAQDMLLLEM+
Subjt:  RSFFSKLKECAQDMLLLEMI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGAAGATGAATCTGTTTCAGATAATGAGAGGGTTCTGGAAATTGCTAATGAATCACTGTTACTTGGTGAAAGAATACCTAAGTCCAAGATAATGCGCAAAGTATT
ACGATCGTTGCCAGGAAAATTTGACATGAAGGTCACTGCCATAGACGAAGCACACGATATAACCAAATTAAAGCTAGATGAGTTATTTGGGTCTCTGCTTACTTTCGAGA
TGGCTATATCTCACAGAGAGAATAAGAAAGGCAAGGTCATTGCTTTTAAGTTAATATATGAAGAAGAGACAACAATAAATAAATCTGATAACGAAGCAAACATGAACGAA
TCGATAGCTCTTTTGATGAAACATTTTTCTAAGGTTGTCAAGAAATTCAAAAATTTGAATACTACAGGATCAAATGCTCGAAATCTGACCAATTATCGAAGAAGAGATGG
TAAGAACAATACCAGAAGGTTTAATGAAATCTCAAACATGAGGGATAGTGACTATGGACGAAAAAAGAAGGGTGAAGGAAGAATTTTCAGGTGTAGAGAATGTGAGAAAG
TTGGTCATTATCGAGCTAAATGTCCTACATTCTCGAGAAGACAAAAGAAAAATTTTTGTGCTAACTTGTCAGATGAGGACACTGATGATAGTGAAAAAGATAATAGCATG
AATGCATTCACAGCATGCACTACAGAAACCAATTCTGGTGAAGTCAGTAAAAATGCTAAATTATGGAAATGCAAGTCAATGCACCTACAGCAACGAAAGTTGTTAGCCCT
TCAGCTAAAACTACTAGATGAGTTTGTCATTACTGTGGCTAGGGAGGTCATATTAGGCCATTTTGCTATATGTTACAAAGAGACAGAATGTATCAATAGAAGTGGATGCT
CTAAGCACATGACTGGAAAGAGATCCTTCTTCTCTAAATTAAAGGAATGTGCTCAGGACATGTTACTTTTGGAGATGATCGTTGAAGGACTAAAAGCCAATCTAATCAGT
GTAAGTCAGCTATGTGATCAAGGTTACAACGTGAAATTCAGCAACGACAGTTGTGTGGTTATGTTTGAGTTTACAAAGGGAAAAAGGGAGAAGATTGTCAGAATCAGAAG
TGATCATGGTAAGGAATTTGAAAATGAAGATTTGAGTAACTTCTGTAAAATGGAAGGAATACATCATGAATATTCTGCTCCTTTAACTCCTCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTGAAGATGAATCTGTTTCAGATAATGAGAGGGTTCTGGAAATTGCTAATGAATCACTGTTACTTGGTGAAAGAATACCTAAGTCCAAGATAATGCGCAAAGTATT
ACGATCGTTGCCAGGAAAATTTGACATGAAGGTCACTGCCATAGACGAAGCACACGATATAACCAAATTAAAGCTAGATGAGTTATTTGGGTCTCTGCTTACTTTCGAGA
TGGCTATATCTCACAGAGAGAATAAGAAAGGCAAGGTCATTGCTTTTAAGTTAATATATGAAGAAGAGACAACAATAAATAAATCTGATAACGAAGCAAACATGAACGAA
TCGATAGCTCTTTTGATGAAACATTTTTCTAAGGTTGTCAAGAAATTCAAAAATTTGAATACTACAGGATCAAATGCTCGAAATCTGACCAATTATCGAAGAAGAGATGG
TAAGAACAATACCAGAAGGTTTAATGAAATCTCAAACATGAGGGATAGTGACTATGGACGAAAAAAGAAGGGTGAAGGAAGAATTTTCAGGTGTAGAGAATGTGAGAAAG
TTGGTCATTATCGAGCTAAATGTCCTACATTCTCGAGAAGACAAAAGAAAAATTTTTGTGCTAACTTGTCAGATGAGGACACTGATGATAGTGAAAAAGATAATAGCATG
AATGCATTCACAGCATGCACTACAGAAACCAATTCTGGTGAAGTCAGTAAAAATGCTAAATTATGGAAATGCAAGTCAATGCACCTACAGCAACGAAAGTTGTTAGCCCT
TCAGCTAAAACTACTAGATGAGTTTGTCATTACTGTGGCTAGGGAGGTCATATTAGGCCATTTTGCTATATGTTACAAAGAGACAGAATGTATCAATAGAAGTGGATGCT
CTAAGCACATGACTGGAAAGAGATCCTTCTTCTCTAAATTAAAGGAATGTGCTCAGGACATGTTACTTTTGGAGATGATCGTTGAAGGACTAAAAGCCAATCTAATCAGT
GTAAGTCAGCTATGTGATCAAGGTTACAACGTGAAATTCAGCAACGACAGTTGTGTGGTTATGTTTGAGTTTACAAAGGGAAAAAGGGAGAAGATTGTCAGAATCAGAAG
TGATCATGGTAAGGAATTTGAAAATGAAGATTTGAGTAACTTCTGTAAAATGGAAGGAATACATCATGAATATTCTGCTCCTTTAACTCCTCAGTAG
Protein sequenceShow/hide protein sequence
MSEDESVSDNERVLEIANESLLLGERIPKSKIMRKVLRSLPGKFDMKVTAIDEAHDITKLKLDELFGSLLTFEMAISHRENKKGKVIAFKLIYEEETTINKSDNEANMNE
SIALLMKHFSKVVKKFKNLNTTGSNARNLTNYRRRDGKNNTRRFNEISNMRDSDYGRKKKGEGRIFRCRECEKVGHYRAKCPTFSRRQKKNFCANLSDEDTDDSEKDNSM
NAFTACTTETNSGEVSKNAKLWKCKSMHLQQRKLLALQLKLLDEFVITVAREVILGHFAICYKETECINRSGCSKHMTGKRSFFSKLKECAQDMLLLEMIVEGLKANLIS
VSQLCDQGYNVKFSNDSCVVMFEFTKGKREKIVRIRSDHGKEFENEDLSNFCKMEGIHHEYSAPLTPQ