; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0021609 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0021609
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
Descriptionocticosapeptide/Phox/Bem1p (PB1) domain-containing protein
Genome locationchr10:3915854..3923274
RNA-Seq ExpressionPay0021609
SyntenyPay0021609
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR000270 - PB1 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008446162.1 PREDICTED: uncharacterized protein LOC103488970 [Cucumis melo]4.7e-111100Show/hide
Query:  MVGASHRTGGIAARSDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCGTCVSLRCQLPSEDLDALVSITSDEDLANLI
        MVGASHRTGGIAARSDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCGTCVSLRCQLPSEDLDALVSITSDEDLANLI
Subjt:  MVGASHRTGGIAARSDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCGTCVSLRCQLPSEDLDALVSITSDEDLANLI

Query:  EEYDRAASPPSSLKIRAFLSLPKSVKKNPLSSSSASSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPSHIY
        EEYDRAASPPSSLKIRAFLSLPKSVKKNPLSSSSASSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPSHIY
Subjt:  EEYDRAASPPSSLKIRAFLSLPKSVKKNPLSSSSASSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPSHIY

Query:  LIHNGNHWQ
        LIHNGNHWQ
Subjt:  LIHNGNHWQ

XP_022151767.1 uncharacterized protein LOC111019672 [Momordica charantia]2.8e-9588.52Show/hide
Query:  MVGASHRTGGIAARSDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCGTCVSLRCQLPSEDLDALVSITSDEDLANLI
        MVGASHR GGIAARSD+TATIKFLCSYGGKILPRYPDGKLRY GGETRVLAVDRSIPFSELL KLGQLCGTCVSLRCQLPSED+DALVSITSDEDLANLI
Subjt:  MVGASHRTGGIAARSDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCGTCVSLRCQLPSEDLDALVSITSDEDLANLI

Query:  EEYDRAASPPSSLKIRAFLSLPKSVKKNPLSSSSASSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPSHIY
        EEYDRAASPPSSLKIRAFLS PKS+KK PL SSSASSSSSSSK SSP TA+SSPRI TQVP++CVHQIP+ VRF YR KKS SKV HCSYH QGNPSH Y
Subjt:  EEYDRAASPPSSLKIRAFLSLPKSVKKNPLSSSSASSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPSHIY

Query:  LIHNGNHWQ
        LIH+GNHWQ
Subjt:  LIHNGNHWQ

XP_022966793.1 uncharacterized protein LOC111466393 isoform X1 [Cucurbita maxima]1.1e-9488.57Show/hide
Query:  MVGASHRTGGIAARSDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCGTCVSLRCQLPSEDLDALVSITSDEDLANLI
        MVGASH  GGIAARSD+TATIKFLCSYGGKILPRYPDGKLRY+GGETRVLAVDRSI FSELL KLGQLCGTCVSLRCQLPSEDLDALVSITSDEDLANLI
Subjt:  MVGASHRTGGIAARSDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCGTCVSLRCQLPSEDLDALVSITSDEDLANLI

Query:  EEYDRAASPPSSLKIRAFLSLPKSVKKNPLSSSSA-SSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPSHI
        EEYDRAASPPSSLKIRAFLS PKSVKK+PLS SSA SSSSSSSK SSP TA SSPRI T VPD+C HQIPTPVRFSYR K S SKVPHCSY+ QG PSH+
Subjt:  EEYDRAASPPSSLKIRAFLSLPKSVKKNPLSSSSA-SSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPSHI

Query:  YLIHNGNHWQ
        YLIHNGNHWQ
Subjt:  YLIHNGNHWQ

XP_023541757.1 uncharacterized protein LOC111801817 isoform X1 [Cucurbita pepo subsp. pepo]8.1e-9588.57Show/hide
Query:  MVGASHRTGGIAARSDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCGTCVSLRCQLPSEDLDALVSITSDEDLANLI
        MVGASH  GGIAARSD+TATIKFLCSYGGKILPRYPDGKLRY+GGETRVLAVDRSI FSELL KLGQLCGTCVSLRCQLPSEDLDALVSITSDEDLANLI
Subjt:  MVGASHRTGGIAARSDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCGTCVSLRCQLPSEDLDALVSITSDEDLANLI

Query:  EEYDRAASPPSSLKIRAFLSLPKSVKKNPLSSSSA-SSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPSHI
        EEYDRAASPP SLKIRAFLSLPKSVKK+PLS SSA SSSSSSSK SSP TA SSPRI T VPD+C HQIPTPVRFSYR K S SKVPHCSY+ QG PSH+
Subjt:  EEYDRAASPPSSLKIRAFLSLPKSVKKNPLSSSSA-SSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPSHI

Query:  YLIHNGNHWQ
        YLIHNGNHWQ
Subjt:  YLIHNGNHWQ

XP_031742074.1 uncharacterized protein LOC101222057 [Cucumis sativus]1.3e-10898.09Show/hide
Query:  MVGASHRTGGIAARSDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCGTCVSLRCQLPSEDLDALVSITSDEDLANLI
        MVGASH TGGIAA+SDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELL KLGQLCGTCVSLRCQLPSEDLDALVSITSDEDLANLI
Subjt:  MVGASHRTGGIAARSDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCGTCVSLRCQLPSEDLDALVSITSDEDLANLI

Query:  EEYDRAASPPSSLKIRAFLSLPKSVKKNPLSSSSASSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPSHIY
        EEYDRAASPPS LKIRAFLSLPKSVKKNPLSSSSASSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPSHIY
Subjt:  EEYDRAASPPSSLKIRAFLSLPKSVKKNPLSSSSASSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPSHIY

Query:  LIHNGNHWQ
        LIHNGNHWQ
Subjt:  LIHNGNHWQ

TrEMBL top hitse value%identityAlignment
A0A0A0KQ32 PB1 domain-containing protein6.2e-10998.09Show/hide
Query:  MVGASHRTGGIAARSDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCGTCVSLRCQLPSEDLDALVSITSDEDLANLI
        MVGASH TGGIAA+SDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELL KLGQLCGTCVSLRCQLPSEDLDALVSITSDEDLANLI
Subjt:  MVGASHRTGGIAARSDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCGTCVSLRCQLPSEDLDALVSITSDEDLANLI

Query:  EEYDRAASPPSSLKIRAFLSLPKSVKKNPLSSSSASSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPSHIY
        EEYDRAASPPS LKIRAFLSLPKSVKKNPLSSSSASSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPSHIY
Subjt:  EEYDRAASPPSSLKIRAFLSLPKSVKKNPLSSSSASSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPSHIY

Query:  LIHNGNHWQ
        LIHNGNHWQ
Subjt:  LIHNGNHWQ

A0A1S3BF29 uncharacterized protein LOC1034889702.3e-111100Show/hide
Query:  MVGASHRTGGIAARSDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCGTCVSLRCQLPSEDLDALVSITSDEDLANLI
        MVGASHRTGGIAARSDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCGTCVSLRCQLPSEDLDALVSITSDEDLANLI
Subjt:  MVGASHRTGGIAARSDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCGTCVSLRCQLPSEDLDALVSITSDEDLANLI

Query:  EEYDRAASPPSSLKIRAFLSLPKSVKKNPLSSSSASSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPSHIY
        EEYDRAASPPSSLKIRAFLSLPKSVKKNPLSSSSASSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPSHIY
Subjt:  EEYDRAASPPSSLKIRAFLSLPKSVKKNPLSSSSASSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPSHIY

Query:  LIHNGNHWQ
        LIHNGNHWQ
Subjt:  LIHNGNHWQ

A0A5A7SUW7 Octicosapeptide/Phox/Bem1p family protein isoform 12.3e-111100Show/hide
Query:  MVGASHRTGGIAARSDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCGTCVSLRCQLPSEDLDALVSITSDEDLANLI
        MVGASHRTGGIAARSDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCGTCVSLRCQLPSEDLDALVSITSDEDLANLI
Subjt:  MVGASHRTGGIAARSDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCGTCVSLRCQLPSEDLDALVSITSDEDLANLI

Query:  EEYDRAASPPSSLKIRAFLSLPKSVKKNPLSSSSASSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPSHIY
        EEYDRAASPPSSLKIRAFLSLPKSVKKNPLSSSSASSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPSHIY
Subjt:  EEYDRAASPPSSLKIRAFLSLPKSVKKNPLSSSSASSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPSHIY

Query:  LIHNGNHWQ
        LIHNGNHWQ
Subjt:  LIHNGNHWQ

A0A6J1DED4 uncharacterized protein LOC1110196721.3e-9588.52Show/hide
Query:  MVGASHRTGGIAARSDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCGTCVSLRCQLPSEDLDALVSITSDEDLANLI
        MVGASHR GGIAARSD+TATIKFLCSYGGKILPRYPDGKLRY GGETRVLAVDRSIPFSELL KLGQLCGTCVSLRCQLPSED+DALVSITSDEDLANLI
Subjt:  MVGASHRTGGIAARSDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCGTCVSLRCQLPSEDLDALVSITSDEDLANLI

Query:  EEYDRAASPPSSLKIRAFLSLPKSVKKNPLSSSSASSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPSHIY
        EEYDRAASPPSSLKIRAFLS PKS+KK PL SSSASSSSSSSK SSP TA+SSPRI TQVP++CVHQIP+ VRF YR KKS SKV HCSYH QGNPSH Y
Subjt:  EEYDRAASPPSSLKIRAFLSLPKSVKKNPLSSSSASSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPSHIY

Query:  LIHNGNHWQ
        LIH+GNHWQ
Subjt:  LIHNGNHWQ

A0A6J1HT89 uncharacterized protein LOC111466393 isoform X15.1e-9588.57Show/hide
Query:  MVGASHRTGGIAARSDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCGTCVSLRCQLPSEDLDALVSITSDEDLANLI
        MVGASH  GGIAARSD+TATIKFLCSYGGKILPRYPDGKLRY+GGETRVLAVDRSI FSELL KLGQLCGTCVSLRCQLPSEDLDALVSITSDEDLANLI
Subjt:  MVGASHRTGGIAARSDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCGTCVSLRCQLPSEDLDALVSITSDEDLANLI

Query:  EEYDRAASPPSSLKIRAFLSLPKSVKKNPLSSSSA-SSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPSHI
        EEYDRAASPPSSLKIRAFLS PKSVKK+PLS SSA SSSSSSSK SSP TA SSPRI T VPD+C HQIPTPVRFSYR K S SKVPHCSY+ QG PSH+
Subjt:  EEYDRAASPPSSLKIRAFLSLPKSVKKNPLSSSSA-SSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPSHI

Query:  YLIHNGNHWQ
        YLIHNGNHWQ
Subjt:  YLIHNGNHWQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G70640.1 octicosapeptide/Phox/Bem1p (PB1) domain-containing protein1.0e-3457.64Show/hide
Query:  TATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCGTCV-SLRCQLPSEDLDALVSITSDEDLANLIEEYDRAASPPSSLKIR
        T T+KFLCSYGG+I PRYPDGKLRY GG+TRVL+V R+I F+EL  KLG++CG  V SLRCQLP++DLDALV++ SDEDL NL+EEYD A +  + +KI 
Subjt:  TATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCGTCV-SLRCQLPSEDLDALVSITSDEDLANLIEEYDRAASPPSSLKIR

Query:  AFLSLPKSVKKN------PLSSSSASSSSSSSKPSSPITAISSP
         FLS  KS +        P ++SS+SS S S  P SP T  + P
Subjt:  AFLSLPKSVKKN------PLSSSSASSSSSSSKPSSPITAISSP

AT3G26510.1 Octicosapeptide/Phox/Bem1p family protein3.1e-4452.36Show/hide
Query:  SDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCG-----TCVSLRCQLPSEDLDALVSITSDEDLANLIEEYDRAASP
        S   +TIKFLCSYGGKILPRYPDGKLRY GG TRVLAV RS+ FSEL SK+ ++CG       V++RCQLP+EDLDALVSITSDEDL NLIEEYD  +S 
Subjt:  SDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCG-----TCVSLRCQLPSEDLDALVSITSDEDLANLIEEYDRAASP

Query:  PSSLKIRAFLSLPKSV---KKNP---------LSSSSASSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPS
         S +KIR FL+ PKS    KK+P          +SSS+++SS+SS P SP  ++S P +            P+P R +     + +K P    ++  N  
Subjt:  PSSLKIRAFLSLPKSV---KKNP---------LSSSSASSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPS

Query:  HIYLIHNGNHWQ
        +IYL+HNGNHWQ
Subjt:  HIYLIHNGNHWQ

AT3G26510.2 Octicosapeptide/Phox/Bem1p family protein3.1e-4452.36Show/hide
Query:  SDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCG-----TCVSLRCQLPSEDLDALVSITSDEDLANLIEEYDRAASP
        S   +TIKFLCSYGGKILPRYPDGKLRY GG TRVLAV RS+ FSEL SK+ ++CG       V++RCQLP+EDLDALVSITSDEDL NLIEEYD  +S 
Subjt:  SDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCG-----TCVSLRCQLPSEDLDALVSITSDEDLANLIEEYDRAASP

Query:  PSSLKIRAFLSLPKSV---KKNP---------LSSSSASSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPS
         S +KIR FL+ PKS    KK+P          +SSS+++SS+SS P SP  ++S P +            P+P R +     + +K P    ++  N  
Subjt:  PSSLKIRAFLSLPKSV---KKNP---------LSSSSASSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPS

Query:  HIYLIHNGNHWQ
        +IYL+HNGNHWQ
Subjt:  HIYLIHNGNHWQ

AT3G26510.3 Octicosapeptide/Phox/Bem1p family protein3.1e-4452.36Show/hide
Query:  SDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCG-----TCVSLRCQLPSEDLDALVSITSDEDLANLIEEYDRAASP
        S   +TIKFLCSYGGKILPRYPDGKLRY GG TRVLAV RS+ FSEL SK+ ++CG       V++RCQLP+EDLDALVSITSDEDL NLIEEYD  +S 
Subjt:  SDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCG-----TCVSLRCQLPSEDLDALVSITSDEDLANLIEEYDRAASP

Query:  PSSLKIRAFLSLPKSV---KKNP---------LSSSSASSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPS
         S +KIR FL+ PKS    KK+P          +SSS+++SS+SS P SP  ++S P +            P+P R +     + +K P    ++  N  
Subjt:  PSSLKIRAFLSLPKSV---KKNP---------LSSSSASSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPS

Query:  HIYLIHNGNHWQ
        +IYL+HNGNHWQ
Subjt:  HIYLIHNGNHWQ

AT3G26510.4 Octicosapeptide/Phox/Bem1p family protein3.1e-4452.36Show/hide
Query:  SDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCG-----TCVSLRCQLPSEDLDALVSITSDEDLANLIEEYDRAASP
        S   +TIKFLCSYGGKILPRYPDGKLRY GG TRVLAV RS+ FSEL SK+ ++CG       V++RCQLP+EDLDALVSITSDEDL NLIEEYD  +S 
Subjt:  SDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCG-----TCVSLRCQLPSEDLDALVSITSDEDLANLIEEYDRAASP

Query:  PSSLKIRAFLSLPKSV---KKNP---------LSSSSASSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPS
         S +KIR FL+ PKS    KK+P          +SSS+++SS+SS P SP  ++S P +            P+P R +     + +K P    ++  N  
Subjt:  PSSLKIRAFLSLPKSV---KKNP---------LSSSSASSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPS

Query:  HIYLIHNGNHWQ
        +IYL+HNGNHWQ
Subjt:  HIYLIHNGNHWQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCGGAGCCTCCCACCGTACCGGCGGAATCGCTGCCAGATCTGACAAAACTGCGACTATCAAATTTCTCTGCAGTTATGGAGGCAAGATCCTTCCTCGTTAT
CCTGATGGAAAACTCCGTTATATTGGTGGTGAAACTCGTGTCCTTGCCGTCGATCGTTCCATTCCTTTCTCTGAACTGTTGTCGAAGCTGGGACAATTGTGTGGA
ACGTGTGTGAGTCTTCGTTGTCAATTGCCTTCTGAGGATCTGGATGCTTTGGTATCGATAACCTCCGATGAGGACCTAGCGAATCTCATCGAAGAATACGATCGA
GCAGCCTCGCCGCCGTCTTCTTTGAAGATCAGAGCGTTCCTTTCACTGCCTAAATCCGTGAAAAAAAATCCTCTATCTAGTTCGTCGGCCTCTTCATCATCGTCG
TCCTCAAAACCTTCATCTCCAATAACTGCTATATCCTCTCCAAGGATCTCAACGCAGGTTCCTGACTATTGCGTCCATCAGATCCCAACGCCAGTGAGGTTTTCG
TACCGCTGGAAGAAGTCTCAATCAAAGGTTCCTCACTGCTCGTATCATCTTCAAGGGAACCCTAGCCATATCTACCTCATCCACAATGGCAATCACTGGCAATAA
mRNA sequenceShow/hide mRNA sequence
GCGAGCAATTACTTCTATAAATAGTGGGCAAGGCGTACCAATACCAGGCTTGGAGAAGCAACTCCAAATCTTCTTCTTCGGTCTCTTCTTTCTCTACATTCCTAG
TCCCTCCTTTTCAATCTAAACTGAACCGATATCTTCGGCTTCAAACGAAGAGGAAAATGGTCGGAGCCTCCCACCGTACCGGCGGAATCGCTGCCAGATCTGACA
AAACTGCGACTATCAAATTTCTCTGCAGTTATGGAGGCAAGATCCTTCCTCGTTATCCTGATGGAAAACTCCGTTATATTGGTGGTGAAACTCGTGTCCTTGCCG
TCGATCGTTCCATTCCTTTCTCTGAACTGTTGTCGAAGCTGGGACAATTGTGTGGAACGTGTGTGAGTCTTCGTTGTCAATTGCCTTCTGAGGATCTGGATGCTT
TGGTATCGATAACCTCCGATGAGGACCTAGCGAATCTCATCGAAGAATACGATCGAGCAGCCTCGCCGCCGTCTTCTTTGAAGATCAGAGCGTTCCTTTCACTGC
CTAAATCCGTGAAAAAAAATCCTCTATCTAGTTCGTCGGCCTCTTCATCATCGTCGTCCTCAAAACCTTCATCTCCAATAACTGCTATATCCTCTCCAAGGATCT
CAACGCAGGTTCCTGACTATTGCGTCCATCAGATCCCAACGCCAGTGAGGTTTTCGTACCGCTGGAAGAAGTCTCAATCAAAGGTTCCTCACTGCTCGTATCATC
TTCAAGGGAACCCTAGCCATATCTACCTCATCCACAATGGCAATCACTGGCAATAACACCAACTAGATTGAGGGAATTCAAAGCAAACATAAATATAAGATAAAA
CGAAATAGAAAAAAGGAAATTCCAAGCAACAAAAAGGTTGTATGATGTAAGGAGATTGTATATGTCGCTAAGGATTAGATTTTGATCCATTAGCCATTAATGAAA
TTCGGTCCTATTTCGTTTTGCAGCCGTAGCTATTGTGATGGTGAGTTTCTTCGTCGAAATATCAGAATCCATTCCCAACTTGCATTACAGCTTCCATTTACCATT
AGGTCCGTCCATCACACGTATTTCCTCTCTGCCAGTAAGAGTCGAGCAAATTCCACTCATGGAGTATCCGTAGTTTTAGGTCTGATTGTTGCATTACAGCTTGTG
CACTATACGCAACTTTTACTCTGGCACAACTGGCTTCGTTATTGGTTCTTATTTCCTTCATTCTCTTTTGAAAACCTTAGTTTAATTTTACACAATTTTTTTTTT
CAAAATAGGATTATGATTCTTGTGTATTGTGGATTCCAGTTTCCATTGTCTTTGATTAAATGATGGATTATATAGTTCTTTTTTTTGTTGGATGGTCTATCTTAC
AAATCTGATTTTGCAGGATTTAAGAGACGACTCCATTGGACCAAACTAAGAGTTCGAGACGATAAGGAGCAACAGCTGCACTTCAGCAATTGGATGTAAGATCTC
CATCCTTGCCTAACTAAAATATGTGTTTGTGTTGCGTGTTTATAAGTGTTGGTTCTTTTTAGGTCTTGTAACATAGTTCAATTAGTCTTGCATCACTGATCTTTG
CCGTAGCATATTTGTATAGATTAGATAATATAATATTAAATTTACCTTTAATCATCAACTTATGTTAATTGGTGATTTAATATATCAAATCATGTAATCTAAAAT
TCATTTGTTATTTCAACTTTACCTTTAGATTAAGTTGTTGGATCAATGAATGAATTAAATGTATGCGCACTTGTTCACTAATGTTTAGCTCCTTTAAGGCCCTCT
AGCATAGTCCTAACCCCTCAATTCTCGGTTCTTTCAATCCTCTTACACTTGTTTATATCACCACTTCAACCTCACATCGTTTTCTTGATGAAAAGTTCACAAGTC
CTGTTCAAGGCGACATAAATTTTATATTTAAAATGTGAAGTTCCGGGAAGATGAAGGCATGTTGTTTATATTTGAATTCTATTATTAACTTAATAATATGTTAGG
TACATGTGAAGATAATCTGCTATTGTCAGAAACAGAGATCTCTTAGGTCAATTTGCTCTGTTTTCTGTACCAAGAGTTTCTTTCACGATCACAGGTATTCCTCTA
TTCTTTCTCTCTCATTTTTAATGAAATAAGACAAGATCATACTCTGGAAAGTTTTGGGAGGAATAACACTTAAAATTAAAGACCTCGAATCTGGTAGTATTAGGA
GCAAGTTGCATCCATCTATGGAGATCGGTTTGTTTCTGTATTTGGAGATTGGATTCATTCCCTGAACTTGAAAATTAGAGGTAAGTCCAAAAGTGGACAGCAAGG
ACTATGGTTGAGAATGCATTTGGGACCATATTGAGATCAGTTTTTGACAAAAAAGGACAAAGGATAGGACCCATTCATAGCATCAAGGATAAGACTTGTGGGTTA
CGCTCCAATATCTAAATTGTCACCAACAGGGAAACCCATAGAAACCAAGACAAATGAAAGGCACCAGAAATATGTTTCATTTCATCTAAAATTTTACAAGAAACG
AAATCATAATCTTATGAGGACAAGTAAATGAAACAGTTAAGTAATTCATCTCTCCATACTTGGTATGATGAGGTCCAGTCCAACTATAATGGAAATTTATTCCAT
TTCTAAATATTTGTCTTCGTTTCATCTAAAATGACCGGTAAATGTAATCTGTTTTGTAGAACCATTTGATTTTTTATTTTTAGTTTTTTAGTTTTTAGAGATTAA
GATGATGACCACCACACTTCATCACCCCATGTTTTAAAGAAAAGTCAAATTTTGAATACTAAGGTTTTGTTTGGTGCCATCTTTTTTTTTGTTTTTACTTTTTAA
AATTAAATTTACCGGACACTACTTTGTTTATATTTAATTTCTTTCTTTGTTATCCACTTTTGTAATTAATTTTTGAAAAAACAGAAGTAATTTTTAAAAGTTGTT
TATGGAATTTGGACAAAGTTTAATATATGGATGAAATAGATTTAATTTTAAAAACAAAAACAGAAATCTAATCGGTCATTGAATTAGGTCTAAAAAAATAGTTTT
AAATCTTCTTTTAATCGGCTGCTAACGTAAGATGCAAAGTTGCATAATTTTAAAAACAAAGAACTAAAGAAACAAAATGGTTATTGGATTTTAAGTAAAAAGAAA
AAGAAAATAAAAAAATTGGCATCATATGTGTAACGATGAGGTCCATACGTCCGGTCCACCATTTGGTGGGAACTTTCCGAACTTCCGAAGATTTTTAATTATTTT
TTTTAGAAGTAGAGGCTACTGTCAATTATATTTTCGAGGGTGAAAAATATTTTGATTCCAGAAAATAAAAAAGAAGGGTCGTTGTCATAAATAATAAAATATTAA
AACTATTTATAAAATAGTCAAACTCTTTCATTTTTATTTAATCTTTTTTAATATTTTGTTATTTCTTTTTGGCCATTCATAACAAATTTAAAAAGAAAACAAAGA
ATTTCTTTTTCCTAAAGAGAAAAAGATAAAAGTAGTATGACATATGTGTGGACCATGTTTCATGCCTTTTTTAATTAGAGTAAAAAAGAATTAGAAAAAAGAAAA
AGTAAAAGAGAAGAGGAGCTCATGTGCTCACATACAAATATATAAAGCAAGTACTCGAGTTGGATTCCATAATTATTTAGGTCTTCTTTGATTTTATTTTTTTTT
TCTTATTTCCTTTTTTAAGTGAAAAAACTCCTTTTTATAAAGGAAACCATTTATTTTGTTGTTAACCATATAATTTTTGTACGATTTAAGTGTTATTTTCTTCAA
AAAAATTTAGGTGGTCTCCAATTTGTTCTATTTTTATCTATAGAAAATTCGATATGTAATGGTTAGACATACAAATACATAATTTAGGAAATAAATAATGAAAGG
GAAATTAGTGCTTATCGTTAAATGTATGTTTTAGATTATAAAAAGGTTTCTTTTCCTTTTTTTTTTTTTTTTCCTTTTTTGATAGTAAATGGCTTCGTTTGTCGT
ATGCAAACATATGCTAATGCATTTTTTTAGTTTGTCAGTTACTAATTTCACTAATCTGATTATACAACGAATCCCTAAAAAATAGCAAAAGAGAACATTATTCGC
ATCTTTGTTGGCAAGAATAAGATGATTTGCTTTATTTTATATTATCTTTGAAATATCTTTTTTCTTAGCTGTTTTATCTTGTATCTTTTTCTATTTGATAAAAAA
TTCAAGATAATTTAAATGATGATCATGTTAATCCAATCCTCATTTCTAATATATCTTTTTTAGTGAACTAATTATTTTAAATTTCTAATTATTTATTATTATTTA
ATATAATGTTGATGGGTGTGGTCGTAATTATCTTAAGAGTTGCATCATCACATATTATATGGTTTAATTTGCTTCTTCGGCCATAGTTTTGATCTCCCGTCTCTG
TGGACCCATGATAGTCGCAGTAAAGAAAATTTGTCCTTTGTCTTTTGGTTCGGATTATAAGAACCACAAGGGCTTGTGCCATCATATCGGATATCATCTATTTTT
GTGTACCTCTTCTTATAATTCACATAAAAAAAATTAAAGTATTTTAGAAAAAATGTAAAAGAAAAACAAAATTTATCGCACCTAAAAGAGTCAGAAGCAAAAGAT
TTCTATTTGCCGGGTACTTCCATTTGATTTCATTTCCCCTTCTTTTTATTTTTCAGTAAAACTTGTGAGCTTCCCTGTGTATATGTTCTTCACGTTTTCCCTTGA
GAAAACAAAACTTGGATTAGTATAATTTAGGAAAGCCGAAGTGTCATTCGTTCGGCTATGAGCCCTAAAGCTTTCAAAGATTTGAGTCATCTGACTCAATGGGTG
GACCATAATGATAGAAATTAGATGAAATGACATATCATCTAGGATTTGATGATAAGGAGTTGGAATTCCTGTCATGTTTCGTACAAAATAGAAATGGCACCACCC
ACCATCCACAAAGATTTGGTGATGAGGTGAAATTATGTTAGTGGTATCACATCACCTAATCCAAATATCTTTCAACTTTAAAAACTTTGAAGCCACAGTATTATA
ATCTGGACAAAAAATTAACCAGGAGTCATGTCTATGTATCATTATCTACTTTGCTAAATTCAGATTGGTTTGTGACCTTTTAAATTTTCTGCAATTATTAGTAAG
GTCGGATTTAGTTAAAGTCCAAGGGGTTCAAACCCCCTCAACTTTATATACTCTTTATATATGTATACATACATATACATATATTTACTTCGAGCTTCTTCAATT
ATATATATTACAAATTGCCATAATATCAAAAGTGTTAGCCCAATAGTATATCCATCTTCCAGTCCTTTTTAAAGTTTTATATAAAGTCACCGTCAACAAAAAAAA
TTTAATGTCATTGATTGTCAGTCACACATTCAGTTGAGTAAGTTTTATGAGGTACATTGGAACTCAGAAGTAGCCAATGCTCTAATCCAATGAGTTTCCTTTTTG
TTTTGTATAGAGATTAAACACTTTACTGACATCAGCTTGTTCAAATCAACCAAATTATCTAATTGAGTACACCTTGTTAACCCTGCTTGAATCAAGTCAAAAGAT
TATATATAGATACGCCTTGTTAACTATTGAAAGCACATCCAACCATCAATTACCTTCTAGCAGCTAATCAAGATAGTTTCCTCGAATACAACATTGAAAATATAT
ATTGGGTTACACTCTACCTAGCTGTTTCCGTTATGTGAAAATCTAGTCCTTTTTTTCTTTTTCTTGAATAATTATTAGAGTTTTTTCTATTGATGGTTGGGGTGA
AACCACAAAAAGATGAGTACTTTTAAGTATTGGATGTTATGAATGCACTTCCTCAAAATGATGTGGCTGTCATGGATTGGACAACTAGATTTGGATTTAACTAAA
CAAAATAACCTTCAGCCAAAAAAAAAAAAAAACTTTCGAACTATTATAACTCTCATCTATAAGAGTGTTGGGTAATCTAAAATGCTATTATCAACATCAAGAGTT
CTCTTCTAAGTTTAGAAATCATACTTTTACCATGGTTATTATCCAGACAATTAATTTTGTTTATTAGATAAGGATATGCATAATATCAGTTTATGAACTATTTAG
AAATGTGATCATCCCAATTTGAGATAAAGATTCCTACACATAAACAGTTGCCAGTTTGTTACTTGAGTTCCTTCCAGGAAAAAAGTTACCATCTTGTCAAGTAAG
TTTATCGTTGGTTCTGTGAGAGTTTAGGTACCCACTAATTGTTATGACCATCTACGCAAGATTGATTATCAGATGGAATAGTTGTGCGTACAAATGCCACATTCT
AGTACCAAGTCAATCAGATATCATTTTTTATTTGGACCAAGTCAATCTCGTTTGGACATTGATGAAACAACCTTAGCTGAAGACTTTCTAATAGTTTTATCTCCA
GGGGTCATCATAAGATAAATTATAATATGAAGTAACAGTACAAAAGCACTTATGATTATGTGTGCATATATTGTTTGTGGATCAAATAATTAGATGGATACTTGG
CTTAACAATTTATATGCTTTAGGCCCATTTGCGATTGTACTATTTTCTCATGCCTGATTCTTTATTGGACTCAAGTAACCCTCTCAACAAGAGATCCGAGGAACA
CGGGGACTAGTTGAAGTAATGAGCTCTAAATCATTTTGTCTTTTTAGTTGGAGTTGGAACTTTAGGCGGTTCACGAAAAAAGATAATGATCCTAGAGTCGAGACT
AAGAGGAATGTATAAACCAATGCTTCCACGAATTAGTATAGGCAAAATATAGTAATTAGATTCAAAATAATTAAATATATAACAATATTTTTAAAAAGTTACAAA
TATAATATCATTGATAGACTATATTACAAATATTGATCTATCACTAATAAATCATAAAAGTCTGTCATTGATATACTTTGTTATATTTACAATTTTTGAAAAATG
TTCCTATATACTTGATTATTATTTGTACAATTGTTGTCCATTCTAATTACCCTTTAATATAGTGTCGTGTTGAGTTTCAATTCTCAAGCCTCATATTTGAGACTT
TTCAAGAAGATTGATTTAATGACATCACGACAAACACCGTGTTTACATGTTTTCTATTTTTCAATATTCTTCTTTGACCTTGTATGATGTTGGTAAACATTATAT
TGAATACTGAATGTTAATATTTTGGATAAGAAATATCGAATGTTTATTTTTGGGTTATATGTCTTTTGTGTTTCAGAATCTAAAATCTTATTAGTGTAAAGTATT
ATCTAGTTGCAG
Protein sequenceShow/hide protein sequence
MVGASHRTGGIAARSDKTATIKFLCSYGGKILPRYPDGKLRYIGGETRVLAVDRSIPFSELLSKLGQLCGTCVSLRCQLPSEDLDALVSITSDEDLANLIEEYDR
AASPPSSLKIRAFLSLPKSVKKNPLSSSSASSSSSSSKPSSPITAISSPRISTQVPDYCVHQIPTPVRFSYRWKKSQSKVPHCSYHLQGNPSHIYLIHNGNHWQ