; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0006448 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0006448
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
Descriptiontrihelix transcription factor ASIL2
Genome locationchr10:22029734..22032763
RNA-Seq ExpressionPI0006448
SyntenyPI0006448
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
InterPro domainsIPR044823 - Trihelix transcription factor ASIL1/2-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601124.1 Trihelix transcription factor ASIL2, partial [Cucurbita argyrosperma subsp. sororia]8.2e-6771.43Show/hide
Query:  MKKRYRSESASAASSWPLYHRLHLLLRGNTNLTPPPPPPLVDLVDPPPPSPSPPPFLPPQNSHGSNGVDNINPSPKEDGVDNGRGGDELLSEKNKNKNNN
        MKKRYRSESASAASSWPLYHRLHLLLRGNT   PPPPP  V L+DPPPP P+PPPFLPPQNSHGSNGVD IN  PKEDGVDNGR              ++
Subjt:  MKKRYRSESASAASSWPLYHRLHLLLRGNTNLTPPPPPPLVDLVDPPPPSPSPPPFLPPQNSHGSNGVDNINPSPKEDGVDNGRGGDELLSEKNKNKNNN

Query:  NNNNKKKKMVTEKTDSSSSTPAAIVYSTSDEKEKGVAAMRPKLQQSKM----KKKKRRRSGEVDSLEQIAGSIRWLAEVVVRSEQARMEMIKDIEKMRAE
          + K KKMV E   + SSTP AIVYS  D+      +MRPK QQ+KM    KK K  R    DSLEQIAGSIRWLAEVVVRSEQARMEMIKDIEKMRAE
Subjt:  NNNNKKKKMVTEKTDSSSSTPAAIVYSTSDEKEKGVAAMRPKLQQSKM----KKKKRRRSGEVDSLEQIAGSIRWLAEVVVRSEQARMEMIKDIEKMRAE

Query:  AEAKRGELDLKRTQIIANTQLEIAKLFASPTKPLVDHSSLRIART
        AEAKRGE+DLKRTQIIANTQLEIAKLFAS TKP+   SSLRI RT
Subjt:  AEAKRGELDLKRTQIIANTQLEIAKLFASPTKPLVDHSSLRIART

XP_004142481.2 trihelix transcription factor ASIL2 [Cucumis sativus]3.6e-8681.56Show/hide
Query:  MKKRYRSESASAASSWPLYHRLHLLLRGNTNLTPPPPPPLVDLVDPPPPSPSPPPFLPPQNSHGSNGVDNINPSPKEDGVDNGRGGDELLSEKNKNKNNN
        MKK+ R E A   SSWPLYHR+  L+ GNTNLT                 PSPPP LPPQNSHGSNGVDNINPSPKEDGVDNGRG DE+LSEKN+N NN+
Subjt:  MKKRYRSESASAASSWPLYHRLHLLLRGNTNLTPPPPPPLVDLVDPPPPSPSPPPFLPPQNSHGSNGVDNINPSPKEDGVDNGRGGDELLSEKNKNKNNN

Query:  NNNNKKKKMVTEK-TDSSSSTPAAIVYSTSDEKEKGVAAMRPKLQQSKMKKKKRRRSGEVDSLEQIAGSIRWLAEVVVRSEQARMEMIKDIEKMRAEAEA
            KKKKMV EK TDSSSSTPAAI+YS+SDEKEKGVAAMRPKLQQSKMKKKKRRRSGEVDSLEQIAGSIRWLAEVVVRSEQARMEMIKDIEKMRAEAEA
Subjt:  NNNNKKKKMVTEK-TDSSSSTPAAIVYSTSDEKEKGVAAMRPKLQQSKMKKKKRRRSGEVDSLEQIAGSIRWLAEVVVRSEQARMEMIKDIEKMRAEAEA

Query:  KRGELDLKRTQIIANTQLEIAKLFASPT-KPLVDHSS-LRIART
        KRGELDLKRTQIIANTQLEIAKLFASPT KPLVDHSS LRIART
Subjt:  KRGELDLKRTQIIANTQLEIAKLFASPT-KPLVDHSS-LRIART

XP_008446967.1 PREDICTED: LOW QUALITY PROTEIN: trihelix transcription factor ASIL2 [Cucumis melo]7.6e-9788.62Show/hide
Query:  MKKRYRSESASAASSWPLYHRLHLLLRGNTNLTPP--PPPPLVDLVDPPPPSPSPPPFLPPQNSHGSNGVDNINPSPKEDGVDNGRGGDELLSEKNKNKN
        MKK  R+  ASAASSWP  HR+   L GN+NLTPP  PPPPLVDLVDPPPP PSPPPFL PQNSHGSNGVDNINPSPKEDGVDNGRGGDELLSEKNKNKN
Subjt:  MKKRYRSESASAASSWPLYHRLHLLLRGNTNLTPP--PPPPLVDLVDPPPPSPSPPPFLPPQNSHGSNGVDNINPSPKEDGVDNGRGGDELLSEKNKNKN

Query:  NNNNNNKKKKMVTEKTDSSSSTPAAIVYSTSDEK-EKGVAAMRPKLQQSKMKKKK-RRRSGEVDSLEQIAGSIRWLAEVVVRSEQARMEMIKDIEKMRAE
         NN+NN KKKMV EKTDSSSSTPAAI YSTSDEK  KGVAAMR KLQQSKMKKKK RRRSGEVDSLE+IAGSIRWLAEVVVRSEQARMEMIKDIEKMRAE
Subjt:  NNNNNNKKKKMVTEKTDSSSSTPAAIVYSTSDEK-EKGVAAMRPKLQQSKMKKKK-RRRSGEVDSLEQIAGSIRWLAEVVVRSEQARMEMIKDIEKMRAE

Query:  AEAKRGELDLKRTQIIANTQLEIAKLFASP-TKPLVDHSSLRIART
        AEAKRGELDLKRTQIIANTQLEIAKLFASP TKPLVDHSSLRIART
Subjt:  AEAKRGELDLKRTQIIANTQLEIAKLFASP-TKPLVDHSSLRIART

XP_022988920.1 trihelix transcription factor ASIL1-like isoform X1 [Cucurbita maxima]2.0e-6569.8Show/hide
Query:  MKKRYRSESASAASSWPLYHRLHLLLRGNTNLTPPPPPPLVDLVDPPPPSPSPPPFLPPQNSHGSNGVDNINPSPKEDGVDNGRGGDELLSEKNKNKNNN
        MKKRYRSESASAASSWPLYHRLHLLLRGNT   PPPPP  V L+DPPPP P+PPPFLP QNSHGSNGVD IN  PKEDGVDNGR              ++
Subjt:  MKKRYRSESASAASSWPLYHRLHLLLRGNTNLTPPPPPPLVDLVDPPPPSPSPPPFLPPQNSHGSNGVDNINPSPKEDGVDNGRGGDELLSEKNKNKNNN

Query:  NNNNKKKKMVTEKTDSSSSTPAAIVYSTSDEKEKGVAAMRPKLQQSKMKKKKRR----RSGEVDSLEQIAGSIRWLAEVVVRSEQARMEMIKDIEKMRAE
          + + KKMV E   + SSTP AIVYS  D+      +MRPK QQ+KMK  K++    R    DSLEQIAGSIRWLA+VVVRSEQARMEMIKDIEKMRAE
Subjt:  NNNNKKKKMVTEKTDSSSSTPAAIVYSTSDEKEKGVAAMRPKLQQSKMKKKKRR----RSGEVDSLEQIAGSIRWLAEVVVRSEQARMEMIKDIEKMRAE

Query:  AEAKRGELDLKRTQIIANTQLEIAKLFASPTKPLVDHSSLRIART
        AEAKRGE+DLKRTQIIANTQLEIAKLFAS TKP+   SSLRI RT
Subjt:  AEAKRGELDLKRTQIIANTQLEIAKLFASPTKPLVDHSSLRIART

XP_038893233.1 trihelix transcription factor ASIL1 [Benincasa hispida]8.2e-6770.98Show/hide
Query:  MKKRYRSESASAASSWPLYHRLHLLLRGNTNLTPP-------PPPPLVDLVD--PPPPSPSPPPFLPPQNSHGSNGVDNINPSPKEDGVDNGRGGD-ELL
        MKKRYRSESAS AS+WPLY+RLHLLLRGNT LTPP       PPPP + LVD  PPPP PSPPPFLP QNSHGSNG+D IN  PKEDGVDNGRG + + L
Subjt:  MKKRYRSESASAASSWPLYHRLHLLLRGNTNLTPP-------PPPPLVDLVD--PPPPSPSPPPFLPPQNSHGSNGVDNINPSPKEDGVDNGRGGD-ELL

Query:  SEKNKNKNNNNNNNKKKKMVTEKTDSSSSTPAAIVYSTSDEKEKGVAAMRPKLQQSKMKKKKRRRSGEV----DSLEQIAGSIRWLAEVVVRSEQARMEM
        SEKN            KKMV    D+ SSTP AIVYS+  EKEK   AMRPK Q +KMK  K++++  +    DSLEQIAGSIRWLAEVVVRSEQARMEM
Subjt:  SEKNKNKNNNNNNNKKKKMVTEKTDSSSSTPAAIVYSTSDEKEKGVAAMRPKLQQSKMKKKKRRRSGEV----DSLEQIAGSIRWLAEVVVRSEQARMEM

Query:  IKDIEKMRAEAEAKRGELDLKRTQIIANTQLEIAKLFASPTKPLVDHSSLRIART
        IKDIEKMRAEAEAKRGE+DLKRTQIIANTQLEIAKLFAS TKPL   SSLRI RT
Subjt:  IKDIEKMRAEAEAKRGELDLKRTQIIANTQLEIAKLFASPTKPLVDHSSLRIART

TrEMBL top hitse value%identityAlignment
A0A0A0KRQ8 Uncharacterized protein1.7e-8681.56Show/hide
Query:  MKKRYRSESASAASSWPLYHRLHLLLRGNTNLTPPPPPPLVDLVDPPPPSPSPPPFLPPQNSHGSNGVDNINPSPKEDGVDNGRGGDELLSEKNKNKNNN
        MKK+ R E A   SSWPLYHR+  L+ GNTNLT                 PSPPP LPPQNSHGSNGVDNINPSPKEDGVDNGRG DE+LSEKN+N NN+
Subjt:  MKKRYRSESASAASSWPLYHRLHLLLRGNTNLTPPPPPPLVDLVDPPPPSPSPPPFLPPQNSHGSNGVDNINPSPKEDGVDNGRGGDELLSEKNKNKNNN

Query:  NNNNKKKKMVTEK-TDSSSSTPAAIVYSTSDEKEKGVAAMRPKLQQSKMKKKKRRRSGEVDSLEQIAGSIRWLAEVVVRSEQARMEMIKDIEKMRAEAEA
            KKKKMV EK TDSSSSTPAAI+YS+SDEKEKGVAAMRPKLQQSKMKKKKRRRSGEVDSLEQIAGSIRWLAEVVVRSEQARMEMIKDIEKMRAEAEA
Subjt:  NNNNKKKKMVTEK-TDSSSSTPAAIVYSTSDEKEKGVAAMRPKLQQSKMKKKKRRRSGEVDSLEQIAGSIRWLAEVVVRSEQARMEMIKDIEKMRAEAEA

Query:  KRGELDLKRTQIIANTQLEIAKLFASPT-KPLVDHSS-LRIART
        KRGELDLKRTQIIANTQLEIAKLFASPT KPLVDHSS LRIART
Subjt:  KRGELDLKRTQIIANTQLEIAKLFASPT-KPLVDHSS-LRIART

A0A1S3BGZ0 LOW QUALITY PROTEIN: trihelix transcription factor ASIL23.7e-9788.62Show/hide
Query:  MKKRYRSESASAASSWPLYHRLHLLLRGNTNLTPP--PPPPLVDLVDPPPPSPSPPPFLPPQNSHGSNGVDNINPSPKEDGVDNGRGGDELLSEKNKNKN
        MKK  R+  ASAASSWP  HR+   L GN+NLTPP  PPPPLVDLVDPPPP PSPPPFL PQNSHGSNGVDNINPSPKEDGVDNGRGGDELLSEKNKNKN
Subjt:  MKKRYRSESASAASSWPLYHRLHLLLRGNTNLTPP--PPPPLVDLVDPPPPSPSPPPFLPPQNSHGSNGVDNINPSPKEDGVDNGRGGDELLSEKNKNKN

Query:  NNNNNNKKKKMVTEKTDSSSSTPAAIVYSTSDEK-EKGVAAMRPKLQQSKMKKKK-RRRSGEVDSLEQIAGSIRWLAEVVVRSEQARMEMIKDIEKMRAE
         NN+NN KKKMV EKTDSSSSTPAAI YSTSDEK  KGVAAMR KLQQSKMKKKK RRRSGEVDSLE+IAGSIRWLAEVVVRSEQARMEMIKDIEKMRAE
Subjt:  NNNNNNKKKKMVTEKTDSSSSTPAAIVYSTSDEK-EKGVAAMRPKLQQSKMKKKK-RRRSGEVDSLEQIAGSIRWLAEVVVRSEQARMEMIKDIEKMRAE

Query:  AEAKRGELDLKRTQIIANTQLEIAKLFASP-TKPLVDHSSLRIART
        AEAKRGELDLKRTQIIANTQLEIAKLFASP TKPLVDHSSLRIART
Subjt:  AEAKRGELDLKRTQIIANTQLEIAKLFASP-TKPLVDHSSLRIART

A0A6J1CC67 uncharacterized protein LOC1110101441.3e-4657.03Show/hide
Query:  MKKRYRSES-----ASAASSWPLYHRLHLLLRGNTNLTP---PPPPPLVDLVDPPPP-------SPSPPPFLPPQNSHGSNGVDNINPSPKEDGVDNGRG
        MKKRYRSES     A+AAS+WPLY+RL LLLRGNT   P   PPPP    + +  PP        P+PPPF+  QNSHGSNGVD IN  PKED ++    
Subjt:  MKKRYRSES-----ASAASSWPLYHRLHLLLRGNTNLTP---PPPPPLVDLVDPPPP-------SPSPPPFLPPQNSHGSNGVDNINPSPKEDGVDNGRG

Query:  GDELLSEKNKNKNNNNNNNKKKKMVTEKTDSSSSTPAAIVYSTSDEKEKGVAAMRPKLQQSKMKKKKRRRSGEVDSLEQIAGSIRWLAEVVVRSEQARME
            LS+   +KNNN+NNN          ++ SSTP AI+YS  D+K      +R K Q    K KK++R  E   L+QIA SIRWLAEVVVRSEQARM+
Subjt:  GDELLSEKNKNKNNNNNNNKKKKMVTEKTDSSSSTPAAIVYSTSDEKEKGVAAMRPKLQQSKMKKKKRRRSGEVDSLEQIAGSIRWLAEVVVRSEQARME

Query:  MIKDIEKMRAEAEAKRGELDLKRTQIIANTQLEIAKLFASPTKPLVDHSSLRIART
        MI+DIE+MRAEAEAKRGE+DLKRT+IIANTQLEIAKLFA+  K  VD SSLRI R+
Subjt:  MIKDIEKMRAEAEAKRGELDLKRTQIIANTQLEIAKLFASPTKPLVDHSSLRIART

A0A6J1GZB5 trihelix transcription factor ASIL25.4e-6469.8Show/hide
Query:  MKKRYRSESASAASSWPLYHRLHLLLRGNTNLTPPPPPPLVDLVDPPPPSPSPPPFLPPQNSHGSNGVDNINPSPKEDGVDNGRGGDELLSEKNKNKNNN
        MKKRYRSESASAASSWPLYHRLHLLLRGNT   PPPPP  V L+DPPPP P+PPPFLP QNSHGSNG D IN  PKEDGVDNGR              ++
Subjt:  MKKRYRSESASAASSWPLYHRLHLLLRGNTNLTPPPPPPLVDLVDPPPPSPSPPPFLPPQNSHGSNGVDNINPSPKEDGVDNGRGGDELLSEKNKNKNNN

Query:  NNNNKKKKMVTEKTDSSSSTPAAIVYSTSDEKEKGVAAMRPKLQQSKM----KKKKRRRSGEVDSLEQIAGSIRWLAEVVVRSEQARMEMIKDIEKMRAE
          + K KKMV E   + SSTP AIVYS  D+      +MRPK QQ+KM    KK K  R    DSLEQIAGSIRWLAEVVVRSEQARMEMIKDIEKMRAE
Subjt:  NNNNKKKKMVTEKTDSSSSTPAAIVYSTSDEKEKGVAAMRPKLQQSKM----KKKKRRRSGEVDSLEQIAGSIRWLAEVVVRSEQARMEMIKDIEKMRAE

Query:  AEAKRGELDLKRTQIIANTQLEIAKLFASPTKPLVDHSSLRIART
        AEAKRGE+DLKRTQIIANTQLEIAKLFAS T P+   SS RI RT
Subjt:  AEAKRGELDLKRTQIIANTQLEIAKLFASPTKPLVDHSSLRIART

A0A6J1JKX8 trihelix transcription factor ASIL1-like isoform X19.8e-6669.8Show/hide
Query:  MKKRYRSESASAASSWPLYHRLHLLLRGNTNLTPPPPPPLVDLVDPPPPSPSPPPFLPPQNSHGSNGVDNINPSPKEDGVDNGRGGDELLSEKNKNKNNN
        MKKRYRSESASAASSWPLYHRLHLLLRGNT   PPPPP  V L+DPPPP P+PPPFLP QNSHGSNGVD IN  PKEDGVDNGR              ++
Subjt:  MKKRYRSESASAASSWPLYHRLHLLLRGNTNLTPPPPPPLVDLVDPPPPSPSPPPFLPPQNSHGSNGVDNINPSPKEDGVDNGRGGDELLSEKNKNKNNN

Query:  NNNNKKKKMVTEKTDSSSSTPAAIVYSTSDEKEKGVAAMRPKLQQSKMKKKKRR----RSGEVDSLEQIAGSIRWLAEVVVRSEQARMEMIKDIEKMRAE
          + + KKMV E   + SSTP AIVYS  D+      +MRPK QQ+KMK  K++    R    DSLEQIAGSIRWLA+VVVRSEQARMEMIKDIEKMRAE
Subjt:  NNNNKKKKMVTEKTDSSSSTPAAIVYSTSDEKEKGVAAMRPKLQQSKMKKKKRR----RSGEVDSLEQIAGSIRWLAEVVVRSEQARMEMIKDIEKMRAE

Query:  AEAKRGELDLKRTQIIANTQLEIAKLFASPTKPLVDHSSLRIART
        AEAKRGE+DLKRTQIIANTQLEIAKLFAS TKP+   SSLRI RT
Subjt:  AEAKRGELDLKRTQIIANTQLEIAKLFASPTKPLVDHSSLRIART

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G44730.1 Alcohol dehydrogenase transcription factor Myb/SANT-like family protein3.3e-0538.16Show/hide
Query:  DSLEQIAGSIRWLAEVVVRSEQARMEMIKDIEKMRAEAEAKRGELDLKRTQIIANTQLEIAKLFASPTKPLVDHSS
        D + +IA +I+ L + +VR+EQ RMEM ++IE MR + E       +KRT++I  +Q  I + FA   K L D+++
Subjt:  DSLEQIAGSIRWLAEVVVRSEQARMEMIKDIEKMRAEAEAKRGELDLKRTQIIANTQLEIAKLFASPTKPLVDHSS

AT3G54390.1 sequence-specific DNA binding transcription factors9.5e-2945.28Show/hide
Query:  MKKRYRSESASA-ASSWPLYHRLHLLLRGNTNLTPPPPP----------PLVDLVDPPPPSPSPPPFLPPQNSHGSNGVDNINPSPKEDGVDNGRGGDEL
        MKKRYRSESA+A  SSWPLY RL  LLRG     P P P          PL+ L++PP P+ +     PPQ S+GSNGV  I   PKEDG          
Subjt:  MKKRYRSESASA-ASSWPLYHRLHLLLRGNTNLTPPPPP----------PLVDLVDPPPPSPSPPPFLPPQNSHGSNGVDNINPSPKEDGVDNGRGGDEL

Query:  LSEKNKNKNNNNNNNKKKKMVTEKTDSSSSTPAAIVYSTSDEKEKGVAAMRPKLQQSKMKKKKRRRSGEVDSLEQIAGSIRWLAEVVVRSEQARMEMIKD
           K +NK   +           + D+ SSTP                 ++ K++  K+K++ +      +  E+IAGSIRWLAEVV+RSE+ARME +K+
Subjt:  LSEKNKNKNNNNNNNKKKKMVTEKTDSSSSTPAAIVYSTSDEKEKGVAAMRPKLQQSKMKKKKRRRSGEVDSLEQIAGSIRWLAEVVVRSEQARMEMIKD

Query:  IEKMRAEAEAKRGELDLKRTQIIANTQLEIAKLFASPTKPLVD---HSSLRIAR
        IE+MRAEAEAKRGELDLKRT+I+ANTQLEIA++FA+      +    SSLRI R
Subjt:  IEKMRAEAEAKRGELDLKRTQIIANTQLEIAKLFASPTKPLVD---HSSLRIAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAAAAGGTACCGTTCTGAATCTGCTTCCGCTGCCTCTTCTTGGCCTTTGTATCACCGTCTTCATCTTTTGCTTAGAGGAAACACTAATCTCACCCCACCACCACC
ACCACCACTTGTTGATCTTGTCGATCCTCCTCCTCCTTCTCCCTCTCCACCGCCCTTTCTTCCGCCGCAAAATTCCCATGGATCCAATGGTGTTGATAACATTAATCCTT
CCCCTAAGGAAGATGGAGTTGATAATGGAAGAGGAGGAGATGAATTATTATCAGAGAAGAATAAGAATAAGAATAATAATAATAATAATAATAAGAAAAAGAAGATGGTA
ACAGAGAAGACAGATAGTAGTAGCAGCACACCAGCTGCAATAGTATACAGTACTAGTGATGAGAAAGAAAAGGGCGTAGCAGCAATGAGGCCTAAATTGCAACAGTCAAA
AATGAAGAAGAAAAAGAGGAGGAGAAGCGGCGAGGTGGATTCGTTGGAGCAGATTGCGGGTAGCATACGGTGGTTGGCGGAGGTGGTGGTGCGCTCGGAACAAGCTAGAA
TGGAGATGATTAAGGATATAGAAAAGATGAGAGCTGAAGCAGAGGCCAAAAGAGGGGAATTGGATCTCAAAAGAACACAGATCATTGCAAATACTCAATTGGAGATAGCT
AAGCTCTTTGCATCTCCTACCAAACCTCTTGTTGATCATTCTTCACTAAGAATTGCTAGAACTTAA
mRNA sequenceShow/hide mRNA sequence
CTCTCTCTCTCTCTTCATCACAACATATCTATATCAAATTAATTAAAACAATTAACATATACTTATGCTCTAAATTCCTACTTAATTACTTAAACTTACAGTTTTTTTAA
TTTTTAATTTTTTTTAAAAATTTTCTTATACCCTTTTATCTTTTCTGTCCTTATACTTAATATACTAGGTAGCTATATTAGAATTAATAATATCTCTATTTATTTAATCT
ATATATATTATTCTTGATTTGATATCTACGGATTGTAATCTAAACTAAAGCTACCTACCTACCTACTACAATCGCTAATACTTATTGACTAATGCCTATCATTATTGCAT
AAAAAAGAGTATTATTCTTTTTCTTTTCATTTTTCATTCATCCTCATTATTCAAATACAATCCTTGTATAAAAAGAATTATATCTACTCCCACAAACTTCTTCATTCACA
CTTCCTACTCTACCCTCTAACTATTTTTCTCTCTATTTATAATATTATTCACTTTCAAATCTTCCTCCAAATTATTTCATTTACTTTCATCCCTTATTACATATTCTACG
TATTTCTTTTAATTTCATTCTTTATTTTTTTAAACATAATGAATATAGTTGCTGTCATCAAATTGTTTAATTTGATGAAATAAAAAGTTGAAATGTTAATCCACATGCTT
ACATTAACTCAGTCTTTAATATTTACCCACACATAAAAGTTTTTTTTTTTTTTTTTTTTTTTTAAAAAAAAAAACCTTTATACGTATCCCATAAACTAAAAAGTTAGGAA
TTCCTAAACCCATAACCCAAAAAATGAAATTACCCATTTCACTCTCTTTCCTCCCTATCAATCTTTAATTCCATCTCTCTCTCTCTCTGTCTAGTTGGCCACTACCAATG
GAATTCTCTCTCTCTCTCTAGAATGCTAGTGGTTCTTTCTTTCTTTCTTTTTTTTTTTTTTTTTTCCTTTTTTTAGGTTGGTGTCACCCACTACTACCCTCCTTCTTATG
CTCAAAAGCCCCACTGTCCTAAGTTTGATATTGGTAATGCAAGAAACAAAAGAAAGGTATCGAAAGAAAGATGTTAGCAGGTAAGTAACAAGAAAAAAAACTTAGAACTA
AGTTAAAAAAACAAAAACAAAAACAAAAAAACTTGAATTTGGGATTGATCAGTAGGATGCTATAATCCCCGAGTGAGTGATTGACTCCCATGGCCATGGAGATCAAACCC
ACTCCCTCATCACCACCCACAACCTCACTACCACAAACACAAACCTCCCCTTCTCTTTTATTTAACCACCACCACCACCATCAGCAGCTTCCCGACGACAACCCATCTCC
CAAAAAAACCCTGTTTCCACCGGAGGAGGAGACCGGCTAAAACGAGACGAATGGAGCGAAGGGGCAGTGTCGACTCTGTTAGAAGCGTACGAATCAAAATGGGTACTACG
AAACAGAGCAAAATTGAAAGGGCATGATTGGGAAGATGTGGCTCGCCATGTGTCTTCAAGATCTAATTTTACCAAATCTCCCAAAACCCAAACTCAGTGTAAGAATAAAA
TTGAGTCTATGAAAAAAAGGTACCGTTCTGAATCTGCTTCCGCTGCCTCTTCTTGGCCTTTGTATCACCGTCTTCATCTTTTGCTTAGAGGAAACACTAATCTCACCCCA
CCACCACCACCACCACTTGTTGATCTTGTCGATCCTCCTCCTCCTTCTCCCTCTCCACCGCCCTTTCTTCCGCCGCAAAATTCCCATGGATCCAATGGTGTTGATAACAT
TAATCCTTCCCCTAAGGAAGATGGAGTTGATAATGGAAGAGGAGGAGATGAATTATTATCAGAGAAGAATAAGAATAAGAATAATAATAATAATAATAATAAGAAAAAGA
AGATGGTAACAGAGAAGACAGATAGTAGTAGCAGCACACCAGCTGCAATAGTATACAGTACTAGTGATGAGAAAGAAAAGGGCGTAGCAGCAATGAGGCCTAAATTGCAA
CAGTCAAAAATGAAGAAGAAAAAGAGGAGGAGAAGCGGCGAGGTGGATTCGTTGGAGCAGATTGCGGGTAGCATACGGTGGTTGGCGGAGGTGGTGGTGCGCTCGGAACA
AGCTAGAATGGAGATGATTAAGGATATAGAAAAGATGAGAGCTGAAGCAGAGGCCAAAAGAGGGGAATTGGATCTCAAAAGAACACAGATCATTGCAAATACTCAATTGG
AGATAGCTAAGCTCTTTGCATCTCCTACCAAACCTCTTGTTGATCATTCTTCACTAAGAATTGCTAGAACTTAATTAAATTCTTTTTCTTTTTTCTTTTTTCTTTTTTAT
TTAAATCCCATATATATAATGGGGGCATTAAATTATCTTAACTACCTACTACTCTCTCACCCTCATATACTTACTTAACTTTAATAATATTCTAATTATATTACTACCTA
CCACTCCCCCTTTGTTATTCAATGTAAGCTGCTTCCCTCTTTCATTTCAGCATGCATCTAAAAATGCTGTTTTGTTTAAACTTACTTTCAGAATAATTAATAATAAATTA
AGTTTTTATATTA
Protein sequenceShow/hide protein sequence
MKKRYRSESASAASSWPLYHRLHLLLRGNTNLTPPPPPPLVDLVDPPPPSPSPPPFLPPQNSHGSNGVDNINPSPKEDGVDNGRGGDELLSEKNKNKNNNNNNNKKKKMV
TEKTDSSSSTPAAIVYSTSDEKEKGVAAMRPKLQQSKMKKKKRRRSGEVDSLEQIAGSIRWLAEVVVRSEQARMEMIKDIEKMRAEAEAKRGELDLKRTQIIANTQLEIA
KLFASPTKPLVDHSSLRIART