; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018148 (gene) of Snake gourd v1 genome

Gene IDTan0018148
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptiondof zinc finger protein DOF5.6-like
Genome locationLG05:85015573..85016753
RNA-Seq ExpressionTan0018148
SyntenyTan0018148
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR003851 - Zinc finger, Dof-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573278.1 Dof zinc finger protein, partial [Cucurbita argyrosperma subsp. sororia]6.9e-3760Show/hide
Query:  MQGATATANEEMKDT----QNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALH-----P
        MQGAT TANEEMK T    Q+ +KCPRC+S NTKFCYYNNYSLSQPRYFCKSCRRYWT GGTLRNVPIGGG RK +KR K +P+ SSSS+AA+       
Subjt:  MQGATATANEEMKDT----QNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALH-----P

Query:  NAPP--------VFVNS---DPGYLPSLEPQILMNQSPLDLGVYSFGFSSDLTWPHRSFGADDRN
        NAPP        VF NS     GY PS E    ++Q+ +D  VYSFG   + +W HRSF  DD N
Subjt:  NAPP--------VFVNS---DPGYLPSLEPQILMNQSPLDLGVYSFGFSSDLTWPHRSFGADDRN

KAG6583987.1 Dof zinc finger protein, partial [Cucurbita argyrosperma subsp. sororia]5.9e-3652.02Show/hide
Query:  MQGATATANEEMKDTQNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALHPNAPPVFVNS
        MQGA A  NEE K T  +  CPRCDS NTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKP KR KT+P SS S       NAPP     
Subjt:  MQGATATANEEMKDTQNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALHPNAPPVFVNS

Query:  DPGYLPSLEPQILMNQSPLDLGVYSFGFSSDLTWPHRSFGADDRNPTWTAVNDDNATNSSAAAGNNKNSSAQSEINCCNNLVQNQWSHHHHLPAYGHP
          G                 L     G+ S L     +      N  WT +N  N+ N S +AGN +N S ++E  CC NLVQN    H HLP+Y  P
Subjt:  DPGYLPSLEPQILMNQSPLDLGVYSFGFSSDLTWPHRSFGADDRNPTWTAVNDDNATNSSAAAGNNKNSSAQSEINCCNNLVQNQWSHHHHLPAYGHP

XP_022140101.1 dof zinc finger protein DOF5.6-like [Momordica charantia]2.9e-4355.81Show/hide
Query:  ATATANEEMKDTQ-NKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALHPNAPPV----FV
        AT TANE+    +  K+KCPRC SFNTKFCYYNNYSLSQPRYFCKSCRRYWT GGTLRNVP+GGGSRK  KR K   T+++++  +L P  PP     F 
Subjt:  ATATANEEMKDTQ-NKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALHPNAPPV----FV

Query:  NSDPGYLPSLEPQILMNQSPLD-LGVYSFGFSSDLTWPHRSFGADDRNP------TWTAVNDDNAT--NSSAAAGNNKNSSAQSEINCCNNLVQNQWS--
        ++  GYL S+E   + NQS LD +GVYSFG      WPHRSFG DD NP      TWT +N   AT  +S+AA+GNN+NSSA S    CN    + WS  
Subjt:  NSDPGYLPSLEPQILMNQSPLD-LGVYSFGFSSDLTWPHRSFGADDRNP------TWTAVNDDNAT--NSSAAAGNNKNSSAQSEINCCNNLVQNQWS--

Query:  ---HHHHLPAYGHPP
           HHH  P YG PP
Subjt:  ---HHHHLPAYGHPP

XP_022954649.1 dof zinc finger protein DOF2.1-like [Cucurbita moschata]2.6e-3659.39Show/hide
Query:  MQGATATANEEMKDT----QNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALH-----P
        MQGA AT NEEMK T    Q+ +KCPRC+S NTKFCYYNNYSLSQPRYFCKSCRRYWT GGTLRNVPIGGG RK +KR K +P+ SSSS+AA+       
Subjt:  MQGATATANEEMKDT----QNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALH-----P

Query:  NAPP--------VFVNS---DPGYLPSLEPQILMNQSPLDLGVYSFGFSSDLTWPHRSFGADDRN
        NAPP        VF NS     GY PS E    ++Q+ +D  VYSFG   + +W HRSF  DD N
Subjt:  NAPP--------VFVNS---DPGYLPSLEPQILMNQSPLDLGVYSFGFSSDLTWPHRSFGADDRN

XP_023542171.1 dof zinc finger protein DOF2.1-like [Cucurbita pepo subsp. pepo]2.0e-3659.39Show/hide
Query:  MQGATATANEEMKDT----QNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALH-----P
        MQGA AT NEEMK T    Q+ +KCPRC+S NTKFCYYNNYSLSQPRYFCKSCRRYWT GGTLRNVP+GGG RK +KR K +P+ SSSS+AA+       
Subjt:  MQGATATANEEMKDT----QNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALH-----P

Query:  NAPP--------VFVNS---DPGYLPSLEPQILMNQSPLDLGVYSFGFSSDLTWPHRSFGADDRN
        NAPP        VF NS     GY PS E    ++QS +D  VYSFG   + +W HRSF  DD N
Subjt:  NAPP--------VFVNS---DPGYLPSLEPQILMNQSPLDLGVYSFGFSSDLTWPHRSFGADDRN

TrEMBL top hitse value%identityAlignment
A0A0A0LXF4 Dof-type domain-containing protein2.7e-3148.76Show/hide
Query:  MQGATATANEEMKDTQNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALHPNAPPVFVNS
        MQGA      +      KQKCPRCDS NTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRK  +   TL  SSSSSS       PP     
Subjt:  MQGATATANEEMKDTQNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALHPNAPPVFVNS

Query:  DPGYLPSLEPQILMNQSPLDLGVYSFGFSS------DLTWPHRSFGADDRNPTWTAVNDDNAT----NSSAAAGNNKNSSAQSEINCCNNLVQNQWSHHH
         PG              PLDL   S+  S+      D+   + SFG       WT     N T    N S+++GNN+NSS Q E N  + L+ N  S  H
Subjt:  DPGYLPSLEPQILMNQSPLDLGVYSFGFSS------DLTWPHRSFGADDRNPTWTAVNDDNAT----NSSAAAGNNKNSSAQSEINCCNNLVQNQWSHHH

Query:  H
        H
Subjt:  H

A0A6J1CET6 dof zinc finger protein DOF5.6-like1.4e-4355.81Show/hide
Query:  ATATANEEMKDTQ-NKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALHPNAPPV----FV
        AT TANE+    +  K+KCPRC SFNTKFCYYNNYSLSQPRYFCKSCRRYWT GGTLRNVP+GGGSRK  KR K   T+++++  +L P  PP     F 
Subjt:  ATATANEEMKDTQ-NKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALHPNAPPV----FV

Query:  NSDPGYLPSLEPQILMNQSPLD-LGVYSFGFSSDLTWPHRSFGADDRNP------TWTAVNDDNAT--NSSAAAGNNKNSSAQSEINCCNNLVQNQWS--
        ++  GYL S+E   + NQS LD +GVYSFG      WPHRSFG DD NP      TWT +N   AT  +S+AA+GNN+NSSA S    CN    + WS  
Subjt:  NSDPGYLPSLEPQILMNQSPLD-LGVYSFGFSSDLTWPHRSFGADDRNP------TWTAVNDDNAT--NSSAAAGNNKNSSAQSEINCCNNLVQNQWS--

Query:  ---HHHHLPAYGHPP
           HHH  P YG PP
Subjt:  ---HHHHLPAYGHPP

A0A6J1EG92 dof zinc finger protein DOF5.3-like4.8e-3649.51Show/hide
Query:  MQGATATANEEMKDTQNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALHPNAPPVFV--
        MQGA A  NEE K T  +  CPRCDS NTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRK  KR K +P SS S       NAPP     
Subjt:  MQGATATANEEMKDTQNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALHPNAPPVFV--

Query:  ------NSDPGYLPSLEPQILMNQSPLDLGVYSFGFSSDLTWPHRSFGADDRNPTWTAVNDDNATNSSAAAGNNKNSSAQSEINCCNNLVQNQWSHHHHL
               S  GY  +L    +     +D+G Y                    N  WT +N       S +AGN +N S ++E  CC NLVQN   HH HL
Subjt:  ------NSDPGYLPSLEPQILMNQSPLDLGVYSFGFSSDLTWPHRSFGADDRNPTWTAVNDDNATNSSAAAGNNKNSSAQSEINCCNNLVQNQWSHHHHL

Query:  PAYGHP
        P Y  P
Subjt:  PAYGHP

A0A6J1GTK8 dof zinc finger protein DOF2.1-like1.3e-3659.39Show/hide
Query:  MQGATATANEEMKDT----QNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALH-----P
        MQGA AT NEEMK T    Q+ +KCPRC+S NTKFCYYNNYSLSQPRYFCKSCRRYWT GGTLRNVPIGGG RK +KR K +P+ SSSS+AA+       
Subjt:  MQGATATANEEMKDT----QNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALH-----P

Query:  NAPP--------VFVNS---DPGYLPSLEPQILMNQSPLDLGVYSFGFSSDLTWPHRSFGADDRN
        NAPP        VF NS     GY PS E    ++Q+ +D  VYSFG   + +W HRSF  DD N
Subjt:  NAPP--------VFVNS---DPGYLPSLEPQILMNQSPLDLGVYSFGFSSDLTWPHRSFGADDRN

A0A6J1KNK3 dof zinc finger protein DOF5.3-like2.8e-3655.62Show/hide
Query:  MQGATATANEEMKDTQNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSS-------SSAALHPNA
        MQGA A +NEE K T  +  CPRCDS NTKFC+YNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKP KR KT+P SS S        S ALH   
Subjt:  MQGATATANEEMKDTQNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSS-------SSAALHPNA

Query:  PPVFVNSDPGYLPSLEPQILMNQSPLDLGVYSFGFSSDLTWPHRSFGADDRNPTWTAVNDDNATNSSAA
        P  +  S  GY  +L    +     +D+G Y+FG S       +SFG  D N  WT +N  N+ NS +A
Subjt:  PPVFVNSDPGYLPSLEPQILMNQSPLDLGVYSFGFSSDLTWPHRSFGADDRNPTWTAVNDDNATNSSAA

SwissProt top hitse value%identityAlignment
O24463 Dof zinc finger protein PBF6.3e-2569.88Show/hide
Query:  KCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTS--SSSSSAALHPNAPPVFVNS
        KCPRCDS NTKFCYYNNYS+SQPRYFCK+CRRYWTHGGTLRNVPIGGG RK    S+ +  S  SSSSSA   P +P    +S
Subjt:  KCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTS--SSSSSAALHPNAPPVFVNS

O82155 Dof zinc finger protein DOF1.76.3e-2561.9Show/hide
Query:  KAMQGATATANEEMKDTQNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSS
        ++M   TA  N+     Q + KCPRCDS NTKFCYYNNY+LSQPR+FCK+CRRYWT GG LRN+P+GGG+RK +KRS + P+S+
Subjt:  KAMQGATATANEEMKDTQNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSS

Q39088 Dof zinc finger protein DOF3.41.1e-2660Show/hide
Query:  MKDTQNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALHPNAPPVFVNSDPGYLP
        + D Q +  CPRCDS NTKFCYYNNY+ SQPR+FCK+CRRYWTHGGTLR+VP+GGG+RK +KRS+T   SSSSS + +  N+  V + + P   P
Subjt:  MKDTQNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALHPNAPPVFVNSDPGYLP

Q94AR6 Dof zinc finger protein DOF3.13.1e-2453.7Show/hide
Query:  QNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALHPNAPPVFVNSDP----GYLPSLEPQ
        Q + KCPRCDS NTKFCYYNNY+LSQPR+FCKSCRRYWT GG LRNVP+GGGSRK + +  T  +SS+SS +    N      + DP       P L+P 
Subjt:  QNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALHPNAPPVFVNSDP----GYLPSLEPQ

Query:  ILMNQSPL
         ++   P+
Subjt:  ILMNQSPL

Q9ZPY0 Dof zinc finger protein DOF2.51.1e-2451.85Show/hide
Query:  TATANEEMKDTQNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALHPNAPPVFVNSDPGY
        TA   E     Q K  CPRC+S NTKFCYYNNYSL+QPRYFCK CRRYWT GG+LRNVP+GG SRK +KRS     SSSSSS  L          + P  
Subjt:  TATANEEMKDTQNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALHPNAPPVFVNSDPGY

Query:  LPSLEPQILMNQ--------SPLDLGVYSFGFSSD
        LP L P IL +         S  DL + SF    D
Subjt:  LPSLEPQILMNQ--------SPLDLGVYSFGFSSD

Arabidopsis top hitse value%identityAlignment
AT1G51700.1 DOF zinc finger protein 14.5e-2661.9Show/hide
Query:  KAMQGATATANEEMKDTQNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSS
        ++M   TA  N+     Q + KCPRCDS NTKFCYYNNY+LSQPR+FCK+CRRYWT GG LRN+P+GGG+RK +KRS + P+S+
Subjt:  KAMQGATATANEEMKDTQNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSS

AT2G46590.1 Dof-type zinc finger DNA-binding family protein7.6e-2651.85Show/hide
Query:  TATANEEMKDTQNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALHPNAPPVFVNSDPGY
        TA   E     Q K  CPRC+S NTKFCYYNNYSL+QPRYFCK CRRYWT GG+LRNVP+GG SRK +KRS     SSSSSS  L          + P  
Subjt:  TATANEEMKDTQNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALHPNAPPVFVNSDPGY

Query:  LPSLEPQILMNQ--------SPLDLGVYSFGFSSD
        LP L P IL +         S  DL + SF    D
Subjt:  LPSLEPQILMNQ--------SPLDLGVYSFGFSSD

AT2G46590.2 Dof-type zinc finger DNA-binding family protein7.6e-2651.85Show/hide
Query:  TATANEEMKDTQNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALHPNAPPVFVNSDPGY
        TA   E     Q K  CPRC+S NTKFCYYNNYSL+QPRYFCK CRRYWT GG+LRNVP+GG SRK +KRS     SSSSSS  L          + P  
Subjt:  TATANEEMKDTQNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALHPNAPPVFVNSDPGY

Query:  LPSLEPQILMNQ--------SPLDLGVYSFGFSSD
        LP L P IL +         S  DL + SF    D
Subjt:  LPSLEPQILMNQ--------SPLDLGVYSFGFSSD

AT3G21270.1 DOF zinc finger protein 22.2e-2553.7Show/hide
Query:  QNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALHPNAPPVFVNSDP----GYLPSLEPQ
        Q + KCPRCDS NTKFCYYNNY+LSQPR+FCKSCRRYWT GG LRNVP+GGGSRK + +  T  +SS+SS +    N      + DP       P L+P 
Subjt:  QNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALHPNAPPVFVNSDP----GYLPSLEPQ

Query:  ILMNQSPL
         ++   P+
Subjt:  ILMNQSPL

AT3G50410.1 OBF binding protein 18.2e-2860Show/hide
Query:  MKDTQNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALHPNAPPVFVNSDPGYLP
        + D Q +  CPRCDS NTKFCYYNNY+ SQPR+FCK+CRRYWTHGGTLR+VP+GGG+RK +KRS+T   SSSSS + +  N+  V + + P   P
Subjt:  MKDTQNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPIGGGSRKPSKRSKTLPTSSSSSSAALHPNAPPVFVNSDPGYLP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGCCCCAAAGTCAATACCCTTCCCTTCAAATCCACGCAACACAAATACCATACCATACCATACCATACCACAGTACAAATTATATATTATATATATATATATCATAA
AACGAAAAATCAAGAGCTAGGTAAAGCAATGCAAGGGGCAACTGCAACAGCAAATGAAGAAATGAAAGACACACAAAACAAACAAAAATGCCCTCGCTGTGACTCTTTCA
ATACCAAGTTTTGTTATTACAATAACTACAGCCTTTCTCAGCCTCGTTACTTTTGCAAGTCTTGTCGAAGGTACTGGACTCATGGTGGCACTCTCCGTAACGTCCCCATC
GGCGGCGGCTCTCGAAAACCATCCAAGCGCTCTAAAACACTGCCCACTTCTTCTTCTTCTTCCTCCGCCGCCCTTCACCCTAATGCCCCGCCAGTTTTTGTCAATTCGGA
TCCTGGGTATTTGCCGTCGTTGGAACCGCAGATTTTGATGAATCAATCGCCTCTGGATCTGGGGGTTTATTCGTTTGGATTCTCGTCGGACCTGACTTGGCCTCATCGGA
GCTTTGGTGCTGATGATCGTAATCCCACGTGGACTGCCGTCAACGACGACAACGCAACTAATTCTTCCGCTGCAGCAGGTAATAATAAAAATTCAAGCGCTCAAAGTGAG
ATTAATTGTTGTAATAATTTGGTCCAAAATCAGTGGTCTCATCATCATCATCTGCCTGCCTATGGTCATCCTCCTTGA
mRNA sequenceShow/hide mRNA sequence
GAACGATCGAAAAGGAGCACCACAAAATCAACAACAAATGTCTCTCCAAATTTCGTCAGTATATATAAATGTTGAGGGCACGTCGATGCCTTTAATCCACCAAAAAAAAA
AACGCTTCCCCTTTGCATGCGCCCCAAAGTCAATACCCTTCCCTTCAAATCCACGCAACACAAATACCATACCATACCATACCATACCACAGTACAAATTATATATTATA
TATATATATATCATAAAACGAAAAATCAAGAGCTAGGTAAAGCAATGCAAGGGGCAACTGCAACAGCAAATGAAGAAATGAAAGACACACAAAACAAACAAAAATGCCCT
CGCTGTGACTCTTTCAATACCAAGTTTTGTTATTACAATAACTACAGCCTTTCTCAGCCTCGTTACTTTTGCAAGTCTTGTCGAAGGTACTGGACTCATGGTGGCACTCT
CCGTAACGTCCCCATCGGCGGCGGCTCTCGAAAACCATCCAAGCGCTCTAAAACACTGCCCACTTCTTCTTCTTCTTCCTCCGCCGCCCTTCACCCTAATGCCCCGCCAG
TTTTTGTCAATTCGGATCCTGGGTATTTGCCGTCGTTGGAACCGCAGATTTTGATGAATCAATCGCCTCTGGATCTGGGGGTTTATTCGTTTGGATTCTCGTCGGACCTG
ACTTGGCCTCATCGGAGCTTTGGTGCTGATGATCGTAATCCCACGTGGACTGCCGTCAACGACGACAACGCAACTAATTCTTCCGCTGCAGCAGGTAATAATAAAAATTC
AAGCGCTCAAAGTGAGATTAATTGTTGTAATAATTTGGTCCAAAATCAGTGGTCTCATCATCATCATCTGCCTGCCTATGGTCATCCTCCTTGATTTCATCATCTCCCAT
TTGTTTCAAATATTATACCGTCATTTTAAAGACATTTTTATCTATGACAAAATAAAAAGAAAAAAAGAAAAACTATAGATTATCATTGGATAATACTTGCAAGCACTTTT
TGATATTTAGTTTTCTAGTACAATAAATAATGGGTGGTGCATGAAGATTTGAATCTCCAATCTCAGTTACTCCTCAATTAAGTTCATGTTAGACCATTTTTTTATATATT
TATTAGTTTAAAAATTTGTGGTCATTCACTTATTTATGAAATGTAATAGTAATAAACTAAAGAAAAATAATTTTGAGTGCA
Protein sequenceShow/hide protein sequence
MRPKVNTLPFKSTQHKYHTIPYHTTVQIIYYIYIYHKTKNQELGKAMQGATATANEEMKDTQNKQKCPRCDSFNTKFCYYNNYSLSQPRYFCKSCRRYWTHGGTLRNVPI
GGGSRKPSKRSKTLPTSSSSSSAALHPNAPPVFVNSDPGYLPSLEPQILMNQSPLDLGVYSFGFSSDLTWPHRSFGADDRNPTWTAVNDDNATNSSAAAGNNKNSSAQSE
INCCNNLVQNQWSHHHHLPAYGHPP