; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000952 (gene) of Snake gourd v1 genome

Gene IDTan0000952
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPR domain zinc finger protein 8, putative isoform 1
Genome locationLG03:62180915..62183652
RNA-Seq ExpressionTan0000952
SyntenyTan0000952
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055894.1 PR domain zinc finger protein 8, putative isoform 1 [Cucumis melo var. makuwa]6.9e-9884.52Show/hide
Query:  MDFARCSDESKESENSSSSKCCGKVEAQPEIVELTRAPLGPSTPDADRESCDFPSDSKSPLTQVVASRALQLTCVDSLGGERIIEAPLEGLDSFDPLCSP
        MDFARC+DESKESEN  S+KCC KVE Q EIVEL R P+ PSTPDADRES DFPSDSKSP+TQVV S+ L LTC+DSL GER IEAPLE  DSFDPLCSP
Subjt:  MDFARCSDESKESENSSSSKCCGKVEAQPEIVELTRAPLGPSTPDADRESCDFPSDSKSPLTQVVASRALQLTCVDSLGGERIIEAPLEGLDSFDPLCSP

Query:  RTPKDGVFDPFSPGPSHLALAPLSRKYFSGSVGFVARRLQFGSSSSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQAETSLAQLSSSQSDSPDC
        RTPKDGVFDPFSP P+HLALAP+SRKYFSGSVGFV RRLQFG  SSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQAE+SLAQ SSSQSDSPDC
Subjt:  RTPKDGVFDPFSPGPSHLALAPLSRKYFSGSVGFVARRLQFGSSSSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQAETSLAQLSSSQSDSPDC

Query:  ITPPASFMSGVAQTCPAAPVKPSGKFRNFDLGLCRKLEF
         TPP SF+SGVAQTCPAAPVKPS K RN D+GLCRKLEF
Subjt:  ITPPASFMSGVAQTCPAAPVKPSGKFRNFDLGLCRKLEF

XP_004142723.1 cyclin-dependent protein kinase inhibitor SMR11 isoform X1 [Cucumis sativus]2.8e-9985.36Show/hide
Query:  MDFARCSDESKESENSSSSKCCGKVEAQPEIVELTRAPLGPSTPDADRESCDFPSDSKSPLTQVVASRALQLTCVDSLGGERIIEAPLEGLDSFDPLCSP
        MDFARC+DESKESEN SS+KCC KVE Q EIVEL R P+ PSTPDADRES DFPSDSKSP+TQVV S+ L LTC+DSL GER  EAPLE  DSFDPLCSP
Subjt:  MDFARCSDESKESENSSSSKCCGKVEAQPEIVELTRAPLGPSTPDADRESCDFPSDSKSPLTQVVASRALQLTCVDSLGGERIIEAPLEGLDSFDPLCSP

Query:  RTPKDGVFDPFSPGPSHLALAPLSRKYFSGSVGFVARRLQFGSSSSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQAETSLAQLSSSQSDSPDC
        RTPKDGVFDPFSP P+HLALAP+SRKYFSGSVGFV RRLQFG  SSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQAE+SLAQLSSSQSDSPDC
Subjt:  RTPKDGVFDPFSPGPSHLALAPLSRKYFSGSVGFVARRLQFGSSSSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQAETSLAQLSSSQSDSPDC

Query:  ITPPASFMSGVAQTCPAAPVKPSGKFRNFDLGLCRKLEF
         TPP SFMSGVAQTCPAAPVKPS K RN D+GLCRKLEF
Subjt:  ITPPASFMSGVAQTCPAAPVKPSGKFRNFDLGLCRKLEF

XP_008463308.1 PREDICTED: uncharacterized protein LOC103501496 [Cucumis melo]6.9e-9884.52Show/hide
Query:  MDFARCSDESKESENSSSSKCCGKVEAQPEIVELTRAPLGPSTPDADRESCDFPSDSKSPLTQVVASRALQLTCVDSLGGERIIEAPLEGLDSFDPLCSP
        MDFARC+DESKESEN  S+KCC KVE Q EIVEL R P+ PSTPDADRES DFPSDSKSP+TQVV S+ L LTC+DSL GER IEAPLE  DSFDPLCSP
Subjt:  MDFARCSDESKESENSSSSKCCGKVEAQPEIVELTRAPLGPSTPDADRESCDFPSDSKSPLTQVVASRALQLTCVDSLGGERIIEAPLEGLDSFDPLCSP

Query:  RTPKDGVFDPFSPGPSHLALAPLSRKYFSGSVGFVARRLQFGSSSSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQAETSLAQLSSSQSDSPDC
        RTPKDGVFDPFSP P+HLALAP+SRKYFSGSVGFV RRLQFG  SSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQAE+SLAQ SSSQSDSPDC
Subjt:  RTPKDGVFDPFSPGPSHLALAPLSRKYFSGSVGFVARRLQFGSSSSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQAETSLAQLSSSQSDSPDC

Query:  ITPPASFMSGVAQTCPAAPVKPSGKFRNFDLGLCRKLEF
         TPP SF+SGVAQTCPAAPVKPS K RN D+GLCRKLEF
Subjt:  ITPPASFMSGVAQTCPAAPVKPSGKFRNFDLGLCRKLEF

XP_022156261.1 uncharacterized protein LOC111023190 [Momordica charantia]1.4e-9882.52Show/hide
Query:  MDFARCSDESKESENSSSSKCCGKVEAQPEIVEL------TRAPLGPSTPDADRESCDFPSDSKSPLTQVVASRALQLTCVDSLGGERIIEAPL-EGLDS
        MDFARCSDESKESE+  SSKCC KVEAQPE+VEL        APLGP+TPDADRES DFPSDSKSP TQVV +R+LQLTC+DSL GER  EAPL EGL+S
Subjt:  MDFARCSDESKESENSSSSKCCGKVEAQPEIVEL------TRAPLGPSTPDADRESCDFPSDSKSPLTQVVASRALQLTCVDSLGGERIIEAPL-EGLDS

Query:  FDPLCSPRTPKDGVFDPFSPGPSHLALAPLSRKYFSGSVGFVARRLQFGSSSSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQAETSLAQLSSS
         DPLCSPRTPKDGVFDPFSPGP+HLALAPLSRKYFSGSVGFVARRLQFGSSSSSSS+QL E  EEQSISDSELLEAVYENLLEVIVS+QAE SLAQLSSS
Subjt:  FDPLCSPRTPKDGVFDPFSPGPSHLALAPLSRKYFSGSVGFVARRLQFGSSSSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQAETSLAQLSSS

Query:  QSDSPDCITPPASFMSGVAQTCPAAPVKPSGKFRNFDLGLCRKLEF
        QSDS +CITPPA ++SG+AQTCP AP+KP GKFR+FDLGLCRKLEF
Subjt:  QSDSPDCITPPASFMSGVAQTCPAAPVKPSGKFRNFDLGLCRKLEF

XP_038883894.1 unknown protein 1 [Benincasa hispida]2.1e-9984.1Show/hide
Query:  MDFARCSDESKESENSSSSKCCGKVEAQPEIVELTRAPLGPSTPDADRESCDFPSDSKSPLTQVVASRALQLTCVDSLGGERIIEAPLEGLDSFDPLCSP
        MDFARC+DESKESEN  S+KCC KVEAQPEIVEL RAPLGPSTPDADRES DF SDSKSPLTQVV S+ L LTC+DSL  +  IEAPLE  DSFDPLCSP
Subjt:  MDFARCSDESKESENSSSSKCCGKVEAQPEIVELTRAPLGPSTPDADRESCDFPSDSKSPLTQVVASRALQLTCVDSLGGERIIEAPLEGLDSFDPLCSP

Query:  RTPKDGVFDPFSPGPSHLALAPLSRKYFSGSVGFVARRLQFGSSSSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQAETSLAQLSSSQSDSPDC
        RTPKDGVFDPFSPGP+HLALAP+SRK F+GSVGFVARRLQFGSSSSSS LQ+VEAEEEQSISD+ELLEAVYENLLEVIVSHQAE+SL QLSSSQSDS DC
Subjt:  RTPKDGVFDPFSPGPSHLALAPLSRKYFSGSVGFVARRLQFGSSSSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQAETSLAQLSSSQSDSPDC

Query:  ITPPASFMSGVAQTCPAAPVKPSGKFRNFDLGLCRKLEF
         TPP SF+SGVAQTCPAAPVKP  K RN D+GLCRKLEF
Subjt:  ITPPASFMSGVAQTCPAAPVKPSGKFRNFDLGLCRKLEF

TrEMBL top hitse value%identityAlignment
A0A0A0KYB8 Uncharacterized protein1.3e-9985.36Show/hide
Query:  MDFARCSDESKESENSSSSKCCGKVEAQPEIVELTRAPLGPSTPDADRESCDFPSDSKSPLTQVVASRALQLTCVDSLGGERIIEAPLEGLDSFDPLCSP
        MDFARC+DESKESEN SS+KCC KVE Q EIVEL R P+ PSTPDADRES DFPSDSKSP+TQVV S+ L LTC+DSL GER  EAPLE  DSFDPLCSP
Subjt:  MDFARCSDESKESENSSSSKCCGKVEAQPEIVELTRAPLGPSTPDADRESCDFPSDSKSPLTQVVASRALQLTCVDSLGGERIIEAPLEGLDSFDPLCSP

Query:  RTPKDGVFDPFSPGPSHLALAPLSRKYFSGSVGFVARRLQFGSSSSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQAETSLAQLSSSQSDSPDC
        RTPKDGVFDPFSP P+HLALAP+SRKYFSGSVGFV RRLQFG  SSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQAE+SLAQLSSSQSDSPDC
Subjt:  RTPKDGVFDPFSPGPSHLALAPLSRKYFSGSVGFVARRLQFGSSSSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQAETSLAQLSSSQSDSPDC

Query:  ITPPASFMSGVAQTCPAAPVKPSGKFRNFDLGLCRKLEF
         TPP SFMSGVAQTCPAAPVKPS K RN D+GLCRKLEF
Subjt:  ITPPASFMSGVAQTCPAAPVKPSGKFRNFDLGLCRKLEF

A0A1S3CIW8 uncharacterized protein LOC1035014963.3e-9884.52Show/hide
Query:  MDFARCSDESKESENSSSSKCCGKVEAQPEIVELTRAPLGPSTPDADRESCDFPSDSKSPLTQVVASRALQLTCVDSLGGERIIEAPLEGLDSFDPLCSP
        MDFARC+DESKESEN  S+KCC KVE Q EIVEL R P+ PSTPDADRES DFPSDSKSP+TQVV S+ L LTC+DSL GER IEAPLE  DSFDPLCSP
Subjt:  MDFARCSDESKESENSSSSKCCGKVEAQPEIVELTRAPLGPSTPDADRESCDFPSDSKSPLTQVVASRALQLTCVDSLGGERIIEAPLEGLDSFDPLCSP

Query:  RTPKDGVFDPFSPGPSHLALAPLSRKYFSGSVGFVARRLQFGSSSSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQAETSLAQLSSSQSDSPDC
        RTPKDGVFDPFSP P+HLALAP+SRKYFSGSVGFV RRLQFG  SSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQAE+SLAQ SSSQSDSPDC
Subjt:  RTPKDGVFDPFSPGPSHLALAPLSRKYFSGSVGFVARRLQFGSSSSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQAETSLAQLSSSQSDSPDC

Query:  ITPPASFMSGVAQTCPAAPVKPSGKFRNFDLGLCRKLEF
         TPP SF+SGVAQTCPAAPVKPS K RN D+GLCRKLEF
Subjt:  ITPPASFMSGVAQTCPAAPVKPSGKFRNFDLGLCRKLEF

A0A5A7UL34 PR domain zinc finger protein 8, putative isoform 13.3e-9884.52Show/hide
Query:  MDFARCSDESKESENSSSSKCCGKVEAQPEIVELTRAPLGPSTPDADRESCDFPSDSKSPLTQVVASRALQLTCVDSLGGERIIEAPLEGLDSFDPLCSP
        MDFARC+DESKESEN  S+KCC KVE Q EIVEL R P+ PSTPDADRES DFPSDSKSP+TQVV S+ L LTC+DSL GER IEAPLE  DSFDPLCSP
Subjt:  MDFARCSDESKESENSSSSKCCGKVEAQPEIVELTRAPLGPSTPDADRESCDFPSDSKSPLTQVVASRALQLTCVDSLGGERIIEAPLEGLDSFDPLCSP

Query:  RTPKDGVFDPFSPGPSHLALAPLSRKYFSGSVGFVARRLQFGSSSSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQAETSLAQLSSSQSDSPDC
        RTPKDGVFDPFSP P+HLALAP+SRKYFSGSVGFV RRLQFG  SSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQAE+SLAQ SSSQSDSPDC
Subjt:  RTPKDGVFDPFSPGPSHLALAPLSRKYFSGSVGFVARRLQFGSSSSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQAETSLAQLSSSQSDSPDC

Query:  ITPPASFMSGVAQTCPAAPVKPSGKFRNFDLGLCRKLEF
         TPP SF+SGVAQTCPAAPVKPS K RN D+GLCRKLEF
Subjt:  ITPPASFMSGVAQTCPAAPVKPSGKFRNFDLGLCRKLEF

A0A6J1DUE1 uncharacterized protein LOC1110231906.7e-9982.52Show/hide
Query:  MDFARCSDESKESENSSSSKCCGKVEAQPEIVEL------TRAPLGPSTPDADRESCDFPSDSKSPLTQVVASRALQLTCVDSLGGERIIEAPL-EGLDS
        MDFARCSDESKESE+  SSKCC KVEAQPE+VEL        APLGP+TPDADRES DFPSDSKSP TQVV +R+LQLTC+DSL GER  EAPL EGL+S
Subjt:  MDFARCSDESKESENSSSSKCCGKVEAQPEIVEL------TRAPLGPSTPDADRESCDFPSDSKSPLTQVVASRALQLTCVDSLGGERIIEAPL-EGLDS

Query:  FDPLCSPRTPKDGVFDPFSPGPSHLALAPLSRKYFSGSVGFVARRLQFGSSSSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQAETSLAQLSSS
         DPLCSPRTPKDGVFDPFSPGP+HLALAPLSRKYFSGSVGFVARRLQFGSSSSSSS+QL E  EEQSISDSELLEAVYENLLEVIVS+QAE SLAQLSSS
Subjt:  FDPLCSPRTPKDGVFDPFSPGPSHLALAPLSRKYFSGSVGFVARRLQFGSSSSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQAETSLAQLSSS

Query:  QSDSPDCITPPASFMSGVAQTCPAAPVKPSGKFRNFDLGLCRKLEF
        QSDS +CITPPA ++SG+AQTCP AP+KP GKFR+FDLGLCRKLEF
Subjt:  QSDSPDCITPPASFMSGVAQTCPAAPVKPSGKFRNFDLGLCRKLEF

A0A6J1IKP3 uncharacterized protein LOC1114783115.5e-8575.62Show/hide
Query:  MDFARCSDESKESENSSSSKCCGKVEAQPEIVELTRAPLGPSTPDADRESCDFPSDSKSPLTQVVASRALQLTCVDSLGGERIIEAPLEGLDSFDPLCSP
        MDFARC+DESKES+  S SKCC K EAQPEIVEL RAP+ PSTPDADRES DFPSDSKSPLTQVV SRAL L+  D          PLE      PLCSP
Subjt:  MDFARCSDESKESENSSSSKCCGKVEAQPEIVELTRAPLGPSTPDADRESCDFPSDSKSPLTQVVASRALQLTCVDSLGGERIIEAPLEGLDSFDPLCSP

Query:  RTPKDGVFDPFSPGPSHLALAPLSRKYFSGSVGFVARRLQFGSS---SSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQAETSLAQLSSSQSDS
        RTPKDGVFDPFSPGP+HLALAPL RK FSGSVGFVARRLQF SS   SSSSSLQ VE+EEEQ+I+DSELLEAVYENLLE+I+SHQAE+SLA ++SS +DS
Subjt:  RTPKDGVFDPFSPGPSHLALAPLSRKYFSGSVGFVARRLQFGSS---SSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQAETSLAQLSSSQSDS

Query:  PDCITPPASFMSGVAQTCPAAPVKPSGKFRNFDLGLCRKLEF
         DC TPPASF++G+AQTCPAAPVKPS K RN DLGLCRKLEF
Subjt:  PDCITPPASFMSGVAQTCPAAPVKPSGKFRNFDLGLCRKLEF

SwissProt top hitse value%identityAlignment
P85192 Unknown protein 18.4e-0637.86Show/hide
Query:  GSSSSSSSLQLVEAE--EEQSISDSELLEAVYENLLEVIVSHQAETSLAQLSSSQSDSPDCITPP-ASFMSGVAQTCPAAPVKPS--GKFRNFDLGLCRK
        G+S++   L     E   E    +  L+E VYE++LE I+  QAE  LA++  S S      TPP A+   G+ +TCP AP+K       R  D+ LCRK
Subjt:  GSSSSSSSLQLVEAE--EEQSISDSELLEAVYENLLEVIVSHQAETSLAQLSSSQSDSPDCITPP-ASFMSGVAQTCPAAPVKPS--GKFRNFDLGLCRK

Query:  LEF
        L+F
Subjt:  LEF

Q9SKN7 Cyclin-dependent protein kinase inhibitor SMR117.1e-0533.33Show/hide
Query:  EAPLEGLDSFDPLCSPRTPKDGVFD--PFSPGPSHLALAPLSRKYFSGSVGFVARRLQFGSSSSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQ
        E P E  ++      P+TP   V D  P     S L+   +  +   G V       Q   S+S S   LV        SD E++E++Y+NLL VI+S Q
Subjt:  EAPLEGLDSFDPLCSPRTPKDGVFD--PFSPGPSHLALAPLSRKYFSGSVGFVARRLQFGSSSSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQ

Query:  AETSLAQLSSSQSDSPDCITPPASF----MSGVAQTCPAAPVKPSGKFRNFDLGLCRKL
           S+A +         C TPP S      + V+ TCP AP+K +   RN D GL RKL
Subjt:  AETSLAQLSSSQSDSPDCITPPASF----MSGVAQTCPAAPVKPSGKFRNFDLGLCRKL

Arabidopsis top hitse value%identityAlignment
AT2G28330.1 unknown protein5.0e-0633.33Show/hide
Query:  EAPLEGLDSFDPLCSPRTPKDGVFD--PFSPGPSHLALAPLSRKYFSGSVGFVARRLQFGSSSSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQ
        E P E  ++      P+TP   V D  P     S L+   +  +   G V       Q   S+S S   LV        SD E++E++Y+NLL VI+S Q
Subjt:  EAPLEGLDSFDPLCSPRTPKDGVFD--PFSPGPSHLALAPLSRKYFSGSVGFVARRLQFGSSSSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQ

Query:  AETSLAQLSSSQSDSPDCITPPASF----MSGVAQTCPAAPVKPSGKFRNFDLGLCRKL
           S+A +         C TPP S      + V+ TCP AP+K +   RN D GL RKL
Subjt:  AETSLAQLSSSQSDSPDCITPPASF----MSGVAQTCPAAPVKPSGKFRNFDLGLCRKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTTGCGCGATGTTCTGATGAATCTAAGGAATCAGAAAATTCTTCGTCTAGTAAATGTTGCGGAAAAGTGGAAGCTCAGCCGGAGATTGTTGAGCTTACACGGGC
TCCTCTTGGACCTTCAACGCCTGATGCTGATCGAGAGAGTTGTGATTTTCCGTCTGATTCAAAGTCCCCACTCACCCAGGTTGTGGCTAGCAGAGCCCTTCAGTTAACTT
GTGTAGATTCTTTAGGTGGTGAAAGGATTATTGAGGCCCCTTTAGAGGGTCTTGATTCTTTTGATCCTCTCTGTAGCCCTCGTACTCCTAAGGATGGTGTTTTTGATCCC
TTTTCCCCTGGCCCTTCTCATTTGGCTTTAGCTCCTCTCTCTAGAAAGTATTTTAGTGGCTCTGTTGGATTTGTTGCTCGTCGCCTTCAGTTTGGGTCTTCTTCTTCTTC
TTCTTCATTGCAATTGGTGGAAGCTGAAGAAGAACAATCCATATCCGATAGTGAGCTGTTAGAGGCTGTTTATGAAAATCTCTTGGAAGTTATTGTCTCCCATCAGGCTG
AGACCTCTCTTGCTCAGCTCTCAAGTTCTCAAAGTGACTCTCCTGATTGTATTACCCCTCCTGCTTCGTTTATGAGTGGGGTTGCTCAAACTTGTCCTGCTGCGCCTGTC
AAGCCATCGGGAAAGTTTCGAAACTTCGACTTGGGTTTGTGCAGAAAGCTTGAGTTCTGA
mRNA sequenceShow/hide mRNA sequence
GGAAAATAGAGAGGAAGAAGAAGAAGAAGGTTAAATATGACCGTTCGATGAGTTCTGATCTACAGTTTCCAGTTTCACTTCCAAAAAACTCCATTCCTAATTCACTTCCA
ACCTCCATTTTTCTCTCTCTACAGTCGCTTCTCTTCTAGTTTTTCGAGCCATAATCTCTAACTTTTAGCTTTCCTCTTCCCCTTTCCATCGCCGTCTTCATCTTCAACCT
CAGTTGAATCCCATTTCTCAGCCGCCGGGGTTTCTCATTCGTTTCTCCGCCGTTCAGGTTTCGCTTTGCGCCATTTTGATTTAACGAGGATGGATTTTGCGCGATGTTCT
GATGAATCTAAGGAATCAGAAAATTCTTCGTCTAGTAAATGTTGCGGAAAAGTGGAAGCTCAGCCGGAGATTGTTGAGCTTACACGGGCTCCTCTTGGACCTTCAACGCC
TGATGCTGATCGAGAGAGTTGTGATTTTCCGTCTGATTCAAAGTCCCCACTCACCCAGGTTGTGGCTAGCAGAGCCCTTCAGTTAACTTGTGTAGATTCTTTAGGTGGTG
AAAGGATTATTGAGGCCCCTTTAGAGGGTCTTGATTCTTTTGATCCTCTCTGTAGCCCTCGTACTCCTAAGGATGGTGTTTTTGATCCCTTTTCCCCTGGCCCTTCTCAT
TTGGCTTTAGCTCCTCTCTCTAGAAAGTATTTTAGTGGCTCTGTTGGATTTGTTGCTCGTCGCCTTCAGTTTGGGTCTTCTTCTTCTTCTTCTTCATTGCAATTGGTGGA
AGCTGAAGAAGAACAATCCATATCCGATAGTGAGCTGTTAGAGGCTGTTTATGAAAATCTCTTGGAAGTTATTGTCTCCCATCAGGCTGAGACCTCTCTTGCTCAGCTCT
CAAGTTCTCAAAGTGACTCTCCTGATTGTATTACCCCTCCTGCTTCGTTTATGAGTGGGGTTGCTCAAACTTGTCCTGCTGCGCCTGTCAAGCCATCGGGAAAGTTTCGA
AACTTCGACTTGGGTTTGTGCAGAAAGCTTGAGTTCTGACTGTTGGTGCCCTTCTGGCTAGCTGCAGATCATTGCATAGAAGAATGTAGGTAACTGACTCTCGAGATGAA
TTACTTGCTAATTTTAAATCTGACGTTTCTGTACTGAATTATATGACTAGCAAAATCTCTGCTGAAGCAGGCTAATTATTGTAAATTTTGGAAGTCTTCTTTTGATGAAC
AAACTTGTTGGGTTGGATGTAGTCCTGTGATCTCATCTCCCCCAGTTTGGCTTAGAATAGGATTGTTAAAAATGATATTTACCAATAAGGCAGTTCTTTTGTTAACCTGA
TCGGCAAGTTGTTAATAAGATGTCACTGTTGTTACATTAGTTGGTGTACTTGATCTCCTAGGTAGGGATGCACTTGTAGAATTGCAAAGTGAAGAACTATTGGTTTCTTG
TATCACTGTTGTTATTTGGCAAGGAAACTTGTTGGCTCTTGCATTTGTTCTGTTAGAATTGCACCATTTCTAGCCCACCTCTTAAGTTGTGCCATGTTGCCTATCAGGAA
AAATATATTGAATGTTAAATTCTTATGTCAATAATAACTAAAACCTTTTGGATCTTGTGGTGTTTAACAACCAAGCACCTTGCAAGTTCGAAAGTTCAGTTTGTTCATAT
GCGTCGTTGAGTTCTAAAATAATAAGCTATTATACAAATTAGTTAAGACTGAAATACACTTATGGGAATCAAGCTAACTTCGACCTGTTGAGTATCGAAACAGTTCAATT
CTCATTTGGCCGAGCTCAAGTAAAGGTCATTGTAAAGGATATAAATTGCTGACTTTTCAAAGGAGCCAAGATGCTAGTTTTGCTGTAGTATTACGACATATGAAGCGATG
TAGCATTAGCGGTTGCATTTTTATTCATCGAGTTTCTGTTCATTTGAGCTGTCTCAATCATCTGGTATCCAAAAATGAATGATGGTGGGACTCCCAGGATGATTGTGCAG
TAGAATACTAGATCTTAATTGTTTCTGCAGATCCTTCAGAAATGGACTGATGCAAGCAAACAGAAGTGTTGGGTCTAAAATGATGCTTCCATTTACAGTGGGGGATATGA
CTCTACTTGGCAGTGCATTGAAGTGAGAAATGTTTAGCAACAAGTAGGAACTAAGAAGAGGGATTGGTGGTAGTCAGATTCTCTTTCCTAGGCATTGGCATGGCTGCTGT
TTCTTGTGCAGGTAGAAAGTGTCAACTGTCAACAATGGGTAGGATATAGTTACAATCTCAAAAGCATATTGGCTTTTGAGCCTTGACAAAGTCCTAACTTGTCATGGGTG
TTGCTTTTTATCTAGTTGTTAGTTGGATAAGTTCAGTTTCACCTTAAAGTTTGAACTTTATTAAATTAAATATTGAATTCAAATAAACTACCTCTAGTGGCC
Protein sequenceShow/hide protein sequence
MDFARCSDESKESENSSSSKCCGKVEAQPEIVELTRAPLGPSTPDADRESCDFPSDSKSPLTQVVASRALQLTCVDSLGGERIIEAPLEGLDSFDPLCSPRTPKDGVFDP
FSPGPSHLALAPLSRKYFSGSVGFVARRLQFGSSSSSSSLQLVEAEEEQSISDSELLEAVYENLLEVIVSHQAETSLAQLSSSQSDSPDCITPPASFMSGVAQTCPAAPV
KPSGKFRNFDLGLCRKLEF