; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS005896 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS005896
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold254:2284036..2286741
RNA-Seq ExpressionMS005896
SyntenyMS005896
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR003173 - Transcriptional coactivator p15 (PC4), C-terminal
IPR009044 - ssDNA-binding transcriptional regulator
IPR014876 - DEK, C-terminal
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK20834.1 Zinc knuckle family protein, putative isoform 2 [Cucumis melo var. makuwa]2.0e-15764.21Show/hide
Query:  MDPETRRRIEETVIDVLQKSNMEEMTEFKVRAAAAERLGIDLSGMECKRVVRNVVDKFLLSIVEHEDKGKEEGTGPTAPDHAKAVVQEIISKKEINDDGA
        M+ ETRR+IEE VI+VL++SN+E+ TEFKVR+   ER+GIDLS  +CK +VRNVV+ FLLS+ E    GKE+  GP+     +AV Q+II KKE NDDG 
Subjt:  MDPETRRRIEETVIDVLQKSNMEEMTEFKVRAAAAERLGIDLSGMECKRVVRNVVDKFLLSIVEHEDKGKEEGTGPTAPDHAKAVVQEIISKKEINDDGA

Query:  RLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRLRFRHDAEKIDAVSDSTTEVTPPKFPFEIIR
         LIC+LS NR+VTIH F+G  +VSIRQYY K GKQLP+ KGIS+ TEQW+  +S+IPAI EAILQMK+  R  HDA+KI A+S + T VT PKFP E IR
Subjt:  RLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRLRFRHDAEKIDAVSDSTTEVTPPKFPFEIIR

Query:  FDGKNYHVWARQMEFLLQHLKIAYVLFDSCPSAVLGPGSSSGNAAQSMAAEQKWMNDDHMCRHNILNSLSDSLFHQYTKRTMSARELWEELKLLYLFEGF
        FDGKNYH WA QME LLQ LKIAYVL + CP+AVLG  SSSGNAAQS  AEQKWM+DDHMC  NILNSLSD LF++Y+K+ MSA ELW+ELKLLY  E F
Subjt:  FDGKNYHVWARQMEFLLQHLKIAYVLFDSCPSAVLGPGSSSGNAAQSMAAEQKWMNDDHMCRHNILNSLSDSLFHQYTKRTMSARELWEELKLLYLFEGF

Query:  GTKRTQVEKYVEFRLAEEKSILEQVQELNSISDSIVAAGMRIDEDFHVSVIMSKLPSSWKNVWMKLMQEEPLPLWKLMDRLRIEEELRTQQNSHLSGASS
        GTKR+QV+KY+EF++ EEKSILEQV+ELN I+DSI +AG  IDEDFHVS I+SKLP SWKNVWM LM E  LPL KL DRLRIEE+LRTQ+NS LS  S 
Subjt:  GTKRTQVEKYVEFRLAEEKSILEQVQELNSISDSIVAAGMRIDEDFHVSVIMSKLPSSWKNVWMKLMQEEPLPLWKLMDRLRIEEELRTQQNSHLSGASS

Query:  NPVDR--HPAPDRISKMRDPKLTSPQFKRKERQVDNMPLLCFDCGKKGHISRNCPSRKVDN
         P  R  H A +  SKM DP   +   ++KE Q +   LLC DCGK+GH S NCP++KVDN
Subjt:  NPVDR--HPAPDRISKMRDPKLTSPQFKRKERQVDNMPLLCFDCGKKGHISRNCPSRKVDN

XP_004134299.1 uncharacterized protein LOC101205072 [Cucumis sativus]6.7e-16665.74Show/hide
Query:  MDPETRRRIEETVIDVLQKSNMEEMTEFKVRAAAAERLGIDLSGMECKRVVRNVVDKFLLSIVEHEDKGKEEGTGPTAPDHAKAVVQEIISKKEINDDGA
        M+ ETRRRIEE VI+VL+KS+ME+ TEFKVR+   ERLGIDLS  +CK +VRNVV+ FLLS+ E    GKE+  GP+     KAV Q+I+ KKE NDDG 
Subjt:  MDPETRRRIEETVIDVLQKSNMEEMTEFKVRAAAAERLGIDLSGMECKRVVRNVVDKFLLSIVEHEDKGKEEGTGPTAPDHAKAVVQEIISKKEINDDGA

Query:  RLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRLRFRHDAEKIDAVSDSTTEVTPPKFPFEIIR
         LIC+LS NR+VTIH F+G  +VS+RQYYEK GKQLP+ KGIS+ TEQW+  +S+IPAI EAILQMK+  R  HDAEKI A S+ TT VT PK+P E IR
Subjt:  RLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRLRFRHDAEKIDAVSDSTTEVTPPKFPFEIIR

Query:  FDGKNYHVWARQMEFLLQHLKIAYVLFDSCPSAVLGPGSSSGNAAQSMAAEQKWMNDDHMCRHNILNSLSDSLFHQYTKRTMSARELWEELKLLYLFEGF
        FDGKNY+ WA QME LLQ LKIAYVL + CP+AVLG  SSSGNAAQS AAEQKWM DDHMCR NILNSLSD LF++Y+K+TMSA ELW+ELKLLYL E F
Subjt:  FDGKNYHVWARQMEFLLQHLKIAYVLFDSCPSAVLGPGSSSGNAAQSMAAEQKWMNDDHMCRHNILNSLSDSLFHQYTKRTMSARELWEELKLLYLFEGF

Query:  GTKRTQVEKYVEFRLAEEKSILEQVQELNSISDSIVAAGMRIDEDFHVSVIMSKLPSSWKNVWMKLMQEEPLPLWKLMDRLRIEEELRTQQNSHLSGASS
        GTKR+QV+KY+EF++ EEKSILEQV+ELN I+DSI ++G  IDEDFHVS I+SKLP SWKNVW+ LM E+ LPL KL DRLRIEE+LRTQ+NS LSG SS
Subjt:  GTKRTQVEKYVEFRLAEEKSILEQVQELNSISDSIVAAGMRIDEDFHVSVIMSKLPSSWKNVWMKLMQEEPLPLWKLMDRLRIEEELRTQQNSHLSGASS

Query:  NPVDR--HPAPDRISKMRDPKLTSPQFKRKERQVDNMPLLCFDCGKKGHISRNCPSRKVDNEVTRER
        +P  R  H A +  SKM DPK  +   ++KE Q +   LLC DCGK+GH S NCP++KV+NEV R+R
Subjt:  NPVDR--HPAPDRISKMRDPKLTSPQFKRKERQVDNMPLLCFDCGKKGHISRNCPSRKVDNEVTRER

XP_008437880.1 PREDICTED: uncharacterized protein LOC103483179 [Cucumis melo]5.2e-15864.43Show/hide
Query:  MDPETRRRIEETVIDVLQKSNMEEMTEFKVRAAAAERLGIDLSGMECKRVVRNVVDKFLLSIVEHEDKGKEEGTGPTAPDHAKAVVQEIISKKEINDDGA
        M+ ETRR+IEE VI+VL++SN+E+ TEFKVR+   ER+GIDLS  +CK +VRNVV+ FLLS+ E    GKE+  GP+     +AV Q+II KKE NDDG 
Subjt:  MDPETRRRIEETVIDVLQKSNMEEMTEFKVRAAAAERLGIDLSGMECKRVVRNVVDKFLLSIVEHEDKGKEEGTGPTAPDHAKAVVQEIISKKEINDDGA

Query:  RLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRLRFRHDAEKIDAVSDSTTEVTPPKFPFEIIR
         LIC+LS NR+VTIH F+G  +VSIRQYY K GKQLP+ KGIS+ TEQW+  +S+IPAI EAILQMK+  R  HDA+KI A+S + T VT PKFP E IR
Subjt:  RLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRLRFRHDAEKIDAVSDSTTEVTPPKFPFEIIR

Query:  FDGKNYHVWARQMEFLLQHLKIAYVLFDSCPSAVLGPGSSSGNAAQSMAAEQKWMNDDHMCRHNILNSLSDSLFHQYTKRTMSARELWEELKLLYLFEGF
        FDGKNYH WA QME LLQ LKIAYVL + CP+AVLG  SSSGNAAQS  AEQKWM+DDHMC  NILNSLSD LF++Y+K+ MSA ELW+ELKLLY  E F
Subjt:  FDGKNYHVWARQMEFLLQHLKIAYVLFDSCPSAVLGPGSSSGNAAQSMAAEQKWMNDDHMCRHNILNSLSDSLFHQYTKRTMSARELWEELKLLYLFEGF

Query:  GTKRTQVEKYVEFRLAEEKSILEQVQELNSISDSIVAAGMRIDEDFHVSVIMSKLPSSWKNVWMKLMQEEPLPLWKLMDRLRIEEELRTQQNSHLSGASS
        GTKR+QV+KY+EF++ EEKSILEQV+ELN I+DSI +AG  IDEDFHVS I+SKLP SWKNVWM LMQE  LPL KL DRLRIEE+LRTQ+NS LS  S 
Subjt:  GTKRTQVEKYVEFRLAEEKSILEQVQELNSISDSIVAAGMRIDEDFHVSVIMSKLPSSWKNVWMKLMQEEPLPLWKLMDRLRIEEELRTQQNSHLSGASS

Query:  NPVDR--HPAPDRISKMRDPKLTSPQFKRKERQVDNMPLLCFDCGKKGHISRNCPSRKVDN
         P  R  H A +  SKM DP   +   ++KE Q +   LLC DCGK+GH S NCP++KVDN
Subjt:  NPVDR--HPAPDRISKMRDPKLTSPQFKRKERQVDNMPLLCFDCGKKGHISRNCPSRKVDN

XP_022945450.1 uncharacterized protein LOC111449676 [Cucurbita moschata]5.0e-15364.11Show/hide
Query:  MDPETRRRIEETVIDVLQKSNMEEMTEFKVRAAAAERLGIDLSGMECKRVVRNVVDKFLLSIVEHEDKGKEEGTGPTAPDHAKAVVQEIISKKEINDDGA
        MD ETRRRI+ETVID+L+ SNMEEMTE+K+RA A +RLG+DLS ++CK +VR+VV+ FL S  E +DKGKE   GP+     KA  QEI+ KKEIN D  
Subjt:  MDPETRRRIEETVIDVLQKSNMEEMTEFKVRAAAAERLGIDLSGMECKRVVRNVVDKFLLSIVEHEDKGKEEGTGPTAPDHAKAVVQEIISKKEINDDGA

Query:  RLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRL-RFRHDAEKIDAVSDSTTEVTPPKFPFEII
        R+IC+LS NRNVT+H+F+G  LVSIRQYYEK GKQLP  KGISL TEQW+A RS+IPAIEEAILQMK+++ R  HDA    AVS   T  + PKFP E I
Subjt:  RLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRL-RFRHDAEKIDAVSDSTTEVTPPKFPFEII

Query:  RFDGKNYHVWARQMEFLLQHLKIAYVLFDSCPSAVLGPGSSSGNAAQSMAAEQKWMNDDHMCRHNILNSLSDSLFHQYTKRTMSARELWEELKLLYLFEG
        RFDGKNY VWARQMEFLL+ LKIAYVL D  P+++LGP SSSGN ++S A+EQ+WM+DDHMCRH ILNSLSDSLFH+YTKRTMSARELW+EL  LYL + 
Subjt:  RFDGKNYHVWARQMEFLLQHLKIAYVLFDSCPSAVLGPGSSSGNAAQSMAAEQKWMNDDHMCRHNILNSLSDSLFHQYTKRTMSARELWEELKLLYLFEG

Query:  FGTKRTQVEKYVEFRLAEEKSILEQVQELNSISDSIVAAGMRIDEDFHVSVIMSKLPSSWKNVWMKLMQEEPLPLWKLMDRLRIEEELRTQQNSHLSGAS
        +GT+R+QV+KY+EFR+ EEKSILEQV+ELN+I++SI++AGMRIDEDFHVS I+SKLP SW NV++KLM+EE LP   L+DRLR EE+LRTQQNSH SG  
Subjt:  FGTKRTQVEKYVEFRLAEEKSILEQVQELNSISDSIVAAGMRIDEDFHVSVIMSKLPSSWKNVWMKLMQEEPLPLWKLMDRLRIEEELRTQQNSHLSGAS

Query:  SNPVDRHPAPDRISKMRDPKLTSPQFKRKERQVDNMPLLCFDCGKKGHISRNCPSRK
               P  +   KM D    S   +++E ++D   LLC +CGK+GHISR+CPS K
Subjt:  SNPVDRHPAPDRISKMRDPKLTSPQFKRKERQVDNMPLLCFDCGKKGHISRNCPSRK

XP_038878142.1 uncharacterized protein LOC120070296 [Benincasa hispida]2.0e-15764.85Show/hide
Query:  MDPETRRRIEETVIDVLQKSNMEEMTEFKVRAAAAERLGIDLSGMECKRVVRNVVDKFLLSIVEHEDKGKEEGTGPTAPDHAKAVVQEIISKKEINDDGA
        MD ET+ +IEETVIDVL+KSNMEE TEFKVR    ERLGIDLS  E K +VRNVV+ FLLS+ E    GKE+  GP+     K + QE I+ KE NDDG 
Subjt:  MDPETRRRIEETVIDVLQKSNMEEMTEFKVRAAAAERLGIDLSGMECKRVVRNVVDKFLLSIVEHEDKGKEEGTGPTAPDHAKAVVQEIISKKEINDDGA

Query:  RLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRLRFRHDAEKIDAVSDSTTEVTPPKFPFEIIR
         +IC+LS NR+VTIH FRG  +VSIRQ++EK GKQLP+ KGIS+ TEQW+A +S+IPAIE+AILQMK + R  HDA+KI AVS+ T  VTPP FP E IR
Subjt:  RLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRLRFRHDAEKIDAVSDSTTEVTPPKFPFEIIR

Query:  FDGKNYHVWARQMEFLLQHLKIAYVLFDSCPSAVLGPGSSSGNAAQSMAAEQKWMNDDHMCRHNILNSLSDSLFHQYTKRTMSARELWEELKLLYLFEGF
        FDGKNY++WA QME LLQHLKIAYVL + CP+ VLGP SSSGN AQ+ AAEQKWM+DD MC  NILNSLSD LF++Y  +TMSA ELW ELKLLY  E F
Subjt:  FDGKNYHVWARQMEFLLQHLKIAYVLFDSCPSAVLGPGSSSGNAAQSMAAEQKWMNDDHMCRHNILNSLSDSLFHQYTKRTMSARELWEELKLLYLFEGF

Query:  GTKRTQVEKYVEFRLAEEKSILEQVQELNSISDSIVAAGMRIDEDFHVSVIMSKLPSSWKNVWMKLMQEEPLPLWKLMDRLRIEEELRTQQNSHLSGASS
        GTKR+QV+KY+EFR+ EEKSILEQV++LN+I+DSIV+AG  IDEDFHVS I+SKLP SW +VW+ LM E+ L L KL+DRLRIEE+LRTQ+NSHLSG SS
Subjt:  GTKRTQVEKYVEFRLAEEKSILEQVQELNSISDSIVAAGMRIDEDFHVSVIMSKLPSSWKNVWMKLMQEEPLPLWKLMDRLRIEEELRTQQNSHLSGASS

Query:  NP--VDRHPAPDRISKMRDPKLTSPQFKRKERQVDNMPLLCFDCGKKGHISRNCPSRK
         P    +H   + +SKM DPKL+S   +++E Q D   LLC +CGK+GH S NCPSRK
Subjt:  NP--VDRHPAPDRISKMRDPKLTSPQFKRKERQVDNMPLLCFDCGKKGHISRNCPSRK

TrEMBL top hitse value%identityAlignment
A0A0A0L3U5 CCHC-type domain-containing protein3.3e-16665.74Show/hide
Query:  MDPETRRRIEETVIDVLQKSNMEEMTEFKVRAAAAERLGIDLSGMECKRVVRNVVDKFLLSIVEHEDKGKEEGTGPTAPDHAKAVVQEIISKKEINDDGA
        M+ ETRRRIEE VI+VL+KS+ME+ TEFKVR+   ERLGIDLS  +CK +VRNVV+ FLLS+ E    GKE+  GP+     KAV Q+I+ KKE NDDG 
Subjt:  MDPETRRRIEETVIDVLQKSNMEEMTEFKVRAAAAERLGIDLSGMECKRVVRNVVDKFLLSIVEHEDKGKEEGTGPTAPDHAKAVVQEIISKKEINDDGA

Query:  RLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRLRFRHDAEKIDAVSDSTTEVTPPKFPFEIIR
         LIC+LS NR+VTIH F+G  +VS+RQYYEK GKQLP+ KGIS+ TEQW+  +S+IPAI EAILQMK+  R  HDAEKI A S+ TT VT PK+P E IR
Subjt:  RLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRLRFRHDAEKIDAVSDSTTEVTPPKFPFEIIR

Query:  FDGKNYHVWARQMEFLLQHLKIAYVLFDSCPSAVLGPGSSSGNAAQSMAAEQKWMNDDHMCRHNILNSLSDSLFHQYTKRTMSARELWEELKLLYLFEGF
        FDGKNY+ WA QME LLQ LKIAYVL + CP+AVLG  SSSGNAAQS AAEQKWM DDHMCR NILNSLSD LF++Y+K+TMSA ELW+ELKLLYL E F
Subjt:  FDGKNYHVWARQMEFLLQHLKIAYVLFDSCPSAVLGPGSSSGNAAQSMAAEQKWMNDDHMCRHNILNSLSDSLFHQYTKRTMSARELWEELKLLYLFEGF

Query:  GTKRTQVEKYVEFRLAEEKSILEQVQELNSISDSIVAAGMRIDEDFHVSVIMSKLPSSWKNVWMKLMQEEPLPLWKLMDRLRIEEELRTQQNSHLSGASS
        GTKR+QV+KY+EF++ EEKSILEQV+ELN I+DSI ++G  IDEDFHVS I+SKLP SWKNVW+ LM E+ LPL KL DRLRIEE+LRTQ+NS LSG SS
Subjt:  GTKRTQVEKYVEFRLAEEKSILEQVQELNSISDSIVAAGMRIDEDFHVSVIMSKLPSSWKNVWMKLMQEEPLPLWKLMDRLRIEEELRTQQNSHLSGASS

Query:  NPVDR--HPAPDRISKMRDPKLTSPQFKRKERQVDNMPLLCFDCGKKGHISRNCPSRKVDNEVTRER
        +P  R  H A +  SKM DPK  +   ++KE Q +   LLC DCGK+GH S NCP++KV+NEV R+R
Subjt:  NPVDR--HPAPDRISKMRDPKLTSPQFKRKERQVDNMPLLCFDCGKKGHISRNCPSRKVDNEVTRER

A0A1S3AV18 uncharacterized protein LOC1034831792.5e-15864.43Show/hide
Query:  MDPETRRRIEETVIDVLQKSNMEEMTEFKVRAAAAERLGIDLSGMECKRVVRNVVDKFLLSIVEHEDKGKEEGTGPTAPDHAKAVVQEIISKKEINDDGA
        M+ ETRR+IEE VI+VL++SN+E+ TEFKVR+   ER+GIDLS  +CK +VRNVV+ FLLS+ E    GKE+  GP+     +AV Q+II KKE NDDG 
Subjt:  MDPETRRRIEETVIDVLQKSNMEEMTEFKVRAAAAERLGIDLSGMECKRVVRNVVDKFLLSIVEHEDKGKEEGTGPTAPDHAKAVVQEIISKKEINDDGA

Query:  RLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRLRFRHDAEKIDAVSDSTTEVTPPKFPFEIIR
         LIC+LS NR+VTIH F+G  +VSIRQYY K GKQLP+ KGIS+ TEQW+  +S+IPAI EAILQMK+  R  HDA+KI A+S + T VT PKFP E IR
Subjt:  RLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRLRFRHDAEKIDAVSDSTTEVTPPKFPFEIIR

Query:  FDGKNYHVWARQMEFLLQHLKIAYVLFDSCPSAVLGPGSSSGNAAQSMAAEQKWMNDDHMCRHNILNSLSDSLFHQYTKRTMSARELWEELKLLYLFEGF
        FDGKNYH WA QME LLQ LKIAYVL + CP+AVLG  SSSGNAAQS  AEQKWM+DDHMC  NILNSLSD LF++Y+K+ MSA ELW+ELKLLY  E F
Subjt:  FDGKNYHVWARQMEFLLQHLKIAYVLFDSCPSAVLGPGSSSGNAAQSMAAEQKWMNDDHMCRHNILNSLSDSLFHQYTKRTMSARELWEELKLLYLFEGF

Query:  GTKRTQVEKYVEFRLAEEKSILEQVQELNSISDSIVAAGMRIDEDFHVSVIMSKLPSSWKNVWMKLMQEEPLPLWKLMDRLRIEEELRTQQNSHLSGASS
        GTKR+QV+KY+EF++ EEKSILEQV+ELN I+DSI +AG  IDEDFHVS I+SKLP SWKNVWM LMQE  LPL KL DRLRIEE+LRTQ+NS LS  S 
Subjt:  GTKRTQVEKYVEFRLAEEKSILEQVQELNSISDSIVAAGMRIDEDFHVSVIMSKLPSSWKNVWMKLMQEEPLPLWKLMDRLRIEEELRTQQNSHLSGASS

Query:  NPVDR--HPAPDRISKMRDPKLTSPQFKRKERQVDNMPLLCFDCGKKGHISRNCPSRKVDN
         P  R  H A +  SKM DP   +   ++KE Q +   LLC DCGK+GH S NCP++KVDN
Subjt:  NPVDR--HPAPDRISKMRDPKLTSPQFKRKERQVDNMPLLCFDCGKKGHISRNCPSRKVDN

A0A5A7TZ44 Zinc knuckle family protein, putative isoform 22.5e-15864.43Show/hide
Query:  MDPETRRRIEETVIDVLQKSNMEEMTEFKVRAAAAERLGIDLSGMECKRVVRNVVDKFLLSIVEHEDKGKEEGTGPTAPDHAKAVVQEIISKKEINDDGA
        M+ ETRR+IEE VI+VL++SN+E+ TEFKVR+   ER+GIDLS  +CK +VRNVV+ FLLS+ E    GKE+  GP+     +AV Q+II KKE NDDG 
Subjt:  MDPETRRRIEETVIDVLQKSNMEEMTEFKVRAAAAERLGIDLSGMECKRVVRNVVDKFLLSIVEHEDKGKEEGTGPTAPDHAKAVVQEIISKKEINDDGA

Query:  RLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRLRFRHDAEKIDAVSDSTTEVTPPKFPFEIIR
         LIC+LS NR+VTIH F+G  +VSIRQYY K GKQLP+ KGIS+ TEQW+  +S+IPAI EAILQMK+  R  HDA+KI A+S + T VT PKFP E IR
Subjt:  RLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRLRFRHDAEKIDAVSDSTTEVTPPKFPFEIIR

Query:  FDGKNYHVWARQMEFLLQHLKIAYVLFDSCPSAVLGPGSSSGNAAQSMAAEQKWMNDDHMCRHNILNSLSDSLFHQYTKRTMSARELWEELKLLYLFEGF
        FDGKNYH WA QME LLQ LKIAYVL + CP+AVLG  SSSGNAAQS  AEQKWM+DDHMC  NILNSLSD LF++Y+K+ MSA ELW+ELKLLY  E F
Subjt:  FDGKNYHVWARQMEFLLQHLKIAYVLFDSCPSAVLGPGSSSGNAAQSMAAEQKWMNDDHMCRHNILNSLSDSLFHQYTKRTMSARELWEELKLLYLFEGF

Query:  GTKRTQVEKYVEFRLAEEKSILEQVQELNSISDSIVAAGMRIDEDFHVSVIMSKLPSSWKNVWMKLMQEEPLPLWKLMDRLRIEEELRTQQNSHLSGASS
        GTKR+QV+KY+EF++ EEKSILEQV+ELN I+DSI +AG  IDEDFHVS I+SKLP SWKNVWM LMQE  LPL KL DRLRIEE+LRTQ+NS LS  S 
Subjt:  GTKRTQVEKYVEFRLAEEKSILEQVQELNSISDSIVAAGMRIDEDFHVSVIMSKLPSSWKNVWMKLMQEEPLPLWKLMDRLRIEEELRTQQNSHLSGASS

Query:  NPVDR--HPAPDRISKMRDPKLTSPQFKRKERQVDNMPLLCFDCGKKGHISRNCPSRKVDN
         P  R  H A +  SKM DP   +   ++KE Q +   LLC DCGK+GH S NCP++KVDN
Subjt:  NPVDR--HPAPDRISKMRDPKLTSPQFKRKERQVDNMPLLCFDCGKKGHISRNCPSRKVDN

A0A5D3DBA1 Zinc knuckle family protein, putative isoform 29.5e-15864.21Show/hide
Query:  MDPETRRRIEETVIDVLQKSNMEEMTEFKVRAAAAERLGIDLSGMECKRVVRNVVDKFLLSIVEHEDKGKEEGTGPTAPDHAKAVVQEIISKKEINDDGA
        M+ ETRR+IEE VI+VL++SN+E+ TEFKVR+   ER+GIDLS  +CK +VRNVV+ FLLS+ E    GKE+  GP+     +AV Q+II KKE NDDG 
Subjt:  MDPETRRRIEETVIDVLQKSNMEEMTEFKVRAAAAERLGIDLSGMECKRVVRNVVDKFLLSIVEHEDKGKEEGTGPTAPDHAKAVVQEIISKKEINDDGA

Query:  RLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRLRFRHDAEKIDAVSDSTTEVTPPKFPFEIIR
         LIC+LS NR+VTIH F+G  +VSIRQYY K GKQLP+ KGIS+ TEQW+  +S+IPAI EAILQMK+  R  HDA+KI A+S + T VT PKFP E IR
Subjt:  RLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRLRFRHDAEKIDAVSDSTTEVTPPKFPFEIIR

Query:  FDGKNYHVWARQMEFLLQHLKIAYVLFDSCPSAVLGPGSSSGNAAQSMAAEQKWMNDDHMCRHNILNSLSDSLFHQYTKRTMSARELWEELKLLYLFEGF
        FDGKNYH WA QME LLQ LKIAYVL + CP+AVLG  SSSGNAAQS  AEQKWM+DDHMC  NILNSLSD LF++Y+K+ MSA ELW+ELKLLY  E F
Subjt:  FDGKNYHVWARQMEFLLQHLKIAYVLFDSCPSAVLGPGSSSGNAAQSMAAEQKWMNDDHMCRHNILNSLSDSLFHQYTKRTMSARELWEELKLLYLFEGF

Query:  GTKRTQVEKYVEFRLAEEKSILEQVQELNSISDSIVAAGMRIDEDFHVSVIMSKLPSSWKNVWMKLMQEEPLPLWKLMDRLRIEEELRTQQNSHLSGASS
        GTKR+QV+KY+EF++ EEKSILEQV+ELN I+DSI +AG  IDEDFHVS I+SKLP SWKNVWM LM E  LPL KL DRLRIEE+LRTQ+NS LS  S 
Subjt:  GTKRTQVEKYVEFRLAEEKSILEQVQELNSISDSIVAAGMRIDEDFHVSVIMSKLPSSWKNVWMKLMQEEPLPLWKLMDRLRIEEELRTQQNSHLSGASS

Query:  NPVDR--HPAPDRISKMRDPKLTSPQFKRKERQVDNMPLLCFDCGKKGHISRNCPSRKVDN
         P  R  H A +  SKM DP   +   ++KE Q +   LLC DCGK+GH S NCP++KVDN
Subjt:  NPVDR--HPAPDRISKMRDPKLTSPQFKRKERQVDNMPLLCFDCGKKGHISRNCPSRKVDN

A0A6J1G0Z2 uncharacterized protein LOC1114496762.4e-15364.11Show/hide
Query:  MDPETRRRIEETVIDVLQKSNMEEMTEFKVRAAAAERLGIDLSGMECKRVVRNVVDKFLLSIVEHEDKGKEEGTGPTAPDHAKAVVQEIISKKEINDDGA
        MD ETRRRI+ETVID+L+ SNMEEMTE+K+RA A +RLG+DLS ++CK +VR+VV+ FL S  E +DKGKE   GP+     KA  QEI+ KKEIN D  
Subjt:  MDPETRRRIEETVIDVLQKSNMEEMTEFKVRAAAAERLGIDLSGMECKRVVRNVVDKFLLSIVEHEDKGKEEGTGPTAPDHAKAVVQEIISKKEINDDGA

Query:  RLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRL-RFRHDAEKIDAVSDSTTEVTPPKFPFEII
        R+IC+LS NRNVT+H+F+G  LVSIRQYYEK GKQLP  KGISL TEQW+A RS+IPAIEEAILQMK+++ R  HDA    AVS   T  + PKFP E I
Subjt:  RLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRL-RFRHDAEKIDAVSDSTTEVTPPKFPFEII

Query:  RFDGKNYHVWARQMEFLLQHLKIAYVLFDSCPSAVLGPGSSSGNAAQSMAAEQKWMNDDHMCRHNILNSLSDSLFHQYTKRTMSARELWEELKLLYLFEG
        RFDGKNY VWARQMEFLL+ LKIAYVL D  P+++LGP SSSGN ++S A+EQ+WM+DDHMCRH ILNSLSDSLFH+YTKRTMSARELW+EL  LYL + 
Subjt:  RFDGKNYHVWARQMEFLLQHLKIAYVLFDSCPSAVLGPGSSSGNAAQSMAAEQKWMNDDHMCRHNILNSLSDSLFHQYTKRTMSARELWEELKLLYLFEG

Query:  FGTKRTQVEKYVEFRLAEEKSILEQVQELNSISDSIVAAGMRIDEDFHVSVIMSKLPSSWKNVWMKLMQEEPLPLWKLMDRLRIEEELRTQQNSHLSGAS
        +GT+R+QV+KY+EFR+ EEKSILEQV+ELN+I++SI++AGMRIDEDFHVS I+SKLP SW NV++KLM+EE LP   L+DRLR EE+LRTQQNSH SG  
Subjt:  FGTKRTQVEKYVEFRLAEEKSILEQVQELNSISDSIVAAGMRIDEDFHVSVIMSKLPSSWKNVWMKLMQEEPLPLWKLMDRLRIEEELRTQQNSHLSGAS

Query:  SNPVDRHPAPDRISKMRDPKLTSPQFKRKERQVDNMPLLCFDCGKKGHISRNCPSRK
               P  +   KM D    S   +++E ++D   LLC +CGK+GHISR+CPS K
Subjt:  SNPVDRHPAPDRISKMRDPKLTSPQFKRKERQVDNMPLLCFDCGKKGHISRNCPSRK

SwissProt top hitse value%identityAlignment
O65154 RNA polymerase II transcriptional coactivator KIWI1.6e-0841.94Show/hide
Query:  LICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAI
        ++C +S+NR V++ ++ G+  + IR++Y K GK LP  KGISL  +QW  LR+    IE+A+
Subjt:  LICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAI

O65155 RNA polymerase II transcriptional coactivator KELP1.7e-3447.13Show/hide
Query:  MDPETRRRIEETVIDVLQKSNMEEMTEFKVRAAAAERLGIDLSGMECKRVVRNVVDKFL----LSIVEHEDKGKEEGTGPTAPDHAKAVVQEIISKKEIN
        M+ ET+ +IE+TVI++L +S+M+E+TEFKVR  A+E+L IDLS    K  VR+VV+KFL        E+    KEE  G    D  K         KE +
Subjt:  MDPETRRRIEETVIDVLQKSNMEEMTEFKVRAAAAERLGIDLSGMECKRVVRNVVDKFL----LSIVEHEDKGKEEGTGPTAPDHAKAVVQEIISKKEIN

Query:  DDGARLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRL
        DDG  +IC+LS+ R VTI +F+G++LVSIR+YY+K GK+LP++KGISL  EQW+  + ++PAIE A+ +M+ R+
Subjt:  DDGARLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRL

P87294 Putative RNA polymerase II transcriptional coactivator7.3e-0638.6Show/hide
Query:  SENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAI
        +E + +T+ +FRG   V IR+YYEK G  LP  KGI+L   +W  L+  I  +++++
Subjt:  SENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAI

Q872F4 Putative RNA polymerase II transcriptional coactivator1.6e-0534.83Show/hide
Query:  NDDGARLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRLR---FRHDAEKIDA
        +D       +L  NR ++   FR  TLV+IR+YY+ GGK +P  KGISL   Q+  L   IP +   +      +    F   AE+ DA
Subjt:  NDDGARLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRLR---FRHDAEKIDA

Q9VLR5 RNA polymerase II transcriptional coactivator1.6e-0548.15Show/hide
Query:  RNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAI
        R V I++FRGR  V IR++Y+KGG+ LP  KGISL   QW  L      +  AI
Subjt:  RNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAI

Arabidopsis top hitse value%identityAlignment
AT4G00980.1 zinc knuckle (CCHC-type) family protein5.4e-8138.69Show/hide
Query:  MDPETRRRIEETVIDVLQKSNMEEMTEFKVRAAAAERLGIDLSGMECKRVVRNVVDKFLLSIVEHEDKGKEEGTGPTAPDHAKAVVQEIISKKEINDDGA
        M+    ++IEETV  +L +S+M++MTEFK+R  A+ +LGIDLSG   K++VR+V++ FLLS       G+       AP   + V    ++   +  +  
Subjt:  MDPETRRRIEETVIDVLQKSNMEEMTEFKVRAAAAERLGIDLSGMECKRVVRNVVDKFLLSIVEHEDKGKEEGTGPTAPDHAKAVVQEIISKKEINDDGA

Query:  RLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRLRFRHDAEKIDAVSDSTTEVTPPKFP-FEII
        R ICKLSE +N T+  +RG+  +SI    ++ GK   + +G  L T QW+ ++ +  AIE+ I Q + +L  + +A +    S++  + +   F   +I 
Subjt:  RLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRLRFRHDAEKIDAVSDSTTEVTPPKFP-FEII

Query:  RFDGKNYHVWARQMEFLLQHLKIAYVLFDSCPS--AVLGPGSSSGNAAQSMAAEQKWMNDDHMCRHNILNSLSDSLFHQYTKRTMSARELWEELKLLYLF
        RFDGK+Y  WA QME  L+ LK+ YVL + CPS  +  GP ++     ++ A  +KW+ DD++C  +++NSLSD L+ +Y+++   A+ELW+ELK +Y  
Subjt:  RFDGKNYHVWARQMEFLLQHLKIAYVLFDSCPS--AVLGPGSSSGNAAQSMAAEQKWMNDDHMCRHNILNSLSDSLFHQYTKRTMSARELWEELKLLYLF

Query:  EGFGTKRTQVEKYVEFRLAEEKSILEQVQELNSISDSIVAAGMRIDEDFHVSVIMSKLPSSWKNVWMKLMQEEPLPLWKLMDRLRIEEELRTQQNSHLSG
        +   +KR+QV KY+EFR+ EE+ ILEQVQ  N I+DSIV+AGM +DE FHVS I+SK P SW+    +LM+EE LP+W LM+R++ EEEL        +G
Subjt:  EGFGTKRTQVEKYVEFRLAEEKSILEQVQELNSISDSIVAAGMRIDEDFHVSVIMSKLPSSWKNVWMKLMQEEPLPLWKLMDRLRIEEELRTQQNSHLSG

Query:  ASSNPVDRHPAPDRISKMRDPKL-------TSPQFKRKERQVD-NMPLLCFDCGKKGHISRNCPSRKVDNEVT
        A    V   PA       R P L        S  +KRKE + D  + ++C +CG+KGH++++C   K D   +
Subjt:  ASSNPVDRHPAPDRISKMRDPKL-------TSPQFKRKERQVD-NMPLLCFDCGKKGHISRNCPSRKVDNEVT

AT4G10920.1 transcriptional coactivator p15 (PC4) family protein (KELP)1.2e-3547.13Show/hide
Query:  MDPETRRRIEETVIDVLQKSNMEEMTEFKVRAAAAERLGIDLSGMECKRVVRNVVDKFL----LSIVEHEDKGKEEGTGPTAPDHAKAVVQEIISKKEIN
        M+ ET+ +IE+TVI++L +S+M+E+TEFKVR  A+E+L IDLS    K  VR+VV+KFL        E+    KEE  G    D  K         KE +
Subjt:  MDPETRRRIEETVIDVLQKSNMEEMTEFKVRAAAAERLGIDLSGMECKRVVRNVVDKFL----LSIVEHEDKGKEEGTGPTAPDHAKAVVQEIISKKEIN

Query:  DDGARLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRL
        DDG  +IC+LS+ R VTI +F+G++LVSIR+YY+K GK+LP++KGISL  EQW+  + ++PAIE A+ +M+ R+
Subjt:  DDGARLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRL

AT4G10920.2 transcriptional coactivator p15 (PC4) family protein (KELP)1.2e-3547.13Show/hide
Query:  MDPETRRRIEETVIDVLQKSNMEEMTEFKVRAAAAERLGIDLSGMECKRVVRNVVDKFL----LSIVEHEDKGKEEGTGPTAPDHAKAVVQEIISKKEIN
        M+ ET+ +IE+TVI++L +S+M+E+TEFKVR  A+E+L IDLS    K  VR+VV+KFL        E+    KEE  G    D  K         KE +
Subjt:  MDPETRRRIEETVIDVLQKSNMEEMTEFKVRAAAAERLGIDLSGMECKRVVRNVVDKFL----LSIVEHEDKGKEEGTGPTAPDHAKAVVQEIISKKEIN

Query:  DDGARLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRL
        DDG  +IC+LS+ R VTI +F+G++LVSIR+YY+K GK+LP++KGISL  EQW+  + ++PAIE A+ +M+ R+
Subjt:  DDGARLICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRL

AT5G09240.1 ssDNA-binding transcriptional regulator2.6e-0639.71Show/hide
Query:  ICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLP--SAKGISLLTEQWAALRSSIPAIEEAILQMKK
        IC L +NR V + +  GR  ++IRQ++ K G  LP  S +GISL  EQW  LR+    I++A+ ++ +
Subjt:  ICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLP--SAKGISLLTEQWAALRSSIPAIEEAILQMKK

AT5G09250.1 ssDNA-binding transcriptional regulator1.1e-0941.94Show/hide
Query:  LICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAI
        ++C +S+NR V++ ++ G+  + IR++Y K GK LP  KGISL  +QW  LR+    IE+A+
Subjt:  LICKLSENRNVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCCTGAGACCCGGCGGAGAATCGAGGAAACGGTGATTGACGTATTGCAGAAATCGAACATGGAAGAGATGACGGAGTTCAAAGTTCGAGCCGCGGCCGCAGAACG
GCTCGGAATCGATCTCTCCGGAATGGAATGCAAGCGGGTGGTGAGGAACGTGGTCGATAAATTTTTACTTTCTATCGTGGAGCACGAGGACAAGGGCAAGGAGGAAGGGA
CTGGCCCTACTGCTCCTGACCACGCTAAAGCGGTGGTGCAGGAGATAATCTCCAAGAAGGAGATCAACGATGATGGTGCCCGTCTGATTTGCAAGTTATCTGAAAACAGG
AATGTGACAATTCATGATTTTAGAGGGAGAACTTTGGTATCGATTAGGCAGTATTATGAAAAAGGTGGAAAACAACTTCCTAGTGCTAAAGGAATCAGCTTGCTGACTGA
ACAATGGGCGGCCCTTAGGAGTAGTATTCCTGCTATAGAGGAAGCTATTTTGCAGATGAAAAAAAGATTAAGATTTCGACATGATGCTGAAAAAATTGATGCTGTCTCAG
ATTCCACAACTGAGGTTACTCCTCCAAAATTTCCTTTTGAAATTATTCGGTTTGATGGAAAAAACTACCACGTATGGGCACGTCAGATGGAGTTTCTCCTGCAGCACTTA
AAGATTGCTTATGTACTTTTTGATTCATGCCCTAGTGCTGTTCTTGGGCCAGGATCAAGCTCTGGAAATGCTGCTCAGTCCATGGCCGCTGAACAGAAATGGATGAATGA
TGACCACATGTGTCGCCACAACATTCTGAACTCCCTCTCCGATAGTCTTTTTCATCAATACACAAAGAGAACAATGAGTGCCCGAGAACTCTGGGAGGAGCTAAAATTGC
TTTATCTTTTTGAAGGATTTGGCACCAAGAGAACTCAAGTAGAAAAATATGTGGAATTCAGGTTGGCCGAGGAGAAGTCAATATTAGAACAAGTTCAAGAACTTAATAGC
ATTTCTGATTCCATTGTAGCTGCTGGAATGCGGATTGACGAGGATTTTCACGTTAGTGTCATTATGTCAAAGCTTCCTTCTTCTTGGAAGAATGTGTGGATGAAGTTAAT
GCAAGAGGAGCCTCTTCCCCTTTGGAAGTTGATGGATCGATTGAGGATTGAAGAAGAACTACGAACACAACAAAACTCACATCTCTCAGGAGCGTCTTCTAATCCAGTAG
ACCGACATCCTGCCCCGGATCGCATTTCAAAGATGAGAGACCCAAAGCTCACAAGCCCACAGTTCAAGAGAAAGGAAAGGCAAGTGGATAACATGCCTTTACTCTGCTTT
GATTGTGGCAAGAAAGGGCATATATCTCGAAATTGTCCGAGTAGGAAGGTCGATAATGAAGTTACTAGGGAAAGAAGA
mRNA sequenceShow/hide mRNA sequence
ATGGACCCTGAGACCCGGCGGAGAATCGAGGAAACGGTGATTGACGTATTGCAGAAATCGAACATGGAAGAGATGACGGAGTTCAAAGTTCGAGCCGCGGCCGCAGAACG
GCTCGGAATCGATCTCTCCGGAATGGAATGCAAGCGGGTGGTGAGGAACGTGGTCGATAAATTTTTACTTTCTATCGTGGAGCACGAGGACAAGGGCAAGGAGGAAGGGA
CTGGCCCTACTGCTCCTGACCACGCTAAAGCGGTGGTGCAGGAGATAATCTCCAAGAAGGAGATCAACGATGATGGTGCCCGTCTGATTTGCAAGTTATCTGAAAACAGG
AATGTGACAATTCATGATTTTAGAGGGAGAACTTTGGTATCGATTAGGCAGTATTATGAAAAAGGTGGAAAACAACTTCCTAGTGCTAAAGGAATCAGCTTGCTGACTGA
ACAATGGGCGGCCCTTAGGAGTAGTATTCCTGCTATAGAGGAAGCTATTTTGCAGATGAAAAAAAGATTAAGATTTCGACATGATGCTGAAAAAATTGATGCTGTCTCAG
ATTCCACAACTGAGGTTACTCCTCCAAAATTTCCTTTTGAAATTATTCGGTTTGATGGAAAAAACTACCACGTATGGGCACGTCAGATGGAGTTTCTCCTGCAGCACTTA
AAGATTGCTTATGTACTTTTTGATTCATGCCCTAGTGCTGTTCTTGGGCCAGGATCAAGCTCTGGAAATGCTGCTCAGTCCATGGCCGCTGAACAGAAATGGATGAATGA
TGACCACATGTGTCGCCACAACATTCTGAACTCCCTCTCCGATAGTCTTTTTCATCAATACACAAAGAGAACAATGAGTGCCCGAGAACTCTGGGAGGAGCTAAAATTGC
TTTATCTTTTTGAAGGATTTGGCACCAAGAGAACTCAAGTAGAAAAATATGTGGAATTCAGGTTGGCCGAGGAGAAGTCAATATTAGAACAAGTTCAAGAACTTAATAGC
ATTTCTGATTCCATTGTAGCTGCTGGAATGCGGATTGACGAGGATTTTCACGTTAGTGTCATTATGTCAAAGCTTCCTTCTTCTTGGAAGAATGTGTGGATGAAGTTAAT
GCAAGAGGAGCCTCTTCCCCTTTGGAAGTTGATGGATCGATTGAGGATTGAAGAAGAACTACGAACACAACAAAACTCACATCTCTCAGGAGCGTCTTCTAATCCAGTAG
ACCGACATCCTGCCCCGGATCGCATTTCAAAGATGAGAGACCCAAAGCTCACAAGCCCACAGTTCAAGAGAAAGGAAAGGCAAGTGGATAACATGCCTTTACTCTGCTTT
GATTGTGGCAAGAAAGGGCATATATCTCGAAATTGTCCGAGTAGGAAGGTCGATAATGAAGTTACTAGGGAAAGAAGA
Protein sequenceShow/hide protein sequence
MDPETRRRIEETVIDVLQKSNMEEMTEFKVRAAAAERLGIDLSGMECKRVVRNVVDKFLLSIVEHEDKGKEEGTGPTAPDHAKAVVQEIISKKEINDDGARLICKLSENR
NVTIHDFRGRTLVSIRQYYEKGGKQLPSAKGISLLTEQWAALRSSIPAIEEAILQMKKRLRFRHDAEKIDAVSDSTTEVTPPKFPFEIIRFDGKNYHVWARQMEFLLQHL
KIAYVLFDSCPSAVLGPGSSSGNAAQSMAAEQKWMNDDHMCRHNILNSLSDSLFHQYTKRTMSARELWEELKLLYLFEGFGTKRTQVEKYVEFRLAEEKSILEQVQELNS
ISDSIVAAGMRIDEDFHVSVIMSKLPSSWKNVWMKLMQEEPLPLWKLMDRLRIEEELRTQQNSHLSGASSNPVDRHPAPDRISKMRDPKLTSPQFKRKERQVDNMPLLCF
DCGKKGHISRNCPSRKVDNEVTRERR