; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh01G019690 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh01G019690
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionC2H2-type domain-containing protein
Genome locationCma_Chr01:12700578..12702761
RNA-Seq ExpressionCmaCh01G019690
SyntenyCmaCh01G019690
Gene Ontology termsGO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
InterPro domainsIPR013087 - Zinc finger C2H2-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608592.1 hypothetical protein SDJN03_01934, partial [Cucurbita argyrosperma subsp. sororia]4.6e-21495.79Show/hide
Query:  KPINTERRKKHQNPPPSQPPPPPSAQSSWDQIKNLITCKQVEATRVHEPEKRPPAANSKLGSSYSSICRFRDVVHGNAKIVHRPDNSPETSSLGQETRLL
        KPINT+RRKKHQNPPPSQPPPPPSAQSSWDQIKNLITCKQVEATRVHEP KR P ANSKLGSS SSICRFRDVVHGNAKIVHRPDNSPETSSLGQETRLL
Subjt:  KPINTERRKKHQNPPPSQPPPPPSAQSSWDQIKNLITCKQVEATRVHEPEKRPPAANSKLGSSYSSICRFRDVVHGNAKIVHRPDNSPETSSLGQETRLL

Query:  TTKATNGSSSRSLTAPTRTKNGASASYSSSSRVLQLRKLSGCYECHAIVDPSRYPIPRSSICPCPQCGEMFSKMESLELHQAVRHAVSELGPDDSGRNIV
        TTKATNGSSSR+LTAPTRTKNGASASY SSSR LQLRKLSGCYECH IVDPSRYPIPRSSICPCPQCGEMF K+ESLELHQAVRHAVSELGPDDSGRNIV
Subjt:  TTKATNGSSSRSLTAPTRTKNGASASYSSSSRVLQLRKLSGCYECHAIVDPSRYPIPRSSICPCPQCGEMFSKMESLELHQAVRHAVSELGPDDSGRNIV

Query:  EIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSELLCDLGSRGSTGLCGSIPGCRVCSVIRHGFQ
        EIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALA  RKNPRCAADGNELLRFHCSELLCDLGSRGSTGLC SIPGCRVCSVIRHGFQ
Subjt:  EIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSELLCDLGSRGSTGLCGSIPGCRVCSVIRHGFQ

Query:  CKPGEPAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDLSAAAVEEENLS-AAAASYDSVSRQSGVYSNLEELIIFNPKAILPCFVVIYE
        CKPGEPAGVRTTASSGRAHDSFECGDGRRRA+LVCRVIAGRVKRIAEDLSAAAVEEENLS AAAASYDSVSRQSG YSNLEELIIFNPKAILPCFVVIYE
Subjt:  CKPGEPAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDLSAAAVEEENLS-AAAASYDSVSRQSGVYSNLEELIIFNPKAILPCFVVIYE

Query:  ALQT
        ALQT
Subjt:  ALQT

KAG7037909.1 hypothetical protein SDJN02_01540 [Cucurbita argyrosperma subsp. argyrosperma]2.1e-21183.58Show/hide
Query:  MAHSTSLAINAEKSRKTKPINTERRKKHQNPPPSQPP---PPPSAQSSWDQIKNLITCKQVEATRVHEPEKRPPAANSKLGSSYSSICRFRDVVHGNAKI
        MA STSLAINAEKSRK KPINT+RRKKHQNPPPSQPP   PPPSAQSSWDQIKNLITCKQVEATRVHEP KR P ANSKLGSS SSICRFRDVVHGNAKI
Subjt:  MAHSTSLAINAEKSRKTKPINTERRKKHQNPPPSQPP---PPPSAQSSWDQIKNLITCKQVEATRVHEPEKRPPAANSKLGSSYSSICRFRDVVHGNAKI

Query:  VHRPDNSPETSSLGQETRLLTTKATNGSSSRSLTAPTRTKNGASASYSSSSRVLQLRKLSGCYECHAIVDPS----------------------------
        VHRPDNSPETSSLGQETRLLTTKATNGSSSR+LTAPTRTKNGASASY SSSR LQLRKLSGCYECH IVDPS                            
Subjt:  VHRPDNSPETSSLGQETRLLTTKATNGSSSRSLTAPTRTKNGASASYSSSSRVLQLRKLSGCYECHAIVDPS----------------------------

Query:  -----------------------------RYPIPRSSICPCPQCGEMFSKMESLELHQAVRHAVSELGPDDSGRNIVEIIFKSSWLKIDRPICKIERILK
                                     RYPIPRSSICPCPQCGEMF K+ESLELHQAVRHAVSELGPDDSGRNIVEIIFKSSWLKIDRPICKIERILK
Subjt:  -----------------------------RYPIPRSSICPCPQCGEMFSKMESLELHQAVRHAVSELGPDDSGRNIVEIIFKSSWLKIDRPICKIERILK

Query:  VHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSELLCDLGSRGSTGLCGSIPGCRVCSVIRHGFQCKPGEPAGVRTTASSGRAHDSFE
        VHNTQRTIQRFEDCRDAVKTRALA  RKNPRCAADGNELLRFHCSELLCDLGSRGSTGLC SIPGCRVCSVIRHGFQCKPGEPAGVRTTASSGRAHDSFE
Subjt:  VHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSELLCDLGSRGSTGLCGSIPGCRVCSVIRHGFQCKPGEPAGVRTTASSGRAHDSFE

Query:  CGDGRRRAMLVCRVIAGRVKRIAEDLSAAAVEEENLS-AAAASYDSVSRQSGVYSNLEELIIFNPKAILPCFVVIYEALQT
        CGDGRRRA+LVCRVIAGRVKRIAEDLSAAAVEEENLS AAAASYDSVSRQSG YSNLEELIIFNPKAILPCFVVIYEALQT
Subjt:  CGDGRRRAMLVCRVIAGRVKRIAEDLSAAAVEEENLS-AAAASYDSVSRQSGVYSNLEELIIFNPKAILPCFVVIYEALQT

XP_022940427.1 uncharacterized protein LOC111446039 [Cucurbita moschata]8.6e-22195.04Show/hide
Query:  MAHSTSLAINAEKSRKTKPINTERRKKHQNPPPSQPP---PPPSAQSSWDQIKNLITCKQVEATRVHEPEKRPPAANSKLGSSYSSICRFRDVVHGNAKI
        MA STSLAINAEKSRK KPINT+RRKKHQ+PPPSQPP   PPPSAQSSWDQIKNLITCKQ+EA+RVHEP KR P ANSKLGSS SSICRFRDVVHGNAKI
Subjt:  MAHSTSLAINAEKSRKTKPINTERRKKHQNPPPSQPP---PPPSAQSSWDQIKNLITCKQVEATRVHEPEKRPPAANSKLGSSYSSICRFRDVVHGNAKI

Query:  VHRPDNSPETSSLGQETRLLTTKATNGSSSRSLTAPTRTKNGASASYSSSSRVLQLRKLSGCYECHAIVDPSRYPIPRSSICPCPQCGEMFSKMESLELH
        VHRPDNSPETSSLGQETRLLTTKATNGSSSRSLTAPTRTKNGASASY SSSR LQLRKLSGCYECH IVDPSRYPIPRSSICPCPQCGEMF K+ESLELH
Subjt:  VHRPDNSPETSSLGQETRLLTTKATNGSSSRSLTAPTRTKNGASASYSSSSRVLQLRKLSGCYECHAIVDPSRYPIPRSSICPCPQCGEMFSKMESLELH

Query:  QAVRHAVSELGPDDSGRNIVEIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSELLCDLGSRGST
        QAVRHAVSELGPDDSGRNIVEIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALA  RKNPRCAADGNELLRFHCSELLCDLGSRGST
Subjt:  QAVRHAVSELGPDDSGRNIVEIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSELLCDLGSRGST

Query:  GLCGSIPGCRVCSVIRHGFQCKPGEPAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDLSAAAVEEENLSAAAASYDSVSRQSGVYSNLE
        GLCGSIPGCRVCSVIRHGFQCKPGEPAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDLSAAAVEEENLSAAAASYDSVSRQSG YSNLE
Subjt:  GLCGSIPGCRVCSVIRHGFQCKPGEPAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDLSAAAVEEENLSAAAASYDSVSRQSGVYSNLE

Query:  ELIIFNPKAILPCFVVIYEALQT
        ELIIFNPKAILPCFVVIYEALQT
Subjt:  ELIIFNPKAILPCFVVIYEALQT

XP_022982145.1 uncharacterized protein LOC111481067 [Cucurbita maxima]8.6e-237100Show/hide
Query:  MAHSTSLAINAEKSRKTKPINTERRKKHQNPPPSQPPPPPSAQSSWDQIKNLITCKQVEATRVHEPEKRPPAANSKLGSSYSSICRFRDVVHGNAKIVHR
        MAHSTSLAINAEKSRKTKPINTERRKKHQNPPPSQPPPPPSAQSSWDQIKNLITCKQVEATRVHEPEKRPPAANSKLGSSYSSICRFRDVVHGNAKIVHR
Subjt:  MAHSTSLAINAEKSRKTKPINTERRKKHQNPPPSQPPPPPSAQSSWDQIKNLITCKQVEATRVHEPEKRPPAANSKLGSSYSSICRFRDVVHGNAKIVHR

Query:  PDNSPETSSLGQETRLLTTKATNGSSSRSLTAPTRTKNGASASYSSSSRVLQLRKLSGCYECHAIVDPSRYPIPRSSICPCPQCGEMFSKMESLELHQAV
        PDNSPETSSLGQETRLLTTKATNGSSSRSLTAPTRTKNGASASYSSSSRVLQLRKLSGCYECHAIVDPSRYPIPRSSICPCPQCGEMFSKMESLELHQAV
Subjt:  PDNSPETSSLGQETRLLTTKATNGSSSRSLTAPTRTKNGASASYSSSSRVLQLRKLSGCYECHAIVDPSRYPIPRSSICPCPQCGEMFSKMESLELHQAV

Query:  RHAVSELGPDDSGRNIVEIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSELLCDLGSRGSTGLC
        RHAVSELGPDDSGRNIVEIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSELLCDLGSRGSTGLC
Subjt:  RHAVSELGPDDSGRNIVEIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSELLCDLGSRGSTGLC

Query:  GSIPGCRVCSVIRHGFQCKPGEPAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDLSAAAVEEENLSAAAASYDSVSRQSGVYSNLEELI
        GSIPGCRVCSVIRHGFQCKPGEPAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDLSAAAVEEENLSAAAASYDSVSRQSGVYSNLEELI
Subjt:  GSIPGCRVCSVIRHGFQCKPGEPAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDLSAAAVEEENLSAAAASYDSVSRQSGVYSNLEELI

Query:  IFNPKAILPCFVVIYEALQT
        IFNPKAILPCFVVIYEALQT
Subjt:  IFNPKAILPCFVVIYEALQT

XP_023525363.1 uncharacterized protein LOC111788987 [Cucurbita pepo subsp. pepo]3.3e-22094.59Show/hide
Query:  MAHSTSLAINAEKSRKTKPINTERRKKHQNPPPSQPPPPP--SAQSSWDQIKNLITCKQVEATRVHEPEKRPPAANSKLGSSYSSICRFRDVVHGNAKIV
        MA STSLAINAEKSRK KPINT+RRKKHQNPPPSQPPPPP  SAQSSWDQIKNLITCKQVEATRVHEP KR P AN KL SS SSICRFRDVVHGNAKIV
Subjt:  MAHSTSLAINAEKSRKTKPINTERRKKHQNPPPSQPPPPP--SAQSSWDQIKNLITCKQVEATRVHEPEKRPPAANSKLGSSYSSICRFRDVVHGNAKIV

Query:  HRPDNSPETSSLGQETRLLTTKATNGSSSRSLTAPTRTKNGASASY---SSSSRVLQLRKLSGCYECHAIVDPSRYPIPRSSICPCPQCGEMFSKMESLE
        HRPDNSPETSSLGQETRLLTTKATNGSSSRSL+APTRTKNGASASY   SSSSR LQLRKLSGCYECH I++PSRYPIPRSSICPCPQCGEMF KMESLE
Subjt:  HRPDNSPETSSLGQETRLLTTKATNGSSSRSLTAPTRTKNGASASY---SSSSRVLQLRKLSGCYECHAIVDPSRYPIPRSSICPCPQCGEMFSKMESLE

Query:  LHQAVRHAVSELGPDDSGRNIVEIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSELLCDLGSRG
        LHQAVRHAVSELGPDDSGRNIVEIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALA  RKNPRCAADGNELLRFHCSELLCDLGSRG
Subjt:  LHQAVRHAVSELGPDDSGRNIVEIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSELLCDLGSRG

Query:  STGLCGSIPGCRVCSVIRHGFQCKPGEPAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDLSAAAVEEENLSAAAASYDSVSRQSGVYSN
        STGLCGSIPGCRVCSVIRHGFQCKPGEPAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDLSAAAVEEENLSAAAASYDSVSRQSG YSN
Subjt:  STGLCGSIPGCRVCSVIRHGFQCKPGEPAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDLSAAAVEEENLSAAAASYDSVSRQSGVYSN

Query:  LEELIIFNPKAILPCFVVIYEALQT
        LEELIIFNPKAILPCFVVIYEALQT
Subjt:  LEELIIFNPKAILPCFVVIYEALQT

TrEMBL top hitse value%identityAlignment
A0A0A0LGH5 C2H2-type domain-containing protein2.2e-18280.65Show/hide
Query:  SLAINAEKSRKTKPINTERRKKHQNPPPSQPPPPPSAQSSWDQIKNLITCKQVEATRVHEPEKRPPAANSKLGSSYSSICRFRDVVHGNAKIVHRPDNSP
        +L  N+ +  K KP++ +RRKKH NP     PPP SAQSSWD IKNLITCKQVE +RV EP KR P A SKLGSS SSIC FRDVVHGNAK+VHR DNSP
Subjt:  SLAINAEKSRKTKPINTERRKKHQNPPPSQPPPPPSAQSSWDQIKNLITCKQVEATRVHEPEKRPPAANSKLGSSYSSICRFRDVVHGNAKIVHRPDNSP

Query:  ETSSLGQETRLLTTKATNGSSSRSLTAPT--RTKNG--ASASY-SSSSRVLQLRKLSGCYECHAIVDPSRYPIPRSSICPCPQCGEMFSKMESLELHQAV
        E+SS+GQETRLLT K+ NGSSSRSLTAPT  RTKNG   SASY SSSSR +QLRKLSGCYECH IVDPSR PIPRSSICPCPQCGE+F K+ESLELHQ V
Subjt:  ETSSLGQETRLLTTKATNGSSSRSLTAPT--RTKNG--ASASY-SSSSRVLQLRKLSGCYECHAIVDPSRYPIPRSSICPCPQCGEMFSKMESLELHQAV

Query:  RHAVSELGPDDSGRNIVEIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSELLCDLGSRGSTGLC
        RHAVSELGP+DSGRNIVEIIFKSSWLK DRPICKIERILKVHNTQRTIQRFEDCRDAVKTRAL STRKNPRCAADGNELLRFHCS L CDLGSRGSTGLC
Subjt:  RHAVSELGPDDSGRNIVEIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSELLCDLGSRGSTGLC

Query:  GSIPGCRVCSVIRHGFQCKPGEPAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDLSA------AAVEEENL---SAAAASYDSVSRQSG
        GSIP C VC+VIRHGFQ KPG P GVRTTASSGRAHDSF+CGDGRRRAMLVCRVIAGRVKRI++D +A       A EEEN+   +AAAASYDSVSR SG
Subjt:  GSIPGCRVCSVIRHGFQCKPGEPAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDLSA------AAVEEENL---SAAAASYDSVSRQSG

Query:  VYSNLEELIIFNPKAILPCFVVIYEALQT
        +YSNLEEL+IFNPKAILPCFVVIYEALQT
Subjt:  VYSNLEELIIFNPKAILPCFVVIYEALQT

A0A6J1F8C7 uncharacterized protein LOC1114430319.4e-18182.02Show/hide
Query:  NTERRKKHQNPPPSQPPPPPSAQSSWDQIKNLITCKQVEATRVHEPEKRPPAANSKLGSSYSSICRFRDVVHGNAKIVHRPDNSPETSSLGQETRLLTTK
        + +RRK+H+NP P  PPPPPSAQSSWDQIK+L+TCKQ+E +RVHEP KR P A SKLGSS SSIC FRDVVHGNAK+VHR DNSPE+SS+GQETRLL  K
Subjt:  NTERRKKHQNPPPSQPPPPPSAQSSWDQIKNLITCKQVEATRVHEPEKRPPAANSKLGSSYSSICRFRDVVHGNAKIVHRPDNSPETSSLGQETRLLTTK

Query:  ATNGSSSRSLTAPT--RTKN----GASASYSSSSRVLQLRKLSGCYECHAIVDPSRYPIPRSSICPCPQCGEMFSKMESLELHQAVRHAVSELGPDDSGR
        A NGSSSRSLTAPT  RTK+     ASASYSSSSR +QLRKLSGCYECH IVDP+RYPIPRSSI PCP CGE+F K E+LELHQ VRHAVSELGPDDSGR
Subjt:  ATNGSSSRSLTAPT--RTKN----GASASYSSSSRVLQLRKLSGCYECHAIVDPSRYPIPRSSICPCPQCGEMFSKMESLELHQAVRHAVSELGPDDSGR

Query:  NIVEIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSELLCDLGSRGSTGLCGSIPGCRVCSVIRH
        NIVEIIFKSSWLK DRPICKI+RILKVHNTQRTIQRFEDCRDAVKTRAL ST+KNPRCAADGNELLRFHCS LLCDLGSRGSTGLCGSIPGC VC+VIRH
Subjt:  NIVEIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSELLCDLGSRGSTGLCGSIPGCRVCSVIRH

Query:  GFQCKPGEPAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDLSAAAVEEENLSAAAASYDSVSRQSGVYSNLEELIIFNPKAILPCFVVI
        GFQCKPG   GV+TTASSGRAHDSF C DGRRRAMLVCRVIAGRVKR+AED +     EE   +AA SYDSVSR SG+YSNLEELIIFNPKAILPCFVVI
Subjt:  GFQCKPGEPAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDLSAAAVEEENLSAAAASYDSVSRQSGVYSNLEELIIFNPKAILPCFVVI

Query:  YEALQT
        YEALQT
Subjt:  YEALQT

A0A6J1FJK3 uncharacterized protein LOC1114460394.2e-22195.04Show/hide
Query:  MAHSTSLAINAEKSRKTKPINTERRKKHQNPPPSQPP---PPPSAQSSWDQIKNLITCKQVEATRVHEPEKRPPAANSKLGSSYSSICRFRDVVHGNAKI
        MA STSLAINAEKSRK KPINT+RRKKHQ+PPPSQPP   PPPSAQSSWDQIKNLITCKQ+EA+RVHEP KR P ANSKLGSS SSICRFRDVVHGNAKI
Subjt:  MAHSTSLAINAEKSRKTKPINTERRKKHQNPPPSQPP---PPPSAQSSWDQIKNLITCKQVEATRVHEPEKRPPAANSKLGSSYSSICRFRDVVHGNAKI

Query:  VHRPDNSPETSSLGQETRLLTTKATNGSSSRSLTAPTRTKNGASASYSSSSRVLQLRKLSGCYECHAIVDPSRYPIPRSSICPCPQCGEMFSKMESLELH
        VHRPDNSPETSSLGQETRLLTTKATNGSSSRSLTAPTRTKNGASASY SSSR LQLRKLSGCYECH IVDPSRYPIPRSSICPCPQCGEMF K+ESLELH
Subjt:  VHRPDNSPETSSLGQETRLLTTKATNGSSSRSLTAPTRTKNGASASYSSSSRVLQLRKLSGCYECHAIVDPSRYPIPRSSICPCPQCGEMFSKMESLELH

Query:  QAVRHAVSELGPDDSGRNIVEIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSELLCDLGSRGST
        QAVRHAVSELGPDDSGRNIVEIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALA  RKNPRCAADGNELLRFHCSELLCDLGSRGST
Subjt:  QAVRHAVSELGPDDSGRNIVEIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSELLCDLGSRGST

Query:  GLCGSIPGCRVCSVIRHGFQCKPGEPAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDLSAAAVEEENLSAAAASYDSVSRQSGVYSNLE
        GLCGSIPGCRVCSVIRHGFQCKPGEPAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDLSAAAVEEENLSAAAASYDSVSRQSG YSNLE
Subjt:  GLCGSIPGCRVCSVIRHGFQCKPGEPAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDLSAAAVEEENLSAAAASYDSVSRQSGVYSNLE

Query:  ELIIFNPKAILPCFVVIYEALQT
        ELIIFNPKAILPCFVVIYEALQT
Subjt:  ELIIFNPKAILPCFVVIYEALQT

A0A6J1IJ58 uncharacterized protein LOC1114767632.7e-18081.62Show/hide
Query:  NTERRKKHQNPPPSQPPPPPSAQSSWDQIKNLITCKQVEATRVHEPEKRPPAANSKLGSSYSSICRFRDVVHGNAKIVHRPDNSPETSSLGQETRLLTTK
        + +RRKKH+NP     PPPPSAQSSWDQIK+L+TCKQ+E +RVHEP KR P A SKLGSS SSIC FRDVVHGNAK+VHR DNSPE+SS+GQETRLL  K
Subjt:  NTERRKKHQNPPPSQPPPPPSAQSSWDQIKNLITCKQVEATRVHEPEKRPPAANSKLGSSYSSICRFRDVVHGNAKIVHRPDNSPETSSLGQETRLLTTK

Query:  ATNGSSSRSLTAPT--RTKN------GASASYSSSSRVLQLRKLSGCYECHAIVDPSRYPIPRSSICPCPQCGEMFSKMESLELHQAVRHAVSELGPDDS
        A NGSSSRSLTAPT  R+K+       ASASYSSSSR +QLRKLSGCYECH IVDP+RYPIPRSSICPCP CGE+F K ESLELHQ VRHAVSELGPDDS
Subjt:  ATNGSSSRSLTAPT--RTKN------GASASYSSSSRVLQLRKLSGCYECHAIVDPSRYPIPRSSICPCPQCGEMFSKMESLELHQAVRHAVSELGPDDS

Query:  GRNIVEIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSELLCDLGSRGSTGLCGSIPGCRVCSVI
        GRNIVEIIFKSSWLK DRPICKI+RILKVHNTQRTIQRFEDCRDAVKTRAL ST+KNPRCAADGNELLRFHCS LLCDLGSRGSTGLCGSIP C VCSVI
Subjt:  GRNIVEIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSELLCDLGSRGSTGLCGSIPGCRVCSVI

Query:  RHGFQCKPGEPAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDLSAAAVEEENLSAAAASYDSVSRQSGVYSNLEELIIFNPKAILPCFV
        RHGFQCKPG   GV+TTASSGRAHDSF C DGRRRAMLVCRVIAGRVKR+AED +     EE   +AA SYDSVSR SG+YSNLEELIIFNPKAILPCFV
Subjt:  RHGFQCKPGEPAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDLSAAAVEEENLSAAAASYDSVSRQSGVYSNLEELIIFNPKAILPCFV

Query:  VIYEALQT
        VIYEALQT
Subjt:  VIYEALQT

A0A6J1J1T8 uncharacterized protein LOC1114810674.1e-237100Show/hide
Query:  MAHSTSLAINAEKSRKTKPINTERRKKHQNPPPSQPPPPPSAQSSWDQIKNLITCKQVEATRVHEPEKRPPAANSKLGSSYSSICRFRDVVHGNAKIVHR
        MAHSTSLAINAEKSRKTKPINTERRKKHQNPPPSQPPPPPSAQSSWDQIKNLITCKQVEATRVHEPEKRPPAANSKLGSSYSSICRFRDVVHGNAKIVHR
Subjt:  MAHSTSLAINAEKSRKTKPINTERRKKHQNPPPSQPPPPPSAQSSWDQIKNLITCKQVEATRVHEPEKRPPAANSKLGSSYSSICRFRDVVHGNAKIVHR

Query:  PDNSPETSSLGQETRLLTTKATNGSSSRSLTAPTRTKNGASASYSSSSRVLQLRKLSGCYECHAIVDPSRYPIPRSSICPCPQCGEMFSKMESLELHQAV
        PDNSPETSSLGQETRLLTTKATNGSSSRSLTAPTRTKNGASASYSSSSRVLQLRKLSGCYECHAIVDPSRYPIPRSSICPCPQCGEMFSKMESLELHQAV
Subjt:  PDNSPETSSLGQETRLLTTKATNGSSSRSLTAPTRTKNGASASYSSSSRVLQLRKLSGCYECHAIVDPSRYPIPRSSICPCPQCGEMFSKMESLELHQAV

Query:  RHAVSELGPDDSGRNIVEIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSELLCDLGSRGSTGLC
        RHAVSELGPDDSGRNIVEIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSELLCDLGSRGSTGLC
Subjt:  RHAVSELGPDDSGRNIVEIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSELLCDLGSRGSTGLC

Query:  GSIPGCRVCSVIRHGFQCKPGEPAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDLSAAAVEEENLSAAAASYDSVSRQSGVYSNLEELI
        GSIPGCRVCSVIRHGFQCKPGEPAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDLSAAAVEEENLSAAAASYDSVSRQSGVYSNLEELI
Subjt:  GSIPGCRVCSVIRHGFQCKPGEPAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDLSAAAVEEENLSAAAASYDSVSRQSGVYSNLEELI

Query:  IFNPKAILPCFVVIYEALQT
        IFNPKAILPCFVVIYEALQT
Subjt:  IFNPKAILPCFVVIYEALQT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11490.1 zinc finger (C2H2 type) family protein1.7e-3332.03Show/hide
Query:  KNLITCKQVEATRVHEPEK-RPPAANSKLGSSYSSICRFRDVVHGNAKIVHRPDNSPETSSLGQETRLLTTKATNGSSSRSLTAPTRTKNGASASYSSSS
        K+L  CK  ++T   +P+K +     +  G S  S+   RDV   N                G E  +      +  S  S       K   +A YS   
Subjt:  KNLITCKQVEATRVHEPEK-RPPAANSKLGSSYSSICRFRDVVHGNAKIVHRPDNSPETSSLGQETRLLTTKATNGSSSRSLTAPTRTKNGASASYSSSS

Query:  RVLQLRKLSGCYECHAIVD--PSRYPIPRSSIC-----PCPQCGEMFSKMESLELHQAVRHAVSELGPDDSGRNIVEIIFKSSWL-KIDR-PICKIERIL
        + L    LSG      +      R+ +  S IC      C +C E    +++ E H    H+V  L   D  R  VE+I  + +  K+ +     I  I 
Subjt:  RVLQLRKLSGCYECHAIVD--PSRYPIPRSSIC-----PCPQCGEMFSKMESLELHQAVRHAVSELGPDDSGRNIVEIIFKSSWL-KIDR-PICKIERIL

Query:  KVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSELLCDLG-SRGSTGLCGSIPGCRVCSVIRHGF--QCKPGEPAGVRTTASSGRAH
        K+ N QR +  FED R+ VK RA   ++K+ RC ADGNE L FH + L C LG S  S+ LC S   C VC ++RHGF  + +P    GV T ++S  A 
Subjt:  KVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSELLCDLG-SRGSTGLCGSIPGCRVCSVIRHGF--QCKPGEPAGVRTTASSGRAH

Query:  DSFECGDGRRR----AMLVCRVIAGRVKRIAEDLSAAAVEEENLSAAAASYDSVSRQSGVYSNLEELIIFNPKAILPCFVVIYE
        +S E   GR R    A+++CRVIAGRV +  +         EN S   + +DS++ + G  S +EEL + + KA+LPCFV+I++
Subjt:  DSFECGDGRRR----AMLVCRVIAGRVKRIAEDLSAAAVEEENLSAAAASYDSVSRQSGVYSNLEELIIFNPKAILPCFVVIYE

AT1G75710.1 C2H2-like zinc finger protein1.1e-13860.43Show/hide
Query:  MAHSTSLAINAEKSRKTKPINTERRKKH--------QNPPPSQPPP--PPSAQSSWDQIKNLITCKQVEATRVHEPEKRPPA--------ANSKLGSSYS
        MA  T L  NAE  +K KP +++R+K+         Q   P +P    PP   SSWDQIKNL+TCKQ+E +RVH+P K   +        + SKLGSS S
Subjt:  MAHSTSLAINAEKSRKTKPINTERRKKH--------QNPPPSQPPP--PPSAQSSWDQIKNLITCKQVEATRVHEPEKRPPA--------ANSKLGSSYS

Query:  SICRFRDVVHGNAKIVHRPDNSPE---TSSLGQETRLLTTK--ATNGSSSRSLTAPTRTKNGASASYSSSS----RVLQLRKLSGCYECHAIVDPSRYPI
        SIC FRDV HGN ++VHR D+SP+   +++   ETRLLT K      SSSRSLT+ + T++ AS SY+SSS    R +Q RKLSGCYECH IVDPSRYPI
Subjt:  SICRFRDVVHGNAKIVHRPDNSPE---TSSLGQETRLLTTK--ATNGSSSRSLTAPTRTKNGASASYSSSS----RVLQLRKLSGCYECHAIVDPSRYPI

Query:  -PRSSICPCPQCGEMFSKMESLELHQAVRHAVSELGPDDSGRNIVEIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRC
         PR  +C C QCGE+F K+ESLELHQAVRHAVSELGP+DSGRNIVEIIFKSSWLK D PIC+IERILKVHNTQRTIQRFEDCRDAVK RAL +TRK+ RC
Subjt:  -PRSSICPCPQCGEMFSKMESLELHQAVRHAVSELGPDDSGRNIVEIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRC

Query:  AADGNELLRFHCSELLCDLGSRGSTGLCGSIPGCRVCSVIRHGFQCKPG------EPAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRI---A
        AADGNELLRFHC+ L C LG+RGS+ LC ++P C VC+VIRHGFQ K G        AGVRTTASSGRA D   C D  RR MLVCRVIAGRVKR+   A
Subjt:  AADGNELLRFHCSELLCDLGSRGSTGLCGSIPGCRVCSVIRHGFQCKPG------EPAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRI---A

Query:  EDLSAAA-----VEEEN---LSAAAASYDSVSRQSGVYSNLEELIIFNPKAILPCFVVIYEALQT
         D SA A     VE+ +   +S++  ++DSV+  +GVYSNLEEL+++NP+AILPCFVVIY+ L++
Subjt:  EDLSAAA-----VEEEN---LSAAAASYDSVSRQSGVYSNLEELIIFNPKAILPCFVVIYEALQT

AT2G29660.1 zinc finger (C2H2 type) family protein6.9e-4338.8Show/hide
Query:  TNGSSSRSLTAPTRTKNGASASYSSSSRVLQLRKLSGCYECHAIVDPSRYPIPRS-SICPCPQCGEMFSKMESLELHQAVRHAVSELGPDDSGRNIVEII
        T  S SR+ T P  T   A +S +S   ++Q    +       I   + + I  S  I PC  CGE+F K+  LE H A++HAVSEL   +S  NIV+II
Subjt:  TNGSSSRSLTAPTRTKNGASASYSSSSRVLQLRKLSGCYECHAIVDPSRYPIPRS-SICPCPQCGEMFSKMESLELHQAVRHAVSELGPDDSGRNIVEII

Query:  FKSSWLK---IDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALASTR-----KNPRCAADGNELLRFHCSELLCDLGSRGSTGLCGSIPGCRVCSVI
        FKS W +      P+  I RILK+HN+ + + RFE+ R+ VK +A  S        + RC ADGNELLRF+CS  +CDLG  G + LCG    C +C +I
Subjt:  FKSSWLK---IDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALASTR-----KNPRCAADGNELLRFHCSELLCDLGSRGSTGLCGSIPGCRVCSVI

Query:  RHGFQCKPGEPAGVRTTASSGRAHDSF------ECG-DGRRRAMLVCRVIAGRVKRIAEDLSAAAVEEENLSAA-AASYDSVSRQSGVYSNL------EE
          GF  K     G+ T A+  R H +       E G    +RAMLVCRV+AGRV           ++++++  +    YDS+  QSG  S        +E
Subjt:  RHGFQCKPGEPAGVRTTASSGRAHDSF------ECG-DGRRRAMLVCRVIAGRVKRIAEDLSAAAVEEENLSAA-AASYDSVSRQSGVYSNL------EE

Query:  LIIFNPKAILPCFVVIY
        L++FNP+A+LPCFV++Y
Subjt:  LIIFNPKAILPCFVVIY

AT4G27240.1 zinc finger (C2H2 type) family protein1.4e-5937.1Show/hide
Query:  AEKSRKTKPI---NTERRKKHQNPPPSQPPPPPSAQSSWDQIKNLITCKQVEATRVHEPEKR---PPAANSKLGSSYS-----------SICRFRDVVHG
        +EK +K K I   NT+ +KK + P            S W  +K  + CK  + + VH P  +    P +  +  +S             SI   +DV+HG
Subjt:  AEKSRKTKPI---NTERRKKHQNPPPSQPPPPPSAQSSWDQIKNLITCKQVEATRVHEPEKR---PPAANSKLGSSYS-----------SICRFRDVVHG

Query:  NAKIVHRP-DNSPETSSLGQETRLLTTKA--TNGSSSRSLTAPTRT------KNGASASYSSSSRVLQLRKLS-------GCYECHAIVDPSRYPIPRSS
        N + + +P  +SP +    +    +T     +N +    +TA   T      + G   +YSSS R    RK S       G ++     D        +S
Subjt:  NAKIVHRP-DNSPETSSLGQETRLLTTKA--TNGSSSRSLTAPTRT------KNGASASYSSSSRVLQLRKLS-------GCYECHAIVDPSRYPIPRSS

Query:  ICPCPQCGEMFSKMESLELHQAVRHAVSELGPDDSGRNIVEIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGN
           C +CGE FSK+E+ E H   +HAV+EL   DS R IVEII ++SWLK +    +I+RILKVHN Q+T+ RFE+ RD VK RA    +K+PRC ADGN
Subjt:  ICPCPQCGEMFSKMESLELHQAVRHAVSELGPDDSGRNIVEIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGN

Query:  ELLRFHCSELLCDLGSRGSTGLCGSIPGCRVCSVIRHGFQCK--PGEPAGVRTTASSGRAHDSFECGD---GRRRAMLVCRVIAGRVKRIAEDLSAAAVE
        ELLRFH + + C LG  GST LC S   C VC +IR+GF  K       GV T ++S RA +S   GD   G R+A++VCRVIAGRV R  E++      
Subjt:  ELLRFHCSELLCDLGSRGSTGLCGSIPGCRVCSVIRHGFQCK--PGEPAGVRTTASSGRAHDSFECGD---GRRRAMLVCRVIAGRVKRIAEDLSAAAVE

Query:  EENLSAAAASYDSVSRQSGVYSNLEELIIFNPKAILPCFVVI
         E +    + +DS++ + G+Y+N+EEL + N +A+LPCFV+I
Subjt:  EENLSAAAASYDSVSRQSGVYSNLEELIIFNPKAILPCFVVI

AT5G54630.1 zinc finger protein-related1.8e-5446.4Show/hide
Query:  SSICPCPQCGEMFSKMESLELHQAVRHAVSELGPDDSGRNIVEIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAAD
        +S   C +CGE F+K+E+ E H   +HAV+EL   DS R IVEII ++SWLK +    +I+R+LKVHN Q+T+ RFE+ R+ VK RA    +K+PRC AD
Subjt:  SSICPCPQCGEMFSKMESLELHQAVRHAVSELGPDDSGRNIVEIIFKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAAD

Query:  GNELLRFHCSELLCDLGSRGSTGLCGSIPGCRVCSVIRHGFQCK--PGEPAGVRTTASSGRAHDSF------ECGD---GRRRAMLVCRVIAGRVKRIAE
        GNELLRFH + + C LG  GST +C +   C VC +IR+GF  K       GV T ++SGRA +S       E GD     R+ ++VCRVIAGRV R  E
Subjt:  GNELLRFHCSELLCDLGSRGSTGLCGSIPGCRVCSVIRHGFQCK--PGEPAGVRTTASSGRAHDSF------ECGD---GRRRAMLVCRVIAGRVKRIAE

Query:  DLSAAAVEEENLSAAAASYDSVSRQSGVYSNLEELIIFNPKAILPCFVVI
        ++       E ++   + +DS++ + G+Y+N+EEL + NPKA+LPCFVVI
Subjt:  DLSAAAVEEENLSAAAASYDSVSRQSGVYSNLEELIIFNPKAILPCFVVI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACACTCAACTTCCTTAGCAATAAACGCAGAGAAAAGCAGAAAAACGAAACCCATCAACACCGAACGCAGAAAAAAGCACCAAAACCCACCTCCCTCCCAACCGCC
GCCGCCGCCCTCTGCCCAATCTTCATGGGACCAAATCAAGAACCTCATCACTTGCAAGCAGGTGGAGGCTACGAGAGTTCACGAGCCGGAAAAACGCCCGCCGGCGGCGA
ATTCGAAGTTGGGTTCTTCTTACAGCTCCATTTGTAGGTTTAGAGACGTGGTCCATGGCAACGCCAAAATTGTTCACAGACCCGACAACTCACCGGAAACCAGCTCCCTC
GGACAGGAAACTAGGTTACTCACTACAAAAGCTACAAACGGTTCATCGTCTCGTTCTTTGACAGCTCCGACCAGAACCAAAAACGGTGCTTCTGCTTCATACTCCTCCTC
TTCTAGAGTATTACAATTACGAAAGCTTTCTGGGTGTTACGAATGCCACGCCATCGTTGACCCTAGCAGGTACCCGATTCCGAGGAGTTCTATTTGCCCCTGTCCTCAAT
GTGGAGAGATGTTCTCCAAGATGGAAAGCTTAGAGCTTCACCAAGCGGTTCGCCACGCTGTTTCGGAGTTGGGTCCTGACGATTCGGGTCGAAACATCGTGGAGATCATT
TTCAAATCAAGCTGGCTAAAAATAGACCGTCCAATTTGCAAGATCGAACGGATATTAAAGGTCCACAATACCCAACGCACCATCCAACGGTTTGAAGACTGCCGCGATGC
AGTGAAGACACGTGCGCTCGCAAGCACTAGAAAAAACCCGCGGTGTGCGGCTGACGGTAATGAGCTGTTGCGGTTCCATTGTAGCGAGTTGTTGTGTGACCTCGGCTCAC
GTGGCTCAACCGGCCTGTGTGGCTCGATTCCCGGCTGCCGTGTCTGCAGCGTTATCCGTCATGGATTCCAGTGCAAGCCCGGTGAACCCGCTGGCGTACGAACCACGGCC
AGTAGCGGTAGGGCCCATGATTCGTTCGAATGCGGCGATGGACGGCGGCGGGCGATGTTGGTGTGTCGTGTCATCGCTGGAAGAGTGAAGCGGATCGCGGAGGATTTGTC
GGCCGCAGCGGTGGAGGAGGAGAATTTGTCGGCGGCAGCGGCCTCTTACGATTCCGTTTCACGACAGTCAGGGGTGTATTCGAATCTCGAGGAGTTGATCATTTTCAATC
CAAAGGCTATCCTTCCTTGTTTCGTTGTGATCTACGAAGCCCTCCAAACCTAA
mRNA sequenceShow/hide mRNA sequence
CTCCATTTCATTTCACACCTTCTATAACTCTCTCTCTCTCTCTCTCTCTCTCCGAGAAAAAAGGGAAACTTTCATTATTTTCATGGCACACTCAACTTCCTTAGCAATAA
ACGCAGAGAAAAGCAGAAAAACGAAACCCATCAACACCGAACGCAGAAAAAAGCACCAAAACCCACCTCCCTCCCAACCGCCGCCGCCGCCCTCTGCCCAATCTTCATGG
GACCAAATCAAGAACCTCATCACTTGCAAGCAGGTGGAGGCTACGAGAGTTCACGAGCCGGAAAAACGCCCGCCGGCGGCGAATTCGAAGTTGGGTTCTTCTTACAGCTC
CATTTGTAGGTTTAGAGACGTGGTCCATGGCAACGCCAAAATTGTTCACAGACCCGACAACTCACCGGAAACCAGCTCCCTCGGACAGGAAACTAGGTTACTCACTACAA
AAGCTACAAACGGTTCATCGTCTCGTTCTTTGACAGCTCCGACCAGAACCAAAAACGGTGCTTCTGCTTCATACTCCTCCTCTTCTAGAGTATTACAATTACGAAAGCTT
TCTGGGTGTTACGAATGCCACGCCATCGTTGACCCTAGCAGGTACCCGATTCCGAGGAGTTCTATTTGCCCCTGTCCTCAATGTGGAGAGATGTTCTCCAAGATGGAAAG
CTTAGAGCTTCACCAAGCGGTTCGCCACGCTGTTTCGGAGTTGGGTCCTGACGATTCGGGTCGAAACATCGTGGAGATCATTTTCAAATCAAGCTGGCTAAAAATAGACC
GTCCAATTTGCAAGATCGAACGGATATTAAAGGTCCACAATACCCAACGCACCATCCAACGGTTTGAAGACTGCCGCGATGCAGTGAAGACACGTGCGCTCGCAAGCACT
AGAAAAAACCCGCGGTGTGCGGCTGACGGTAATGAGCTGTTGCGGTTCCATTGTAGCGAGTTGTTGTGTGACCTCGGCTCACGTGGCTCAACCGGCCTGTGTGGCTCGAT
TCCCGGCTGCCGTGTCTGCAGCGTTATCCGTCATGGATTCCAGTGCAAGCCCGGTGAACCCGCTGGCGTACGAACCACGGCCAGTAGCGGTAGGGCCCATGATTCGTTCG
AATGCGGCGATGGACGGCGGCGGGCGATGTTGGTGTGTCGTGTCATCGCTGGAAGAGTGAAGCGGATCGCGGAGGATTTGTCGGCCGCAGCGGTGGAGGAGGAGAATTTG
TCGGCGGCAGCGGCCTCTTACGATTCCGTTTCACGACAGTCAGGGGTGTATTCGAATCTCGAGGAGTTGATCATTTTCAATCCAAAGGCTATCCTTCCTTGTTTCGTTGT
GATCTACGAAGCCCTCCAAACCTAATTAACGACGTCGTTTAATTAGTTCGTTTTCTTTTTGTTAGTTTTATTACGTCAGACAGTTTAGGATATCTATATGGTTTAGTTAG
CGTTTTTTTTTAGTCTTTAAATTTAAAATAATTTTTGTTGGTGGATATTAGATGTGGGTTACAGTAAGCTTATGTAGTATGATATAATTTTAGCTGCTTCACTAATATGG
AATCAATCCTTTCATGTTTGACTCTAA
Protein sequenceShow/hide protein sequence
MAHSTSLAINAEKSRKTKPINTERRKKHQNPPPSQPPPPPSAQSSWDQIKNLITCKQVEATRVHEPEKRPPAANSKLGSSYSSICRFRDVVHGNAKIVHRPDNSPETSSL
GQETRLLTTKATNGSSSRSLTAPTRTKNGASASYSSSSRVLQLRKLSGCYECHAIVDPSRYPIPRSSICPCPQCGEMFSKMESLELHQAVRHAVSELGPDDSGRNIVEII
FKSSWLKIDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSELLCDLGSRGSTGLCGSIPGCRVCSVIRHGFQCKPGEPAGVRTTA
SSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDLSAAAVEEENLSAAAASYDSVSRQSGVYSNLEELIIFNPKAILPCFVVIYEALQT