; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg007051 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg007051
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionC2H2-type domain-containing protein
Genome locationscaffold9:47104776..47107586
RNA-Seq ExpressionSpg007051
SyntenySpg007051
Gene Ontology termsGO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
InterPro domainsIPR013087 - Zinc finger C2H2-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136493.2 uncharacterized protein LOC101222539 [Cucumis sativus]2.3e-20286.48Show/hide
Query:  MALLTSLAEPSEQKKPLNNKRRKKHQKPTPSPPPPSSAQSSWDQIKNLITCKQVEVSRVHEPAKRSPAYSKLGSSCSSICSFRDVVHGNAKVVHRADNSP
        MALLT+  +  + KKPL+NKRRKKH  PT   PPPSSAQSSWD IKNLITCKQVEVSRV EP KRSPAYSKLGSSCSSICSFRDVVHGNAKVVHRADNSP
Subjt:  MALLTSLAEPSEQKKPLNNKRRKKHQKPTPSPPPPSSAQSSWDQIKNLITCKQVEVSRVHEPAKRSPAYSKLGSSCSSICSFRDVVHGNAKVVHRADNSP

Query:  ESSSVGQETRLLSRKAANGSSSRSLTAPAPTRTKNGASGS---YSSSSRGIQLRKLSGCYECHTIVDPSRYPVPRSSICPCPQCGEVFPKIESLELHQAV
        ESSSVGQETRLL+RK+ANGSSSRSLTAP P RTKNG SGS    SSSSRGIQLRKLSGCYECHTIVDPSR P+PRSSICPCPQCGEVFPKIESLELHQ V
Subjt:  ESSSVGQETRLLSRKAANGSSSRSLTAPAPTRTKNGASGS---YSSSSRGIQLRKLSGCYECHTIVDPSRYPVPRSSICPCPQCGEVFPKIESLELHQAV

Query:  RHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSALLCDLGSRGSTGLC
        RHAVSELGP+DSGRNIVEIIFKSSWLKKDRPICKI+RILKVHNTQRTIQRFEDCRDAVKTRAL STRKNPRCAADGNELLRFHCSAL CDLGSRGSTGLC
Subjt:  RHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSALLCDLGSRGSTGLC

Query:  GSIPGCGVCSVIRHGFQCKPGGTAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDTAA----VPVPVEEENL----SAAASYDSVSRHSG
        GSIP CGVC+VIRHGFQ KPGG  GVRTTASSGRAHDSF+CGDGRRRAMLVCRVIAGRVKRI++D AA         EEEN+    +AAASYDSVSRHSG
Subjt:  GSIPGCGVCSVIRHGFQCKPGGTAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDTAA----VPVPVEEENL----SAAASYDSVSRHSG

Query:  MYSNLEELIIFNPKAILPCFVVIYEALQT
        MYSNLEEL+IFNPKAILPCFVVIYEALQT
Subjt:  MYSNLEELIIFNPKAILPCFVVIYEALQT

XP_008466489.1 PREDICTED: uncharacterized protein LOC103503882 [Cucumis melo]9.8e-20186.42Show/hide
Query:  MALLTSLAEPSEQKKPLNNKRRKKHQKPTPSPPPPSSAQSSWDQIKNLITCKQVEVSRVHEPAKRSPAYSKLGSSCSSICSFRDVVHGNAKVVHRADNSP
        MALLT+  +  + KKPL NKRRKKH  PTPS   PSSAQSSWD IKNLITCKQVEVSRV E  KRSPAYSKLGSSCSSICSFRDVVHGNAKVVHRADNSP
Subjt:  MALLTSLAEPSEQKKPLNNKRRKKHQKPTPSPPPPSSAQSSWDQIKNLITCKQVEVSRVHEPAKRSPAYSKLGSSCSSICSFRDVVHGNAKVVHRADNSP

Query:  ESSSVGQETRLLSRKAANGSSSRSLTAPAPTRTKNGASGS---YSSSSRGIQLRKLSGCYECHTIVDPSRYPVPRSSICPCPQCGEVFPKIESLELHQAV
        ESSSVGQETRLL+RK+ANGSSSRSLTAPAP RTKNG SGS    SSSSRGIQLRKLSGCYECHTIVDPSR+P+PRSSIC CPQCGEVFPKIESLELHQ V
Subjt:  ESSSVGQETRLLSRKAANGSSSRSLTAPAPTRTKNGASGS---YSSSSRGIQLRKLSGCYECHTIVDPSRYPVPRSSICPCPQCGEVFPKIESLELHQAV

Query:  RHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSALLCDLGSRGSTGLC
        RHAVSELGP+DSGRNIVEIIFKSSWLKKDRPICKI+RILKVHNTQRTIQRFEDCRDAVKTRAL S+RKNPRCAADGNELLRFHCSALLCDLGSRGSTGLC
Subjt:  RHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSALLCDLGSRGSTGLC

Query:  GSIPGCGVCSVIRHGFQCKPGGTAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDT---AAVPVPVEEENL----SAAASYDSVSRHSGM
        GSIPGCGVC+VIRHGFQCKPGG  GVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRI+ED           +EEN+    + AASYDSVSRHSGM
Subjt:  GSIPGCGVCSVIRHGFQCKPGGTAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDT---AAVPVPVEEENL----SAAASYDSVSRHSGM

Query:  YSNLEELIIFNPKAILPCFVVIYEALQ
        YSNLEEL+IFNPKAILPCFVVIYEALQ
Subjt:  YSNLEELIIFNPKAILPCFVVIYEALQ

XP_022936402.1 uncharacterized protein LOC111443031 [Cucurbita moschata]1.7e-20086.02Show/hide
Query:  MALLTSLAEPSEQKKPLNNKRRKKHQKPTPSPPPPSSAQSSWDQIKNLITCKQVEVSRVHEPAKRSPAYSKLGSSCSSICSFRDVVHGNAKVVHRADNSP
        MALLTS           +NKRRK+H+ P+P PPPP SAQSSWDQIK+L+TCKQ+E SRVHEP KRSPAYSKLGSSCSSICSFRDVVHGNAKVVHRADNSP
Subjt:  MALLTSLAEPSEQKKPLNNKRRKKHQKPTPSPPPPSSAQSSWDQIKNLITCKQVEVSRVHEPAKRSPAYSKLGSSCSSICSFRDVVHGNAKVVHRADNSP

Query:  ESSSVGQETRLLSRKAANGSSSRSLTAPAPTRTKN----GASGSYSSSSRGIQLRKLSGCYECHTIVDPSRYPVPRSSICPCPQCGEVFPKIESLELHQA
        ESSSVGQETRLL RKA NGSSSRSLTAP P RTK+     AS SYSSSSRGIQLRKLSGCYECHTIVDP+RYP+PRSSI PCP CGEVFPK E+LELHQ 
Subjt:  ESSSVGQETRLLSRKAANGSSSRSLTAPAPTRTKN----GASGSYSSSSRGIQLRKLSGCYECHTIVDPSRYPVPRSSICPCPQCGEVFPKIESLELHQA

Query:  VRHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSALLCDLGSRGSTGL
        VRHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRAL ST+KNPRCAADGNELLRFHCSALLCDLGSRGSTGL
Subjt:  VRHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSALLCDLGSRGSTGL

Query:  CGSIPGCGVCSVIRHGFQCKPGGTAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDTAAVPVPVEEENLSAAASYDSVSRHSGMYSNLEE
        CGSIPGCGVC+VIRHGFQCKPGG  GV+TTASSGRAHDSF C DGRRRAMLVCRVIAGRVKR+AED A      EEEN+SAA SYDSVSRHSGMYSNLEE
Subjt:  CGSIPGCGVCSVIRHGFQCKPGGTAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDTAAVPVPVEEENLSAAASYDSVSRHSGMYSNLEE

Query:  LIIFNPKAILPCFVVIYEALQT
        LIIFNPKAILPCFVVIYEALQT
Subjt:  LIIFNPKAILPCFVVIYEALQT

XP_022940427.1 uncharacterized protein LOC111446039 [Cucurbita moschata]1.3e-20087.56Show/hide
Query:  MALLTSL---AEPSEQKKPLNNKRRKKHQKPTPS----PPPPSSAQSSWDQIKNLITCKQVEVSRVHEPAKRSPAYSKLGSSCSSICSFRDVVHGNAKVV
        MAL TSL   AE S + KP+N KRRKKHQ P PS    P PP SAQSSWDQIKNLITCKQ+E SRVHEPAKRSPA SKLGSSCSSIC FRDVVHGNAK+V
Subjt:  MALLTSL---AEPSEQKKPLNNKRRKKHQKPTPS----PPPPSSAQSSWDQIKNLITCKQVEVSRVHEPAKRSPAYSKLGSSCSSICSFRDVVHGNAKVV

Query:  HRADNSPESSSVGQETRLLSRKAANGSSSRSLTAPAPTRTKNGASGSYSSSSRGIQLRKLSGCYECHTIVDPSRYPVPRSSICPCPQCGEVFPKIESLEL
        HR DNSPE+SS+GQETRLL+ KA NGSSSRSLT  APTRTKNGAS SY SSSRG+QLRKLSGCYECHTIVDPSRYP+PRSSICPCPQCGE+FPKIESLEL
Subjt:  HRADNSPESSSVGQETRLLSRKAANGSSSRSLTAPAPTRTKNGASGSYSSSSRGIQLRKLSGCYECHTIVDPSRYPVPRSSICPCPQCGEVFPKIESLEL

Query:  HQAVRHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSALLCDLGSRGS
        HQAVRHAVSELGPDDSGRNIVEIIFKSSWLK DRPICKI+RILKVHNTQRTIQRFEDCRDAVKTRALA  RKNPRCAADGNELLRFHCS LLCDLGSRGS
Subjt:  HQAVRHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSALLCDLGSRGS

Query:  TGLCGSIPGCGVCSVIRHGFQCKPGGTAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDTAAVPVPVEEENLS-AAASYDSVSRHSGMYS
        TGLCGSIPGC VCSVIRHGFQCKPG  AGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAED +A    VEEENLS AAASYDSVSR SG YS
Subjt:  TGLCGSIPGCGVCSVIRHGFQCKPGGTAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDTAAVPVPVEEENLS-AAASYDSVSRHSGMYS

Query:  NLEELIIFNPKAILPCFVVIYEALQT
        NLEELIIFNPKAILPCFVVIYEALQT
Subjt:  NLEELIIFNPKAILPCFVVIYEALQT

XP_038900106.1 uncharacterized protein LOC120087248 [Benincasa hispida]2.3e-20287.94Show/hide
Query:  MALLTSLAEPSEQKKPLNNKRRKKHQKPTPSPPPPSSAQSSWDQIKNLITCKQVEVSRVHEPAKRSPAYSKLGSSCSSICSFRDVVHGNAKVVHRADNSP
        MALLT+  +  + KKPL NKRRKKH  PT   PPPSSAQSSWD IKNLITCKQVEVSRV EPAKRSPAYSKLGSSC SICSFRDVVHGNAKVVHRADNSP
Subjt:  MALLTSLAEPSEQKKPLNNKRRKKHQKPTPSPPPPSSAQSSWDQIKNLITCKQVEVSRVHEPAKRSPAYSKLGSSCSSICSFRDVVHGNAKVVHRADNSP

Query:  ESSSVGQETRLLSRKAANGSSSRSLTAPA--PTRTKNG--ASGSY-SSSSRGIQLRKLSGCYECHTIVDPSRYPVPRSSICPCPQCGEVFPKIESLELHQ
        ESSSVGQETRLL+RK ANGSSSRSLTAPA    RTKNG  ASGSY SSSSRGIQLRKLSGCYECHTIVDPSRYP+PRSSI PCPQCGEVFPKIESLELHQ
Subjt:  ESSSVGQETRLLSRKAANGSSSRSLTAPA--PTRTKNG--ASGSY-SSSSRGIQLRKLSGCYECHTIVDPSRYPVPRSSICPCPQCGEVFPKIESLELHQ

Query:  AVRHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSALLCDLGSRGSTG
         VRHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKI+RILKVHNTQRTIQRFEDCRDAVKTRAL STRKNPRCAADGNELLRFHCSAL CDLGSRGSTG
Subjt:  AVRHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSALLCDLGSRGSTG

Query:  LCGSIPGCGVCSVIRHGFQCKPGGTAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDTAAVPVPVEEENLSAAASYDSVSRHSGMYSNLE
        LCGSIPGCGVCSVIRHGFQC PGG  GVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRI+ED A      EE   +AAASYDS+SRHSG+YSNLE
Subjt:  LCGSIPGCGVCSVIRHGFQCKPGGTAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDTAAVPVPVEEENLSAAASYDSVSRHSGMYSNLE

Query:  ELIIFNPKAILPCFVVIYEALQT
        EL++FNPKAILPCFVVIYEALQT
Subjt:  ELIIFNPKAILPCFVVIYEALQT

TrEMBL top hitse value%identityAlignment
A0A0A0LGH5 C2H2-type domain-containing protein1.1e-20286.48Show/hide
Query:  MALLTSLAEPSEQKKPLNNKRRKKHQKPTPSPPPPSSAQSSWDQIKNLITCKQVEVSRVHEPAKRSPAYSKLGSSCSSICSFRDVVHGNAKVVHRADNSP
        MALLT+  +  + KKPL+NKRRKKH  PT   PPPSSAQSSWD IKNLITCKQVEVSRV EP KRSPAYSKLGSSCSSICSFRDVVHGNAKVVHRADNSP
Subjt:  MALLTSLAEPSEQKKPLNNKRRKKHQKPTPSPPPPSSAQSSWDQIKNLITCKQVEVSRVHEPAKRSPAYSKLGSSCSSICSFRDVVHGNAKVVHRADNSP

Query:  ESSSVGQETRLLSRKAANGSSSRSLTAPAPTRTKNGASGS---YSSSSRGIQLRKLSGCYECHTIVDPSRYPVPRSSICPCPQCGEVFPKIESLELHQAV
        ESSSVGQETRLL+RK+ANGSSSRSLTAP P RTKNG SGS    SSSSRGIQLRKLSGCYECHTIVDPSR P+PRSSICPCPQCGEVFPKIESLELHQ V
Subjt:  ESSSVGQETRLLSRKAANGSSSRSLTAPAPTRTKNGASGS---YSSSSRGIQLRKLSGCYECHTIVDPSRYPVPRSSICPCPQCGEVFPKIESLELHQAV

Query:  RHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSALLCDLGSRGSTGLC
        RHAVSELGP+DSGRNIVEIIFKSSWLKKDRPICKI+RILKVHNTQRTIQRFEDCRDAVKTRAL STRKNPRCAADGNELLRFHCSAL CDLGSRGSTGLC
Subjt:  RHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSALLCDLGSRGSTGLC

Query:  GSIPGCGVCSVIRHGFQCKPGGTAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDTAA----VPVPVEEENL----SAAASYDSVSRHSG
        GSIP CGVC+VIRHGFQ KPGG  GVRTTASSGRAHDSF+CGDGRRRAMLVCRVIAGRVKRI++D AA         EEEN+    +AAASYDSVSRHSG
Subjt:  GSIPGCGVCSVIRHGFQCKPGGTAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDTAA----VPVPVEEENL----SAAASYDSVSRHSG

Query:  MYSNLEELIIFNPKAILPCFVVIYEALQT
        MYSNLEEL+IFNPKAILPCFVVIYEALQT
Subjt:  MYSNLEELIIFNPKAILPCFVVIYEALQT

A0A1S3CSN6 uncharacterized protein LOC1035038824.8e-20186.42Show/hide
Query:  MALLTSLAEPSEQKKPLNNKRRKKHQKPTPSPPPPSSAQSSWDQIKNLITCKQVEVSRVHEPAKRSPAYSKLGSSCSSICSFRDVVHGNAKVVHRADNSP
        MALLT+  +  + KKPL NKRRKKH  PTPS   PSSAQSSWD IKNLITCKQVEVSRV E  KRSPAYSKLGSSCSSICSFRDVVHGNAKVVHRADNSP
Subjt:  MALLTSLAEPSEQKKPLNNKRRKKHQKPTPSPPPPSSAQSSWDQIKNLITCKQVEVSRVHEPAKRSPAYSKLGSSCSSICSFRDVVHGNAKVVHRADNSP

Query:  ESSSVGQETRLLSRKAANGSSSRSLTAPAPTRTKNGASGS---YSSSSRGIQLRKLSGCYECHTIVDPSRYPVPRSSICPCPQCGEVFPKIESLELHQAV
        ESSSVGQETRLL+RK+ANGSSSRSLTAPAP RTKNG SGS    SSSSRGIQLRKLSGCYECHTIVDPSR+P+PRSSIC CPQCGEVFPKIESLELHQ V
Subjt:  ESSSVGQETRLLSRKAANGSSSRSLTAPAPTRTKNGASGS---YSSSSRGIQLRKLSGCYECHTIVDPSRYPVPRSSICPCPQCGEVFPKIESLELHQAV

Query:  RHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSALLCDLGSRGSTGLC
        RHAVSELGP+DSGRNIVEIIFKSSWLKKDRPICKI+RILKVHNTQRTIQRFEDCRDAVKTRAL S+RKNPRCAADGNELLRFHCSALLCDLGSRGSTGLC
Subjt:  RHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSALLCDLGSRGSTGLC

Query:  GSIPGCGVCSVIRHGFQCKPGGTAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDT---AAVPVPVEEENL----SAAASYDSVSRHSGM
        GSIPGCGVC+VIRHGFQCKPGG  GVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRI+ED           +EEN+    + AASYDSVSRHSGM
Subjt:  GSIPGCGVCSVIRHGFQCKPGGTAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDT---AAVPVPVEEENL----SAAASYDSVSRHSGM

Query:  YSNLEELIIFNPKAILPCFVVIYEALQ
        YSNLEEL+IFNPKAILPCFVVIYEALQ
Subjt:  YSNLEELIIFNPKAILPCFVVIYEALQ

A0A6J1F8C7 uncharacterized protein LOC1114430318.1e-20186.02Show/hide
Query:  MALLTSLAEPSEQKKPLNNKRRKKHQKPTPSPPPPSSAQSSWDQIKNLITCKQVEVSRVHEPAKRSPAYSKLGSSCSSICSFRDVVHGNAKVVHRADNSP
        MALLTS           +NKRRK+H+ P+P PPPP SAQSSWDQIK+L+TCKQ+E SRVHEP KRSPAYSKLGSSCSSICSFRDVVHGNAKVVHRADNSP
Subjt:  MALLTSLAEPSEQKKPLNNKRRKKHQKPTPSPPPPSSAQSSWDQIKNLITCKQVEVSRVHEPAKRSPAYSKLGSSCSSICSFRDVVHGNAKVVHRADNSP

Query:  ESSSVGQETRLLSRKAANGSSSRSLTAPAPTRTKN----GASGSYSSSSRGIQLRKLSGCYECHTIVDPSRYPVPRSSICPCPQCGEVFPKIESLELHQA
        ESSSVGQETRLL RKA NGSSSRSLTAP P RTK+     AS SYSSSSRGIQLRKLSGCYECHTIVDP+RYP+PRSSI PCP CGEVFPK E+LELHQ 
Subjt:  ESSSVGQETRLLSRKAANGSSSRSLTAPAPTRTKN----GASGSYSSSSRGIQLRKLSGCYECHTIVDPSRYPVPRSSICPCPQCGEVFPKIESLELHQA

Query:  VRHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSALLCDLGSRGSTGL
        VRHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRAL ST+KNPRCAADGNELLRFHCSALLCDLGSRGSTGL
Subjt:  VRHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSALLCDLGSRGSTGL

Query:  CGSIPGCGVCSVIRHGFQCKPGGTAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDTAAVPVPVEEENLSAAASYDSVSRHSGMYSNLEE
        CGSIPGCGVC+VIRHGFQCKPGG  GV+TTASSGRAHDSF C DGRRRAMLVCRVIAGRVKR+AED A      EEEN+SAA SYDSVSRHSGMYSNLEE
Subjt:  CGSIPGCGVCSVIRHGFQCKPGGTAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDTAAVPVPVEEENLSAAASYDSVSRHSGMYSNLEE

Query:  LIIFNPKAILPCFVVIYEALQT
        LIIFNPKAILPCFVVIYEALQT
Subjt:  LIIFNPKAILPCFVVIYEALQT

A0A6J1FJK3 uncharacterized protein LOC1114460396.2e-20187.56Show/hide
Query:  MALLTSL---AEPSEQKKPLNNKRRKKHQKPTPS----PPPPSSAQSSWDQIKNLITCKQVEVSRVHEPAKRSPAYSKLGSSCSSICSFRDVVHGNAKVV
        MAL TSL   AE S + KP+N KRRKKHQ P PS    P PP SAQSSWDQIKNLITCKQ+E SRVHEPAKRSPA SKLGSSCSSIC FRDVVHGNAK+V
Subjt:  MALLTSL---AEPSEQKKPLNNKRRKKHQKPTPS----PPPPSSAQSSWDQIKNLITCKQVEVSRVHEPAKRSPAYSKLGSSCSSICSFRDVVHGNAKVV

Query:  HRADNSPESSSVGQETRLLSRKAANGSSSRSLTAPAPTRTKNGASGSYSSSSRGIQLRKLSGCYECHTIVDPSRYPVPRSSICPCPQCGEVFPKIESLEL
        HR DNSPE+SS+GQETRLL+ KA NGSSSRSLT  APTRTKNGAS SY SSSRG+QLRKLSGCYECHTIVDPSRYP+PRSSICPCPQCGE+FPKIESLEL
Subjt:  HRADNSPESSSVGQETRLLSRKAANGSSSRSLTAPAPTRTKNGASGSYSSSSRGIQLRKLSGCYECHTIVDPSRYPVPRSSICPCPQCGEVFPKIESLEL

Query:  HQAVRHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSALLCDLGSRGS
        HQAVRHAVSELGPDDSGRNIVEIIFKSSWLK DRPICKI+RILKVHNTQRTIQRFEDCRDAVKTRALA  RKNPRCAADGNELLRFHCS LLCDLGSRGS
Subjt:  HQAVRHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSALLCDLGSRGS

Query:  TGLCGSIPGCGVCSVIRHGFQCKPGGTAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDTAAVPVPVEEENLS-AAASYDSVSRHSGMYS
        TGLCGSIPGC VCSVIRHGFQCKPG  AGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAED +A    VEEENLS AAASYDSVSR SG YS
Subjt:  TGLCGSIPGCGVCSVIRHGFQCKPGGTAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDTAAVPVPVEEENLS-AAASYDSVSRHSGMYS

Query:  NLEELIIFNPKAILPCFVVIYEALQT
        NLEELIIFNPKAILPCFVVIYEALQT
Subjt:  NLEELIIFNPKAILPCFVVIYEALQT

A0A6J1IJ58 uncharacterized protein LOC1114767635.8e-19985.61Show/hide
Query:  MALLTSLAEPSEQKKPLNNKRRKKHQKPTPSPPPPSSAQSSWDQIKNLITCKQVEVSRVHEPAKRSPAYSKLGSSCSSICSFRDVVHGNAKVVHRADNSP
        MALLTS           +NKRRKKH+ P+   PPP SAQSSWDQIK+L+TCKQ+E SRVHEP KRSPAYSKLGSSCSSICSFRDVVHGNAKVVHRADNSP
Subjt:  MALLTSLAEPSEQKKPLNNKRRKKHQKPTPSPPPPSSAQSSWDQIKNLITCKQVEVSRVHEPAKRSPAYSKLGSSCSSICSFRDVVHGNAKVVHRADNSP

Query:  ESSSVGQETRLLSRKAANGSSSRSLTAPAPTRTKN------GASGSYSSSSRGIQLRKLSGCYECHTIVDPSRYPVPRSSICPCPQCGEVFPKIESLELH
        ESSSVGQETRLL RKA NGSSSRSLTAP P R+K+       AS SYSSSSRGIQLRKLSGCYECHTIVDP+RYP+PRSSICPCP CGEVFPK ESLELH
Subjt:  ESSSVGQETRLLSRKAANGSSSRSLTAPAPTRTKN------GASGSYSSSSRGIQLRKLSGCYECHTIVDPSRYPVPRSSICPCPQCGEVFPKIESLELH

Query:  QAVRHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSALLCDLGSRGST
        Q VRHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRAL ST+KNPRCAADGNELLRFHCSALLCDLGSRGST
Subjt:  QAVRHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSALLCDLGSRGST

Query:  GLCGSIPGCGVCSVIRHGFQCKPGGTAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDTAAVPVPVEEENLSAAASYDSVSRHSGMYSNL
        GLCGSIP CGVCSVIRHGFQCKPGG  GV+TTASSGRAHDSF C DGRRRAMLVCRVIAGRVKR+AED A      EEEN+SAA SYDSVSRHSGMYSNL
Subjt:  GLCGSIPGCGVCSVIRHGFQCKPGGTAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDTAAVPVPVEEENLSAAASYDSVSRHSGMYSNL

Query:  EELIIFNPKAILPCFVVIYEALQT
        EELIIFNPKAILPCFVVIYEALQT
Subjt:  EELIIFNPKAILPCFVVIYEALQT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11490.1 zinc finger (C2H2 type) family protein2.6e-3432.74Show/hide
Query:  WDQIKNLITCKQVEVSRVH-EPAKRSPAYSKLGSSCS--SICSFRDVVHGNAKVVHRADNSPESSSVGQETRLLSRKAANGSSSRSLTAPAPTRT-KNGA
        W  +K  ++C + + S V  +P K      +  S CS  S+ + RDV   N                G E  +   +  +  SSRSL +     T K   
Subjt:  WDQIKNLITCKQVEVSRVH-EPAKRSPAYSKLGSSCS--SICSFRDVVHGNAKVVHRADNSPESSSVGQETRLLSRKAANGSSSRSLTAPAPTRT-KNGA

Query:  SGSYSSSSRGIQLRKLSGCYECHTIVD--PSRYPVPRSSIC-----PCPQCGEVFPKIESLELHQAVRHAVSELGPDDSGRNIVEIIFKSSWLKKDRPI-
        +  YS   +G+    LSG      +      R+ V  S IC      C +C E    +++ E H    H+V  L   D  R  VE+I  + +  K   + 
Subjt:  SGSYSSSSRGIQLRKLSGCYECHTIVD--PSRYPVPRSSIC-----PCPQCGEVFPKIESLELHQAVRHAVSELGPDDSGRNIVEIIFKSSWLKKDRPI-

Query:  -CKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSALLCDLG-SRGSTGLCGSIPGCGVCSVIRHGF--QCKPGGTAGVRT
           I  I K+ N QR +  FED R+ VK RA   ++K+ RC ADGNE L FH + L C LG S  S+ LC S   C VC ++RHGF  + +P G  GV T
Subjt:  -CKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSALLCDLG-SRGSTGLCGSIPGCGVCSVIRHGF--QCKPGGTAGVRT

Query:  TASSGRAHDSFECGDGRRR----AMLVCRVIAGRVKRIAEDTAAVPVPVEE-ENLSAAASYDSVSRHSGMYSNLEELIIFNPKAILPCFVVIYE
         ++S  A +S E   GR R    A+++CRVIAGRV +          P++  EN    + +DS++   G  S +EEL + + KA+LPCFV+I++
Subjt:  TASSGRAHDSFECGDGRRR----AMLVCRVIAGRVKRIAEDTAAVPVPVEE-ENLSAAASYDSVSRHSGMYSNLEELIIFNPKAILPCFVVIYE

AT1G75710.1 C2H2-like zinc finger protein5.7e-14661.88Show/hide
Query:  MALLTSL---AEPSEQKKPLNNKRRK-----------KHQKPTPSPPPPSSAQSSWDQIKNLITCKQVEVSRVHEPAKRSP---------AYSKLGSSCS
        MALLT L   AE  ++ KP ++KR+K           KH+   P    P    SSWDQIKNL+TCKQ+E SRVH+P+K S          + SKLGSSCS
Subjt:  MALLTSL---AEPSEQKKPLNNKRRK-----------KHQKPTPSPPPPSSAQSSWDQIKNLITCKQVEVSRVHEPAKRSP---------AYSKLGSSCS

Query:  SICSFRDVVHGNAKVVHRADNSPE---SSSVGQETRLLSRKAA--NGSSSRSLTAPAPTRTKNGASGSYSSSS----RGIQLRKLSGCYECHTIVDPSRY
        SICSFRDV HGN +VVHRAD+SP+   S++   ETRLL+RK      SSSRSLT+ +   T++ ASGSY+SSS    R +Q RKLSGCYECH IVDPSRY
Subjt:  SICSFRDVVHGNAKVVHRADNSPE---SSSVGQETRLLSRKAA--NGSSSRSLTAPAPTRTKNGASGSYSSSS----RGIQLRKLSGCYECHTIVDPSRY

Query:  PV-PRSSICPCPQCGEVFPKIESLELHQAVRHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTRKNP
        P+ PR  +C C QCGEVFPK+ESLELHQAVRHAVSELGP+DSGRNIVEIIFKSSWLKKD PIC+I+RILKVHNTQRTIQRFEDCRDAVK RAL +TRK+ 
Subjt:  PV-PRSSICPCPQCGEVFPKIESLELHQAVRHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTRKNP

Query:  RCAADGNELLRFHCSALLCDLGSRGSTGLCGSIPGCGVCSVIRHGFQCKPGG------TAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRI--
        RCAADGNELLRFHC+ L C LG+RGS+ LC ++P CGVC+VIRHGFQ K GG       AGVRTTASSGRA D   C D  RR MLVCRVIAGRVKR+  
Subjt:  RCAADGNELLRFHCSALLCDLGSRGSTGLCGSIPGCGVCSVIRHGFQCKPGG------TAGVRTTASSGRAHDSFECGDGRRRAMLVCRVIAGRVKRI--

Query:  ----AEDTAAVPVPVEEENL----SAAASYDSVSRHSGMYSNLEELIIFNPKAILPCFVVIYEALQT
            A  TA     VE+ ++    S+  ++DSV+ ++G+YSNLEEL+++NP+AILPCFVVIY+ L++
Subjt:  ----AEDTAAVPVPVEEENL----SAAASYDSVSRHSGMYSNLEELIIFNPKAILPCFVVIYEALQT

AT2G29660.1 zinc finger (C2H2 type) family protein5.6e-4542.69Show/hide
Query:  ICPCPQCGEVFPKIESLELHQAVRHAVSELGPDDSGRNIVEIIFKSSWLKKDR---PICKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTR-----KN
        I PC  CGE+FPKI  LE H A++HAVSEL   +S  NIV+IIFKS W ++     P+  I RILK+HN+ + + RFE+ R+ VK +A  S        +
Subjt:  ICPCPQCGEVFPKIESLELHQAVRHAVSELGPDDSGRNIVEIIFKSSWLKKDR---PICKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTR-----KN

Query:  PRCAADGNELLRFHCSALLCDLGSRGSTGLCGSIPGCGVCSVIRHGFQCKPGGTAGVRTTASSGRAHDSF------ECG-DGRRRAMLVCRVIAGRVKRI
         RC ADGNELLRF+CS  +CDLG  G + LCG    C +C +I  GF  K     G+ T A+  R H +       E G    +RAMLVCRV+AGRV   
Subjt:  PRCAADGNELLRFHCSALLCDLGSRGSTGLCGSIPGCGVCSVIRHGFQCKPGGTAGVRTTASSGRAHDSF------ECG-DGRRRAMLVCRVIAGRVKRI

Query:  AEDTAAVPVPVEEENLSAAASYDSVSRHSGMYSNL------EELIIFNPKAILPCFVVIY
                +  ++ + S    YDS+   SG  S        +EL++FNP+A+LPCFV++Y
Subjt:  AEDTAAVPVPVEEENLSAAASYDSVSRHSGMYSNL------EELIIFNPKAILPCFVVIY

AT4G27240.1 zinc finger (C2H2 type) family protein7.6e-5838.41Show/hide
Query:  KRRKKHQKPTPSPPPPSSAQSSWDQIKNLITCKQVEVSRVHEP----------AKRSPAYSKLG----SSCS-SICSFRDVVHGNAKVVHR-ADNSPESS
        K++K  Q+            S W  +K  + CK  +VS VH P           KR+   S  G    S CS SI + +DV+HGN + + +   +SP S 
Subjt:  KRRKKHQKPTPSPPPPSSAQSSWDQIKNLITCKQVEVSRVHEP----------AKRSPAYSKLG----SSCS-SICSFRDVVHGNAKVVHR-ADNSPESS

Query:  SVGQETRLLSRKA--ANGSSSRSLTAPAPT----RTKNGASGSYSSSSRGIQLRKLS-------GCYECHTIVDPSRYPVPRSSICPCPQCGEVFPKIES
           +    ++     +N +    +TA   T      + G   +YSSS R    RK S       G ++     D        +S   C +CGE F K+E+
Subjt:  SVGQETRLLSRKA--ANGSSSRSLTAPAPT----RTKNGASGSYSSSSRGIQLRKLS-------GCYECHTIVDPSRYPVPRSSICPCPQCGEVFPKIES

Query:  LELHQAVRHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSALLCDLGS
         E H   +HAV+EL   DS R IVEII ++SWLK +    +I RILKVHN Q+T+ RFE+ RD VK RA    +K+PRC ADGNELLRFH + + C LG 
Subjt:  LELHQAVRHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSALLCDLGS

Query:  RGSTGLCGSIPGCGVCSVIRHGFQCK--PGGTAGVRTTASSGRAHDSFECGD---GRRRAMLVCRVIAGRVKRIAEDTAAVPVPVEEENLSAAASYDSVS
         GST LC S   C VC +IR+GF  K       GV T ++S RA +S   GD   G R+A++VCRVIAGRV R        PV   EE     + +DS++
Subjt:  RGSTGLCGSIPGCGVCSVIRHGFQCK--PGGTAGVRTTASSGRAHDSFECGD---GRRRAMLVCRVIAGRVKRIAEDTAAVPVPVEEENLSAAASYDSVS

Query:  RHSGMYSNLEELIIFNPKAILPCFVVI
           G+Y+N+EEL + N +A+LPCFV+I
Subjt:  RHSGMYSNLEELIIFNPKAILPCFVVI

AT5G54630.1 zinc finger protein-related3.3e-5347.01Show/hide
Query:  SSICPCPQCGEVFPKIESLELHQAVRHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAAD
        +S   C +CGE F K+E+ E H   +HAV+EL   DS R IVEII ++SWLK +    +I R+LKVHN Q+T+ RFE+ R+ VK RA    +K+PRC AD
Subjt:  SSICPCPQCGEVFPKIESLELHQAVRHAVSELGPDDSGRNIVEIIFKSSWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAAD

Query:  GNELLRFHCSALLCDLGSRGSTGLCGSIPGCGVCSVIRHGFQCK--PGGTAGVRTTASSGRAHDSF------ECGD---GRRRAMLVCRVIAGRVKRIAE
        GNELLRFH + + C LG  GST +C +   C VC +IR+GF  K       GV T ++SGRA +S       E GD     R+ ++VCRVIAGRV R   
Subjt:  GNELLRFHCSALLCDLGSRGSTGLCGSIPGCGVCSVIRHGFQCK--PGGTAGVRTTASSGRAHDSF------ECGD---GRRRAMLVCRVIAGRVKRIAE

Query:  DTAAVPVPVEEENLSAAASYDSVSRHSGMYSNLEELIIFNPKAILPCFVVI
             PV   EE     + +DS++   G+Y+N+EEL + NPKA+LPCFVVI
Subjt:  DTAAVPVPVEEENLSAAASYDSVSRHSGMYSNLEELIIFNPKAILPCFVVI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCTCTTAACTTCCTTGGCAGAGCCCTCTGAGCAAAAGAAACCCCTCAACAATAAACGCAGAAAAAAGCACCAAAAGCCAACCCCGTCACCGCCACCGCCGTCCTC
CGCCCAATCCTCATGGGACCAAATCAAGAATCTCATCACCTGCAAACAGGTCGAGGTTTCGAGAGTTCATGAGCCGGCGAAACGCTCGCCGGCGTATTCGAAGTTGGGGT
CTTCATGCAGTTCCATTTGTAGTTTCAGAGACGTGGTCCATGGCAATGCCAAAGTTGTACACCGAGCCGACAACTCACCGGAAAGCAGCTCCGTCGGACAGGAAACTAGG
TTACTCAGTAGAAAAGCTGCAAACGGTTCGTCGTCTCGTTCTTTGACGGCACCGGCGCCGACGAGAACGAAAAACGGTGCGTCTGGTTCGTACTCCTCGTCTTCGAGAGG
AATACAATTGCGAAAGCTTTCTGGGTGTTATGAATGTCACACCATCGTTGACCCTAGCAGGTACCCAGTTCCGAGGAGTTCTATATGTCCTTGTCCTCAATGTGGAGAGG
TCTTCCCCAAGATTGAAAGCTTAGAGCTTCACCAAGCAGTTCGCCATGCTGTTTCCGAGTTGGGTCCTGACGATTCGGGTCGAAATATTGTGGAGATAATTTTCAAGTCA
AGCTGGCTAAAAAAGGACCGCCCCATTTGCAAGATCCAACGGATATTGAAGGTCCACAACACCCAACGCACCATCCAACGCTTCGAGGACTGCCGCGATGCAGTCAAGAC
ACGTGCGCTCGCCAGCACCAGAAAAAACCCGCGCTGTGCGGCCGACGGTAATGAGCTGTTGCGCTTCCACTGTAGCGCCTTGTTGTGCGACCTCGGCTCACGTGGCTCGA
CCGGCTTGTGCGGCTCCATTCCTGGCTGCGGCGTCTGCAGCGTCATCCGCCATGGATTCCAGTGCAAGCCTGGTGGGACCGCCGGCGTACGGACTACTGCCAGTAGTGGT
AGGGCCCACGATTCCTTCGAATGCGGCGACGGGCGACGGCGGGCGATGTTGGTGTGCCGTGTCATCGCCGGGAGAGTGAAGCGGATCGCTGAGGATACGGCGGCGGTGCC
GGTGCCGGTGGAGGAGGAGAATTTGTCGGCGGCGGCATCGTACGACTCCGTTTCGCGACACTCGGGGATGTACTCGAATCTCGAGGAGTTGATCATTTTCAATCCGAAGG
CTATCCTTCCTTGTTTCGTCGTCATCTACGAAGCGCTCCAAACCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCTCTTAACTTCCTTGGCAGAGCCCTCTGAGCAAAAGAAACCCCTCAACAATAAACGCAGAAAAAAGCACCAAAAGCCAACCCCGTCACCGCCACCGCCGTCCTC
CGCCCAATCCTCATGGGACCAAATCAAGAATCTCATCACCTGCAAACAGGTCGAGGTTTCGAGAGTTCATGAGCCGGCGAAACGCTCGCCGGCGTATTCGAAGTTGGGGT
CTTCATGCAGTTCCATTTGTAGTTTCAGAGACGTGGTCCATGGCAATGCCAAAGTTGTACACCGAGCCGACAACTCACCGGAAAGCAGCTCCGTCGGACAGGAAACTAGG
TTACTCAGTAGAAAAGCTGCAAACGGTTCGTCGTCTCGTTCTTTGACGGCACCGGCGCCGACGAGAACGAAAAACGGTGCGTCTGGTTCGTACTCCTCGTCTTCGAGAGG
AATACAATTGCGAAAGCTTTCTGGGTGTTATGAATGTCACACCATCGTTGACCCTAGCAGGTACCCAGTTCCGAGGAGTTCTATATGTCCTTGTCCTCAATGTGGAGAGG
TCTTCCCCAAGATTGAAAGCTTAGAGCTTCACCAAGCAGTTCGCCATGCTGTTTCCGAGTTGGGTCCTGACGATTCGGGTCGAAATATTGTGGAGATAATTTTCAAGTCA
AGCTGGCTAAAAAAGGACCGCCCCATTTGCAAGATCCAACGGATATTGAAGGTCCACAACACCCAACGCACCATCCAACGCTTCGAGGACTGCCGCGATGCAGTCAAGAC
ACGTGCGCTCGCCAGCACCAGAAAAAACCCGCGCTGTGCGGCCGACGGTAATGAGCTGTTGCGCTTCCACTGTAGCGCCTTGTTGTGCGACCTCGGCTCACGTGGCTCGA
CCGGCTTGTGCGGCTCCATTCCTGGCTGCGGCGTCTGCAGCGTCATCCGCCATGGATTCCAGTGCAAGCCTGGTGGGACCGCCGGCGTACGGACTACTGCCAGTAGTGGT
AGGGCCCACGATTCCTTCGAATGCGGCGACGGGCGACGGCGGGCGATGTTGGTGTGCCGTGTCATCGCCGGGAGAGTGAAGCGGATCGCTGAGGATACGGCGGCGGTGCC
GGTGCCGGTGGAGGAGGAGAATTTGTCGGCGGCGGCATCGTACGACTCCGTTTCGCGACACTCGGGGATGTACTCGAATCTCGAGGAGTTGATCATTTTCAATCCGAAGG
CTATCCTTCCTTGTTTCGTCGTCATCTACGAAGCGCTCCAAACCTAA
Protein sequenceShow/hide protein sequence
MALLTSLAEPSEQKKPLNNKRRKKHQKPTPSPPPPSSAQSSWDQIKNLITCKQVEVSRVHEPAKRSPAYSKLGSSCSSICSFRDVVHGNAKVVHRADNSPESSSVGQETR
LLSRKAANGSSSRSLTAPAPTRTKNGASGSYSSSSRGIQLRKLSGCYECHTIVDPSRYPVPRSSICPCPQCGEVFPKIESLELHQAVRHAVSELGPDDSGRNIVEIIFKS
SWLKKDRPICKIQRILKVHNTQRTIQRFEDCRDAVKTRALASTRKNPRCAADGNELLRFHCSALLCDLGSRGSTGLCGSIPGCGVCSVIRHGFQCKPGGTAGVRTTASSG
RAHDSFECGDGRRRAMLVCRVIAGRVKRIAEDTAAVPVPVEEENLSAAASYDSVSRHSGMYSNLEELIIFNPKAILPCFVVIYEALQT