; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024255 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024255
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionC3HC-type domain-containing protein
Genome locationchr10:1630624..1635567
RNA-Seq ExpressionLag0024255
SyntenyLag0024255
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR012935 - Zinc finger, C3HC-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6584377.1 NIPA-like protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0082.4Show/hide
Query:  MTQDSEKRFHSIMDKLFQNGQATPNSNSASSPSSSSSPSGVQLLRGKKRPYSSSALVVGELRSKSDVIEALQKHSSASAGSTDAPLCRPWDRGDLSKRLT
        M+QDSEKRFHSIMDKLFQN  ATPNSNSASSP  SSSPSG QL RG+KRPYSSS LVVGELR+KSDVIEALQKHS+ASAGS+DAPLCRPWDRGDLSKRLT
Subjt:  MTQDSEKRFHSIMDKLFQNGQATPNSNSASSPSSSSSPSGVQLLRGKKRPYSSSALVVGELRSKSDVIEALQKHSSASAGSTDAPLCRPWDRGDLSKRLT

Query:  TFKSMTWFGKPKVVNAINCSRRGWINVDMDTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTPAPILVKKYG
        TFKSMTWFGKPKVVN INC+RRGWINVDMDTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALA+FPPTP P LV K+ 
Subjt:  TFKSMTWFGKPKVVNAINCSRRGWINVDMDTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTPAPILVKKYG

Query:  ERCSMLLHLSALPVISSSFLKWMKSSHLKQFLEELSLKEFGNESLNNSEIEFLGDGHDSDTAKVYYQALKLISLFGWEPRSLPYVVDCKTGSDQSLKKST
        ERCSMLLHLSALPVISSSF+KWMKS HLK+FLEELSL+EFGNES   SEIE+LGDGHDS+TA+VYYQALKLISLFGWEPRSLPYVVDCKTGSDQSLKK+T
Subjt:  ERCSMLLHLSALPVISSSFLKWMKSSHLKQFLEELSLKEFGNESLNNSEIEFLGDGHDSDTAKVYYQALKLISLFGWEPRSLPYVVDCKTGSDQSLKKST

Query:  TLDSRPTVNLCTAATKENVNRNRIAEISSELQSLPNSVVLDCRLCGAGVGLWTFHTIPRPVEIIRLVGPTELNGESGTRDSGNKSVVNHSGIGNVGISSK
        TL S PTVNL TAAT ENV+ N IAEISSELQS PNSVVLDCRLCGA VGLW F TIP+PVEIIRLVGPTELN ESGT DSGNKSV+NH+GI NV     
Subjt:  TLDSRPTVNLCTAATKENVNRNRIAEISSELQSLPNSVVLDCRLCGAGVGLWTFHTIPRPVEIIRLVGPTELNGESGTRDSGNKSVVNHSGIGNVGISSK

Query:  ESISNLTSTIAGGPTPARQSFKATITLPVIGQNLRARLFNDEKFIDQMYTDQEMVQADSLDKNMLQESKNDEDTTLTGQTDQSEGIRLFQNQTLDHGCST
          +S L+STIAGGPTPARQSFKATITLPVIGQNLRARLFNDEK   +MYTDQEMVQADSLDKNMLQ+SK+ ED+TLTGQ DQ       QNQT D  CST
Subjt:  ESISNLTSTIAGGPTPARQSFKATITLPVIGQNLRARLFNDEKFIDQMYTDQEMVQADSLDKNMLQESKNDEDTTLTGQTDQSEGIRLFQNQTLDHGCST

Query:  TGDDQTPLLEGTSVTDRGTLPESSLNGSTEEAQEKRTEIVPAQEIEVLENA---------GKAADLHPGPSPVENPLTSTDAVMITSSECSEKELPSVVS
        +GDDQTPLLEG SVTD+GTLPES LNGSTEE Q KRTEIVPAQEIEV+ENA          KA DLH GPSPV+  L STD+VMITSSECSEK+LPS V 
Subjt:  TGDDQTPLLEGTSVTDRGTLPESSLNGSTEEAQEKRTEIVPAQEIEVLENA---------GKAADLHPGPSPVENPLTSTDAVMITSSECSEKELPSVVS

Query:  DQCDSQQVSENDTSNSKDVSLANLQVTTCKSPCLEVDTNIDITSKNESTNDKLGSDNHTTSENQDSEGGGAANDKVHTSVNSEHITHGGEDYPKGAPFGN
        DQCD QQVSENDTSNSK+VSL +LQVT  KS C EVDTN DI S+ EST DKL SDNH TSENQD EGG  ANDKV+TSVNSEHI H GEDYPKG P G 
Subjt:  DQCDSQQVSENDTSNSKDVSLANLQVTTCKSPCLEVDTNIDITSKNESTNDKLGSDNHTTSENQDSEGGGAANDKVHTSVNSEHITHGGEDYPKGAPFGN

Query:  VMEFDPIRQHRHFCSWIATGNASPGWKQTLIALQREKSSSPHSPKNSPSASLIKVDDPVRSVRNLFTSSAKKLKSSLVSNDNTK
        V EFDPIRQHRHFC WIATGN +PGWK TL ALQRE SSSPHSPKNSPSASLIKVDDPV SVRNLFTSSAKKLKSSLVSN++TK
Subjt:  VMEFDPIRQHRHFCSWIATGNASPGWKQTLIALQREKSSSPHSPKNSPSASLIKVDDPVRSVRNLFTSSAKKLKSSLVSNDNTK

XP_008455775.1 PREDICTED: uncharacterized protein LOC103495850 isoform X1 [Cucumis melo]0.0e+0081.04Show/hide
Query:  MTQDSEKRFHSIMDKLFQNGQATPNSNSASSPSSSSSPSGVQLLRGKKRPYSSSALVVGELRSKSDVIEALQKHSSASAGSTDAPLCRPWDRGDLSKRLT
        M+QDSEKRFHSIMDKLFQN Q++PNSNSASS SSSSSPSGVQL RGKKRPYSSSALVVGELRSKSDVIEALQKHSSAS GS+DAPLCRPWDRGDL KRL 
Subjt:  MTQDSEKRFHSIMDKLFQNGQATPNSNSASSPSSSSSPSGVQLLRGKKRPYSSSALVVGELRSKSDVIEALQKHSSASAGSTDAPLCRPWDRGDLSKRLT

Query:  TFKSMTWFGKPKVVNAINCSRRGWINVDMDTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTPAPILVKKYG
        TFKSMTWFGKPKVVNAINC+RRGW+NVD DTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTP P+LV K+ 
Subjt:  TFKSMTWFGKPKVVNAINCSRRGWINVDMDTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTPAPILVKKYG

Query:  ERCSMLLHLSALPVISSSFLKWMKSSHLKQFLEELSLKEFGNESLNNSEIEFLGDGHDSDTAKVYYQALKLISLFGWEPRSLPYVVDCKT-GSDQSLKKS
        ER SMLLHLSALPVISSSFLKWM S HL QF+EEL+L  FGNESL+ SE+E+LGDGHDSDT KVYYQALKLISLFGWEPRS+PY+V+CK+ GSDQSLKKS
Subjt:  ERCSMLLHLSALPVISSSFLKWMKSSHLKQFLEELSLKEFGNESLNNSEIEFLGDGHDSDTAKVYYQALKLISLFGWEPRSLPYVVDCKT-GSDQSLKKS

Query:  TTLDSRPTVNLCTAATKENVNRNRIAEISSELQSLPNSVVLDCRLCGAGVGLWTFHTIPRPVEIIRLVGPTELNGESGTRDSGNKSVVNHSGIGNVGISS
        TT DS PTV+L T ATKENV+ NRIAE+SSELQS PNSVVLDCRLCGA VGLWTFHTIPRPVEIIRLVGPTELN ESGT DSGNKSV+NH+GIG+VG   
Subjt:  TTLDSRPTVNLCTAATKENVNRNRIAEISSELQSLPNSVVLDCRLCGAGVGLWTFHTIPRPVEIIRLVGPTELNGESGTRDSGNKSVVNHSGIGNVGISS

Query:  KESISNLTSTIAGGPTPARQSFKATITLPVIGQNLRARLFNDEKFIDQMYTDQEMVQADSLDKNMLQESKNDEDTTLTGQTDQSEGIRLFQNQTLDHGCS
           IS LTSTIAGGPTPARQSFKATITLPVIGQ+LRARLFNDEKF DQ+Y DQEMVQADS D+ + + SK++EDTT +GQTDQ E  RL QNQT+D GC 
Subjt:  KESISNLTSTIAGGPTPARQSFKATITLPVIGQNLRARLFNDEKFIDQMYTDQEMVQADSLDKNMLQESKNDEDTTLTGQTDQSEGIRLFQNQTLDHGCS

Query:  TTGDDQTPLLEGTSVTDRGTLPESSLNGSTEEAQEKRTEIVPAQEIEVLENA---------GKAADLHPGPSPVENPLTSTDAVMITSSECSEKELPSVV
        T+GDDQT LLEGTSVTD+GTLP+SSLNGSTEE Q K TE VPAQ+IE LENA          K ADL+P  SPVENPL STDAVMITSSECSEKELPS V
Subjt:  TTGDDQTPLLEGTSVTDRGTLPESSLNGSTEEAQEKRTEIVPAQEIEVLENA---------GKAADLHPGPSPVENPLTSTDAVMITSSECSEKELPSVV

Query:  SDQCDSQQVSENDTSNSKDVSLANLQVTTCKSPCLEVDTNIDITSKNESTNDKLGSDNHTTSENQDSEGGGAANDKVHTSVNSEHITHGGEDYPKGAPFG
        SDQCDSQQVSEND SNSK+VSLA+ QVT CKS  LE DTN D+    ES  DKL SDN TTSENQ  EGG   NDKVHTSVNS H+ HGGEDY KG   G
Subjt:  SDQCDSQQVSENDTSNSKDVSLANLQVTTCKSPCLEVDTNIDITSKNESTNDKLGSDNHTTSENQDSEGGGAANDKVHTSVNSEHITHGGEDYPKGAPFG

Query:  NVMEFDPIRQHRHFCSWIATGNASPGWKQTLIALQREKSSSPHSPKNSPSASLIKVDDPVRSVRNLFTSSAKKLKSSLVSNDNTKH
        + +EFDPIRQHR+FC WIATGN +PGWKQTL ALQREKSSSPHSPKNSPSASLIKV+DPV SVRNLFTSSAKKLKSSL+SN+ TKH
Subjt:  NVMEFDPIRQHRHFCSWIATGNASPGWKQTLIALQREKSSSPHSPKNSPSASLIKVDDPVRSVRNLFTSSAKKLKSSLVSNDNTKH

XP_022924046.1 uncharacterized protein LOC111431594 [Cucurbita moschata]0.0e+0082.53Show/hide
Query:  MTQDSEKRFHSIMDKLFQNGQATPNSNSASSPSSSSSPSGVQLLRGKKRPYSSSALVVGELRSKSDVIEALQKHSSASAGSTDAPLCRPWDRGDLSKRLT
        M+QDSEKRFHSIMDKLFQN  ATPNSNSASSP  SSSPSG QL RG+KRPYSSS LVVGELR+KSDVIEALQKHS+ASAGS+DAPLCRPWDRGDLSKRLT
Subjt:  MTQDSEKRFHSIMDKLFQNGQATPNSNSASSPSSSSSPSGVQLLRGKKRPYSSSALVVGELRSKSDVIEALQKHSSASAGSTDAPLCRPWDRGDLSKRLT

Query:  TFKSMTWFGKPKVVNAINCSRRGWINVDMDTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTPAPILVKKYG
        TFKSMTWFGKPKVVN INC+RRGWINVDMDTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALA+FPPTP P LV K+ 
Subjt:  TFKSMTWFGKPKVVNAINCSRRGWINVDMDTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTPAPILVKKYG

Query:  ERCSMLLHLSALPVISSSFLKWMKSSHLKQFLEELSLKEFGNESLNNSEIEFLGDGHDSDTAKVYYQALKLISLFGWEPRSLPYVVDCKTGSDQSLKKST
        ERCSMLLHLSALPVISSSF+KWM+S HLK+FLEELSL+EFGNES   SEIE+LGDGHDS+TA+VYYQALKLISLFGWEPRSLPYVVDCKTGSDQSLKK+T
Subjt:  ERCSMLLHLSALPVISSSFLKWMKSSHLKQFLEELSLKEFGNESLNNSEIEFLGDGHDSDTAKVYYQALKLISLFGWEPRSLPYVVDCKTGSDQSLKKST

Query:  TLDSRPTVNLCTAATKENVNRNRIAEISSELQSLPNSVVLDCRLCGAGVGLWTFHTIPRPVEIIRLVGPTELNGESGTRDSGNKSVVNHSGIGNVGISSK
        TL S PTVNL TAATKENV+ N IAEISSELQS PNSVVLDCRLCGA VGLW F TIP+PVEIIRLVGPTELN ESGT DSGNKSV+NH+GI NV     
Subjt:  TLDSRPTVNLCTAATKENVNRNRIAEISSELQSLPNSVVLDCRLCGAGVGLWTFHTIPRPVEIIRLVGPTELNGESGTRDSGNKSVVNHSGIGNVGISSK

Query:  ESISNLTSTIAGGPTPARQSFKATITLPVIGQNLRARLFNDEKFIDQMYTDQEMVQADSLDKNMLQESKNDEDTTLTGQTDQSEGIRLFQNQTLDHGCST
          +S L+STIAGGPTPARQSFKATITLPVIGQNLRARLFNDEK  D+MYTDQEMVQADSLDKNMLQ+SK+ ED+TLTGQ DQ       QNQT D  CST
Subjt:  ESISNLTSTIAGGPTPARQSFKATITLPVIGQNLRARLFNDEKFIDQMYTDQEMVQADSLDKNMLQESKNDEDTTLTGQTDQSEGIRLFQNQTLDHGCST

Query:  TGDDQTPLLEGTSVTDRGTLPESSLNGSTEEAQEKRTEIVPAQEIEVLENA---------GKAADLHPGPSPVENPLTSTDAVMITSSECSEKELPSVVS
        +GDDQTPLLEG S TD+GTLPES LNGSTEE Q KRTEIVPAQEIEV+ENA          KA DLH GPSPV+  L STD+VMITSSECSEK+LPS V 
Subjt:  TGDDQTPLLEGTSVTDRGTLPESSLNGSTEEAQEKRTEIVPAQEIEVLENA---------GKAADLHPGPSPVENPLTSTDAVMITSSECSEKELPSVVS

Query:  DQCDSQQVSENDTSNSKDVSLANLQVTTCKSPCLEVDTNIDITSKNESTNDKLGSDNHTTSENQDSEGGGAANDKVHTSVNSEHITHGGEDYPKGAPFGN
        DQCD QQVSENDTSNSK+VSL +LQVT  KS C EVDTN DI S+ EST DKL SDNH TSENQD EGG  ANDKV+TSVNSEHI HGGEDYPKG P G 
Subjt:  DQCDSQQVSENDTSNSKDVSLANLQVTTCKSPCLEVDTNIDITSKNESTNDKLGSDNHTTSENQDSEGGGAANDKVHTSVNSEHITHGGEDYPKGAPFGN

Query:  VMEFDPIRQHRHFCSWIATGNASPGWKQTLIALQREKSSSPHSPKNSPSASLIKVDDPVRSVRNLFTSSAKKLKSSLVSNDNTK
        V EFDPIRQHRHFC WIATGN +PGWK TL ALQRE SSSPHSPKNSPSASLIKVDDPV SVRNLFTSSAKKLKSSLVSN++TK
Subjt:  VMEFDPIRQHRHFCSWIATGNASPGWKQTLIALQREKSSSPHSPKNSPSASLIKVDDPVRSVRNLFTSSAKKLKSSLVSNDNTK

XP_023519717.1 uncharacterized protein LOC111783071 [Cucurbita pepo subsp. pepo]0.0e+0082.78Show/hide
Query:  MTQDSEKRFHSIMDKLFQNGQATPNSNSASSPSSSSSPSGVQLLRGKKRPYSSSALVVGELRSKSDVIEALQKHSSASAGSTDAPLCRPWDRGDLSKRLT
        M+QDSEKRFHSIMDKLFQN  ATPNSNSASSP  SSSPSG QL RG+KRPYSSSALVVGELR+KSDVIEALQKHS+ASAGS+DAPLCRPWDRGDLSKRLT
Subjt:  MTQDSEKRFHSIMDKLFQNGQATPNSNSASSPSSSSSPSGVQLLRGKKRPYSSSALVVGELRSKSDVIEALQKHSSASAGSTDAPLCRPWDRGDLSKRLT

Query:  TFKSMTWFGKPKVVNAINCSRRGWINVDMDTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTPAPILVKKYG
        TFKSMTWFGKPKVVN INC+RRGWINVDMDTIACESCG RLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALA+FPPTP P LV K+ 
Subjt:  TFKSMTWFGKPKVVNAINCSRRGWINVDMDTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTPAPILVKKYG

Query:  ERCSMLLHLSALPVISSSFLKWMKSSHLKQFLEELSLKEFGNESLNNSEIEFLGDGHDSDTAKVYYQALKLISLFGWEPRSLPYVVDCKTGSDQSLKKST
        ERCSMLLHLSALP ISSSF+KWMKS HLK+FLEELSL+EFGNES   SEIE+LGDG DS+TAKVYYQALKLISLFGWEPRSLPYVVDCKTGSDQSLKKST
Subjt:  ERCSMLLHLSALPVISSSFLKWMKSSHLKQFLEELSLKEFGNESLNNSEIEFLGDGHDSDTAKVYYQALKLISLFGWEPRSLPYVVDCKTGSDQSLKKST

Query:  TLDSRPTVNLCTAATKENVNRNRIAEISSELQSLPNSVVLDCRLCGAGVGLWTFHTIPRPVEIIRLVGPTELNGESGTRDSGNKSVVNHSGIGNVGISSK
         L S PT++L TAA KENV+ NRIAEISSELQS PNSVVLDCRLCGA VGLW FHTIPRPVEIIRLVGPTELN ESGT DSGNKSV+N +GI NV     
Subjt:  TLDSRPTVNLCTAATKENVNRNRIAEISSELQSLPNSVVLDCRLCGAGVGLWTFHTIPRPVEIIRLVGPTELNGESGTRDSGNKSVVNHSGIGNVGISSK

Query:  ESISNLTSTIAGGPTPARQSFKATITLPVIGQNLRARLFNDEKFIDQMYTDQEMVQADSLDKNMLQESKNDEDTTLTGQTDQSEGIRLFQNQTLDHGCST
          +S  TSTIAGGPTPARQSFKATITLPVIGQNLRARLFNDEK  D+MYTDQEMVQADSLDKNMLQ+SK++ED+TLTGQ DQ       QNQTLD  CST
Subjt:  ESISNLTSTIAGGPTPARQSFKATITLPVIGQNLRARLFNDEKFIDQMYTDQEMVQADSLDKNMLQESKNDEDTTLTGQTDQSEGIRLFQNQTLDHGCST

Query:  TGDDQTPLLEGTSVTDRGTLPESSLNGSTEEAQEKRTEIVPAQEIEVLENA---------GKAADLHPGPSPVENPLTSTDAVMITSSECSEKELPSVVS
        +GDDQTPL EG SVTD+GTLPES LNGSTEE Q KRTEIVPAQEIEV+ENA          KA DLH GPSPV+  L STDAVMITSSECSEK+LPS VS
Subjt:  TGDDQTPLLEGTSVTDRGTLPESSLNGSTEEAQEKRTEIVPAQEIEVLENA---------GKAADLHPGPSPVENPLTSTDAVMITSSECSEKELPSVVS

Query:  DQCDSQQVSENDTSNSKDVSLANLQVTTCKSPCLEVDTNIDITSKNESTNDKLGSDNHTTSENQDSEGGGAANDKVHTSVNSEHITHGGEDYPKGAPFGN
        DQCD QQVSENDTSNSK+VSL +LQVT  KS C EVDTN DI S+NEST DKL SDNH TSENQD EGG  ANDK++TSVNSEHI HGGEDYPKG P G 
Subjt:  DQCDSQQVSENDTSNSKDVSLANLQVTTCKSPCLEVDTNIDITSKNESTNDKLGSDNHTTSENQDSEGGGAANDKVHTSVNSEHITHGGEDYPKGAPFGN

Query:  VMEFDPIRQHRHFCSWIATGNASPGWKQTLIALQREKSSSPHSPKNSPSASLIKVDDPVRSVRNLFTSSAKKLKSSLVSNDNTK
        V EFDPIRQHRHFC WIATGN +PGWK TL ALQRE SSSPHSPKNSPSASLIKVDDPV SVRNLFTSSAKKLKSSLVSN++TK
Subjt:  VMEFDPIRQHRHFCSWIATGNASPGWKQTLIALQREKSSSPHSPKNSPSASLIKVDDPVRSVRNLFTSSAKKLKSSLVSNDNTK

XP_038895031.1 uncharacterized protein LOC120083371 [Benincasa hispida]0.0e+0084.33Show/hide
Query:  MTQDSEKRFHSIMDKLFQNGQATPNSNSASSPSSSSSPSGVQLLRGKKRPYSSSALVVGELRSKSDVIEALQKHSSASAGSTDAPLCRPWDRGDLSKRLT
        M+QDSEKRFHSIMDKLFQN Q TPNSNSASSP SSSSPSGVQL RGKKRPYSSSALVVGELRSKSDVIEALQKHSSAS GS+DAPLCRPWDRGDLSKRLT
Subjt:  MTQDSEKRFHSIMDKLFQNGQATPNSNSASSPSSSSSPSGVQLLRGKKRPYSSSALVVGELRSKSDVIEALQKHSSASAGSTDAPLCRPWDRGDLSKRLT

Query:  TFKSMTWFGKPKVVNAINCSRRGWINVDMDTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTPAPILVKKYG
        TFKSMTWFGKPKVVNAINC+RRGWINVDMDTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTP PILV K+ 
Subjt:  TFKSMTWFGKPKVVNAINCSRRGWINVDMDTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTPAPILVKKYG

Query:  ERCSMLLHLSALPVISSSFLKWMKSSHLKQFLEELSLKEFGNESLNNSEIEFLGDGHDSDTAKVYYQALKLISLFGWEPRSLPYVVDCKTGSDQSLKKST
        ER SMLLHLS LPVISSSFLKW KS HLKQFLEEL+ +EFGN+SLN S  E+LGDGHDSDTAKVYYQALKLISLFGWEPRSLPYVVDCKTGSDQSLKKST
Subjt:  ERCSMLLHLSALPVISSSFLKWMKSSHLKQFLEELSLKEFGNESLNNSEIEFLGDGHDSDTAKVYYQALKLISLFGWEPRSLPYVVDCKTGSDQSLKKST

Query:  TLDSRPTVNLCTAATKENVNRNRIAEISSELQSLPNSVVLDCRLCGAGVGLWTFHTIPRPVEIIRLVGPTELNGESGTRDSGNKSVVNHSGIGNVGISSK
        TLDSRPTVNL TAATKENV+ NRIAE+SSELQS PNSVVLDCRLCGA  GLW FHTIPRPVEIIRLVGPTELN ESGT DS N S++NH+GIGNVG    
Subjt:  TLDSRPTVNLCTAATKENVNRNRIAEISSELQSLPNSVVLDCRLCGAGVGLWTFHTIPRPVEIIRLVGPTELNGESGTRDSGNKSVVNHSGIGNVGISSK

Query:  ESISNLTSTIAGGPTPARQSFKATITLPVIGQNLRARLFNDEKFIDQMYTDQEMVQADSLDKNMLQESKNDEDTTLTGQTDQSEGIRLFQNQTLDHGCST
          IS LTSTIAGGPTPARQSFKATITLPVIGQ+LRARLFNDEKF +++Y DQEMVQADS DKNMLQ SK++EDTT TGQ DQ E IRL QNQ LD G  T
Subjt:  ESISNLTSTIAGGPTPARQSFKATITLPVIGQNLRARLFNDEKFIDQMYTDQEMVQADSLDKNMLQESKNDEDTTLTGQTDQSEGIRLFQNQTLDHGCST

Query:  TGDDQTPLLEGTSVTDRGTLPESSLNGSTEEAQEKRTEIVPAQEIEVLENA---------GKAADLHPGPSPVENPLTSTDAVMITSSECSEKELPSVVS
        +GDDQTPLLEGTSVTD+G+LPESSLNGSTEE Q KRTEIVPAQ+ EVLENA          K+ADLHP PSPVENPLTSTDAVMITSSECSEKELPS VS
Subjt:  TGDDQTPLLEGTSVTDRGTLPESSLNGSTEEAQEKRTEIVPAQEIEVLENA---------GKAADLHPGPSPVENPLTSTDAVMITSSECSEKELPSVVS

Query:  DQCDSQQVSENDTSNSKDVSLANLQVTTCKSPCLEVDTNIDITSKNESTNDKLGSDNHTTSENQDSEGGGAANDKVHTSVNSEHITHGGEDYPKGAPFGN
         QCDSQQVSE DTSNSK+VSL + QVT CKS CLEVDTN DI   NES  DKLGSDNHTTSENQD  GGG   DKVHTSVNS+HI HGGEDY KG   G+
Subjt:  DQCDSQQVSENDTSNSKDVSLANLQVTTCKSPCLEVDTNIDITSKNESTNDKLGSDNHTTSENQDSEGGGAANDKVHTSVNSEHITHGGEDYPKGAPFGN

Query:  VMEFDPIRQHRHFCSWIATGNASPGWKQTLIALQREKSSSPHSPKNSPSASLIKVDDPVRSVRNLFTSSAKKLKSSLVSNDNTKH
        +MEFDPIRQHR FC WIATGN +PGWKQTL ALQREK+SSPHSP+N+PSASLIKVDDPV SVRNLFTSSAKKLKSSLVSN++TKH
Subjt:  VMEFDPIRQHRHFCSWIATGNASPGWKQTLIALQREKSSSPHSPKNSPSASLIKVDDPVRSVRNLFTSSAKKLKSSLVSNDNTKH

TrEMBL top hitse value%identityAlignment
A0A0A0LQC5 C3HC-type domain-containing protein0.0e+0080.03Show/hide
Query:  MTQDSEKRFHSIMDKLFQNGQATPNSNSASSPSSSSSPSGVQLLRGKKRPYSSSALVVGELRSKSDVIEALQKHSSASAGSTDAPLCRPWDRGDLSKRLT
        M+QDSEKRFHSIMDKLFQN Q+TPNSNSASSPSSSSSPSGVQL RG+KRPYSSSALVVGELRSKSDVIEALQKHSSAS GS+DAPLCRPWDRGDL KRL 
Subjt:  MTQDSEKRFHSIMDKLFQNGQATPNSNSASSPSSSSSPSGVQLLRGKKRPYSSSALVVGELRSKSDVIEALQKHSSASAGSTDAPLCRPWDRGDLSKRLT

Query:  TFKSMTWFGKPKVVNAINCSRRGWINVDMDTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTPAPILVKKYG
        TFKSMTWFGKPKVVNAINC+RRGW+NVD DTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTP P+LV K+ 
Subjt:  TFKSMTWFGKPKVVNAINCSRRGWINVDMDTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTPAPILVKKYG

Query:  ERCSMLLHLSALPVISSSFLKWMKSSHLKQFLEELSLKEFGNESLNNSEIEFLGDGHDSDTAKVYYQALKLISLFGWEPRSLPYVVDCKTG-SDQSLKKS
        ER SMLL LSALPVISSSFLKWM S HLKQF+EEL+ + FGNESL+ SE+E+LGDGHDSDT KVYYQALKLISLFGWEPRSLPYVVDCK+G SDQSLKKS
Subjt:  ERCSMLLHLSALPVISSSFLKWMKSSHLKQFLEELSLKEFGNESLNNSEIEFLGDGHDSDTAKVYYQALKLISLFGWEPRSLPYVVDCKTG-SDQSLKKS

Query:  TTLDSRPTVNLCTAATKENVNRNRIAEISSELQSLPNSVVLDCRLCGAGVGLWTFHTIPRPVEIIRLVGPTELNGESGTRDSGNKSVVNHSGIGNVGISS
        TT DSRPTV+L T  TKENV  NRIAE+SSELQS PNSVVLDCRLCGA VGLWTFHTIPRPVEIIRLVG TELN ESGT DSGNKSV+NH+GIGNVG   
Subjt:  TTLDSRPTVNLCTAATKENVNRNRIAEISSELQSLPNSVVLDCRLCGAGVGLWTFHTIPRPVEIIRLVGPTELNGESGTRDSGNKSVVNHSGIGNVGISS

Query:  KESISNLTSTIAGGPTPARQSFKATITLPVIGQNLRARLFNDEKFIDQMYTDQEMVQADSLDKNMLQESKNDEDTTLTGQTDQSEGIRLFQNQTLDHGCS
           IS LTSTIAGGPTPARQSFKATITLPVIGQ+LRARLF+DEKF DQ+Y DQEMVQADS DK M Q SK++ED   TG+TDQ +  RL QNQTLD GC 
Subjt:  KESISNLTSTIAGGPTPARQSFKATITLPVIGQNLRARLFNDEKFIDQMYTDQEMVQADSLDKNMLQESKNDEDTTLTGQTDQSEGIRLFQNQTLDHGCS

Query:  TTGDDQTPLLEGTSVTDRGTLPESSLNGSTEEAQEKRTEIVPAQEIEVLENA---------GKAADLHPGPSPVENPLTSTDAVMITSSECSEKELPSVV
        T+GDDQTPLLEGTSVTD GTLP+SSLNGSTEE + K TE VPAQ+IEV ENA          K ADLHP  SP ENPLTSTDA MITS+ECSEKELPS V
Subjt:  TTGDDQTPLLEGTSVTDRGTLPESSLNGSTEEAQEKRTEIVPAQEIEVLENA---------GKAADLHPGPSPVENPLTSTDAVMITSSECSEKELPSVV

Query:  SDQCDSQQVSENDTSNSKDVSLANLQVTTCKSPCLEVDTNIDITSKNESTNDKLGSDNHTTSENQDSEGGGAANDKVHTSVNSEHITHGGEDYPKGAPFG
        SDQCD+        SNSK++SLA+ Q+T+CKS  LE DT+ DI    ES  DKLGSDNHTT ENQ  EGGG +NDKVHTS+NS H+ HGGEDY KG    
Subjt:  SDQCDSQQVSENDTSNSKDVSLANLQVTTCKSPCLEVDTNIDITSKNESTNDKLGSDNHTTSENQDSEGGGAANDKVHTSVNSEHITHGGEDYPKGAPFG

Query:  NVMEFDPIRQHRHFCSWIATGNASPGWKQTLIALQREKSSSPHSPKNSPSASLIKVDDPVRSVRNLFTSSAKKLKSSLVSNDNTKH
          +EFDPIRQHR+FC WIATGN +PGWKQTL ALQREK SSPHSPKNSPSASLIKV+DPV SVRNLFTSSAKKLKSSLVSN+ TKH
Subjt:  NVMEFDPIRQHRHFCSWIATGNASPGWKQTLIALQREKSSSPHSPKNSPSASLIKVDDPVRSVRNLFTSSAKKLKSSLVSNDNTKH

A0A1S4DWH4 uncharacterized protein LOC103495850 isoform X10.0e+0081.04Show/hide
Query:  MTQDSEKRFHSIMDKLFQNGQATPNSNSASSPSSSSSPSGVQLLRGKKRPYSSSALVVGELRSKSDVIEALQKHSSASAGSTDAPLCRPWDRGDLSKRLT
        M+QDSEKRFHSIMDKLFQN Q++PNSNSASS SSSSSPSGVQL RGKKRPYSSSALVVGELRSKSDVIEALQKHSSAS GS+DAPLCRPWDRGDL KRL 
Subjt:  MTQDSEKRFHSIMDKLFQNGQATPNSNSASSPSSSSSPSGVQLLRGKKRPYSSSALVVGELRSKSDVIEALQKHSSASAGSTDAPLCRPWDRGDLSKRLT

Query:  TFKSMTWFGKPKVVNAINCSRRGWINVDMDTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTPAPILVKKYG
        TFKSMTWFGKPKVVNAINC+RRGW+NVD DTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTP P+LV K+ 
Subjt:  TFKSMTWFGKPKVVNAINCSRRGWINVDMDTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTPAPILVKKYG

Query:  ERCSMLLHLSALPVISSSFLKWMKSSHLKQFLEELSLKEFGNESLNNSEIEFLGDGHDSDTAKVYYQALKLISLFGWEPRSLPYVVDCKT-GSDQSLKKS
        ER SMLLHLSALPVISSSFLKWM S HL QF+EEL+L  FGNESL+ SE+E+LGDGHDSDT KVYYQALKLISLFGWEPRS+PY+V+CK+ GSDQSLKKS
Subjt:  ERCSMLLHLSALPVISSSFLKWMKSSHLKQFLEELSLKEFGNESLNNSEIEFLGDGHDSDTAKVYYQALKLISLFGWEPRSLPYVVDCKT-GSDQSLKKS

Query:  TTLDSRPTVNLCTAATKENVNRNRIAEISSELQSLPNSVVLDCRLCGAGVGLWTFHTIPRPVEIIRLVGPTELNGESGTRDSGNKSVVNHSGIGNVGISS
        TT DS PTV+L T ATKENV+ NRIAE+SSELQS PNSVVLDCRLCGA VGLWTFHTIPRPVEIIRLVGPTELN ESGT DSGNKSV+NH+GIG+VG   
Subjt:  TTLDSRPTVNLCTAATKENVNRNRIAEISSELQSLPNSVVLDCRLCGAGVGLWTFHTIPRPVEIIRLVGPTELNGESGTRDSGNKSVVNHSGIGNVGISS

Query:  KESISNLTSTIAGGPTPARQSFKATITLPVIGQNLRARLFNDEKFIDQMYTDQEMVQADSLDKNMLQESKNDEDTTLTGQTDQSEGIRLFQNQTLDHGCS
           IS LTSTIAGGPTPARQSFKATITLPVIGQ+LRARLFNDEKF DQ+Y DQEMVQADS D+ + + SK++EDTT +GQTDQ E  RL QNQT+D GC 
Subjt:  KESISNLTSTIAGGPTPARQSFKATITLPVIGQNLRARLFNDEKFIDQMYTDQEMVQADSLDKNMLQESKNDEDTTLTGQTDQSEGIRLFQNQTLDHGCS

Query:  TTGDDQTPLLEGTSVTDRGTLPESSLNGSTEEAQEKRTEIVPAQEIEVLENA---------GKAADLHPGPSPVENPLTSTDAVMITSSECSEKELPSVV
        T+GDDQT LLEGTSVTD+GTLP+SSLNGSTEE Q K TE VPAQ+IE LENA          K ADL+P  SPVENPL STDAVMITSSECSEKELPS V
Subjt:  TTGDDQTPLLEGTSVTDRGTLPESSLNGSTEEAQEKRTEIVPAQEIEVLENA---------GKAADLHPGPSPVENPLTSTDAVMITSSECSEKELPSVV

Query:  SDQCDSQQVSENDTSNSKDVSLANLQVTTCKSPCLEVDTNIDITSKNESTNDKLGSDNHTTSENQDSEGGGAANDKVHTSVNSEHITHGGEDYPKGAPFG
        SDQCDSQQVSEND SNSK+VSLA+ QVT CKS  LE DTN D+    ES  DKL SDN TTSENQ  EGG   NDKVHTSVNS H+ HGGEDY KG   G
Subjt:  SDQCDSQQVSENDTSNSKDVSLANLQVTTCKSPCLEVDTNIDITSKNESTNDKLGSDNHTTSENQDSEGGGAANDKVHTSVNSEHITHGGEDYPKGAPFG

Query:  NVMEFDPIRQHRHFCSWIATGNASPGWKQTLIALQREKSSSPHSPKNSPSASLIKVDDPVRSVRNLFTSSAKKLKSSLVSNDNTKH
        + +EFDPIRQHR+FC WIATGN +PGWKQTL ALQREKSSSPHSPKNSPSASLIKV+DPV SVRNLFTSSAKKLKSSL+SN+ TKH
Subjt:  NVMEFDPIRQHRHFCSWIATGNASPGWKQTLIALQREKSSSPHSPKNSPSASLIKVDDPVRSVRNLFTSSAKKLKSSLVSNDNTKH

A0A5D3BI62 C3HC zinc finger-like, putative isoform 10.0e+0081.04Show/hide
Query:  MTQDSEKRFHSIMDKLFQNGQATPNSNSASSPSSSSSPSGVQLLRGKKRPYSSSALVVGELRSKSDVIEALQKHSSASAGSTDAPLCRPWDRGDLSKRLT
        M+QDSEKRFHSIMDKLFQN Q++PNSNSASS SSSSSPSGVQL RGKKRPYSSSALVVGELRSKSDVIEALQKHSSAS GS+DAPLCRPWDRGDL KRL 
Subjt:  MTQDSEKRFHSIMDKLFQNGQATPNSNSASSPSSSSSPSGVQLLRGKKRPYSSSALVVGELRSKSDVIEALQKHSSASAGSTDAPLCRPWDRGDLSKRLT

Query:  TFKSMTWFGKPKVVNAINCSRRGWINVDMDTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTPAPILVKKYG
        TFKSMTWFGKPKVVNAINC+RRGW+NVD DTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTP P+LV K+ 
Subjt:  TFKSMTWFGKPKVVNAINCSRRGWINVDMDTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTPAPILVKKYG

Query:  ERCSMLLHLSALPVISSSFLKWMKSSHLKQFLEELSLKEFGNESLNNSEIEFLGDGHDSDTAKVYYQALKLISLFGWEPRSLPYVVDCKT-GSDQSLKKS
        ER SMLLHLSALPVISSSFLKWM S HL QF+EEL+L  FGNESL+ SE+E+LGDGHDSDT KVYYQALKLISLFGWEPRS+PY+V+CK+ GSDQSLKKS
Subjt:  ERCSMLLHLSALPVISSSFLKWMKSSHLKQFLEELSLKEFGNESLNNSEIEFLGDGHDSDTAKVYYQALKLISLFGWEPRSLPYVVDCKT-GSDQSLKKS

Query:  TTLDSRPTVNLCTAATKENVNRNRIAEISSELQSLPNSVVLDCRLCGAGVGLWTFHTIPRPVEIIRLVGPTELNGESGTRDSGNKSVVNHSGIGNVGISS
        TT DS PTV+L T ATKENV+ NRIAE+SSELQS PNSVVLDCRLCGA VGLWTFHTIPRPVEIIRLVGPTELN ESGT DSGNKSV+NH+GIG+VG   
Subjt:  TTLDSRPTVNLCTAATKENVNRNRIAEISSELQSLPNSVVLDCRLCGAGVGLWTFHTIPRPVEIIRLVGPTELNGESGTRDSGNKSVVNHSGIGNVGISS

Query:  KESISNLTSTIAGGPTPARQSFKATITLPVIGQNLRARLFNDEKFIDQMYTDQEMVQADSLDKNMLQESKNDEDTTLTGQTDQSEGIRLFQNQTLDHGCS
           IS LTSTIAGGPTPARQSFKATITLPVIGQ+LRARLFNDEKF DQ+Y DQEMVQADS D+ + + SK++EDTT +GQTDQ E  RL QNQT+D GC 
Subjt:  KESISNLTSTIAGGPTPARQSFKATITLPVIGQNLRARLFNDEKFIDQMYTDQEMVQADSLDKNMLQESKNDEDTTLTGQTDQSEGIRLFQNQTLDHGCS

Query:  TTGDDQTPLLEGTSVTDRGTLPESSLNGSTEEAQEKRTEIVPAQEIEVLENA---------GKAADLHPGPSPVENPLTSTDAVMITSSECSEKELPSVV
        T+GDDQT LLEGTSVTD+GTLP+SSLNGSTEE Q K TE VPAQ+IE LENA          K ADL+P  SPVENPL STDAVMITSSECSEKELPS V
Subjt:  TTGDDQTPLLEGTSVTDRGTLPESSLNGSTEEAQEKRTEIVPAQEIEVLENA---------GKAADLHPGPSPVENPLTSTDAVMITSSECSEKELPSVV

Query:  SDQCDSQQVSENDTSNSKDVSLANLQVTTCKSPCLEVDTNIDITSKNESTNDKLGSDNHTTSENQDSEGGGAANDKVHTSVNSEHITHGGEDYPKGAPFG
        SDQCDSQQVSEND SNSK+VSLA+ QVT CKS  LE DTN D+    ES  DKL SDN TTSENQ  EGG   NDKVHTSVNS H+ HGGEDY KG   G
Subjt:  SDQCDSQQVSENDTSNSKDVSLANLQVTTCKSPCLEVDTNIDITSKNESTNDKLGSDNHTTSENQDSEGGGAANDKVHTSVNSEHITHGGEDYPKGAPFG

Query:  NVMEFDPIRQHRHFCSWIATGNASPGWKQTLIALQREKSSSPHSPKNSPSASLIKVDDPVRSVRNLFTSSAKKLKSSLVSNDNTKH
        + +EFDPIRQHR+FC WIATGN +PGWKQTL ALQREKSSSPHSPKNSPSASLIKV+DPV SVRNLFTSSAKKLKSSL+SN+ TKH
Subjt:  NVMEFDPIRQHRHFCSWIATGNASPGWKQTLIALQREKSSSPHSPKNSPSASLIKVDDPVRSVRNLFTSSAKKLKSSLVSNDNTKH

A0A6J1E8G0 uncharacterized protein LOC1114315940.0e+0082.53Show/hide
Query:  MTQDSEKRFHSIMDKLFQNGQATPNSNSASSPSSSSSPSGVQLLRGKKRPYSSSALVVGELRSKSDVIEALQKHSSASAGSTDAPLCRPWDRGDLSKRLT
        M+QDSEKRFHSIMDKLFQN  ATPNSNSASSP  SSSPSG QL RG+KRPYSSS LVVGELR+KSDVIEALQKHS+ASAGS+DAPLCRPWDRGDLSKRLT
Subjt:  MTQDSEKRFHSIMDKLFQNGQATPNSNSASSPSSSSSPSGVQLLRGKKRPYSSSALVVGELRSKSDVIEALQKHSSASAGSTDAPLCRPWDRGDLSKRLT

Query:  TFKSMTWFGKPKVVNAINCSRRGWINVDMDTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTPAPILVKKYG
        TFKSMTWFGKPKVVN INC+RRGWINVDMDTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALA+FPPTP P LV K+ 
Subjt:  TFKSMTWFGKPKVVNAINCSRRGWINVDMDTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTPAPILVKKYG

Query:  ERCSMLLHLSALPVISSSFLKWMKSSHLKQFLEELSLKEFGNESLNNSEIEFLGDGHDSDTAKVYYQALKLISLFGWEPRSLPYVVDCKTGSDQSLKKST
        ERCSMLLHLSALPVISSSF+KWM+S HLK+FLEELSL+EFGNES   SEIE+LGDGHDS+TA+VYYQALKLISLFGWEPRSLPYVVDCKTGSDQSLKK+T
Subjt:  ERCSMLLHLSALPVISSSFLKWMKSSHLKQFLEELSLKEFGNESLNNSEIEFLGDGHDSDTAKVYYQALKLISLFGWEPRSLPYVVDCKTGSDQSLKKST

Query:  TLDSRPTVNLCTAATKENVNRNRIAEISSELQSLPNSVVLDCRLCGAGVGLWTFHTIPRPVEIIRLVGPTELNGESGTRDSGNKSVVNHSGIGNVGISSK
        TL S PTVNL TAATKENV+ N IAEISSELQS PNSVVLDCRLCGA VGLW F TIP+PVEIIRLVGPTELN ESGT DSGNKSV+NH+GI NV     
Subjt:  TLDSRPTVNLCTAATKENVNRNRIAEISSELQSLPNSVVLDCRLCGAGVGLWTFHTIPRPVEIIRLVGPTELNGESGTRDSGNKSVVNHSGIGNVGISSK

Query:  ESISNLTSTIAGGPTPARQSFKATITLPVIGQNLRARLFNDEKFIDQMYTDQEMVQADSLDKNMLQESKNDEDTTLTGQTDQSEGIRLFQNQTLDHGCST
          +S L+STIAGGPTPARQSFKATITLPVIGQNLRARLFNDEK  D+MYTDQEMVQADSLDKNMLQ+SK+ ED+TLTGQ DQ       QNQT D  CST
Subjt:  ESISNLTSTIAGGPTPARQSFKATITLPVIGQNLRARLFNDEKFIDQMYTDQEMVQADSLDKNMLQESKNDEDTTLTGQTDQSEGIRLFQNQTLDHGCST

Query:  TGDDQTPLLEGTSVTDRGTLPESSLNGSTEEAQEKRTEIVPAQEIEVLENA---------GKAADLHPGPSPVENPLTSTDAVMITSSECSEKELPSVVS
        +GDDQTPLLEG S TD+GTLPES LNGSTEE Q KRTEIVPAQEIEV+ENA          KA DLH GPSPV+  L STD+VMITSSECSEK+LPS V 
Subjt:  TGDDQTPLLEGTSVTDRGTLPESSLNGSTEEAQEKRTEIVPAQEIEVLENA---------GKAADLHPGPSPVENPLTSTDAVMITSSECSEKELPSVVS

Query:  DQCDSQQVSENDTSNSKDVSLANLQVTTCKSPCLEVDTNIDITSKNESTNDKLGSDNHTTSENQDSEGGGAANDKVHTSVNSEHITHGGEDYPKGAPFGN
        DQCD QQVSENDTSNSK+VSL +LQVT  KS C EVDTN DI S+ EST DKL SDNH TSENQD EGG  ANDKV+TSVNSEHI HGGEDYPKG P G 
Subjt:  DQCDSQQVSENDTSNSKDVSLANLQVTTCKSPCLEVDTNIDITSKNESTNDKLGSDNHTTSENQDSEGGGAANDKVHTSVNSEHITHGGEDYPKGAPFGN

Query:  VMEFDPIRQHRHFCSWIATGNASPGWKQTLIALQREKSSSPHSPKNSPSASLIKVDDPVRSVRNLFTSSAKKLKSSLVSNDNTK
        V EFDPIRQHRHFC WIATGN +PGWK TL ALQRE SSSPHSPKNSPSASLIKVDDPV SVRNLFTSSAKKLKSSLVSN++TK
Subjt:  VMEFDPIRQHRHFCSWIATGNASPGWKQTLIALQREKSSSPHSPKNSPSASLIKVDDPVRSVRNLFTSSAKKLKSSLVSNDNTK

A0A6J1KEG3 uncharacterized protein LOC1114950840.0e+0082.14Show/hide
Query:  MTQDSEKRFHSIMDKLFQNGQATPNSNSASSPSSSSSPSGVQLLRGKKRPYSSSALVVGELRSKSDVIEALQKHSSASAGSTDAPLCRPWDRGDLSKRLT
        M+QDSEKRFHSIMDKLFQN  ATPNSNSASSP  SSSPSG QL RG+KRPYSSSALVVGELR+KSDVIEALQKHS+ASAGS+DAPLCRPWDRGDLSKRLT
Subjt:  MTQDSEKRFHSIMDKLFQNGQATPNSNSASSPSSSSSPSGVQLLRGKKRPYSSSALVVGELRSKSDVIEALQKHSSASAGSTDAPLCRPWDRGDLSKRLT

Query:  TFKSMTWFGKPKVVNAINCSRRGWINVDMDTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTPAPILVKKYG
        TFKSMTWFGKPKVVN INC+RRGWINVDMDTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALA+FPPTP P LV K+ 
Subjt:  TFKSMTWFGKPKVVNAINCSRRGWINVDMDTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTPAPILVKKYG

Query:  ERCSMLLHLSALPVISSSFLKWMKSSHLKQFLEELSLKEFGNESLNNSEIEFLGDGHDSDTAKVYYQALKLISLFGWEPRSLPYVVDCKTGSDQSLKKST
        ERCSMLLHLSALPVI SSF+KWMKS HLK+FLEELSL+E GNES   SEIE+LGDGHDS+TA+VYYQALKLISLFGWEPRSLPYVVDCKTGSDQSLKKST
Subjt:  ERCSMLLHLSALPVISSSFLKWMKSSHLKQFLEELSLKEFGNESLNNSEIEFLGDGHDSDTAKVYYQALKLISLFGWEPRSLPYVVDCKTGSDQSLKKST

Query:  TLDSRPTVNLCTAATKENVNRNRIAEISSELQSLPNSVVLDCRLCGAGVGLWTFHTIPRPVEIIRLVGPTELNGESGTRDSGNKSVVNHSGIGNVGISSK
        TL S PTVNL TAATKENV+ N IAEISSELQS PNSVVLDCRLCGA VGLW F TIP+PVEIIRLVGPTELN ESGT DSGNKSV+NH+GI NV     
Subjt:  TLDSRPTVNLCTAATKENVNRNRIAEISSELQSLPNSVVLDCRLCGAGVGLWTFHTIPRPVEIIRLVGPTELNGESGTRDSGNKSVVNHSGIGNVGISSK

Query:  ESISNLTSTIAGGPTPARQSFKATITLPVIGQNLRARLFNDEKFIDQMYTDQEMVQADSLDKNMLQESKNDEDTTLTGQTDQSEGIRLFQNQTLDHGCST
          +S L+STIAGGPTPARQSFKATITLPVIGQNLRARLF+DEK  D+MYTDQEMVQ DSLDKNMLQ+SK+ ED+TLTGQ DQ       QNQT D  CST
Subjt:  ESISNLTSTIAGGPTPARQSFKATITLPVIGQNLRARLFNDEKFIDQMYTDQEMVQADSLDKNMLQESKNDEDTTLTGQTDQSEGIRLFQNQTLDHGCST

Query:  TGDDQTPLLEGTSVTDRGTLPESSLNGSTEEAQEKRTEIVPAQEIEVLENA---------GKAADLHPGPSPVENPLTSTDAVMITSSECSEKELPSVVS
        +GDDQTPLLEG SVTD+GTLPES LNGSTEE Q KRTEIVPAQEIEV+ENA          KA DLH G SPV+  L STD+VMITSSECSEK+LPS VS
Subjt:  TGDDQTPLLEGTSVTDRGTLPESSLNGSTEEAQEKRTEIVPAQEIEVLENA---------GKAADLHPGPSPVENPLTSTDAVMITSSECSEKELPSVVS

Query:  DQCDSQQVSENDTSNSKDVSLANLQVTTCKSPCLEVDTNIDITSKNESTNDKLGSDNHTTSENQDSEGGGAANDKVHTSVNSEHITHGGEDYPKGAPFGN
        DQCD QQVS NDTSNSK+VSL +LQVT  KS C EVDTN DI S++EST DKL SDNH TSENQD E G  ANDKV+TSVNSEHI HGGEDYPKG P G 
Subjt:  DQCDSQQVSENDTSNSKDVSLANLQVTTCKSPCLEVDTNIDITSKNESTNDKLGSDNHTTSENQDSEGGGAANDKVHTSVNSEHITHGGEDYPKGAPFGN

Query:  VMEFDPIRQHRHFCSWIATGNASPGWKQTLIALQREKSSSPHSPKNSPSASLIKVDDPVRSVRNLFTSSAKKLKSSLVSNDNTK
        V EFDPIRQHRHFC WI+TGN +PGWK TL ALQRE SSSPHSPKNSPSASLIKVDDPV SVRNLFTSSAKKLKSSLVSN++TK
Subjt:  VMEFDPIRQHRHFCSWIATGNASPGWKQTLIALQREKSSSPHSPKNSPSASLIKVDDPVRSVRNLFTSSAKKLKSSLVSNDNTK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G17210.1 IAP-like protein 11.0e-4732.96Show/hide
Query:  QNGQATPNSNSASSPSSSSSPSGVQLLRGKKR---PYSSSALVVGELRSKSDVIEALQKHSSASAGSTDAPLCRPWDRGDLSKRLTTFKSMTWFGKPKVV
        QN     N NS +S S+S+S + V   R + R   P  ++A       S + ++ A     +    +     CR WDRGDL +RL TFK   W GKPK  
Subjt:  QNGQATPNSNSASSPSSSSSPSGVQLLRGKKR---PYSSSALVVGELRSKSDVIEALQKHSSASAGSTDAPLCRPWDRGDLSKRLTTFKSMTWFGKPKVV

Query:  NAINCSRRGWINVDMDTIACESCGARLLFSTP-SSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTPAPILVKKYGERCSMLLHLSALP
        +++ C+++GW++VD+D + CE CG+ L +S P  S N  + +     FS +LD+ H+  CPW+  +C E+L  FPPTP   L+  Y +RC  LL   +LP
Subjt:  NAINCSRRGWINVDMDTIACESCGARLLFSTP-SSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTPAPILVKKYGERCSMLLHLSALP

Query:  VISSSFLKWMKSSHLKQFLEELSLKEFGNESLNNSEIEFLGDGHDSDTAKVYYQALKLISLFGWEPRSLPYVVDCKTGSDQSLK----------KSTTLD
        ++S S +  M++S   Q ++ L      + S     I    + +  +    Y +A KLISL GWEPR LP + DC+  S QS +          +S   D
Subjt:  VISSSFLKWMKSSHLKQFLEELSLKEFGNESLNNSEIEFLGDGHDSDTAKVYYQALKLISLFGWEPRSLPYVVDCKTGSDQSLK----------KSTTLD

Query:  SRPTVNLCTAATKENVNRNRIAEISSELQSLPNSVVLDCRLCGAGVGLWTFHTIPRPV
          P+    +A++++      +  +  E +S     +LDC LCG  V +  F T  RPV
Subjt:  SRPTVNLCTAATKENVNRNRIAEISSELQSLPNSVVLDCRLCGAGVGLWTFHTIPRPV

AT1G48950.1 C3HC zinc finger-like5.6e-11536.9Show/hide
Query:  MTQDSEKRFHSIMDKLFQNGQATPNSNSASSPSSSSSPSGVQLLRGKKRPYSSSALVVGELRSKSDVIEALQKHSSA---SAGSTDAPLCRPWDRGDLSK
        M QDSEKRFH IMDKLF     TP+ +    PSSS+S S  Q  RGKKR   SSAL + E +    ++ A    SSA    AG++ + LCRPWDRGDL +
Subjt:  MTQDSEKRFHSIMDKLFQNGQATPNSNSASSPSSSSSPSGVQLLRGKKRPYSSSALVVGELRSKSDVIEALQKHSSA---SAGSTDAPLCRPWDRGDLSK

Query:  RLTTFKSMTWFGKPKVVNAINCSRRGWINVDMDTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTPAPILVK
        RL TFKSMTWF KP+V++A+NC+RRGW+N D D+IACESCGA L FS PSSW++QQVEKAA VFSLKL++GHKLLCPWI+N+C+E L++FP      LV 
Subjt:  RLTTFKSMTWFGKPKVVNAINCSRRGWINVDMDTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTPAPILVK

Query:  KYGERCSMLLHLSALPVISSSFLKWMKSSHLKQFLEELSLKEFGNESLNNSEIEFLGDGHDSDTAKVYYQALKLISLFGWEPRSLPYVVDCKTGSDQSLK
        ++ ER   LL L ALPVIS S +++M+SS L++FL+        + +  +S+ E L +   +  A+++YQA KLISL GWEPR+LPY+VDCK    ++ +
Subjt:  KYGERCSMLLHLSALPVISSSFLKWMKSSHLKQFLEELSLKEFGNESLNNSEIEFLGDGHDSDTAKVYYQALKLISLFGWEPRSLPYVVDCKTGSDQSLK

Query:  KSTTLDSRP--------TVNLCTAATKENVNRNRIAEISSELQSLPNSVVLDCRLCGAGVGLWTFHTIPRPVEIIRLVGPTELNGESGTRDSGNKSVVNH
         + T+D  P        +++  T         N    +   L S P+SVVLDC+LCGA VGLW F T+PRP+E+ R+ G TE+N E   +          
Subjt:  KSTTLDSRP--------TVNLCTAATKENVNRNRIAEISSELQSLPNSVVLDCRLCGAGVGLWTFHTIPRPVEIIRLVGPTELNGESGTRDSGNKSVVNH

Query:  SGIGNVGISSKESISNLTSTIAGGPTPARQSFKATITLPVIGQNLRARLFNDEKFIDQMYTDQEMVQADSLDKNMLQESKNDEDTTLTGQTDQSEGIRLF
              G + +   S+L  TIAGGP   +Q+FKATI+LP+IG+NLR+R  +  +       D +     S+     + ++N+ D T              
Subjt:  SGIGNVGISSKESISNLTSTIAGGPTPARQSFKATITLPVIGQNLRARLFNDEKFIDQMYTDQEMVQADSLDKNMLQESKNDEDTTLTGQTDQSEGIRLF

Query:  QNQTLDHGCSTTGDDQTPLLEGTSVTDRGTLPESSLNGSTEEAQEKRTEIVPAQEIEVLENAGKAADLHPGPSPVENPLTSTDAVMITSSECSEKELPSV
        QN                                                      +V+ + G+ AD                                 
Subjt:  QNQTLDHGCSTTGDDQTPLLEGTSVTDRGTLPESSLNGSTEEAQEKRTEIVPAQEIEVLENAGKAADLHPGPSPVENPLTSTDAVMITSSECSEKELPSV

Query:  VSDQCDSQQVSENDTSNSKDVSLANLQVTTCKSPCLEVDTNIDITSKNESTNDKLGSDNHTTSENQDSEGGGAANDKVHTSVNSEHITHGGEDYPKGAPF
                    N T    D++L N             D  + +   N   N+K      +T+E                               K A  
Subjt:  VSDQCDSQQVSENDTSNSKDVSLANLQVTTCKSPCLEVDTNIDITSKNESTNDKLGSDNHTTSENQDSEGGGAANDKVHTSVNSEHITHGGEDYPKGAPF

Query:  GNVMEFDPIRQHRHFCSWI-ATGNASPGWKQTLIALQREKSSSPHSPKNSPSASLIKVDDPVRSVRNLFTSSAKK
           MEFDPI+QHRHFC WI +TG   PGW+QTL ALQR K S    P    S+SL KVDDP+ SVRNLF S + K
Subjt:  GNVMEFDPIRQHRHFCSWI-ATGNASPGWKQTLIALQREKSSSPHSPKNSPSASLIKVDDPVRSVRNLFTSSAKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACAAGATTCAGAGAAGAGGTTCCATTCCATCATGGACAAGCTCTTTCAGAATGGACAAGCCACTCCAAACTCAAATTCCGCATCTTCCCCATCCTCCTCCTCCAG
TCCGTCAGGAGTACAATTGTTGAGAGGGAAAAAGCGCCCATATTCTTCTTCTGCTCTGGTAGTGGGAGAGCTGAGGTCAAAAAGTGATGTAATTGAGGCATTGCAGAAGC
ATTCTTCAGCTTCTGCTGGATCCACTGATGCTCCATTATGCAGGCCTTGGGACCGTGGAGATCTTTCGAAAAGATTAACCACATTCAAGTCGATGACATGGTTTGGCAAA
CCTAAGGTGGTAAATGCTATAAATTGTTCTAGAAGAGGTTGGATCAATGTAGATATGGATACTATTGCCTGTGAATCATGTGGAGCACGTCTCCTTTTCTCTACTCCATC
TTCCTGGAATCAGCAACAAGTTGAGAAAGCCGCTTTGGTATTTAGCTTAAAGTTGGATAATGGGCACAAGTTACTCTGTCCCTGGATAGATAATGCCTGTGATGAAGCAT
TGGCTGATTTTCCTCCTACCCCTGCTCCAATTTTAGTTAAAAAATATGGAGAGCGTTGTTCTATGTTATTACATCTTTCAGCTCTCCCTGTTATTTCGTCTTCATTTCTC
AAATGGATGAAGAGTTCCCACCTCAAGCAATTTCTTGAAGAATTATCCTTGAAGGAATTTGGTAATGAGTCTCTTAACAACTCTGAAATTGAGTTCCTAGGAGATGGACA
TGATTCAGATACTGCTAAAGTATATTATCAGGCTCTAAAGCTAATTAGCTTGTTTGGATGGGAACCTCGTTCACTGCCCTATGTAGTTGACTGCAAGACAGGGTCAGATC
AATCTCTCAAGAAATCCACCACTTTGGATTCACGTCCTACTGTTAATTTATGCACTGCTGCTACCAAAGAAAATGTTAATAGAAATAGAATTGCTGAGATTTCAAGTGAA
TTGCAATCTCTGCCCAATTCTGTTGTTTTAGATTGCCGGCTCTGTGGAGCTGGCGTTGGATTATGGACTTTCCACACAATTCCTAGACCTGTGGAAATCATAAGATTGGT
TGGACCCACTGAATTGAACGGTGAGTCAGGCACTCGTGATTCAGGCAATAAAAGTGTCGTCAATCATTCAGGTATTGGTAATGTAGGAATATCATCAAAAGAGAGCATAT
CAAATTTAACTTCCACAATCGCAGGGGGACCTACCCCTGCACGACAGAGTTTCAAGGCCACCATCACTTTGCCTGTCATTGGCCAAAACTTAAGGGCTAGGTTATTCAAT
GATGAAAAATTTATTGATCAGATGTATACTGACCAAGAAATGGTTCAAGCCGATTCCTTAGATAAAAATATGTTACAAGAAAGCAAAAACGACGAAGATACCACCCTTAC
TGGACAAACTGATCAGTCAGAAGGCATAAGATTGTTCCAGAATCAAACACTTGATCATGGATGCAGTACTACCGGTGATGATCAGACCCCTTTATTGGAAGGTACAAGTG
TTACTGATCGAGGAACCTTACCTGAATCTAGTTTGAATGGTTCAACTGAAGAAGCTCAAGAAAAGAGAACAGAGATTGTTCCTGCGCAGGAAATTGAAGTGCTGGAGAAT
GCTGGTAAAGCAGCAGACCTGCATCCTGGCCCTTCTCCTGTCGAAAACCCTTTGACGTCAACAGATGCTGTTATGATTACAAGTAGTGAATGCAGTGAAAAGGAGTTGCC
TTCCGTTGTCTCTGACCAATGTGATTCACAACAGGTTTCCGAAAATGATACTTCAAATAGCAAAGATGTTTCTTTAGCTAACTTACAGGTGACCACATGTAAATCCCCAT
GCCTTGAAGTTGATACAAATATAGATATCACCAGTAAGAACGAATCAACGAATGACAAACTTGGTTCTGATAACCACACCACCTCAGAAAACCAGGATAGTGAAGGAGGT
GGTGCTGCCAATGACAAAGTGCATACCTCTGTGAACAGCGAGCATATTACCCATGGTGGAGAGGATTATCCCAAGGGTGCACCATTTGGTAATGTGATGGAGTTCGATCC
AATCAGGCAGCACAGGCATTTTTGCTCTTGGATTGCCACAGGAAATGCGTCACCTGGATGGAAACAAACCCTAATTGCTTTACAGCGTGAAAAAAGCTCTTCGCCACATT
CACCTAAGAACTCTCCATCAGCGTCTCTTATTAAGGTCGATGACCCTGTTAGATCGGTTCGAAATCTATTCACGTCGTCTGCAAAGAAATTGAAAAGTAGTCTTGTCTCT
AACGACAACACCAAGCACTAG
mRNA sequenceShow/hide mRNA sequence
ATGACACAAGATTCAGAGAAGAGGTTCCATTCCATCATGGACAAGCTCTTTCAGAATGGACAAGCCACTCCAAACTCAAATTCCGCATCTTCCCCATCCTCCTCCTCCAG
TCCGTCAGGAGTACAATTGTTGAGAGGGAAAAAGCGCCCATATTCTTCTTCTGCTCTGGTAGTGGGAGAGCTGAGGTCAAAAAGTGATGTAATTGAGGCATTGCAGAAGC
ATTCTTCAGCTTCTGCTGGATCCACTGATGCTCCATTATGCAGGCCTTGGGACCGTGGAGATCTTTCGAAAAGATTAACCACATTCAAGTCGATGACATGGTTTGGCAAA
CCTAAGGTGGTAAATGCTATAAATTGTTCTAGAAGAGGTTGGATCAATGTAGATATGGATACTATTGCCTGTGAATCATGTGGAGCACGTCTCCTTTTCTCTACTCCATC
TTCCTGGAATCAGCAACAAGTTGAGAAAGCCGCTTTGGTATTTAGCTTAAAGTTGGATAATGGGCACAAGTTACTCTGTCCCTGGATAGATAATGCCTGTGATGAAGCAT
TGGCTGATTTTCCTCCTACCCCTGCTCCAATTTTAGTTAAAAAATATGGAGAGCGTTGTTCTATGTTATTACATCTTTCAGCTCTCCCTGTTATTTCGTCTTCATTTCTC
AAATGGATGAAGAGTTCCCACCTCAAGCAATTTCTTGAAGAATTATCCTTGAAGGAATTTGGTAATGAGTCTCTTAACAACTCTGAAATTGAGTTCCTAGGAGATGGACA
TGATTCAGATACTGCTAAAGTATATTATCAGGCTCTAAAGCTAATTAGCTTGTTTGGATGGGAACCTCGTTCACTGCCCTATGTAGTTGACTGCAAGACAGGGTCAGATC
AATCTCTCAAGAAATCCACCACTTTGGATTCACGTCCTACTGTTAATTTATGCACTGCTGCTACCAAAGAAAATGTTAATAGAAATAGAATTGCTGAGATTTCAAGTGAA
TTGCAATCTCTGCCCAATTCTGTTGTTTTAGATTGCCGGCTCTGTGGAGCTGGCGTTGGATTATGGACTTTCCACACAATTCCTAGACCTGTGGAAATCATAAGATTGGT
TGGACCCACTGAATTGAACGGTGAGTCAGGCACTCGTGATTCAGGCAATAAAAGTGTCGTCAATCATTCAGGTATTGGTAATGTAGGAATATCATCAAAAGAGAGCATAT
CAAATTTAACTTCCACAATCGCAGGGGGACCTACCCCTGCACGACAGAGTTTCAAGGCCACCATCACTTTGCCTGTCATTGGCCAAAACTTAAGGGCTAGGTTATTCAAT
GATGAAAAATTTATTGATCAGATGTATACTGACCAAGAAATGGTTCAAGCCGATTCCTTAGATAAAAATATGTTACAAGAAAGCAAAAACGACGAAGATACCACCCTTAC
TGGACAAACTGATCAGTCAGAAGGCATAAGATTGTTCCAGAATCAAACACTTGATCATGGATGCAGTACTACCGGTGATGATCAGACCCCTTTATTGGAAGGTACAAGTG
TTACTGATCGAGGAACCTTACCTGAATCTAGTTTGAATGGTTCAACTGAAGAAGCTCAAGAAAAGAGAACAGAGATTGTTCCTGCGCAGGAAATTGAAGTGCTGGAGAAT
GCTGGTAAAGCAGCAGACCTGCATCCTGGCCCTTCTCCTGTCGAAAACCCTTTGACGTCAACAGATGCTGTTATGATTACAAGTAGTGAATGCAGTGAAAAGGAGTTGCC
TTCCGTTGTCTCTGACCAATGTGATTCACAACAGGTTTCCGAAAATGATACTTCAAATAGCAAAGATGTTTCTTTAGCTAACTTACAGGTGACCACATGTAAATCCCCAT
GCCTTGAAGTTGATACAAATATAGATATCACCAGTAAGAACGAATCAACGAATGACAAACTTGGTTCTGATAACCACACCACCTCAGAAAACCAGGATAGTGAAGGAGGT
GGTGCTGCCAATGACAAAGTGCATACCTCTGTGAACAGCGAGCATATTACCCATGGTGGAGAGGATTATCCCAAGGGTGCACCATTTGGTAATGTGATGGAGTTCGATCC
AATCAGGCAGCACAGGCATTTTTGCTCTTGGATTGCCACAGGAAATGCGTCACCTGGATGGAAACAAACCCTAATTGCTTTACAGCGTGAAAAAAGCTCTTCGCCACATT
CACCTAAGAACTCTCCATCAGCGTCTCTTATTAAGGTCGATGACCCTGTTAGATCGGTTCGAAATCTATTCACGTCGTCTGCAAAGAAATTGAAAAGTAGTCTTGTCTCT
AACGACAACACCAAGCACTAG
Protein sequenceShow/hide protein sequence
MTQDSEKRFHSIMDKLFQNGQATPNSNSASSPSSSSSPSGVQLLRGKKRPYSSSALVVGELRSKSDVIEALQKHSSASAGSTDAPLCRPWDRGDLSKRLTTFKSMTWFGK
PKVVNAINCSRRGWINVDMDTIACESCGARLLFSTPSSWNQQQVEKAALVFSLKLDNGHKLLCPWIDNACDEALADFPPTPAPILVKKYGERCSMLLHLSALPVISSSFL
KWMKSSHLKQFLEELSLKEFGNESLNNSEIEFLGDGHDSDTAKVYYQALKLISLFGWEPRSLPYVVDCKTGSDQSLKKSTTLDSRPTVNLCTAATKENVNRNRIAEISSE
LQSLPNSVVLDCRLCGAGVGLWTFHTIPRPVEIIRLVGPTELNGESGTRDSGNKSVVNHSGIGNVGISSKESISNLTSTIAGGPTPARQSFKATITLPVIGQNLRARLFN
DEKFIDQMYTDQEMVQADSLDKNMLQESKNDEDTTLTGQTDQSEGIRLFQNQTLDHGCSTTGDDQTPLLEGTSVTDRGTLPESSLNGSTEEAQEKRTEIVPAQEIEVLEN
AGKAADLHPGPSPVENPLTSTDAVMITSSECSEKELPSVVSDQCDSQQVSENDTSNSKDVSLANLQVTTCKSPCLEVDTNIDITSKNESTNDKLGSDNHTTSENQDSEGG
GAANDKVHTSVNSEHITHGGEDYPKGAPFGNVMEFDPIRQHRHFCSWIATGNASPGWKQTLIALQREKSSSPHSPKNSPSASLIKVDDPVRSVRNLFTSSAKKLKSSLVS
NDNTKH