; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC06G120570 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC06G120570
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionENT domain-containing protein
Genome locationCicolChr06:22369644..22378625
RNA-Seq ExpressionCcUC06G120570
SyntenyCcUC06G120570
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR005491 - ENT domain
IPR014002 - Agenet domain, plant type
IPR036142 - ENT domain-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046740.1 uncharacterized protein E6C27_scaffold216G00210 [Cucumis melo var. makuwa]3.6e-18185.45Show/hide
Query:  MEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNNSWKMATVSEVLGKMHILVRLLGSSQEFKVRKTDIRVRQSW
        MEALRSN PNY+AGH+K KGV+NHA+V QVS KVI PCHT LD S AWAPGDVVEVFDNNSWK+ATVSEVLGKMHILVRLLGSS+EFKVRKTDIRVRQSW
Subjt:  MEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNNSWKMATVSEVLGKMHILVRLLGSSQEFKVRKTDIRVRQSW

Query:  KDDDNAWVMAGKGNKNSNGGKFHANPFLNNSHNSTSQVQKTNSRTTLWKKDDCFAIRDQNLQDNYNVRIQKRSTDCSSKAFYGSSHKVRLIEKEGRYVKV
        KDDD AWVM GKG+KN NGG+ H N  LN+ H+S+SQVQKTNSRTTLWKKDD  AIR+QN+ DNYNVRI KRSTDC SK  YG++HKVRLIEKEGRYVKV
Subjt:  KDDDNAWVMAGKGNKNSNGGKFHANPFLNNSHNSTSQVQKTNSRTTLWKKDDCFAIRDQNLQDNYNVRIQKRSTDCSSKAFYGSSHKVRLIEKEGRYVKV

Query:  VGANPTEIPKLQVAPVSYARDSLGERHRPASLNRRLGRHLELDIKGKEPVSPVRELNDADSIMCSVGSCSISSDNSSEMPCDVSAGVTDQIAGHFCDDRS
        V ANPTE+PKLQV PVSY RDSLGERHR ASLN RLG +LELDIKGKE VSPVRELNDADSIMCSVGSCSI+SDNSSEMPCDVS GVTDQIAGHFCDDRS
Subjt:  VGANPTEIPKLQVAPVSYARDSLGERHRPASLNRRLGRHLELDIKGKEPVSPVRELNDADSIMCSVGSCSISSDNSSEMPCDVSAGVTDQIAGHFCDDRS

Query:  PHHSGYEGGHCLPTTEELAAEIHRLELHAYRCTIEAFHASGPLSWDQEVLITNLRLSLNISNDEHLMQLKYLISADTS
        PH SGYE GHCLP  EELAAEIHRLEL AYRCTIEA HASGPLSWDQEVLITNLRLSLNISNDEHLMQLKYLISADTS
Subjt:  PHHSGYEGGHCLPTTEELAAEIHRLELHAYRCTIEAFHASGPLSWDQEVLITNLRLSLNISNDEHLMQLKYLISADTS

XP_011656195.1 uncharacterized protein LOC101213700 isoform X1 [Cucumis sativus]3.4e-19586.14Show/hide
Query:  IMKFKKGSKLKISSKKKVALGPQCSMEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNNSWKMATVSEVLGKMH
        +MKFKKGSKLKI SKKKV LG QCSMEALRSN PNY+AGH+K KGV+NHA+V QVS+ VIMPCHT LD S AWAPGDVVEVFDNNSWK+ATVSEVLGKMH
Subjt:  IMKFKKGSKLKISSKKKVALGPQCSMEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNNSWKMATVSEVLGKMH

Query:  ILVRLLGSSQEFKVRKTDIRVRQSWKDDDNAWVMAGKGNKNSNGGKFHANPFLNNSHNSTSQVQKTNSRTTLWKKDDCFAIRDQNLQDNYNVRIQKRSTD
        ILVRLLGSSQEFKVRKTDIR RQSWKDDD AWVM GKG+KN NGG+ HAN  LN+SH+STSQVQKTNSRTTLWKKDDC AIR+QN+QDNYNV+I KRSTD
Subjt:  ILVRLLGSSQEFKVRKTDIRVRQSWKDDDNAWVMAGKGNKNSNGGKFHANPFLNNSHNSTSQVQKTNSRTTLWKKDDCFAIRDQNLQDNYNVRIQKRSTD

Query:  CSSKAFYGSSHKVRLIEKEGRYVKVVGANPTEIPKLQVAPVSYARDSLGERHRPASLNRRLGRHLELDIKGKE-PVSPVRELNDADSIMCSVGSCSISSD
        CSS+A YG++HKVRLIEKEGRYVKVV ANPTE+PKLQV PVSY RDSLGER R ASLN RLG +LELDIKGKE   SPVRELNDADSI+CSVGSCSISSD
Subjt:  CSSKAFYGSSHKVRLIEKEGRYVKVVGANPTEIPKLQVAPVSYARDSLGERHRPASLNRRLGRHLELDIKGKE-PVSPVRELNDADSIMCSVGSCSISSD

Query:  NSSEMPCDVSAGVTDQIAGHFCDDRSPHHSGYEGGHCLPTTEELAAEIHRLELHAYRCTIEAFHASGPLSWDQEVLITNLRLSLNISNDEHLMQLKYLIS
        NSSEMPCDVS GVTDQIAGHFCDDRSPH SGYE GHCLPT EELAAEIHRLEL AYRCTIEA HASGPLSWDQEVLITNLRLSLNISNDEHLMQLKYLIS
Subjt:  NSSEMPCDVSAGVTDQIAGHFCDDRSPHHSGYEGGHCLPTTEELAAEIHRLELHAYRCTIEAFHASGPLSWDQEVLITNLRLSLNISNDEHLMQLKYLIS

Query:  ADTS
        ADTS
Subjt:  ADTS

XP_016903306.1 PREDICTED: uncharacterized protein LOC103502731 isoform X2 [Cucumis melo]3.7e-19485.86Show/hide
Query:  IMKFKKGSKLKISSKKKVALGPQCSMEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNNSWKMATVSEVLGKMH
        +MKFKKGSKLKI SKKKVALGPQCSMEALRSN PNY+AGH+K KGV+NHA+V QVS KVI PCHT LD S AWAPGDVVEVFDNNSWK+ATVSEVLGKMH
Subjt:  IMKFKKGSKLKISSKKKVALGPQCSMEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNNSWKMATVSEVLGKMH

Query:  ILVRLLGSSQEFKVRKTDIRVRQSWKDDDNAWVMAGKGNKNSNGGKFHANPFLNNSHNSTSQVQKTNSRTTLWKKDDCFAIRDQNLQDNYNVRIQKRSTD
        ILVRLLGSS+EFKVRKTDIRVRQSWKDDD AWVM GKG+KN NGG+ H N  LN+ H+S+SQVQKTNSRTTLWKKDD  AIR+QN+ DNYNVRI KRSTD
Subjt:  ILVRLLGSSQEFKVRKTDIRVRQSWKDDDNAWVMAGKGNKNSNGGKFHANPFLNNSHNSTSQVQKTNSRTTLWKKDDCFAIRDQNLQDNYNVRIQKRSTD

Query:  CSSKAFYGSSHKVRLIEKEGRYVKVVGANPTEIPKLQVAPVSYARDSLGERHRPASLNRRLGRHLELDIKGKEPVSPVRELNDADSIMCSVGSCSISSDN
        C SK  YG++HKVRLIEKEGRYVKVV ANPTE+PKLQV PVSY RDSLGERHR ASLN RLG +LELDIKGKE VSPVRELNDADSIMCSVGSCSI+SDN
Subjt:  CSSKAFYGSSHKVRLIEKEGRYVKVVGANPTEIPKLQVAPVSYARDSLGERHRPASLNRRLGRHLELDIKGKEPVSPVRELNDADSIMCSVGSCSISSDN

Query:  SSEMPCDVSAGVTDQIAGHFCDDRSPHHSGYEGGHCLPTTEELAAEIHRLELHAYRCTIEAFHASGPLSWDQEVLITNLRLSLNISNDEHLMQLKYLISA
        SSEMPCDVS GVTDQIAGHFCDDRSPH SGYE GHCLP  EELAAEIHRLEL AYRCTIEA HASGPLSWDQEVLITNLRLSLNISNDEHLMQLKYLISA
Subjt:  SSEMPCDVSAGVTDQIAGHFCDDRSPHHSGYEGGHCLPTTEELAAEIHRLELHAYRCTIEAFHASGPLSWDQEVLITNLRLSLNISNDEHLMQLKYLISA

Query:  DTS
        DTS
Subjt:  DTS

XP_022987746.1 uncharacterized protein LOC111485204 isoform X1 [Cucurbita maxima]8.0e-18180.35Show/hide
Query:  MKFKKGSKLKISSKKKVALGPQCSMEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNNSWKMATVSEVLGKMHI
        MKFKKGSKLK+ SK+KVALGPQCSMEALRSN  NYMAG++  KGVNNH +V++VS+KVI+PCHTRLD SEAWAPGDVVEVFDNN WK+ATVSEVLGKMHI
Subjt:  MKFKKGSKLKISSKKKVALGPQCSMEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNNSWKMATVSEVLGKMHI

Query:  LVRLLGSSQEFKVRKTDIRVRQSWKDDDNAWVMAGKGNKNSNGGKFHANPFLNNSHNSTSQVQKTNSRTTLWKKDDCFAIRDQNLQDNYNVRIQKRSTDC
        LVRLLGSSQEFKVRKT IRVRQSW+DDDNAWV+  KGNKNSNGGK HAN FLN   NSTSQVQKTNSRT+ WK+DDCFA R++  QD+YNVR+ KRSTDC
Subjt:  LVRLLGSSQEFKVRKTDIRVRQSWKDDDNAWVMAGKGNKNSNGGKFHANPFLNNSHNSTSQVQKTNSRTTLWKKDDCFAIRDQNLQDNYNVRIQKRSTDC

Query:  SSKAFYGSSHKVRLIEKEGRYVKVVGANPTEIPKLQVAPVSYARDSLGERHRPASLNRRLGRHLELDIKGKEPVSPVRELNDADSIMCSVGSCSISSDNS
        SSKA Y +SHKVR IEK+GRY+KVV AN TE+PKLQV PVS  RD LGERH PASLN RLG HL+LDIK KEPV+ VRE ND DSIMCSVGSCSISSDNS
Subjt:  SSKAFYGSSHKVRLIEKEGRYVKVVGANPTEIPKLQVAPVSYARDSLGERHRPASLNRRLGRHLELDIKGKEPVSPVRELNDADSIMCSVGSCSISSDNS

Query:  SEMPCDVSAGVTDQIAGHFCDDRSPHHSGYEGGHCLPTTEELAAEIHRLELHAYRCTIEAFHASGPLSWDQEVLITNLRLSLNISNDEHLMQLKYLISAD
        +E+PCD    + D+ AGHFC+DRSP+HSGYE GHCLPT EELAAEIHRLELHAYRCTIEA HASGPLSW+QEVLITNLRLSLNISNDEHL+QLKYLISA+
Subjt:  SEMPCDVSAGVTDQIAGHFCDDRSPHHSGYEGGHCLPTTEELAAEIHRLELHAYRCTIEAFHASGPLSWDQEVLITNLRLSLNISNDEHLMQLKYLISAD

Query:  TS
        TS
Subjt:  TS

XP_038880219.1 uncharacterized protein LOC120071880 isoform X3 [Benincasa hispida]9.8e-18788.47Show/hide
Query:  MKFKKGSKLKISSKKKVALGPQCSMEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNNSWKMATVSEVLGKMHI
        MKFKKGSKLKI SKKKVALGPQCS+EALRSN PNYMAGH+K KGVNNHALV+QVSQKV+MPCHT LD SEAWAPGDVVEVFDNNSWKMATVSEVLGKMHI
Subjt:  MKFKKGSKLKISSKKKVALGPQCSMEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNNSWKMATVSEVLGKMHI

Query:  LVRLLGSSQEFKVRKTDIRVRQSWKDDDNAWVMAGKGNKNSNGGKFHANPFLNNSHNSTSQVQKTNSRTTLWKKDDCFAIRDQNLQDNYNVRIQKRSTDC
        LVRLLGSSQEFKVRKTDIRVRQSWKDDD+AWVM GKGNKNSN GK  AN FLN+SHNS SQVQKTNSRTT WKKDDCFAIR+QN +D+YNVRI KRSTDC
Subjt:  LVRLLGSSQEFKVRKTDIRVRQSWKDDDNAWVMAGKGNKNSNGGKFHANPFLNNSHNSTSQVQKTNSRTTLWKKDDCFAIRDQNLQDNYNVRIQKRSTDC

Query:  SSKAFYGSSHKVRLIEKEGRYVKVVGANPTEIP-KLQVAPVSYARDSLGERHRPASLNRRLGRHLELDIKGKEPVSPVRELNDADSIMCSVGSCSISSDN
        SSKAFYG+SHKVRLIEKEGRYVKVV ANPTE+P KLQ+ PVSY RDSLGERHRPASLN RLG HL++D+KGKEPVSPVRELNDADSIMCSVGSCSISSDN
Subjt:  SSKAFYGSSHKVRLIEKEGRYVKVVGANPTEIP-KLQVAPVSYARDSLGERHRPASLNRRLGRHLELDIKGKEPVSPVRELNDADSIMCSVGSCSISSDN

Query:  SSEMPCDVSAGVTDQIAGHFCDDRSPHHSGYEGGHCLPTTEELAAEIHRLELHAYRCTIEAFHASGPLSWDQE
        SSEMP DVSAGVTDQI GH CDDRSPHHSG+E G+CLPT EELAAEIHRLELHAYRCTIEA HASGPLSWDQE
Subjt:  SSEMPCDVSAGVTDQIAGHFCDDRSPHHSGYEGGHCLPTTEELAAEIHRLELHAYRCTIEAFHASGPLSWDQE

TrEMBL top hitse value%identityAlignment
A0A0A0LWD3 ENT domain-containing protein1.6e-19586.14Show/hide
Query:  IMKFKKGSKLKISSKKKVALGPQCSMEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNNSWKMATVSEVLGKMH
        +MKFKKGSKLKI SKKKV LG QCSMEALRSN PNY+AGH+K KGV+NHA+V QVS+ VIMPCHT LD S AWAPGDVVEVFDNNSWK+ATVSEVLGKMH
Subjt:  IMKFKKGSKLKISSKKKVALGPQCSMEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNNSWKMATVSEVLGKMH

Query:  ILVRLLGSSQEFKVRKTDIRVRQSWKDDDNAWVMAGKGNKNSNGGKFHANPFLNNSHNSTSQVQKTNSRTTLWKKDDCFAIRDQNLQDNYNVRIQKRSTD
        ILVRLLGSSQEFKVRKTDIR RQSWKDDD AWVM GKG+KN NGG+ HAN  LN+SH+STSQVQKTNSRTTLWKKDDC AIR+QN+QDNYNV+I KRSTD
Subjt:  ILVRLLGSSQEFKVRKTDIRVRQSWKDDDNAWVMAGKGNKNSNGGKFHANPFLNNSHNSTSQVQKTNSRTTLWKKDDCFAIRDQNLQDNYNVRIQKRSTD

Query:  CSSKAFYGSSHKVRLIEKEGRYVKVVGANPTEIPKLQVAPVSYARDSLGERHRPASLNRRLGRHLELDIKGKE-PVSPVRELNDADSIMCSVGSCSISSD
        CSS+A YG++HKVRLIEKEGRYVKVV ANPTE+PKLQV PVSY RDSLGER R ASLN RLG +LELDIKGKE   SPVRELNDADSI+CSVGSCSISSD
Subjt:  CSSKAFYGSSHKVRLIEKEGRYVKVVGANPTEIPKLQVAPVSYARDSLGERHRPASLNRRLGRHLELDIKGKE-PVSPVRELNDADSIMCSVGSCSISSD

Query:  NSSEMPCDVSAGVTDQIAGHFCDDRSPHHSGYEGGHCLPTTEELAAEIHRLELHAYRCTIEAFHASGPLSWDQEVLITNLRLSLNISNDEHLMQLKYLIS
        NSSEMPCDVS GVTDQIAGHFCDDRSPH SGYE GHCLPT EELAAEIHRLEL AYRCTIEA HASGPLSWDQEVLITNLRLSLNISNDEHLMQLKYLIS
Subjt:  NSSEMPCDVSAGVTDQIAGHFCDDRSPHHSGYEGGHCLPTTEELAAEIHRLELHAYRCTIEAFHASGPLSWDQEVLITNLRLSLNISNDEHLMQLKYLIS

Query:  ADTS
        ADTS
Subjt:  ADTS

A0A1S3CNA1 uncharacterized protein LOC103502731 isoform X35.3e-17884.72Show/hide
Query:  IMKFKKGSKLKISSKKKVALGPQCSMEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNNSWKMATVSEVLGKMH
        +MKFKKGSKLKI SKKKVALGPQCSMEALRSN PNY+AGH+K KGV+NHA+V QVS KVI PCHT LD S AWAPGDVVEVFDNNSWK+ATVSEVLGKMH
Subjt:  IMKFKKGSKLKISSKKKVALGPQCSMEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNNSWKMATVSEVLGKMH

Query:  ILVRLLGSSQEFKVRKTDIRVRQSWKDDDNAWVMAGKGNKNSNGGKFHANPFLNNSHNSTSQVQKTNSRTTLWKKDDCFAIRDQNLQDNYNVRIQKRSTD
        ILVRLLGSS+EFKVRKTDIRVRQSWKDDD AWVM GKG+KN NGG+ H N  LN+ H+S+SQVQKTNSRTTLWKKDD  AIR+QN+ DNYNVRI KRSTD
Subjt:  ILVRLLGSSQEFKVRKTDIRVRQSWKDDDNAWVMAGKGNKNSNGGKFHANPFLNNSHNSTSQVQKTNSRTTLWKKDDCFAIRDQNLQDNYNVRIQKRSTD

Query:  CSSKAFYGSSHKVRLIEKEGRYVKVVGANPTEIPKLQVAPVSYARDSLGERHRPASLNRRLGRHLELDIKGKEPVSPVRELNDADSIMCSVGSCSISSDN
        C SK  YG++HKVRLIEKEGRYVKVV ANPTE+PKLQV PVSY RDSLGERHR ASLN RLG +LELDIKGKE VSPVRELNDADSIMCSVGSCSI+SDN
Subjt:  CSSKAFYGSSHKVRLIEKEGRYVKVVGANPTEIPKLQVAPVSYARDSLGERHRPASLNRRLGRHLELDIKGKEPVSPVRELNDADSIMCSVGSCSISSDN

Query:  SSEMPCDVSAGVTDQIAGHFCDDRSPHHSGYEGGHCLPTTEELAAEIHRLELHAYRCTIEAFHASGPLSWDQE
        SSEMPCDVS GVTDQIAGHFCDDRSPH SGYE GHCLP  EELAAEIHRLEL AYRCTIEA HASGPLSWDQE
Subjt:  SSEMPCDVSAGVTDQIAGHFCDDRSPHHSGYEGGHCLPTTEELAAEIHRLELHAYRCTIEAFHASGPLSWDQE

A0A1S4E4Z7 uncharacterized protein LOC103502731 isoform X21.8e-19485.86Show/hide
Query:  IMKFKKGSKLKISSKKKVALGPQCSMEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNNSWKMATVSEVLGKMH
        +MKFKKGSKLKI SKKKVALGPQCSMEALRSN PNY+AGH+K KGV+NHA+V QVS KVI PCHT LD S AWAPGDVVEVFDNNSWK+ATVSEVLGKMH
Subjt:  IMKFKKGSKLKISSKKKVALGPQCSMEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNNSWKMATVSEVLGKMH

Query:  ILVRLLGSSQEFKVRKTDIRVRQSWKDDDNAWVMAGKGNKNSNGGKFHANPFLNNSHNSTSQVQKTNSRTTLWKKDDCFAIRDQNLQDNYNVRIQKRSTD
        ILVRLLGSS+EFKVRKTDIRVRQSWKDDD AWVM GKG+KN NGG+ H N  LN+ H+S+SQVQKTNSRTTLWKKDD  AIR+QN+ DNYNVRI KRSTD
Subjt:  ILVRLLGSSQEFKVRKTDIRVRQSWKDDDNAWVMAGKGNKNSNGGKFHANPFLNNSHNSTSQVQKTNSRTTLWKKDDCFAIRDQNLQDNYNVRIQKRSTD

Query:  CSSKAFYGSSHKVRLIEKEGRYVKVVGANPTEIPKLQVAPVSYARDSLGERHRPASLNRRLGRHLELDIKGKEPVSPVRELNDADSIMCSVGSCSISSDN
        C SK  YG++HKVRLIEKEGRYVKVV ANPTE+PKLQV PVSY RDSLGERHR ASLN RLG +LELDIKGKE VSPVRELNDADSIMCSVGSCSI+SDN
Subjt:  CSSKAFYGSSHKVRLIEKEGRYVKVVGANPTEIPKLQVAPVSYARDSLGERHRPASLNRRLGRHLELDIKGKEPVSPVRELNDADSIMCSVGSCSISSDN

Query:  SSEMPCDVSAGVTDQIAGHFCDDRSPHHSGYEGGHCLPTTEELAAEIHRLELHAYRCTIEAFHASGPLSWDQEVLITNLRLSLNISNDEHLMQLKYLISA
        SSEMPCDVS GVTDQIAGHFCDDRSPH SGYE GHCLP  EELAAEIHRLEL AYRCTIEA HASGPLSWDQEVLITNLRLSLNISNDEHLMQLKYLISA
Subjt:  SSEMPCDVSAGVTDQIAGHFCDDRSPHHSGYEGGHCLPTTEELAAEIHRLELHAYRCTIEAFHASGPLSWDQEVLITNLRLSLNISNDEHLMQLKYLISA

Query:  DTS
        DTS
Subjt:  DTS

A0A5D3CRF3 ENT domain-containing protein1.7e-18185.45Show/hide
Query:  MEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNNSWKMATVSEVLGKMHILVRLLGSSQEFKVRKTDIRVRQSW
        MEALRSN PNY+AGH+K KGV+NHA+V QVS KVI PCHT LD S AWAPGDVVEVFDNNSWK+ATVSEVLGKMHILVRLLGSS+EFKVRKTDIRVRQSW
Subjt:  MEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNNSWKMATVSEVLGKMHILVRLLGSSQEFKVRKTDIRVRQSW

Query:  KDDDNAWVMAGKGNKNSNGGKFHANPFLNNSHNSTSQVQKTNSRTTLWKKDDCFAIRDQNLQDNYNVRIQKRSTDCSSKAFYGSSHKVRLIEKEGRYVKV
        KDDD AWVM GKG+KN NGG+ H N  LN+ H+S+SQVQKTNSRTTLWKKDD  AIR+QN+ DNYNVRI KRSTDC SK  YG++HKVRLIEKEGRYVKV
Subjt:  KDDDNAWVMAGKGNKNSNGGKFHANPFLNNSHNSTSQVQKTNSRTTLWKKDDCFAIRDQNLQDNYNVRIQKRSTDCSSKAFYGSSHKVRLIEKEGRYVKV

Query:  VGANPTEIPKLQVAPVSYARDSLGERHRPASLNRRLGRHLELDIKGKEPVSPVRELNDADSIMCSVGSCSISSDNSSEMPCDVSAGVTDQIAGHFCDDRS
        V ANPTE+PKLQV PVSY RDSLGERHR ASLN RLG +LELDIKGKE VSPVRELNDADSIMCSVGSCSI+SDNSSEMPCDVS GVTDQIAGHFCDDRS
Subjt:  VGANPTEIPKLQVAPVSYARDSLGERHRPASLNRRLGRHLELDIKGKEPVSPVRELNDADSIMCSVGSCSISSDNSSEMPCDVSAGVTDQIAGHFCDDRS

Query:  PHHSGYEGGHCLPTTEELAAEIHRLELHAYRCTIEAFHASGPLSWDQEVLITNLRLSLNISNDEHLMQLKYLISADTS
        PH SGYE GHCLP  EELAAEIHRLEL AYRCTIEA HASGPLSWDQEVLITNLRLSLNISNDEHLMQLKYLISADTS
Subjt:  PHHSGYEGGHCLPTTEELAAEIHRLELHAYRCTIEAFHASGPLSWDQEVLITNLRLSLNISNDEHLMQLKYLISADTS

A0A6J1JF73 uncharacterized protein LOC111485204 isoform X13.9e-18180.35Show/hide
Query:  MKFKKGSKLKISSKKKVALGPQCSMEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNNSWKMATVSEVLGKMHI
        MKFKKGSKLK+ SK+KVALGPQCSMEALRSN  NYMAG++  KGVNNH +V++VS+KVI+PCHTRLD SEAWAPGDVVEVFDNN WK+ATVSEVLGKMHI
Subjt:  MKFKKGSKLKISSKKKVALGPQCSMEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNNSWKMATVSEVLGKMHI

Query:  LVRLLGSSQEFKVRKTDIRVRQSWKDDDNAWVMAGKGNKNSNGGKFHANPFLNNSHNSTSQVQKTNSRTTLWKKDDCFAIRDQNLQDNYNVRIQKRSTDC
        LVRLLGSSQEFKVRKT IRVRQSW+DDDNAWV+  KGNKNSNGGK HAN FLN   NSTSQVQKTNSRT+ WK+DDCFA R++  QD+YNVR+ KRSTDC
Subjt:  LVRLLGSSQEFKVRKTDIRVRQSWKDDDNAWVMAGKGNKNSNGGKFHANPFLNNSHNSTSQVQKTNSRTTLWKKDDCFAIRDQNLQDNYNVRIQKRSTDC

Query:  SSKAFYGSSHKVRLIEKEGRYVKVVGANPTEIPKLQVAPVSYARDSLGERHRPASLNRRLGRHLELDIKGKEPVSPVRELNDADSIMCSVGSCSISSDNS
        SSKA Y +SHKVR IEK+GRY+KVV AN TE+PKLQV PVS  RD LGERH PASLN RLG HL+LDIK KEPV+ VRE ND DSIMCSVGSCSISSDNS
Subjt:  SSKAFYGSSHKVRLIEKEGRYVKVVGANPTEIPKLQVAPVSYARDSLGERHRPASLNRRLGRHLELDIKGKEPVSPVRELNDADSIMCSVGSCSISSDNS

Query:  SEMPCDVSAGVTDQIAGHFCDDRSPHHSGYEGGHCLPTTEELAAEIHRLELHAYRCTIEAFHASGPLSWDQEVLITNLRLSLNISNDEHLMQLKYLISAD
        +E+PCD    + D+ AGHFC+DRSP+HSGYE GHCLPT EELAAEIHRLELHAYRCTIEA HASGPLSW+QEVLITNLRLSLNISNDEHL+QLKYLISA+
Subjt:  SEMPCDVSAGVTDQIAGHFCDDRSPHHSGYEGGHCLPTTEELAAEIHRLELHAYRCTIEAFHASGPLSWDQEVLITNLRLSLNISNDEHLMQLKYLISAD

Query:  TS
        TS
Subjt:  TS

SwissProt top hitse value%identityAlignment
Q08A72 Protein EMSY-LIKE 42.3e-0538.37Show/hide
Query:  ELAAEIHRLELHAYRCTIEAFHASG-PLSWDQEVLITNLRLSLNISNDEHLMQLKYLISADT-SWLRSRLESSRCQ-AMANAREVI
        ++ A+IH++E  AY   + AF A G  +SW++E +IT LR  L++SN+EH   L  + S DT   +R   +S   Q +M NA +V+
Subjt:  ELAAEIHRLELHAYRCTIEAFHASG-PLSWDQEVLITNLRLSLNISNDEHLMQLKYLISADT-SWLRSRLESSRCQ-AMANAREVI

Q9C7C4 Protein EMSY-LIKE 11.5e-0442.62Show/hide
Query:  LAAEIHRLELHAYRCTIEAFHA-SGPLSWDQEVLITNLRLSLNISNDEHLMQLKYLISADT
        +  +IH+LE  AY   + AF A S  +SW++E LIT LR  L +S+DEH   L  +   DT
Subjt:  LAAEIHRLELHAYRCTIEAFHA-SGPLSWDQEVLITNLRLSLNISNDEHLMQLKYLISADT

Arabidopsis top hitse value%identityAlignment
AT2G25590.1 Plant Tudor-like protein1.9e-2341.61Show/hide
Query:  MKFKKGSKLKISSKKKVALGPQCSMEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNN-SWKMATVSEVLGKMH
        M+F++GS++++ S K+ + G   S E +  N   Y   +  F+  NN  + D+V +K+I PC  ++D  + W  G++VEV DNN SWK ATV EVL   +
Subjt:  MKFKKGSKLKISSKKKVALGPQCSMEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNN-SWKMATVSEVLGKMH

Query:  ILVRLLGSSQEFKVRKTDIRVRQSWKDDDNAWVMAGK
         +VRLLG+  E  V K  +R RQSW+D+   WVM GK
Subjt:  ILVRLLGSSQEFKVRKTDIRVRQSWKDDDNAWVMAGK

AT4G32440.1 Plant Tudor-like RNA-binding protein7.8e-4132.27Show/hide
Query:  MKFKKGSKLKISSKKKVALGPQCSMEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNNSWKMATVSEVLGKMHI
        M+ +KGS++++ S K+   G     E +  N   Y      F+  +  A++++V +K+I PC   +D  E W  G++VEV DN SWK ATV E L   + 
Subjt:  MKFKKGSKLKISSKKKVALGPQCSMEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNNSWKMATVSEVLGKMHI

Query:  LVRLLGSSQEFKVRKTDIRVRQSWKDDDNAWVMAGK--GNKNSN---GGKFHANPFLNNSHNSTSQVQKTNSRTTLWKKDDCFAIRDQNLQDNYNVRIQK
        +VRLLG+ +E    K ++R R+SW+D+   WV  GK  G+  S+   G   H    L    NS    + +     L K+   +                 
Subjt:  LVRLLGSSQEFKVRKTDIRVRQSWKDDDNAWVMAGK--GNKNSN---GGKFHANPFLNNSHNSTSQVQKTNSRTTLWKKDDCFAIRDQNLQDNYNVRIQK

Query:  RSTDCSSKAFYGSSHKVRLIEKEGRYVKV--VGANPTEIPKLQVAPVSYARDSLGERHRPASLNRRLGRHLEL-DIKGKEPVSPVRELNDADSIMCSVGS
          ++C +++  G+  K+R +EKEG+  KV  +   P               +  G+ H  ASLN     + ++  ++ K     VR  + +DS +CSVGS
Subjt:  RSTDCSSKAFYGSSHKVRLIEKEGRYVKV--VGANPTEIPKLQVAPVSYARDSLGERHRPASLNRRLGRHLEL-DIKGKEPVSPVRELNDADSIMCSVGS

Query:  CSISSDNSSEMPCDVSAGVTDQIAGHFCDDRSPHHSGYEGGHCLPTTEELAAEIHRLELHAYRCTIEAFHASGPLSWDQEVLITNLRLSLNISNDEHLMQ
        CS +S + S MP  +  G T Q      D  S    G E      +  + A    R EL++YR T+    +SGPLSW+QE  +T+LRLSLNIS+DEHLM+
Subjt:  CSISSDNSSEMPCDVSAGVTDQIAGHFCDDRSPHHSGYEGGHCLPTTEELAAEIHRLELHAYRCTIEAFHASGPLSWDQEVLITNLRLSLNISNDEHLMQ

Query:  LKYLISADT
        ++ LIS  T
Subjt:  LKYLISADT

AT4G32440.2 Plant Tudor-like RNA-binding protein7.3e-3932Show/hide
Query:  MKFKKGSKLKISSKKKVALGPQCSMEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNNSWKMATVSEVLGKMHI
        M+ +KGS++++ S K+   G     E +  N   Y      F+  +  A++++V +K+I PC   +D  E W  G++VEV DN SWK ATV E L   + 
Subjt:  MKFKKGSKLKISSKKKVALGPQCSMEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNNSWKMATVSEVLGKMHI

Query:  LVRLLGSSQEFKVRKTDIRVRQSWKDDDNAWVMAGK--GNKNSN---GGKFHANPFLNNSHNSTSQVQKTNSRTTLWKKDDCFAIRDQNLQDNYNVRIQK
        +VRLLG+ +E    K ++R R+SW+D+   WV  GK  G+  S+   G   H    L    NS    + +     L K+   +                 
Subjt:  LVRLLGSSQEFKVRKTDIRVRQSWKDDDNAWVMAGK--GNKNSN---GGKFHANPFLNNSHNSTSQVQKTNSRTTLWKKDDCFAIRDQNLQDNYNVRIQK

Query:  RSTDCSSKAFYGSSHKVRLIEKEGRYVKV--VGANPTEIPKLQVAPVSYARDSLGERHRPASLNRRLGRHLEL-DIKGKEPVSPVRELNDADSIMCSVGS
          ++C +++  G+  K+R +EKEG+  KV  +   P               +  G+ H  ASLN     + ++  ++ K     VR  + +DS +CSVGS
Subjt:  RSTDCSSKAFYGSSHKVRLIEKEGRYVKV--VGANPTEIPKLQVAPVSYARDSLGERHRPASLNRRLGRHLEL-DIKGKEPVSPVRELNDADSIMCSVGS

Query:  CSISSDNSSEMPCDVSAGVTDQIAGHFCDDRSPHHSGYEGGHCLPTTEELAAEIHRLELHAYRCTIEAFHASGPLSWDQEVLITNLRLSLNISNDEHLMQ
        CS +S + S MP  +  G T Q      D  S    G E      +  + A    R EL++YR T+    +SGPLSW+QE  +T+LRLSLNIS+DEHLM+
Subjt:  CSISSDNSSEMPCDVSAGVTDQIAGHFCDDRSPHHSGYEGGHCLPTTEELAAEIHRLELHAYRCTIEAFHASGPLSWDQEVLITNLRLSLNISNDEHLMQ

AT4G32440.3 Plant Tudor-like RNA-binding protein1.3e-1936.76Show/hide
Query:  MKFKKGSKLKISSKKKVALGPQCSMEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNNSWKMATVSEVLGKMHI
        M+ +KGS++++ S K+   G     E +  N   Y      F+  +  A++++V +K+I PC   +D  E W  G++VEV DN SWK ATV E L   + 
Subjt:  MKFKKGSKLKISSKKKVALGPQCSMEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNNSWKMATVSEVLGKMHI

Query:  LVRLLGSSQEFKVRKTDIRVRQSWKDDDNAWVMAGK
        +VRLLG+ +E    K ++R R+SW+D+   WV  GK
Subjt:  LVRLLGSSQEFKVRKTDIRVRQSWKDDDNAWVMAGK

AT5G20030.1 Plant Tudor-like RNA-binding protein1.3e-4032.18Show/hide
Query:  MKFKKGSKLKISSKKKVALGPQCSMEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNNSWKMATVSEVLGKMHI
        M+F KG+K+++ SK  V  G   S E +  N      GH      +++   ++V +K + P   RL   +AW PGD++EVF + SWKMA VS+VLG    
Subjt:  MKFKKGSKLKISSKKKVALGPQCSMEALRSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNNSWKMATVSEVLGKMHI

Query:  LVRLLGSSQEFKVRKTDIRVRQSWKDDDNAWVMAGKGNKNSNGGKFHANPFLNNSHNSTSQVQKTNSRTTLWKKDDCFAIRDQNLQDNYNVRIQKRSTDC
        LVRLLGSS +FKV K+DIRVRQSW+  DN W+M G+G                ++  ST ++     R  +  K D  +   ++  D  +V +       
Subjt:  LVRLLGSSQEFKVRKTDIRVRQSWKDDDNAWVMAGKGNKNSNGGKFHANPFLNNSHNSTSQVQKTNSRTTLWKKDDCFAIRDQNLQDNYNVRIQKRSTDC

Query:  SSKAFYGSSHKVRLIEKEGRYVKVVGANPTEIPKLQVAPVSYARDSLGERHRPASLNRRLGRHLELDIKGKEPVSPVRELNDADSIMCSVGSCSISSDNS
              G   +   + +     + + A P              R+ + E                             E  D +S+  SVGSC + +D  
Subjt:  SSKAFYGSSHKVRLIEKEGRYVKVVGANPTEIPKLQVAPVSYARDSLGERHRPASLNRRLGRHLELDIKGKEPVSPVRELNDADSIMCSVGSCSISSDNS

Query:  SEMPCD-VSAGVTDQIAGHFCDDRSPHHSGYEGGHCLPTTEELAAEIHRLELHAYRCTIEAFHASGP-LSWDQEVLITNLRLSLNISNDEHLMQLKYLIS
        S +  + +  G +       C        G      +P     AA++HRLEL AYR +IE  HASGP ++W+QE  ITNLRL LNISN+EHLMQ++ LIS
Subjt:  SEMPCD-VSAGVTDQIAGHFCDDRSPHHSGYEGGHCLPTTEELAAEIHRLELHAYRCTIEAFHASGP-LSWDQEVLITNLRLSLNISNDEHLMQLKYLIS

Query:  ADTS
         D S
Subjt:  ADTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATAGGTCTTCCAGCACTTCCCGGAGCTTCGAAGAATTCCCAGTTGAGTTGCCGCCAGATTCTATAAGCTCCCCACCATTGAAGAATGAAGCTTCTGGTGCTTTGCC
CATTTTTAAAAGTAATAATGCCTCCAAGAAAGAAATGGGATCACACTTTAGGTCTCCAGGAGAAAATGCAGTTCACTTAATCCCTCTTACTCTGTTTCTTTGTGCTTTAA
TCCTTTGGTACTTTGCGCCACACATCATGAAATTCAAAAAAGGGAGTAAGCTGAAAATATCGAGCAAAAAGAAGGTGGCTTTAGGACCACAATGTTCTATGGAAGCATTG
CGAAGCAATGACCCCAACTACATGGCTGGACATATTAAGTTTAAGGGTGTCAACAACCATGCTCTGGTGGATCAGGTATCTCAAAAGGTCATTATGCCCTGCCACACTCG
TTTAGATCGTTCGGAGGCTTGGGCTCCTGGTGATGTTGTAGAGGTGTTTGATAACAACTCATGGAAAATGGCCACAGTCTCTGAGGTTTTGGGGAAAATGCACATTCTTG
TCAGATTACTTGGATCCTCTCAGGAGTTTAAAGTAAGAAAAACCGATATCCGGGTTAGACAATCGTGGAAAGATGATGATAATGCATGGGTTATGGCTGGAAAGGGAAAT
AAAAATTCTAATGGAGGGAAGTTCCATGCCAATCCATTTTTGAACAATAGTCACAATTCAACCTCTCAAGTTCAGAAGACAAACTCAAGGACAACTCTGTGGAAAAAAGA
TGACTGCTTTGCTATTAGGGACCAAAATCTTCAAGATAACTACAATGTGAGAATTCAGAAGAGATCAACCGATTGCTCGTCTAAAGCATTTTATGGATCTAGCCATAAAG
TTAGATTGATTGAGAAAGAGGGTAGATATGTTAAGGTGGTTGGTGCAAATCCAACAGAGATACCTAAACTGCAGGTAGCTCCTGTTTCTTATGCTAGAGATTCTCTGGGT
GAAAGACATAGGCCTGCATCACTGAATCGCAGACTTGGTCGGCATTTGGAATTGGATATCAAGGGTAAGGAACCAGTTAGTCCAGTTCGAGAATTGAACGATGCAGATAG
CATTATGTGCTCTGTTGGTAGTTGCAGCATCAGCAGTGACAACTCAAGTGAGATGCCGTGTGATGTGTCTGCTGGTGTGACTGATCAAATTGCTGGTCACTTCTGTGATG
ATCGATCACCTCATCATTCGGGATACGAAGGAGGACATTGTCTCCCTACCACTGAAGAATTGGCAGCTGAAATCCATAGGTTAGAGTTACATGCCTACCGTTGCACTATC
GAGGCATTCCATGCATCAGGACCTTTGAGTTGGGATCAAGAAGTATTGATCACAAATCTTCGACTTTCGCTTAACATATCGAATGATGAACATTTAATGCAGCTAAAATA
TTTAATTTCTGCAGATACCAGCTGGTTAAGATCGAGACTGGAGTCTTCTCGATGCCAAGCGATGGCGAATGCTCGAGAGGTTATACTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCATAGGTCTTCCAGCACTTCCCGGAGCTTCGAAGAATTCCCAGTTGAGTTGCCGCCAGATTCTATAAGCTCCCCACCATTGAAGAATGAAGCTTCTGGTGCTTTGCC
CATTTTTAAAAGTAATAATGCCTCCAAGAAAGAAATGGGATCACACTTTAGGTCTCCAGGAGAAAATGCAGTTCACTTAATCCCTCTTACTCTGTTTCTTTGTGCTTTAA
TCCTTTGGTACTTTGCGCCACACATCATGAAATTCAAAAAAGGGAGTAAGCTGAAAATATCGAGCAAAAAGAAGGTGGCTTTAGGACCACAATGTTCTATGGAAGCATTG
CGAAGCAATGACCCCAACTACATGGCTGGACATATTAAGTTTAAGGGTGTCAACAACCATGCTCTGGTGGATCAGGTATCTCAAAAGGTCATTATGCCCTGCCACACTCG
TTTAGATCGTTCGGAGGCTTGGGCTCCTGGTGATGTTGTAGAGGTGTTTGATAACAACTCATGGAAAATGGCCACAGTCTCTGAGGTTTTGGGGAAAATGCACATTCTTG
TCAGATTACTTGGATCCTCTCAGGAGTTTAAAGTAAGAAAAACCGATATCCGGGTTAGACAATCGTGGAAAGATGATGATAATGCATGGGTTATGGCTGGAAAGGGAAAT
AAAAATTCTAATGGAGGGAAGTTCCATGCCAATCCATTTTTGAACAATAGTCACAATTCAACCTCTCAAGTTCAGAAGACAAACTCAAGGACAACTCTGTGGAAAAAAGA
TGACTGCTTTGCTATTAGGGACCAAAATCTTCAAGATAACTACAATGTGAGAATTCAGAAGAGATCAACCGATTGCTCGTCTAAAGCATTTTATGGATCTAGCCATAAAG
TTAGATTGATTGAGAAAGAGGGTAGATATGTTAAGGTGGTTGGTGCAAATCCAACAGAGATACCTAAACTGCAGGTAGCTCCTGTTTCTTATGCTAGAGATTCTCTGGGT
GAAAGACATAGGCCTGCATCACTGAATCGCAGACTTGGTCGGCATTTGGAATTGGATATCAAGGGTAAGGAACCAGTTAGTCCAGTTCGAGAATTGAACGATGCAGATAG
CATTATGTGCTCTGTTGGTAGTTGCAGCATCAGCAGTGACAACTCAAGTGAGATGCCGTGTGATGTGTCTGCTGGTGTGACTGATCAAATTGCTGGTCACTTCTGTGATG
ATCGATCACCTCATCATTCGGGATACGAAGGAGGACATTGTCTCCCTACCACTGAAGAATTGGCAGCTGAAATCCATAGGTTAGAGTTACATGCCTACCGTTGCACTATC
GAGGCATTCCATGCATCAGGACCTTTGAGTTGGGATCAAGAAGTATTGATCACAAATCTTCGACTTTCGCTTAACATATCGAATGATGAACATTTAATGCAGCTAAAATA
TTTAATTTCTGCAGATACCAGCTGGTTAAGATCGAGACTGGAGTCTTCTCGATGCCAAGCGATGGCGAATGCTCGAGAGGTTATACTTTAAGCCTTTCTTTCACTTTGTA
TATTCAACTTGGTATACAGAACCAACTGCATAGCCACCACTACACCACAGTCCACAACGGATATTTGGCTTCATATAGTGAAACTTTTGGTAATGTAATTCACTTAACCA
TGTCAGGGATACATCGTATTCAACCATTTATTTTTCAAATTCTTGTTACTGTTGGATATTGTTATTACCTGGGATATGTTTTTAGACTGTACTTTGTTTTATGGTTTTTG
CCTGCTGAAGACAGCATCTAAACAAAAGGCTTTGGCTTTTTAAAGAGGTTCTGGCAGTGGCCATACTAGAGGTCCCATCTCTAACAGCCCTGCTGAGATTCAAGTAGTAC
CAAATGAAACAGAGGAGCTCTTTCCACACAACCACCCAAGTAGCCCATTTCATGATTCCTCATCAAAATTCAATTTTCCATCATTATCATACTCTTCTCTCTCTTAAACA
GAAGAAAATAAACCAAACCCGCAAACCCCTTTTTTGCCCATTATTTTATTCTTAAAATTTCAAGGAAGATGTCATGGTCTAATATCAGGACAAAGAAAACAATCTCACAG
CTCGAGATAAATGTTATTATTTTTCTTTTTTCTTTTTTAGTATAAAAAAAGAACAAATGGTTGAGAGAGAGAGAGAAAGTATGAAGATTGGCATCTTAAAAAGTGGCCAA
GTAGTCATTGTTTCTGCATACATACCCTGCATTTTAATAAAACAAACAAAAATTGAAAAAGAGAGAGTAAAAGAGTGGGTTTGTTGATGGGAAAGTGAATTGATTTTGAT
TTTGTCTTTAAGATCTTAAGAAAGAGGGGAGTTGATGGGCACGAACCCTACAAGGAAAAGGCCACCAGCATTTATGGGGCTCCACCAAATCCTCTGCTATTGTCTCATTA
GCAACTCTATCTGATTGGCATTCCCATGTGATCATGCTCCCTTTCTCACACTGACACTTAGATATATATATGTATATAATAGAAATGTCTCCTTCCAATTCCAACACTCT
TTCTTCTAATCATCTCATATTAGCCATAATCTTCCCCATTAACAATCTCCTCTTTTCCATCCATATCCAAATTAAGCAAATCCCTTAATTGGGTATTGACAAGGAAATAA
AAGTTTGTATCTTTTCTGTTAAATTTTGAACATAATTTGATTCTTTGAAGCAAGGAAGAAATGGTCAGGAAGTCCCATCAAGATATTGGCATCTCCCCTCTCTTTTGATA
TGTGCTTGTATTTTAGTAACAGTTATAGATTCTTTGTTTGGAGAGAGAGAAATGGTTGTTGTTATTGGTTTTTGTTTTTGTTGTTGTCACCTTTATCTCATCATACAGTG
TTTGTTTTGAAAAGCAAAATAGAAAGTACATGTGGCAGATTGCAGACCCCAACTAAAGTCTTCTCAGCAATACACATACATAATACACACACACACACATACAAGTAATC
ATATGGCAGTTAAAATAAAAGGTATTCAGCTGCCTGTAACTGCAACACTACTGAGATATGGTGGATCATCTTTCTGTTCAGAAATGCCAAACCAGAAGAGAGAGAGAGAG
AGAGAGAGCGAGAGAGAGAGGGAGAGAGAGAGATTTAATCAAAGGGAAGACATTATCTAGGCAGCTGCCTTTTGCTTTGGCATAAGAGCCAATTTTGGGAAACATAAGCC
ATTTACTTATATTGATTAGGGTTTAATGGTTGATATTCCCAAGGGGATTCTTTTTTCAATTAAATTGTATAAACAAATCTCTCTTTTCTCTCTGTTTCTTTGTGTCTGAT
CTGAACACACACAATCATGGCATGGTGGAGAGATAGAGACATATCATATGTAAATAGTACACATGAATTAGTTTCCTCTCAAATAGGAG
Protein sequenceShow/hide protein sequence
MHRSSSTSRSFEEFPVELPPDSISSPPLKNEASGALPIFKSNNASKKEMGSHFRSPGENAVHLIPLTLFLCALILWYFAPHIMKFKKGSKLKISSKKKVALGPQCSMEAL
RSNDPNYMAGHIKFKGVNNHALVDQVSQKVIMPCHTRLDRSEAWAPGDVVEVFDNNSWKMATVSEVLGKMHILVRLLGSSQEFKVRKTDIRVRQSWKDDDNAWVMAGKGN
KNSNGGKFHANPFLNNSHNSTSQVQKTNSRTTLWKKDDCFAIRDQNLQDNYNVRIQKRSTDCSSKAFYGSSHKVRLIEKEGRYVKVVGANPTEIPKLQVAPVSYARDSLG
ERHRPASLNRRLGRHLELDIKGKEPVSPVRELNDADSIMCSVGSCSISSDNSSEMPCDVSAGVTDQIAGHFCDDRSPHHSGYEGGHCLPTTEELAAEIHRLELHAYRCTI
EAFHASGPLSWDQEVLITNLRLSLNISNDEHLMQLKYLISADTSWLRSRLESSRCQAMANAREVIL