; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0028897 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0028897
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionHTH myb-type domain-containing protein
Genome locationchr08:16836209..16846248
RNA-Seq ExpressionPI0028897
SyntenyPI0028897
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004143262.2 uncharacterized protein LOC101219571 isoform X2 [Cucumis sativus]8.8e-26294.57Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGDFDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKVE PFLP PLNDSNEVEDLLVESKSEHVLG+CLRVQDFSCDFGYGIQTN GGLDSNSKQGGEHELKFGD DQLLDDANEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGDFDQLLDDANEV

Query:  GEFHATNNLPNTYAEVAENSFRQNRGFQLGNLSSESKSQGPSRSDTDAFGISELSATMVIEAEFNNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL
        GEFHATNNLPNTYAEVAENSFRQNRGFQL N SSE +SQGPSR DTDAFGISELSATMV+EAEF+NTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL
Subjt:  GEFHATNNLPNTYAEVAENSFRQNRGFQLGNLSSESKSQGPSRSDTDAFGISELSATMVIEAEFNNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL

Query:  DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFVDSKSENNKGRRKPPTKDKYLKVMSTEESNHIRHEVQMLTPRSDSQ
        DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEE +DSKSE+NKGRRK P KDKYLKVMSTEESNHIRHEVQM TPRSDSQ
Subjt:  DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFVDSKSENNKGRRKPPTKDKYLKVMSTEESNHIRHEVQMLTPRSDSQ

Query:  CGTSVPMQSQSERRHPKKHVPVSGFLSEDESSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKW
        CGTSVP+Q +SERRHPKKHVP S FLSEDE SATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKW
Subjt:  CGTSVPMQSQSERRHPKKHVPVSGFLSEDESSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKW

Query:  RNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKAITPPMDLIESNSLSFNWGRKKYE
        RNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVK ITPPMDLIESNSLSFNWGRKKY+
Subjt:  RNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKAITPPMDLIESNSLSFNWGRKKYE

XP_008449224.1 PREDICTED: uncharacterized protein LOC103491166 isoform X1 [Cucumis melo]2.8e-26895.82Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGDFDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGDFDQLLDDANEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGDFDQLLDDANEV

Query:  GEFHATNNLPNTYAEVAENSFRQNRGFQLGNLSSESKSQGPSRSDTDAFGISELSATMVIEAEFNNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL
        GEFHATNNLPNTYAEVAENSFR+NR FQLGN SSE+KS GPSR DTDAFGISELSATMV+EAEFNNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL
Subjt:  GEFHATNNLPNTYAEVAENSFRQNRGFQLGNLSSESKSQGPSRSDTDAFGISELSATMVIEAEFNNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL

Query:  DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFVDSKSENNKGRRKPPTKDKYLKVMSTEESNHIRHEVQMLTPRSDSQ
        DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEF+DSKSE+NKGRRK P KDKYLKVMSTEES HIRHEVQM+ PRSDSQ
Subjt:  DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFVDSKSENNKGRRKPPTKDKYLKVMSTEESNHIRHEVQMLTPRSDSQ

Query:  CGTSVPMQSQSERRHPKKHVPVSGFLSEDESSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKW
        CGTSVP+Q +SERRHP KHVPVSGFLSEDESSATECKNVYSSA+RCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKW
Subjt:  CGTSVPMQSQSERRHPKKHVPVSGFLSEDESSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKW

Query:  RNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKAITPPMDLIESNSLSFNWGRKKYE
        RNLLRASCVNIQNKKG+EGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKAITPPMDLIESNSLSFNWGRKKYE
Subjt:  RNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKAITPPMDLIESNSLSFNWGRKKYE

XP_008449225.1 PREDICTED: uncharacterized protein LOC103491166 isoform X2 [Cucumis melo]2.6e-26695.62Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGDFDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTN GGLDSNSKQGGEHELKFGDFDQLLDDANEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGDFDQLLDDANEV

Query:  GEFHATNNLPNTYAEVAENSFRQNRGFQLGNLSSESKSQGPSRSDTDAFGISELSATMVIEAEFNNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL
        GEFHATNNLPNTYAEVAENSFR+NR FQLGN SSE+KS GPSR DTDAFGISELSATMV+EAEFNNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL
Subjt:  GEFHATNNLPNTYAEVAENSFRQNRGFQLGNLSSESKSQGPSRSDTDAFGISELSATMVIEAEFNNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL

Query:  DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFVDSKSENNKGRRKPPTKDKYLKVMSTEESNHIRHEVQMLTPRSDSQ
        DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEF+DSKSE+NKGRRK P KDKYLKVMSTEES HIRHEVQM+ PRSDSQ
Subjt:  DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFVDSKSENNKGRRKPPTKDKYLKVMSTEESNHIRHEVQMLTPRSDSQ

Query:  CGTSVPMQSQSERRHPKKHVPVSGFLSEDESSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKW
        CGTSVP+Q +SERRHP KHVPVSGFLSEDESSATECKNVYSSA+RCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKW
Subjt:  CGTSVPMQSQSERRHPKKHVPVSGFLSEDESSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKW

Query:  RNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKAITPPMDLIESNSLSFNWGRKKYE
        RNLLRASCVNIQNKKG+EGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKAITPPMDLIESNSLSFNWGRKKYE
Subjt:  RNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKAITPPMDLIESNSLSFNWGRKKYE

XP_011657653.1 uncharacterized protein LOC101219571 isoform X1 [Cucumis sativus]9.4e-26494.78Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGDFDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKVE PFLP PLNDSNEVEDLLVESKSEHVLG+CLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGD DQLLDDANEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGDFDQLLDDANEV

Query:  GEFHATNNLPNTYAEVAENSFRQNRGFQLGNLSSESKSQGPSRSDTDAFGISELSATMVIEAEFNNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL
        GEFHATNNLPNTYAEVAENSFRQNRGFQL N SSE +SQGPSR DTDAFGISELSATMV+EAEF+NTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL
Subjt:  GEFHATNNLPNTYAEVAENSFRQNRGFQLGNLSSESKSQGPSRSDTDAFGISELSATMVIEAEFNNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL

Query:  DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFVDSKSENNKGRRKPPTKDKYLKVMSTEESNHIRHEVQMLTPRSDSQ
        DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEE +DSKSE+NKGRRK P KDKYLKVMSTEESNHIRHEVQM TPRSDSQ
Subjt:  DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFVDSKSENNKGRRKPPTKDKYLKVMSTEESNHIRHEVQMLTPRSDSQ

Query:  CGTSVPMQSQSERRHPKKHVPVSGFLSEDESSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKW
        CGTSVP+Q +SERRHPKKHVP S FLSEDE SATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKW
Subjt:  CGTSVPMQSQSERRHPKKHVPVSGFLSEDESSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKW

Query:  RNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKAITPPMDLIESNSLSFNWGRKKYE
        RNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVK ITPPMDLIESNSLSFNWGRKKY+
Subjt:  RNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKAITPPMDLIESNSLSFNWGRKKYE

XP_038881566.1 uncharacterized protein LOC120073047 isoform X1 [Benincasa hispida]5.9e-25892.75Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNG-GGLDSNSKQGGEHELKFGDFDQLLDDANE
        MDQEVHFCQKFTNMKSHWV+VEGPFLPAPLNDSNEVEDLLVE KS+HVLGNCLRVQDFSCDFGYGIQTNG GGLDSNSKQGGEHELKFGD DQLLDDANE
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNG-GGLDSNSKQGGEHELKFGDFDQLLDDANE

Query:  VGEFHATNNLPNTYAEVAENSFRQNRGFQLGNLSSESKSQGPSRSDTDAFGISELSATMVIEAEFNNTPVERGLTHELSPGLGTKGRCV--TPLEGNICG
        VGEFHATNNL +TYAEVAENSFRQNRG QLGN SS SKSQGPSRSDTDAFGISELSATMV+E EFNNTPVERGLTHELSPGL TKGRCV  TPLEGNIC 
Subjt:  VGEFHATNNLPNTYAEVAENSFRQNRGFQLGNLSSESKSQGPSRSDTDAFGISELSATMVIEAEFNNTPVERGLTHELSPGLGTKGRCV--TPLEGNICG

Query:  TILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFVDSKSENNKGRRKPPTKDKYLKVMSTEESNHIRHEVQMLTPRS
        TILDNRNIHKFNTNENYIENGDLSDENVKGDIVAN+LASCSRERRLRKPTRRYIEEF DSKSENNKGRRKPPTKDKYLKV STEESNHIRHEVQMLTPRS
Subjt:  TILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFVDSKSENNKGRRKPPTKDKYLKVMSTEESNHIRHEVQMLTPRS

Query:  DSQCGTSVPMQSQSERRHPKKHVPVSGFLSEDESSATECKNVYSSAKRCKKYDRRR-QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDL
        +  CGTSVP+QS+S+RRHPKKHVPVSGFLSEDESSATECKNVYSS KRCKKYDRRR QKMW+LTEVMRLVDGIAEYGTGRWT IKKHLFASSPHRTPIDL
Subjt:  DSQCGTSVPMQSQSERRHPKKHVPVSGFLSEDESSATECKNVYSSAKRCKKYDRRR-QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDL

Query:  RDKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKAITPPMDLIESNSLSFNWGRKKYE
        RDKWRNLLRASCVNIQN+KGIE KQ+HASRPLPKSLLQRVYELANIYPYPKER PKSVKA TPPM LIESNSLSFNWGRKKYE
Subjt:  RDKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKAITPPMDLIESNSLSFNWGRKKYE

TrEMBL top hitse value%identityAlignment
A0A1S3BKX9 uncharacterized protein LOC103491166 isoform X11.4e-26895.82Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGDFDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGDFDQLLDDANEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGDFDQLLDDANEV

Query:  GEFHATNNLPNTYAEVAENSFRQNRGFQLGNLSSESKSQGPSRSDTDAFGISELSATMVIEAEFNNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL
        GEFHATNNLPNTYAEVAENSFR+NR FQLGN SSE+KS GPSR DTDAFGISELSATMV+EAEFNNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL
Subjt:  GEFHATNNLPNTYAEVAENSFRQNRGFQLGNLSSESKSQGPSRSDTDAFGISELSATMVIEAEFNNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL

Query:  DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFVDSKSENNKGRRKPPTKDKYLKVMSTEESNHIRHEVQMLTPRSDSQ
        DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEF+DSKSE+NKGRRK P KDKYLKVMSTEES HIRHEVQM+ PRSDSQ
Subjt:  DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFVDSKSENNKGRRKPPTKDKYLKVMSTEESNHIRHEVQMLTPRSDSQ

Query:  CGTSVPMQSQSERRHPKKHVPVSGFLSEDESSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKW
        CGTSVP+Q +SERRHP KHVPVSGFLSEDESSATECKNVYSSA+RCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKW
Subjt:  CGTSVPMQSQSERRHPKKHVPVSGFLSEDESSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKW

Query:  RNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKAITPPMDLIESNSLSFNWGRKKYE
        RNLLRASCVNIQNKKG+EGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKAITPPMDLIESNSLSFNWGRKKYE
Subjt:  RNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKAITPPMDLIESNSLSFNWGRKKYE

A0A1S3BLJ8 uncharacterized protein LOC103491166 isoform X31.9e-21795.21Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGDFDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGDFDQLLDDANEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGDFDQLLDDANEV

Query:  GEFHATNNLPNTYAEVAENSFRQNRGFQLGNLSSESKSQGPSRSDTDAFGISELSATMVIEAEFNNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL
        GEFHATNNLPNTYAEVAENSFR+NR FQLGN SSE+KS GPSR DTDAFGISELSATMV+EAEFNNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL
Subjt:  GEFHATNNLPNTYAEVAENSFRQNRGFQLGNLSSESKSQGPSRSDTDAFGISELSATMVIEAEFNNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL

Query:  DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFVDSKSENNKGRRKPPTKDKYLKVMSTEESNHIRHEVQMLTPRSDSQ
        DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEF+DSKSE+NKGRRK P KDKYLKVMSTEES HIRHEVQM+ PRSDSQ
Subjt:  DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFVDSKSENNKGRRKPPTKDKYLKVMSTEESNHIRHEVQMLTPRSDSQ

Query:  CGTSVPMQSQSERRHPKKHVPVSGFLSEDESSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR
        CGTSVP+Q +SERRHP KHVPVSGFLSEDESSATECKNVYSSA+RCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR
Subjt:  CGTSVPMQSQSERRHPKKHVPVSGFLSEDESSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR

A0A1S3BLK0 uncharacterized protein LOC103491166 isoform X21.3e-26695.62Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGDFDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTN GGLDSNSKQGGEHELKFGDFDQLLDDANEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGDFDQLLDDANEV

Query:  GEFHATNNLPNTYAEVAENSFRQNRGFQLGNLSSESKSQGPSRSDTDAFGISELSATMVIEAEFNNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL
        GEFHATNNLPNTYAEVAENSFR+NR FQLGN SSE+KS GPSR DTDAFGISELSATMV+EAEFNNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL
Subjt:  GEFHATNNLPNTYAEVAENSFRQNRGFQLGNLSSESKSQGPSRSDTDAFGISELSATMVIEAEFNNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTIL

Query:  DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFVDSKSENNKGRRKPPTKDKYLKVMSTEESNHIRHEVQMLTPRSDSQ
        DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEF+DSKSE+NKGRRK P KDKYLKVMSTEES HIRHEVQM+ PRSDSQ
Subjt:  DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFVDSKSENNKGRRKPPTKDKYLKVMSTEESNHIRHEVQMLTPRSDSQ

Query:  CGTSVPMQSQSERRHPKKHVPVSGFLSEDESSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKW
        CGTSVP+Q +SERRHP KHVPVSGFLSEDESSATECKNVYSSA+RCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKW
Subjt:  CGTSVPMQSQSERRHPKKHVPVSGFLSEDESSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKW

Query:  RNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKAITPPMDLIESNSLSFNWGRKKYE
        RNLLRASCVNIQNKKG+EGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKAITPPMDLIESNSLSFNWGRKKYE
Subjt:  RNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKAITPPMDLIESNSLSFNWGRKKYE

A0A6J1CRG2 uncharacterized protein LOC111013581 isoform X21.1e-22883.16Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGDFDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKV+G FLPAPLN+ NEVE LLVE KS HVLG+CLR QDFSCDF YGIQTN GGLDSNSKQ GEHELKF D DQLL D NEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGDFDQLLDDANEV

Query:  GEFHATNNLPNTYAEVAENSFRQNRGFQLGNLSSESKSQGPSRSDTDAFGISELSATMVIEAEFNN-TPVERGLTHELSPGLGTKGRCVTPLEGNICGTI
         EFHATNNLPNTY EVAENSFR+NRG QLGNLSSESKSQG SR+DT+AF ISELSA MV EAE NN TPV+RGLTHEL  GL TKGRC TPL+G+IC TI
Subjt:  GEFHATNNLPNTYAEVAENSFRQNRGFQLGNLSSESKSQGPSRSDTDAFGISELSATMVIEAEFNN-TPVERGLTHELSPGLGTKGRCVTPLEGNICGTI

Query:  LDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFVDSKSENNKGRRKPPTKDKYLKVMSTEESNHIRHEVQMLTPRSDS
        LDN NIHKF+TNE  +ENG LSDENVKG+I A++LA CSR+RRLRKPTRRYIEEF DSKSE++KG+RKPPTKDKY+KV S EESNHIRH+VQMLTP  +S
Subjt:  LDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFVDSKSENNKGRRKPPTKDKYLKVMSTEESNHIRHEVQMLTPRSDS

Query:  QCGTSVPMQSQSERRHPKKHVPVSGFLSEDESSATECKNVYSSAKRCKKYDRRR-QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRD
         CGTS+P+QS+S+RR PKKHVPVSGFLSE+ESSATECK VYSSAKRCKK+DRR+ QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFA+SP+RTPIDLRD
Subjt:  QCGTSVPMQSQSERRHPKKHVPVSGFLSEDESSATECKNVYSSAKRCKKYDRRR-QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRD

Query:  KWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKAITPPMDLIESNSLSFNWGRKKYE
        KWRNLLRASCVNIQN+ GIE KQ+HASRPLPKSLLQRVYELANIYPYPKER PKSVKA T PM LIESNSLSFNWGRKKY+
Subjt:  KWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKAITPPMDLIESNSLSFNWGRKKYE

A0A6J1CRQ1 uncharacterized protein LOC111013581 isoform X12.6e-22782.99Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGDFDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKV+G FLPAPLN+ NEVE LLVE KS HVLG+CLR QDFSCDF YGIQTN GGLDSNSKQ GEHELKF D DQLL D NEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGDFDQLLDDANEV

Query:  GEFHATNNLPNTYAEVAENSFRQNRGFQLGNLSSESKSQGPSRSDTDAFGISELSATMVIEAEFNN-TPVERGLTHELSPGLGTKGRCVTPLEGNICGTI
         EFHATNNLPNTY EVAENSFR+NRG QLGNLSSESKSQG SR+DT+AF ISELSA MV EAE NN TPV+RGLTHEL  GL TKGRC TPL+G+IC TI
Subjt:  GEFHATNNLPNTYAEVAENSFRQNRGFQLGNLSSESKSQGPSRSDTDAFGISELSATMVIEAEFNN-TPVERGLTHELSPGLGTKGRCVTPLEGNICGTI

Query:  LDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFVDSKSENNKGRRKPPTKDKYLKVMSTEESNHIRHEVQMLTPRSDS
        LDN NIHKF+TNE  +ENG LSDENVKG+I A++LA CSR+RRLRKPTRRYIEEF DSKSE++KG+RKPPTKDKY+KV S EESNHIRH+VQMLTP  +S
Subjt:  LDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFVDSKSENNKGRRKPPTKDKYLKVMSTEESNHIRHEVQMLTPRSDS

Query:  QCGTSVPMQSQSERRHPKKHVPVSGFLSEDESSATECKNVYSSAKRCKKYDRRR-QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR-
         CGTS+P+QS+S+RR PKKHVPVSGFLSE+ESSATECK VYSSAKRCKK+DRR+ QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFA+SP+RTPIDLR 
Subjt:  QCGTSVPMQSQSERRHPKKHVPVSGFLSEDESSATECKNVYSSAKRCKKYDRRR-QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR-

Query:  DKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKAITPPMDLIESNSLSFNWGRKKYE
        DKWRNLLRASCVNIQN+ GIE KQ+HASRPLPKSLLQRVYELANIYPYPKER PKSVKA T PM LIESNSLSFNWGRKKY+
Subjt:  DKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGPKSVKAITPPMDLIESNSLSFNWGRKKYE

SwissProt top hitse value%identityAlignment
Q9C7B1 Telomere repeat-binding protein 32.7e-1134.23Show/hide
Query:  LSEDESSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQTHAS
        L E E  A     +    KR +   RR ++ +++TEV  LV  + E GTGRW  +K   F  + HRT +DL+DKW+ L+  + ++ Q ++G         
Subjt:  LSEDESSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQTHAS

Query:  RPLPKSLLQRV
         P+P+ LL RV
Subjt:  RPLPKSLLQRV

Q9FFY9 Telomere repeat-binding protein 45.4e-1235.51Show/hide
Query:  ESSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQTHASRPLP
        ES A     V    KR +   RR ++ +++TEV  LV  + E GTGRW  +K   F ++ HRT +DL+DKW+ L+  + ++ Q ++G          P+P
Subjt:  ESSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQTHASRPLP

Query:  KSLLQRV
        + LL RV
Subjt:  KSLLQRV

Q9LL45 Telomere-binding protein 13.3e-0935.42Show/hide
Query:  SSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRV
        S +KR     RR ++ +T+ EV  LV+ +   GTGRW  +K   F +  HRT +DL+DKW+ L+  + +  Q ++G          P+P+ LL RV
Subjt:  SSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRV

Q9M347 Telomere repeat-binding protein 68.6e-1032.46Show/hide
Query:  SEDESSATECKNV----YSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQT
        S D + A   K+V       A + +   RR ++ +T++EV  LV  +   GTGRW  +K H F    HRT +DL+DKW+ L+  + ++ + ++G      
Subjt:  SEDESSATECKNV----YSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQT

Query:  HASRPLPKSLLQRV
            P+P+ LL RV
Subjt:  HASRPLPKSLLQRV

Q9SNB9 Telomere repeat-binding protein 26.6e-1036.05Show/hide
Query:  RRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRV
        RR ++ +++TEV  LV  + + GTGRW  +K   F  + HRT +DL+DKW+ L+  + ++ Q ++G          P+P+ LL RV
Subjt:  RRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRV

Arabidopsis top hitse value%identityAlignment
AT1G17460.1 TRF-like 32.8e-1643.33Show/hide
Query:  RRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELA
        R+  + WT++EV +LV+G+++YG G+WT IKK  F+   HRT +DL+DKWRNL +AS  N        G + H S  +P  ++ +V ELA
Subjt:  RRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELA

AT1G72650.1 TRF-like 63.2e-2028.04Show/hide
Query:  RRLRKPTRRYIEEFVDSKSENNKGRRKPPTKDKYLKVMSTEESNHIRHEVQMLTPRSDSQCGT--SVPMQSQSERRHPKKHVPV-----SGFLSEDESSA
        +R+RKPTRRYIEE  ++  +    +   P+KD+ L   S   S  +    ++   R  S  G+   VP  S   R  P++++       S +L ED++SA
Subjt:  RRLRKPTRRYIEEFVDSKSENNKGRRKPPTKDKYLKVMSTEESNHIRHEVQMLTPRSDSQCGT--SVPMQSQSERRHPKKHVPV-----SGFLSEDESSA

Query:  TECK----------------NVYSSAKRCKKYD------------------------------------------------RRRQKMWTLTEVMRLVDGI
         E                  +V  SA R  + +                                                R+  + WTL+E+ +LV+G+
Subjt:  TECK----------------NVYSSAKRCKKYD------------------------------------------------RRRQKMWTLTEVMRLVDGI

Query:  AEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELA
        ++YG G+W+ IKKHLF+S  +RT +DL+DKWRNLL+ S         +   + H S  +P  +L RV ELA
Subjt:  AEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELA

AT1G72650.2 TRF-like 63.2e-2028.04Show/hide
Query:  RRLRKPTRRYIEEFVDSKSENNKGRRKPPTKDKYLKVMSTEESNHIRHEVQMLTPRSDSQCGT--SVPMQSQSERRHPKKHVPV-----SGFLSEDESSA
        +R+RKPTRRYIEE  ++  +    +   P+KD+ L   S   S  +    ++   R  S  G+   VP  S   R  P++++       S +L ED++SA
Subjt:  RRLRKPTRRYIEEFVDSKSENNKGRRKPPTKDKYLKVMSTEESNHIRHEVQMLTPRSDSQCGT--SVPMQSQSERRHPKKHVPV-----SGFLSEDESSA

Query:  TECK----------------NVYSSAKRCKKYD------------------------------------------------RRRQKMWTLTEVMRLVDGI
         E                  +V  SA R  + +                                                R+  + WTL+E+ +LV+G+
Subjt:  TECK----------------NVYSSAKRCKKYD------------------------------------------------RRRQKMWTLTEVMRLVDGI

Query:  AEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELA
        ++YG G+W+ IKKHLF+S  +RT +DL+DKWRNLL+ S         +   + H S  +P  +L RV ELA
Subjt:  AEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELA

AT2G37025.1 TRF-like 81.6e-2743.06Show/hide
Query:  RRHPKK---HVPVSGFLSEDESSATECKNVYSSAK--RCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRAS
        R+ P K   H  +    S+D+ + +E ++  S  K  R K   R+ Q++WTL EVM LVDGI+ +G G+WT IK H F  + HR P+D+RDKWRNLL+AS
Subjt:  RRHPKK---HVPVSGFLSEDESSATECKNVYSSAK--RCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRAS

Query:  CVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGP
             N    E K+   +R +PK +L RV ELA+++PYP  + P
Subjt:  CVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGP

AT2G37025.2 TRF-like 81.6e-2743.06Show/hide
Query:  RRHPKK---HVPVSGFLSEDESSATECKNVYSSAK--RCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRAS
        R+ P K   H  +    S+D+ + +E ++  S  K  R K   R+ Q++WTL EVM LVDGI+ +G G+WT IK H F  + HR P+D+RDKWRNLL+AS
Subjt:  RRHPKK---HVPVSGFLSEDESSATECKNVYSSAK--RCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRAS

Query:  CVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGP
             N    E K+   +R +PK +L RV ELA+++PYP  + P
Subjt:  CVNIQNKKGIEGKQTHASRPLPKSLLQRVYELANIYPYPKERGP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCAAGAAGTGCATTTCTGCCAGAAGTTCACAAATATGAAATCTCATTGGGTAAAAGTGGAGGGACCTTTTCTTCCTGCACCACTAAATGATTCAAATGAAGTTGA
GGATTTACTTGTGGAGTCCAAAAGCGAGCATGTTTTAGGAAATTGTTTGAGAGTTCAAGATTTCTCTTGCGACTTTGGCTATGGAATACAAACAAACGGTGGTGGATTGG
ATTCTAATAGCAAGCAGGGAGGCGAACATGAACTTAAATTTGGAGATTTTGACCAACTGCTGGATGATGCCAATGAAGTAGGGGAATTTCATGCAACAAACAATCTGCCA
AATACATATGCCGAAGTTGCTGAAAATTCTTTCAGACAGAATAGGGGATTTCAATTGGGAAACTTAAGTTCAGAGAGTAAATCTCAGGGACCAAGCAGGAGTGATACTGA
TGCTTTTGGAATATCAGAATTATCAGCAACAATGGTAATAGAGGCTGAATTCAATAATACACCTGTTGAGAGGGGTTTAACTCATGAGTTGTCCCCTGGTCTGGGGACCA
AAGGTAGGTGTGTAACACCACTTGAAGGAAACATCTGTGGTACAATACTTGATAATAGAAATATCCATAAGTTCAATACTAATGAAAACTATATAGAAAATGGCGATTTA
TCTGATGAAAATGTGAAGGGTGATATTGTGGCAAACGAACTTGCCAGTTGTTCAAGGGAGAGGAGATTGCGTAAGCCTACACGAAGATACATTGAAGAATTTGTAGATTC
GAAGTCTGAAAATAACAAGGGAAGGCGAAAACCTCCTACAAAAGATAAATACCTCAAAGTGATGTCTACTGAAGAATCCAATCACATTAGACATGAGGTACAAATGTTGA
CGCCTAGAAGTGATTCACAATGTGGTACGTCTGTTCCAATGCAGTCTCAATCTGAAAGAAGACATCCAAAGAAGCATGTGCCAGTTTCAGGATTTCTATCGGAAGATGAA
TCGTCTGCAACTGAGTGTAAGAATGTTTATTCATCTGCTAAAAGATGTAAAAAGTATGATAGGAGGCGCCAGAAGATGTGGACCCTTACTGAAGTAATGCGATTAGTTGA
TGGAATCGCCGAATATGGAACTGGCCGCTGGACTCATATTAAGAAGCATCTATTTGCATCTTCTCCTCATCGCACACCTATAGATCTCAGGGACAAATGGCGAAATCTTC
TGAGAGCTAGCTGTGTTAACATACAGAACAAAAAAGGGATTGAAGGGAAGCAGACACATGCCTCACGTCCACTACCAAAGTCCCTGCTCCAACGTGTTTATGAATTGGCC
AATATCTATCCATACCCAAAGGAGCGCGGTCCAAAATCAGTCAAAGCAATTACACCTCCCATGGATCTTATTGAAAGTAACTCTTTGTCATTCAATTGGGGGCGGAAGAA
GTATGAATGA
mRNA sequenceShow/hide mRNA sequence
GCGGTTTTAAAAAAACCACATTTCTATTTTTCCGGTTTCGATCCTCTGAACGCCACTCTCTCCCCAAATCTTCTCTTCAACTAAGCTCGGATCTTCACCTTCCTCGCTGG
ATTGATTGAGCTTAAAAGTAAACAAATTATGGATCAAGAAGTGCATTTCTGCCAGAAGTTCACAAATATGAAATCTCATTGGGTAAAAGTGGAGGGACCTTTTCTTCCTG
CACCACTAAATGATTCAAATGAAGTTGAGGATTTACTTGTGGAGTCCAAAAGCGAGCATGTTTTAGGAAATTGTTTGAGAGTTCAAGATTTCTCTTGCGACTTTGGCTAT
GGAATACAAACAAACGGTGGTGGATTGGATTCTAATAGCAAGCAGGGAGGCGAACATGAACTTAAATTTGGAGATTTTGACCAACTGCTGGATGATGCCAATGAAGTAGG
GGAATTTCATGCAACAAACAATCTGCCAAATACATATGCCGAAGTTGCTGAAAATTCTTTCAGACAGAATAGGGGATTTCAATTGGGAAACTTAAGTTCAGAGAGTAAAT
CTCAGGGACCAAGCAGGAGTGATACTGATGCTTTTGGAATATCAGAATTATCAGCAACAATGGTAATAGAGGCTGAATTCAATAATACACCTGTTGAGAGGGGTTTAACT
CATGAGTTGTCCCCTGGTCTGGGGACCAAAGGTAGGTGTGTAACACCACTTGAAGGAAACATCTGTGGTACAATACTTGATAATAGAAATATCCATAAGTTCAATACTAA
TGAAAACTATATAGAAAATGGCGATTTATCTGATGAAAATGTGAAGGGTGATATTGTGGCAAACGAACTTGCCAGTTGTTCAAGGGAGAGGAGATTGCGTAAGCCTACAC
GAAGATACATTGAAGAATTTGTAGATTCGAAGTCTGAAAATAACAAGGGAAGGCGAAAACCTCCTACAAAAGATAAATACCTCAAAGTGATGTCTACTGAAGAATCCAAT
CACATTAGACATGAGGTACAAATGTTGACGCCTAGAAGTGATTCACAATGTGGTACGTCTGTTCCAATGCAGTCTCAATCTGAAAGAAGACATCCAAAGAAGCATGTGCC
AGTTTCAGGATTTCTATCGGAAGATGAATCGTCTGCAACTGAGTGTAAGAATGTTTATTCATCTGCTAAAAGATGTAAAAAGTATGATAGGAGGCGCCAGAAGATGTGGA
CCCTTACTGAAGTAATGCGATTAGTTGATGGAATCGCCGAATATGGAACTGGCCGCTGGACTCATATTAAGAAGCATCTATTTGCATCTTCTCCTCATCGCACACCTATA
GATCTCAGGGACAAATGGCGAAATCTTCTGAGAGCTAGCTGTGTTAACATACAGAACAAAAAAGGGATTGAAGGGAAGCAGACACATGCCTCACGTCCACTACCAAAGTC
CCTGCTCCAACGTGTTTATGAATTGGCCAATATCTATCCATACCCAAAGGAGCGCGGTCCAAAATCAGTCAAAGCAATTACACCTCCCATGGATCTTATTGAAAGTAACT
CTTTGTCATTCAATTGGGGGCGGAAGAAGTATGAATGACATCAACTTTGGAAGCAGCAGAAATTCCTTCGCAGTCAGAGGTTGAAGTCTAAGTAATACTTATAATTAGAT
GTAAAAAGATCTCTGTTTCTGTTTTGACCCTTTTGTAACGGTGATATGCACTTTGAAATTGGGAAGAAAATCTTTTATAAAAGCCACGGAGCTAATTAACTGAAAATATC
TAGTATGATCCTTTCCCTTCCTTCCTCTTTTTTTAACGTTTGATTTTTTAGGTAAGTATAAATTAGATCATATACATTTTCCTAATACCAATTGTTAATTCTCGAGAAGC
TAGAGGTTCTCAGATTTTGTTGCAGCGAGGAAAGGAAGTCATGTTCAAGTTGGAACTAATTGCTTAGATGAAGGAGAAATCAGCACACATCTTAAGTATTCATCTCCGAT
CTCCATTGTTATTAATGTTTGCCTCTACTGAAGAACACGACCACTCTTGTATCTTCTACTAATTCTTACTCACACATGGGTGAGGTAAGTATACTTTTATTATTCAATGT
TTTGCAAATTGATTATTACTGTCATTCTGGCAAAATATGTACTTTTTCCAGTCTCAATCAACATGCCAGTTGGCTTCTCTAGATTTTTGTTTACATAATGTATCATTTGG
CATCAGTGCCTTCTTATTAATAGCTCTCTCGTATACACATGCATAAACATAGACCCTTGTATACACTTAAGGGATTACCATCATTAGTCCTTTGATTTTCTAGACAGATG
GTAACGATTACAGCAGGCTGGCCCACCTCCCTCTACCTTCATAACCTGCCTACTCCTTGGAACATAGGAACATTGAAAGCTAGAACATAAAGAACTAGGATCTCTCTGGT
ACAATATATCAAATATTATATATTGGTCTTCTGAATTTGCAATAGTCACCCACTTCCGCCATTGCCATTATCATTGTCATTTCCTTGTGAGTTGTTGAAGTACTCTCGAG
GCAATGAATGATGGTTGCTAATGCCACTTTCTGGGTGCTTGAAATTGATCTTTCTGCTGTTATTTTCTTCAACTTTTTGTGCTTGTTTGAAGAAATATTGGTTTCCTTGT
CTATGAAGTCCTCCCAAAGCAATCCTTCTTGAAGTCGCTGAGCAAGTGTAGCTCAATATCATCATTGATAAGATAAGTACCAAGAACTTCATCCTGGGGTTTTGAGTATC
GTACACTTAGATTCATCAGAAATAAGG
Protein sequenceShow/hide protein sequence
MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVESKSEHVLGNCLRVQDFSCDFGYGIQTNGGGLDSNSKQGGEHELKFGDFDQLLDDANEVGEFHATNNLP
NTYAEVAENSFRQNRGFQLGNLSSESKSQGPSRSDTDAFGISELSATMVIEAEFNNTPVERGLTHELSPGLGTKGRCVTPLEGNICGTILDNRNIHKFNTNENYIENGDL
SDENVKGDIVANELASCSRERRLRKPTRRYIEEFVDSKSENNKGRRKPPTKDKYLKVMSTEESNHIRHEVQMLTPRSDSQCGTSVPMQSQSERRHPKKHVPVSGFLSEDE
SSATECKNVYSSAKRCKKYDRRRQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNKKGIEGKQTHASRPLPKSLLQRVYELA
NIYPYPKERGPKSVKAITPPMDLIESNSLSFNWGRKKYE