; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG04G009190 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG04G009190
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionHTH myb-type domain-containing protein
Genome locationCG_Chr04:24036522..24046147
RNA-Seq ExpressionClCG04G009190
SyntenyClCG04G009190
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008449224.1 PREDICTED: uncharacterized protein LOC103491166 isoform X1 [Cucumis melo]4.1e-23278.84Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLEPVEALL
        MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVE KS+HVLGNCLRVQDFSCDFGYGIQTNGG                             
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLEPVEALL

Query:  WRHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPV
           L   +K+       F D DQLLDDANEVGEFHATNNLPNTYAEVAENSFR+NR  QLGN SSE+KS GPSR DTDAFGISELSATMVME EFNNTPV
Subjt:  WRHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPV

Query:  ERGLTHELSPGLRTKGRRVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPP
        ERGLTHELSPGL TKGR VTPLEGNIC TILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEF DSKSE NKGRRK P
Subjt:  ERGLTHELSPGLRTKGRRVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPP

Query:  TKDKYLKVTSTEQSNHIRHEVQMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAKR
         KDKYLKV STE+S HIRHEVQM  PR +S CGTSVPVQ +S+RRHP KHVPV                        SGFLSEDESSATECK+VYSSA+R
Subjt:  TKDKYLKVTSTEQSNHIRHEVQMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAKR

Query:  CKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRVYELANIYPY
        CKKYDRR QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQN+KG++ KQ+HASRPLPKSLLQRVYELANIYPY
Subjt:  CKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRVYELANIYPY

Query:  PKERSPKSVKSTTPPMHHVESNSLSFNWGRKKYE
        PKER PKSVK+ TPPM  +ESNSLSFNWGRKKYE
Subjt:  PKERSPKSVKSTTPPMHHVESNSLSFNWGRKKYE

XP_008449225.1 PREDICTED: uncharacterized protein LOC103491166 isoform X2 [Cucumis melo]3.1e-23278.84Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLEPVEALL
        MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVE KS+HVLGNCLRVQDFSCDFGYGIQTNGG                             
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLEPVEALL

Query:  WRHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPV
           L   +K+       F D DQLLDDANEVGEFHATNNLPNTYAEVAENSFR+NR  QLGN SSE+KS GPSR DTDAFGISELSATMVME EFNNTPV
Subjt:  WRHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPV

Query:  ERGLTHELSPGLRTKGRRVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPP
        ERGLTHELSPGL TKGR VTPLEGNIC TILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEF DSKSE NKGRRK P
Subjt:  ERGLTHELSPGLRTKGRRVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPP

Query:  TKDKYLKVTSTEQSNHIRHEVQMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAKR
         KDKYLKV STE+S HIRHEVQM  PR +S CGTSVPVQ +S+RRHP KHVPV                        SGFLSEDESSATECK+VYSSA+R
Subjt:  TKDKYLKVTSTEQSNHIRHEVQMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAKR

Query:  CKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRVYELANIYPY
        CKKYDRR QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQN+KG++ KQ+HASRPLPKSLLQRVYELANIYPY
Subjt:  CKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRVYELANIYPY

Query:  PKERSPKSVKSTTPPMHHVESNSLSFNWGRKKYE
        PKER PKSVK+ TPPM  +ESNSLSFNWGRKKYE
Subjt:  PKERSPKSVKSTTPPMHHVESNSLSFNWGRKKYE

XP_038881566.1 uncharacterized protein LOC120073047 isoform X1 [Benincasa hispida]3.8e-24683.24Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLEPVEALL
        MDQEVHFCQKFTNMKSHWV+VEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNG            GGL S             
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLEPVEALL

Query:  WRHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPV
               +K+       F DLDQLLDDANEVGEFHATNNL +TYAEVAENSFRQNRGLQLGN SS SKSQGPSRSDTDAFGISELSATMVMEDEFNNTPV
Subjt:  WRHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPV

Query:  ERGLTHELSPGLRTKGRRV--TPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRK
        ERGLTHELSPGLRTKGR V  TPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVAN+LASCSRERRLRKPTRRYIEEFADSKSE+NKGRRK
Subjt:  ERGLTHELSPGLRTKGRRV--TPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRK

Query:  PPTKDKYLKVTSTEQSNHIRHEVQMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSA
        PPTKDKYLKVTSTE+SNHIRHEVQM TPR E HCGTSVPVQSRSQRRHPKKHVPV                        SGFLSEDESSATECK+VYSS 
Subjt:  PPTKDKYLKVTSTEQSNHIRHEVQMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSA

Query:  KRCKKYD-RRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRVYELANI
        KRCKKYD RRHQKMW+LTEVMRLVDGIAEYGTGRWT IKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGI+RKQSHASRPLPKSLLQRVYELANI
Subjt:  KRCKKYD-RRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRVYELANI

Query:  YPYPKERSPKSVKSTTPPMHHVESNSLSFNWGRKKYE
        YPYPKERSPKSVK+TTPPMH +ESNSLSFNWGRKKYE
Subjt:  YPYPKERSPKSVKSTTPPMHHVESNSLSFNWGRKKYE

XP_038881567.1 uncharacterized protein LOC120073047 isoform X2 [Benincasa hispida]1.7e-23382.66Show/hide
Query:  VKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLEPVEALLWRHLSQRNKKRIPELSHF
        V+VEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNG            GGL S                    +K+       F
Subjt:  VKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLEPVEALLWRHLSQRNKKRIPELSHF

Query:  LDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPVERGLTHELSPGLRTKGRR
         DLDQLLDDANEVGEFHATNNL +TYAEVAENSFRQNRGLQLGN SS SKSQGPSRSDTDAFGISELSATMVMEDEFNNTPVERGLTHELSPGLRTKGR 
Subjt:  LDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPVERGLTHELSPGLRTKGRR

Query:  V--TPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYLKVTSTEQSNH
        V  TPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVAN+LASCSRERRLRKPTRRYIEEFADSKSE+NKGRRKPPTKDKYLKVTSTE+SNH
Subjt:  V--TPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYLKVTSTEQSNH

Query:  IRHEVQMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAKRCKKYD-RRHQKMWTLT
        IRHEVQM TPR E HCGTSVPVQSRSQRRHPKKHVPV                        SGFLSEDESSATECK+VYSS KRCKKYD RRHQKMW+LT
Subjt:  IRHEVQMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAKRCKKYD-RRHQKMWTLT

Query:  EVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKSTTPP
        EVMRLVDGIAEYGTGRWT IKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGI+RKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVK+TTPP
Subjt:  EVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKSTTPP

Query:  MHHVESNSLSFNWGRKKYE
        MH +ESNSLSFNWGRKKYE
Subjt:  MHHVESNSLSFNWGRKKYE

XP_038881569.1 uncharacterized protein LOC120073047 isoform X3 [Benincasa hispida]1.0e-24383.05Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLEPVEALL
        MDQEVHFCQKFTNMKSHWV+VEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNG            GGL S             
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLEPVEALL

Query:  WRHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPV
               +K+       F DLDQLLDDANEVGEFHATNNL N  AEVAENSFRQNRGLQLGN SS SKSQGPSRSDTDAFGISELSATMVMEDEFNNTPV
Subjt:  WRHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPV

Query:  ERGLTHELSPGLRTKGRRV--TPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRK
        ERGLTHELSPGLRTKGR V  TPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVAN+LASCSRERRLRKPTRRYIEEFADSKSE+NKGRRK
Subjt:  ERGLTHELSPGLRTKGRRV--TPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRK

Query:  PPTKDKYLKVTSTEQSNHIRHEVQMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSA
        PPTKDKYLKVTSTE+SNHIRHEVQM TPR E HCGTSVPVQSRSQRRHPKKHVPV                        SGFLSEDESSATECK+VYSS 
Subjt:  PPTKDKYLKVTSTEQSNHIRHEVQMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSA

Query:  KRCKKYD-RRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRVYELANI
        KRCKKYD RRHQKMW+LTEVMRLVDGIAEYGTGRWT IKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGI+RKQSHASRPLPKSLLQRVYELANI
Subjt:  KRCKKYD-RRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRVYELANI

Query:  YPYPKERSPKSVKSTTPPMHHVESNSLSFNWGRKKYE
        YPYPKERSPKSVK+TTPPMH +ESNSLSFNWGRKKYE
Subjt:  YPYPKERSPKSVKSTTPPMHHVESNSLSFNWGRKKYE

TrEMBL top hitse value%identityAlignment
A0A1S3BKX9 uncharacterized protein LOC103491166 isoform X12.0e-23278.84Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLEPVEALL
        MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVE KS+HVLGNCLRVQDFSCDFGYGIQTNGG                             
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLEPVEALL

Query:  WRHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPV
           L   +K+       F D DQLLDDANEVGEFHATNNLPNTYAEVAENSFR+NR  QLGN SSE+KS GPSR DTDAFGISELSATMVME EFNNTPV
Subjt:  WRHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPV

Query:  ERGLTHELSPGLRTKGRRVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPP
        ERGLTHELSPGL TKGR VTPLEGNIC TILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEF DSKSE NKGRRK P
Subjt:  ERGLTHELSPGLRTKGRRVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPP

Query:  TKDKYLKVTSTEQSNHIRHEVQMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAKR
         KDKYLKV STE+S HIRHEVQM  PR +S CGTSVPVQ +S+RRHP KHVPV                        SGFLSEDESSATECK+VYSSA+R
Subjt:  TKDKYLKVTSTEQSNHIRHEVQMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAKR

Query:  CKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRVYELANIYPY
        CKKYDRR QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQN+KG++ KQ+HASRPLPKSLLQRVYELANIYPY
Subjt:  CKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRVYELANIYPY

Query:  PKERSPKSVKSTTPPMHHVESNSLSFNWGRKKYE
        PKER PKSVK+ TPPM  +ESNSLSFNWGRKKYE
Subjt:  PKERSPKSVKSTTPPMHHVESNSLSFNWGRKKYE

A0A1S3BLK0 uncharacterized protein LOC103491166 isoform X21.5e-23278.84Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLEPVEALL
        MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVE KS+HVLGNCLRVQDFSCDFGYGIQTNGG                             
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLEPVEALL

Query:  WRHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPV
           L   +K+       F D DQLLDDANEVGEFHATNNLPNTYAEVAENSFR+NR  QLGN SSE+KS GPSR DTDAFGISELSATMVME EFNNTPV
Subjt:  WRHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPV

Query:  ERGLTHELSPGLRTKGRRVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPP
        ERGLTHELSPGL TKGR VTPLEGNIC TILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEF DSKSE NKGRRK P
Subjt:  ERGLTHELSPGLRTKGRRVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPP

Query:  TKDKYLKVTSTEQSNHIRHEVQMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAKR
         KDKYLKV STE+S HIRHEVQM  PR +S CGTSVPVQ +S+RRHP KHVPV                        SGFLSEDESSATECK+VYSSA+R
Subjt:  TKDKYLKVTSTEQSNHIRHEVQMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAKR

Query:  CKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRVYELANIYPY
        CKKYDRR QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQN+KG++ KQ+HASRPLPKSLLQRVYELANIYPY
Subjt:  CKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRVYELANIYPY

Query:  PKERSPKSVKSTTPPMHHVESNSLSFNWGRKKYE
        PKER PKSVK+ TPPM  +ESNSLSFNWGRKKYE
Subjt:  PKERSPKSVKSTTPPMHHVESNSLSFNWGRKKYE

A0A6J1CRG2 uncharacterized protein LOC111013581 isoform X26.8e-21775.05Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLEPVEALL
        MDQEVHFCQKFTNMKSHWVKV+G FLPAPLN+ NEVE LLVEPKS+HVLG+CLR QDFSCDF YGIQTN             GGL S             
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLEPVEALL

Query:  WRHLSQRNKKRIPELS-HFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNN-T
               N K+  E    F DLDQLL D NEV EFHATNNLPNTY EVAENSFR+NRGLQLGNLSSESKSQG SR+DT+AF ISELSA MV E E NN T
Subjt:  WRHLSQRNKKRIPELS-HFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNN-T

Query:  PVERGLTHELSPGLRTKGRRVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRK
        PV+RGLTHEL  GLRTKGR  TPL+G+IC TILDN NIHKF+TNE  +ENG LSDENVKG+I A++LA CSR+RRLRKPTRRYIEEFADSKSES+KG+RK
Subjt:  PVERGLTHELSPGLRTKGRRVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRK

Query:  PPTKDKYLKVTSTEQSNHIRHEVQMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSA
        PPTKDKY+KVTS E+SNHIRH+VQM TP  ESHCGTS+PVQSRSQRR PKKHVPV                        SGFLSE+ESSATECK VYSSA
Subjt:  PPTKDKYLKVTSTEQSNHIRHEVQMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSA

Query:  KRCKKYDRR-HQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRVYELANI
        KRCKK+DRR HQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFA+SP+RTPIDLRDKWRNLLRASCVNIQNR GI+RKQSHASRPLPKSLLQRVYELANI
Subjt:  KRCKKYDRR-HQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRVYELANI

Query:  YPYPKERSPKSVKSTTPPMHHVESNSLSFNWGRKKYE
        YPYPKERSPKSVK+TT PMH +ESNSLSFNWGRKKY+
Subjt:  YPYPKERSPKSVKSTTPPMHHVESNSLSFNWGRKKYE

A0A6J1CRQ1 uncharacterized protein LOC111013581 isoform X11.7e-21574.91Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLEPVEALL
        MDQEVHFCQKFTNMKSHWVKV+G FLPAPLN+ NEVE LLVEPKS+HVLG+CLR QDFSCDF YGIQTN             GGL S             
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLEPVEALL

Query:  WRHLSQRNKKRIPELS-HFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNN-T
               N K+  E    F DLDQLL D NEV EFHATNNLPNTY EVAENSFR+NRGLQLGNLSSESKSQG SR+DT+AF ISELSA MV E E NN T
Subjt:  WRHLSQRNKKRIPELS-HFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNN-T

Query:  PVERGLTHELSPGLRTKGRRVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRK
        PV+RGLTHEL  GLRTKGR  TPL+G+IC TILDN NIHKF+TNE  +ENG LSDENVKG+I A++LA CSR+RRLRKPTRRYIEEFADSKSES+KG+RK
Subjt:  PVERGLTHELSPGLRTKGRRVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRK

Query:  PPTKDKYLKVTSTEQSNHIRHEVQMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSA
        PPTKDKY+KVTS E+SNHIRH+VQM TP  ESHCGTS+PVQSRSQRR PKKHVPV                        SGFLSE+ESSATECK VYSSA
Subjt:  PPTKDKYLKVTSTEQSNHIRHEVQMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSA

Query:  KRCKKYDRR-HQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR-DKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRVYELAN
        KRCKK+DRR HQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFA+SP+RTPIDLR DKWRNLLRASCVNIQNR GI+RKQSHASRPLPKSLLQRVYELAN
Subjt:  KRCKKYDRR-HQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR-DKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRVYELAN

Query:  IYPYPKERSPKSVKSTTPPMHHVESNSLSFNWGRKKYE
        IYPYPKERSPKSVK+TT PMH +ESNSLSFNWGRKKY+
Subjt:  IYPYPKERSPKSVKSTTPPMHHVESNSLSFNWGRKKYE

A0A6J1ECM4 uncharacterized protein LOC111433102 isoform X18.1e-20270.65Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLEPVEALL
        MDQEVHFCQKFTNM  HWVK+EG FLPAPLN+SNEV+  LVEPKSDH LGNCLRVQDFS DFGY IQTNG                              
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLEPVEALL

Query:  WRHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPV
                                                    AEV ENSFRQNRGLQLG  SSESKSQG SRSDTDAF ISELSATMVME EFNNTPV
Subjt:  WRHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPV

Query:  ERGLTHELSPGLRTKGRRVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPP
        ER LT EL  GLRT+G   TP EGNICDTILDN NIHKFNTNENY+EN  +SDENVKGDIVA++LASCSRERRLRKPTRRYIEEFADSKSE+NKGRRKPP
Subjt:  ERGLTHELSPGLRTKGRRVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPP

Query:  TKDKYLKVTSTEQSNHIRHEVQMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAKR
        TKDKYLKVTSTE+SNHIRH+VQM TP+ ESHCGTSVPVQSRSQRRHP+KHVPV                        SGFLSEDE SATECK+VYSSAK 
Subjt:  TKDKYLKVTSTEQSNHIRHEVQMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAKR

Query:  CKKYDRR-HQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRVYELANIYP
        CKKYDRR HQKMWTLTEVMRLVDGIAEYGTGRWT IK+HLFASSPHRTPIDLRDKWRNLL+ASCVNIQN KG + KQ HASRPLPKSLLQRVYELANIYP
Subjt:  CKKYDRR-HQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRVYELANIYP

Query:  YPKERSPKSVKSTTPPMHHVESNSLSFNWGRKKYE
        YPKERSPK V+  TPPM+ +ESNSLSFNWGRKKYE
Subjt:  YPKERSPKSVKSTTPPMHHVESNSLSFNWGRKKYE

SwissProt top hitse value%identityAlignment
Q6R0E3 Telomere repeat-binding protein 51.8e-0924.87Show/hide
Query:  PPTKDKYLKVTSTEQSN--HIRHEVQMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLP--ANSVAYLPPSQSSGFLSEDESSATECKSV
        PP   K L    ++ +   ++ H +    P    H   S  V+S    +    +   ++    L+P+ P  A ++  +PP ++                 
Subjt:  PPTKDKYLKVTSTEQSN--HIRHEVQMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLP--ANSVAYLPPSQSSGFLSEDESSATECKSV

Query:  YSSAKRCKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRV
            KR +   RR ++ +++ EV  LV  +   GTGRW  +K   F ++ HRT +DL+DKW+ L+  + ++ Q R+G          P+P+ LL RV
Subjt:  YSSAKRCKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRV

Q9C7B1 Telomere repeat-binding protein 32.6e-1135.14Show/hide
Query:  LSEDESSATECKSVYSSAKRCKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHAS
        L E E  A     +    KR +   RR ++ +++TEV  LV  + E GTGRW  +K   F  + HRT +DL+DKW+ L+  + ++ Q R+G         
Subjt:  LSEDESSATECKSVYSSAKRCKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHAS

Query:  RPLPKSLLQRV
         P+P+ LL RV
Subjt:  RPLPKSLLQRV

Q9FFY9 Telomere repeat-binding protein 45.2e-1236.45Show/hide
Query:  ESSATECKSVYSSAKRCKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLP
        ES A     V    KR +   RR ++ +++TEV  LV  + E GTGRW  +K   F ++ HRT +DL+DKW+ L+  + ++ Q R+G          P+P
Subjt:  ESSATECKSVYSSAKRCKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLP

Query:  KSLLQRV
        + LL RV
Subjt:  KSLLQRV

Q9M347 Telomere repeat-binding protein 62.8e-1034.21Show/hide
Query:  SEDESSATECKSV----YSSAKRCKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQS
        S D + A   KSV       A + +   RR ++ +T++EV  LV  +   GTGRW  +K H F    HRT +DL+DKW+ L+  + ++ + R+G      
Subjt:  SEDESSATECKSV----YSSAKRCKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQS

Query:  HASRPLPKSLLQRV
            P+P+ LL RV
Subjt:  HASRPLPKSLLQRV

Q9SNB9 Telomere repeat-binding protein 28.3e-1032.76Show/hide
Query:  RRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRVYELANIYPYPKERS
        RR ++ +++TEV  LV  + + GTGRW  +K   F  + HRT +DL+DKW+ L+  + ++ Q R+G          P+P+ LL RV +    + Y  +  
Subjt:  RRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRVYELANIYPYPKERS

Query:  PKSVKSTTPPMHHVES
           ++ T PP   VE+
Subjt:  PKSVKSTTPPMHHVES

Arabidopsis top hitse value%identityAlignment
AT1G17460.1 TRF-like 37.9e-1627.27Show/hide
Query:  RRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYLKVTSTEQSNHIRHEVQMFTPRCESHCGTSVPVQSRSQRRHPKKHVPV------------SLLLSG
        +R+RKPTRRYIEE   ++ +   G   P      ++  S+E    +   V +   R +      VP  S  +R  P++++              +    G
Subjt:  RRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYLKVTSTEQSNHIRHEVQMFTPRCESHCGTSVPVQSRSQRRHPKKHVPV------------SLLLSG

Query:  LLPLLP---ANSVAYLPPSQSSGFLSEDESSATECKSVYSSAKR-------------------------CKKYDRRHQKMWTLTEVMRLVDGIAEYGTGR
         L L P   +N V  +P  +S+    + ES     K +++   +                              R+  + WT++EV +LV+G+++YG G+
Subjt:  LLPLLP---ANSVAYLPPSQSSGFLSEDESSATECKSVYSSAKR-------------------------CKKYDRRHQKMWTLTEVMRLVDGIAEYGTGR

Query:  WTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRVYELA
        WT IKK  F+   HRT +DL+DKWRNL +AS  N +   G+   + H S  +P  ++ +V ELA
Subjt:  WTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRVYELA

AT1G72650.1 TRF-like 65.1e-2330Show/hide
Query:  RRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYLKVTSTEQSNHIRHEVQMFTPRCESHCGT--SVPVQSRSQRRHPKKHVPVSL-LLSGLL--PLLPA
        +R+RKPTRRYIEE +++  +    +   P+KD+ L   S  +S  +    ++   R  S  G+   VP  S  +R  P++++   L   S  L      A
Subjt:  RRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYLKVTSTEQSNHIRHEVQMFTPRCESHCGT--SVPVQSRSQRRHPKKHVPVSL-LLSGLL--PLLPA

Query:  NSVAYLPPSQSSG-------------------FLSEDESSATECKSVYSSAKRCKKYD----------------------RRHQKMWTLTEVMRLVDGIA
         S   L PSQ S                    F + DE++     S        +  D                      R+H + WTL+E+ +LV+G++
Subjt:  NSVAYLPPSQSSG-------------------FLSEDESSATECKSVYSSAKRCKKYD----------------------RRHQKMWTLTEVMRLVDGIA

Query:  EYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRVYELA
        +YG G+W+ IKKHLF+S  +RT +DL+DKWRNLL+ S     +   +   + H S  +P  +L RV ELA
Subjt:  EYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRVYELA

AT1G72650.2 TRF-like 65.1e-2330Show/hide
Query:  RRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYLKVTSTEQSNHIRHEVQMFTPRCESHCGT--SVPVQSRSQRRHPKKHVPVSL-LLSGLL--PLLPA
        +R+RKPTRRYIEE +++  +    +   P+KD+ L   S  +S  +    ++   R  S  G+   VP  S  +R  P++++   L   S  L      A
Subjt:  RRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYLKVTSTEQSNHIRHEVQMFTPRCESHCGT--SVPVQSRSQRRHPKKHVPVSL-LLSGLL--PLLPA

Query:  NSVAYLPPSQSSG-------------------FLSEDESSATECKSVYSSAKRCKKYD----------------------RRHQKMWTLTEVMRLVDGIA
         S   L PSQ S                    F + DE++     S        +  D                      R+H + WTL+E+ +LV+G++
Subjt:  NSVAYLPPSQSSG-------------------FLSEDESSATECKSVYSSAKRCKKYD----------------------RRHQKMWTLTEVMRLVDGIA

Query:  EYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRVYELA
        +YG G+W+ IKKHLF+S  +RT +DL+DKWRNLL+ S     +   +   + H S  +P  +L RV ELA
Subjt:  EYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASRPLPKSLLQRVYELA

AT2G37025.1 TRF-like 87.6e-2741.46Show/hide
Query:  YLPPSQSSGFLSEDESSATECKSVYSSAK--RCKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQN
        Y+  S    F S+D+ + +E +   S  K  R K   R++Q++WTL EVM LVDGI+ +G G+WT IK H F  + HR P+D+RDKWRNLL+AS     N
Subjt:  YLPPSQSSGFLSEDESSATECKSVYSSAK--RCKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQN

Query:  RKGIDRKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKSTTPPMHHVESNSLSFNWGRKK
            + K+   +R +PK +L RV ELA+++PYP  +SP  V        H  S S S +  +KK
Subjt:  RKGIDRKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKSTTPPMHHVESNSLSFNWGRKK

AT2G37025.2 TRF-like 87.6e-2741.46Show/hide
Query:  YLPPSQSSGFLSEDESSATECKSVYSSAK--RCKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQN
        Y+  S    F S+D+ + +E +   S  K  R K   R++Q++WTL EVM LVDGI+ +G G+WT IK H F  + HR P+D+RDKWRNLL+AS     N
Subjt:  YLPPSQSSGFLSEDESSATECKSVYSSAK--RCKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQN

Query:  RKGIDRKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKSTTPPMHHVESNSLSFNWGRKK
            + K+   +R +PK +L RV ELA+++PYP  +SP  V        H  S S S +  +KK
Subjt:  RKGIDRKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKSTTPPMHHVESNSLSFNWGRKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAACGGAATGGGAACGACGCGGCAACCTTATCATGTTAAATTAAACCACAACCGTCTCAACGCCGTCAAAGCGGTAACGTTAGGTTACGTGACCAGGTCAAACAC
TGAAACCGCTAACGAAATCCACGTGTATTGGAACCGGATTTCAAGTGCATATAAATTGAGACATGTAGGATTGATTGAGCTTAAAAGTAAACAAATTATGGATCAAGAAG
TGCATTTCTGCCAGAAGTTCACAAATATGAAATCTCATTGGGTAAAAGTGGAGGGACCTTTTCTTCCTGCACCATTAAATGATTCAAATGAAGTTGAGGATTTACTTGTG
GAGCCTAAAAGCGACCATGTTTTAGGAAATTGCTTGAGAGTTCAAGATTTCTCTTGCGACTTTGGCTATGGAATACAAACAAACGGTGGTGAATTCTTAAGATCCATTCC
TTATGCTTTTATTGGTGGCTTGAAATCCTTTAGAGTGGAAAAGCTAGAACCTGTTGAGGCACTTCTTTGGAGGCATTTGTCTCAGAGGAATAAGAAAAGAATTCCAGAAC
TCTCACATTTCCTTGATCTTGATCAACTGCTCGATGATGCCAATGAAGTAGGGGAATTCCATGCAACAAACAATCTGCCAAATACATATGCCGAAGTTGCTGAAAATTCT
TTCAGACAGAATAGGGGATTACAATTGGGAAACTTAAGTTCAGAGAGTAAATCTCAGGGACCAAGCAGAAGTGATACTGATGCTTTTGGAATATCAGAACTGTCAGCAAC
AATGGTAATGGAGGATGAATTCAATAATACACCTGTTGAGAGGGGTTTAACTCATGAGTTGTCCCCTGGTCTGAGGACCAAAGGTAGGCGTGTAACACCACTTGAAGGCA
ACATCTGTGATACGATACTTGATAATAGAAATATCCATAAGTTCAATACTAATGAAAACTATATAGAAAATGGCGATTTATCTGATGAAAATGTGAAGGGTGATATTGTG
GCAAACGAACTTGCCAGTTGTTCAAGGGAGAGGAGATTGCGTAAGCCTACTCGAAGATACATTGAAGAATTTGCAGATTCAAAGTCTGAAAGTAACAAGGGAAGGAGAAA
ACCTCCTACAAAAGATAAATACCTGAAAGTGACGTCCACTGAACAATCCAATCACATTAGACATGAGGTACAAATGTTCACGCCTAGATGTGAATCACATTGTGGTACAT
CTGTTCCAGTGCAGTCTCGATCTCAAAGAAGACATCCAAAGAAGCATGTACCAGTTTCATTGTTACTTTCTGGTCTTCTGCCACTCCTCCCCGCTAATTCAGTTGCTTAT
CTTCCCCCTTCCCAATCCTCTGGATTTCTGTCTGAAGATGAATCTTCTGCAACTGAGTGTAAAAGTGTTTATTCATCTGCTAAAAGATGTAAAAAGTATGATAGGAGGCA
CCAGAAGATGTGGACCCTTACGGAAGTTATGCGATTAGTTGATGGAATTGCTGAATATGGAACTGGCCGCTGGACTCATATAAAGAAGCACCTATTTGCATCTTCTCCTC
ATCGCACACCTATAGACCTCAGGGACAAATGGCGAAATCTTCTGAGAGCTAGCTGTGTTAACATACAGAACAGAAAAGGGATCGATCGGAAGCAGTCACATGCCTCGCGT
CCACTGCCAAAGTCCCTGCTTCAACGTGTTTATGAACTGGCCAATATATATCCATACCCAAAGGAGCGCAGTCCAAAATCAGTCAAATCAACTACACCTCCTATGCATCA
TGTCGAAAGTAACTCATTGTCATTCAATTGGGGGCGGAAGAAGTATGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGAACGGAATGGGAACGACGCGGCAACCTTATCATGTTAAATTAAACCACAACCGTCTCAACGCCGTCAAAGCGGTAACGTTAGGTTACGTGACCAGGTCAAACAC
TGAAACCGCTAACGAAATCCACGTGTATTGGAACCGGATTTCAAGTGCATATAAATTGAGACATGTAGGATTGATTGAGCTTAAAAGTAAACAAATTATGGATCAAGAAG
TGCATTTCTGCCAGAAGTTCACAAATATGAAATCTCATTGGGTAAAAGTGGAGGGACCTTTTCTTCCTGCACCATTAAATGATTCAAATGAAGTTGAGGATTTACTTGTG
GAGCCTAAAAGCGACCATGTTTTAGGAAATTGCTTGAGAGTTCAAGATTTCTCTTGCGACTTTGGCTATGGAATACAAACAAACGGTGGTGAATTCTTAAGATCCATTCC
TTATGCTTTTATTGGTGGCTTGAAATCCTTTAGAGTGGAAAAGCTAGAACCTGTTGAGGCACTTCTTTGGAGGCATTTGTCTCAGAGGAATAAGAAAAGAATTCCAGAAC
TCTCACATTTCCTTGATCTTGATCAACTGCTCGATGATGCCAATGAAGTAGGGGAATTCCATGCAACAAACAATCTGCCAAATACATATGCCGAAGTTGCTGAAAATTCT
TTCAGACAGAATAGGGGATTACAATTGGGAAACTTAAGTTCAGAGAGTAAATCTCAGGGACCAAGCAGAAGTGATACTGATGCTTTTGGAATATCAGAACTGTCAGCAAC
AATGGTAATGGAGGATGAATTCAATAATACACCTGTTGAGAGGGGTTTAACTCATGAGTTGTCCCCTGGTCTGAGGACCAAAGGTAGGCGTGTAACACCACTTGAAGGCA
ACATCTGTGATACGATACTTGATAATAGAAATATCCATAAGTTCAATACTAATGAAAACTATATAGAAAATGGCGATTTATCTGATGAAAATGTGAAGGGTGATATTGTG
GCAAACGAACTTGCCAGTTGTTCAAGGGAGAGGAGATTGCGTAAGCCTACTCGAAGATACATTGAAGAATTTGCAGATTCAAAGTCTGAAAGTAACAAGGGAAGGAGAAA
ACCTCCTACAAAAGATAAATACCTGAAAGTGACGTCCACTGAACAATCCAATCACATTAGACATGAGGTACAAATGTTCACGCCTAGATGTGAATCACATTGTGGTACAT
CTGTTCCAGTGCAGTCTCGATCTCAAAGAAGACATCCAAAGAAGCATGTACCAGTTTCATTGTTACTTTCTGGTCTTCTGCCACTCCTCCCCGCTAATTCAGTTGCTTAT
CTTCCCCCTTCCCAATCCTCTGGATTTCTGTCTGAAGATGAATCTTCTGCAACTGAGTGTAAAAGTGTTTATTCATCTGCTAAAAGATGTAAAAAGTATGATAGGAGGCA
CCAGAAGATGTGGACCCTTACGGAAGTTATGCGATTAGTTGATGGAATTGCTGAATATGGAACTGGCCGCTGGACTCATATAAAGAAGCACCTATTTGCATCTTCTCCTC
ATCGCACACCTATAGACCTCAGGGACAAATGGCGAAATCTTCTGAGAGCTAGCTGTGTTAACATACAGAACAGAAAAGGGATCGATCGGAAGCAGTCACATGCCTCGCGT
CCACTGCCAAAGTCCCTGCTTCAACGTGTTTATGAACTGGCCAATATATATCCATACCCAAAGGAGCGCAGTCCAAAATCAGTCAAATCAACTACACCTCCTATGCATCA
TGTCGAAAGTAACTCATTGTCATTCAATTGGGGGCGGAAGAAGTATGAATGACATCAACTTTGGAAGCAGCAGAAATTCCTTTGCTGTGAAGTGGAAGTCTAATGAATAC
TTACAATTAGATGTAAAAAGATCTCTGTTTCTGTTTTCAGCCTTTTGTAACGGTGATATGCACTTGAAACTGGGAAGAAAATCTTCCATTATAAAAGCCACGGAGCTAAT
TAACTGAAAATTTATACTAGAGGTTCTCAGATTCTGTTGTAGCGAGGAAAGGAAGTCATGTTCAAGTTGGAACTAATTGCTTGGATGGAGGAGAAATCAGCGTATGCCTT
GATCGAATATGGATACTAATGTTTGCCTCTACTGAAGAA
Protein sequenceShow/hide protein sequence
MSNGMGTTRQPYHVKLNHNRLNAVKAVTLGYVTRSNTETANEIHVYWNRISSAYKLRHVGLIELKSKQIMDQEVHFCQKFTNMKSHWVKVEGPFLPAPLNDSNEVEDLLV
EPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLEPVEALLWRHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENS
FRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPVERGLTHELSPGLRTKGRRVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIV
ANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYLKVTSTEQSNHIRHEVQMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAY
LPPSQSSGFLSEDESSATECKSVYSSAKRCKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIDRKQSHASR
PLPKSLLQRVYELANIYPYPKERSPKSVKSTTPPMHHVESNSLSFNWGRKKYE