; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006626 (gene) of Snake gourd v1 genome

Gene IDTan0006626
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionHTH myb-type domain-containing protein
Genome locationLG03:67085402..67092887
RNA-Seq ExpressionTan0006626
SyntenyTan0006626
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143737.1 uncharacterized protein LOC111013581 isoform X1 [Momordica charantia]2.8e-21883.06Show/hide
Query:  VKVEGSFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGMTPNLTSDQPSFEEPSKFLKDLDQLLDDAN
        VKV+GSFLPAPLNE NEVEHLLVEPKS+HVLG+CLR QDFSCDF YGIQTN GGLDSNSKQ GEHELKF                    +DLDQLL D N
Subjt:  VKVEGSFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGMTPNLTSDQPSFEEPSKFLKDLDQLLDDAN

Query:  EVGEFHATNNLPNTYAEVAENSFRQNRGLQLVNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNN-TPVERGFTHELCAGLRTKGRCVTPLGGSISD
        EV EFHATNNLPNTY EVAENSFR+NRGLQL NLSSESKSQGSSR+DT+AF ISELSA MV EAE NN TPV+RG THELCAGLRTKGRC TPL GSI  
Subjt:  EVGEFHATNNLPNTYAEVAENSFRQNRGLQLVNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNN-TPVERGFTHELCAGLRTKGRCVTPLGGSISD

Query:  TILDNRNIHKFNTNECYIENGDLSDENVKGDIVASKLA-SSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTPRS
        TILDN NIHKF+TNE  +ENG LSDENVKG+I ASKLA  SR+RRLRKPTRRYIEEFADSKSES+KG+RKPPTKDKY+K TSIEESNHIRHKVQMLTP  
Subjt:  TILDNRNIHKFNTNECYIENGDLSDENVKGDIVASKLA-SSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTPRS

Query:  ESHCGTSVPVQSRSQRGHPKKPVPVSGFRSEDESSATECKSVYSSAKRSKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDL
        ESHCGTS+PVQSRSQR  PKK VPVSGF SE+ESSATECK VYSSAKR KKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFA+SP+RTPIDL
Subjt:  ESHCGTSVPVQSRSQRGHPKKPVPVSGFRSEDESSATECKSVYSSAKRSKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDL

Query:  R-DKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVRATTPPMCLIESNSLSFNWGRKKYE
        R DKWRNLLRASCVN+QNR GIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSV+ATT PM LIESNSLSFNWGRKKY+
Subjt:  R-DKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVRATTPPMCLIESNSLSFNWGRKKYE

XP_022143738.1 uncharacterized protein LOC111013581 isoform X2 [Momordica charantia]1.1e-21983.23Show/hide
Query:  VKVEGSFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGMTPNLTSDQPSFEEPSKFLKDLDQLLDDAN
        VKV+GSFLPAPLNE NEVEHLLVEPKS+HVLG+CLR QDFSCDF YGIQTN GGLDSNSKQ GEHELKF                    +DLDQLL D N
Subjt:  VKVEGSFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGMTPNLTSDQPSFEEPSKFLKDLDQLLDDAN

Query:  EVGEFHATNNLPNTYAEVAENSFRQNRGLQLVNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNN-TPVERGFTHELCAGLRTKGRCVTPLGGSISD
        EV EFHATNNLPNTY EVAENSFR+NRGLQL NLSSESKSQGSSR+DT+AF ISELSA MV EAE NN TPV+RG THELCAGLRTKGRC TPL GSI  
Subjt:  EVGEFHATNNLPNTYAEVAENSFRQNRGLQLVNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNN-TPVERGFTHELCAGLRTKGRCVTPLGGSISD

Query:  TILDNRNIHKFNTNECYIENGDLSDENVKGDIVASKLA-SSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTPRS
        TILDN NIHKF+TNE  +ENG LSDENVKG+I ASKLA  SR+RRLRKPTRRYIEEFADSKSES+KG+RKPPTKDKY+K TSIEESNHIRHKVQMLTP  
Subjt:  TILDNRNIHKFNTNECYIENGDLSDENVKGDIVASKLA-SSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTPRS

Query:  ESHCGTSVPVQSRSQRGHPKKPVPVSGFRSEDESSATECKSVYSSAKRSKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDL
        ESHCGTS+PVQSRSQR  PKK VPVSGF SE+ESSATECK VYSSAKR KKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFA+SP+RTPIDL
Subjt:  ESHCGTSVPVQSRSQRGHPKKPVPVSGFRSEDESSATECKSVYSSAKRSKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDL

Query:  RDKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVRATTPPMCLIESNSLSFNWGRKKYE
        RDKWRNLLRASCVN+QNR GIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSV+ATT PM LIESNSLSFNWGRKKY+
Subjt:  RDKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVRATTPPMCLIESNSLSFNWGRKKYE

XP_038881566.1 uncharacterized protein LOC120073047 isoform X1 [Benincasa hispida]2.9e-23187.01Show/hide
Query:  VKVEGSFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTN-SGGLDSNSKQGGEHELKFGMTPNLTSDQPSFEEPSKFLKDLDQLLDDA
        V+VEG FLPAPLN+SNEVE LLVEPKSDHVLGNCLRVQDFSCDFGYGIQTN  GGLDSNSKQGGEHELKFG                    DLDQLLDDA
Subjt:  VKVEGSFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTN-SGGLDSNSKQGGEHELKFGMTPNLTSDQPSFEEPSKFLKDLDQLLDDA

Query:  NEVGEFHATNNLPNTYAEVAENSFRQNRGLQLVNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVERGFTHELCAGLRTKGRCV--TPLGGSI
        NEVGEFHATNNL +TYAEVAENSFRQNRGLQL N SS SKSQG SRSDTDAFGISELSATMVME EFNNTPVERG THEL  GLRTKGRCV  TPL G+I
Subjt:  NEVGEFHATNNLPNTYAEVAENSFRQNRGLQLVNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVERGFTHELCAGLRTKGRCV--TPLGGSI

Query:  SDTILDNRNIHKFNTNECYIENGDLSDENVKGDIVASKLAS-SRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTP
         DTILDNRNIHKFNTNE YIENGDLSDENVKGDIVA+KLAS SRERRLRKPTRRYIEEFADSKSE+NKGRRKPPTKDKY+K TS EESNHIRH+VQMLTP
Subjt:  SDTILDNRNIHKFNTNECYIENGDLSDENVKGDIVASKLAS-SRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTP

Query:  RSESHCGTSVPVQSRSQRGHPKKPVPVSGFRSEDESSATECKSVYSSAKRSKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPI
        RSE HCGTSVPVQSRSQR HPKK VPVSGF SEDESSATECK+VYSS KR KK+DRR+HQKMW+LTEVMRLVDGIAEYGTGRWT IKKHLFASSPHRTPI
Subjt:  RSESHCGTSVPVQSRSQRGHPKKPVPVSGFRSEDESSATECKSVYSSAKRSKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPI

Query:  DLRDKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVRATTPPMCLIESNSLSFNWGRKKYE
        DLRDKWRNLLRASCVN+QNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSV+ATTPPM LIESNSLSFNWGRKKYE
Subjt:  DLRDKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVRATTPPMCLIESNSLSFNWGRKKYE

XP_038881567.1 uncharacterized protein LOC120073047 isoform X2 [Benincasa hispida]1.7e-23186.83Show/hide
Query:  LVKVEGSFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTN-SGGLDSNSKQGGEHELKFGMTPNLTSDQPSFEEPSKFLKDLDQLLDD
        +V+VEG FLPAPLN+SNEVE LLVEPKSDHVLGNCLRVQDFSCDFGYGIQTN  GGLDSNSKQGGEHELKFG                    DLDQLLDD
Subjt:  LVKVEGSFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTN-SGGLDSNSKQGGEHELKFGMTPNLTSDQPSFEEPSKFLKDLDQLLDD

Query:  ANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLVNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVERGFTHELCAGLRTKGRCV--TPLGGS
        ANEVGEFHATNNL +TYAEVAENSFRQNRGLQL N SS SKSQG SRSDTDAFGISELSATMVME EFNNTPVERG THEL  GLRTKGRCV  TPL G+
Subjt:  ANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLVNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVERGFTHELCAGLRTKGRCV--TPLGGS

Query:  ISDTILDNRNIHKFNTNECYIENGDLSDENVKGDIVASKLAS-SRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLT
        I DTILDNRNIHKFNTNE YIENGDLSDENVKGDIVA+KLAS SRERRLRKPTRRYIEEFADSKSE+NKGRRKPPTKDKY+K TS EESNHIRH+VQMLT
Subjt:  ISDTILDNRNIHKFNTNECYIENGDLSDENVKGDIVASKLAS-SRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLT

Query:  PRSESHCGTSVPVQSRSQRGHPKKPVPVSGFRSEDESSATECKSVYSSAKRSKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTP
        PRSE HCGTSVPVQSRSQR HPKK VPVSGF SEDESSATECK+VYSS KR KK+DRR+HQKMW+LTEVMRLVDGIAEYGTGRWT IKKHLFASSPHRTP
Subjt:  PRSESHCGTSVPVQSRSQRGHPKKPVPVSGFRSEDESSATECKSVYSSAKRSKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTP

Query:  IDLRDKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVRATTPPMCLIESNSLSFNWGRKKYE
        IDLRDKWRNLLRASCVN+QNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSV+ATTPPM LIESNSLSFNWGRKKYE
Subjt:  IDLRDKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVRATTPPMCLIESNSLSFNWGRKKYE

XP_038881569.1 uncharacterized protein LOC120073047 isoform X3 [Benincasa hispida]6.0e-22986.8Show/hide
Query:  VKVEGSFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTN-SGGLDSNSKQGGEHELKFGMTPNLTSDQPSFEEPSKFLKDLDQLLDDA
        V+VEG FLPAPLN+SNEVE LLVEPKSDHVLGNCLRVQDFSCDFGYGIQTN  GGLDSNSKQGGEHELKFG                    DLDQLLDDA
Subjt:  VKVEGSFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTN-SGGLDSNSKQGGEHELKFGMTPNLTSDQPSFEEPSKFLKDLDQLLDDA

Query:  NEVGEFHATNNLPNTYAEVAENSFRQNRGLQLVNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVERGFTHELCAGLRTKGRCV--TPLGGSI
        NEVGEFHATNNL N  AEVAENSFRQNRGLQL N SS SKSQG SRSDTDAFGISELSATMVME EFNNTPVERG THEL  GLRTKGRCV  TPL G+I
Subjt:  NEVGEFHATNNLPNTYAEVAENSFRQNRGLQLVNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVERGFTHELCAGLRTKGRCV--TPLGGSI

Query:  SDTILDNRNIHKFNTNECYIENGDLSDENVKGDIVASKLAS-SRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTP
         DTILDNRNIHKFNTNE YIENGDLSDENVKGDIVA+KLAS SRERRLRKPTRRYIEEFADSKSE+NKGRRKPPTKDKY+K TS EESNHIRH+VQMLTP
Subjt:  SDTILDNRNIHKFNTNECYIENGDLSDENVKGDIVASKLAS-SRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTP

Query:  RSESHCGTSVPVQSRSQRGHPKKPVPVSGFRSEDESSATECKSVYSSAKRSKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPI
        RSE HCGTSVPVQSRSQR HPKK VPVSGF SEDESSATECK+VYSS KR KK+DRR+HQKMW+LTEVMRLVDGIAEYGTGRWT IKKHLFASSPHRTPI
Subjt:  RSESHCGTSVPVQSRSQRGHPKKPVPVSGFRSEDESSATECKSVYSSAKRSKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPI

Query:  DLRDKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVRATTPPMCLIESNSLSFNWGRKKYE
        DLRDKWRNLLRASCVN+QNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSV+ATTPPM LIESNSLSFNWGRKKYE
Subjt:  DLRDKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVRATTPPMCLIESNSLSFNWGRKKYE

TrEMBL top hitse value%identityAlignment
A0A1S3BKX9 uncharacterized protein LOC103491166 isoform X11.2e-21782.57Show/hide
Query:  VKVEGSFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGMTPNLTSDQPSFEEPSKFLKDLDQLLDDAN
        VKVEG FLPAPLN+SNEVE LLVE KS+HVLGNCLRVQDFSCDFGYGIQTN GGLDSNSKQGGEHELKFG                    D DQLLDDAN
Subjt:  VKVEGSFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGMTPNLTSDQPSFEEPSKFLKDLDQLLDDAN

Query:  EVGEFHATNNLPNTYAEVAENSFRQNRGLQLVNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVERGFTHELCAGLRTKGRCVTPLGGSISDT
        EVGEFHATNNLPNTYAEVAENSFR+NR  QL N SSE+KS G SR DTDAFGISELSATMVMEAEFNNTPVERG THEL  GL TKGRCVTPL G+I  T
Subjt:  EVGEFHATNNLPNTYAEVAENSFRQNRGLQLVNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVERGFTHELCAGLRTKGRCVTPLGGSISDT

Query:  ILDNRNIHKFNTNECYIENGDLSDENVKGDIVASKLAS-SRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTPRSE
        ILDNRNIHKFNTNE YIENGDLSDENVKGDIVA++LAS SRERRLRKPTRRYIEEF DSKSE NKGRRK P KDKY+K  S EES HIRH+VQM+ PRS+
Subjt:  ILDNRNIHKFNTNECYIENGDLSDENVKGDIVASKLAS-SRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTPRSE

Query:  SHCGTSVPVQSRSQRGHPKKPVPVSGFRSEDESSATECKSVYSSAKRSKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR
        S CGTSVPVQ +S+R HP K VPVSGF SEDESSATECK+VYSSA+R KK+DRR+ QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR
Subjt:  SHCGTSVPVQSRSQRGHPKKPVPVSGFRSEDESSATECKSVYSSAKRSKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR

Query:  DKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVRATTPPMCLIESNSLSFNWGRKKYE
        DKWRNLLRASCVN+QN+KG+E KQ+HASRPLPKSLLQRVYELANIYPYPKER PKSV+A TPPM LIESNSLSFNWGRKKYE
Subjt:  DKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVRATTPPMCLIESNSLSFNWGRKKYE

A0A1S3BLK0 uncharacterized protein LOC103491166 isoform X22.8e-21682.57Show/hide
Query:  VKVEGSFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGMTPNLTSDQPSFEEPSKFLKDLDQLLDDAN
        VKVEG FLPAPLN+SNEVE LLVE KS+HVLGNCLRVQDFSCDFGYGIQTN GGLDSNSKQGGEHELKFG                    D DQLLDDAN
Subjt:  VKVEGSFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGMTPNLTSDQPSFEEPSKFLKDLDQLLDDAN

Query:  EVGEFHATNNLPNTYAEVAENSFRQNRGLQLVNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVERGFTHELCAGLRTKGRCVTPLGGSISDT
        EVGEFHATNNLPNTYAEVAENSFR+NR  QL N SSE+KS G SR DTDAFGISELSATMVMEAEFNNTPVERG THEL  GL TKGRCVTPL G+I  T
Subjt:  EVGEFHATNNLPNTYAEVAENSFRQNRGLQLVNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVERGFTHELCAGLRTKGRCVTPLGGSISDT

Query:  ILDNRNIHKFNTNECYIENGDLSDENVKGDIVASKLAS-SRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTPRSE
        ILDNRNIHKFNTNE YIENGDLSDENVKGDIVA++LAS SRERRLRKPTRRYIEEF DSKSE NKGRRK P KDKY+K  S EES HIRH+VQM+ PRS+
Subjt:  ILDNRNIHKFNTNECYIENGDLSDENVKGDIVASKLAS-SRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTPRSE

Query:  SHCGTSVPVQSRSQRGHPKKPVPVSGFRSEDESSATECKSVYSSAKRSKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR
        S CGTSVPVQ +S+R HP K VPVSGF SEDESSATECK+VYSSA+R KK+DRR+ QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR
Subjt:  SHCGTSVPVQSRSQRGHPKKPVPVSGFRSEDESSATECKSVYSSAKRSKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR

Query:  DKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVRATTPPMCLIESNSLSFNWGRKKYE
        DKWRNLLRASCVN+QN+KG+E KQ+HASRPLPKSLLQRVYELANIYPYPKER PKSV+A TPPM LIESNSLSFNWGRKKYE
Subjt:  DKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVRATTPPMCLIESNSLSFNWGRKKYE

A0A6J1CRG2 uncharacterized protein LOC111013581 isoform X25.5e-22083.23Show/hide
Query:  VKVEGSFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGMTPNLTSDQPSFEEPSKFLKDLDQLLDDAN
        VKV+GSFLPAPLNE NEVEHLLVEPKS+HVLG+CLR QDFSCDF YGIQTN GGLDSNSKQ GEHELKF                    +DLDQLL D N
Subjt:  VKVEGSFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGMTPNLTSDQPSFEEPSKFLKDLDQLLDDAN

Query:  EVGEFHATNNLPNTYAEVAENSFRQNRGLQLVNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNN-TPVERGFTHELCAGLRTKGRCVTPLGGSISD
        EV EFHATNNLPNTY EVAENSFR+NRGLQL NLSSESKSQGSSR+DT+AF ISELSA MV EAE NN TPV+RG THELCAGLRTKGRC TPL GSI  
Subjt:  EVGEFHATNNLPNTYAEVAENSFRQNRGLQLVNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNN-TPVERGFTHELCAGLRTKGRCVTPLGGSISD

Query:  TILDNRNIHKFNTNECYIENGDLSDENVKGDIVASKLA-SSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTPRS
        TILDN NIHKF+TNE  +ENG LSDENVKG+I ASKLA  SR+RRLRKPTRRYIEEFADSKSES+KG+RKPPTKDKY+K TSIEESNHIRHKVQMLTP  
Subjt:  TILDNRNIHKFNTNECYIENGDLSDENVKGDIVASKLA-SSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTPRS

Query:  ESHCGTSVPVQSRSQRGHPKKPVPVSGFRSEDESSATECKSVYSSAKRSKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDL
        ESHCGTS+PVQSRSQR  PKK VPVSGF SE+ESSATECK VYSSAKR KKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFA+SP+RTPIDL
Subjt:  ESHCGTSVPVQSRSQRGHPKKPVPVSGFRSEDESSATECKSVYSSAKRSKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDL

Query:  RDKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVRATTPPMCLIESNSLSFNWGRKKYE
        RDKWRNLLRASCVN+QNR GIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSV+ATT PM LIESNSLSFNWGRKKY+
Subjt:  RDKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVRATTPPMCLIESNSLSFNWGRKKYE

A0A6J1CRQ1 uncharacterized protein LOC111013581 isoform X11.4e-21883.06Show/hide
Query:  VKVEGSFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGMTPNLTSDQPSFEEPSKFLKDLDQLLDDAN
        VKV+GSFLPAPLNE NEVEHLLVEPKS+HVLG+CLR QDFSCDF YGIQTN GGLDSNSKQ GEHELKF                    +DLDQLL D N
Subjt:  VKVEGSFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGMTPNLTSDQPSFEEPSKFLKDLDQLLDDAN

Query:  EVGEFHATNNLPNTYAEVAENSFRQNRGLQLVNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNN-TPVERGFTHELCAGLRTKGRCVTPLGGSISD
        EV EFHATNNLPNTY EVAENSFR+NRGLQL NLSSESKSQGSSR+DT+AF ISELSA MV EAE NN TPV+RG THELCAGLRTKGRC TPL GSI  
Subjt:  EVGEFHATNNLPNTYAEVAENSFRQNRGLQLVNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNN-TPVERGFTHELCAGLRTKGRCVTPLGGSISD

Query:  TILDNRNIHKFNTNECYIENGDLSDENVKGDIVASKLA-SSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTPRS
        TILDN NIHKF+TNE  +ENG LSDENVKG+I ASKLA  SR+RRLRKPTRRYIEEFADSKSES+KG+RKPPTKDKY+K TSIEESNHIRHKVQMLTP  
Subjt:  TILDNRNIHKFNTNECYIENGDLSDENVKGDIVASKLA-SSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTPRS

Query:  ESHCGTSVPVQSRSQRGHPKKPVPVSGFRSEDESSATECKSVYSSAKRSKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDL
        ESHCGTS+PVQSRSQR  PKK VPVSGF SE+ESSATECK VYSSAKR KKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFA+SP+RTPIDL
Subjt:  ESHCGTSVPVQSRSQRGHPKKPVPVSGFRSEDESSATECKSVYSSAKRSKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDL

Query:  R-DKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVRATTPPMCLIESNSLSFNWGRKKYE
        R DKWRNLLRASCVN+QNR GIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSV+ATT PM LIESNSLSFNWGRKKY+
Subjt:  R-DKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVRATTPPMCLIESNSLSFNWGRKKYE

A0A6J1EGA3 uncharacterized protein LOC111433102 isoform X21.9e-19174.9Show/hide
Query:  VKVEGSFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGMTPNLTSDQPSFEEPSKFLKDLDQLLDDAN
        VK+EGSFLPAPLNESNEV+H LVEPKSDH LGNCLRVQDFS DFGY IQTN                                                 
Subjt:  VKVEGSFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGMTPNLTSDQPSFEEPSKFLKDLDQLLDDAN

Query:  EVGEFHATNNLPNTYAEVAENSFRQNRGLQLVNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVERGFTHELCAGLRTKGRCVTPLGGSISDT
                       AEV ENSFRQNRGLQL   SSESKSQGSSRSDTDAF ISELSATMVMEAEFNNTPVER  T EL +GLRT+G C TP  G+I DT
Subjt:  EVGEFHATNNLPNTYAEVAENSFRQNRGLQLVNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVERGFTHELCAGLRTKGRCVTPLGGSISDT

Query:  ILDNRNIHKFNTNECYIENGDLSDENVKGDIVASKLAS-SRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTPRSE
        ILDN NIHKFNTNE Y+EN  +SDENVKGDIVASKLAS SRERRLRKPTRRYIEEFADSKSE+NKGRRKPPTKDKY+K TS EESNHIRHKVQMLTP+ E
Subjt:  ILDNRNIHKFNTNECYIENGDLSDENVKGDIVASKLAS-SRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTPRSE

Query:  SHCGTSVPVQSRSQRGHPKKPVPVSGFRSEDESSATECKSVYSSAKRSKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR
        SHCGTSVPVQSRSQR HP+K VPVSGF SEDE SATECK+VYSSAK  KK+DRRKHQKMWTLTEVMRLVDGIAEYGTGRWT IK+HLFASSPHRTPIDLR
Subjt:  SHCGTSVPVQSRSQRGHPKKPVPVSGFRSEDESSATECKSVYSSAKRSKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR

Query:  DKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVRATTPPMCLIESNSLSFNWGRKKYE
        DKWRNLL+ASCVN+QN KG E KQ HASRPLPKSLLQRVYELANIYPYPKERSPK VR  TPPM LIESNSLSFNWGRKKYE
Subjt:  DKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVRATTPPMCLIESNSLSFNWGRKKYE

SwissProt top hitse value%identityAlignment
Q9C7B1 Telomere repeat-binding protein 31.4e-1036.78Show/hide
Query:  RRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRV
        +R+ ++ +++TEV  LV  + E GTGRW  +K   F  + HRT +DL+DKW+ L+  + ++ Q R+G          P+P+ LL RV
Subjt:  RRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRV

Q9FFY9 Telomere repeat-binding protein 41.1e-1036.11Show/hide
Query:  ESSATECKSVYSSAKRSKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNMQNRKGIERKQSHASRPL
        ES A     V    KR++   RR  ++ +++TEV  LV  + E GTGRW  +K   F ++ HRT +DL+DKW+ L+  + ++ Q R+G          P+
Subjt:  ESSATECKSVYSSAKRSKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNMQNRKGIERKQSHASRPL

Query:  PKSLLQRV
        P+ LL RV
Subjt:  PKSLLQRV

Q9LL45 Telomere-binding protein 12.0e-0936.84Show/hide
Query:  RSKKHD--RRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRV
        RSK+ D  +R+ ++ +T+ EV  LV+ +   GTGRW  +K   F +  HRT +DL+DKW+ L+  + +  Q R+G          P+P+ LL RV
Subjt:  RSKKHD--RRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRV

Q9M347 Telomere repeat-binding protein 65.9e-0932.46Show/hide
Query:  SEDESSATECKSVYSSAKR---SKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNMQNRKGIERKQS
        S D + A   KSV     +     +  +R+ ++ +T++EV  LV  +   GTGRW  +K H F    HRT +DL+DKW+ L+  + ++ + R+G      
Subjt:  SEDESSATECKSVYSSAKR---SKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNMQNRKGIERKQS

Query:  HASRPLPKSLLQRV
            P+P+ LL RV
Subjt:  HASRPLPKSLLQRV

Q9SNB9 Telomere repeat-binding protein 29.0e-1035.63Show/hide
Query:  RRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRV
        +R+ ++ +++TEV  LV  + + GTGRW  +K   F  + HRT +DL+DKW+ L+  + ++ Q R+G          P+P+ LL RV
Subjt:  RRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRV

Arabidopsis top hitse value%identityAlignment
AT1G17460.1 TRF-like 39.6e-1536.42Show/hide
Query:  RSESHCGTSVPVQSRSQRGHPKKPV--PVSGFRSEDE---SSATECKSVYSSAKRSKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSP
        +S S C     VQ  S + H  KP+   V     E E   SS       +  A  ++    RK  + WT++EV +LV+G+++YG G+WT IKK  F+   
Subjt:  RSESHCGTSVPVQSRSQRGHPKKPV--PVSGFRSEDE---SSATECKSVYSSAKRSKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSP

Query:  HRTPIDLRDKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELA
        HRT +DL+DKWRNL +AS  N +   G+++   H S  +P  ++ +V ELA
Subjt:  HRTPIDLRDKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELA

AT1G72650.1 TRF-like 61.8e-2129.79Show/hide
Query:  ASKLASSRER---RLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTPRSESHCGT--SVPVQSRSQRGHPKKPV-PVSGF
        AS+  SS +R   R+RKPTRRYIEE +++  +    +   P+KD+ +   S   S  +    ++   R  S  G+   VP  S  +R  P++ +  + G 
Subjt:  ASKLASSRER---RLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTPRSESHCGT--SVPVQSRSQRGHPKKPV-PVSGF

Query:  RS---EDESSATECK----------------SVYSSAKRSKKHD-----------------------------------------------RRKHQKMWT
         S   ED++SA E                  SV  SA R  +++                                               RRKH + WT
Subjt:  RS---EDESSATECK----------------SVYSSAKRSKKHD-----------------------------------------------RRKHQKMWT

Query:  LTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELA
        L+E+ +LV+G+++YG G+W+ IKKHLF+S  +RT +DL+DKWRNLL+ S     +   +   + H S  +P  +L RV ELA
Subjt:  LTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELA

AT1G72650.2 TRF-like 61.8e-2129.79Show/hide
Query:  ASKLASSRER---RLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTPRSESHCGT--SVPVQSRSQRGHPKKPV-PVSGF
        AS+  SS +R   R+RKPTRRYIEE +++  +    +   P+KD+ +   S   S  +    ++   R  S  G+   VP  S  +R  P++ +  + G 
Subjt:  ASKLASSRER---RLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTPRSESHCGT--SVPVQSRSQRGHPKKPV-PVSGF

Query:  RS---EDESSATECK----------------SVYSSAKRSKKHD-----------------------------------------------RRKHQKMWT
         S   ED++SA E                  SV  SA R  +++                                               RRKH + WT
Subjt:  RS---EDESSATECK----------------SVYSSAKRSKKHD-----------------------------------------------RRKHQKMWT

Query:  LTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELA
        L+E+ +LV+G+++YG G+W+ IKKHLF+S  +RT +DL+DKWRNLL+ S     +   +   + H S  +P  +L RV ELA
Subjt:  LTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELA

AT2G37025.1 TRF-like 81.4e-2941.28Show/hide
Query:  RSQRGHPKKPVPVSGFR--SEDESSATECKSVYSSAKRSK-KHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLR
        R ++G  KK +  S  +  S+D+ + +E +   S  K S+ K DRRK+Q++WTL EVM LVDGI+ +G G+WT IK H F  + HR P+D+RDKWRNLL+
Subjt:  RSQRGHPKKPVPVSGFR--SEDESSATECKSVYSSAKRSK-KHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLR

Query:  ASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVRATTPPMCLIESNSLSFNWGRKK
        AS     N    E K+   +R +PK +L RV ELA+++PYP  +SP          C++  +S S +  + K
Subjt:  ASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVRATTPPMCLIESNSLSFNWGRKK

AT2G37025.2 TRF-like 81.4e-2941.28Show/hide
Query:  RSQRGHPKKPVPVSGFR--SEDESSATECKSVYSSAKRSK-KHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLR
        R ++G  KK +  S  +  S+D+ + +E +   S  K S+ K DRRK+Q++WTL EVM LVDGI+ +G G+WT IK H F  + HR P+D+RDKWRNLL+
Subjt:  RSQRGHPKKPVPVSGFR--SEDESSATECKSVYSSAKRSK-KHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLR

Query:  ASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVRATTPPMCLIESNSLSFNWGRKK
        AS     N    E K+   +R +PK +L RV ELA+++PYP  +SP          C++  +S S +  + K
Subjt:  ASCVNMQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVRATTPPMCLIESNSLSFNWGRKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGTGAGTCTACGAGATGTGGTGTTCGACAAGCTGCAGATTTCAGTGGACTGCGAGGCACTTTTGGTCAAAGTGGAGGGATCTTTTCTTCCTGCACCATTAAATGA
ATCAAATGAAGTTGAGCATTTACTTGTGGAGCCTAAAAGCGACCATGTTTTAGGAAATTGCTTGAGAGTTCAAGATTTCTCTTGTGACTTTGGCTATGGGATACAAACAA
ACAGCGGTGGATTGGATTCTAATAGCAAGCAGGGAGGCGAACATGAACTTAAATTTGGAATGACGCCCAACCTAACTAGTGACCAACCCTCCTTCGAAGAGCCTTCCAAA
TTCTTAAAAGATCTTGATCAACTGCTGGATGATGCCAATGAAGTAGGGGAATTCCATGCAACAAACAATCTGCCAAATACATATGCAGAAGTTGCTGAAAATTCTTTCAG
ACAGAATAGGGGATTACAATTGGTAAACTTAAGTTCAGAGAGCAAGTCTCAGGGATCAAGCAGGAGTGATACTGATGCTTTTGGAATATCAGAATTGTCAGCAACAATGG
TAATGGAGGCTGAATTCAATAATACACCTGTTGAGAGGGGTTTTACTCATGAGTTGTGTGCTGGTCTGAGGACCAAAGGTAGGTGCGTAACACCACTTGGAGGCAGCATC
AGCGATACAATACTTGATAATAGAAATATCCATAAGTTTAATACTAATGAATGCTATATAGAAAATGGCGATTTATCTGATGAGAATGTGAAAGGTGATATTGTGGCAAG
CAAACTTGCCAGTTCAAGGGAGAGGAGATTGCGTAAGCCTACTCGAAGATACATTGAAGAATTTGCAGATTCGAAGTCTGAAAGTAACAAGGGAAGGAGAAAACCTCCTA
CAAAAGATAAATACATGAAAGGGACGTCTATTGAAGAATCTAATCATATTAGACATAAGGTACAAATGTTGACGCCTAGAAGTGAATCGCATTGTGGTACTTCTGTTCCT
GTGCAGTCTCGATCTCAAAGAGGACATCCAAAGAAGCCTGTACCAGTTTCAGGATTTCGATCAGAAGACGAATCTTCTGCAACTGAATGTAAAAGTGTTTATTCATCTGC
TAAAAGAAGTAAAAAGCACGATAGGAGGAAGCACCAGAAGATGTGGACCCTTACTGAAGTAATGCGATTAGTTGATGGAATTGCTGAATATGGAACTGGCCGCTGGACTC
ATATAAAGAAGCACCTATTTGCATCTTCTCCTCATCGTACACCTATAGATCTCAGGGACAAATGGCGAAATCTTCTGAGAGCTAGCTGTGTTAACATGCAGAACAGAAAA
GGGATCGAACGGAAGCAATCACATGCCTCGCGCCCACTGCCAAAGTCCCTGCTCCAACGTGTCTATGAACTGGCCAATATTTATCCATATCCAAAGGAGCGCAGTCCAAA
GTCAGTCAGAGCAACAACACCTCCCATGTGTCTTATTGAAAGTAACTCTCTGTCATTCAATTGGGGGCGGAAGAAGTATGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGATTGTGAGTCTACGAGATGTGGTGTTCGACAAGCTGCAGATTTCAGTGGACTGCGAGGCACTTTTGGTCAAAGTGGAGGGATCTTTTCTTCCTGCACCATTAAATGA
ATCAAATGAAGTTGAGCATTTACTTGTGGAGCCTAAAAGCGACCATGTTTTAGGAAATTGCTTGAGAGTTCAAGATTTCTCTTGTGACTTTGGCTATGGGATACAAACAA
ACAGCGGTGGATTGGATTCTAATAGCAAGCAGGGAGGCGAACATGAACTTAAATTTGGAATGACGCCCAACCTAACTAGTGACCAACCCTCCTTCGAAGAGCCTTCCAAA
TTCTTAAAAGATCTTGATCAACTGCTGGATGATGCCAATGAAGTAGGGGAATTCCATGCAACAAACAATCTGCCAAATACATATGCAGAAGTTGCTGAAAATTCTTTCAG
ACAGAATAGGGGATTACAATTGGTAAACTTAAGTTCAGAGAGCAAGTCTCAGGGATCAAGCAGGAGTGATACTGATGCTTTTGGAATATCAGAATTGTCAGCAACAATGG
TAATGGAGGCTGAATTCAATAATACACCTGTTGAGAGGGGTTTTACTCATGAGTTGTGTGCTGGTCTGAGGACCAAAGGTAGGTGCGTAACACCACTTGGAGGCAGCATC
AGCGATACAATACTTGATAATAGAAATATCCATAAGTTTAATACTAATGAATGCTATATAGAAAATGGCGATTTATCTGATGAGAATGTGAAAGGTGATATTGTGGCAAG
CAAACTTGCCAGTTCAAGGGAGAGGAGATTGCGTAAGCCTACTCGAAGATACATTGAAGAATTTGCAGATTCGAAGTCTGAAAGTAACAAGGGAAGGAGAAAACCTCCTA
CAAAAGATAAATACATGAAAGGGACGTCTATTGAAGAATCTAATCATATTAGACATAAGGTACAAATGTTGACGCCTAGAAGTGAATCGCATTGTGGTACTTCTGTTCCT
GTGCAGTCTCGATCTCAAAGAGGACATCCAAAGAAGCCTGTACCAGTTTCAGGATTTCGATCAGAAGACGAATCTTCTGCAACTGAATGTAAAAGTGTTTATTCATCTGC
TAAAAGAAGTAAAAAGCACGATAGGAGGAAGCACCAGAAGATGTGGACCCTTACTGAAGTAATGCGATTAGTTGATGGAATTGCTGAATATGGAACTGGCCGCTGGACTC
ATATAAAGAAGCACCTATTTGCATCTTCTCCTCATCGTACACCTATAGATCTCAGGGACAAATGGCGAAATCTTCTGAGAGCTAGCTGTGTTAACATGCAGAACAGAAAA
GGGATCGAACGGAAGCAATCACATGCCTCGCGCCCACTGCCAAAGTCCCTGCTCCAACGTGTCTATGAACTGGCCAATATTTATCCATATCCAAAGGAGCGCAGTCCAAA
GTCAGTCAGAGCAACAACACCTCCCATGTGTCTTATTGAAAGTAACTCTCTGTCATTCAATTGGGGGCGGAAGAAGTATGAATGA
Protein sequenceShow/hide protein sequence
MIVSLRDVVFDKLQISVDCEALLVKVEGSFLPAPLNESNEVEHLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGMTPNLTSDQPSFEEPSK
FLKDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLVNLSSESKSQGSSRSDTDAFGISELSATMVMEAEFNNTPVERGFTHELCAGLRTKGRCVTPLGGSI
SDTILDNRNIHKFNTNECYIENGDLSDENVKGDIVASKLASSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYMKGTSIEESNHIRHKVQMLTPRSESHCGTSVP
VQSRSQRGHPKKPVPVSGFRSEDESSATECKSVYSSAKRSKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNMQNRK
GIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVRATTPPMCLIESNSLSFNWGRKKYE