; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g0536 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g0536
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionHTH myb-type domain-containing protein
Genome locationMC01:11770848..11779783
RNA-Seq ExpressionMC01g0536
SyntenyMC01g0536
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143737.1 uncharacterized protein LOC111013581 isoform X1 [Momordica charantia]0.099.79Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVDGSFLPAPLNEPNEVEHLLVEPKSNHVLGDCLRAQDFSCDFDYGIQTNRGGLDSNSKQAGEHELKFEDLDQLLADDNEV
        MDQEVHFCQKFTNMKSHWVKVDGSFLPAPLNEPNEVEHLLVEPKSNHVLGDCLRAQDFSCDFDYGIQTNRGGLDSNSKQAGEHELKFEDLDQLLADDNEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVDGSFLPAPLNEPNEVEHLLVEPKSNHVLGDCLRAQDFSCDFDYGIQTNRGGLDSNSKQAGEHELKFEDLDQLLADDNEV

Query:  EEFHATNNLPNTYTEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGLRTKGRCETPLDGSICSTI
        EEFHATNNLPNTYTEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGLRTKGRCETPLDGSICSTI
Subjt:  EEFHATNNLPNTYTEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGLRTKGRCETPLDGSICSTI

Query:  LDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYVKVTSIEESNHIRHKVQMLTPGGES
        LDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYVKVTSIEESNHIRHKVQMLTPGGES
Subjt:  LDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYVKVTSIEESNHIRHKVQMLTPGGES

Query:  HCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLR-
        HCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLR 
Subjt:  HCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLR-

Query:  DKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKATTSPMHLIESNSLSFNWGRKKYD
        DKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKATTSPMHLIESNSLSFNWGRKKYD
Subjt:  DKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKATTSPMHLIESNSLSFNWGRKKYD

XP_022143738.1 uncharacterized protein LOC111013581 isoform X2 [Momordica charantia]0.0100Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVDGSFLPAPLNEPNEVEHLLVEPKSNHVLGDCLRAQDFSCDFDYGIQTNRGGLDSNSKQAGEHELKFEDLDQLLADDNEV
        MDQEVHFCQKFTNMKSHWVKVDGSFLPAPLNEPNEVEHLLVEPKSNHVLGDCLRAQDFSCDFDYGIQTNRGGLDSNSKQAGEHELKFEDLDQLLADDNEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVDGSFLPAPLNEPNEVEHLLVEPKSNHVLGDCLRAQDFSCDFDYGIQTNRGGLDSNSKQAGEHELKFEDLDQLLADDNEV

Query:  EEFHATNNLPNTYTEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGLRTKGRCETPLDGSICSTI
        EEFHATNNLPNTYTEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGLRTKGRCETPLDGSICSTI
Subjt:  EEFHATNNLPNTYTEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGLRTKGRCETPLDGSICSTI

Query:  LDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYVKVTSIEESNHIRHKVQMLTPGGES
        LDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYVKVTSIEESNHIRHKVQMLTPGGES
Subjt:  LDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYVKVTSIEESNHIRHKVQMLTPGGES

Query:  HCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLRD
        HCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLRD
Subjt:  HCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLRD

Query:  KWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKATTSPMHLIESNSLSFNWGRKKYD
        KWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKATTSPMHLIESNSLSFNWGRKKYD
Subjt:  KWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKATTSPMHLIESNSLSFNWGRKKYD

XP_022143739.1 uncharacterized protein LOC111013581 isoform X3 [Momordica charantia]3.14e-28399.75Show/hide
Query:  DLDQLLADDNEVEEFHATNNLPNTYTEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGLRTKGRC
        DLDQLLADDNEVEEFHATNNLPNTYTEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGLRTKGRC
Subjt:  DLDQLLADDNEVEEFHATNNLPNTYTEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGLRTKGRC

Query:  ETPLDGSICSTILDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYVKVTSIEESNHIR
        ETPLDGSICSTILDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYVKVTSIEESNHIR
Subjt:  ETPLDGSICSTILDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYVKVTSIEESNHIR

Query:  HKVQMLTPGGESHCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFA
        HKVQMLTPGGESHCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFA
Subjt:  HKVQMLTPGGESHCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFA

Query:  TSPYRTPIDLR-DKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKATTSPMHLIESNSLSFNWGRKKYD
        TSPYRTPIDLR DKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKATTSPMHLIESNSLSFNWGRKKYD
Subjt:  TSPYRTPIDLR-DKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKATTSPMHLIESNSLSFNWGRKKYD

XP_038881566.1 uncharacterized protein LOC120073047 isoform X1 [Benincasa hispida]9.08e-29484.5Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVDGSFLPAPLNEPNEVEHLLVEPKSNHVLGDCLRAQDFSCDFDYGIQTN-RGGLDSNSKQAGEHELKFEDLDQLLADDNE
        MDQEVHFCQKFTNMKSHWV+V+G FLPAPLN+ NEVE LLVEPKS+HVLG+CLR QDFSCDF YGIQTN  GGLDSNSKQ GEHELKF DLDQLL D NE
Subjt:  MDQEVHFCQKFTNMKSHWVKVDGSFLPAPLNEPNEVEHLLVEPKSNHVLGDCLRAQDFSCDFDYGIQTN-RGGLDSNSKQAGEHELKFEDLDQLLADDNE

Query:  VEEFHATNNLPNTYTEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGLRTKGRCET--PLDGSIC
        V EFHATNNL +TY EVAENSFR+NRGLQLGN SS SKSQG SR+DT+AF ISELSA MV E E NNT PV+RGLTHEL  GLRTKGRC T  PL+G+IC
Subjt:  VEEFHATNNLPNTYTEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGLRTKGRCET--PLDGSIC

Query:  STILDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYVKVTSIEESNHIRHKVQMLTPG
         TILDN NIHKF+TNE  +ENG LSDENVKG+I A+KLA CSR+RRLRKPTRRYIEEFADSKSE++KG+RKPPTKDKY+KVTS EESNHIRH+VQMLTP 
Subjt:  STILDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYVKVTSIEESNHIRHKVQMLTPG

Query:  GESHCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPID
         E HCGTS+PVQSRSQRR PKKHVPVSGFLSE+ESSATECK VYSS KRCKK+DRR+HQKMW+LTEVMRLVDGIAEYGTGRWT IKKHLFA+SP+RTPID
Subjt:  GESHCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPID

Query:  LRDKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKATTSPMHLIESNSLSFNWGRKKYD
        LRDKWRNLLRASCVNIQNR GIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKATT PMHLIESNSLSFNWGRKKY+
Subjt:  LRDKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKATTSPMHLIESNSLSFNWGRKKYD

XP_038881569.1 uncharacterized protein LOC120073047 isoform X3 [Benincasa hispida]9.36e-29184.3Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVDGSFLPAPLNEPNEVEHLLVEPKSNHVLGDCLRAQDFSCDFDYGIQTN-RGGLDSNSKQAGEHELKFEDLDQLLADDNE
        MDQEVHFCQKFTNMKSHWV+V+G FLPAPLN+ NEVE LLVEPKS+HVLG+CLR QDFSCDF YGIQTN  GGLDSNSKQ GEHELKF DLDQLL D NE
Subjt:  MDQEVHFCQKFTNMKSHWVKVDGSFLPAPLNEPNEVEHLLVEPKSNHVLGDCLRAQDFSCDFDYGIQTN-RGGLDSNSKQAGEHELKFEDLDQLLADDNE

Query:  VEEFHATNNLPNTYTEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGLRTKGRCET--PLDGSIC
        V EFHATNNL N   EVAENSFR+NRGLQLGN SS SKSQG SR+DT+AF ISELSA MV E E NNT PV+RGLTHEL  GLRTKGRC T  PL+G+IC
Subjt:  VEEFHATNNLPNTYTEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGLRTKGRCET--PLDGSIC

Query:  STILDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYVKVTSIEESNHIRHKVQMLTPG
         TILDN NIHKF+TNE  +ENG LSDENVKG+I A+KLA CSR+RRLRKPTRRYIEEFADSKSE++KG+RKPPTKDKY+KVTS EESNHIRH+VQMLTP 
Subjt:  STILDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYVKVTSIEESNHIRHKVQMLTPG

Query:  GESHCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPID
         E HCGTS+PVQSRSQRR PKKHVPVSGFLSE+ESSATECK VYSS KRCKK+DRR+HQKMW+LTEVMRLVDGIAEYGTGRWT IKKHLFA+SP+RTPID
Subjt:  GESHCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPID

Query:  LRDKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKATTSPMHLIESNSLSFNWGRKKYD
        LRDKWRNLLRASCVNIQNR GIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKATT PMHLIESNSLSFNWGRKKY+
Subjt:  LRDKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKATTSPMHLIESNSLSFNWGRKKYD

TrEMBL top hitse value%identityAlignment
A0A1S3BKX9 uncharacterized protein LOC103491166 isoform X12.91e-27780.87Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVDGSFLPAPLNEPNEVEHLLVEPKSNHVLGDCLRAQDFSCDFDYGIQTNRGGLDSNSKQAGEHELKFEDLDQLLADDNEV
        MDQEVHFCQKFTNMKSHWVKV+G FLPAPLN+ NEVE LLVE KS HVLG+CLR QDFSCDF YGIQTN GGLDSNSKQ GEHELKF D DQLL D NEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVDGSFLPAPLNEPNEVEHLLVEPKSNHVLGDCLRAQDFSCDFDYGIQTNRGGLDSNSKQAGEHELKFEDLDQLLADDNEV

Query:  EEFHATNNLPNTYTEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGLRTKGRCETPLDGSICSTI
         EFHATNNLPNTY EVAENSFRRNR  QLGN SSE+KS G SR DT+AF ISELSA MV EAE NNT PV+RGLTHEL  GL TKGRC TPL+G+IC TI
Subjt:  EEFHATNNLPNTYTEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGLRTKGRCETPLDGSICSTI

Query:  LDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYVKVTSIEESNHIRHKVQMLTPGGES
        LDN NIHKF+TNE  +ENG LSDENVKG+I A++LA CSR+RRLRKPTRRYIEEF DSKSE +KG+RK P KDKY+KV S EES HIRH+VQM+ P  +S
Subjt:  LDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYVKVTSIEESNHIRHKVQMLTPGGES

Query:  HCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLRD
         CGTS+PVQ +S+RR P KHVPVSGFLSE+ESSATECK VYSSA+RCKK+DRR+ QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFA+SP+RTPIDLRD
Subjt:  HCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLRD

Query:  KWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKATTSPMHLIESNSLSFNWGRKKYD
        KWRNLLRASCVNIQN+ G+E KQ+HASRPLPKSLLQRVYELANIYPYPKER PKSVKA T PM LIESNSLSFNWGRKKY+
Subjt:  KWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKATTSPMHLIESNSLSFNWGRKKYD

A0A1S3BLK0 uncharacterized protein LOC103491166 isoform X29.33e-27680.87Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVDGSFLPAPLNEPNEVEHLLVEPKSNHVLGDCLRAQDFSCDFDYGIQTNRGGLDSNSKQAGEHELKFEDLDQLLADDNEV
        MDQEVHFCQKFTNMKSHWVKV+G FLPAPLN+ NEVE LLVE KS HVLG+CLR QDFSCDF YGIQTN GGLDSNSKQ GEHELKF D DQLL D NEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVDGSFLPAPLNEPNEVEHLLVEPKSNHVLGDCLRAQDFSCDFDYGIQTNRGGLDSNSKQAGEHELKFEDLDQLLADDNEV

Query:  EEFHATNNLPNTYTEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGLRTKGRCETPLDGSICSTI
         EFHATNNLPNTY EVAENSFRRNR  QLGN SSE+KS G SR DT+AF ISELSA MV EAE NNT PV+RGLTHEL  GL TKGRC TPL+G+IC TI
Subjt:  EEFHATNNLPNTYTEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGLRTKGRCETPLDGSICSTI

Query:  LDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYVKVTSIEESNHIRHKVQMLTPGGES
        LDN NIHKF+TNE  +ENG LSDENVKG+I A++LA CSR+RRLRKPTRRYIEEF DSKSE +KG+RK P KDKY+KV S EES HIRH+VQM+ P  +S
Subjt:  LDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYVKVTSIEESNHIRHKVQMLTPGGES

Query:  HCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLRD
         CGTS+PVQ +S+RR P KHVPVSGFLSE+ESSATECK VYSSA+RCKK+DRR+ QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFA+SP+RTPIDLRD
Subjt:  HCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLRD

Query:  KWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKATTSPMHLIESNSLSFNWGRKKYD
        KWRNLLRASCVNIQN+ G+E KQ+HASRPLPKSLLQRVYELANIYPYPKER PKSVKA T PM LIESNSLSFNWGRKKY+
Subjt:  KWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKATTSPMHLIESNSLSFNWGRKKYD

A0A6J1CQ81 uncharacterized protein LOC111013581 isoform X31.52e-28399.75Show/hide
Query:  DLDQLLADDNEVEEFHATNNLPNTYTEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGLRTKGRC
        DLDQLLADDNEVEEFHATNNLPNTYTEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGLRTKGRC
Subjt:  DLDQLLADDNEVEEFHATNNLPNTYTEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGLRTKGRC

Query:  ETPLDGSICSTILDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYVKVTSIEESNHIR
        ETPLDGSICSTILDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYVKVTSIEESNHIR
Subjt:  ETPLDGSICSTILDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYVKVTSIEESNHIR

Query:  HKVQMLTPGGESHCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFA
        HKVQMLTPGGESHCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFA
Subjt:  HKVQMLTPGGESHCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFA

Query:  TSPYRTPIDLR-DKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKATTSPMHLIESNSLSFNWGRKKYD
        TSPYRTPIDLR DKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKATTSPMHLIESNSLSFNWGRKKYD
Subjt:  TSPYRTPIDLR-DKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKATTSPMHLIESNSLSFNWGRKKYD

A0A6J1CRG2 uncharacterized protein LOC111013581 isoform X20.0100Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVDGSFLPAPLNEPNEVEHLLVEPKSNHVLGDCLRAQDFSCDFDYGIQTNRGGLDSNSKQAGEHELKFEDLDQLLADDNEV
        MDQEVHFCQKFTNMKSHWVKVDGSFLPAPLNEPNEVEHLLVEPKSNHVLGDCLRAQDFSCDFDYGIQTNRGGLDSNSKQAGEHELKFEDLDQLLADDNEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVDGSFLPAPLNEPNEVEHLLVEPKSNHVLGDCLRAQDFSCDFDYGIQTNRGGLDSNSKQAGEHELKFEDLDQLLADDNEV

Query:  EEFHATNNLPNTYTEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGLRTKGRCETPLDGSICSTI
        EEFHATNNLPNTYTEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGLRTKGRCETPLDGSICSTI
Subjt:  EEFHATNNLPNTYTEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGLRTKGRCETPLDGSICSTI

Query:  LDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYVKVTSIEESNHIRHKVQMLTPGGES
        LDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYVKVTSIEESNHIRHKVQMLTPGGES
Subjt:  LDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYVKVTSIEESNHIRHKVQMLTPGGES

Query:  HCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLRD
        HCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLRD
Subjt:  HCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLRD

Query:  KWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKATTSPMHLIESNSLSFNWGRKKYD
        KWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKATTSPMHLIESNSLSFNWGRKKYD
Subjt:  KWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKATTSPMHLIESNSLSFNWGRKKYD

A0A6J1CRQ1 uncharacterized protein LOC111013581 isoform X10.099.79Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVDGSFLPAPLNEPNEVEHLLVEPKSNHVLGDCLRAQDFSCDFDYGIQTNRGGLDSNSKQAGEHELKFEDLDQLLADDNEV
        MDQEVHFCQKFTNMKSHWVKVDGSFLPAPLNEPNEVEHLLVEPKSNHVLGDCLRAQDFSCDFDYGIQTNRGGLDSNSKQAGEHELKFEDLDQLLADDNEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVDGSFLPAPLNEPNEVEHLLVEPKSNHVLGDCLRAQDFSCDFDYGIQTNRGGLDSNSKQAGEHELKFEDLDQLLADDNEV

Query:  EEFHATNNLPNTYTEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGLRTKGRCETPLDGSICSTI
        EEFHATNNLPNTYTEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGLRTKGRCETPLDGSICSTI
Subjt:  EEFHATNNLPNTYTEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGLRTKGRCETPLDGSICSTI

Query:  LDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYVKVTSIEESNHIRHKVQMLTPGGES
        LDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYVKVTSIEESNHIRHKVQMLTPGGES
Subjt:  LDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYVKVTSIEESNHIRHKVQMLTPGGES

Query:  HCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLR-
        HCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLR 
Subjt:  HCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLR-

Query:  DKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKATTSPMHLIESNSLSFNWGRKKYD
        DKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKATTSPMHLIESNSLSFNWGRKKYD
Subjt:  DKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVKATTSPMHLIESNSLSFNWGRKKYD

SwissProt top hitse value%identityAlignment
Q6R0E3 Telomere repeat-binding protein 53.6e-0833.33Show/hide
Query:  RRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLRDKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRV
        +R+ ++ +++ EV  LV  +   GTGRW  +K   F  + +RT +DL+DKW+ L+  + ++ Q R G          P+P+ LL RV
Subjt:  RRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLRDKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRV

Q9C7B1 Telomere repeat-binding protein 31.1e-0935.63Show/hide
Query:  RRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLRDKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRV
        +R+ ++ +++TEV  LV  + E GTGRW  +K   F  + +RT +DL+DKW+ L+  + ++ Q R G          P+P+ LL RV
Subjt:  RRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLRDKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRV

Q9FFY9 Telomere repeat-binding protein 48.6e-1026.64Show/hide
Query:  TNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPT-RRYIEEFADSKSESHKGKRKPPTKDKYVKVTSIEESNHIRHKVQMLTPGGESHCGTSIPVQ
        T   C EN G     ++  +E   + +CS    L  PT    + E + +      G   PP  + Y+    I   N + +  +++    +          
Subjt:  TNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPT-RRYIEEFADSKSESHKGKRKPPTKDKYVKVTSIEESNHIRHKVQMLTPGGESHCGTSIPVQ

Query:  SRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLRDKWRNLLRAS
        SR+        VPV       ES A     V    KR +   RR  ++ +++TEV  LV  + E GTGRW  +K   F  + +RT +DL+DKW+ L+  +
Subjt:  SRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLRDKWRNLLRAS

Query:  CVNIQNRTGIERKQSHASRPLPKSLLQRV
         ++ Q R G          P+P+ LL RV
Subjt:  CVNIQNRTGIERKQSHASRPLPKSLLQRV

Q9M347 Telomere repeat-binding protein 65.6e-0934.48Show/hide
Query:  RRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLRDKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRV
        +R+ ++ +T++EV  LV  +   GTGRW  +K H F    +RT +DL+DKW+ L+  + ++ + R G          P+P+ LL RV
Subjt:  RRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLRDKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRV

Q9SNB9 Telomere repeat-binding protein 27.3e-0934.48Show/hide
Query:  RRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLRDKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRV
        +R+ ++ +++TEV  LV  + + GTGRW  +K   F  + +RT +DL+DKW+ L+  + ++ Q R G          P+P+ LL RV
Subjt:  RRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLRDKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRV

Arabidopsis top hitse value%identityAlignment
AT1G17460.1 TRF-like 31.8e-1543.33Show/hide
Query:  RKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLRDKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELA
        RK  + WT++EV +LV+G+++YG G+WT IKK  F+   +RT +DL+DKWRNL +AS  N +   G+++   H S  +P  ++ +V ELA
Subjt:  RKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLRDKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELA

AT1G72650.1 TRF-like 64.5e-2227.39Show/hide
Query:  DTNERCLENGGLSDENVKGNIEASKLAICSRD--------RRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYV----KVTSIEESNHIRHKV-QMLTP
        D     +++   S ++V G+         S D        +R+RKPTRRYIEE +++  +    K   P+KD+ +    +V SI  S+  R  V +M++ 
Subjt:  DTNERCLENGGLSDENVKGNIEASKLAICSRD--------RRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYV----KVTSIEESNHIRHKV-QMLTP

Query:  GGESHCGTSIPVQSRSQRRLPKKHVPV-----SGFLSEEESSATECKI----------------VYSSAKRCKKHD------------------------
         G       +P  S  +R  P++++       S +L E+++SA E  +                V  SA R  +++                        
Subjt:  GGESHCGTSIPVQSRSQRRLPKKHVPV-----SGFLSEEESSATECKI----------------VYSSAKRCKKHD------------------------

Query:  -----------------------RRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLRDKWRNLLRASCVNIQNRTGIERKQSHASR
                               RRKH + WTL+E+ +LV+G+++YG G+W+ IKKHLF++  YRT +DL+DKWRNLL+ S     + + +   + H S 
Subjt:  -----------------------RRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLRDKWRNLLRASCVNIQNRTGIERKQSHASR

Query:  PLPKSLLQRVYELA
         +P  +L RV ELA
Subjt:  PLPKSLLQRVYELA

AT1G72650.2 TRF-like 64.5e-2227.39Show/hide
Query:  DTNERCLENGGLSDENVKGNIEASKLAICSRD--------RRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYV----KVTSIEESNHIRHKV-QMLTP
        D     +++   S ++V G+         S D        +R+RKPTRRYIEE +++  +    K   P+KD+ +    +V SI  S+  R  V +M++ 
Subjt:  DTNERCLENGGLSDENVKGNIEASKLAICSRD--------RRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYV----KVTSIEESNHIRHKV-QMLTP

Query:  GGESHCGTSIPVQSRSQRRLPKKHVPV-----SGFLSEEESSATECKI----------------VYSSAKRCKKHD------------------------
         G       +P  S  +R  P++++       S +L E+++SA E  +                V  SA R  +++                        
Subjt:  GGESHCGTSIPVQSRSQRRLPKKHVPV-----SGFLSEEESSATECKI----------------VYSSAKRCKKHD------------------------

Query:  -----------------------RRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLRDKWRNLLRASCVNIQNRTGIERKQSHASR
                               RRKH + WTL+E+ +LV+G+++YG G+W+ IKKHLF++  YRT +DL+DKWRNLL+ S     + + +   + H S 
Subjt:  -----------------------RRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLRDKWRNLLRASCVNIQNRTGIERKQSHASR

Query:  PLPKSLLQRVYELA
         +P  +L RV ELA
Subjt:  PLPKSLLQRVYELA

AT2G37025.1 TRF-like 82.2e-2927.85Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVDGSFLPAPLNEPNEVEHLLVEPKSNHV------------LGDCLRAQDFSCDFDYGIQTNRGGLDSNSKQAGEHELKFE
        M+    F   F   KS    VD +   +  ++ ++++H L++P  ++             +G   + ++FS  FD     + GG+++ S Q  E  LKFE
Subjt:  MDQEVHFCQKFTNMKSHWVKVDGSFLPAPLNEPNEVEHLLVEPKSNHV------------LGDCLRAQDFSCDFDYGIQTNRGGLDSNSKQAGEHELKFE

Query:  DLDQLLADDNEVEEFHATNNLPNTY------TEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGL
         LD +L   +EVE+ +A++ L +        TEV +N    +    L N SSES S G S     +  ++E S   V  AES            ++ +  
Subjt:  DLDQLLADDNEVEEFHATNNLPNTY------TEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGL

Query:  RTKGRCETPLDGSICSTILDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKS-ESHKGK-RKPPTKDKYVKVTS
        + K                       +DTNE         D+        S +   +R ++L       ++   + KS ES+  + RK P K KY+  TS
Subjt:  RTKGRCETPLDGSICSTILDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKS-ESHKGK-RKPPTKDKYVKVTS

Query:  IEESNHIRHKVQMLTPGGESHCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCK-KHDRRKHQKMWTLTEVMRLVDGIAEYGTGRW
        +E++                                           S+++ + +E +   S  K  + K DRRK+Q++WTL EVM LVDGI+ +G G+W
Subjt:  IEESNHIRHKVQMLTPGGESHCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCK-KHDRRKHQKMWTLTEVMRLVDGIAEYGTGRW

Query:  THIKKHLFATSPYRTPIDLRDKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSP
        T IK H F  + +R P+D+RDKWRNLL+AS     N    E K+   +R +PK +L RV ELA+++PYP  +SP
Subjt:  THIKKHLFATSPYRTPIDLRDKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSP

AT2G37025.2 TRF-like 82.2e-2927.85Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVDGSFLPAPLNEPNEVEHLLVEPKSNHV------------LGDCLRAQDFSCDFDYGIQTNRGGLDSNSKQAGEHELKFE
        M+    F   F   KS    VD +   +  ++ ++++H L++P  ++             +G   + ++FS  FD     + GG+++ S Q  E  LKFE
Subjt:  MDQEVHFCQKFTNMKSHWVKVDGSFLPAPLNEPNEVEHLLVEPKSNHV------------LGDCLRAQDFSCDFDYGIQTNRGGLDSNSKQAGEHELKFE

Query:  DLDQLLADDNEVEEFHATNNLPNTY------TEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGL
         LD +L   +EVE+ +A++ L +        TEV +N    +    L N SSES S G S     +  ++E S   V  AES            ++ +  
Subjt:  DLDQLLADDNEVEEFHATNNLPNTY------TEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGL

Query:  RTKGRCETPLDGSICSTILDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKS-ESHKGK-RKPPTKDKYVKVTS
        + K                       +DTNE         D+        S +   +R ++L       ++   + KS ES+  + RK P K KY+  TS
Subjt:  RTKGRCETPLDGSICSTILDNINIHKFDTNERCLENGGLSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKS-ESHKGK-RKPPTKDKYVKVTS

Query:  IEESNHIRHKVQMLTPGGESHCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCK-KHDRRKHQKMWTLTEVMRLVDGIAEYGTGRW
        +E++                                           S+++ + +E +   S  K  + K DRRK+Q++WTL EVM LVDGI+ +G G+W
Subjt:  IEESNHIRHKVQMLTPGGESHCGTSIPVQSRSQRRLPKKHVPVSGFLSEEESSATECKIVYSSAKRCK-KHDRRKHQKMWTLTEVMRLVDGIAEYGTGRW

Query:  THIKKHLFATSPYRTPIDLRDKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSP
        T IK H F  + +R P+D+RDKWRNLL+AS     N    E K+   +R +PK +L RV ELA+++PYP  +SP
Subjt:  THIKKHLFATSPYRTPIDLRDKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCAAGAAGTGCATTTCTGCCAGAAGTTCACAAATATGAAATCTCACTGGGTAAAGGTGGATGGATCTTTTCTTCCTGCACCATTAAATGAACCAAATGAGGTTGA
GCATTTACTTGTGGAGCCCAAAAGCAACCATGTTTTAGGAGATTGCTTGAGAGCTCAAGATTTCTCCTGTGACTTTGACTATGGGATACAAACAAATCGTGGTGGATTGG
ATTCTAATAGCAAGCAGGCAGGCGAACATGAACTTAAATTTGAAGATCTTGATCAACTTCTGGCTGATGACAATGAAGTAGAGGAATTCCATGCAACAAACAATCTACCA
AACACATATACAGAAGTCGCTGAAAATTCTTTCAGACGAAATAGGGGATTACAATTGGGAAACTTAAGTTCAGAGAGTAAGTCTCAGGGATCAAGCAGGAATGATACTGA
AGCTTTTGAAATATCAGAATTATCAGCAGCAATGGTCAGAGAGGCTGAATCCAATAATACAACACCTGTTGACAGGGGTTTAACTCACGAGTTGTGCGCTGGTCTGAGGA
CCAAAGGTAGGTGTGAAACACCACTTGACGGCAGCATCTGCAGTACGATACTTGATAATATAAATATCCATAAGTTTGATACTAATGAAAGGTGTCTAGAAAATGGTGGT
TTATCCGATGAAAATGTAAAGGGTAATATTGAGGCAAGCAAGCTTGCCATTTGTTCAAGGGATAGGAGGTTGCGTAAGCCTACTCGAAGATACATTGAAGAATTTGCAGA
TTCGAAGTCTGAAAGTCACAAGGGAAAGAGAAAGCCTCCAACAAAAGATAAATACGTGAAAGTCACGTCTATTGAAGAATCTAATCATATTAGACATAAGGTACAAATGT
TGACGCCTGGAGGGGAATCACATTGTGGTACTTCTATTCCAGTGCAGTCTCGATCCCAAAGAAGACTTCCAAAGAAGCATGTACCAGTTTCAGGATTTCTATCAGAAGAG
GAATCTTCTGCAACTGAATGTAAAATTGTTTATTCATCTGCTAAAAGATGTAAAAAGCATGATAGGCGGAAGCATCAGAAGATGTGGACCCTTACTGAAGTAATGAGATT
AGTTGATGGAATTGCTGAATATGGAACTGGCCGCTGGACTCATATAAAGAAGCACCTATTTGCAACTTCTCCTTATCGCACGCCTATAGATCTCAGGGACAAATGGCGAA
ATCTTCTGAGAGCTAGCTGTGTTAACATACAGAACAGAACAGGGATCGAACGGAAGCAATCACATGCCTCGCGTCCACTGCCCAAGTCCCTGCTCCAACGTGTCTATGAA
CTGGCCAATATTTATCCATATCCAAAGGAGCGCAGTCCAAAATCAGTCAAAGCAACTACATCTCCCATGCATCTTATTGAAAGTAACTCTTTGTCATTCAATTGGGGGCG
GAAGAAGTATGAC
mRNA sequenceShow/hide mRNA sequence
AGTACAAGCGCTAACGAAATCCACCTGTACTCGCCGCGGAATTCAGACGCACATTAAGTTGAAGAATAGAAAAGAGAGCAGAAAACGTCTTGGATTCCGTCGAGCAAAAA
AAACGCCGCATAAATTCACTTCCCTTCCGAATTGTCGTCGCTTAATTTGGCGCTTTTATTTCTACTGTCCGGTTTCGGACCTCTGACGCCGCTCACTGGTTTATCCATCT
TCTCTCCAAGCTAAGCTCGGATCTTCAGCTTCCTCGCTGAGCTACTAATTCATATCAGGAAACGTAGGATTGAGCTTAAAAGTATACCTAATTATGGATCAAGAAGTGCA
TTTCTGCCAGAAGTTCACAAATATGAAATCTCACTGGGTAAAGGTGGATGGATCTTTTCTTCCTGCACCATTAAATGAACCAAATGAGGTTGAGCATTTACTTGTGGAGC
CCAAAAGCAACCATGTTTTAGGAGATTGCTTGAGAGCTCAAGATTTCTCCTGTGACTTTGACTATGGGATACAAACAAATCGTGGTGGATTGGATTCTAATAGCAAGCAG
GCAGGCGAACATGAACTTAAATTTGAAGATCTTGATCAACTTCTGGCTGATGACAATGAAGTAGAGGAATTCCATGCAACAAACAATCTACCAAACACATATACAGAAGT
CGCTGAAAATTCTTTCAGACGAAATAGGGGATTACAATTGGGAAACTTAAGTTCAGAGAGTAAGTCTCAGGGATCAAGCAGGAATGATACTGAAGCTTTTGAAATATCAG
AATTATCAGCAGCAATGGTCAGAGAGGCTGAATCCAATAATACAACACCTGTTGACAGGGGTTTAACTCACGAGTTGTGCGCTGGTCTGAGGACCAAAGGTAGGTGTGAA
ACACCACTTGACGGCAGCATCTGCAGTACGATACTTGATAATATAAATATCCATAAGTTTGATACTAATGAAAGGTGTCTAGAAAATGGTGGTTTATCCGATGAAAATGT
AAAGGGTAATATTGAGGCAAGCAAGCTTGCCATTTGTTCAAGGGATAGGAGGTTGCGTAAGCCTACTCGAAGATACATTGAAGAATTTGCAGATTCGAAGTCTGAAAGTC
ACAAGGGAAAGAGAAAGCCTCCAACAAAAGATAAATACGTGAAAGTCACGTCTATTGAAGAATCTAATCATATTAGACATAAGGTACAAATGTTGACGCCTGGAGGGGAA
TCACATTGTGGTACTTCTATTCCAGTGCAGTCTCGATCCCAAAGAAGACTTCCAAAGAAGCATGTACCAGTTTCAGGATTTCTATCAGAAGAGGAATCTTCTGCAACTGA
ATGTAAAATTGTTTATTCATCTGCTAAAAGATGTAAAAAGCATGATAGGCGGAAGCATCAGAAGATGTGGACCCTTACTGAAGTAATGAGATTAGTTGATGGAATTGCTG
AATATGGAACTGGCCGCTGGACTCATATAAAGAAGCACCTATTTGCAACTTCTCCTTATCGCACGCCTATAGATCTCAGGGACAAATGGCGAAATCTTCTGAGAGCTAGC
TGTGTTAACATACAGAACAGAACAGGGATCGAACGGAAGCAATCACATGCCTCGCGTCCACTGCCCAAGTCCCTGCTCCAACGTGTCTATGAACTGGCCAATATTTATCC
ATATCCAAAGGAGCGCAGTCCAAAATCAGTCAAAGCAACTACATCTCCCATGCATCTTATTGAAAGTAACTCTTTGTCATTCAATTGGGGGCGGAAGAAGTATGAC
Protein sequenceShow/hide protein sequence
MDQEVHFCQKFTNMKSHWVKVDGSFLPAPLNEPNEVEHLLVEPKSNHVLGDCLRAQDFSCDFDYGIQTNRGGLDSNSKQAGEHELKFEDLDQLLADDNEVEEFHATNNLP
NTYTEVAENSFRRNRGLQLGNLSSESKSQGSSRNDTEAFEISELSAAMVREAESNNTTPVDRGLTHELCAGLRTKGRCETPLDGSICSTILDNINIHKFDTNERCLENGG
LSDENVKGNIEASKLAICSRDRRLRKPTRRYIEEFADSKSESHKGKRKPPTKDKYVKVTSIEESNHIRHKVQMLTPGGESHCGTSIPVQSRSQRRLPKKHVPVSGFLSEE
ESSATECKIVYSSAKRCKKHDRRKHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFATSPYRTPIDLRDKWRNLLRASCVNIQNRTGIERKQSHASRPLPKSLLQRVYE
LANIYPYPKERSPKSVKATTSPMHLIESNSLSFNWGRKKYD