; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0021999 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0021999
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionHTH myb-type domain-containing protein
Genome locationchr12:21496787..21504474
RNA-Seq ExpressionPI0021999
SyntenyPI0021999
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051759.1 telomere repeat-binding protein 1-like isoform X2 [Cucumis melo var. makuwa]3.5e-25088.95Show/hide
Query:  MNAASKETMMTSNQATKMFTYEFPDVDSDLSIFPVPEDDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFDQIKIDSESLHSSLTLEG
        MNAASKETMMTSNQATKMFTYEFPDVDSDLSIFPVPEDDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFDQIKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQATKMFTYEFPDVDSDLSIFPVPEDDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFDQIKIDSESLHSSLTLEG

Query:  ERTKQEGHDEKLRADDVCSGINKHPGAVNSRTTDDIFNLAPASSNASFLGNVDFGSYSQGFSTTDIGVDLSFAGEY------------------------
        ERTKQEGHDEKLRADDVCSGINKHPGAVN RTTDDIFNLAPASSNASFLGNVDFGSYSQGFSTTDIGVDLSFAG +                        
Subjt:  ERTKQEGHDEKLRADDVCSGINKHPGAVNSRTTDDIFNLAPASSNASFLGNVDFGSYSQGFSTTDIGVDLSFAGEY------------------------

Query:  -----------VGRQAYERKRLSSSDSSKTNFEHNESKINPFNEGNRGDTLVTEKRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQWQ
                      +  +RKRLSSSDS KTN+EHNESKINPFNEGNRGDTL+TEKRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQWQ
Subjt:  -----------VGRQAYERKRLSSSDSSKTNFEHNESKINPFNEGNRGDTLVTEKRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQWQ

Query:  KKIKAAPIVHKDKSFNGGCIQVPFGLPIEEGHSAKRE--HEPEEIKDNRILCIKDKYEVESFSAESEDENTEDECATKGNNTQKGNSRRKHHISWTLSEV
        KK+KAAPI+HKDKSFNGGCIQVPFGLPIEEGHSAK+    EPEEIKDNRILCIKDKYEVESFSAESEDENTEDEC TKGN+TQKGNSRRKHHISWTLSEV
Subjt:  KKIKAAPIVHKDKSFNGGCIQVPFGLPIEEGHSAKRE--HEPEEIKDNRILCIKDKYEVESFSAESEDENTEDECATKGNNTQKGNSRRKHHISWTLSEV

Query:  MKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELAAIYPYPRENKSKESCSAPSTSS
        MKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELAAIYPYPRENKSKESCSAPSTSS
Subjt:  MKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELAAIYPYPRENKSKESCSAPSTSS

Query:  FKSTTNNMFVSLPTVM
        FKSTTNNMFVSLPTVM
Subjt:  FKSTTNNMFVSLPTVM

XP_004139439.1 uncharacterized protein LOC101213992 isoform X2 [Cucumis sativus]1.7e-24988.95Show/hide
Query:  MNAASKETMMTSNQATKMFTYEFPDVDSDLSIFPVPEDDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFDQIKIDSESLHSSLTLEG
        MNAASKETMMTSNQATK+FTYEFPDVDSDLSIFPVPEDDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFDQIKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQATKMFTYEFPDVDSDLSIFPVPEDDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFDQIKIDSESLHSSLTLEG

Query:  ERTKQEGHDEKLRADDVCSGINKHPGAVNSRTTDDIFNLAPASSNASFLGNVDFGSYSQGFSTTDIGVDLSFAGEY------------------------
        ERTKQEGHD+KLRADDVCSGINKHPGAVNSRTTDDIFNLAPASSNASFLGNVDFGSYSQGFSTTDIGVDLSFAG +                        
Subjt:  ERTKQEGHDEKLRADDVCSGINKHPGAVNSRTTDDIFNLAPASSNASFLGNVDFGSYSQGFSTTDIGVDLSFAGEY------------------------

Query:  -----------VGRQAYERKRLSSSDSSKTNFEHNESKINPFNEGNRGDTLVTEKRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQWQ
                      +  +RKRLSSSDS KTNFEH+ESKINPF+EGNRGDTLVTEKRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKS HSESHKQQWQ
Subjt:  -----------VGRQAYERKRLSSSDSSKTNFEHNESKINPFNEGNRGDTLVTEKRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQWQ

Query:  KKIKAAPIVHKDKSFNGGCIQVPFGLPIEEGHSAKRE--HEPEEIKDNRILCIKDKYEVESFSAESEDENTEDECATKGNNTQKGNSRRKHHISWTLSEV
        KK+KAAPIVHKDKSFNGGCIQVPFGLPIEEGHSAK+    EPEEIKDNRILCIKDKYEVESFSAESEDENTEDECATKGN+TQKGNSRRKHHISWTLSEV
Subjt:  KKIKAAPIVHKDKSFNGGCIQVPFGLPIEEGHSAKRE--HEPEEIKDNRILCIKDKYEVESFSAESEDENTEDECATKGNNTQKGNSRRKHHISWTLSEV

Query:  MKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELAAIYPYPRENKSKESCSAPSTSS
        MKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELAAIYPYPRENKSKESCSAPSTSS
Subjt:  MKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELAAIYPYPRENKSKESCSAPSTSS

Query:  FKSTTNNMFVSLPTVM
        FKSTTNNMFVSLPTVM
Subjt:  FKSTTNNMFVSLPTVM

XP_008462901.1 PREDICTED: uncharacterized protein LOC103501169 isoform X1 [Cucumis melo]3.5e-25088.95Show/hide
Query:  MNAASKETMMTSNQATKMFTYEFPDVDSDLSIFPVPEDDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFDQIKIDSESLHSSLTLEG
        MNAASKETMMTSNQATKMFTYEFPDVDSDLSIFPVPEDDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFDQIKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQATKMFTYEFPDVDSDLSIFPVPEDDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFDQIKIDSESLHSSLTLEG

Query:  ERTKQEGHDEKLRADDVCSGINKHPGAVNSRTTDDIFNLAPASSNASFLGNVDFGSYSQGFSTTDIGVDLSFAGEY------------------------
        ERTKQEGHDEKLRADDVCSGINKHPGAVN RTTDDIFNLAPASSNASFLGNVDFGSYSQGFSTTDIGVDLSFAG +                        
Subjt:  ERTKQEGHDEKLRADDVCSGINKHPGAVNSRTTDDIFNLAPASSNASFLGNVDFGSYSQGFSTTDIGVDLSFAGEY------------------------

Query:  -----------VGRQAYERKRLSSSDSSKTNFEHNESKINPFNEGNRGDTLVTEKRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQWQ
                      +  +RKRLSSSDS KTN+EHNESKINPFNEGNRGDTL+TEKRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQWQ
Subjt:  -----------VGRQAYERKRLSSSDSSKTNFEHNESKINPFNEGNRGDTLVTEKRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQWQ

Query:  KKIKAAPIVHKDKSFNGGCIQVPFGLPIEEGHSAKRE--HEPEEIKDNRILCIKDKYEVESFSAESEDENTEDECATKGNNTQKGNSRRKHHISWTLSEV
        KK+KAAPI+HKDKSFNGGCIQVPFGLPIEEGHSAK+    EPEEIKDNRILCIKDKYEVESFSAESEDENTEDEC TKGN+TQKGNSRRKHHISWTLSEV
Subjt:  KKIKAAPIVHKDKSFNGGCIQVPFGLPIEEGHSAKRE--HEPEEIKDNRILCIKDKYEVESFSAESEDENTEDECATKGNNTQKGNSRRKHHISWTLSEV

Query:  MKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELAAIYPYPRENKSKESCSAPSTSS
        MKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELAAIYPYPRENKSKESCSAPSTSS
Subjt:  MKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELAAIYPYPRENKSKESCSAPSTSS

Query:  FKSTTNNMFVSLPTVM
        FKSTTNNMFVSLPTVM
Subjt:  FKSTTNNMFVSLPTVM

XP_031741781.1 uncharacterized protein LOC101213992 isoform X1 [Cucumis sativus]4.2e-24888.78Show/hide
Query:  MNAASKETMMTSNQATKMFTYEFPDVDSDLSIFPVPEDDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFDQIKIDSESLHSSLTLEG
        MNAASKETMMTSNQATK+FTYEFPDVDSDLSIFPVPEDDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFDQIKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQATKMFTYEFPDVDSDLSIFPVPEDDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFDQIKIDSESLHSSLTLEG

Query:  ERTKQEGHDEKLRADDVCSGINKHPGAVNSRTTDDIFNLAPASSNASFLGNVDFGSYSQGFSTTDIGVDLSFAGEY------------------------
        ERTKQEGHD+KLRADDVCSGINKHPGAVNSRTTDDIFNLAPASSNASFLGNVDFGSYSQGFSTTDIGVDLSFAG +                        
Subjt:  ERTKQEGHDEKLRADDVCSGINKHPGAVNSRTTDDIFNLAPASSNASFLGNVDFGSYSQGFSTTDIGVDLSFAGEY------------------------

Query:  -----------VGRQAYERKRLSSSDSSKTNFEHNESKINPFNEGNRGDTLVTEKRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQWQ
                      +  +RKRLSSSDS KTNFEH+ESKINPF+EGNRGDTLVTEKRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKS HSESHKQQWQ
Subjt:  -----------VGRQAYERKRLSSSDSSKTNFEHNESKINPFNEGNRGDTLVTEKRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQWQ

Query:  KKIKAAPIVHKDKSFNGGCIQVPFGLPIEEGHSAKRE--HEPEEIKDNRILCIKDKYEVESFSAESEDENTEDECATKGNNTQKGNSRRKHHISWTLSEV
        KK+KAAPIVHKDKSFNGGCIQVPFGLPIEEGHSAK+    EPEEIKDNRILCIKDKYEVESFSAESEDENTEDECATKGN+TQKGNSRRKHHISWTLSEV
Subjt:  KKIKAAPIVHKDKSFNGGCIQVPFGLPIEEGHSAKRE--HEPEEIKDNRILCIKDKYEVESFSAESEDENTEDECATKGNNTQKGNSRRKHHISWTLSEV

Query:  MKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRK-VVLGRKQASQQVPESVLCRVRELAAIYPYPRENKSKESCSAPSTS
        MKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRK VVLGRKQASQQVPESVLCRVRELAAIYPYPRENKSKESCSAPSTS
Subjt:  MKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRK-VVLGRKQASQQVPESVLCRVRELAAIYPYPRENKSKESCSAPSTS

Query:  SFKSTTNNMFVSLPTVM
        SFKSTTNNMFVSLPTVM
Subjt:  SFKSTTNNMFVSLPTVM

XP_038894300.1 uncharacterized protein LOC120082937 isoform X2 [Benincasa hispida]2.0e-24287.02Show/hide
Query:  MNAASKETMMTSNQATKMFTYEFPDVDSDLSIFPVPEDDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFDQIKIDSESLHSSLTLEG
        MNAASKETMMTSNQATKMFTYEFPDVDSDLSIFPVPEDDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFDQIKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQATKMFTYEFPDVDSDLSIFPVPEDDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFDQIKIDSESLHSSLTLEG

Query:  ERTKQEGHDEKLRADDVCSGINKHPGAVNSRTTDDIFNLAPASSNASFLGNVDFGSYSQGFSTTDIGVDLSFAGEY------------------------
        ERTKQE HDEKLRADDVCSGINKHPGAVN RTTDDIFNLAP SSNASFLGNVDFGSYSQGF TTDIGVDLSF G +                        
Subjt:  ERTKQEGHDEKLRADDVCSGINKHPGAVNSRTTDDIFNLAPASSNASFLGNVDFGSYSQGFSTTDIGVDLSFAGEY------------------------

Query:  -----------VGRQAYERKRLSSSDSSKTNFEHNESKINPFNEGNRGDTLVTEKRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQWQ
                      +  +RKRLSS DSSKTNFEHNESKI PFNEGNRGDT+VTEKRLRKPPRRYSEESVEQKSRSNSK+S  KASKDK + SESHKQQWQ
Subjt:  -----------VGRQAYERKRLSSSDSSKTNFEHNESKINPFNEGNRGDTLVTEKRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQWQ

Query:  KKIKAAPIVHKDKSFNGGCIQVPFGLPIEEGHSAKRE--HEPEEIKDNRILCIKDKYEVESFSAESEDENTEDECATKGNNTQKGNSRRKHHISWTLSEV
        KKIKAAPIVHKDKSFNGGCIQVPFGLPIEEGHSAK+    EPEEIKDNRILCIKDKY+VESFSAESEDENTEDEC TK N+TQKGNSRRKHHISWTLSEV
Subjt:  KKIKAAPIVHKDKSFNGGCIQVPFGLPIEEGHSAKRE--HEPEEIKDNRILCIKDKYEVESFSAESEDENTEDECATKGNNTQKGNSRRKHHISWTLSEV

Query:  MKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELAAIYPYPRENKSKESCSAPSTSS
        MKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELAAIYPYPRENKSKESCSAPSTSS
Subjt:  MKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELAAIYPYPRENKSKESCSAPSTSS

Query:  FKSTTNNMFVSLPTVM
        FKSTTNNMFVSLPTVM
Subjt:  FKSTTNNMFVSLPTVM

TrEMBL top hitse value%identityAlignment
A0A0A0LV10 HTH myb-type domain-containing protein8.3e-25088.95Show/hide
Query:  MNAASKETMMTSNQATKMFTYEFPDVDSDLSIFPVPEDDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFDQIKIDSESLHSSLTLEG
        MNAASKETMMTSNQATK+FTYEFPDVDSDLSIFPVPEDDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFDQIKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQATKMFTYEFPDVDSDLSIFPVPEDDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFDQIKIDSESLHSSLTLEG

Query:  ERTKQEGHDEKLRADDVCSGINKHPGAVNSRTTDDIFNLAPASSNASFLGNVDFGSYSQGFSTTDIGVDLSFAGEY------------------------
        ERTKQEGHD+KLRADDVCSGINKHPGAVNSRTTDDIFNLAPASSNASFLGNVDFGSYSQGFSTTDIGVDLSFAG +                        
Subjt:  ERTKQEGHDEKLRADDVCSGINKHPGAVNSRTTDDIFNLAPASSNASFLGNVDFGSYSQGFSTTDIGVDLSFAGEY------------------------

Query:  -----------VGRQAYERKRLSSSDSSKTNFEHNESKINPFNEGNRGDTLVTEKRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQWQ
                      +  +RKRLSSSDS KTNFEH+ESKINPF+EGNRGDTLVTEKRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKS HSESHKQQWQ
Subjt:  -----------VGRQAYERKRLSSSDSSKTNFEHNESKINPFNEGNRGDTLVTEKRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQWQ

Query:  KKIKAAPIVHKDKSFNGGCIQVPFGLPIEEGHSAKRE--HEPEEIKDNRILCIKDKYEVESFSAESEDENTEDECATKGNNTQKGNSRRKHHISWTLSEV
        KK+KAAPIVHKDKSFNGGCIQVPFGLPIEEGHSAK+    EPEEIKDNRILCIKDKYEVESFSAESEDENTEDECATKGN+TQKGNSRRKHHISWTLSEV
Subjt:  KKIKAAPIVHKDKSFNGGCIQVPFGLPIEEGHSAKRE--HEPEEIKDNRILCIKDKYEVESFSAESEDENTEDECATKGNNTQKGNSRRKHHISWTLSEV

Query:  MKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELAAIYPYPRENKSKESCSAPSTSS
        MKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELAAIYPYPRENKSKESCSAPSTSS
Subjt:  MKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELAAIYPYPRENKSKESCSAPSTSS

Query:  FKSTTNNMFVSLPTVM
        FKSTTNNMFVSLPTVM
Subjt:  FKSTTNNMFVSLPTVM

A0A1S3CHY4 uncharacterized protein LOC103501169 isoform X11.7e-25088.95Show/hide
Query:  MNAASKETMMTSNQATKMFTYEFPDVDSDLSIFPVPEDDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFDQIKIDSESLHSSLTLEG
        MNAASKETMMTSNQATKMFTYEFPDVDSDLSIFPVPEDDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFDQIKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQATKMFTYEFPDVDSDLSIFPVPEDDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFDQIKIDSESLHSSLTLEG

Query:  ERTKQEGHDEKLRADDVCSGINKHPGAVNSRTTDDIFNLAPASSNASFLGNVDFGSYSQGFSTTDIGVDLSFAGEY------------------------
        ERTKQEGHDEKLRADDVCSGINKHPGAVN RTTDDIFNLAPASSNASFLGNVDFGSYSQGFSTTDIGVDLSFAG +                        
Subjt:  ERTKQEGHDEKLRADDVCSGINKHPGAVNSRTTDDIFNLAPASSNASFLGNVDFGSYSQGFSTTDIGVDLSFAGEY------------------------

Query:  -----------VGRQAYERKRLSSSDSSKTNFEHNESKINPFNEGNRGDTLVTEKRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQWQ
                      +  +RKRLSSSDS KTN+EHNESKINPFNEGNRGDTL+TEKRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQWQ
Subjt:  -----------VGRQAYERKRLSSSDSSKTNFEHNESKINPFNEGNRGDTLVTEKRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQWQ

Query:  KKIKAAPIVHKDKSFNGGCIQVPFGLPIEEGHSAKRE--HEPEEIKDNRILCIKDKYEVESFSAESEDENTEDECATKGNNTQKGNSRRKHHISWTLSEV
        KK+KAAPI+HKDKSFNGGCIQVPFGLPIEEGHSAK+    EPEEIKDNRILCIKDKYEVESFSAESEDENTEDEC TKGN+TQKGNSRRKHHISWTLSEV
Subjt:  KKIKAAPIVHKDKSFNGGCIQVPFGLPIEEGHSAKRE--HEPEEIKDNRILCIKDKYEVESFSAESEDENTEDECATKGNNTQKGNSRRKHHISWTLSEV

Query:  MKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELAAIYPYPRENKSKESCSAPSTSS
        MKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELAAIYPYPRENKSKESCSAPSTSS
Subjt:  MKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELAAIYPYPRENKSKESCSAPSTSS

Query:  FKSTTNNMFVSLPTVM
        FKSTTNNMFVSLPTVM
Subjt:  FKSTTNNMFVSLPTVM

A0A5D3DCR8 Telomere repeat-binding protein 1-like isoform X21.7e-25088.95Show/hide
Query:  MNAASKETMMTSNQATKMFTYEFPDVDSDLSIFPVPEDDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFDQIKIDSESLHSSLTLEG
        MNAASKETMMTSNQATKMFTYEFPDVDSDLSIFPVPEDDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFDQIKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQATKMFTYEFPDVDSDLSIFPVPEDDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFDQIKIDSESLHSSLTLEG

Query:  ERTKQEGHDEKLRADDVCSGINKHPGAVNSRTTDDIFNLAPASSNASFLGNVDFGSYSQGFSTTDIGVDLSFAGEY------------------------
        ERTKQEGHDEKLRADDVCSGINKHPGAVN RTTDDIFNLAPASSNASFLGNVDFGSYSQGFSTTDIGVDLSFAG +                        
Subjt:  ERTKQEGHDEKLRADDVCSGINKHPGAVNSRTTDDIFNLAPASSNASFLGNVDFGSYSQGFSTTDIGVDLSFAGEY------------------------

Query:  -----------VGRQAYERKRLSSSDSSKTNFEHNESKINPFNEGNRGDTLVTEKRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQWQ
                      +  +RKRLSSSDS KTN+EHNESKINPFNEGNRGDTL+TEKRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQWQ
Subjt:  -----------VGRQAYERKRLSSSDSSKTNFEHNESKINPFNEGNRGDTLVTEKRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQWQ

Query:  KKIKAAPIVHKDKSFNGGCIQVPFGLPIEEGHSAKRE--HEPEEIKDNRILCIKDKYEVESFSAESEDENTEDECATKGNNTQKGNSRRKHHISWTLSEV
        KK+KAAPI+HKDKSFNGGCIQVPFGLPIEEGHSAK+    EPEEIKDNRILCIKDKYEVESFSAESEDENTEDEC TKGN+TQKGNSRRKHHISWTLSEV
Subjt:  KKIKAAPIVHKDKSFNGGCIQVPFGLPIEEGHSAKRE--HEPEEIKDNRILCIKDKYEVESFSAESEDENTEDECATKGNNTQKGNSRRKHHISWTLSEV

Query:  MKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELAAIYPYPRENKSKESCSAPSTSS
        MKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELAAIYPYPRENKSKESCSAPSTSS
Subjt:  MKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELAAIYPYPRENKSKESCSAPSTSS

Query:  FKSTTNNMFVSLPTVM
        FKSTTNNMFVSLPTVM
Subjt:  FKSTTNNMFVSLPTVM

A0A6J1HPN0 uncharacterized protein LOC111464895 isoform X15.1e-22381.43Show/hide
Query:  MNAASKETMMTSNQATKMFTYEFPDVDSDLSIFPVPEDDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFDQIKIDSESLHSSLTLEG
        M+AASKETMMTSNQATKMFTYEFPDVDSDLSIFPVPEDDVLGVEHFLEP+ NKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFD IKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQATKMFTYEFPDVDSDLSIFPVPEDDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFDQIKIDSESLHSSLTLEG

Query:  ERTKQEGHDEKLR-ADDVCSGINKHPGAVNSRTTDDIFNLAPASSNASFLGNVDFGSYSQGFSTTDIGVDLSFAGEY-----------------------
        ERTKQE HD KL  ADDVCSG NK  G  NSRTTDDIFNLAP SSNASFLGNVDFGSYSQGF TTDIGVDLSF G +                       
Subjt:  ERTKQEGHDEKLR-ADDVCSGINKHPGAVNSRTTDDIFNLAPASSNASFLGNVDFGSYSQGFSTTDIGVDLSFAGEY-----------------------

Query:  ------------VGRQAYERKRLSSSDSSKTNFEHNESKINPFNEGNRGDTLVTEKRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQW
                       +  +RKRLSS  SSKTN EHNES+I  FNEGNRGDTLVTEKRLRKPPRRYSEESVEQKSRSNSKKSA +ASKDKS  SESHK QW
Subjt:  ------------VGRQAYERKRLSSSDSSKTNFEHNESKINPFNEGNRGDTLVTEKRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQW

Query:  QKKIKAAPIVHKDKSFNGGCIQVPFGLPIEEGHSAKRE--HEPEEIKDNRILCIKDKYEVESFSAESEDENTEDECATKGNNTQKGNSRRKHHISWTLSE
        QKK+KAAP+V+KDKSFNGGCIQVPFGLPIEE HSAK+    EPEEIK+NRILCIKDK +VESFSAESEDENTEDECATK  N QKGNSRRKHHISWTLSE
Subjt:  QKKIKAAPIVHKDKSFNGGCIQVPFGLPIEEGHSAKRE--HEPEEIKDNRILCIKDKYEVESFSAESEDENTEDECATKGNNTQKGNSRRKHHISWTLSE

Query:  VMKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELAAIYPYPRENKSKESCSAPSTS
        VMKL+EGVSEYGVGRWTEIKRLQF+SSSHRTSVDLKDKWRNLLKASDTQLQNRRKVV+GRKQ SQQVPESVLCRVRELAAIYPYPRENKSK  CSAPS+S
Subjt:  VMKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELAAIYPYPRENKSKESCSAPSTS

Query:  SFKSTTNNMFVSLPTVM
        SFKST NNM +SLPTVM
Subjt:  SFKSTTNNMFVSLPTVM

A0A6J1I2E4 uncharacterized protein LOC111469282 isoform X15.8e-21980.85Show/hide
Query:  MNAASKETMMTSNQATKMFTYEFPDVDSDLSIFPVPEDDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFDQIKIDSESLHSSLTLEG
        MNAASKETMMTSNQATKMFTYEFPDVDSDLSIFPVPEDDVLGVEHFLEP+ NKCFSGSVLDFNTFHSHKCLGIE FSFAVDFD IKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQATKMFTYEFPDVDSDLSIFPVPEDDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFDQIKIDSESLHSSLTLEG

Query:  ERTKQEGHDEKLR-ADDVCSGINKHPGAVNSRTTDDIFNLAPASSNASFLGNVDFGSYSQGFSTTDIGVDLSFAGEY-----------------------
        ERTKQE HD KL  ADDVCSG NK  G  NSRTTDDIFNLAP SSNASFLG+VDFGS SQGF TTDIGVDLSF G Y                       
Subjt:  ERTKQEGHDEKLR-ADDVCSGINKHPGAVNSRTTDDIFNLAPASSNASFLGNVDFGSYSQGFSTTDIGVDLSFAGEY-----------------------

Query:  ------------VGRQAYERKRLSSSDSSKTNFEHNESKINPFNEGNRGDTLVTEKRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQW
                       +  +RKRLSS  SSKTN EHNES+I  FNEG+RGDTLVTEKRLRKPPRRYSEESVEQKSRSNSKKSA +ASKDKS  SESHKQQW
Subjt:  ------------VGRQAYERKRLSSSDSSKTNFEHNESKINPFNEGNRGDTLVTEKRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQW

Query:  QKKIKAAPIVHKDKSFNGGCIQVPFGLPIEEGHSAKRE--HEPEEIKDNRILCIKDKYEVESFSAESEDENTEDECATKGNNTQKGNSRRKHHISWTLSE
        QKK+KAAP+V+KDKSFNGGCIQVPFGLPIEE HSAK+    EPEEIK+NRILCIKDK +VESFSAESEDENTEDECATK  N QKGNSRRKHHISWTLSE
Subjt:  QKKIKAAPIVHKDKSFNGGCIQVPFGLPIEEGHSAKRE--HEPEEIKDNRILCIKDKYEVESFSAESEDENTEDECATKGNNTQKGNSRRKHHISWTLSE

Query:  VMKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELAAIYPYPRENKSKESCSAPSTS
        VMKL+EGVSEYGVGRWTEIKRLQF+SSSHRTSVDLKDKWRNLLKASDTQLQNRRKVV+GRKQ SQQVPESVLCRVRELAAIYPYPRENKSK   SAPS+S
Subjt:  VMKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELAAIYPYPRENKSKESCSAPSTS

Query:  SFKSTTNNMFVSLPTVM
        S KST NNM +SLPTVM
Subjt:  SFKSTTNNMFVSLPTVM

SwissProt top hitse value%identityAlignment
Q9C7B1 Telomere repeat-binding protein 31.8e-0732.43Show/hide
Query:  SRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELAAIYPYPRE
        ++R+    ++++EV  LV+ V E G GRW ++K   F  + HRT VDLKDKW+ L+  +    Q RR          + VP+ +L RV      Y Y  +
Subjt:  SRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELAAIYPYPRE

Query:  NKSKESCSAPS
        ++ K      S
Subjt:  NKSKESCSAPS

Q9FFY9 Telomere repeat-binding protein 41.2e-0838.64Show/hide
Query:  SRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRV
        S+R+    ++++EV  LV  V E G GRW ++K   F ++SHRT VDLKDKW+ L+  +    Q RR          + VP+ +L RV
Subjt:  SRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRV

Q9LL45 Telomere-binding protein 16.8e-0737.93Show/hide
Query:  RRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRV
        +R+    +T++EV  LVE V   G GRW ++K   F +  HRT VDLKDKW+ L+  +    Q RR            VP+ +L RV
Subjt:  RRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRV

Q9M347 Telomere repeat-binding protein 64.0e-0736.78Show/hide
Query:  RRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRV
        +R+    +T+SEV  LV+ V   G GRW ++K   F   +HRT VDLKDKW+ L+  +    + RR          + VP+ +L RV
Subjt:  RRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRV

Q9SNB9 Telomere repeat-binding protein 25.2e-0733.68Show/hide
Query:  SRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELAAIY
        ++R+    ++++EV  LV+ V + G GRW ++K   F  + HRT VDLKDKW+ L+  +    Q RR          + VP+ +L RV +  A +
Subjt:  SRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELAAIY

Arabidopsis top hitse value%identityAlignment
AT1G17460.1 TRF-like 35.5e-2033.09Show/hide
Query:  KRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQWQKKIKAAPIVHKDKSFNGGCIQVPFGLPIEEGHSAKREH----------------
        KR+RKP RRY EE+ E++    S   +   S  +++ SE              +V +  S  G  IQVP+   +    S  RE+                
Subjt:  KRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQWQKKIKAAPIVHKDKSFNGGCIQVPFGLPIEEGHSAKREH----------------

Query:  ---------EPEEIKD--NRILCIK----------DKYEVESFSAESEDENTEDECATKGNNTQKGN----------SRRKHHISWTLSEVMKLVEGVSE
                  P ++ +  NR+  +K          DK  ++    + + E  E E      ++   N          S RK H +WT+SEV KLVEGVS+
Subjt:  ---------EPEEIKD--NRILCIK----------DKYEVESFSAESEDENTEDECATKGNNTQKGN----------SRRKHHISWTLSEVMKLVEGVSE

Query:  YGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELA
        YGVG+WTEIK+L F+  +HRT+VDLKDKWRNL KAS +   NR +  L +K  S  +P  ++ +VRELA
Subjt:  YGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELA

AT1G72650.1 TRF-like 62.2e-2434.32Show/hide
Query:  KRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQWQKKIKAAPIVHKDKSFNGGCIQVPFGLPIEEG------------HSAKREHEPEE
        KR+RKP RRY EE  E   +  + KS +  SKD+ L  +S  +           V +  S  G  I+VP+   +               HS+  E +   
Subjt:  KRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQWQKKIKAAPIVHKDKSFNGGCIQVPFGLPIEEG------------HSAKREHEPEE

Query:  IKDNRIL---------------------------CIKDKYEVESFSAESEDENTEDECATKGNNT----------QKGNSRRKHHISWTLSEVMKLVEGV
         + N  L                              D+  VE   +E + E   +   + GN++          Q G  RRKHH +WTLSE+ KLVEGV
Subjt:  IKDNRIL---------------------------CIKDKYEVESFSAESEDENTEDECATKGNNT----------QKGNSRRKHHISWTLSEVMKLVEGV

Query:  SEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELA
        S+YG G+W+EIK+  F+S S+RTSVDLKDKWRNLLK S  Q  +     L +K  S  +P  +L RVRELA
Subjt:  SEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELA

AT1G72650.2 TRF-like 62.2e-2434.32Show/hide
Query:  KRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQWQKKIKAAPIVHKDKSFNGGCIQVPFGLPIEEG------------HSAKREHEPEE
        KR+RKP RRY EE  E   +  + KS +  SKD+ L  +S  +           V +  S  G  I+VP+   +               HS+  E +   
Subjt:  KRLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQWQKKIKAAPIVHKDKSFNGGCIQVPFGLPIEEG------------HSAKREHEPEE

Query:  IKDNRIL---------------------------CIKDKYEVESFSAESEDENTEDECATKGNNT----------QKGNSRRKHHISWTLSEVMKLVEGV
         + N  L                              D+  VE   +E + E   +   + GN++          Q G  RRKHH +WTLSE+ KLVEGV
Subjt:  IKDNRIL---------------------------CIKDKYEVESFSAESEDENTEDECATKGNNT----------QKGNSRRKHHISWTLSEVMKLVEGV

Query:  SEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELA
        S+YG G+W+EIK+  F+S S+RTSVDLKDKWRNLLK S  Q  +     L +K  S  +P  +L RVRELA
Subjt:  SEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVRELA

AT2G37025.1 TRF-like 81.1e-2343.88Show/hide
Query:  ENTEDECATKGNNTQKGNSRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPE
        E+ +D   TK + T+  + RRK+   WTL EVM LV+G+S +GVG+WT+IK   F  ++HR  VD++DKWRNLLKAS  +  N  +    RK  ++ +P+
Subjt:  ENTEDECATKGNNTQKGNSRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPE

Query:  SVLCRVRELAAIYPYPRENKSKESCSAPSTSSFKSTTNN
         +L RVRELA+++PYP    SK  C    +S  +ST+ N
Subjt:  SVLCRVRELAAIYPYPRENKSKESCSAPSTSSFKSTTNN

AT2G37025.2 TRF-like 81.1e-2343.88Show/hide
Query:  ENTEDECATKGNNTQKGNSRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPE
        E+ +D   TK + T+  + RRK+   WTL EVM LV+G+S +GVG+WT+IK   F  ++HR  VD++DKWRNLLKAS  +  N  +    RK  ++ +P+
Subjt:  ENTEDECATKGNNTQKGNSRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPE

Query:  SVLCRVRELAAIYPYPRENKSKESCSAPSTSSFKSTTNN
         +L RVRELA+++PYP    SK  C    +S  +ST+ N
Subjt:  SVLCRVRELAAIYPYPRENKSKESCSAPSTSSFKSTTNN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGCGGCGTCCAAGGAAACTATGATGACTTCCAACCAGGCTACAAAAATGTTTACTTATGAGTTTCCTGATGTTGACTCGGATTTATCTATTTTCCCTGTTCCAGA
AGATGATGTCTTAGGGGTGGAACATTTCTTGGAACCTGACTTTAACAAATGCTTTTCTGGCAGTGTTCTAGATTTCAACACGTTCCACTCTCATAAGTGCTTAGGCATTG
AACATTTTTCGTTTGCTGTTGATTTTGATCAGATAAAGATCGATAGTGAATCATTGCATTCTAGCCTTACACTAGAAGGAGAGAGAACTAAGCAAGAGGGTCATGATGAA
AAACTGCGAGCTGATGATGTGTGTTCTGGGATTAACAAACATCCGGGAGCTGTGAACAGTCGAACTACAGATGATATCTTCAATTTGGCTCCTGCTTCATCCAATGCTTC
TTTTCTAGGCAATGTGGATTTTGGTAGCTACTCACAAGGCTTTTCTACAACAGACATTGGTGTTGATTTGTCTTTTGCTGGAGAATATGTCGGACGACAGGCCTATGAAA
GGAAGAGACTTTCAAGCAGCGACTCCTCGAAGACAAATTTTGAACACAACGAATCTAAAATCAATCCCTTTAATGAAGGGAATAGGGGAGATACACTTGTTACAGAAAAG
AGATTGCGAAAGCCTCCCAGGAGATACAGTGAAGAGTCAGTTGAACAAAAATCGAGATCTAACAGCAAGAAAAGCGCTCTTAAAGCTTCCAAGGATAAATCTCTCCATTC
TGAGTCTCACAAGCAGCAATGGCAGAAGAAAATTAAAGCAGCACCAATCGTTCATAAAGATAAATCATTCAATGGAGGTTGTATTCAAGTTCCGTTCGGTCTTCCAATAG
AAGAAGGTCATTCAGCAAAAAGAGAACATGAGCCGGAGGAGATCAAGGACAATAGAATATTGTGCATAAAAGATAAATATGAAGTAGAGTCTTTCTCAGCCGAGTCCGAA
GATGAGAACACTGAAGATGAATGTGCCACCAAAGGTAACAATACTCAAAAAGGCAACAGTCGCAGGAAGCATCATATCTCGTGGACTCTTTCTGAGGTAATGAAGTTGGT
AGAAGGTGTTTCAGAGTATGGAGTTGGAAGGTGGACAGAAATAAAGAGGCTACAGTTTGCATCATCTTCACATAGAACATCTGTGGATCTCAAGGACAAATGGAGAAATC
TATTGAAGGCAAGTGACACACAGTTGCAGAACAGAAGAAAGGTGGTGCTTGGTCGGAAGCAAGCATCTCAGCAAGTACCAGAGTCTGTTCTATGTCGTGTTCGGGAATTG
GCGGCTATTTATCCATATCCTAGGGAAAACAAATCCAAGGAATCATGCTCAGCTCCTTCAACTTCATCCTTCAAATCTACTACTAATAACATGTTTGTATCTTTGCCCAC
AGTTATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATGCGGCGTCCAAGGAAACTATGATGACTTCCAACCAGGCTACAAAAATGTTTACTTATGAGTTTCCTGATGTTGACTCGGATTTATCTATTTTCCCTGTTCCAGA
AGATGATGTCTTAGGGGTGGAACATTTCTTGGAACCTGACTTTAACAAATGCTTTTCTGGCAGTGTTCTAGATTTCAACACGTTCCACTCTCATAAGTGCTTAGGCATTG
AACATTTTTCGTTTGCTGTTGATTTTGATCAGATAAAGATCGATAGTGAATCATTGCATTCTAGCCTTACACTAGAAGGAGAGAGAACTAAGCAAGAGGGTCATGATGAA
AAACTGCGAGCTGATGATGTGTGTTCTGGGATTAACAAACATCCGGGAGCTGTGAACAGTCGAACTACAGATGATATCTTCAATTTGGCTCCTGCTTCATCCAATGCTTC
TTTTCTAGGCAATGTGGATTTTGGTAGCTACTCACAAGGCTTTTCTACAACAGACATTGGTGTTGATTTGTCTTTTGCTGGAGAATATGTCGGACGACAGGCCTATGAAA
GGAAGAGACTTTCAAGCAGCGACTCCTCGAAGACAAATTTTGAACACAACGAATCTAAAATCAATCCCTTTAATGAAGGGAATAGGGGAGATACACTTGTTACAGAAAAG
AGATTGCGAAAGCCTCCCAGGAGATACAGTGAAGAGTCAGTTGAACAAAAATCGAGATCTAACAGCAAGAAAAGCGCTCTTAAAGCTTCCAAGGATAAATCTCTCCATTC
TGAGTCTCACAAGCAGCAATGGCAGAAGAAAATTAAAGCAGCACCAATCGTTCATAAAGATAAATCATTCAATGGAGGTTGTATTCAAGTTCCGTTCGGTCTTCCAATAG
AAGAAGGTCATTCAGCAAAAAGAGAACATGAGCCGGAGGAGATCAAGGACAATAGAATATTGTGCATAAAAGATAAATATGAAGTAGAGTCTTTCTCAGCCGAGTCCGAA
GATGAGAACACTGAAGATGAATGTGCCACCAAAGGTAACAATACTCAAAAAGGCAACAGTCGCAGGAAGCATCATATCTCGTGGACTCTTTCTGAGGTAATGAAGTTGGT
AGAAGGTGTTTCAGAGTATGGAGTTGGAAGGTGGACAGAAATAAAGAGGCTACAGTTTGCATCATCTTCACATAGAACATCTGTGGATCTCAAGGACAAATGGAGAAATC
TATTGAAGGCAAGTGACACACAGTTGCAGAACAGAAGAAAGGTGGTGCTTGGTCGGAAGCAAGCATCTCAGCAAGTACCAGAGTCTGTTCTATGTCGTGTTCGGGAATTG
GCGGCTATTTATCCATATCCTAGGGAAAACAAATCCAAGGAATCATGCTCAGCTCCTTCAACTTCATCCTTCAAATCTACTACTAATAACATGTTTGTATCTTTGCCCAC
AGTTATGTGA
Protein sequenceShow/hide protein sequence
MNAASKETMMTSNQATKMFTYEFPDVDSDLSIFPVPEDDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLGIEHFSFAVDFDQIKIDSESLHSSLTLEGERTKQEGHDE
KLRADDVCSGINKHPGAVNSRTTDDIFNLAPASSNASFLGNVDFGSYSQGFSTTDIGVDLSFAGEYVGRQAYERKRLSSSDSSKTNFEHNESKINPFNEGNRGDTLVTEK
RLRKPPRRYSEESVEQKSRSNSKKSALKASKDKSLHSESHKQQWQKKIKAAPIVHKDKSFNGGCIQVPFGLPIEEGHSAKREHEPEEIKDNRILCIKDKYEVESFSAESE
DENTEDECATKGNNTQKGNSRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFASSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLCRVREL
AAIYPYPRENKSKESCSAPSTSSFKSTTNNMFVSLPTVM