; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g0352 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g0352
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionHTH myb-type domain-containing protein
Genome locationMC03:10089001..10097594
RNA-Seq ExpressionMC03g0352
SyntenyMC03g0352
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607151.1 Telomere repeat-binding protein 4, partial [Cucurbita argyrosperma subsp. sororia]9.79e-31086.81Show/hide
Query:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
        MNAASKETMMTSNQ TKMFTYEFPDVDSDLSIFP PE+DVLGVEHFL+PD+NKCF GSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG

Query:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQGFYPTD-GVDLSFAGGHDVAFSQNEGKLMISSRTGSSGC
        ERTKQE  DEKS+GTDDV CSGINKL  TV  QT+DDIFNLAPGSCNASFLGNV+FG Y QGF  TD GVDLSFAGGH+V F Q + KLMISS TGSSGC
Subjt:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQGFYPTD-GVDLSFAGGHDVAFSQNEGKLMISSRTGSSGC

Query:  SGSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQN
        SGSSTACL+DENMSDDRP+KRKRLSSCDSSK  SE NESKI+PF+EGNKV+TLVTEKRLRKPPRRYSEESIEQKSRS  KKSA KASKDKSLPS  HR  
Subjt:  SGSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQN

Query:  HCQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISW
          QKKLKAAPI+ KDKSFNGGCIQVPFGLPIEEG  HS KKR CW+ E +KDNRILCIKDKNDVES+SAESEDENTEDEC+TKGN TQKG+SRRKHHISW
Subjt:  HCQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISW

Query:  TLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSS
        TLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQA QQVPESVL RVRELAAIYPYPRENKSKE+CS 
Subjt:  TLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSS

Query:  SAPSTSSFKT-TSNMFVSLPTVM
         AP+TSSFK+ TSNM V LPTVM
Subjt:  SAPSTSSFKT-TSNMFVSLPTVM

XP_022152933.1 uncharacterized protein LOC111020545 [Momordica charantia]0.097.31Show/hide
Query:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
        MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG

Query:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQGFYPTDGVDLSFAGGHDVAFSQNEGKLMISSRTGSSGCS
        ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQG              HDVAFSQNEGKLMISSRTGSSGCS
Subjt:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQGFYPTDGVDLSFAGGHDVAFSQNEGKLMISSRTGSSGCS

Query:  GSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNH
        GSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNH
Subjt:  GSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNH

Query:  CQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISWT
        CQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISWT
Subjt:  CQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISWT

Query:  LSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSSS
        LSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSSS
Subjt:  LSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSSS

Query:  APSTSSFKTTSNMFVSLPTVM
        APSTSSFKTTSNMFVSLPTVM
Subjt:  APSTSSFKTTSNMFVSLPTVM

XP_022998873.1 uncharacterized protein LOC111493398 isoform X3 [Cucurbita maxima]1.39e-30987Show/hide
Query:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
        MNAASKETMMTSNQ TKMFTYEFPDVDSDLSIFP PE+DVLGVEHFL+PD+NKCF GSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG

Query:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQGFYPTD-GVDLSFAGGHDVAFSQNEGKLMISSRTGSSGC
        ERTKQE  DEKS+GTDDVVCSGINKL  TV  Q  DDIFNLAPGSCNASFLGNV+FG Y QGF  TD GVDLSFAGGH+VAF Q +GKLMISS TGSSGC
Subjt:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQGFYPTD-GVDLSFAGGHDVAFSQNEGKLMISSRTGSSGC

Query:  SGSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQN
        SGSSTACL+DENMSDDRP+KRKRLSSCDSSK  SE NESKI+PF+EGNKV+T VTEKRLRKPPRRYSEESIEQKSRS  KKSA KASKDKSLPS  H  +
Subjt:  SGSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQN

Query:  HCQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISW
          QKKLKAAPI+ KDKSFNGGCIQVPFGLPIEEG  HS KKR CW+ E +KDNRILCIKDKNDVES+SAESEDENTEDEC+TKGN TQKG+SRRKHHISW
Subjt:  HCQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISW

Query:  TLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSS
        TLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQA QQVPESVL RVRELAAIYPYPRENKSKE+CS 
Subjt:  TLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSS

Query:  SAPSTSSFK-TTSNMFVSLPTVM
         AP+TSSFK TTSNM V LPTVM
Subjt:  SAPSTSSFK-TTSNMFVSLPTVM

XP_038894299.1 uncharacterized protein LOC120082937 isoform X1 [Benincasa hispida]3.98e-30986.64Show/hide
Query:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
        MNAASKETMMTSNQ TKMFTYEFPDVDSDLSIFPVPE+DVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCL IE+FSFAV+FDQIKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG

Query:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQGFYPTD-GVDLSFAGGHDVAFSQNEGKLMISSRTGSSGC
        ERTKQEDHDEK +  D  VCSGINK P  V  +T DDIFNLAPGS NASFLGNV+FGSYSQGF  TD GVDLSF GGH VAF Q EGKLMISS TGSSGC
Subjt:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQGFYPTD-GVDLSFAGGHDVAFSQNEGKLMISSRTGSSGC

Query:  SGSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQN
        SGS+TACLDDENMSDDRPMKRKRLSSCDSSKT  E NESKIIPF+EGN+ DT+VTEKRLRKPPRRYSEES+EQKSRS  K+S  KASKDK +PSESH+Q 
Subjt:  SGSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQN

Query:  HCQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISW
          QKK+KAAPI+HKDKSFNGGCIQVPFGLPIEEG  HSAKKR CW+ E +KDNRILCIKDK DVES+SAESEDENTEDEC+TK N+TQKG+SRRKHHISW
Subjt:  HCQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISW

Query:  TLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRK-VVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCS
        TLSEVMKLVEGVSEYGVGRWTEIKRLQF+SSSHRTSVDLKDKWRNLLKASDTQLQNRRK VVLGRKQASQQVPESVL RVRELAAIYPYPRENKSKESCS
Subjt:  TLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRK-VVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCS

Query:  SSAPSTSSFK-TTSNMFVSLPTVM
          APSTSSFK TT+NMFVSLPTVM
Subjt:  SSAPSTSSFK-TTSNMFVSLPTVM

XP_038894300.1 uncharacterized protein LOC120082937 isoform X2 [Benincasa hispida]5.71e-31186.81Show/hide
Query:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
        MNAASKETMMTSNQ TKMFTYEFPDVDSDLSIFPVPE+DVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCL IE+FSFAV+FDQIKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG

Query:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQGFYPTD-GVDLSFAGGHDVAFSQNEGKLMISSRTGSSGC
        ERTKQEDHDEK +  D  VCSGINK P  V  +T DDIFNLAPGS NASFLGNV+FGSYSQGF  TD GVDLSF GGH VAF Q EGKLMISS TGSSGC
Subjt:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQGFYPTD-GVDLSFAGGHDVAFSQNEGKLMISSRTGSSGC

Query:  SGSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQN
        SGS+TACLDDENMSDDRPMKRKRLSSCDSSKT  E NESKIIPF+EGN+ DT+VTEKRLRKPPRRYSEES+EQKSRS  K+S  KASKDK +PSESH+Q 
Subjt:  SGSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQN

Query:  HCQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISW
          QKK+KAAPI+HKDKSFNGGCIQVPFGLPIEEG  HSAKKR CW+ E +KDNRILCIKDK DVES+SAESEDENTEDEC+TK N+TQKG+SRRKHHISW
Subjt:  HCQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISW

Query:  TLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSS
        TLSEVMKLVEGVSEYGVGRWTEIKRLQF+SSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVL RVRELAAIYPYPRENKSKESCS 
Subjt:  TLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSS

Query:  SAPSTSSFK-TTSNMFVSLPTVM
         APSTSSFK TT+NMFVSLPTVM
Subjt:  SAPSTSSFK-TTSNMFVSLPTVM

TrEMBL top hitse value%identityAlignment
A0A5D3DCR8 Telomere repeat-binding protein 1-like isoform X22.51e-30786.81Show/hide
Query:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
        MNAASKETMMTSNQ TKMFTYEFPDVDSDLSIFPVPE+DVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCL IE+FSFAV+FDQIKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG

Query:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQGFYPTD-GVDLSFAGGHDVAFSQNEGKLMISSRTGSSGC
        ERTKQE HDEK +  D  VCSGINK P  V  +T DDIFNLAP S NASFLGNV+FGSYSQGF  TD GVDLSFAGGH+VAF Q EGK MISS TGSSGC
Subjt:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQGFYPTD-GVDLSFAGGHDVAFSQNEGKLMISSRTGSSGC

Query:  SGSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQN
        SGS+TACLDDENMSDDRPMKRKRLSS DS KT  E NESKI PF+EGN+ DTL+TEKRLRKPPRRYSEES+EQKSRS  KKSA KASKDKSL SESH+Q 
Subjt:  SGSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQN

Query:  HCQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISW
          QKK+KAAPIIHKDKSFNGGCIQVPFGLPIEEG  HSAKKRTCW+ E +KDNRILCIKDK +VES+SAESEDENTEDEC TKGN+TQKG+SRRKHHISW
Subjt:  HCQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISW

Query:  TLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSS
        TLSEVMKLVEGVSEYGVGRWTEIKRLQF+SSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVL RVRELAAIYPYPRENKSKESCS 
Subjt:  TLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSS

Query:  SAPSTSSFK-TTSNMFVSLPTVM
         APSTSSFK TT+NMFVSLPTVM
Subjt:  SAPSTSSFK-TTSNMFVSLPTVM

A0A6J1DG71 uncharacterized protein LOC1110205450.097.31Show/hide
Query:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
        MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG

Query:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQGFYPTDGVDLSFAGGHDVAFSQNEGKLMISSRTGSSGCS
        ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQG              HDVAFSQNEGKLMISSRTGSSGCS
Subjt:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQGFYPTDGVDLSFAGGHDVAFSQNEGKLMISSRTGSSGCS

Query:  GSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNH
        GSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNH
Subjt:  GSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNH

Query:  CQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISWT
        CQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISWT
Subjt:  CQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISWT

Query:  LSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSSS
        LSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSSS
Subjt:  LSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSSS

Query:  APSTSSFKTTSNMFVSLPTVM
        APSTSSFKTTSNMFVSLPTVM
Subjt:  APSTSSFKTTSNMFVSLPTVM

A0A6J1GAF3 uncharacterized protein LOC111452393 isoform X36.42e-30886.42Show/hide
Query:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
        MNAASKETMMTSNQ TKMFTYEFPDVDSDLSIFP PE+DVLGVEHFL+PD+NKCF GSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG

Query:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQGFYPTD-GVDLSFAGGHDVAFSQNEGKLMISSRTGSSGC
        ERTKQE  DEKS+GTDDV CSGINKL  TV  QT DDIFNLAPGSCNASFLGNV+FG Y QGF  TD GVDLSFAGGH+V F Q + KLMISS TGSSGC
Subjt:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQGFYPTD-GVDLSFAGGHDVAFSQNEGKLMISSRTGSSGC

Query:  SGSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQN
        SGSSTACL+DENMSDDRP+KRKRLSSCDSSK  SE  ESKI+PF+EGNKV+TLVTEKRLRKPPRRYSEESIEQKSRS  KKSA KASKDKSLPS  HR  
Subjt:  SGSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQN

Query:  HCQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISW
          QKKLKAAPI+ KDKSFNGGCIQVPFGLPIEEG  HS KKR CW+ E +KDNRILCIKDKNDVES+SAESEDENTEDEC+TKGN TQKG+SRRKHHISW
Subjt:  HCQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISW

Query:  TLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSS
        TLSEVMKLVEGVS+YGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQA QQVPESVL RVRELAAIYPYPRENKSKE+CS 
Subjt:  TLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSS

Query:  SAPSTSSFKT-TSNMFVSLPTVM
         AP+TSSFK+ TSNM V LPTVM
Subjt:  SAPSTSSFKT-TSNMFVSLPTVM

A0A6J1K971 uncharacterized protein LOC111493398 isoform X26.92e-30886.67Show/hide
Query:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
        MNAASKETMMTSNQ TKMFTYEFPDVDSDLSIFP PE+DVLGVEHFL+PD+NKCF GSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG

Query:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQGFYPTD-GVDLSFAG--GHDVAFSQNEGKLMISSRTGSS
        ERTKQE  DEKS+GTDDVVCSGINKL  TV  Q  DDIFNLAPGSCNASFLGNV+FG Y QGF  TD GVDLSFAG  GH+VAF Q +GKLMISS TGSS
Subjt:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQGFYPTD-GVDLSFAG--GHDVAFSQNEGKLMISSRTGSS

Query:  GCSGSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHR
        GCSGSSTACL+DENMSDDRP+KRKRLSSCDSSK  SE NESKI+PF+EGNKV+T VTEKRLRKPPRRYSEESIEQKSRS  KKSA KASKDKSLPS  H 
Subjt:  GCSGSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHR

Query:  QNHCQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHI
         +  QKKLKAAPI+ KDKSFNGGCIQVPFGLPIEEG  HS KKR CW+ E +KDNRILCIKDKNDVES+SAESEDENTEDEC+TKGN TQKG+SRRKHHI
Subjt:  QNHCQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHI

Query:  SWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESC
        SWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQA QQVPESVL RVRELAAIYPYPRENKSKE+C
Subjt:  SWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESC

Query:  SSSAPSTSSFK-TTSNMFVSLPTVM
        S  AP+TSSFK TTSNM V LPTVM
Subjt:  SSSAPSTSSFK-TTSNMFVSLPTVM

A0A6J1KI08 uncharacterized protein LOC111493398 isoform X36.73e-31087Show/hide
Query:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
        MNAASKETMMTSNQ TKMFTYEFPDVDSDLSIFP PE+DVLGVEHFL+PD+NKCF GSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG

Query:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQGFYPTD-GVDLSFAGGHDVAFSQNEGKLMISSRTGSSGC
        ERTKQE  DEKS+GTDDVVCSGINKL  TV  Q  DDIFNLAPGSCNASFLGNV+FG Y QGF  TD GVDLSFAGGH+VAF Q +GKLMISS TGSSGC
Subjt:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQGFYPTD-GVDLSFAGGHDVAFSQNEGKLMISSRTGSSGC

Query:  SGSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQN
        SGSSTACL+DENMSDDRP+KRKRLSSCDSSK  SE NESKI+PF+EGNKV+T VTEKRLRKPPRRYSEESIEQKSRS  KKSA KASKDKSLPS  H  +
Subjt:  SGSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQN

Query:  HCQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISW
          QKKLKAAPI+ KDKSFNGGCIQVPFGLPIEEG  HS KKR CW+ E +KDNRILCIKDKNDVES+SAESEDENTEDEC+TKGN TQKG+SRRKHHISW
Subjt:  HCQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISW

Query:  TLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSS
        TLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQA QQVPESVL RVRELAAIYPYPRENKSKE+CS 
Subjt:  TLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSS

Query:  SAPSTSSFK-TTSNMFVSLPTVM
         AP+TSSFK TTSNM V LPTVM
Subjt:  SAPSTSSFK-TTSNMFVSLPTVM

SwissProt top hitse value%identityAlignment
Q6R0E3 Telomere repeat-binding protein 55.7e-0735.23Show/hide
Query:  SRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRV
        ++R+    ++++EV  LV+ V   G GRW ++K   F ++ HRT VDLKDKW+ L+  +    Q RR          + VP+ +L RV
Subjt:  SRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRV

Q9C7B1 Telomere repeat-binding protein 38.7e-0836.36Show/hide
Query:  SRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRV
        ++R+    ++++EV  LV+ V E G GRW ++K   F  + HRT VDLKDKW+ L+  +    Q RR          + VP+ +L RV
Subjt:  SRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRV

Q9FFY9 Telomere repeat-binding protein 41.9e-0731.71Show/hide
Query:  SRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRE
        S+R+    ++++EV  LV  V E G GRW ++K   F ++SHRT VDLKDKW+ L+  +    Q RR          + VP+ +L RV      + Y  +
Subjt:  SRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRE

Query:  NKSKESCSSSAPSTSSFKTTSNM
        ++ K++      +T   +  S+M
Subjt:  NKSKESCSSSAPSTSSFKTTSNM

Q9M347 Telomere repeat-binding protein 63.3e-0736.78Show/hide
Query:  RRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRV
        +R+    +T+SEV  LV+ V   G GRW ++K   F+  +HRT VDLKDKW+ L+  +    + RR          + VP+ +L RV
Subjt:  RRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRV

Q9SNB9 Telomere repeat-binding protein 23.3e-0733.33Show/hide
Query:  QKGSSRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIY
        Q+ +++R+    ++++EV  LV+ V + G GRW ++K   F  + HRT VDLKDKW+ L+  +    Q RR          + VP+ +L RV +  A +
Subjt:  QKGSSRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIY

Arabidopsis top hitse value%identityAlignment
AT1G17460.1 TRF-like 35.4e-2134.35Show/hide
Query:  KRLRKPPRRYSEE---------------------SIEQKSRSTIKKSAHKASKDKSLPSESHRQNHCQKKLKAAPIIHKDKSF-------NGGCIQVPFG
        KR+RKP RRY EE                     ++  + R  + +    A     +P  SH +    ++   A    + KS+        G     P  
Subjt:  KRLRKPPRRYSEE---------------------SIEQKSRSTIKKSAHKASKDKSLPSESHRQNHCQKKLKAAPIIHKDKSF-------NGGCIQVPFG

Query:  LPIEEGQVHSAKKRT-CWDSEGMKDNRILCIKD------KNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISWTLSEVMKLVEGVSEYGVGRWT
        L  +  +V   K  + C   E  KD+      D      + ++   S +S D+N  D  IT      + +S RK H +WT+SEV KLVEGVS+YGVG+WT
Subjt:  LPIEEGQVHSAKKRT-CWDSEGMKDNRILCIKD------KNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISWTLSEVMKLVEGVSEYGVGRWT

Query:  EIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELA
        EIK+L FS  +HRT+VDLKDKWRNL KAS +   NR +  L +K  S  +P  ++ +VRELA
Subjt:  EIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELA

AT1G72650.1 TRF-like 66.8e-2434.56Show/hide
Query:  KRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNHCQKKLKAAPIIHKDKSFNGGCIQVPF-----------------------------
        KR+RKP RRY EE  E   +    KS    SKD+ L  +S  ++      K   +  +  S  G  I+VP+                             
Subjt:  KRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNHCQKKLKAAPIIHKDKSFNGGCIQVPF-----------------------------

Query:  ----GLPIEEGQVHS--AKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNT----------QKGSSRRKHHISWTLSEVMKLVEG
             L +   Q+ S    + +   S            D+N+VE   +E + E   +   + GN++          Q G+ RRKHH +WTLSE+ KLVEG
Subjt:  ----GLPIEEGQVHS--AKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNT----------QKGSSRRKHHISWTLSEVMKLVEG

Query:  VSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELA
        VS+YG G+W+EIK+  FSS S+RTSVDLKDKWRNLLK S  Q  +     L +K  S  +P  +L RVRELA
Subjt:  VSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELA

AT1G72650.2 TRF-like 66.8e-2434.56Show/hide
Query:  KRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNHCQKKLKAAPIIHKDKSFNGGCIQVPF-----------------------------
        KR+RKP RRY EE  E   +    KS    SKD+ L  +S  ++      K   +  +  S  G  I+VP+                             
Subjt:  KRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNHCQKKLKAAPIIHKDKSFNGGCIQVPF-----------------------------

Query:  ----GLPIEEGQVHS--AKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNT----------QKGSSRRKHHISWTLSEVMKLVEG
             L +   Q+ S    + +   S            D+N+VE   +E + E   +   + GN++          Q G+ RRKHH +WTLSE+ KLVEG
Subjt:  ----GLPIEEGQVHS--AKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNT----------QKGSSRRKHHISWTLSEVMKLVEG

Query:  VSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELA
        VS+YG G+W+EIK+  FSS S+RTSVDLKDKWRNLLK S  Q  +     L +K  S  +P  +L RVRELA
Subjt:  VSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELA

AT2G37025.1 TRF-like 82.0e-2345.71Show/hide
Query:  ENTEDECITKGNNTQKGSSRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPE
        E+ +D  +TK + T+  S RRK+   WTL EVM LV+G+S +GVG+WT+IK   F  ++HR  VD++DKWRNLLKAS  +  N  +    RK  ++ +P+
Subjt:  ENTEDECITKGNNTQKGSSRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPE

Query:  SVLRRVRELAAIYPYPRENKSKESC----SSSAPSTSSFK
         +L RVRELA+++PYP    SK  C    SS + STS  K
Subjt:  SVLRRVRELAAIYPYPRENKSKESC----SSSAPSTSSFK

AT2G37025.2 TRF-like 82.0e-2345.71Show/hide
Query:  ENTEDECITKGNNTQKGSSRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPE
        E+ +D  +TK + T+  S RRK+   WTL EVM LV+G+S +GVG+WT+IK   F  ++HR  VD++DKWRNLLKAS  +  N  +    RK  ++ +P+
Subjt:  ENTEDECITKGNNTQKGSSRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPE

Query:  SVLRRVRELAAIYPYPRENKSKESC----SSSAPSTSSFK
         +L RVRELA+++PYP    SK  C    SS + STS  K
Subjt:  SVLRRVRELAAIYPYPRENKSKESC----SSSAPSTSSFK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGCGGCATCCAAGGAAACTATGATGACTTCCAATCAGACTACGAAAATGTTTACTTATGAGTTTCCTGATGTTGACTCGGATTTATCTATTTTCCCTGTTCCAGA
AGAAGATGTCTTAGGAGTGGAACATTTCTTAGAACCTGACTTTAACAAATGCTTTTCTGGCAGTGTTCTAGATTTCAACACGTTCCACTCTCATAAGTGCTTAAGCATTG
AAAATTTTTCATTCGCTGTTGAATTTGATCAGATAAAGATTGATAGTGAATCATTGCATTCTAGCCTTACACTAGAAGGAGAGAGAACCAAGCAAGAGGACCATGATGAA
AAATCACAAGGCACAGATGATGTTGTCTGTTCTGGGATAAACAAACTTCCAGCAACGGTGAAGAGTCAAACTGCGGATGACATCTTCAATTTGGCTCCTGGTTCATGCAA
TGCTTCTTTTCTAGGCAATGTGAATTTTGGTAGCTACTCACAAGGTTTTTATCCAACAGACGGTGTTGATTTGTCTTTTGCTGGGGGGCATGATGTGGCTTTCAGCCAGA
ACGAAGGCAAACTTATGATCTCATCCAGAACTGGGTCGTCAGGCTGTTCAGGTAGCTCTACTGCATGCTTGGATGATGAGAATATGTCAGATGACAGGCCTATGAAAAGG
AAGCGACTTTCAAGTTGTGACTCCTCGAAGACAGGTTCTGAGCTTAATGAATCTAAAATCATTCCCTTCCATGAAGGAAACAAAGTAGATACACTTGTTACAGAAAAGAG
ATTGCGAAAGCCGCCTAGGAGATACAGCGAAGAGTCAATTGAACAAAAGTCAAGATCTACTATCAAGAAGAGTGCTCATAAAGCTTCCAAGGATAAGTCTCTCCCCTCTG
AATCTCACAGGCAGAATCACTGCCAAAAGAAACTTAAAGCAGCACCGATCATTCATAAAGATAAATCGTTTAATGGAGGTTGTATTCAGGTTCCATTCGGTCTTCCGATA
GAAGAAGGTCAAGTTCACTCGGCGAAAAAGAGAACATGTTGGGACTCAGAGGGGATGAAGGATAATAGAATCTTGTGCATAAAAGATAAAAACGATGTGGAGTCTTACTC
GGCCGAGTCAGAAGATGAGAATACTGAAGACGAGTGCATCACTAAAGGTAATAATACTCAAAAAGGCAGCAGTCGCAGGAAGCACCATATATCATGGACTCTTTCTGAGG
TTATGAAGTTGGTGGAAGGTGTTTCAGAGTATGGTGTCGGAAGGTGGACAGAAATTAAGAGGCTACAATTTTCATCATCTTCACATAGAACATCTGTGGATCTCAAGGAC
AAATGGAGAAATCTATTGAAGGCAAGTGACACACAGTTGCAGAACAGAAGAAAGGTCGTCCTTGGTCGAAAGCAGGCATCGCAGCAAGTACCGGAATCAGTTCTTCGCCG
AGTTCGGGAACTGGCCGCTATTTATCCATATCCAAGGGAAAACAAATCCAAGGAATCGTGCTCAAGCTCAGCCCCCTCAACTTCATCCTTCAAAACTACCAGTAACATGT
TTGTTTCTTTGCCCACAGTTATGTGA
mRNA sequenceShow/hide mRNA sequence
AAAGGAATGATTACAAAATCTAAATAAAAATTTGACTGATGAGAGAGAATGGAGGCAGAAGAACTAGAGTTGAATGGCCGGCGCGTGAGAAAGTAGAGCGCGTGTGGAAG
CTCAGCCTTCTTTTCTTTCACGCTTTACGAGTCAAAATGGACTCGTGAAGGACCGTATTGCATTGTTAAGGCGGCAAATGACGATGATTTGTCGCTTACCCCTCCACGGA
CTTCACTCTCCTCATTCAAATCCAATTCCCAAACCCATTTCAAATCCGTCACCTAAAAGTTCACCGCTTTGCTCACTGGTTCCCTCACTTGTTTTTCCCTCCTTTCGTCT
TCCTCTCAGCAAAATCCTCTCCGCTCTTCCCTTTTCAGTTTTGGGTCTTCCCGATTTTCTCTGTTGGACATTGCAATGTGGTGGATTTTTGCCTCTGTTCTCTGAAATTT
TGAGCTTCTTGCCCCGGACATGGATTTCTAGCCTTGGGTTGTTTCACAGCTCCAGCTTCGGCTTAGCATTGCCTTGAAGGAACGGTTCGGGTTGTAATTCACTACGCAAT
GAATGCGGCATCCAAGGAAACTATGATGACTTCCAATCAGACTACGAAAATGTTTACTTATGAGTTTCCTGATGTTGACTCGGATTTATCTATTTTCCCTGTTCCAGAAG
AAGATGTCTTAGGAGTGGAACATTTCTTAGAACCTGACTTTAACAAATGCTTTTCTGGCAGTGTTCTAGATTTCAACACGTTCCACTCTCATAAGTGCTTAAGCATTGAA
AATTTTTCATTCGCTGTTGAATTTGATCAGATAAAGATTGATAGTGAATCATTGCATTCTAGCCTTACACTAGAAGGAGAGAGAACCAAGCAAGAGGACCATGATGAAAA
ATCACAAGGCACAGATGATGTTGTCTGTTCTGGGATAAACAAACTTCCAGCAACGGTGAAGAGTCAAACTGCGGATGACATCTTCAATTTGGCTCCTGGTTCATGCAATG
CTTCTTTTCTAGGCAATGTGAATTTTGGTAGCTACTCACAAGGTTTTTATCCAACAGACGGTGTTGATTTGTCTTTTGCTGGGGGGCATGATGTGGCTTTCAGCCAGAAC
GAAGGCAAACTTATGATCTCATCCAGAACTGGGTCGTCAGGCTGTTCAGGTAGCTCTACTGCATGCTTGGATGATGAGAATATGTCAGATGACAGGCCTATGAAAAGGAA
GCGACTTTCAAGTTGTGACTCCTCGAAGACAGGTTCTGAGCTTAATGAATCTAAAATCATTCCCTTCCATGAAGGAAACAAAGTAGATACACTTGTTACAGAAAAGAGAT
TGCGAAAGCCGCCTAGGAGATACAGCGAAGAGTCAATTGAACAAAAGTCAAGATCTACTATCAAGAAGAGTGCTCATAAAGCTTCCAAGGATAAGTCTCTCCCCTCTGAA
TCTCACAGGCAGAATCACTGCCAAAAGAAACTTAAAGCAGCACCGATCATTCATAAAGATAAATCGTTTAATGGAGGTTGTATTCAGGTTCCATTCGGTCTTCCGATAGA
AGAAGGTCAAGTTCACTCGGCGAAAAAGAGAACATGTTGGGACTCAGAGGGGATGAAGGATAATAGAATCTTGTGCATAAAAGATAAAAACGATGTGGAGTCTTACTCGG
CCGAGTCAGAAGATGAGAATACTGAAGACGAGTGCATCACTAAAGGTAATAATACTCAAAAAGGCAGCAGTCGCAGGAAGCACCATATATCATGGACTCTTTCTGAGGTT
ATGAAGTTGGTGGAAGGTGTTTCAGAGTATGGTGTCGGAAGGTGGACAGAAATTAAGAGGCTACAATTTTCATCATCTTCACATAGAACATCTGTGGATCTCAAGGACAA
ATGGAGAAATCTATTGAAGGCAAGTGACACACAGTTGCAGAACAGAAGAAAGGTCGTCCTTGGTCGAAAGCAGGCATCGCAGCAAGTACCGGAATCAGTTCTTCGCCGAG
TTCGGGAACTGGCCGCTATTTATCCATATCCAAGGGAAAACAAATCCAAGGAATCGTGCTCAAGCTCAGCCCCCTCAACTTCATCCTTCAAAACTACCAGTAACATGTTT
GTTTCTTTGCCCACAGTTATGTGAAGGAAGTTGAATAGAGAATCAGATTGGTTTTCCTTTCCTTTTTTTTTTCCTTTTTTCTCTTTTCTTTTTATTTTTTAATTTTTAGT
AATTAGGAAGTTAGAATGTGAAAGCTTTGTACTTCACTGGCTGATTTGGCAAACAGATACAAAACCTCCATTCTGCTTTACTATTGAACCCA
Protein sequenceShow/hide protein sequence
MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEGERTKQEDHDE
KSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQGFYPTDGVDLSFAGGHDVAFSQNEGKLMISSRTGSSGCSGSSTACLDDENMSDDRPMKR
KRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNHCQKKLKAAPIIHKDKSFNGGCIQVPFGLPI
EEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKD
KWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSSSAPSTSSFKTTSNMFVSLPTVM