; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g21790 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g21790
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionHTH myb-type domain-containing protein
Genome locationchr3:15000060..15006867
RNA-Seq ExpressionMoc03g21790
SyntenyMoc03g21790
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607151.1 Telomere repeat-binding protein 4, partial [Cucurbita argyrosperma subsp. sororia]1.3e-23684.48Show/hide
Query:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
        MNAASKETMMTSNQ TKMFTYEFPDVDSDLSIFP PE+DVLGVEHFL+PD+NKCF GSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG

Query:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFG--------------SYSQGHDVAFSQNEGKLMISSRTGSSGCS
        ERTKQE  DEKS+GTDDV CSGINKL  TV  QT+DDIFNLAPGSCNASFLGNV+FG              S++ GH+V F Q + KLMISS TGSSGCS
Subjt:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFG--------------SYSQGHDVAFSQNEGKLMISSRTGSSGCS

Query:  GSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNH
        GSSTACL+DENMSDDRP+KRKRLSSCDSSK  SE NESKI+PF+EGNKV+TLVTEKRLRKPPRRYSEESIEQKSRS  KKSA KASKDKSLPS  HR + 
Subjt:  GSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNH

Query:  CQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISWT
         QKKLKAAPI+ KDKSFNGGCIQVPFGLPIEEG  HS KKR CW+ E +KDNRILCIKDKNDVES+SAESEDENTEDEC+TKGN TQKG+SRRKHHISWT
Subjt:  CQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISWT

Query:  LSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSSS
        LSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQA QQVPESVL RVRELAAIYPYPRENKSKE+C  S
Subjt:  LSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSSS

Query:  APSTSSFK-TTSNMFVSLPTVM
        AP+TSSFK +TSNM V LPTVM
Subjt:  APSTSSFK-TTSNMFVSLPTVM

XP_022152933.1 uncharacterized protein LOC111020545 [Momordica charantia]6.6e-284100Show/hide
Query:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
        MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG

Query:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQGHDVAFSQNEGKLMISSRTGSSGCSGSSTACLDDENMSD
        ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQGHDVAFSQNEGKLMISSRTGSSGCSGSSTACLDDENMSD
Subjt:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQGHDVAFSQNEGKLMISSRTGSSGCSGSSTACLDDENMSD

Query:  DRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNHCQKKLKAAPIIHKD
        DRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNHCQKKLKAAPIIHKD
Subjt:  DRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNHCQKKLKAAPIIHKD

Query:  KSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISWTLSEVMKLVEGVSEY
        KSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISWTLSEVMKLVEGVSEY
Subjt:  KSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISWTLSEVMKLVEGVSEY

Query:  GVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSSSAPSTSSFKTTSNMF
        GVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSSSAPSTSSFKTTSNMF
Subjt:  GVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSSSAPSTSSFKTTSNMF

Query:  VSLPTVM
        VSLPTVM
Subjt:  VSLPTVM

XP_022998873.1 uncharacterized protein LOC111493398 isoform X3 [Cucurbita maxima]2.3e-23684.67Show/hide
Query:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
        MNAASKETMMTSNQ TKMFTYEFPDVDSDLSIFP PE+DVLGVEHFL+PD+NKCF GSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG

Query:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFG--------------SYSQGHDVAFSQNEGKLMISSRTGSSGCS
        ERTKQE  DEKS+GTDDVVCSGINKL  TV  Q  DDIFNLAPGSCNASFLGNV+FG              S++ GH+VAF Q +GKLMISS TGSSGCS
Subjt:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFG--------------SYSQGHDVAFSQNEGKLMISSRTGSSGCS

Query:  GSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNH
        GSSTACL+DENMSDDRP+KRKRLSSCDSSK  SE NESKI+PF+EGNKV+T VTEKRLRKPPRRYSEESIEQKSRS  KKSA KASKDKSLPS  H  + 
Subjt:  GSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNH

Query:  CQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISWT
         QKKLKAAPI+ KDKSFNGGCIQVPFGLPIEEG  HS KKR CW+ E +KDNRILCIKDKNDVES+SAESEDENTEDEC+TKGN TQKG+SRRKHHISWT
Subjt:  CQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISWT

Query:  LSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSSS
        LSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQA QQVPESVL RVRELAAIYPYPRENKSKE+C  S
Subjt:  LSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSSS

Query:  APSTSSFK-TTSNMFVSLPTVM
        AP+TSSFK TTSNM V LPTVM
Subjt:  APSTSSFK-TTSNMFVSLPTVM

XP_023524479.1 uncharacterized protein LOC111788382 isoform X3 [Cucurbita pepo subsp. pepo]5.1e-23684.29Show/hide
Query:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
        MNAASKETMMTSNQ TKMFTYEFPDVDSDLSIFP PE+DVLGVEHFL+PD+NKCF GSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG

Query:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFG--------------SYSQGHDVAFSQNEGKLMISSRTGSSGCS
        ERTKQE  DEKS+GTDDVVCSGINKL  TV  QT DDIFNLAPGSCNASFLGNV+FG              S++ GH+V F Q + KLMISS TGSSGCS
Subjt:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFG--------------SYSQGHDVAFSQNEGKLMISSRTGSSGCS

Query:  GSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNH
        GSSTACL+DENMSDDRP+KRKRLSSCDSSK  SE NESKI+PF+EGNKV+TLVTEKRLRKPPRRYSEESIEQKSRS  KKSA KASKDKSLPS  HR + 
Subjt:  GSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNH

Query:  CQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISWT
         QKKLKAAPI+ KDKSFNGGCIQVPFGLPIEEG  HS KKR CW+ E +KDNRILCIKDKNDVES+SAES+DENTEDEC+ KGN TQKG+SRRKHHISWT
Subjt:  CQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISWT

Query:  LSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSSS
        LSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQA QQVPESVL RVRELAAIYPYPRENKSK++C  S
Subjt:  LSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSSS

Query:  APSTSSFK-TTSNMFVSLPTVM
        AP+TSSFK TTSNM V LPTVM
Subjt:  APSTSSFK-TTSNMFVSLPTVM

XP_038894300.1 uncharacterized protein LOC120082937 isoform X2 [Benincasa hispida]6.1e-23784.7Show/hide
Query:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
        MNAASKETMMTSNQ TKMFTYEFPDVDSDLSIFPVPE+DVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCL IE+FSFAV+FDQIKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG

Query:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQ---------------GHDVAFSQNEGKLMISSRTGSSGC
        ERTKQEDHDEK +  D  VCSGINK P  V  +T DDIFNLAPGS NASFLGNV+FGSYSQ               GH VAF Q EGKLMISS TGSSGC
Subjt:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQ---------------GHDVAFSQNEGKLMISSRTGSSGC

Query:  SGSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQN
        SGS+TACLDDENMSDDRPMKRKRLSSCDSSKT  E NESKIIPF+EGN+ DT+VTEKRLRKPPRRYSEES+EQKSRS  K+S  KASKDK +PSESH+Q 
Subjt:  SGSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQN

Query:  HCQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISW
          QKK+KAAPI+HKDKSFNGGCIQVPFGLPIEEG  HSAKKR CW+ E +KDNRILCIKDK DVES+SAESEDENTEDEC+TK N+TQKG+SRRKHHISW
Subjt:  HCQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISW

Query:  TLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSS
        TLSEVMKLVEGVSEYGVGRWTEIKRLQF+SSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVL RVRELAAIYPYPRENKSKESC  
Subjt:  TLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSS

Query:  SAPSTSSFK-TTSNMFVSLPTVM
        SAPSTSSFK TT+NMFVSLPTVM
Subjt:  SAPSTSSFK-TTSNMFVSLPTVM

TrEMBL top hitse value%identityAlignment
A0A6J1DG71 uncharacterized protein LOC1110205453.2e-284100Show/hide
Query:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
        MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG

Query:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQGHDVAFSQNEGKLMISSRTGSSGCSGSSTACLDDENMSD
        ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQGHDVAFSQNEGKLMISSRTGSSGCSGSSTACLDDENMSD
Subjt:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQGHDVAFSQNEGKLMISSRTGSSGCSGSSTACLDDENMSD

Query:  DRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNHCQKKLKAAPIIHKD
        DRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNHCQKKLKAAPIIHKD
Subjt:  DRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNHCQKKLKAAPIIHKD

Query:  KSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISWTLSEVMKLVEGVSEY
        KSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISWTLSEVMKLVEGVSEY
Subjt:  KSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISWTLSEVMKLVEGVSEY

Query:  GVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSSSAPSTSSFKTTSNMF
        GVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSSSAPSTSSFKTTSNMF
Subjt:  GVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSSSAPSTSSFKTTSNMF

Query:  VSLPTVM
        VSLPTVM
Subjt:  VSLPTVM

A0A6J1GAD7 uncharacterized protein LOC111452393 isoform X24.0e-23483.81Show/hide
Query:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
        MNAASKETMMTSNQ TKMFTYEFPDVDSDLSIFP PE+DVLGVEHFL+PD+NKCF GSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG

Query:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQ-----------------GHDVAFSQNEGKLMISSRTGSS
        ERTKQE  DEKS+GTDDV CSGINKL  TV  QT DDIFNLAPGSCNASFLGNV+FG Y Q                 GH+V F Q + KLMISS TGSS
Subjt:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQ-----------------GHDVAFSQNEGKLMISSRTGSS

Query:  GCSGSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHR
        GCSGSSTACL+DENMSDDRP+KRKRLSSCDSSK  SE  ESKI+PF+EGNKV+TLVTEKRLRKPPRRYSEESIEQKSRS  KKSA KASKDKSLPS  HR
Subjt:  GCSGSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHR

Query:  QNHCQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHI
         +  QKKLKAAPI+ KDKSFNGGCIQVPFGLPIEEG  HS KKR CW+ E +KDNRILCIKDKNDVES+SAESEDENTEDEC+TKGN TQKG+SRRKHHI
Subjt:  QNHCQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHI

Query:  SWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESC
        SWTLSEVMKLVEGVS+YGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQA QQVPESVL RVRELAAIYPYPRENKSKE+C
Subjt:  SWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESC

Query:  SSSAPSTSSFK-TTSNMFVSLPTVM
          SAP+TSSFK +TSNM V LPTVM
Subjt:  SSSAPSTSSFK-TTSNMFVSLPTVM

A0A6J1GAF3 uncharacterized protein LOC111452393 isoform X32.7e-23584.1Show/hide
Query:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
        MNAASKETMMTSNQ TKMFTYEFPDVDSDLSIFP PE+DVLGVEHFL+PD+NKCF GSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG

Query:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFG--------------SYSQGHDVAFSQNEGKLMISSRTGSSGCS
        ERTKQE  DEKS+GTDDV CSGINKL  TV  QT DDIFNLAPGSCNASFLGNV+FG              S++ GH+V F Q + KLMISS TGSSGCS
Subjt:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFG--------------SYSQGHDVAFSQNEGKLMISSRTGSSGCS

Query:  GSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNH
        GSSTACL+DENMSDDRP+KRKRLSSCDSSK  SE  ESKI+PF+EGNKV+TLVTEKRLRKPPRRYSEESIEQKSRS  KKSA KASKDKSLPS  HR + 
Subjt:  GSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNH

Query:  CQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISWT
         QKKLKAAPI+ KDKSFNGGCIQVPFGLPIEEG  HS KKR CW+ E +KDNRILCIKDKNDVES+SAESEDENTEDEC+TKGN TQKG+SRRKHHISWT
Subjt:  CQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISWT

Query:  LSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSSS
        LSEVMKLVEGVS+YGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQA QQVPESVL RVRELAAIYPYPRENKSKE+C  S
Subjt:  LSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSSS

Query:  APSTSSFK-TTSNMFVSLPTVM
        AP+TSSFK +TSNM V LPTVM
Subjt:  APSTSSFK-TTSNMFVSLPTVM

A0A6J1K971 uncharacterized protein LOC111493398 isoform X21.6e-23584.38Show/hide
Query:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
        MNAASKETMMTSNQ TKMFTYEFPDVDSDLSIFP PE+DVLGVEHFL+PD+NKCF GSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG

Query:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQ-----------------GHDVAFSQNEGKLMISSRTGSS
        ERTKQE  DEKS+GTDDVVCSGINKL  TV  Q  DDIFNLAPGSCNASFLGNV+FG Y Q                 GH+VAF Q +GKLMISS TGSS
Subjt:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQ-----------------GHDVAFSQNEGKLMISSRTGSS

Query:  GCSGSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHR
        GCSGSSTACL+DENMSDDRP+KRKRLSSCDSSK  SE NESKI+PF+EGNKV+T VTEKRLRKPPRRYSEESIEQKSRS  KKSA KASKDKSLPS  H 
Subjt:  GCSGSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHR

Query:  QNHCQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHI
         +  QKKLKAAPI+ KDKSFNGGCIQVPFGLPIEEG  HS KKR CW+ E +KDNRILCIKDKNDVES+SAESEDENTEDEC+TKGN TQKG+SRRKHHI
Subjt:  QNHCQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHI

Query:  SWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESC
        SWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQA QQVPESVL RVRELAAIYPYPRENKSKE+C
Subjt:  SWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESC

Query:  SSSAPSTSSFK-TTSNMFVSLPTVM
          SAP+TSSFK TTSNM V LPTVM
Subjt:  SSSAPSTSSFK-TTSNMFVSLPTVM

A0A6J1KI08 uncharacterized protein LOC111493398 isoform X31.1e-23684.67Show/hide
Query:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
        MNAASKETMMTSNQ TKMFTYEFPDVDSDLSIFP PE+DVLGVEHFL+PD+NKCF GSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG
Subjt:  MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEG

Query:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFG--------------SYSQGHDVAFSQNEGKLMISSRTGSSGCS
        ERTKQE  DEKS+GTDDVVCSGINKL  TV  Q  DDIFNLAPGSCNASFLGNV+FG              S++ GH+VAF Q +GKLMISS TGSSGCS
Subjt:  ERTKQEDHDEKSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFG--------------SYSQGHDVAFSQNEGKLMISSRTGSSGCS

Query:  GSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNH
        GSSTACL+DENMSDDRP+KRKRLSSCDSSK  SE NESKI+PF+EGNKV+T VTEKRLRKPPRRYSEESIEQKSRS  KKSA KASKDKSLPS  H  + 
Subjt:  GSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSELNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNH

Query:  CQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISWT
         QKKLKAAPI+ KDKSFNGGCIQVPFGLPIEEG  HS KKR CW+ E +KDNRILCIKDKNDVES+SAESEDENTEDEC+TKGN TQKG+SRRKHHISWT
Subjt:  CQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISWT

Query:  LSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSSS
        LSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQA QQVPESVL RVRELAAIYPYPRENKSKE+C  S
Subjt:  LSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSSS

Query:  APSTSSFK-TTSNMFVSLPTVM
        AP+TSSFK TTSNM V LPTVM
Subjt:  APSTSSFK-TTSNMFVSLPTVM

SwissProt top hitse value%identityAlignment
Q6R0E3 Telomere repeat-binding protein 55.5e-0735.23Show/hide
Query:  SRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRV
        ++R+    ++++EV  LV+ V   G GRW ++K   F ++ HRT VDLKDKW+ L+  +    Q RR          + VP+ +L RV
Subjt:  SRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRV

Q9C7B1 Telomere repeat-binding protein 38.5e-0836.36Show/hide
Query:  SRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRV
        ++R+    ++++EV  LV+ V E G GRW ++K   F  + HRT VDLKDKW+ L+  +    Q RR          + VP+ +L RV
Subjt:  SRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRV

Q9FFY9 Telomere repeat-binding protein 41.9e-0731.71Show/hide
Query:  SRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRE
        S+R+    ++++EV  LV  V E G GRW ++K   F ++SHRT VDLKDKW+ L+  +    Q RR          + VP+ +L RV      + Y  +
Subjt:  SRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRE

Query:  NKSKESCSSSAPSTSSFKTTSNM
        ++ K++      +T   +  S+M
Subjt:  NKSKESCSSSAPSTSSFKTTSNM

Q9M347 Telomere repeat-binding protein 63.2e-0736.78Show/hide
Query:  RRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRV
        +R+    +T+SEV  LV+ V   G GRW ++K   F+  +HRT VDLKDKW+ L+  +    + RR          + VP+ +L RV
Subjt:  RRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRV

Q9SNB9 Telomere repeat-binding protein 23.2e-0733.33Show/hide
Query:  QKGSSRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIY
        Q+ +++R+    ++++EV  LV+ V + G GRW ++K   F  + HRT VDLKDKW+ L+  +    Q RR          + VP+ +L RV +  A +
Subjt:  QKGSSRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELAAIY

Arabidopsis top hitse value%identityAlignment
AT1G17460.1 TRF-like 35.3e-2134.35Show/hide
Query:  KRLRKPPRRYSEE---------------------SIEQKSRSTIKKSAHKASKDKSLPSESHRQNHCQKKLKAAPIIHKDKSF-------NGGCIQVPFG
        KR+RKP RRY EE                     ++  + R  + +    A     +P  SH +    ++   A    + KS+        G     P  
Subjt:  KRLRKPPRRYSEE---------------------SIEQKSRSTIKKSAHKASKDKSLPSESHRQNHCQKKLKAAPIIHKDKSF-------NGGCIQVPFG

Query:  LPIEEGQVHSAKKRT-CWDSEGMKDNRILCIKD------KNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISWTLSEVMKLVEGVSEYGVGRWT
        L  +  +V   K  + C   E  KD+      D      + ++   S +S D+N  D  IT      + +S RK H +WT+SEV KLVEGVS+YGVG+WT
Subjt:  LPIEEGQVHSAKKRT-CWDSEGMKDNRILCIKD------KNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISWTLSEVMKLVEGVSEYGVGRWT

Query:  EIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELA
        EIK+L FS  +HRT+VDLKDKWRNL KAS +   NR +  L +K  S  +P  ++ +VRELA
Subjt:  EIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELA

AT1G72650.1 TRF-like 66.6e-2434.56Show/hide
Query:  KRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNHCQKKLKAAPIIHKDKSFNGGCIQVPF-----------------------------
        KR+RKP RRY EE  E   +    KS    SKD+ L  +S  ++      K   +  +  S  G  I+VP+                             
Subjt:  KRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNHCQKKLKAAPIIHKDKSFNGGCIQVPF-----------------------------

Query:  ----GLPIEEGQVHS--AKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNT----------QKGSSRRKHHISWTLSEVMKLVEG
             L +   Q+ S    + +   S            D+N+VE   +E + E   +   + GN++          Q G+ RRKHH +WTLSE+ KLVEG
Subjt:  ----GLPIEEGQVHS--AKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNT----------QKGSSRRKHHISWTLSEVMKLVEG

Query:  VSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELA
        VS+YG G+W+EIK+  FSS S+RTSVDLKDKWRNLLK S  Q  +     L +K  S  +P  +L RVRELA
Subjt:  VSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELA

AT1G72650.2 TRF-like 66.6e-2434.56Show/hide
Query:  KRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNHCQKKLKAAPIIHKDKSFNGGCIQVPF-----------------------------
        KR+RKP RRY EE  E   +    KS    SKD+ L  +S  ++      K   +  +  S  G  I+VP+                             
Subjt:  KRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNHCQKKLKAAPIIHKDKSFNGGCIQVPF-----------------------------

Query:  ----GLPIEEGQVHS--AKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNT----------QKGSSRRKHHISWTLSEVMKLVEG
             L +   Q+ S    + +   S            D+N+VE   +E + E   +   + GN++          Q G+ RRKHH +WTLSE+ KLVEG
Subjt:  ----GLPIEEGQVHS--AKKRTCWDSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNT----------QKGSSRRKHHISWTLSEVMKLVEG

Query:  VSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELA
        VS+YG G+W+EIK+  FSS S+RTSVDLKDKWRNLLK S  Q  +     L +K  S  +P  +L RVRELA
Subjt:  VSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPESVLRRVRELA

AT2G37025.1 TRF-like 81.9e-2345.71Show/hide
Query:  ENTEDECITKGNNTQKGSSRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPE
        E+ +D  +TK + T+  S RRK+   WTL EVM LV+G+S +GVG+WT+IK   F  ++HR  VD++DKWRNLLKAS  +  N  +    RK  ++ +P+
Subjt:  ENTEDECITKGNNTQKGSSRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPE

Query:  SVLRRVRELAAIYPYPRENKSKESC----SSSAPSTSSFK
         +L RVRELA+++PYP    SK  C    SS + STS  K
Subjt:  SVLRRVRELAAIYPYPRENKSKESC----SSSAPSTSSFK

AT2G37025.2 TRF-like 81.9e-2345.71Show/hide
Query:  ENTEDECITKGNNTQKGSSRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPE
        E+ +D  +TK + T+  S RRK+   WTL EVM LV+G+S +GVG+WT+IK   F  ++HR  VD++DKWRNLLKAS  +  N  +    RK  ++ +P+
Subjt:  ENTEDECITKGNNTQKGSSRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQNRRKVVLGRKQASQQVPE

Query:  SVLRRVRELAAIYPYPRENKSKESC----SSSAPSTSSFK
         +L RVRELA+++PYP    SK  C    SS + STS  K
Subjt:  SVLRRVRELAAIYPYPRENKSKESC----SSSAPSTSSFK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGCGGCATCCAAGGAAACTATGATGACTTCCAATCAGACTACGAAAATGTTTACTTATGAGTTTCCTGATGTTGACTCGGATTTATCTATTTTCCCTGTTCCAGA
AGAAGATGTCTTAGGAGTGGAACATTTCTTAGAACCTGACTTTAACAAATGCTTTTCTGGCAGTGTTCTAGATTTCAACACGTTCCACTCTCATAAGTGCTTAAGCATTG
AAAATTTTTCATTCGCTGTTGAATTTGATCAGATAAAGATTGATAGTGAATCATTGCATTCTAGCCTTACACTAGAAGGAGAGAGAACCAAGCAAGAGGACCATGATGAA
AAATCACAAGGCACAGATGATGTTGTCTGTTCTGGGATAAACAAACTTCCAGCAACGGTGAAGAGTCAAACTGCGGATGACATCTTCAATTTGGCTCCTGGTTCATGCAA
TGCTTCTTTTCTAGGCAATGTGAATTTTGGTAGCTACTCACAAGGGCATGATGTGGCTTTCAGCCAGAACGAAGGCAAACTTATGATCTCATCCAGAACTGGGTCGTCAG
GCTGTTCAGGTAGCTCTACTGCATGCTTGGATGATGAGAATATGTCAGATGACAGGCCTATGAAAAGGAAGCGACTTTCAAGTTGTGACTCCTCGAAGACAGGTTCTGAG
CTTAATGAATCTAAAATCATTCCCTTCCATGAAGGAAACAAAGTAGATACACTTGTTACAGAAAAGAGATTGCGAAAGCCGCCTAGGAGATACAGCGAAGAGTCAATTGA
ACAAAAGTCAAGATCTACTATCAAGAAGAGTGCTCATAAAGCTTCCAAGGATAAGTCTCTCCCCTCTGAATCTCACAGGCAGAATCACTGCCAAAAGAAACTTAAAGCAG
CACCGATCATTCATAAAGATAAATCGTTTAATGGAGGTTGTATTCAGGTTCCATTCGGTCTTCCGATAGAAGAAGGTCAAGTTCACTCGGCGAAAAAGAGAACATGTTGG
GACTCAGAGGGGATGAAGGATAATAGAATCTTGTGCATAAAAGATAAAAACGATGTGGAGTCTTACTCGGCCGAGTCAGAAGATGAGAATACTGAAGACGAGTGCATCAC
TAAAGGTAATAATACTCAAAAAGGCAGCAGTCGCAGGAAGCACCATATATCATGGACTCTTTCTGAGGTTATGAAGTTGGTGGAAGGTGTTTCAGAGTATGGTGTCGGAA
GGTGGACAGAAATTAAGAGGCTACAATTTTCATCATCTTCACATAGAACATCTGTGGATCTCAAGGACAAATGGAGAAATCTATTGAAGGCAAGTGACACACAGTTGCAG
AACAGAAGAAAGGTCGTCCTTGGTCGAAAGCAGGCATCGCAGCAAGTACCGGAATCAGTTCTTCGCCGAGTTCGGGAACTGGCCGCTATTTATCCATATCCAAGGGAAAA
CAAATCCAAGGAATCGTGCTCAAGCTCAGCCCCCTCAACTTCATCCTTCAAAACTACCAGTAACATGTTTGTTTCTTTGCCCACAGTTATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATGCGGCATCCAAGGAAACTATGATGACTTCCAATCAGACTACGAAAATGTTTACTTATGAGTTTCCTGATGTTGACTCGGATTTATCTATTTTCCCTGTTCCAGA
AGAAGATGTCTTAGGAGTGGAACATTTCTTAGAACCTGACTTTAACAAATGCTTTTCTGGCAGTGTTCTAGATTTCAACACGTTCCACTCTCATAAGTGCTTAAGCATTG
AAAATTTTTCATTCGCTGTTGAATTTGATCAGATAAAGATTGATAGTGAATCATTGCATTCTAGCCTTACACTAGAAGGAGAGAGAACCAAGCAAGAGGACCATGATGAA
AAATCACAAGGCACAGATGATGTTGTCTGTTCTGGGATAAACAAACTTCCAGCAACGGTGAAGAGTCAAACTGCGGATGACATCTTCAATTTGGCTCCTGGTTCATGCAA
TGCTTCTTTTCTAGGCAATGTGAATTTTGGTAGCTACTCACAAGGGCATGATGTGGCTTTCAGCCAGAACGAAGGCAAACTTATGATCTCATCCAGAACTGGGTCGTCAG
GCTGTTCAGGTAGCTCTACTGCATGCTTGGATGATGAGAATATGTCAGATGACAGGCCTATGAAAAGGAAGCGACTTTCAAGTTGTGACTCCTCGAAGACAGGTTCTGAG
CTTAATGAATCTAAAATCATTCCCTTCCATGAAGGAAACAAAGTAGATACACTTGTTACAGAAAAGAGATTGCGAAAGCCGCCTAGGAGATACAGCGAAGAGTCAATTGA
ACAAAAGTCAAGATCTACTATCAAGAAGAGTGCTCATAAAGCTTCCAAGGATAAGTCTCTCCCCTCTGAATCTCACAGGCAGAATCACTGCCAAAAGAAACTTAAAGCAG
CACCGATCATTCATAAAGATAAATCGTTTAATGGAGGTTGTATTCAGGTTCCATTCGGTCTTCCGATAGAAGAAGGTCAAGTTCACTCGGCGAAAAAGAGAACATGTTGG
GACTCAGAGGGGATGAAGGATAATAGAATCTTGTGCATAAAAGATAAAAACGATGTGGAGTCTTACTCGGCCGAGTCAGAAGATGAGAATACTGAAGACGAGTGCATCAC
TAAAGGTAATAATACTCAAAAAGGCAGCAGTCGCAGGAAGCACCATATATCATGGACTCTTTCTGAGGTTATGAAGTTGGTGGAAGGTGTTTCAGAGTATGGTGTCGGAA
GGTGGACAGAAATTAAGAGGCTACAATTTTCATCATCTTCACATAGAACATCTGTGGATCTCAAGGACAAATGGAGAAATCTATTGAAGGCAAGTGACACACAGTTGCAG
AACAGAAGAAAGGTCGTCCTTGGTCGAAAGCAGGCATCGCAGCAAGTACCGGAATCAGTTCTTCGCCGAGTTCGGGAACTGGCCGCTATTTATCCATATCCAAGGGAAAA
CAAATCCAAGGAATCGTGCTCAAGCTCAGCCCCCTCAACTTCATCCTTCAAAACTACCAGTAACATGTTTGTTTCTTTGCCCACAGTTATGTGA
Protein sequenceShow/hide protein sequence
MNAASKETMMTSNQTTKMFTYEFPDVDSDLSIFPVPEEDVLGVEHFLEPDFNKCFSGSVLDFNTFHSHKCLSIENFSFAVEFDQIKIDSESLHSSLTLEGERTKQEDHDE
KSQGTDDVVCSGINKLPATVKSQTADDIFNLAPGSCNASFLGNVNFGSYSQGHDVAFSQNEGKLMISSRTGSSGCSGSSTACLDDENMSDDRPMKRKRLSSCDSSKTGSE
LNESKIIPFHEGNKVDTLVTEKRLRKPPRRYSEESIEQKSRSTIKKSAHKASKDKSLPSESHRQNHCQKKLKAAPIIHKDKSFNGGCIQVPFGLPIEEGQVHSAKKRTCW
DSEGMKDNRILCIKDKNDVESYSAESEDENTEDECITKGNNTQKGSSRRKHHISWTLSEVMKLVEGVSEYGVGRWTEIKRLQFSSSSHRTSVDLKDKWRNLLKASDTQLQ
NRRKVVLGRKQASQQVPESVLRRVRELAAIYPYPRENKSKESCSSSAPSTSSFKTTSNMFVSLPTVM