; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0037421 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0037421
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionHTH myb-type domain-containing protein
Genome locationchr2:6068411..6077148
RNA-Seq ExpressionLag0037421
SyntenyLag0037421
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591699.1 hypothetical protein SDJN03_14045, partial [Cucurbita argyrosperma subsp. sororia]7.0e-23679.65Show/hide
Query:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLQEVFRGSEGRIDWDKLVKNTSTGISNAREYQLLWRHLAYRHTLLENMDYVTDPVDDDSDLEFEIEP
        MIE KEK+KKGTIS EDSS +LERYSVRTI TLL+EV   SE RIDWDKLVKNTSTGISN REYQLLWRHLAYRHTLLEN+D VTDP+D DSDL+FEIEP
Subjt:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLQEVFRGSEGRIDWDKLVKNTSTGISNAREYQLLWRHLAYRHTLLENMDYVTDPVDDDSDLEFEIEP

Query:  VPSVSSESSNEAAACVKVLIANGIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGMYVTIPISIQRQPIPTPSATEVFDVNGAAGSNAA
         PSVS+ES NEAAACVKVLIANGIPSESD+PSSS VEAPLTIGI SNS+S RA+LENPQSACLMQGMYVT+PISIQRQP+PTPSATEVFDVNGAAG NAA
Subjt:  VPSVSSESSNEAAACVKVLIANGIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGMYVTIPISIQRQPIPTPSATEVFDVNGAAGSNAA

Query:  SRKRRKPWTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRKRRGNLNMGANTTSTQISKAQIDAAHRALSLALDLPVNNSKT-ANSN
        SRKRRKPW+K EDLEL+AAVEKYGEGNWANILK DFKGDRTASQLSQRWSIIRKR GNLN+GANTTSTQISKAQIDAAHRALSLALDLPVNNSK+ ANSN
Subjt:  SRKRRKPWTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRKRRGNLNMGANTTSTQISKAQIDAAHRALSLALDLPVNNSKT-ANSN

Query:  INNSNFSSSSGVEAPFQMQNQSLQIPMPSRPVLVEPSPSVAKSGINTSKNSLTMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQTKNAIHIKSK-
        +N+S  SS+SG EAP Q+QNQS Q+ +PSRP+ V+P PS AKSGINT+KN+L MKSTHNSDSIV ATAVAAGARIVSPSDAASL+KAAQTKNAIHIKSK 
Subjt:  INNSNFSSSSGVEAPFQMQNQSLQIPMPSRPVLVEPSPSVAKSGINTSKNSLTMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQTKNAIHIKSK-

Query:  GTLIKSPVLGNAPMRSDARPSVHYISTGKTATPGSNFVGGKSTMVGNNPMKAVSPKVLHNRSTALLKNAPSDPISPATESPSKQEVQSSEECKISEPIVT
         + I+ PVLGNA    DARPSVHYISTG+TATPGSN+VGGKSTM GNN MK VSPK  +N STA+L N PS+ ISP TESP KQEV+SSEE KIS+PI+T
Subjt:  GTLIKSPVLGNAPMRSDARPSVHYISTGKTATPGSNFVGGKSTMVGNNPMKAVSPKVLHNRSTALLKNAPSDPISPATESPSKQEVQSSEECKISEPIVT

Query:  AKKESREDEAVIRDISVASQRSDGELGRLSTCIETHNTSLNMEIDENDLKAGCSKQVEMKTKANDVEIRG
         K + RE+  V+RD+  ASQ SD E G  STCIE  NTSLNM+IDEND+KA C KQ E K KANDV+I G
Subjt:  AKKESREDEAVIRDISVASQRSDGELGRLSTCIETHNTSLNMEIDENDLKAGCSKQVEMKTKANDVEIRG

KAG7024581.1 hypothetical protein SDJN02_13399, partial [Cucurbita argyrosperma subsp. argyrosperma]5.5e-24179.05Show/hide
Query:  LLLFLLLFICDFDSCSIMIERKEKRKKGTISKEDSSTLLERYSVRTILTLLQEVFRGSEGRIDWDKLVKNTSTGISNAREYQLLWRHLAYRHTLLENMDY
        LLL    F CDF + SIMIE KEK+KKGTIS EDSS +LERYSVRTI TLL+EV   SE RIDWDKLVKNTSTGISN REYQLLWRHLAYRHTLLEN+D 
Subjt:  LLLFLLLFICDFDSCSIMIERKEKRKKGTISKEDSSTLLERYSVRTILTLLQEVFRGSEGRIDWDKLVKNTSTGISNAREYQLLWRHLAYRHTLLENMDY

Query:  VTDPVDDDSDLEFEIEPVPSVSSESSNEAAACVKVLIANGIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGMYVTIPISIQRQPIPTP
        VTDP+D DSDL+FEIEP PSVS+ES NEAAACVKVLIANGIPSESD+PSSS VEAPLTIGI SNS+S RA+LENPQSACLMQGMYVT+PISIQRQP+PTP
Subjt:  VTDPVDDDSDLEFEIEPVPSVSSESSNEAAACVKVLIANGIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGMYVTIPISIQRQPIPTP

Query:  SATEVFDVNGAAGSNAASRKRRKPWTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRKRRGNLNMGANTTSTQISKAQIDAAHRALS
        SATEVFDVNGAAGSNAASRKRRKPW+K EDLEL+AAVEKYGEGNWANILK DFKGDRTASQLSQRWSIIRKR GNLN+GANTTSTQISKAQIDAAHRALS
Subjt:  SATEVFDVNGAAGSNAASRKRRKPWTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRKRRGNLNMGANTTSTQISKAQIDAAHRALS

Query:  LALDLPVNNSKT-ANSNINNSNFSSSSGVEAPFQMQNQSLQIPMPSRPVLVEPSPSVAKSGINTSKNSLTMKSTHNSDSIVRATAVAAGARIVSPSDAAS
        LALD PVNNSK+ ANSN+N+S  SS+SG EAP Q+QNQS Q+ +PSRP+ V+P PS AKSGINT+KN+L MKSTHNSDSIVRATAVAAGARIVSPSDAAS
Subjt:  LALDLPVNNSKT-ANSNINNSNFSSSSGVEAPFQMQNQSLQIPMPSRPVLVEPSPSVAKSGINTSKNSLTMKSTHNSDSIVRATAVAAGARIVSPSDAAS

Query:  LLKAAQTKNAIHIKSK-GTLIKSPVLGNAPMRSDARPSVHYISTGKTATPGSNFVGGKSTMVGNNPMKAVSPKVLHNRSTALLKNAPSDPISPATESPSK
        L+KAAQTKNAIHIKSK  + I+ P+LGNA    DARPSVHYISTG+TATPGSN+VGGKSTM GNN MK VSPK  +N STA+L N PS+ ISP TESP K
Subjt:  LLKAAQTKNAIHIKSK-GTLIKSPVLGNAPMRSDARPSVHYISTGKTATPGSNFVGGKSTMVGNNPMKAVSPKVLHNRSTALLKNAPSDPISPATESPSK

Query:  QEVQSSEECKISEPIVTAKKESREDEAVIRDISVASQRSDGELGRLSTCIETHNTSLNMEIDENDLKAGCSKQVEMKTKANDVEIRG
        QEV+SSEE KIS+PI+T K + RE+  V+RD+  ASQ SD E G  STCIE  NTSLNMEI+END+KA C KQ E K KANDV+IRG
Subjt:  QEVQSSEECKISEPIVTAKKESREDEAVIRDISVASQRSDGELGRLSTCIETHNTSLNMEIDENDLKAGCSKQVEMKTKANDVEIRG

XP_022976301.1 uncharacterized protein LOC111476736 isoform X2 [Cucurbita maxima]2.4e-22879.17Show/hide
Query:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLQEVFRGSEGRIDWDKLVKNTSTGISNAREYQLLWRHLAYRHTLLENMDYVTDPVDDDSDLEFEIEP
        MIE KEK+KKGTIS EDS  +LERYSVRTI TLL+EV   SE RIDWDKLVKNTSTGISN REYQLLWRHLAYRHTLLEN+D VTDP+D DSDL+FEIEP
Subjt:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLQEVFRGSEGRIDWDKLVKNTSTGISNAREYQLLWRHLAYRHTLLENMDYVTDPVDDDSDLEFEIEP

Query:  VPSVSSESSNEAAACVKVLIANGIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGMYVTIPISIQRQPIPTPSATEVFDVNGAAGSNAA
         PSVS+ES NEAAACVKVLIANGIPSESD+PSSS VEAPLTIGI SNS+S RA+LENPQSACLMQGMYVT+PISI+RQP+PTPSATEVFDVNGAAGSNAA
Subjt:  VPSVSSESSNEAAACVKVLIANGIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGMYVTIPISIQRQPIPTPSATEVFDVNGAAGSNAA

Query:  SRKRRKPWTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRKRRGNLNMGANTTSTQISKAQIDAAHRALSLALDLPVNNSKTANSNI
        SRKRRKPW+K +DLEL+AAVEKYGEGNWANILK DFKGDRTASQLSQRWSIIRKR GNLN+GANTTSTQISKAQIDAAHRALSLALDLPVN SK+ANSN+
Subjt:  SRKRRKPWTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRKRRGNLNMGANTTSTQISKAQIDAAHRALSLALDLPVNNSKTANSNI

Query:  NNSNFSSSSGVEAPFQMQNQSLQIPMPSRPVLVEPSPSVAKSGINTSKNSLTMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQTKNAIHIKSK-G
        N+S  SS+SG EAP Q+QNQS Q+ +P RP+ V+P P  AKSGINT KN+L MKSTHNSDSIVRATAVAAGARIVSP DAASL+K AQTKNAIHIKSK  
Subjt:  NNSNFSSSSGVEAPFQMQNQSLQIPMPSRPVLVEPSPSVAKSGINTSKNSLTMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQTKNAIHIKSK-G

Query:  TLIKSPVLGNAPMRSDARPSVHYISTGKTATPGSNFVGGKSTMVGNNPMKAVSPKVLHNRSTALLKNAPSDPISPATESPSKQEVQSSEECKISEPIVTA
        + I+ PVLGNA    DA+PSVHYISTG+TATPGSN+VGGKSTM GNN MK V+PK  +N STA+L N PS+ ISP TESP KQEV+SSEECKIS+PI+T+
Subjt:  TLIKSPVLGNAPMRSDARPSVHYISTGKTATPGSNFVGGKSTMVGNNPMKAVSPKVLHNRSTALLKNAPSDPISPATESPSKQEVQSSEECKISEPIVTA

Query:  KKESREDEAVIRDISVASQRSDGELGRLSTCIETHNTSLNMEIDENDLKAGC
        K +SRE+  V+RD+  AS  SD E G  STCIE  NTSLNMEIDEND+KA C
Subjt:  KKESREDEAVIRDISVASQRSDGELGRLSTCIETHNTSLNMEIDENDLKAGC

XP_023534835.1 uncharacterized protein LOC111796458 isoform X1 [Cucurbita pepo subsp. pepo]1.7e-23479.12Show/hide
Query:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLQEVFRGSEGRIDWDKLVKNTSTGISNAREYQLLWRHLAYRHTLLENMDYVTDPVDDDSDLEFEIEP
        MIE KEK KKGTIS EDSS +LERYSVRTI TLL+EV   SE RIDWDKLVKNTSTGISN REYQLLWRHLAYRHTLLEN+D VTDP+D DSDL+FEIEP
Subjt:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLQEVFRGSEGRIDWDKLVKNTSTGISNAREYQLLWRHLAYRHTLLENMDYVTDPVDDDSDLEFEIEP

Query:  VPSVSSESSNEAAACVKVLIANGIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGMYVTIPISIQRQPIPTPSATEVFDVNGAAGSNAA
         PSVS+ES NEAAACVKVLIANGIPSESD+PSSS VEAPLTIGI SNS+  RA+LENPQSACLMQGMYVT PIS+QRQP+PTPSATEVFDVNGAAG NAA
Subjt:  VPSVSSESSNEAAACVKVLIANGIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGMYVTIPISIQRQPIPTPSATEVFDVNGAAGSNAA

Query:  SRKRRKPWTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRKRRGNLNMGANTTSTQISKAQIDAAHRALSLALDLPVNNSKT-ANSN
        SRKRRKPW+K EDLEL+AAVEKYGEGNWANILK DFKGDRTASQLSQRWSIIRKRRG L++GANTTSTQ SKAQIDAAHRALSLALDLPVNNSK+ ANSN
Subjt:  SRKRRKPWTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRKRRGNLNMGANTTSTQISKAQIDAAHRALSLALDLPVNNSKT-ANSN

Query:  INNSNFSSSSGVEAPFQMQNQSLQIPMPSRPVLVEPSPSVAKSGINTSKNSLTMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQTKNAIHIKSK-
        +N+S  SS+SG EAP Q+QNQS Q+ +P RP+ V+P PS AKSGINT+KN+L MKSTHNSDSIVRATAVAAGARIVSPSDAASL+KAAQTKNAIHIKSK 
Subjt:  INNSNFSSSSGVEAPFQMQNQSLQIPMPSRPVLVEPSPSVAKSGINTSKNSLTMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQTKNAIHIKSK-

Query:  GTLIKSPVLGNAPMRSDARPSVHYISTGKTATPGSNFVGGKSTMVGNNPMKAVSPKVLHNRSTALLKNAPSDPISPATESPSKQEVQSSEECKISEPIVT
         + I+ PVLGNA    DARPSVHYISTG+TATPGSN+VGGKSTM G   MK VSPK  +N STA+  N PS+ ISP TESP KQEV+SSEECKIS+PI+T
Subjt:  GTLIKSPVLGNAPMRSDARPSVHYISTGKTATPGSNFVGGKSTMVGNNPMKAVSPKVLHNRSTALLKNAPSDPISPATESPSKQEVQSSEECKISEPIVT

Query:  AKKESREDEAVIRDISVASQRSDGELGRLSTCIETHNTSLNMEIDENDLKAGCSKQVEMKTKANDVEIRG
        +K + RE+  V+RD+  ASQ SD E G  STCIE  NTSLNMEIDEND+KA C KQ E KTKANDV+IRG
Subjt:  AKKESREDEAVIRDISVASQRSDGELGRLSTCIETHNTSLNMEIDENDLKAGCSKQVEMKTKANDVEIRG

XP_023534838.1 uncharacterized protein LOC111796458 isoform X2 [Cucurbita pepo subsp. pepo]7.0e-23679.26Show/hide
Query:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLQEVFRGSEGRIDWDKLVKNTSTGISNAREYQLLWRHLAYRHTLLENMDYVTDPVDDDSDLEFEIEP
        MIE KEK KKGTIS EDSS +LERYSVRTI TLL+EV   SE RIDWDKLVKNTSTGISN REYQLLWRHLAYRHTLLEN+D VTDP+D DSDL+FEIEP
Subjt:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLQEVFRGSEGRIDWDKLVKNTSTGISNAREYQLLWRHLAYRHTLLENMDYVTDPVDDDSDLEFEIEP

Query:  VPSVSSESSNEAAACVKVLIANGIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGMYVTIPISIQRQPIPTPSATEVFDVNGAAGSNAA
         PSVS+ES NEAAACVKVLIANGIPSESD+PSSS VEAPLTIGI SNS+  RA+LENPQSACLMQGMYVT PIS+QRQP+PTPSATEVFDVNGAAG NAA
Subjt:  VPSVSSESSNEAAACVKVLIANGIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGMYVTIPISIQRQPIPTPSATEVFDVNGAAGSNAA

Query:  SRKRRKPWTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRKRRGNLNMGANTTSTQISKAQIDAAHRALSLALDLPVNNSKTANSNI
        SRKRRKPW+K EDLEL+AAVEKYGEGNWANILK DFKGDRTASQLSQRWSIIRKRRG L++GANTTSTQ SKAQIDAAHRALSLALDLPVNNSK+ANSN+
Subjt:  SRKRRKPWTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRKRRGNLNMGANTTSTQISKAQIDAAHRALSLALDLPVNNSKTANSNI

Query:  NNSNFSSSSGVEAPFQMQNQSLQIPMPSRPVLVEPSPSVAKSGINTSKNSLTMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQTKNAIHIKSK-G
        N+S  SS+SG EAP Q+QNQS Q+ +P RP+ V+P PS AKSGINT+KN+L MKSTHNSDSIVRATAVAAGARIVSPSDAASL+KAAQTKNAIHIKSK  
Subjt:  NNSNFSSSSGVEAPFQMQNQSLQIPMPSRPVLVEPSPSVAKSGINTSKNSLTMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQTKNAIHIKSK-G

Query:  TLIKSPVLGNAPMRSDARPSVHYISTGKTATPGSNFVGGKSTMVGNNPMKAVSPKVLHNRSTALLKNAPSDPISPATESPSKQEVQSSEECKISEPIVTA
        + I+ PVLGNA    DARPSVHYISTG+TATPGSN+VGGKSTM G   MK VSPK  +N STA+  N PS+ ISP TESP KQEV+SSEECKIS+PI+T+
Subjt:  TLIKSPVLGNAPMRSDARPSVHYISTGKTATPGSNFVGGKSTMVGNNPMKAVSPKVLHNRSTALLKNAPSDPISPATESPSKQEVQSSEECKISEPIVTA

Query:  KKESREDEAVIRDISVASQRSDGELGRLSTCIETHNTSLNMEIDENDLKAGCSKQVEMKTKANDVEIRG
        K + RE+  V+RD+  ASQ SD E G  STCIE  NTSLNMEIDEND+KA C KQ E KTKANDV+IRG
Subjt:  KKESREDEAVIRDISVASQRSDGELGRLSTCIETHNTSLNMEIDENDLKAGCSKQVEMKTKANDVEIRG

TrEMBL top hitse value%identityAlignment
A0A6J1C5S4 uncharacterized protein LOC1110087031.1e-22677.87Show/hide
Query:  SIMIERKEKRKKGTISKEDS-STLLERYSVRTILTLLQEVFRGSEGRIDWDKLVKNTSTGISNAREYQLLWRHLAYRHTLLENMDYVTDPVDDDSDLEFE
        S+MIERKEK+KKG IS ED  STLLERYSVRTILTLL+EV + SE RIDWDKLVKNTSTGISNAREYQ+LWRHLAYRHTLLENMD +T P+DDDSDL+FE
Subjt:  SIMIERKEKRKKGTISKEDS-STLLERYSVRTILTLLQEVFRGSEGRIDWDKLVKNTSTGISNAREYQLLWRHLAYRHTLLENMDYVTDPVDDDSDLEFE

Query:  IEPVPSVSSESSNEAAACVKVLIANGIPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSACLMQGMYVTIPISIQRQPI-PTPS-ATEVFDVNGAAG
        IE  PSV+SES NEAAA VKVLIAN IPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQS CL+Q MYV IPISIQRQPI  TP+ +TEVFDVNGAAG
Subjt:  IEPVPSVSSESSNEAAACVKVLIANGIPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSACLMQGMYVTIPISIQRQPI-PTPS-ATEVFDVNGAAG

Query:  SNAASRKRRKPWTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRKRRGNLNMGANTTSTQISKAQIDAAHRALSLALDLPVNNSKTA
         NAASRKRRKPW+KAED+ELIAAV+K GEGNWANILK DFKG+RTASQLSQRWSIIRKRR NLN+GAN T TQISKAQIDA HRALS ALDLPVNNSKT 
Subjt:  SNAASRKRRKPWTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRKRRGNLNMGANTTSTQISKAQIDAAHRALSLALDLPVNNSKTA

Query:  NSNINNSNFSSSSGVEAPFQMQNQSLQIPMPSRPVLVEPSPSVAKSGINTSKNSLTMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQTKNAIHIK
         SNIN+   SS+SG EAP QMQNQS QIP PSRPVLVEP PS  K GI+TSKN+L MKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQ +NAIHIK
Subjt:  NSNINNSNFSSSSGVEAPFQMQNQSLQIPMPSRPVLVEPSPSVAKSGINTSKNSLTMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQTKNAIHIK

Query:  SK-GTLIKSPVLGNAPMRSDARPSVHYISTGKTATPGSNFVGGKSTMVGNNPMKAVSPKVLHNRST-ALLKNAPSDPISPATESPSKQEVQSSEECKISE
        S   + IK PV GNAP+ SD RP++HYISTGK A+PGSN+VGGK  +V NN +KA+SP VLH+RST A+L N  SD  SPATESPSK+E++SSEE K+ E
Subjt:  SK-GTLIKSPVLGNAPMRSDARPSVHYISTGKTATPGSNFVGGKSTMVGNNPMKAVSPKVLHNRST-ALLKNAPSDPISPATESPSKQEVQSSEECKISE

Query:  PIVTAKKESREDEAVIRDISVASQRSDGELGRLSTCIETHNTSLNMEIDENDLKAGCSKQVEMKTKANDVEIRG
        P+ T K+E+RE EAV R  + A++RSDGEL  LSTCIE HN S N EIDEN +KAGCS+QVE K  ANDVEIRG
Subjt:  PIVTAKKESREDEAVIRDISVASQRSDGELGRLSTCIETHNTSLNMEIDENDLKAGCSKQVEMKTKANDVEIRG

A0A6J1FAZ9 uncharacterized protein LOC111443670 isoform X22.1e-21781.82Show/hide
Query:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLQEVFRGSEGRIDWDKLVKNTSTGISNAREYQLLWRHLAYRHTLLENMDYVTDPVDDDSDLEFEIEP
        MIE KEK+KKGTIS EDSS +LERYSVRTI TLL+EV   SE RIDWDKLVKNTSTGISN REYQLLWRHLAYRHTLLEN+D VTDP+D DSDL+FEIEP
Subjt:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLQEVFRGSEGRIDWDKLVKNTSTGISNAREYQLLWRHLAYRHTLLENMDYVTDPVDDDSDLEFEIEP

Query:  VPSVSSESSNEAAACVKVLIANGIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGMYVTIPISIQRQPIPTPSATEVFDVNGAAGSNAA
         PSVS+ES NEAAACVKVLIANGIPSESD+PSSS VEAPLTIGI SNS+S RA+LENPQSACLMQGMYVT+PISIQRQP+PTPSATEVFDVNGAAGSNAA
Subjt:  VPSVSSESSNEAAACVKVLIANGIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGMYVTIPISIQRQPIPTPSATEVFDVNGAAGSNAA

Query:  SRKRRKPWTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRKRRGNLNMGANTTSTQISKAQIDAAHRALSLALDLPVNNSKTANSNI
        SRKRRKPW+K EDLEL+AAVEKYGEGNWANILK DFKGDRTASQLSQRWSIIRKR GNLN+GANTTSTQISKAQIDAAHRALSLALDLPVNNSK+ANSN+
Subjt:  SRKRRKPWTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRKRRGNLNMGANTTSTQISKAQIDAAHRALSLALDLPVNNSKTANSNI

Query:  NNSNFSSSSGVEAPFQMQNQSLQIPMPSRPVLVEPSPSVAKSGINTSKNSLTMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQTKNAIHIKSK-G
        N+S  SS+SG EAP Q+QNQS Q+ +PSRP+ V+P PS AKSGINT+KN+L MKSTHNSDSIVRATAVAAGARIVSPSDAASL+KAAQTKNAIHIKSK  
Subjt:  NNSNFSSSSGVEAPFQMQNQSLQIPMPSRPVLVEPSPSVAKSGINTSKNSLTMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQTKNAIHIKSK-G

Query:  TLIKSPVLGNAPMRSDARPSVHYISTGKTATPGSNFVGGKSTMVGNNPMKAVSPKVLHNRSTALLKNAPSDPISPATESPSKQEVQSSEECKISEPIVTA
        + I+ P+LGNA    DARPSVHYISTG+TATPG+N+VGGKSTM GNN MK VSPK  +N STA+L N PS+ ISP TESP KQEV+SSEE KIS+PI+T 
Subjt:  TLIKSPVLGNAPMRSDARPSVHYISTGKTATPGSNFVGGKSTMVGNNPMKAVSPKVLHNRSTALLKNAPSDPISPATESPSKQEVQSSEECKISEPIVTA

Query:  KKESRE
        K + RE
Subjt:  KKESRE

A0A6J1FGE2 uncharacterized protein LOC111443670 isoform X15.1e-21681.66Show/hide
Query:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLQEVFRGSEGRIDWDKLVKNTSTGISNAREYQLLWRHLAYRHTLLENMDYVTDPVDDDSDLEFEIEP
        MIE KEK+KKGTIS EDSS +LERYSVRTI TLL+EV   SE RIDWDKLVKNTSTGISN REYQLLWRHLAYRHTLLEN+D VTDP+D DSDL+FEIEP
Subjt:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLQEVFRGSEGRIDWDKLVKNTSTGISNAREYQLLWRHLAYRHTLLENMDYVTDPVDDDSDLEFEIEP

Query:  VPSVSSESSNEAAACVKVLIANGIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGMYVTIPISIQRQPIPTPSATEVFDVNGAAGSNAA
         PSVS+ES NEAAACVKVLIANGIPSESD+PSSS VEAPLTIGI SNS+S RA+LENPQSACLMQGMYVT+PISIQRQP+PTPSATEVFDVNGAAGSNAA
Subjt:  VPSVSSESSNEAAACVKVLIANGIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGMYVTIPISIQRQPIPTPSATEVFDVNGAAGSNAA

Query:  SRKRRKPWTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRKRRGNLNMGANTTSTQISKAQIDAAHRALSLALDLPVNNSKT-ANSN
        SRKRRKPW+K EDLEL+AAVEKYGEGNWANILK DFKGDRTASQLSQRWSIIRKR GNLN+GANTTSTQISKAQIDAAHRALSLALDLPVNNSK+ ANSN
Subjt:  SRKRRKPWTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRKRRGNLNMGANTTSTQISKAQIDAAHRALSLALDLPVNNSKT-ANSN

Query:  INNSNFSSSSGVEAPFQMQNQSLQIPMPSRPVLVEPSPSVAKSGINTSKNSLTMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQTKNAIHIKSK-
        +N+S  SS+SG EAP Q+QNQS Q+ +PSRP+ V+P PS AKSGINT+KN+L MKSTHNSDSIVRATAVAAGARIVSPSDAASL+KAAQTKNAIHIKSK 
Subjt:  INNSNFSSSSGVEAPFQMQNQSLQIPMPSRPVLVEPSPSVAKSGINTSKNSLTMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQTKNAIHIKSK-

Query:  GTLIKSPVLGNAPMRSDARPSVHYISTGKTATPGSNFVGGKSTMVGNNPMKAVSPKVLHNRSTALLKNAPSDPISPATESPSKQEVQSSEECKISEPIVT
         + I+ P+LGNA    DARPSVHYISTG+TATPG+N+VGGKSTM GNN MK VSPK  +N STA+L N PS+ ISP TESP KQEV+SSEE KIS+PI+T
Subjt:  GTLIKSPVLGNAPMRSDARPSVHYISTGKTATPGSNFVGGKSTMVGNNPMKAVSPKVLHNRSTALLKNAPSDPISPATESPSKQEVQSSEECKISEPIVT

Query:  AKKESRE
         K + RE
Subjt:  AKKESRE

A0A6J1IGI9 uncharacterized protein LOC111476736 isoform X21.2e-22879.17Show/hide
Query:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLQEVFRGSEGRIDWDKLVKNTSTGISNAREYQLLWRHLAYRHTLLENMDYVTDPVDDDSDLEFEIEP
        MIE KEK+KKGTIS EDS  +LERYSVRTI TLL+EV   SE RIDWDKLVKNTSTGISN REYQLLWRHLAYRHTLLEN+D VTDP+D DSDL+FEIEP
Subjt:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLQEVFRGSEGRIDWDKLVKNTSTGISNAREYQLLWRHLAYRHTLLENMDYVTDPVDDDSDLEFEIEP

Query:  VPSVSSESSNEAAACVKVLIANGIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGMYVTIPISIQRQPIPTPSATEVFDVNGAAGSNAA
         PSVS+ES NEAAACVKVLIANGIPSESD+PSSS VEAPLTIGI SNS+S RA+LENPQSACLMQGMYVT+PISI+RQP+PTPSATEVFDVNGAAGSNAA
Subjt:  VPSVSSESSNEAAACVKVLIANGIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGMYVTIPISIQRQPIPTPSATEVFDVNGAAGSNAA

Query:  SRKRRKPWTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRKRRGNLNMGANTTSTQISKAQIDAAHRALSLALDLPVNNSKTANSNI
        SRKRRKPW+K +DLEL+AAVEKYGEGNWANILK DFKGDRTASQLSQRWSIIRKR GNLN+GANTTSTQISKAQIDAAHRALSLALDLPVN SK+ANSN+
Subjt:  SRKRRKPWTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRKRRGNLNMGANTTSTQISKAQIDAAHRALSLALDLPVNNSKTANSNI

Query:  NNSNFSSSSGVEAPFQMQNQSLQIPMPSRPVLVEPSPSVAKSGINTSKNSLTMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQTKNAIHIKSK-G
        N+S  SS+SG EAP Q+QNQS Q+ +P RP+ V+P P  AKSGINT KN+L MKSTHNSDSIVRATAVAAGARIVSP DAASL+K AQTKNAIHIKSK  
Subjt:  NNSNFSSSSGVEAPFQMQNQSLQIPMPSRPVLVEPSPSVAKSGINTSKNSLTMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQTKNAIHIKSK-G

Query:  TLIKSPVLGNAPMRSDARPSVHYISTGKTATPGSNFVGGKSTMVGNNPMKAVSPKVLHNRSTALLKNAPSDPISPATESPSKQEVQSSEECKISEPIVTA
        + I+ PVLGNA    DA+PSVHYISTG+TATPGSN+VGGKSTM GNN MK V+PK  +N STA+L N PS+ ISP TESP KQEV+SSEECKIS+PI+T+
Subjt:  TLIKSPVLGNAPMRSDARPSVHYISTGKTATPGSNFVGGKSTMVGNNPMKAVSPKVLHNRSTALLKNAPSDPISPATESPSKQEVQSSEECKISEPIVTA

Query:  KKESREDEAVIRDISVASQRSDGELGRLSTCIETHNTSLNMEIDENDLKAGC
        K +SRE+  V+RD+  AS  SD E G  STCIE  NTSLNMEIDEND+KA C
Subjt:  KKESREDEAVIRDISVASQRSDGELGRLSTCIETHNTSLNMEIDENDLKAGC

A0A6J1IN48 uncharacterized protein LOC111476736 isoform X12.9e-22779.02Show/hide
Query:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLQEVFRGSEGRIDWDKLVKNTSTGISNAREYQLLWRHLAYRHTLLENMDYVTDPVDDDSDLEFEIEP
        MIE KEK+KKGTIS EDS  +LERYSVRTI TLL+EV   SE RIDWDKLVKNTSTGISN REYQLLWRHLAYRHTLLEN+D VTDP+D DSDL+FEIEP
Subjt:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLQEVFRGSEGRIDWDKLVKNTSTGISNAREYQLLWRHLAYRHTLLENMDYVTDPVDDDSDLEFEIEP

Query:  VPSVSSESSNEAAACVKVLIANGIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGMYVTIPISIQRQPIPTPSATEVFDVNGAAGSNAA
         PSVS+ES NEAAACVKVLIANGIPSESD+PSSS VEAPLTIGI SNS+S RA+LENPQSACLMQGMYVT+PISI+RQP+PTPSATEVFDVNGAAGSNAA
Subjt:  VPSVSSESSNEAAACVKVLIANGIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGMYVTIPISIQRQPIPTPSATEVFDVNGAAGSNAA

Query:  SRKRRKPWTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRKRRGNLNMGANTTSTQISKAQIDAAHRALSLALDLPVNNSKT-ANSN
        SRKRRKPW+K +DLEL+AAVEKYGEGNWANILK DFKGDRTASQLSQRWSIIRKR GNLN+GANTTSTQISKAQIDAAHRALSLALDLPVN SK+ ANSN
Subjt:  SRKRRKPWTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRKRRGNLNMGANTTSTQISKAQIDAAHRALSLALDLPVNNSKT-ANSN

Query:  INNSNFSSSSGVEAPFQMQNQSLQIPMPSRPVLVEPSPSVAKSGINTSKNSLTMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQTKNAIHIKSK-
        +N+S  SS+SG EAP Q+QNQS Q+ +P RP+ V+P P  AKSGINT KN+L MKSTHNSDSIVRATAVAAGARIVSP DAASL+K AQTKNAIHIKSK 
Subjt:  INNSNFSSSSGVEAPFQMQNQSLQIPMPSRPVLVEPSPSVAKSGINTSKNSLTMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQTKNAIHIKSK-

Query:  GTLIKSPVLGNAPMRSDARPSVHYISTGKTATPGSNFVGGKSTMVGNNPMKAVSPKVLHNRSTALLKNAPSDPISPATESPSKQEVQSSEECKISEPIVT
         + I+ PVLGNA    DA+PSVHYISTG+TATPGSN+VGGKSTM GNN MK V+PK  +N STA+L N PS+ ISP TESP KQEV+SSEECKIS+PI+T
Subjt:  GTLIKSPVLGNAPMRSDARPSVHYISTGKTATPGSNFVGGKSTMVGNNPMKAVSPKVLHNRSTALLKNAPSDPISPATESPSKQEVQSSEECKISEPIVT

Query:  AKKESREDEAVIRDISVASQRSDGELGRLSTCIETHNTSLNMEIDENDLKAGC
        +K +SRE+  V+RD+  AS  SD E G  STCIE  NTSLNMEIDEND+KA C
Subjt:  AKKESREDEAVIRDISVASQRSDGELGRLSTCIETHNTSLNMEIDENDLKAGC

SwissProt top hitse value%identityAlignment
O35144 Telomeric repeat-binding factor 25.9e-0435.29Show/hide
Query:  EVFDVNGAAGSNAASRKRRKPWTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRK
        ++F+V       ++S  R++ WT  E   +   V KYGEGNWA I K     +RTA  +  RW  ++K
Subjt:  EVFDVNGAAGSNAASRKRRKPWTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRK

P70371 Telomeric repeat-binding factor 12.7e-0440Show/hide
Query:  ASRKRRKPWTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRK
        + R++R+ W   ED  L   V+KYGEGNWA IL      +RT+  L  RW  +++
Subjt:  ASRKRRKPWTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRK

Arabidopsis top hitse value%identityAlignment
AT1G09710.1 Homeodomain-like superfamily protein5.6e-5835.57Show/hide
Query:  RKKGTISKEDSSTLLERYSVRTILTLLQEVFRGSEGRIDWDKLVKNTSTGISNAREYQLLWRHLAYRHTLLENMDYVTDPVDDDSDLEFEIEPVPSVSSE
        R+K  I++ D +TLL RY + TIL +LQE+   SE ++DW+ LVK T+TGI+NAREYQLLWRHL+YRH LL   D    P+DDDSD+E E+E  P+VS E
Subjt:  RKKGTISKEDSSTLLERYSVRTILTLLQEVFRGSEGRIDWDKLVKNTSTGISNAREYQLLWRHLAYRHTLLENMDYVTDPVDDDSDLEFEIEPVPSVSSE

Query:  SSNEAAACVKVLIANGIPSESDIPSSSAVEAPLTIGISNS--QSSRANLENPQSACLMQGMYVTIPISIQRQPIPTPSATEVFDVNGAAGSNAASRKRRK
        +S EA A VKV+ A+ + SESDI   S VEAPLTI I  +  + S+   E+P S+   +GM +  P+ +Q+      ++TE  + NG+AG + A R++RK
Subjt:  SSNEAAACVKVLIANGIPSESDIPSSSAVEAPLTIGISNS--QSSRANLENPQSACLMQGMYVTIPISIQRQPIPTPSATEVFDVNGAAGSNAASRKRRK

Query:  PWTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRKRRGNLNMGANTTSTQISKAQIDAAHRALSLALDLPVNNSKTANSNINNSNFS
         W+  ED EL AAV++ GEGNWA+I+K DF+G+RTASQLSQRW++IRK R + +   +    Q ++A++ A + ALSLAL     ++K A   +  ++  
Subjt:  PWTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRKRRGNLNMGANTTSTQISKAQIDAAHRALSLALDLPVNNSKTANSNINNSNFS

Query:  SSSGVEAPFQMQNQSLQIPMPSRPVLVEPSPSVAKSGINTSKNSLTMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQTK-NAIHI-KSKGTLIKS
        + +  EA     +Q  Q   P    L     S+  +     K + T  ST  SD +V A +VAA A +     AAS  K    K +A  + K+K     S
Subjt:  SSSGVEAPFQMQNQSLQIPMPSRPVLVEPSPSVAKSGINTSKNSLTMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQTK-NAIHI-KSKGTLIKS

Query:  PVLGNAPMRSDARPSVHYISTGKTATPGSNFVGGKSTMVGNNPMKAV-------SPKVLHNRSTALLKNAPSDPISPATESPSKQEVQS----SEECKIS
         V    P  S + P V          PG++ V      + N  +K V        P ++  RS      + S P++  +   S Q V +    + E    
Subjt:  PVLGNAPMRSDARPSVHYISTGKTATPGSNFVGGKSTMVGNNPMKAV-------SPKVLHNRSTALLKNAPSDPISPATESPSKQEVQS----SEECKIS

Query:  EPIVTAKKESREDE----AVIRDISVASQRSDGELGRLSTCIETHNTSLNMEIDENDLKAGCSKQVEMKTKAND---VEIRG
        +P    K ++R +E    A I+   + S+  +   G+ +T  ++ N  L  ++        C+   ++  K +D   V IRG
Subjt:  EPIVTAKKESREDE----AVIRDISVASQRSDGELGRLSTCIETHNTSLNMEIDENDLKAGCSKQVEMKTKAND---VEIRG

AT1G09710.2 Homeodomain-like superfamily protein2.4e-5636.05Show/hide
Query:  RKKGTISKEDSSTLLERYSVRTILTLLQEVFRGSEGRIDWDKLVKNTSTGISNAREYQLLWRHLAYRHTLLENMDYVTDPVDDDSDLEFEIEPVPSVSSE
        R+K  I++ D +TLL RY + TIL +LQE+   SE ++DW+ LVK T+TGI+NAREYQLLWRHL+YRH LL   D    P+DDDSD+E E+E  P+VS E
Subjt:  RKKGTISKEDSSTLLERYSVRTILTLLQEVFRGSEGRIDWDKLVKNTSTGISNAREYQLLWRHLAYRHTLLENMDYVTDPVDDDSDLEFEIEPVPSVSSE

Query:  SSNEAAACVKVLIANGIPSESDIPSSSAVEAPLTIGISNS--QSSRANLENPQSACLMQGMYVTIPISIQRQPIPTPSATEVFDVNGAAGSNAASRKRRK
        +S EA A VKV+ A+ + SESDI   S VEAPLTI I  +  + S+   E+P S+   +GM +  P+ +Q+      ++TE  + NG+AG + A R++RK
Subjt:  SSNEAAACVKVLIANGIPSESDIPSSSAVEAPLTIGISNS--QSSRANLENPQSACLMQGMYVTIPISIQRQPIPTPSATEVFDVNGAAGSNAASRKRRK

Query:  PWTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRKRRGNLNMGANTTSTQISKAQIDAAHRALSLALDLPVNNSKTANSNINNSNFS
         W+  ED EL AAV++ GEGNWA+I+K DF+G+RTASQLSQRW++IRK R + +   +    Q ++A++ A + ALSLAL     ++K A    +  +F 
Subjt:  PWTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRKRRGNLNMGANTTSTQISKAQIDAAHRALSLALDLPVNNSKTANSNINNSNFS

Query:  SSSGVEA---------PFQMQNQSLQIPMPSRPVLVEPSPSVAKSGINTSKNSLTMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQTKNAIHIKS
        ++S +           P    NQ L        +    S ++ ++  N   +S   +    S  IV+A   A  +   + S       A+ T  +  + +
Subjt:  SSSGVEA---------PFQMQNQSLQIPMPSRPVLVEPSPSVAKSGINTSKNSLTMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQTKNAIHIKS

Query:  KGTLIKSPVLGNAPMRSDARPSVHYISTGKT---ATPGSNFVGGKSTMVGNNPMKAVS-PKVLHNRSTAL-LKNAPSDPISPATESPSKQE
          ++  +  +G+    +  R     +  GKT     P +  V   ST+    P  ++S PKV    S A  +++  +  + P   S S  +
Subjt:  KGTLIKSPVLGNAPMRSDARPSVHYISTGKT---ATPGSNFVGGKSTMVGNNPMKAVS-PKVLHNRSTAL-LKNAPSDPISPATESPSKQE

AT1G58220.1 Homeodomain-like superfamily protein2.5e-5838.14Show/hide
Query:  KRKKGTISKEDSSTLLERYSVRTILTLLQEVFRGSEGRIDWDKLVKNTSTGISNAREYQLLWRHLAYRHTLLENMDYVTDPVDDDSDLEFEIEPVPSVSS
        K++K  IS+ D +TLL+RY   TIL LLQE+   +E +++W++LVK TSTGI++AREYQLLWRHLAYR +L+  +      +DDDSD+E E+E  P VS 
Subjt:  KRKKGTISKEDSSTLLERYSVRTILTLLQEVFRGSEGRIDWDKLVKNTSTGISNAREYQLLWRHLAYRHTLLENMDYVTDPVDDDSDLEFEIEPVPSVSS

Query:  ESSNEAAACVKVLIANGIPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSACLMQGMYVTIPISIQRQPIPTPSATEVFDVNGAAGSNAASRKRRKP
        +   EA A VKV+ A+ +PSESDIP  S VEAPLTI I  S   R   E   S    +GM +T P+ +       P A E  + NG A S+ A RKRRK 
Subjt:  ESSNEAAACVKVLIANGIPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSACLMQGMYVTIPISIQRQPIPTPSATEVFDVNGAAGSNAASRKRRKP

Query:  WTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRKRRGNLNMGANTTSTQISKAQIDAAHRALSLALDLPVNNSKTANSNINNSNFSS
        W+  ED ELIAAV+++GEG+WA I KE+F+G+RTASQLSQRW  IR+R    N  +  T  Q ++AQ+ AA+RALSLA+   + + K A       +  +
Subjt:  WTKAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRKRRGNLNMGANTTSTQISKAQIDAAHRALSLALDLPVNNSKTANSNINNSNFSS

Query:  SSGVEAPFQMQNQSL---QIPMPSRPVLVEPSPS--VAKSGINTSKNSLTMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQTKNAIH--------
          G +A       +L   Q P P    L   + S  VAKS +   K   T  ST  +D +V A +VAA A +   + A ++ K    KNA+         
Subjt:  SSGVEAPFQMQNQSL---QIPMPSRPVLVEPSPS--VAKSGINTSKNSLTMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQTKNAIH--------

Query:  IKSKGTL----------------IKSPVLGNAPMRSDARPSVHYISTGKTATPGSNFVGGKSTMVGNNPMKAVSPKVLHNRSTALLKN--APSDPISPAT
        +K+  T+                +K+ V  + P RS    S   +   KTA   ++ +   S M+        +PKV   ++TA + +   PS  IS   
Subjt:  IKSKGTL----------------IKSPVLGNAPMRSDARPSVHYISTGKTATPGSNFVGGKSTMVGNNPMKAVSPKVLHNRSTALLKN--APSDPISPAT

Query:  ESPSKQEVQSSEECKISEPIVTAKKES
          P K    ++   + S  I   K ES
Subjt:  ESPSKQEVQSSEECKISEPIVTAKKES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAGCACCGGTCGGCTCGTCCCTTCTGCCGGCGATGCGCTCCTGGCCTTAACTGGCCGGGTCGTGCCTCCGGCGCTACTGGTCCAAGTACTCCATTATAAGAGGTC
CACAGACAGAGCTGATGAAGCAAAAGATTTGGTTGCTGTGAGCCCCACTTCTCTGCCTGCGCGTCTATCATCGAACCATGCACCCTCACTACCGTCGCCGATAGCAATAA
GCCCATTCCCGCTACAACTGCAACGCTTCGTCCTCAAGCAATATCCGAGAAAGCGATTTCAGTCTCATCGGCGACGGTCGCTGCCGTTTCACCTGCTTCTCTTGTTTTTG
CTGCTCTTCATATGCGATTTCGATTCGTGCTCTATAATGATTGAGAGGAAAGAGAAGCGAAAGAAAGGGACAATTAGTAAGGAAGATAGTTCCACTCTATTGGAAAGATA
TTCAGTTAGGACGATACTGACATTGCTTCAGGAGGTGTTCCGAGGTTCGGAAGGGCGAATTGATTGGGACAAGTTGGTGAAGAATACGTCGACTGGGATTTCTAATGCTC
GGGAATATCAGTTGTTATGGCGGCATTTGGCTTATCGTCACACGTTGCTAGAGAACATGGATTATGTTACTGATCCAGTGGATGATGATAGTGACTTAGAGTTTGAAATA
GAACCTGTTCCATCTGTCAGCAGTGAGTCCTCGAATGAAGCTGCAGCATGTGTGAAGGTATTGATTGCTAATGGTATACCGAGTGAGTCAGATATTCCAAGCAGTTCTGC
AGTTGAGGCCCCATTGACTATAGGTATATCCAATAGTCAATCATCTAGAGCCAATCTTGAAAATCCTCAATCTGCTTGTTTGATGCAAGGGATGTATGTTACAATTCCAA
TTTCCATTCAGAGACAGCCAATTCCAACACCATCAGCAACTGAAGTATTTGATGTGAATGGAGCAGCTGGTAGTAATGCAGCTTCTCGAAAAAGAAGAAAACCTTGGACA
AAGGCAGAGGATTTGGAATTGATTGCTGCTGTGGAAAAGTATGGTGAAGGTAACTGGGCGAATATCTTGAAAGAAGACTTCAAGGGGGATAGAACTGCTTCACAGCTATC
TCAGAGGTGGTCCATTATTAGGAAGCGACGTGGTAATTTGAATATGGGAGCTAACACCACAAGTACTCAGATATCCAAAGCTCAGATTGATGCTGCACACCGTGCATTGT
CCCTTGCCCTTGATTTGCCTGTGAATAACTCAAAAACAGCAAATTCAAATATAAACAATAGCAATTTCTCTTCTTCAAGTGGTGTTGAAGCTCCATTTCAAATGCAGAAT
CAGTCTCTACAGATTCCCATGCCTTCAAGGCCGGTGCTGGTAGAGCCTTCACCTTCAGTAGCAAAATCTGGAATTAACACTTCCAAGAACTCGTTGACGATGAAGTCTAC
TCACAATTCTGATTCTATAGTTAGAGCAACTGCAGTAGCTGCGGGGGCACGGATTGTTTCTCCATCAGATGCTGCATCCCTACTGAAGGCTGCACAGACAAAAAATGCCA
TCCACATAAAGTCCAAAGGCACTTTGATCAAATCACCTGTGCTTGGTAATGCACCGATGCGCTCGGACGCACGCCCCAGTGTACATTATATTTCCACAGGAAAAACAGCA
ACTCCAGGCTCAAATTTTGTGGGTGGTAAATCTACCATGGTTGGTAATAACCCAATGAAAGCTGTCTCACCAAAAGTTCTGCACAATCGTTCTACTGCTCTTTTGAAAAA
TGCACCATCAGACCCAATAAGCCCAGCAACTGAATCTCCATCGAAGCAAGAGGTTCAGAGTTCAGAAGAATGCAAAATTTCCGAGCCAATTGTTACAGCGAAAAAAGAGT
CTCGAGAAGATGAAGCTGTTATAAGAGACATCTCTGTTGCTTCACAAAGATCAGATGGGGAATTGGGAAGACTTTCAACTTGCATTGAGACTCACAATACTTCTTTGAAT
ATGGAGATAGATGAAAATGATCTTAAAGCAGGATGTTCCAAGCAGGTCGAAATGAAAACGAAGGCAAATGATGTCGAGATTAGGGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGAGCACCGGTCGGCTCGTCCCTTCTGCCGGCGATGCGCTCCTGGCCTTAACTGGCCGGGTCGTGCCTCCGGCGCTACTGGTCCAAGTACTCCATTATAAGAGGTC
CACAGACAGAGCTGATGAAGCAAAAGATTTGGTTGCTGTGAGCCCCACTTCTCTGCCTGCGCGTCTATCATCGAACCATGCACCCTCACTACCGTCGCCGATAGCAATAA
GCCCATTCCCGCTACAACTGCAACGCTTCGTCCTCAAGCAATATCCGAGAAAGCGATTTCAGTCTCATCGGCGACGGTCGCTGCCGTTTCACCTGCTTCTCTTGTTTTTG
CTGCTCTTCATATGCGATTTCGATTCGTGCTCTATAATGATTGAGAGGAAAGAGAAGCGAAAGAAAGGGACAATTAGTAAGGAAGATAGTTCCACTCTATTGGAAAGATA
TTCAGTTAGGACGATACTGACATTGCTTCAGGAGGTGTTCCGAGGTTCGGAAGGGCGAATTGATTGGGACAAGTTGGTGAAGAATACGTCGACTGGGATTTCTAATGCTC
GGGAATATCAGTTGTTATGGCGGCATTTGGCTTATCGTCACACGTTGCTAGAGAACATGGATTATGTTACTGATCCAGTGGATGATGATAGTGACTTAGAGTTTGAAATA
GAACCTGTTCCATCTGTCAGCAGTGAGTCCTCGAATGAAGCTGCAGCATGTGTGAAGGTATTGATTGCTAATGGTATACCGAGTGAGTCAGATATTCCAAGCAGTTCTGC
AGTTGAGGCCCCATTGACTATAGGTATATCCAATAGTCAATCATCTAGAGCCAATCTTGAAAATCCTCAATCTGCTTGTTTGATGCAAGGGATGTATGTTACAATTCCAA
TTTCCATTCAGAGACAGCCAATTCCAACACCATCAGCAACTGAAGTATTTGATGTGAATGGAGCAGCTGGTAGTAATGCAGCTTCTCGAAAAAGAAGAAAACCTTGGACA
AAGGCAGAGGATTTGGAATTGATTGCTGCTGTGGAAAAGTATGGTGAAGGTAACTGGGCGAATATCTTGAAAGAAGACTTCAAGGGGGATAGAACTGCTTCACAGCTATC
TCAGAGGTGGTCCATTATTAGGAAGCGACGTGGTAATTTGAATATGGGAGCTAACACCACAAGTACTCAGATATCCAAAGCTCAGATTGATGCTGCACACCGTGCATTGT
CCCTTGCCCTTGATTTGCCTGTGAATAACTCAAAAACAGCAAATTCAAATATAAACAATAGCAATTTCTCTTCTTCAAGTGGTGTTGAAGCTCCATTTCAAATGCAGAAT
CAGTCTCTACAGATTCCCATGCCTTCAAGGCCGGTGCTGGTAGAGCCTTCACCTTCAGTAGCAAAATCTGGAATTAACACTTCCAAGAACTCGTTGACGATGAAGTCTAC
TCACAATTCTGATTCTATAGTTAGAGCAACTGCAGTAGCTGCGGGGGCACGGATTGTTTCTCCATCAGATGCTGCATCCCTACTGAAGGCTGCACAGACAAAAAATGCCA
TCCACATAAAGTCCAAAGGCACTTTGATCAAATCACCTGTGCTTGGTAATGCACCGATGCGCTCGGACGCACGCCCCAGTGTACATTATATTTCCACAGGAAAAACAGCA
ACTCCAGGCTCAAATTTTGTGGGTGGTAAATCTACCATGGTTGGTAATAACCCAATGAAAGCTGTCTCACCAAAAGTTCTGCACAATCGTTCTACTGCTCTTTTGAAAAA
TGCACCATCAGACCCAATAAGCCCAGCAACTGAATCTCCATCGAAGCAAGAGGTTCAGAGTTCAGAAGAATGCAAAATTTCCGAGCCAATTGTTACAGCGAAAAAAGAGT
CTCGAGAAGATGAAGCTGTTATAAGAGACATCTCTGTTGCTTCACAAAGATCAGATGGGGAATTGGGAAGACTTTCAACTTGCATTGAGACTCACAATACTTCTTTGAAT
ATGGAGATAGATGAAAATGATCTTAAAGCAGGATGTTCCAAGCAGGTCGAAATGAAAACGAAGGCAAATGATGTCGAGATTAGGGGATGA
Protein sequenceShow/hide protein sequence
MVSTGRLVPSAGDALLALTGRVVPPALLVQVLHYKRSTDRADEAKDLVAVSPTSLPARLSSNHAPSLPSPIAISPFPLQLQRFVLKQYPRKRFQSHRRRSLPFHLLLLFL
LLFICDFDSCSIMIERKEKRKKGTISKEDSSTLLERYSVRTILTLLQEVFRGSEGRIDWDKLVKNTSTGISNAREYQLLWRHLAYRHTLLENMDYVTDPVDDDSDLEFEI
EPVPSVSSESSNEAAACVKVLIANGIPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSACLMQGMYVTIPISIQRQPIPTPSATEVFDVNGAAGSNAASRKRRKPWT
KAEDLELIAAVEKYGEGNWANILKEDFKGDRTASQLSQRWSIIRKRRGNLNMGANTTSTQISKAQIDAAHRALSLALDLPVNNSKTANSNINNSNFSSSSGVEAPFQMQN
QSLQIPMPSRPVLVEPSPSVAKSGINTSKNSLTMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQTKNAIHIKSKGTLIKSPVLGNAPMRSDARPSVHYISTGKTA
TPGSNFVGGKSTMVGNNPMKAVSPKVLHNRSTALLKNAPSDPISPATESPSKQEVQSSEECKISEPIVTAKKESREDEAVIRDISVASQRSDGELGRLSTCIETHNTSLN
MEIDENDLKAGCSKQVEMKTKANDVEIRG