; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10007131 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10007131
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptioninosine-uridine preferring nucleoside hydrolase family protein
Genome locationChr10:1594592..1624130
RNA-Seq ExpressionHG10007131
SyntenyHG10007131
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0017025 - TBP-class protein binding (molecular function)
InterPro domainsIPR001910 - Inosine/uridine-preferring nucleoside hydrolase domain
IPR011665 - Brf1, TBP-binding domain
IPR013150 - Transcription factor TFIIB, cyclin-like domain
IPR013763 - Cyclin-like
IPR036452 - Ribonucleoside hydrolase-like
IPR036915 - Cyclin-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008450713.1 PREDICTED: uncharacterized protein LOC103492210 isoform X1 [Cucumis melo]0.0e+0086.66Show/hide
Query:  MVWLANSSS-HSPMRILVDTDVDTDDIFALFYLLKQPSSLFHLQAITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGISHNATIFPHVGGYLP
        MVWLANSSS +SP RILVDTD D DDIFALFYLLKQPSSLFHLQ ITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGIS +ATI P++GGYLP
Subjt:  MVWLANSSS-HSPMRILVDTDVDTDDIFALFYLLKQPSSLFHLQAITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGISHNATIFPHVGGYLP

Query:  LIDQGVSTAGQCRYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGG
        LIDQGVSTAGQCRYRQAIPVG  GRL ANTNFGLRK FLPQGKRRYIPMKQPTAQQVMKDAISAGPT VFLMGAHTNLAIFLLSNPHLKKNIKH+YAMGG
Subjt:  LIDQGVSTAGQCRYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGG

Query:  AIREICSESADKSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTG
        AIREICSES   SHGKTC+NIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNK+VFLAFEQRQNTYEAKYCFQSLKMARDTW  
Subjt:  AIREICSESADKSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTG

Query:  NGFFEIYSMWDSFMVGVALSQMYNLDRGG--GNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGK
        +GFFE+YSMWDSFMVGVALSQMYNL RGG  G NA+SKMEYLN+TIVTSN+PYGISDGSNP V+G LL  FG QKNGVHSGHVQTGMLDPFC+  T  GK
Subjt:  NGFFEIYSMWDSFMVGVALSQMYNLDRGG--GNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGK

Query:  CQDGYTKEADGPESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKPVIFDMDMSTGDFLTLLYLLK
        CQDGYTKEADG ESVQVLVAVEAKSTID NSSIDKAFYISFLDVLNSPRQTGRFDFRAQFP YREVLYRP FGK+LLGKPVIFDMDMSTGDFLTLLYLLK
Subjt:  CQDGYTKEADGPESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKPVIFDMDMSTGDFLTLLYLLK

Query:  TPIEIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESS
        TPIEIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGD+FAIGE HP FPPIGDCK                               Y+AE+S
Subjt:  TPIEIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESS

Query:  VKYGAFRDTDHPELGQMSALDVWKDVVHSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAAD
        VK+GAFRDTDHPEL QMSALDVWKDVV +LDL+AKITVLTSGPLTNLA+IIHHKA+S+RI+EVYITGGHI+Y VDKGN+FTIPSNEYSEFNFFLDP AAD
Subjt:  VKYGAFRDTDHPELGQMSALDVWKDVVHSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAAD

Query:  LVLGSGLNITLIPLNVQRRVSSFHKILKKLELGNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGES
        LV GSGLNITLIPLNVQRRVSSF+KILKKL+  NRTPEAWFSRRLL RLY LKQKHHQYHHVDMFLGEV+G VSLAGKHLNLKQTFSFKPLKV++NGGES
Subjt:  LVLGSGLNITLIPLNVQRRVSSFHKILKKLELGNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGES

Query:  KVGQTIIDDKKGKWVRVLESVEPLAFYEDLANALADEKQTAVIGSFE
        KVGQTIID KKGKWVRVLES+EPLA YEDLANALADEKQTAVI SFE
Subjt:  KVGQTIIDDKKGKWVRVLESVEPLAFYEDLANALADEKQTAVIGSFE

XP_011659920.1 uncharacterized protein LOC101212769 isoform X1 [Cucumis sativus]0.0e+0087.93Show/hide
Query:  MVWLANSSS-HSPMRILVDTDVDTDDIFALFYLLKQPSSLFHLQAITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGISHNATIFPHVGGYLP
        +VWLANSSS +SPMRILVDTDVDTDDIFAL YLLKQPSSLFHLQ ITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGIS NATI  ++GGYLP
Subjt:  MVWLANSSS-HSPMRILVDTDVDTDDIFALFYLLKQPSSLFHLQAITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGISHNATIFPHVGGYLP

Query:  LIDQGVSTAGQCRYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGG
        LIDQGVSTAGQCRYRQAIPVG  GRL ANTNFGLRK FLPQGKRRYIPMKQPTAQQVMKDAISAGPT VFLMGAHTNLAIFLLSNPHLKKNIKH+YAMGG
Subjt:  LIDQGVSTAGQCRYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGG

Query:  AIREICSESADKSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTG
        AIREICSESADKSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNK+VFLAFEQRQNTYEAKYCFQSLKMA DTW  
Subjt:  AIREICSESADKSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTG

Query:  NGFFEIYSMWDSFMVGVALSQMYNLDRGGGNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQ
        +GFFEIYSMWDSFMVGVALSQMYNL RGGGNNA+SKMEYLNITIVTSNKPYGISDGSNPLVDGHLLP  G Q NGVHSGHVQTGMLDPFC+  T  GKCQ
Subjt:  NGFFEIYSMWDSFMVGVALSQMYNLDRGGGNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQ

Query:  DGYTKEADGPESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKPVIFDMDMSTGDFLTLLYLLKTP
        DGYTKE+DG ESVQVLVAVEAKSTID NSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGK+LLGKPVIFDMDMSTGDFLTLLYLLKTP
Subjt:  DGYTKEADGPESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKPVIFDMDMSTGDFLTLLYLLKTP

Query:  IEIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSVK
        IEIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHP +PPIGDCK                               Y+AE+SVK
Subjt:  IEIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSVK

Query:  YGAFRDTDHPELGQMSALDVWKDVVHSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADLV
        +GAFRDTDHPEL QMS LDVWKDVV SL+L+AKITVLT+GPLTNLA+II HKA+S+RI+EVYITGGH+NY VDKGN+FTIPSNEYSEFNFFLDP AADLV
Subjt:  YGAFRDTDHPELGQMSALDVWKDVVHSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADLV

Query:  LGSGLNITLIPLNVQRRVSSFHKILKKLELGNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKV
          SGLNITLIPLNVQRRVSSFHKIL+KL+L NRTPEAW SRRLL RLY LKQKHHQYHHVDMFLGEVLG VSLAGKHLNLKQTFSFKPLKV++NGGESKV
Subjt:  LGSGLNITLIPLNVQRRVSSFHKILKKLELGNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKV

Query:  GQTIIDDKKGKWVRVLESVEPLAFYEDLANALADEKQTAVIGSFE
        GQTIID+KKGKWVRVLES+EPLAFYED+ANALADEKQTAVI SFE
Subjt:  GQTIIDDKKGKWVRVLESVEPLAFYEDLANALADEKQTAVIGSFE

XP_016900935.1 PREDICTED: uncharacterized protein LOC103492210 isoform X2 [Cucumis melo]0.0e+0085.82Show/hide
Query:  MVWLANSSSHSPMRILVDTDVDTDDIFALFYLLKQPSSLFHLQAITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGISHNATIFPHVGGYLPL
        MVWL N   +S MRILVDTDVDTDD+  L YLLKQ  SLFHLQ ITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGIS +ATI P++GGYLPL
Subjt:  MVWLANSSSHSPMRILVDTDVDTDDIFALFYLLKQPSSLFHLQAITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGISHNATIFPHVGGYLPL

Query:  IDQGVSTAGQCRYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGA
        IDQGVSTAGQCRYRQAIPVG  GRL ANTNFGLRK FLPQGKRRYIPMKQPTAQQVMKDAISAGPT VFLMGAHTNLAIFLLSNPHLKKNIKH+YAMGGA
Subjt:  IDQGVSTAGQCRYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGA

Query:  IREICSESADKSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGN
        IREICSES   SHGKTC+NIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNK+VFLAFEQRQNTYEAKYCFQSLKMARDTW  +
Subjt:  IREICSESADKSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGN

Query:  GFFEIYSMWDSFMVGVALSQMYNLDRGG--GNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKC
        GFFE+YSMWDSFMVGVALSQMYNL RGG  G NA+SKMEYLN+TIVTSN+PYGISDGSNP V+G LL  FG QKNGVHSGHVQTGMLDPFC+  T  GKC
Subjt:  GFFEIYSMWDSFMVGVALSQMYNLDRGG--GNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKC

Query:  QDGYTKEADGPESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKPVIFDMDMSTGDFLTLLYLLKT
        QDGYTKEADG ESVQVLVAVEAKSTID NSSIDKAFYISFLDVLNSPRQTGRFDFRAQFP YREVLYRP FGK+LLGKPVIFDMDMSTGDFLTLLYLLKT
Subjt:  QDGYTKEADGPESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKPVIFDMDMSTGDFLTLLYLLKT

Query:  PIEIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSV
        PIEIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGD+FAIGE HP FPPIGDCK                               Y+AE+SV
Subjt:  PIEIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSV

Query:  KYGAFRDTDHPELGQMSALDVWKDVVHSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADL
        K+GAFRDTDHPEL QMSALDVWKDVV +LDL+AKITVLTSGPLTNLA+IIHHKA+S+RI+EVYITGGHI+Y VDKGN+FTIPSNEYSEFNFFLDP AADL
Subjt:  KYGAFRDTDHPELGQMSALDVWKDVVHSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADL

Query:  VLGSGLNITLIPLNVQRRVSSFHKILKKLELGNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESK
        V GSGLNITLIPLNVQRRVSSF+KILKKL+  NRTPEAWFSRRLL RLY LKQKHHQYHHVDMFLGEV+G VSLAGKHLNLKQTFSFKPLKV++NGGESK
Subjt:  VLGSGLNITLIPLNVQRRVSSFHKILKKLELGNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESK

Query:  VGQTIIDDKKGKWVRVLESVEPLAFYEDLANALADEKQTAVIGSFE
        VGQTIID KKGKWVRVLES+EPLA YEDLANALADEKQTAVI SFE
Subjt:  VGQTIIDDKKGKWVRVLESVEPLAFYEDLANALADEKQTAVIGSFE

XP_016900936.1 PREDICTED: uncharacterized protein LOC103492210 isoform X3 [Cucumis melo]0.0e+0085.82Show/hide
Query:  MVWLANSSSHSPMRILVDTDVDTDDIFALFYLLKQPSSLFHLQAITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGISHNATIFPHVGGYLPL
        MVWL N   +S MRILVDTDVDTDD+  L YLLKQ  SLFHLQ ITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGIS +ATI P++GGYLPL
Subjt:  MVWLANSSSHSPMRILVDTDVDTDDIFALFYLLKQPSSLFHLQAITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGISHNATIFPHVGGYLPL

Query:  IDQGVSTAGQCRYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGA
        IDQGVSTAGQCRYRQAIPVG  GRL ANTNFGLRK FLPQGKRRYIPMKQPTAQQVMKDAISAGPT VFLMGAHTNLAIFLLSNPHLKKNIKH+YAMGGA
Subjt:  IDQGVSTAGQCRYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGA

Query:  IREICSESADKSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGN
        IREICSES   SHGKTC+NIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNK+VFLAFEQRQNTYEAKYCFQSLKMARDTW  +
Subjt:  IREICSESADKSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGN

Query:  GFFEIYSMWDSFMVGVALSQMYNLDRGG--GNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKC
        GFFE+YSMWDSFMVGVALSQMYNL RGG  G NA+SKMEYLN+TIVTSN+PYGISDGSNP V+G LL  FG QKNGVHSGHVQTGMLDPFC+  T  GKC
Subjt:  GFFEIYSMWDSFMVGVALSQMYNLDRGG--GNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKC

Query:  QDGYTKEADGPESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKPVIFDMDMSTGDFLTLLYLLKT
        QDGYTKEADG ESVQVLVAVEAKSTID NSSIDKAFYISFLDVLNSPRQTGRFDFRAQFP YREVLYRP FGK+LLGKPVIFDMDMSTGDFLTLLYLLKT
Subjt:  QDGYTKEADGPESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKPVIFDMDMSTGDFLTLLYLLKT

Query:  PIEIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSV
        PIEIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGD+FAIGE HP FPPIGDCK                               Y+AE+SV
Subjt:  PIEIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSV

Query:  KYGAFRDTDHPELGQMSALDVWKDVVHSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADL
        K+GAFRDTDHPEL QMSALDVWKDVV +LDL+AKITVLTSGPLTNLA+IIHHKA+S+RI+EVYITGGHI+Y VDKGN+FTIPSNEYSEFNFFLDP AADL
Subjt:  KYGAFRDTDHPELGQMSALDVWKDVVHSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADL

Query:  VLGSGLNITLIPLNVQRRVSSFHKILKKLELGNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESK
        V GSGLNITLIPLNVQRRVSSF+KILKKL+  NRTPEAWFSRRLL RLY LKQKHHQYHHVDMFLGEV+G VSLAGKHLNLKQTFSFKPLKV++NGGESK
Subjt:  VLGSGLNITLIPLNVQRRVSSFHKILKKLELGNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESK

Query:  VGQTIIDDKKGKWVRVLESVEPLAFYEDLANALADEKQTAVIGSFE
        VGQTIID KKGKWVRVLES+EPLA YEDLANALADEKQTAVI SFE
Subjt:  VGQTIIDDKKGKWVRVLESVEPLAFYEDLANALADEKQTAVIGSFE

XP_038879905.1 uncharacterized protein LOC120071620 isoform X1 [Benincasa hispida]0.0e+0091.54Show/hide
Query:  MVWLANSSSHSPMRILVDTDVDTDDIFALFYLLKQPSSLFHLQAITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGISHNATIFPHVGGYLPL
        MVWLANSSS SPMRILVDTDVDTDDIFALFYLLKQPSSLFHLQAITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGG+GGIS NATI PHVGGYLPL
Subjt:  MVWLANSSSHSPMRILVDTDVDTDDIFALFYLLKQPSSLFHLQAITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGISHNATIFPHVGGYLPL

Query:  IDQGVSTAGQCRYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGA
        IDQGVSTAGQCRYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQ VMKDA+SAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGA
Subjt:  IDQGVSTAGQCRYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGA

Query:  IREICSESADKSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGN
        IREICSE+ADKSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVN+KVFL FEQRQNTYEAKYCFQSLKMARDTWTGN
Subjt:  IREICSESADKSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGN

Query:  GFFEIYSMWDSFMVGVALSQMYNLDRGGGNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQD
        GFFEIYSMWDSFMVGVALSQMYN DRG GNNA+SKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFC+V TEIGKCQD
Subjt:  GFFEIYSMWDSFMVGVALSQMYNLDRGGGNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQD

Query:  GYTKEADGPESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKPVIFDMDMSTGDFLTLLYLLKTPI
        GYTKEADGPESVQVLVAVEAKSTID NSSIDKAFY SFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKPVIFDMDMSTGDFLTLLYLLKTPI
Subjt:  GYTKEADGPESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKPVIFDMDMSTGDFLTLLYLLKTPI

Query:  EIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSVKY
        EIINLKGIIISPNGWAT ATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK                               Y+AESSVK+
Subjt:  EIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSVKY

Query:  GAFRDTDHPELGQMSALDVWKDVVHSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADLVL
        G FRDTDHPEL QMSALDVWKDVV SLDLE KITVLT+GPLTNLAQI+HHKA+S+RIQEVYI+GG+INY VDKGNVFTIPSNE+SEFNFFLDPTAADLVL
Subjt:  GAFRDTDHPELGQMSALDVWKDVVHSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADLVL

Query:  GSGLNITLIPLNVQRRVSSFHKILKKLELGNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKVG
        GSGLNITLIPLNVQR+VSSFHKILKKL+LGNRTPEA FSRRLLSRLYHLKQKHHQYHHVDMFLGEVLG VSLAGKH+NLKQTFS KPLKVVTNGGESKVG
Subjt:  GSGLNITLIPLNVQRRVSSFHKILKKLELGNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKVG

Query:  QTIIDDKKGKWVRVLESVEPLAFYEDLANALADEKQTAVIGSFEGQKRQWT
        QTIID+KKGKWVRVLESVEPLAFYEDLANALADEKQ+AVIGSFEGQKR W+
Subjt:  QTIIDDKKGKWVRVLESVEPLAFYEDLANALADEKQTAVIGSFEGQKRQWT

TrEMBL top hitse value%identityAlignment
A0A0A0LWJ1 Uncharacterized protein0.0e+0087.93Show/hide
Query:  MVWLANSSS-HSPMRILVDTDVDTDDIFALFYLLKQPSSLFHLQAITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGISHNATIFPHVGGYLP
        +VWLANSSS +SPMRILVDTDVDTDDIFAL YLLKQPSSLFHLQ ITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGIS NATI  ++GGYLP
Subjt:  MVWLANSSS-HSPMRILVDTDVDTDDIFALFYLLKQPSSLFHLQAITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGISHNATIFPHVGGYLP

Query:  LIDQGVSTAGQCRYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGG
        LIDQGVSTAGQCRYRQAIPVG  GRL ANTNFGLRK FLPQGKRRYIPMKQPTAQQVMKDAISAGPT VFLMGAHTNLAIFLLSNPHLKKNIKH+YAMGG
Subjt:  LIDQGVSTAGQCRYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGG

Query:  AIREICSESADKSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTG
        AIREICSESADKSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNK+VFLAFEQRQNTYEAKYCFQSLKMA DTW  
Subjt:  AIREICSESADKSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTG

Query:  NGFFEIYSMWDSFMVGVALSQMYNLDRGGGNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQ
        +GFFEIYSMWDSFMVGVALSQMYNL RGGGNNA+SKMEYLNITIVTSNKPYGISDGSNPLVDGHLLP  G Q NGVHSGHVQTGMLDPFC+  T  GKCQ
Subjt:  NGFFEIYSMWDSFMVGVALSQMYNLDRGGGNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQ

Query:  DGYTKEADGPESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKPVIFDMDMSTGDFLTLLYLLKTP
        DGYTKE+DG ESVQVLVAVEAKSTID NSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGK+LLGKPVIFDMDMSTGDFLTLLYLLKTP
Subjt:  DGYTKEADGPESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKPVIFDMDMSTGDFLTLLYLLKTP

Query:  IEIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSVK
        IEIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHP +PPIGDCK                               Y+AE+SVK
Subjt:  IEIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSVK

Query:  YGAFRDTDHPELGQMSALDVWKDVVHSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADLV
        +GAFRDTDHPEL QMS LDVWKDVV SL+L+AKITVLT+GPLTNLA+II HKA+S+RI+EVYITGGH+NY VDKGN+FTIPSNEYSEFNFFLDP AADLV
Subjt:  YGAFRDTDHPELGQMSALDVWKDVVHSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADLV

Query:  LGSGLNITLIPLNVQRRVSSFHKILKKLELGNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKV
          SGLNITLIPLNVQRRVSSFHKIL+KL+L NRTPEAW SRRLL RLY LKQKHHQYHHVDMFLGEVLG VSLAGKHLNLKQTFSFKPLKV++NGGESKV
Subjt:  LGSGLNITLIPLNVQRRVSSFHKILKKLELGNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKV

Query:  GQTIIDDKKGKWVRVLESVEPLAFYEDLANALADEKQTAVIGSFE
        GQTIID+KKGKWVRVLES+EPLAFYED+ANALADEKQTAVI SFE
Subjt:  GQTIIDDKKGKWVRVLESVEPLAFYEDLANALADEKQTAVIGSFE

A0A1S3BP73 uncharacterized protein LOC103492210 isoform X10.0e+0086.66Show/hide
Query:  MVWLANSSS-HSPMRILVDTDVDTDDIFALFYLLKQPSSLFHLQAITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGISHNATIFPHVGGYLP
        MVWLANSSS +SP RILVDTD D DDIFALFYLLKQPSSLFHLQ ITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGIS +ATI P++GGYLP
Subjt:  MVWLANSSS-HSPMRILVDTDVDTDDIFALFYLLKQPSSLFHLQAITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGISHNATIFPHVGGYLP

Query:  LIDQGVSTAGQCRYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGG
        LIDQGVSTAGQCRYRQAIPVG  GRL ANTNFGLRK FLPQGKRRYIPMKQPTAQQVMKDAISAGPT VFLMGAHTNLAIFLLSNPHLKKNIKH+YAMGG
Subjt:  LIDQGVSTAGQCRYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGG

Query:  AIREICSESADKSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTG
        AIREICSES   SHGKTC+NIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNK+VFLAFEQRQNTYEAKYCFQSLKMARDTW  
Subjt:  AIREICSESADKSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTG

Query:  NGFFEIYSMWDSFMVGVALSQMYNLDRGG--GNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGK
        +GFFE+YSMWDSFMVGVALSQMYNL RGG  G NA+SKMEYLN+TIVTSN+PYGISDGSNP V+G LL  FG QKNGVHSGHVQTGMLDPFC+  T  GK
Subjt:  NGFFEIYSMWDSFMVGVALSQMYNLDRGG--GNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGK

Query:  CQDGYTKEADGPESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKPVIFDMDMSTGDFLTLLYLLK
        CQDGYTKEADG ESVQVLVAVEAKSTID NSSIDKAFYISFLDVLNSPRQTGRFDFRAQFP YREVLYRP FGK+LLGKPVIFDMDMSTGDFLTLLYLLK
Subjt:  CQDGYTKEADGPESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKPVIFDMDMSTGDFLTLLYLLK

Query:  TPIEIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESS
        TPIEIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGD+FAIGE HP FPPIGDCK                               Y+AE+S
Subjt:  TPIEIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESS

Query:  VKYGAFRDTDHPELGQMSALDVWKDVVHSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAAD
        VK+GAFRDTDHPEL QMSALDVWKDVV +LDL+AKITVLTSGPLTNLA+IIHHKA+S+RI+EVYITGGHI+Y VDKGN+FTIPSNEYSEFNFFLDP AAD
Subjt:  VKYGAFRDTDHPELGQMSALDVWKDVVHSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAAD

Query:  LVLGSGLNITLIPLNVQRRVSSFHKILKKLELGNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGES
        LV GSGLNITLIPLNVQRRVSSF+KILKKL+  NRTPEAWFSRRLL RLY LKQKHHQYHHVDMFLGEV+G VSLAGKHLNLKQTFSFKPLKV++NGGES
Subjt:  LVLGSGLNITLIPLNVQRRVSSFHKILKKLELGNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGES

Query:  KVGQTIIDDKKGKWVRVLESVEPLAFYEDLANALADEKQTAVIGSFE
        KVGQTIID KKGKWVRVLES+EPLA YEDLANALADEKQTAVI SFE
Subjt:  KVGQTIIDDKKGKWVRVLESVEPLAFYEDLANALADEKQTAVIGSFE

A0A1S4DY76 uncharacterized protein LOC103492210 isoform X30.0e+0085.82Show/hide
Query:  MVWLANSSSHSPMRILVDTDVDTDDIFALFYLLKQPSSLFHLQAITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGISHNATIFPHVGGYLPL
        MVWL N   +S MRILVDTDVDTDD+  L YLLKQ  SLFHLQ ITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGIS +ATI P++GGYLPL
Subjt:  MVWLANSSSHSPMRILVDTDVDTDDIFALFYLLKQPSSLFHLQAITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGISHNATIFPHVGGYLPL

Query:  IDQGVSTAGQCRYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGA
        IDQGVSTAGQCRYRQAIPVG  GRL ANTNFGLRK FLPQGKRRYIPMKQPTAQQVMKDAISAGPT VFLMGAHTNLAIFLLSNPHLKKNIKH+YAMGGA
Subjt:  IDQGVSTAGQCRYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGA

Query:  IREICSESADKSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGN
        IREICSES   SHGKTC+NIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNK+VFLAFEQRQNTYEAKYCFQSLKMARDTW  +
Subjt:  IREICSESADKSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGN

Query:  GFFEIYSMWDSFMVGVALSQMYNLDRGG--GNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKC
        GFFE+YSMWDSFMVGVALSQMYNL RGG  G NA+SKMEYLN+TIVTSN+PYGISDGSNP V+G LL  FG QKNGVHSGHVQTGMLDPFC+  T  GKC
Subjt:  GFFEIYSMWDSFMVGVALSQMYNLDRGG--GNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKC

Query:  QDGYTKEADGPESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKPVIFDMDMSTGDFLTLLYLLKT
        QDGYTKEADG ESVQVLVAVEAKSTID NSSIDKAFYISFLDVLNSPRQTGRFDFRAQFP YREVLYRP FGK+LLGKPVIFDMDMSTGDFLTLLYLLKT
Subjt:  QDGYTKEADGPESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKPVIFDMDMSTGDFLTLLYLLKT

Query:  PIEIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSV
        PIEIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGD+FAIGE HP FPPIGDCK                               Y+AE+SV
Subjt:  PIEIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSV

Query:  KYGAFRDTDHPELGQMSALDVWKDVVHSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADL
        K+GAFRDTDHPEL QMSALDVWKDVV +LDL+AKITVLTSGPLTNLA+IIHHKA+S+RI+EVYITGGHI+Y VDKGN+FTIPSNEYSEFNFFLDP AADL
Subjt:  KYGAFRDTDHPELGQMSALDVWKDVVHSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADL

Query:  VLGSGLNITLIPLNVQRRVSSFHKILKKLELGNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESK
        V GSGLNITLIPLNVQRRVSSF+KILKKL+  NRTPEAWFSRRLL RLY LKQKHHQYHHVDMFLGEV+G VSLAGKHLNLKQTFSFKPLKV++NGGESK
Subjt:  VLGSGLNITLIPLNVQRRVSSFHKILKKLELGNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESK

Query:  VGQTIIDDKKGKWVRVLESVEPLAFYEDLANALADEKQTAVIGSFE
        VGQTIID KKGKWVRVLES+EPLA YEDLANALADEKQTAVI SFE
Subjt:  VGQTIIDDKKGKWVRVLESVEPLAFYEDLANALADEKQTAVIGSFE

A0A1S4DYY6 uncharacterized protein LOC103492210 isoform X20.0e+0085.82Show/hide
Query:  MVWLANSSSHSPMRILVDTDVDTDDIFALFYLLKQPSSLFHLQAITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGISHNATIFPHVGGYLPL
        MVWL N   +S MRILVDTDVDTDD+  L YLLKQ  SLFHLQ ITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGIS +ATI P++GGYLPL
Subjt:  MVWLANSSSHSPMRILVDTDVDTDDIFALFYLLKQPSSLFHLQAITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGISHNATIFPHVGGYLPL

Query:  IDQGVSTAGQCRYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGA
        IDQGVSTAGQCRYRQAIPVG  GRL ANTNFGLRK FLPQGKRRYIPMKQPTAQQVMKDAISAGPT VFLMGAHTNLAIFLLSNPHLKKNIKH+YAMGGA
Subjt:  IDQGVSTAGQCRYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGA

Query:  IREICSESADKSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGN
        IREICSES   SHGKTC+NIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNK+VFLAFEQRQNTYEAKYCFQSLKMARDTW  +
Subjt:  IREICSESADKSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGN

Query:  GFFEIYSMWDSFMVGVALSQMYNLDRGG--GNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKC
        GFFE+YSMWDSFMVGVALSQMYNL RGG  G NA+SKMEYLN+TIVTSN+PYGISDGSNP V+G LL  FG QKNGVHSGHVQTGMLDPFC+  T  GKC
Subjt:  GFFEIYSMWDSFMVGVALSQMYNLDRGG--GNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKC

Query:  QDGYTKEADGPESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKPVIFDMDMSTGDFLTLLYLLKT
        QDGYTKEADG ESVQVLVAVEAKSTID NSSIDKAFYISFLDVLNSPRQTGRFDFRAQFP YREVLYRP FGK+LLGKPVIFDMDMSTGDFLTLLYLLKT
Subjt:  QDGYTKEADGPESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKPVIFDMDMSTGDFLTLLYLLKT

Query:  PIEIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSV
        PIEIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGD+FAIGE HP FPPIGDCK                               Y+AE+SV
Subjt:  PIEIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSV

Query:  KYGAFRDTDHPELGQMSALDVWKDVVHSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADL
        K+GAFRDTDHPEL QMSALDVWKDVV +LDL+AKITVLTSGPLTNLA+IIHHKA+S+RI+EVYITGGHI+Y VDKGN+FTIPSNEYSEFNFFLDP AADL
Subjt:  KYGAFRDTDHPELGQMSALDVWKDVVHSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADL

Query:  VLGSGLNITLIPLNVQRRVSSFHKILKKLELGNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESK
        V GSGLNITLIPLNVQRRVSSF+KILKKL+  NRTPEAWFSRRLL RLY LKQKHHQYHHVDMFLGEV+G VSLAGKHLNLKQTFSFKPLKV++NGGESK
Subjt:  VLGSGLNITLIPLNVQRRVSSFHKILKKLELGNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESK

Query:  VGQTIIDDKKGKWVRVLESVEPLAFYEDLANALADEKQTAVIGSFE
        VGQTIID KKGKWVRVLES+EPLA YEDLANALADEKQTAVI SFE
Subjt:  VGQTIIDDKKGKWVRVLESVEPLAFYEDLANALADEKQTAVIGSFE

A0A6J1DM35 uncharacterized protein LOC1110218210.0e+0083.04Show/hide
Query:  MVWLANSSSHS------PMRILVDTDVDTDDIFALFYLLKQPSSLFHLQAITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGISHNAT--IFP
        +VWL NSS HS        RI+VDTDVDTDD+FA+FYLLKQP+SLFHLQAITINGNGWS+AGHAVNH+YDMLFMMGRDDIPVGVGG+GGIS N T  I P
Subjt:  MVWLANSSSHS------PMRILVDTDVDTDDIFALFYLLKQPSSLFHLQAITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGISHNAT--IFP

Query:  H--VGGYLPLIDQGVSTAGQCRYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKN
        H  VGG+LPLIDQG+STAG CRYRQAIPVGEKGRLYANTNFGLRK FLPQG RRY P+KQPTAQQV+KDAISAGPTTVFLMG HTNLAIFL++NPHLKKN
Subjt:  H--VGGYLPLIDQGVSTAGQCRYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKN

Query:  IKHIYAMGGAIREICSESADKSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSL
        IKHIYAMGGAIREICS   DKSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPV+K VFLAFEQR NTYEA+YCFQSL
Subjt:  IKHIYAMGGAIREICSESADKSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSL

Query:  KMARDTWTGNGFFEIYSMWDSFMVGVALSQMYNLDRGGGNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCV
        KMARDTW  NGFFEIYSMWDSFMVGV+LSQM+NLD+GGG+NAYSKMEY+NITIVTSN+PYGISDGSNPLVDGHL+PKFGVQKNGVHSGHVQTGMLDPFC+
Subjt:  KMARDTWTGNGFFEIYSMWDSFMVGVALSQMYNLDRGGGNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCV

Query:  VPTEIGKCQDGYTKEADGPESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKPVIFDMDMSTGDFL
        + T  GKCQDGYTKEA+G ESVQVLVAVEAKST D NSSIDKAFYISFLDVLNSP+QTGRFDFRAQFPNY+EVLYRPKFGKKLLGKPV+FDMDMSTGDF+
Subjt:  VPTEIGKCQDGYTKEADGPESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKPVIFDMDMSTGDFL

Query:  TLLYLLKTPIEIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK------------------------------
        TLLYLLKTP+EII+LKGIIISPNGWATAATIDVVYDVLHMMGRDDI VGLGD+FAIGEAHPSFPPIGDCK                              
Subjt:  TLLYLLKTPIEIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK------------------------------

Query:  -YSAESSVKYGAFRDTDHPELGQMSALDVWKDVVHSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFF
         Y+AE+SVK+GA RDTDHPEL QMSAL+VWK +V SLD   KITVLT+GPLTNLAQI+  KA+ SRIQEVYITGGHI +  DKGNVFTIPSN Y+EFNFF
Subjt:  -YSAESSVKYGAFRDTDHPELGQMSALDVWKDVVHSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFF

Query:  LDPTAADLVLGSGLNITLIPLNVQRRVSSFHKILKKLELGNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKV
        LDPTAA+LVLGSGLNITLIPLNVQRRVSSFHKILK+L+L N+TPEA FS+RL SRLYHLKQ HHQYHHVDMFLGEVLG VSLAGKH+NLK+ FSFKPLKV
Subjt:  LDPTAADLVLGSGLNITLIPLNVQRRVSSFHKILKKLELGNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKV

Query:  VTNGGESKVGQTIIDDKKGKWVRVLESVEPLAFYEDLANALADEKQTAVIGSFEGQKRQWT
        VTNGGESKVGQTIID+KKGKWVRVLESVEPLAFYE LA+AL DEKQ+AV+GSFE QKR W+
Subjt:  VTNGGESKVGQTIIDDKKGKWVRVLESVEPLAFYEDLANALADEKQTAVIGSFEGQKRQWT

SwissProt top hitse value%identityAlignment
P29056 Transcription factor IIIB 70 kDa subunit8.9e-3125.47Show/hide
Query:  CSNC-----VKNVAGSRDEAGFLYCDMCGKVLDFYNFSQDPTFTKDSGGQSQLSGNFVRSIQSNFS-------ASRERTLNKAFEDMRYMRNGLNMGESD
        C NC      ++++ + ++   L C  CG V +      + TF + S G + + G+F+ + QS+ +        SRE TLN A   +R +   L++ E  
Subjt:  CSNC-----VKNVAGSRDEAGFLYCDMCGKVLDFYNFSQDPTFTKDSGGQSQLSGNFVRSIQSNFS-------ASRERTLNKAFEDMRYMRNGLNMGESD

Query:  EIIRVAGAFYRIALERNFTRGRNTEFVQAACLYIACR-------------------YVLGAVFLQLCKVLRLEEHPIVQKPVDPSLFIDKFTQCLLGGTK
         I   A  +Y++AL  NF +GR ++ V A+CLY+ACR                   Y +GA FL++ K L + E P+     DPSLFI  F + L    K
Subjt:  EIIRVAGAFYRIALERNFTRGRNTEFVQAACLYIACR-------------------YVLGAVFLQLCKVLRLEEHPIVQKPVDPSLFIDKFTQCLLGGTK

Query:  EDGMKKEVSRTALKIITSMKRDWMQTGRKPSGLCGAALYISALSNGVKCSKSDIIKIVHICDATLTKRLIEFENTESGSLTMEEF--------------I
            K +V + A+K+   M +DWM  GR+P+G+ GA + ++   N ++ + ++I+ + H+ + TL +RL EF+NT++  L++++F               
Subjt:  EDGMKKEVSRTALKIITSMKRDWMQTGRKPSGLCGAALYISALSNGVKCSKSDIIKIVHICDATLTKRLIEFENTESGSLTMEEF--------------I

Query:  VMADKV--KGSNSYTNNGLNATSDEVL--------CVHKNECKKPYALGLCKSCYD------DFVELSGGLDGGSNPPAFQSAEKERMEKAMVEEGSNDS
        V   K   K  +S     +  TS+E L         + + E      L   K   +      + ++ + G+DG                + +  EGS + 
Subjt:  VMADKV--KGSNSYTNNGLNATSDEVL--------CVHKNECKKPYALGLCKSCYD------DFVELSGGLDGGSNPPAFQSAEKERMEKAMVEEGSNDS

Query:  SAIGKFSQGLNPCNNTEKESDNVHADASKTVGSKEAEAK---GAADEQRGLDEGANKIGADGLG---------------------ATASDESDNWSDIDD
        +   K S+          ++++V  +  +T G++E   K     ++E++  + G  +   DG                       +  SD+ DN  D+DD
Subjt:  SAIGKFSQGLNPCNNTEKESDNVHADASKTVGSKEAEAK---GAADEQRGLDEGANKIGADGLG---------------------ATASDESDNWSDIDD

Query:  IEVDGYLHNEEEKHYKKIIWEEMNREYLEEQAAK
         E++ +L NEE    K+ IW  +N ++L EQ +K
Subjt:  IEVDGYLHNEEEKHYKKIIWEEMNREYLEEQAAK

P46070 Transcription factor IIIB 70 kDa subunit3.7e-3729.39Show/hide
Query:  CSNC-----VKNVAGSRDEAGFLYCDMCGKVLDFYNFSQDPTFTKDSGGQSQLSGNFVRSIQS----------NFSASRERTLNKAFEDMRYMRNGLNMG
        C NC     V++++ + +E   L C +CG V +  +   + TF + S G + + G FV + Q+          N   SRE TLN A   ++ +   LN+ 
Subjt:  CSNC-----VKNVAGSRDEAGFLYCDMCGKVLDFYNFSQDPTFTKDSGGQSQLSGNFVRSIQS----------NFSASRERTLNKAFEDMRYMRNGLNMG

Query:  ESDEIIRVAGAFYRIALERNFTRGRNTEFVQAACLYIACR-------------------YVLGAVFLQLCKVLRLEEHPIVQKPVDPSLFIDKFTQCLLG
        E   +   A  +YR+AL  NF +GR ++ V AACLYIACR                   Y +GA FL+L K L++ + P+     DPSLFI  F + L  
Subjt:  ESDEIIRVAGAFYRIALERNFTRGRNTEFVQAACLYIACR-------------------YVLGAVFLQLCKVLRLEEHPIVQKPVDPSLFIDKFTQCLLG

Query:  GTKEDGMKKEVSRTALKIITSMKRDWMQTGRKPSGLCGAALYISALSNGVKCSKSDIIKIVHICDATLTKRLIEFENTESGSLTMEEFIVMADKVKGSNS
        G K    K +V R A+K+  +M RDWM  GR+P+G+ GA L ++   N ++ + S+I+ I H+ + TL +RL EF+NT S  L+++EF       +   +
Subjt:  GTKEDGMKKEVSRTALKIITSMKRDWMQTGRKPSGLCGAALYISALSNGVKCSKSDIIKIVHICDATLTKRLIEFENTESGSLTMEEFIVMADKVKGSNS

Query:  YTNNGLNATSDEVLCVHKNECKKPYALGLCKSCYDDFVELSGGLDGGSNPP-------AFQSAEKERME--KAMVEEGSNDSSAIGKFSQGLNPCNNTEK
          N G  +   +     KN  K+       K   D    L    +  S  P       A + + KE +   K + E    + S I K + G+        
Subjt:  YTNNGLNATSDEVLCVHKNECKKPYALGLCKSCYDDFVELSGGLDGGSNPP-------AFQSAEKERME--KAMVEEGSNDSSAIGKFSQGLNPCNNTEK

Query:  ESDNVHADASKTVGSKEAEAKGAADEQRGLDE-GANKIGAD--------------GLGATASDESDNWSDIDDIEVDGYLHNEEEKHYKKIIWEEMNREY
        + +++H              K   D++R LDE     +  D               L +  SD  +N  D+DD E+D +L +EE    K+ IW ++N +Y
Subjt:  ESDNVHADASKTVGSKEAEAKGAADEQRGLDE-GANKIGAD--------------GLGATASDESDNWSDIDDIEVDGYLHNEEEKHYKKIIWEEMNREY

Query:  LEEQAAK
        L EQ +K
Subjt:  LEEQAAK

Q8CFK2 Transcription factor IIIB 90 kDa subunit2.3e-3429.09Show/hide
Query:  GFLYCDMCGKVLDFYNFSQDPTFTKDSGGQSQLSGNFVR--------SIQSNF-----SASRERTLNKAFEDMRYMRNGLNMGESDEIIRVAGAFYRIAL
        G   C  CG VL+      +  F ++SGG S   G FV         ++   F       SR +TL      + ++ + L + +    +  A  F+++A+
Subjt:  GFLYCDMCGKVLDFYNFSQDPTFTKDSGGQSQLSGNFVR--------SIQSNF-----SASRERTLNKAFEDMRYMRNGLNMGESDEIIRVAGAFYRIAL

Query:  ERNFTRGRNTEFVQAACLYIACR-------------------YVLGAVFLQLCKVLRLEEHPIVQKPVDPSLFIDKFTQCLLGGTKEDGMKKEVSRTALK
         ++ TRGR    V AACLY+ CR                   YVLG  FL L + L +         +DP L+I +F   L  G K      EVS TAL+
Subjt:  ERNFTRGRNTEFVQAACLYIACR-------------------YVLGAVFLQLCKVLRLEEHPIVQKPVDPSLFIDKFTQCLLGGTKEDGMKKEVSRTALK

Query:  IITSMKRDWMQTGRKPSGLCGAALYISALSNGVKCSKSDIIKIVHICDATLTKRLIEFENTESGSLTMEEFI-VMADKVKGSNSYT--------------
        ++  MKRDWM TGR+PSGLCGAAL ++A  +  + +  ++I +V +C++TL KRL EFE+T +  LT++EF+ +  ++     SYT              
Subjt:  IITSMKRDWMQTGRKPSGLCGAALYISALSNGVKCSKSDIIKIVHICDATLTKRLIEFENTESGSLTMEEFI-VMADKVKGSNSYT--------------

Query:  -NNGLNATSDEVLCVH-----KNECKKPYALGLCKSCYDDFVELSGGLDGGSNPPAFQSAEKERMEKAMVEEGSNDSSAIGKFSQGLNPCNNTEKESDNV
         +  L     E+         + E  +P A G   +   D      G D  S+P   +  E E +E A      +    +     G     + +  S  +
Subjt:  -NNGLNATSDEVLCVH-----KNECKKPYALGLCKSCYDDFVELSGGLDGGSNPPAFQSAEKERMEKAMVEEGSNDSSAIGKFSQGLNPCNNTEKESDNV

Query:  HADASKTVGSKEAEAKGAADEQRGLDEGANKIGADGLGATASDESDNWSDIDDIEVDGYLHNEEEKHYKKIIWEEMNREYLEEQAAKDAATAAAK
         A  S       A + G +D  R   E  +    D   ++   E D  S IDD+E+D Y+ NE E   K  +W   N EYL EQ  K+A  A  K
Subjt:  HADASKTVGSKEAEAKGAADEQRGLDEGANKIGADGLGATASDESDNWSDIDDIEVDGYLHNEEEKHYKKIIWEEMNREYLEEQAAKDAATAAAK

Q92994 Transcription factor IIIB 90 kDa subunit2.1e-3529.78Show/hide
Query:  GFLYCDMCGKVLDFYNFSQDPTFTKDSGGQSQLSGNFVR--------SIQSNF-----SASRERTLNKAFEDMRYMRNGLNMGESDEIIRVAGAFYRIAL
        G   C  CG VL+      +  F + SGG S   G FV         ++   F       SR +TL      + ++ N L + +    +  A  F+++A+
Subjt:  GFLYCDMCGKVLDFYNFSQDPTFTKDSGGQSQLSGNFVR--------SIQSNF-----SASRERTLNKAFEDMRYMRNGLNMGESDEIIRVAGAFYRIAL

Query:  ERNFTRGRNTEFVQAACLYIACR-------------------YVLGAVFLQLCKVLRLEEHPIVQKPVDPSLFIDKFTQCLLGGTKEDGMKKEVSRTALK
         R+ TRGR    V AACLY+ CR                   YVLG  FL L + L +         +DP L+I +F   L  G K      EVS TAL+
Subjt:  ERNFTRGRNTEFVQAACLYIACR-------------------YVLGAVFLQLCKVLRLEEHPIVQKPVDPSLFIDKFTQCLLGGTKEDGMKKEVSRTALK

Query:  IITSMKRDWMQTGRKPSGLCGAALYISALSNGVKCSKSDIIKIVHICDATLTKRLIEFENTESGSLTMEEFI-VMADKVKGSNSYTNNGLNATS---DEV
        ++  MKRDWM TGR+PSGLCGAAL ++A  +  + +  ++I +V +C++TL KRL EFE+T +  LT++EF+ +  ++     SYT           ++V
Subjt:  IITSMKRDWMQTGRKPSGLCGAALYISALSNGVKCSKSDIIKIVHICDATLTKRLIEFENTESGSLTMEEFI-VMADKVKGSNSYTNNGLNATS---DEV

Query:  LCVHKNECKKPYALGLCKSCYDDFVEL---------SGGL-----DGGSNPPAF-----QSAEKERMEKAMVEEGSNDSSAIGKFSQGLNPCNNTEKESD
        L     E +         S Y D +E+          GGL     DG +   A      +  E E +E A      +    +   + G +    + +   
Subjt:  LCVHKNECKKPYALGLCKSCYDDFVEL---------SGGL-----DGGSNPPAF-----QSAEKERMEKAMVEEGSNDSSAIGKFSQGLNPCNNTEKESD

Query:  NVHADASKTVGSKEAEAKGAADEQRGLDEGANKIGADGLGATASDESDNWSDIDDIEVDGYLHNEEEKHYKKIIWEEMNREYLEEQAAKDAATAAAK
           A  S       A + G +D  R   E  +   +D   A+   E D  S IDD+E+D Y+ NE E   K  +W   N EYL EQ  K+A  A  K
Subjt:  NVHADASKTVGSKEAEAKGAADEQRGLDEGANKIGADGLGATASDESDNWSDIDDIEVDGYLHNEEEKHYKKIIWEEMNREYLEEQAAKDAATAAAK

Q9P6R0 Transcription factor IIIB 60 kDa subunit2.1e-3226.42Show/hide
Query:  CSNCVKNVAGSRDEAGFLYCDMCGKVLDFYNFSQDPTFTKDSGGQSQLSGNFVRSIQSNFSA------------SRERTLNKAFEDMRYMRNGLNMGESD
        C NC      S   +G  YC  CG V++      + TF + S G + + G+ V + Q++               SRE T+      +  +   L + E  
Subjt:  CSNCVKNVAGSRDEAGFLYCDMCGKVLDFYNFSQDPTFTKDSGGQSQLSGNFVRSIQSNFSA------------SRERTLNKAFEDMRYMRNGLNMGESD

Query:  EIIRVAGAFYRIALERNFTRGRNTEFVQAACLYIACR-------------------YVLGAVFLQLCKVLRLEEHPIVQKPVDPSLFIDKFTQCLLGGTK
          I  A  ++ +A+  NF +GR +++V A+CLYI CR                   + LG+ FL+LC+VLR    P+    +DPSL+I +F   L  G +
Subjt:  EIIRVAGAFYRIALERNFTRGRNTEFVQAACLYIACR-------------------YVLGAVFLQLCKVLRLEEHPIVQKPVDPSLFIDKFTQCLLGGTK

Query:  EDGMKKEVSRTALKIITSMKRDWMQTGRKPSGLCGAALYISALSNGVKCSKSDIIKIVHICDATLTKRLIEFENTESGSLTMEEFIVMADKVKGSNSYTN
               V+  A++++  M RDWMQ GR+P+G+CGA L I+A  N  + S  +++ +V + D T+ KRL EF+ TESG L++ +F          N +  
Subjt:  EDGMKKEVSRTALKIITSMKRDWMQTGRKPSGLCGAALYISALSNGVKCSKSDIIKIVHICDATLTKRLIEFENTESGSLTMEEFIVMADKVKGSNSYTN

Query:  NGLNATSDEVLCVHKNECKKPYALGLCKSCYDDFVELSGGLDGGSNPPAFQSAEKERMEKAMVEEGSNDSSAIGKFSQGLNPCNNTE----KESDNVHAD
                                                L+G S+PP+F   +K        + G+   S I    + ++P   T      E  +    
Subjt:  NGLNATSDEVLCVHKNECKKPYALGLCKSCYDDFVELSGGLDGGSNPPAFQSAEKERMEKAMVEEGSNDSSAIGKFSQGLNPCNNTE----KESDNVHAD

Query:  ASKTVGSKEAEAKGAADEQRGLDEGANKIGADGL-------GATASDESDNWSDIDDIEVDGYLHNEEEKHYKKIIWEEMNREYLEEQAAKD
         +  V S+E      ADE+    E    +  D L           S+E     D+DD E++  L +++E   K  +W E+N++YL E+ AK+
Subjt:  ASKTVGSKEAEAKGAADEQRGLDEGANKIGADGL-------GATASDESDNWSDIDDIEVDGYLHNEEEKHYKKIIWEEMNREYLEEQAAKD

Arabidopsis top hitse value%identityAlignment
AT2G01280.1 Cyclin/Brf1-like TBP-binding protein1.5e-9744Show/hide
Query:  MVWCSNCVKNVAGSRDEAGFLYCDMCGKVLDFYNFSQDPTFTKDSGGQSQLSGNFVRSIQSNFSASRERTLNKAFEDMRYMRNGLNMG-ESDEIIRVAGA
        MVWC +C KNV   R   G L CD+CG++L+ +NFS D TF K++ GQ     N V S+ +  S+SR+R   KA +++R +++ L +G E D+++ +A  
Subjt:  MVWCSNCVKNVAGSRDEAGFLYCDMCGKVLDFYNFSQDPTFTKDSGGQSQLSGNFVRSIQSNFSASRERTLNKAFEDMRYMRNGLNMG-ESDEIIRVAGA

Query:  FYRIALERNFTRGRNTEFVQAACLYIACRYV------LGAVFLQLCKVLRLEEHPIVQKPVDPSLFIDKFTQCLLGGTKEDGMKKEVSRTALKIITSMKR
        FY  A+++NFT+GR  E VQ++CLY+AC Y+      LG+V+LQLC++L L ++   ++ VDPS+FI +FT  LL G       K+V+ TA  II+SMKR
Subjt:  FYRIALERNFTRGRNTEFVQAACLYIACRYV------LGAVFLQLCKVLRLEEHPIVQKPVDPSLFIDKFTQCLLGGTKEDGMKKEVSRTALKIITSMKR

Query:  DWMQTGRKPSGLCGAALYISALSNGVKCSKSDIIKIVHICDATLTKRLIEFENTESGSLTMEEFIVMADKVKGSNSYTNNGLNATSDEVLCVHKNECKKP
        DW+QTGRKPSG+CGAA+Y++ALS+G+  S++DI K+VH+C+AT+TKRL EF NTE+GSLT++E +  ++++    ++T    N+    V C HK+   K 
Subjt:  DWMQTGRKPSGLCGAALYISALSNGVKCSKSDIIKIVHICDATLTKRLIEFENTESGSLTMEEFIVMADKVKGSNSYTNNGLNATSDEVLCVHKNECKKP

Query:  YALGLCKSCYDDFVELSGGLDGGSNPPAFQSAEKERMEKAMVEEGSNDSSAIGKFSQGLNPCNNTEKESDNVHADASKTVGSKEAEAKGAADEQRGLDEG
        +  GLCKSC+DDF+ +SGG+ GGS+PPA+Q AEKERMEKA  EE   +   IG         N    E  NV   A K     E E  G           
Subjt:  YALGLCKSCYDDFVELSGGLDGGSNPPAFQSAEKERMEKAMVEEGSNDSSAIGKFSQGLNPCNNTEKESDNVHADASKTVGSKEAEAKGAADEQRGLDEG

Query:  ANKIGADGLGATASDESDNWSDIDDIEVDGYLHNEEEKHYKKIIWEEMNREYLEEQAAKDAATAAAKNAYEANFQNCSEDLLAAKDLADAAAAAVAKSRK
                  A  SDESD  SD DD EV+  L  E+E   K   W   N++YLEEQA K+AA  AA         NC ED   A++L +A+ AAVA SRK
Subjt:  ANKIGADGLGATASDESDNWSDIDDIEVDGYLHNEEEKHYKKIIWEEMNREYLEEQAAKDAATAAAKNAYEANFQNCSEDLLAAKDLADAAAAAVAKSRK

AT2G45100.1 Cyclin/Brf1-like TBP-binding protein1.0e-9341.15Show/hide
Query:  MVWCSNCVKNVAGSRDEAGFLYCDMCGKVLDFYNFSQDPTFTKDSGGQSQLSGNFVRSIQSNFSASRERTLNKAFEDMRYMRNGLNMGES-DEIIRVAGA
        MVWC +C KNV G R     L CD+CG++L+ +NFS + TF K++ GQSQ SGN ++S+QS  S+SRER + KA +++  +R+ L +G+  D++I +A  
Subjt:  MVWCSNCVKNVAGSRDEAGFLYCDMCGKVLDFYNFSQDPTFTKDSGGQSQLSGNFVRSIQSNFSASRERTLNKAFEDMRYMRNGLNMGES-DEIIRVAGA

Query:  FYRIALERNFTRGRNTEFVQAACLYIACR-------------------YVLGAVFLQLCKVLRLEEHPIVQKPVDPSLFIDKFTQCLLGGTKEDGMKKEV
        F+RIAL+ NFT+GR+ E V ++CLY+ CR                   Y LG+V+LQLC +L + E+   +K VDPS+FI +F+  LL G   +    ++
Subjt:  FYRIALERNFTRGRNTEFVQAACLYIACR-------------------YVLGAVFLQLCKVLRLEEHPIVQKPVDPSLFIDKFTQCLLGGTKEDGMKKEV

Query:  SRTALKIITSMKRDWMQTGRKPSGLCGAALYISALSNGVKCSKSDIIKIVHICDATLTKRLIEFENTESGSLTMEEFI-------VMADKVKGSNSYTNN
          TA  II SMKRDWMQTGRKPSG+CGAALY +ALS+G+KCSK+DI+ IVHIC+ATLTKRLIEF +TE+ SLT +E           A + K   ++   
Subjt:  SRTALKIITSMKRDWMQTGRKPSGLCGAALYISALSNGVKCSKSDIIKIVHICDATLTKRLIEFENTESGSLTMEEFI-------VMADKVKGSNSYTNN

Query:  GLNATSDEVLCVHKNECKKPYALGLCKSCYDDFVELSGGLDGGSNPPAFQSAEKERMEKAMVEEGSNDSSAIGKFSQGLNPCNNTEKESDNVHADASKTV
        G+      VLC+H+ +C KP   GLC+SCYD+F+ +SGGL+GGS+PPAFQ AEKERME                                          
Subjt:  GLNATSDEVLCVHKNECKKPYALGLCKSCYDDFVELSGGLDGGSNPPAFQSAEKERMEKAMVEEGSNDSSAIGKFSQGLNPCNNTEKESDNVHADASKTV

Query:  GSKEAEAKGAADEQRGLDEGANKIGADGLGATASDESDNWSDIDDIEVDGYLHNEEEKHYKKIIWEEMNREYLEEQAAKDAATAAAKNAYEANFQNCSED
                    E+   +E   ++  DG     SDES   SD+DD E+D Y    EE    KI ++  N  Y E++AAK AA   A N            
Subjt:  GSKEAEAKGAADEQRGLDEGANKIGADGLGATASDESDNWSDIDDIEVDGYLHNEEEKHYKKIIWEEMNREYLEEQAAKDAATAAAKNAYEANFQNCSED

Query:  LLAAKDLADAAAAAVAKSRK
           A ++ +A+ AA AKSRK
Subjt:  LLAAKDLADAAAAAVAKSRK

AT3G09360.1 Cyclin/Brf1-like TBP-binding protein1.5e-11847.67Show/hide
Query:  MVWCSNCVKNVAGSRDEAGFLYCDMCGKVLDFYNFSQDPTFTKDSGGQSQLSGNFVRSIQSNFSASRERTLNKAFEDMRYMRNGLNMG-ESDEIIRVAGA
        MVWC++CVKNV G R   G L C++CG++L+ ++FS + TF K++ GQSQ SGN VRS+QS  ++SRER    A +++  +++ L +G E D++I +A  
Subjt:  MVWCSNCVKNVAGSRDEAGFLYCDMCGKVLDFYNFSQDPTFTKDSGGQSQLSGNFVRSIQSNFSASRERTLNKAFEDMRYMRNGLNMG-ESDEIIRVAGA

Query:  FYRIALERNFTRGRNTEFVQAACLYIACR-------------------YVLGAVFLQLCKVLRLEEHPIVQKPVDPSLFIDKFTQCLLGGTKEDGMKKEV
        F+ +A+E+NFT+GR TE VQA+CLY+ CR                   Y LG+V+LQLC++L L E+   +K VDPS+F+D+F+  LL G       K+V
Subjt:  FYRIALERNFTRGRNTEFVQAACLYIACR-------------------YVLGAVFLQLCKVLRLEEHPIVQKPVDPSLFIDKFTQCLLGGTKEDGMKKEV

Query:  SRTALKIITSMKRDWMQTGRKPSGLCGAALYISALSNGVKCSKSDIIKIVHICDATLTKRLIEFENTESGSLTMEEFIVMADKVKGSNSYTNNGLNATSD
          TA  II SMKRDW+QTGRKPSG+CGAALY +ALS+G+KCSK+DI+ IVHIC+ATLTKRLIEF +T+SG+L + E   + ++     S+T     +  +
Subjt:  SRTALKIITSMKRDWMQTGRKPSGLCGAALYISALSNGVKCSKSDIIKIVHICDATLTKRLIEFENTESGSLTMEEFIVMADKVKGSNSYTNNGLNATSD

Query:  EVLCVHKNECKKPYALGLCKSCYDDFVELSGGLDGGSNPPAFQSAEKERMEKAMVEEGSNDSSAIGKFSQGLNPCNNTEKESDNVHAD-ASKTVGSKEAE
         VLC+H++   KP+  GLC+ CY DF+ +SGGL GGSNPPAFQ AEKERMEKA  EE             G++  N+ E+    +++D  S +   K+  
Subjt:  EVLCVHKNECKKPYALGLCKSCYDDFVELSGGLDGGSNPPAFQSAEKERMEKAMVEEGSNDSSAIGKFSQGLNPCNNTEKESDNVHAD-ASKTVGSKEAE

Query:  AKGAADEQRGLDEGANKIGADGLGATASDESDNWSDIDDIEVDGYLHNEEEKHYKKIIWEEMNREYLEEQAAKDAATAAAKNAYEANFQNCSEDLLAAKD
         KG  D+  G +E A+           SDESDN+SDI D EV+GY++NEEE HYK I W EMN++YLEEQAAK+AA  AA  A +A+  NC ED   A+ 
Subjt:  AKGAADEQRGLDEGANKIGADGLGATASDESDNWSDIDDIEVDGYLHNEEEKHYKKIIWEEMNREYLEEQAAKDAATAAAKNAYEANFQNCSEDLLAAKD

Query:  LADAAAAAVAKSRK
          +AA A  AKSRK
Subjt:  LADAAAAAVAKSRK

AT5G18860.1 inosine-uridine preferring nucleoside hydrolase family protein3.1e-27355.98Show/hide
Query:  SSSHSPMRILVDTDVDTDDIFALFYLLKQPSSLFHLQAITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGISHNATIFPHVGGYLPLIDQGVS
        SSSH   RILVDTDVDTDD+FA+ YLLK   S F L  IT++ N W++AGHAVN +YD+L MM RDDIPVGVGG+GGIS + TI   VGGY P+I+QG++
Subjt:  SSSHSPMRILVDTDVDTDDIFALFYLLKQPSSLFHLQAITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGISHNATIFPHVGGYLPLIDQGVS

Query:  TAGQCRYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGAIRE---
        T G+CRYRQAIP G  G L  ++N+G RK FLPQG RRY P++QPTAQ+V+ D IS GPTTV L+G+HTN A+FL+SNPHLK NI+HIY MGG +R    
Subjt:  TAGQCRYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGAIRE---

Query:  ----ICSESADKSHGKTCNNIGNLWPPNT-NPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQR-QNTYEAKYCFQSLKMARDTW
              + +  +   + C N GNL+   T NPY+EFNIF DPFAAY V HSG+PVTLVPLDAT+TIP+N+K F  FE   Q TYEA+Y F SLK+ARDTW
Subjt:  ----ICSESADKSHGKTCNNIGNLWPPNT-NPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQR-QNTYEAKYCFQSLKMARDTW

Query:  TGNGFFEIYSMWDSFMVGVALSQM---YNLDRGGGNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTE
          + F++ Y MWDSF  GVA+S M    N +   G N +++MEY+NIT+VTSNKPYG SDGSNP  D    PKF +   GVHSGHVQTG+ DP C+  + 
Subjt:  TGNGFFEIYSMWDSFMVGVALSQM---YNLDRGGGNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTE

Query:  I--GKCQDGYTKEADGPESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKPVIFDMDMSTGDFLTL
        I  GKC+DGYT+E  G +SV+VLVA  AK  I+  S +D+ FY+ FL+VLN P +TGRF+F +QFP Y+E L+RP   K   GKPV+FDMDMS GDFL+L
Subjt:  I--GKCQDGYTKEADGPESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKPVIFDMDMSTGDFLTL

Query:  LYLLKTPIEIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------Y
         YLLK P++ I+LK II+SP GWA AATIDVVYD+LHMMGRDDI VGLGD+ A+ ++ P FPP+G CK                               Y
Subjt:  LYLLKTPIEIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------Y

Query:  SAESSVKYGAFRDTDHPELGQMSALDVWKDVVHSLDLEAKITVLTSGPLTNLAQII-HHKAVSSRIQEVYITGGHIN-YSVDKGNVFTIPSNEYSEFNFF
        +AE+SV +GA RDTD PEL Q  A++VW+++  S +  +KITVLT+GPLTNLA+II   K  SS I+EVYI GGHIN    DKGN+FTIPSN Y+EFN F
Subjt:  SAESSVKYGAFRDTDHPELGQMSALDVWKDVVHSLDLEAKITVLTSGPLTNLAQII-HHKAVSSRIQEVYITGGHIN-YSVDKGNVFTIPSNEYSEFNFF

Query:  LDPTAADLVLGSGLNITLIPLNVQRRVSSFHKILKKLELGNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKV
        LDP AA  VL S LNITL+PL  Q ++SSF  +L +L    +TPEA F +RLL RL  L QKH +Y H+DMFLGEVLG V L G   +LK     + +KV
Subjt:  LDPTAADLVLGSGLNITLIPLNVQRRVSSFHKILKKLELGNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKV

Query:  VTNGGESKVGQTIIDDKKGKWVRVLESVEPLAFYEDLANALADEKQTAVIGSFEGQKRQWT
        +  G ES+ G+ +ID  +GK +++LE V+ ++  E  A+ L D+KQ+AVIGSFE QK+ W+
Subjt:  VTNGGESKVGQTIIDDKKGKWVRVLESVEPLAFYEDLANALADEKQTAVIGSFEGQKRQWT

AT5G18890.1 Inosine-uridine preferring nucleoside hydrolase family protein2.0e-15052.31Show/hide
Query:  NLDRGGGNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQDGYTKEADGPESVQVLVAVEAKS
        N +   G N +++MEY+NIT+VTSN+PYG+ D SNP       PKF +   GVHSGHVQ G+ DP C+  +  G C+DGYTKE  GP+SV+VLVA  AK 
Subjt:  NLDRGGGNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQDGYTKEADGPESVQVLVAVEAKS

Query:  TIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKF-GKKLLGKPVIFDMDMSTGDFLTLLYLLKTPIEIINLKGIIISPNGWATAATI
        + + NS +D+ FY  FL+VLN P +TGRF F  QF  YRE L+  +    +L GKPV+FDMDMS GDFL+L YLLK P+EII+LK +I+SP GWA  ATI
Subjt:  TIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKF-GKKLLGKPVIFDMDMSTGDFLTLLYLLKTPIEIINLKGIIISPNGWATAATI

Query:  DVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCKYSA-----------------------------ESSVKYGAFRDTDHPELGQMSALDVWKDV
        DVVYD+LHMMGRDDI VGLGD+FAI ++ P FP  GDCKY+                              E+SV +GA  DTD PEL Q  AL+VW+++
Subjt:  DVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCKYSA-----------------------------ESSVKYGAFRDTDHPELGQMSALDVWKDV

Query:  VHSLDLEAKITVLTSGPLTNLAQII-HHKAVSSRIQEVYITGGHINY-SVDKGNVFTIPSNEYSEFNFFLDPTAADLVLGSGLNITLIPLNVQRRVSSFH
          S+D  +KITVLT+GPLT+LA+II   K  SS I+EVYI GGHI+    DKGN+FT+PSN Y+EFN FLDP AA  VL SGLNITLIPL  QR   SF 
Subjt:  VHSLDLEAKITVLTSGPLTNLAQII-HHKAVSSRIQEVYITGGHINY-SVDKGNVFTIPSNEYSEFNFFLDPTAADLVLGSGLNITLIPLNVQRRVSSFH

Query:  KILKKLELGNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKVGQTIIDDKKGKWVRVLESVEPL
         +L +L    +TPEA F +RLL+RL  L QK  +Y H+DMFLGE+LG + L G H  LK     + +KV+  G ESK G  +ID  +GK +++LE V+  
Subjt:  KILKKLELGNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKVGQTIIDDKKGKWVRVLESVEPL

Query:  AFYEDLANALADEKQTAVIGSFEGQKRQW-TPREELPPRSR
          YE  A+ L D+KQ+AVIGSFE Q+ +W TP    P  +R
Subjt:  AFYEDLANALADEKQTAVIGSFEGQKRQW-TPREELPPRSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTGGTTAGCAAATTCTTCTTCTCATTCTCCTATGAGGATTCTTGTCGATACAGATGTTGATACCGACGATATCTTCGCTCTATTCTACCTTCTCAAGCAACCTTC
TTCTCTCTTTCATCTCCAGGCAATAACAATTAACGGGAATGGATGGAGCGACGCGGGACATGCGGTGAACCATTTGTACGACATGTTGTTCATGATGGGCCGAGACGACA
TCCCAGTTGGTGTCGGCGGAGACGGCGGCATTTCCCATAACGCTACTATTTTCCCTCACGTCGGTGGCTACCTACCACTCATTGATCAGGGTGTGTCGACGGCAGGGCAA
TGCAGATACAGGCAAGCCATTCCGGTGGGAGAGAAAGGACGTCTCTATGCCAATACCAATTTTGGTTTACGCAAACCCTTTCTTCCTCAGGGTAAGAGGAGATATATTCC
TATGAAGCAGCCAACTGCACAGCAGGTGATGAAAGATGCAATATCTGCAGGGCCTACTACAGTTTTTCTTATGGGAGCTCATACAAACTTGGCTATATTTCTTCTGTCAA
ATCCTCATTTGAAGAAAAATATAAAGCATATATATGCCATGGGTGGTGCTATTAGAGAAATTTGCTCAGAAAGTGCTGACAAATCTCATGGGAAGACATGCAACAATATC
GGGAACTTGTGGCCTCCCAATACAAATCCATATGCTGAGTTCAATATATTTGGAGACCCTTTTGCTGCCTATACAGTACTACATTCTGGAATTCCTGTTACACTTGTTCC
TCTGGATGCAACAAGTACCATTCCTGTGAATAAGAAAGTATTTTTGGCATTTGAACAGAGACAAAACACTTATGAGGCTAAATATTGTTTCCAGTCTTTGAAAATGGCTC
GTGATACTTGGACTGGCAATGGATTTTTCGAAATTTATTCGATGTGGGATTCTTTTATGGTTGGTGTAGCTTTGTCCCAAATGTATAATTTGGACAGAGGGGGTGGAAAT
AATGCATATTCGAAGATGGAATATCTAAATATCACTATTGTTACATCGAACAAACCTTATGGGATCTCTGATGGATCAAATCCACTTGTTGATGGACATTTGCTCCCAAA
ATTTGGTGTCCAAAAGAATGGGGTGCACAGTGGTCATGTTCAGACAGGAATGCTAGATCCATTCTGCGTTGTTCCAACTGAAATAGGAAAATGTCAGGATGGTTATACAA
AGGAGGCAGATGGCCCTGAATCAGTTCAAGTTCTAGTTGCTGTGGAAGCTAAATCAACTATTGACGCCAACAGCTCAATTGATAAAGCATTTTATATTAGCTTCCTGGAT
GTTCTTAATAGTCCACGACAGACGGGGAGGTTTGATTTTAGAGCTCAGTTTCCTAACTACAGGGAAGTATTATATAGACCAAAATTTGGGAAGAAATTACTAGGAAAACC
GGTTATTTTTGACATGGATATGAGCACAGGAGATTTTCTCACTCTTCTCTATCTCTTGAAGACACCCATTGAAATCATCAATCTAAAGGGAATAATAATTAGCCCAAATG
GATGGGCAACGGCTGCAACAATAGATGTTGTATATGATGTATTACATATGATGGGCCGTGACGACATTTCGGTTGGTCTTGGAGATGTATTCGCCATTGGAGAAGCACAT
CCATCATTTCCTCCTATTGGAGACTGCAAATATTCTGCTGAAAGTTCAGTGAAGTATGGAGCATTTAGAGATACTGATCACCCTGAACTCGGACAAATGTCTGCACTTGA
TGTTTGGAAAGATGTTGTGCACTCTCTAGATCTGGAGGCAAAAATTACCGTTTTAACCAGCGGACCTTTGACGAATCTCGCTCAAATCATACATCATAAAGCCGTAAGCT
CAAGGATTCAGGAAGTATATATTACTGGAGGACATATAAATTATAGTGTGGACAAGGGAAATGTGTTCACTATACCTTCAAATGAATATTCAGAATTCAACTTCTTTCTG
GACCCTACAGCTGCCGATTTGGTTTTGGGCTCAGGATTGAACATCACACTTATCCCGTTGAACGTGCAACGAAGAGTAAGTTCTTTTCACAAGATCCTCAAAAAGTTGGA
GCTCGGAAACAGAACTCCCGAAGCATGGTTCTCTCGACGTCTACTCTCCAGGCTATATCACCTTAAGCAGAAGCATCATCAATACCATCATGTGGACATGTTCTTAGGAG
AAGTCCTTGGTGTAGTGAGCTTAGCTGGAAAACATTTGAATCTGAAGCAAACATTCAGCTTCAAGCCTCTCAAGGTAGTCACAAATGGAGGTGAATCAAAAGTAGGGCAA
ACAATTATAGATGATAAGAAAGGGAAGTGGGTGAGAGTTTTGGAGAGTGTTGAACCACTTGCATTTTATGAAGATCTTGCAAACGCATTGGCTGATGAAAAGCAAACTGC
TGTTATTGGAAGCTTTGAGGGGCAAAAGAGGCAGTGGACCCCACGCGAAGAACTCCCTCCCCGCAGTCGTCAACTCTCAACCTCGCTCACGTCAGTCGCCGGTCGTCCGC
CGTCCCCGCCGCTGGGCTCTGTCGCCGTCCGTCGTGCTCAGACCAACGTTGCCGTTGCTGGTGCTGAGTTTCTCGTGGATCTCTTTCTCTCTCTTCCGCGCGTTCAGCTG
GTTTTTCTTTTATGTATGCCTCTTAATATCAAATGGGTGATTGTGACACATTGCAACATGGTGTGGTGCAGCAATTGTGTAAAAAATGTTGCTGGGTCTCGAGATGAAGC
CGGTTTCCTATACTGTGATATGTGTGGGAAGGTGTTGGATTTTTACAATTTCTCTCAAGATCCTACTTTTACAAAGGATTCTGGAGGGCAGAGTCAATTGTCAGGAAATT
TTGTTAGATCTATCCAGAGTAATTTTTCTGCTTCACGAGAAAGGACATTAAATAAAGCTTTTGAAGACATGAGATACATGAGGAATGGTTTAAACATGGGTGAGAGTGAT
GAAATAATTCGTGTAGCTGGGGCATTTTATAGAATTGCTTTAGAGAGAAACTTTACACGGGGACGCAATACAGAGTTTGTTCAAGCTGCTTGCCTTTACATTGCTTGTCG
CTATGTGTTAGGTGCTGTATTTTTGCAACTTTGTAAAGTGTTAAGACTTGAAGAGCACCCTATTGTTCAAAAGCCCGTTGATCCTTCCCTTTTTATTGATAAATTTACTC
AATGTTTGCTTGGAGGAACAAAGGAGGATGGAATGAAGAAGGAGGTTTCCAGAACTGCATTGAAGATCATAACCAGCATGAAGCGTGATTGGATGCAGACAGGGAGAAAG
CCTAGTGGCCTATGTGGTGCGGCTTTATATATATCTGCACTTTCTAATGGTGTCAAATGTTCCAAGTCAGACATTATAAAGATAGTGCATATTTGTGATGCAACATTGAC
AAAGCGGTTAATTGAGTTCGAGAATACAGAATCTGGCAGCTTAACGATGGAGGAATTTATTGTCATGGCAGACAAGGTCAAGGGAAGCAACTCTTATACTAATAATGGCT
TGAATGCAACGAGTGATGAAGTACTTTGCGTGCATAAGAATGAGTGCAAAAAGCCATATGCTCTCGGTCTTTGTAAAAGCTGTTATGATGATTTTGTTGAACTGTCTGGA
GGACTAGATGGGGGATCAAATCCTCCAGCATTTCAGTCTGCTGAGAAAGAAAGAATGGAAAAAGCAATGGTTGAGGAAGGTTCTAATGACTCTAGTGCTATTGGCAAATT
TTCACAAGGACTAAACCCATGCAACAATACTGAAAAAGAATCGGACAATGTACACGCAGATGCATCAAAAACTGTTGGTTCTAAAGAAGCTGAAGCGAAAGGAGCTGCAG
ATGAGCAAAGAGGACTTGATGAAGGTGCTAATAAAATTGGAGCTGATGGCCTGGGAGCCACTGCTTCAGATGAATCAGATAATTGGTCTGATATAGATGATATTGAGGTT
GACGGCTATCTTCACAATGAGGAGGAGAAGCATTACAAAAAGATAATTTGGGAAGAGATGAACCGGGAGTATCTTGAGGAACAAGCAGCTAAGGACGCAGCAACAGCTGC
TGCAAAGAATGCTTATGAGGCCAATTTCCAAAATTGTTCAGAAGATTTGTTGGCAGCAAAGGATCTTGCAGATGCTGCTGCGGCTGCTGTTGCAAAATCTAGAAAGGGTG
TCGATGGCTTCTGGGAGATCATGATTGAGATCAGAGATGGGGAAGACAATGGCTCGGGTGGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTGGTTAGCAAATTCTTCTTCTCATTCTCCTATGAGGATTCTTGTCGATACAGATGTTGATACCGACGATATCTTCGCTCTATTCTACCTTCTCAAGCAACCTTC
TTCTCTCTTTCATCTCCAGGCAATAACAATTAACGGGAATGGATGGAGCGACGCGGGACATGCGGTGAACCATTTGTACGACATGTTGTTCATGATGGGCCGAGACGACA
TCCCAGTTGGTGTCGGCGGAGACGGCGGCATTTCCCATAACGCTACTATTTTCCCTCACGTCGGTGGCTACCTACCACTCATTGATCAGGGTGTGTCGACGGCAGGGCAA
TGCAGATACAGGCAAGCCATTCCGGTGGGAGAGAAAGGACGTCTCTATGCCAATACCAATTTTGGTTTACGCAAACCCTTTCTTCCTCAGGGTAAGAGGAGATATATTCC
TATGAAGCAGCCAACTGCACAGCAGGTGATGAAAGATGCAATATCTGCAGGGCCTACTACAGTTTTTCTTATGGGAGCTCATACAAACTTGGCTATATTTCTTCTGTCAA
ATCCTCATTTGAAGAAAAATATAAAGCATATATATGCCATGGGTGGTGCTATTAGAGAAATTTGCTCAGAAAGTGCTGACAAATCTCATGGGAAGACATGCAACAATATC
GGGAACTTGTGGCCTCCCAATACAAATCCATATGCTGAGTTCAATATATTTGGAGACCCTTTTGCTGCCTATACAGTACTACATTCTGGAATTCCTGTTACACTTGTTCC
TCTGGATGCAACAAGTACCATTCCTGTGAATAAGAAAGTATTTTTGGCATTTGAACAGAGACAAAACACTTATGAGGCTAAATATTGTTTCCAGTCTTTGAAAATGGCTC
GTGATACTTGGACTGGCAATGGATTTTTCGAAATTTATTCGATGTGGGATTCTTTTATGGTTGGTGTAGCTTTGTCCCAAATGTATAATTTGGACAGAGGGGGTGGAAAT
AATGCATATTCGAAGATGGAATATCTAAATATCACTATTGTTACATCGAACAAACCTTATGGGATCTCTGATGGATCAAATCCACTTGTTGATGGACATTTGCTCCCAAA
ATTTGGTGTCCAAAAGAATGGGGTGCACAGTGGTCATGTTCAGACAGGAATGCTAGATCCATTCTGCGTTGTTCCAACTGAAATAGGAAAATGTCAGGATGGTTATACAA
AGGAGGCAGATGGCCCTGAATCAGTTCAAGTTCTAGTTGCTGTGGAAGCTAAATCAACTATTGACGCCAACAGCTCAATTGATAAAGCATTTTATATTAGCTTCCTGGAT
GTTCTTAATAGTCCACGACAGACGGGGAGGTTTGATTTTAGAGCTCAGTTTCCTAACTACAGGGAAGTATTATATAGACCAAAATTTGGGAAGAAATTACTAGGAAAACC
GGTTATTTTTGACATGGATATGAGCACAGGAGATTTTCTCACTCTTCTCTATCTCTTGAAGACACCCATTGAAATCATCAATCTAAAGGGAATAATAATTAGCCCAAATG
GATGGGCAACGGCTGCAACAATAGATGTTGTATATGATGTATTACATATGATGGGCCGTGACGACATTTCGGTTGGTCTTGGAGATGTATTCGCCATTGGAGAAGCACAT
CCATCATTTCCTCCTATTGGAGACTGCAAATATTCTGCTGAAAGTTCAGTGAAGTATGGAGCATTTAGAGATACTGATCACCCTGAACTCGGACAAATGTCTGCACTTGA
TGTTTGGAAAGATGTTGTGCACTCTCTAGATCTGGAGGCAAAAATTACCGTTTTAACCAGCGGACCTTTGACGAATCTCGCTCAAATCATACATCATAAAGCCGTAAGCT
CAAGGATTCAGGAAGTATATATTACTGGAGGACATATAAATTATAGTGTGGACAAGGGAAATGTGTTCACTATACCTTCAAATGAATATTCAGAATTCAACTTCTTTCTG
GACCCTACAGCTGCCGATTTGGTTTTGGGCTCAGGATTGAACATCACACTTATCCCGTTGAACGTGCAACGAAGAGTAAGTTCTTTTCACAAGATCCTCAAAAAGTTGGA
GCTCGGAAACAGAACTCCCGAAGCATGGTTCTCTCGACGTCTACTCTCCAGGCTATATCACCTTAAGCAGAAGCATCATCAATACCATCATGTGGACATGTTCTTAGGAG
AAGTCCTTGGTGTAGTGAGCTTAGCTGGAAAACATTTGAATCTGAAGCAAACATTCAGCTTCAAGCCTCTCAAGGTAGTCACAAATGGAGGTGAATCAAAAGTAGGGCAA
ACAATTATAGATGATAAGAAAGGGAAGTGGGTGAGAGTTTTGGAGAGTGTTGAACCACTTGCATTTTATGAAGATCTTGCAAACGCATTGGCTGATGAAAAGCAAACTGC
TGTTATTGGAAGCTTTGAGGGGCAAAAGAGGCAGTGGACCCCACGCGAAGAACTCCCTCCCCGCAGTCGTCAACTCTCAACCTCGCTCACGTCAGTCGCCGGTCGTCCGC
CGTCCCCGCCGCTGGGCTCTGTCGCCGTCCGTCGTGCTCAGACCAACGTTGCCGTTGCTGGTGCTGAGTTTCTCGTGGATCTCTTTCTCTCTCTTCCGCGCGTTCAGCTG
GTTTTTCTTTTATGTATGCCTCTTAATATCAAATGGGTGATTGTGACACATTGCAACATGGTGTGGTGCAGCAATTGTGTAAAAAATGTTGCTGGGTCTCGAGATGAAGC
CGGTTTCCTATACTGTGATATGTGTGGGAAGGTGTTGGATTTTTACAATTTCTCTCAAGATCCTACTTTTACAAAGGATTCTGGAGGGCAGAGTCAATTGTCAGGAAATT
TTGTTAGATCTATCCAGAGTAATTTTTCTGCTTCACGAGAAAGGACATTAAATAAAGCTTTTGAAGACATGAGATACATGAGGAATGGTTTAAACATGGGTGAGAGTGAT
GAAATAATTCGTGTAGCTGGGGCATTTTATAGAATTGCTTTAGAGAGAAACTTTACACGGGGACGCAATACAGAGTTTGTTCAAGCTGCTTGCCTTTACATTGCTTGTCG
CTATGTGTTAGGTGCTGTATTTTTGCAACTTTGTAAAGTGTTAAGACTTGAAGAGCACCCTATTGTTCAAAAGCCCGTTGATCCTTCCCTTTTTATTGATAAATTTACTC
AATGTTTGCTTGGAGGAACAAAGGAGGATGGAATGAAGAAGGAGGTTTCCAGAACTGCATTGAAGATCATAACCAGCATGAAGCGTGATTGGATGCAGACAGGGAGAAAG
CCTAGTGGCCTATGTGGTGCGGCTTTATATATATCTGCACTTTCTAATGGTGTCAAATGTTCCAAGTCAGACATTATAAAGATAGTGCATATTTGTGATGCAACATTGAC
AAAGCGGTTAATTGAGTTCGAGAATACAGAATCTGGCAGCTTAACGATGGAGGAATTTATTGTCATGGCAGACAAGGTCAAGGGAAGCAACTCTTATACTAATAATGGCT
TGAATGCAACGAGTGATGAAGTACTTTGCGTGCATAAGAATGAGTGCAAAAAGCCATATGCTCTCGGTCTTTGTAAAAGCTGTTATGATGATTTTGTTGAACTGTCTGGA
GGACTAGATGGGGGATCAAATCCTCCAGCATTTCAGTCTGCTGAGAAAGAAAGAATGGAAAAAGCAATGGTTGAGGAAGGTTCTAATGACTCTAGTGCTATTGGCAAATT
TTCACAAGGACTAAACCCATGCAACAATACTGAAAAAGAATCGGACAATGTACACGCAGATGCATCAAAAACTGTTGGTTCTAAAGAAGCTGAAGCGAAAGGAGCTGCAG
ATGAGCAAAGAGGACTTGATGAAGGTGCTAATAAAATTGGAGCTGATGGCCTGGGAGCCACTGCTTCAGATGAATCAGATAATTGGTCTGATATAGATGATATTGAGGTT
GACGGCTATCTTCACAATGAGGAGGAGAAGCATTACAAAAAGATAATTTGGGAAGAGATGAACCGGGAGTATCTTGAGGAACAAGCAGCTAAGGACGCAGCAACAGCTGC
TGCAAAGAATGCTTATGAGGCCAATTTCCAAAATTGTTCAGAAGATTTGTTGGCAGCAAAGGATCTTGCAGATGCTGCTGCGGCTGCTGTTGCAAAATCTAGAAAGGGTG
TCGATGGCTTCTGGGAGATCATGATTGAGATCAGAGATGGGGAAGACAATGGCTCGGGTGGGTGA
Protein sequenceShow/hide protein sequence
MVWLANSSSHSPMRILVDTDVDTDDIFALFYLLKQPSSLFHLQAITINGNGWSDAGHAVNHLYDMLFMMGRDDIPVGVGGDGGISHNATIFPHVGGYLPLIDQGVSTAGQ
CRYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGAIREICSESADKSHGKTCNNI
GNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGNGFFEIYSMWDSFMVGVALSQMYNLDRGGGN
NAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQDGYTKEADGPESVQVLVAVEAKSTIDANSSIDKAFYISFLD
VLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKPVIFDMDMSTGDFLTLLYLLKTPIEIINLKGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAH
PSFPPIGDCKYSAESSVKYGAFRDTDHPELGQMSALDVWKDVVHSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFL
DPTAADLVLGSGLNITLIPLNVQRRVSSFHKILKKLELGNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKVGQ
TIIDDKKGKWVRVLESVEPLAFYEDLANALADEKQTAVIGSFEGQKRQWTPREELPPRSRQLSTSLTSVAGRPPSPPLGSVAVRRAQTNVAVAGAEFLVDLFLSLPRVQL
VFLLCMPLNIKWVIVTHCNMVWCSNCVKNVAGSRDEAGFLYCDMCGKVLDFYNFSQDPTFTKDSGGQSQLSGNFVRSIQSNFSASRERTLNKAFEDMRYMRNGLNMGESD
EIIRVAGAFYRIALERNFTRGRNTEFVQAACLYIACRYVLGAVFLQLCKVLRLEEHPIVQKPVDPSLFIDKFTQCLLGGTKEDGMKKEVSRTALKIITSMKRDWMQTGRK
PSGLCGAALYISALSNGVKCSKSDIIKIVHICDATLTKRLIEFENTESGSLTMEEFIVMADKVKGSNSYTNNGLNATSDEVLCVHKNECKKPYALGLCKSCYDDFVELSG
GLDGGSNPPAFQSAEKERMEKAMVEEGSNDSSAIGKFSQGLNPCNNTEKESDNVHADASKTVGSKEAEAKGAADEQRGLDEGANKIGADGLGATASDESDNWSDIDDIEV
DGYLHNEEEKHYKKIIWEEMNREYLEEQAAKDAATAAAKNAYEANFQNCSEDLLAAKDLADAAAAAVAKSRKGVDGFWEIMIEIRDGEDNGSGG