; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi06G001660 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi06G001660
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPyrimidine-specific ribonucleoside hydrolase RihA
Genome locationchr06:1779706..1804599
RNA-Seq ExpressionLsi06G001660
SyntenyLsi06G001660
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR001910 - Inosine/uridine-preferring nucleoside hydrolase domain
IPR011665 - Brf1, TBP-binding domain
IPR036452 - Ribonucleoside hydrolase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008450714.1 PREDICTED: uncharacterized protein LOC103492210 isoform X4 [Cucumis melo]0.0e+0082.45Show/hide
Query:  RYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGAIREICSESADK
        RYRQAIPVG  GRL ANTNFGLRK FLPQGKRRYIPMKQPTAQQVMKDAISAGPT VFLMGAHTNLAIFLLSNPHLKKNIKH+YAMGGAIREICSES   
Subjt:  RYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGAIREICSESADK

Query:  SHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGNGFFEIYSMWDS
        SHGKTC+NIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNK+VFLAFEQRQNTYEAKYCFQSLKMARDTW  +GFFE+YSMWDS
Subjt:  SHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGNGFFEIYSMWDS

Query:  FMVGVALSQMYNLDRGG--GNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQDGYTKEADGP
        FMVGVALSQMYNL RGG  G NA+SKMEYLN+TIVTSN+PYGISDGSNP V+G LL  FG QKNGVHSGHVQTGMLDPFC+  T  GKCQDGYTKEADG 
Subjt:  FMVGVALSQMYNLDRGG--GNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQDGYTKEADGP

Query:  ESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKP-----------------------------GII
        ESVQVLVAVEAKSTID NSSIDKAFYISFLDVLNSPRQTGRFDFRAQFP YREVLYRP FGK+LLGKP                             GII
Subjt:  ESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKP-----------------------------GII

Query:  ISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSVKYGAFRDTDHP
        ISPNGWATAATIDVVYDVLHMMGRDDISVGLGD+FAIGE HP FPPIGDCK                               Y+AE+SVK+GAFRDTDHP
Subjt:  ISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSVKYGAFRDTDHP

Query:  ELGQMSALDVWKDVVQSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADLVLGSGLNITLI
        EL QMSALDVWKDVV++LDL+AKITVLTSGPLTNLA+IIHHKA+S+RI+EVYITGGHI+Y VDKGN+FTIPSNEYSEFNFFLDP AADLV GSGLNITLI
Subjt:  ELGQMSALDVWKDVVQSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADLVLGSGLNITLI

Query:  PLNVQRRVSSFHKILKKLKLRNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKVGQTIIDDKKG
        PLNVQRRVSSF+KILKKLK RNRTPEAWFSRRLL RLY LKQKHHQYHHVDMFLGEV+G VSLAGKHLNLKQTFSFKPLKV++NGGESKVGQTIID KKG
Subjt:  PLNVQRRVSSFHKILKKLKLRNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKVGQTIIDDKKG

Query:  KWVRVLESVEPLAFYEDLANALADEKQTAVIGSFE
        KWVRVLES+EPLA YEDLANALADEKQTAVI SFE
Subjt:  KWVRVLESVEPLAFYEDLANALADEKQTAVIGSFE

XP_011659920.1 uncharacterized protein LOC101212769 isoform X1 [Cucumis sativus]0.0e+0083.9Show/hide
Query:  RYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGAIREICSESADK
        RYRQAIPVG  GRL ANTNFGLRK FLPQGKRRYIPMKQPTAQQVMKDAISAGPT VFLMGAHTNLAIFLLSNPHLKKNIKH+YAMGGAIREICSESADK
Subjt:  RYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGAIREICSESADK

Query:  SHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGNGFFEIYSMWDS
        SHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNK+VFLAFEQRQNTYEAKYCFQSLKMA DTW  +GFFEIYSMWDS
Subjt:  SHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGNGFFEIYSMWDS

Query:  FMVGVALSQMYNLDRGGGNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQDGYTKEADGPES
        FMVGVALSQMYNL RGGGNNA+SKMEYLNITIVTSNKPYGISDGSNPLVDGHLLP  G Q NGVHSGHVQTGMLDPFC+  T  GKCQDGYTKE+DG ES
Subjt:  FMVGVALSQMYNLDRGGGNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQDGYTKEADGPES

Query:  VQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKP-----------------------------GIIIS
        VQVLVAVEAKSTID NSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGK+LLGKP                             GIIIS
Subjt:  VQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKP-----------------------------GIIIS

Query:  PNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSVKYGAFRDTDHPEL
        PNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHP +PPIGDCK                               Y+AE+SVK+GAFRDTDHPEL
Subjt:  PNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSVKYGAFRDTDHPEL

Query:  GQMSALDVWKDVVQSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADLVLGSGLNITLIPL
         QMS LDVWKDVVQSL+L+AKITVLT+GPLTNLA+II HKA+S+RI+EVYITGGH+NY VDKGN+FTIPSNEYSEFNFFLDP AADLV  SGLNITLIPL
Subjt:  GQMSALDVWKDVVQSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADLVLGSGLNITLIPL

Query:  NVQRRVSSFHKILKKLKLRNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKVGQTIIDDKKGKW
        NVQRRVSSFHKIL+KLKLRNRTPEAW SRRLL RLY LKQKHHQYHHVDMFLGEVLG VSLAGKHLNLKQTFSFKPLKV++NGGESKVGQTIID+KKGKW
Subjt:  NVQRRVSSFHKILKKLKLRNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKVGQTIIDDKKGKW

Query:  VRVLESVEPLAFYEDLANALADEKQTAVIGSFE
        VRVLES+EPLAFYED+ANALADEKQTAVI SFE
Subjt:  VRVLESVEPLAFYEDLANALADEKQTAVIGSFE

XP_022154597.1 uncharacterized protein LOC111021821 [Momordica charantia]0.0e+0080.95Show/hide
Query:  RYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGAIREICSESADK
        RYRQAIPVGEKGRLYANTNFGLRK FLPQG RRY P+KQPTAQQV+KDAISAGPTTVFLMG HTNLAIFL++NPHLKKNIKHIYAMGGAIREICS   DK
Subjt:  RYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGAIREICSESADK

Query:  SHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGNGFFEIYSMWDS
        SHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPV+K VFLAFEQR NTYEA+YCFQSLKMARDTW  NGFFEIYSMWDS
Subjt:  SHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGNGFFEIYSMWDS

Query:  FMVGVALSQMYNLDRGGGNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQDGYTKEADGPES
        FMVGV+LSQM+NLD+GGG+NAYSKMEY+NITIVTSN+PYGISDGSNPLVDGHL+PKFGVQKNGVHSGHVQTGMLDPFC++ T  GKCQDGYTKEA+G ES
Subjt:  FMVGVALSQMYNLDRGGGNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQDGYTKEADGPES

Query:  VQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKP-----------------------------GIIIS
        VQVLVAVEAKST D NSSIDKAFYISFLDVLNSP+QTGRFDFRAQFPNY+EVLYRPKFGKKLLGKP                             GIIIS
Subjt:  VQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKP-----------------------------GIIIS

Query:  PNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSVKYGAFRDTDHPEL
        PNGWATAATIDVVYDVLHMMGRDDI VGLGD+FAIGEAHPSFPPIGDCK                               Y+AE+SVK+GA RDTDHPEL
Subjt:  PNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSVKYGAFRDTDHPEL

Query:  GQMSALDVWKDVVQSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADLVLGSGLNITLIPL
         QMSAL+VWK +V+SLD   KITVLT+GPLTNLAQI+  KA+ SRIQEVYITGGHI +  DKGNVFTIPSN Y+EFNFFLDPTAA+LVLGSGLNITLIPL
Subjt:  GQMSALDVWKDVVQSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADLVLGSGLNITLIPL

Query:  NVQRRVSSFHKILKKLKLRNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKVGQTIIDDKKGKW
        NVQRRVSSFHKILK+LKLRN+TPEA FS+RL SRLYHLKQ HHQYHHVDMFLGEVLG VSLAGKH+NLK+ FSFKPLKVVTNGGESKVGQTIID+KKGKW
Subjt:  NVQRRVSSFHKILKKLKLRNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKVGQTIIDDKKGKW

Query:  VRVLESVEPLAFYEDLANALADEKQTAVIGSFEGQKRQWT
        VRVLESVEPLAFYE LA+AL DEKQ+AV+GSFE QKR W+
Subjt:  VRVLESVEPLAFYEDLANALADEKQTAVIGSFEGQKRQWT

XP_038879905.1 uncharacterized protein LOC120071620 isoform X1 [Benincasa hispida]0.0e+0087.03Show/hide
Query:  RYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGAIREICSESADK
        RYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQ VMKDA+SAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGAIREICSE+ADK
Subjt:  RYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGAIREICSESADK

Query:  SHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGNGFFEIYSMWDS
        SHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVN+KVFL FEQRQNTYEAKYCFQSLKMARDTWTGNGFFEIYSMWDS
Subjt:  SHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGNGFFEIYSMWDS

Query:  FMVGVALSQMYNLDRGGGNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQDGYTKEADGPES
        FMVGVALSQMYN DRG GNNA+SKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFC+V TEIGKCQDGYTKEADGPES
Subjt:  FMVGVALSQMYNLDRGGGNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQDGYTKEADGPES

Query:  VQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKP-----------------------------GIIIS
        VQVLVAVEAKSTID NSSIDKAFY SFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKP                             GIIIS
Subjt:  VQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKP-----------------------------GIIIS

Query:  PNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSVKYGAFRDTDHPEL
        PNGWAT ATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK                               Y+AESSVK+G FRDTDHPEL
Subjt:  PNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSVKYGAFRDTDHPEL

Query:  GQMSALDVWKDVVQSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADLVLGSGLNITLIPL
         QMSALDVWKDVVQSLDLE KITVLT+GPLTNLAQI+HHKA+S+RIQEVYI+GG+INY VDKGNVFTIPSNE+SEFNFFLDPTAADLVLGSGLNITLIPL
Subjt:  GQMSALDVWKDVVQSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADLVLGSGLNITLIPL

Query:  NVQRRVSSFHKILKKLKLRNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKVGQTIIDDKKGKW
        NVQR+VSSFHKILKKLKL NRTPEA FSRRLLSRLYHLKQKHHQYHHVDMFLGEVLG VSLAGKH+NLKQTFS KPLKVVTNGGESKVGQTIID+KKGKW
Subjt:  NVQRRVSSFHKILKKLKLRNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKVGQTIIDDKKGKW

Query:  VRVLESVEPLAFYEDLANALADEKQTAVIGSFEGQKRQWT
        VRVLESVEPLAFYEDLANALADEKQ+AVIGSFEGQKR W+
Subjt:  VRVLESVEPLAFYEDLANALADEKQTAVIGSFEGQKRQWT

XP_038879906.1 uncharacterized protein LOC120071620 isoform X2 [Benincasa hispida]0.0e+0086.36Show/hide
Query:  MKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGAIREICSESADKSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVL
        MKQPTAQ VMKDA+SAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGAIREICSE+ADKSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVL
Subjt:  MKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGAIREICSESADKSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVL

Query:  HSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGNGFFEIYSMWDSFMVGVALSQMYNLDRGGGNNAYSKMEYLNITIVTSN
        HSGIPVTLVPLDATSTIPVN+KVFL FEQRQNTYEAKYCFQSLKMARDTWTGNGFFEIYSMWDSFMVGVALSQMYN DRG GNNA+SKMEYLNITIVTSN
Subjt:  HSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGNGFFEIYSMWDSFMVGVALSQMYNLDRGGGNNAYSKMEYLNITIVTSN

Query:  KPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQDGYTKEADGPESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQ
        KPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFC+V TEIGKCQDGYTKEADGPESVQVLVAVEAKSTID NSSIDKAFY SFLDVLNSPRQ
Subjt:  KPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQDGYTKEADGPESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQ

Query:  TGRFDFRAQFPNYREVLYRPKFGKKLLGKP-----------------------------GIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIG
        TGRFDFRAQFPNYREVLYRPKFGKKLLGKP                             GIIISPNGWAT ATIDVVYDVLHMMGRDDISVGLGDVFAIG
Subjt:  TGRFDFRAQFPNYREVLYRPKFGKKLLGKP-----------------------------GIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIG

Query:  EAHPSFPPIGDCK-------------------------------YSAESSVKYGAFRDTDHPELGQMSALDVWKDVVQSLDLEAKITVLTSGPLTNLAQI
        EAHPSFPPIGDCK                               Y+AESSVK+G FRDTDHPEL QMSALDVWKDVVQSLDLE KITVLT+GPLTNLAQI
Subjt:  EAHPSFPPIGDCK-------------------------------YSAESSVKYGAFRDTDHPELGQMSALDVWKDVVQSLDLEAKITVLTSGPLTNLAQI

Query:  IHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADLVLGSGLNITLIPLNVQRRVSSFHKILKKLKLRNRTPEAWFSRRLLSRLY
        +HHKA+S+RIQEVYI+GG+INY VDKGNVFTIPSNE+SEFNFFLDPTAADLVLGSGLNITLIPLNVQR+VSSFHKILKKLKL NRTPEA FSRRLLSRLY
Subjt:  IHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADLVLGSGLNITLIPLNVQRRVSSFHKILKKLKLRNRTPEAWFSRRLLSRLY

Query:  HLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKVGQTIIDDKKGKWVRVLESVEPLAFYEDLANALADEKQTAVIGSFEGQK
        HLKQKHHQYHHVDMFLGEVLG VSLAGKH+NLKQTFS KPLKVVTNGGESKVGQTIID+KKGKWVRVLESVEPLAFYEDLANALADEKQ+AVIGSFEGQK
Subjt:  HLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKVGQTIIDDKKGKWVRVLESVEPLAFYEDLANALADEKQTAVIGSFEGQK

Query:  RQWT
        R W+
Subjt:  RQWT

TrEMBL top hitse value%identityAlignment
A0A0A0LWJ1 Uncharacterized protein0.0e+0083.9Show/hide
Query:  RYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGAIREICSESADK
        RYRQAIPVG  GRL ANTNFGLRK FLPQGKRRYIPMKQPTAQQVMKDAISAGPT VFLMGAHTNLAIFLLSNPHLKKNIKH+YAMGGAIREICSESADK
Subjt:  RYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGAIREICSESADK

Query:  SHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGNGFFEIYSMWDS
        SHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNK+VFLAFEQRQNTYEAKYCFQSLKMA DTW  +GFFEIYSMWDS
Subjt:  SHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGNGFFEIYSMWDS

Query:  FMVGVALSQMYNLDRGGGNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQDGYTKEADGPES
        FMVGVALSQMYNL RGGGNNA+SKMEYLNITIVTSNKPYGISDGSNPLVDGHLLP  G Q NGVHSGHVQTGMLDPFC+  T  GKCQDGYTKE+DG ES
Subjt:  FMVGVALSQMYNLDRGGGNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQDGYTKEADGPES

Query:  VQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKP-----------------------------GIIIS
        VQVLVAVEAKSTID NSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGK+LLGKP                             GIIIS
Subjt:  VQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKP-----------------------------GIIIS

Query:  PNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSVKYGAFRDTDHPEL
        PNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHP +PPIGDCK                               Y+AE+SVK+GAFRDTDHPEL
Subjt:  PNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSVKYGAFRDTDHPEL

Query:  GQMSALDVWKDVVQSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADLVLGSGLNITLIPL
         QMS LDVWKDVVQSL+L+AKITVLT+GPLTNLA+II HKA+S+RI+EVYITGGH+NY VDKGN+FTIPSNEYSEFNFFLDP AADLV  SGLNITLIPL
Subjt:  GQMSALDVWKDVVQSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADLVLGSGLNITLIPL

Query:  NVQRRVSSFHKILKKLKLRNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKVGQTIIDDKKGKW
        NVQRRVSSFHKIL+KLKLRNRTPEAW SRRLL RLY LKQKHHQYHHVDMFLGEVLG VSLAGKHLNLKQTFSFKPLKV++NGGESKVGQTIID+KKGKW
Subjt:  NVQRRVSSFHKILKKLKLRNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKVGQTIIDDKKGKW

Query:  VRVLESVEPLAFYEDLANALADEKQTAVIGSFE
        VRVLES+EPLAFYED+ANALADEKQTAVI SFE
Subjt:  VRVLESVEPLAFYEDLANALADEKQTAVIGSFE

A0A1S3BPV7 uncharacterized protein LOC103492210 isoform X40.0e+0082.45Show/hide
Query:  RYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGAIREICSESADK
        RYRQAIPVG  GRL ANTNFGLRK FLPQGKRRYIPMKQPTAQQVMKDAISAGPT VFLMGAHTNLAIFLLSNPHLKKNIKH+YAMGGAIREICSES   
Subjt:  RYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGAIREICSESADK

Query:  SHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGNGFFEIYSMWDS
        SHGKTC+NIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNK+VFLAFEQRQNTYEAKYCFQSLKMARDTW  +GFFE+YSMWDS
Subjt:  SHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGNGFFEIYSMWDS

Query:  FMVGVALSQMYNLDRGG--GNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQDGYTKEADGP
        FMVGVALSQMYNL RGG  G NA+SKMEYLN+TIVTSN+PYGISDGSNP V+G LL  FG QKNGVHSGHVQTGMLDPFC+  T  GKCQDGYTKEADG 
Subjt:  FMVGVALSQMYNLDRGG--GNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQDGYTKEADGP

Query:  ESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKP-----------------------------GII
        ESVQVLVAVEAKSTID NSSIDKAFYISFLDVLNSPRQTGRFDFRAQFP YREVLYRP FGK+LLGKP                             GII
Subjt:  ESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKP-----------------------------GII

Query:  ISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSVKYGAFRDTDHP
        ISPNGWATAATIDVVYDVLHMMGRDDISVGLGD+FAIGE HP FPPIGDCK                               Y+AE+SVK+GAFRDTDHP
Subjt:  ISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSVKYGAFRDTDHP

Query:  ELGQMSALDVWKDVVQSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADLVLGSGLNITLI
        EL QMSALDVWKDVV++LDL+AKITVLTSGPLTNLA+IIHHKA+S+RI+EVYITGGHI+Y VDKGN+FTIPSNEYSEFNFFLDP AADLV GSGLNITLI
Subjt:  ELGQMSALDVWKDVVQSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADLVLGSGLNITLI

Query:  PLNVQRRVSSFHKILKKLKLRNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKVGQTIIDDKKG
        PLNVQRRVSSF+KILKKLK RNRTPEAWFSRRLL RLY LKQKHHQYHHVDMFLGEV+G VSLAGKHLNLKQTFSFKPLKV++NGGESKVGQTIID KKG
Subjt:  PLNVQRRVSSFHKILKKLKLRNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKVGQTIIDDKKG

Query:  KWVRVLESVEPLAFYEDLANALADEKQTAVIGSFE
        KWVRVLES+EPLA YEDLANALADEKQTAVI SFE
Subjt:  KWVRVLESVEPLAFYEDLANALADEKQTAVIGSFE

A0A1S4DY76 uncharacterized protein LOC103492210 isoform X30.0e+0082.45Show/hide
Query:  RYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGAIREICSESADK
        RYRQAIPVG  GRL ANTNFGLRK FLPQGKRRYIPMKQPTAQQVMKDAISAGPT VFLMGAHTNLAIFLLSNPHLKKNIKH+YAMGGAIREICSES   
Subjt:  RYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGAIREICSESADK

Query:  SHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGNGFFEIYSMWDS
        SHGKTC+NIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNK+VFLAFEQRQNTYEAKYCFQSLKMARDTW  +GFFE+YSMWDS
Subjt:  SHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGNGFFEIYSMWDS

Query:  FMVGVALSQMYNLDRGG--GNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQDGYTKEADGP
        FMVGVALSQMYNL RGG  G NA+SKMEYLN+TIVTSN+PYGISDGSNP V+G LL  FG QKNGVHSGHVQTGMLDPFC+  T  GKCQDGYTKEADG 
Subjt:  FMVGVALSQMYNLDRGG--GNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQDGYTKEADGP

Query:  ESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKP-----------------------------GII
        ESVQVLVAVEAKSTID NSSIDKAFYISFLDVLNSPRQTGRFDFRAQFP YREVLYRP FGK+LLGKP                             GII
Subjt:  ESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKP-----------------------------GII

Query:  ISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSVKYGAFRDTDHP
        ISPNGWATAATIDVVYDVLHMMGRDDISVGLGD+FAIGE HP FPPIGDCK                               Y+AE+SVK+GAFRDTDHP
Subjt:  ISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSVKYGAFRDTDHP

Query:  ELGQMSALDVWKDVVQSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADLVLGSGLNITLI
        EL QMSALDVWKDVV++LDL+AKITVLTSGPLTNLA+IIHHKA+S+RI+EVYITGGHI+Y VDKGN+FTIPSNEYSEFNFFLDP AADLV GSGLNITLI
Subjt:  ELGQMSALDVWKDVVQSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADLVLGSGLNITLI

Query:  PLNVQRRVSSFHKILKKLKLRNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKVGQTIIDDKKG
        PLNVQRRVSSF+KILKKLK RNRTPEAWFSRRLL RLY LKQKHHQYHHVDMFLGEV+G VSLAGKHLNLKQTFSFKPLKV++NGGESKVGQTIID KKG
Subjt:  PLNVQRRVSSFHKILKKLKLRNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKVGQTIIDDKKG

Query:  KWVRVLESVEPLAFYEDLANALADEKQTAVIGSFE
        KWVRVLES+EPLA YEDLANALADEKQTAVI SFE
Subjt:  KWVRVLESVEPLAFYEDLANALADEKQTAVIGSFE

A0A1S4DYY6 uncharacterized protein LOC103492210 isoform X20.0e+0082.45Show/hide
Query:  RYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGAIREICSESADK
        RYRQAIPVG  GRL ANTNFGLRK FLPQGKRRYIPMKQPTAQQVMKDAISAGPT VFLMGAHTNLAIFLLSNPHLKKNIKH+YAMGGAIREICSES   
Subjt:  RYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGAIREICSESADK

Query:  SHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGNGFFEIYSMWDS
        SHGKTC+NIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNK+VFLAFEQRQNTYEAKYCFQSLKMARDTW  +GFFE+YSMWDS
Subjt:  SHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGNGFFEIYSMWDS

Query:  FMVGVALSQMYNLDRGG--GNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQDGYTKEADGP
        FMVGVALSQMYNL RGG  G NA+SKMEYLN+TIVTSN+PYGISDGSNP V+G LL  FG QKNGVHSGHVQTGMLDPFC+  T  GKCQDGYTKEADG 
Subjt:  FMVGVALSQMYNLDRGG--GNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQDGYTKEADGP

Query:  ESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKP-----------------------------GII
        ESVQVLVAVEAKSTID NSSIDKAFYISFLDVLNSPRQTGRFDFRAQFP YREVLYRP FGK+LLGKP                             GII
Subjt:  ESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKP-----------------------------GII

Query:  ISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSVKYGAFRDTDHP
        ISPNGWATAATIDVVYDVLHMMGRDDISVGLGD+FAIGE HP FPPIGDCK                               Y+AE+SVK+GAFRDTDHP
Subjt:  ISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSVKYGAFRDTDHP

Query:  ELGQMSALDVWKDVVQSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADLVLGSGLNITLI
        EL QMSALDVWKDVV++LDL+AKITVLTSGPLTNLA+IIHHKA+S+RI+EVYITGGHI+Y VDKGN+FTIPSNEYSEFNFFLDP AADLV GSGLNITLI
Subjt:  ELGQMSALDVWKDVVQSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADLVLGSGLNITLI

Query:  PLNVQRRVSSFHKILKKLKLRNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKVGQTIIDDKKG
        PLNVQRRVSSF+KILKKLK RNRTPEAWFSRRLL RLY LKQKHHQYHHVDMFLGEV+G VSLAGKHLNLKQTFSFKPLKV++NGGESKVGQTIID KKG
Subjt:  PLNVQRRVSSFHKILKKLKLRNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKVGQTIIDDKKG

Query:  KWVRVLESVEPLAFYEDLANALADEKQTAVIGSFE
        KWVRVLES+EPLA YEDLANALADEKQTAVI SFE
Subjt:  KWVRVLESVEPLAFYEDLANALADEKQTAVIGSFE

A0A6J1DM35 uncharacterized protein LOC1110218210.0e+0080.95Show/hide
Query:  RYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGAIREICSESADK
        RYRQAIPVGEKGRLYANTNFGLRK FLPQG RRY P+KQPTAQQV+KDAISAGPTTVFLMG HTNLAIFL++NPHLKKNIKHIYAMGGAIREICS   DK
Subjt:  RYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGAIREICSESADK

Query:  SHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGNGFFEIYSMWDS
        SHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPV+K VFLAFEQR NTYEA+YCFQSLKMARDTW  NGFFEIYSMWDS
Subjt:  SHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGNGFFEIYSMWDS

Query:  FMVGVALSQMYNLDRGGGNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQDGYTKEADGPES
        FMVGV+LSQM+NLD+GGG+NAYSKMEY+NITIVTSN+PYGISDGSNPLVDGHL+PKFGVQKNGVHSGHVQTGMLDPFC++ T  GKCQDGYTKEA+G ES
Subjt:  FMVGVALSQMYNLDRGGGNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQDGYTKEADGPES

Query:  VQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKP-----------------------------GIIIS
        VQVLVAVEAKST D NSSIDKAFYISFLDVLNSP+QTGRFDFRAQFPNY+EVLYRPKFGKKLLGKP                             GIIIS
Subjt:  VQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKP-----------------------------GIIIS

Query:  PNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSVKYGAFRDTDHPEL
        PNGWATAATIDVVYDVLHMMGRDDI VGLGD+FAIGEAHPSFPPIGDCK                               Y+AE+SVK+GA RDTDHPEL
Subjt:  PNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESSVKYGAFRDTDHPEL

Query:  GQMSALDVWKDVVQSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADLVLGSGLNITLIPL
         QMSAL+VWK +V+SLD   KITVLT+GPLTNLAQI+  KA+ SRIQEVYITGGHI +  DKGNVFTIPSN Y+EFNFFLDPTAA+LVLGSGLNITLIPL
Subjt:  GQMSALDVWKDVVQSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADLVLGSGLNITLIPL

Query:  NVQRRVSSFHKILKKLKLRNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKVGQTIIDDKKGKW
        NVQRRVSSFHKILK+LKLRN+TPEA FS+RL SRLYHLKQ HHQYHHVDMFLGEVLG VSLAGKH+NLK+ FSFKPLKVVTNGGESKVGQTIID+KKGKW
Subjt:  NVQRRVSSFHKILKKLKLRNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKVGQTIIDDKKGKW

Query:  VRVLESVEPLAFYEDLANALADEKQTAVIGSFEGQKRQWT
        VRVLESVEPLAFYE LA+AL DEKQ+AV+GSFE QKR W+
Subjt:  VRVLESVEPLAFYEDLANALADEKQTAVIGSFEGQKRQWT

SwissProt top hitse value%identityAlignment
Q8CFK2 Transcription factor IIIB 90 kDa subunit1.2e-0926.17Show/hide
Query:  EVSRTALKIITSMKRDWMQ-----------------------------IKIVHICDATLTKRLIEFENTESGSLTMEEFI-VMADKVKGSNSYT------
        EVS TAL+++  MKRDWM                              I +V +C++TL KRL EFE+T +  LT++EF+ +  ++     SYT      
Subjt:  EVSRTALKIITSMKRDWMQ-----------------------------IKIVHICDATLTKRLIEFENTESGSLTMEEFI-VMADKVKGSNSYT------

Query:  ---------NNGLNATSDEVLCVH-----KNECKKPYALGLCKSCYDDFVELSGGLDGGSNPPAFQSAEKERMEKAMVEEGSNDSSAIGKFSQGLNPCNN
                 +  L     E+         + E  +P A G   +   D      G D  S+P   +  E E +E A      +    +     G     +
Subjt:  ---------NNGLNATSDEVLCVH-----KNECKKPYALGLCKSCYDDFVELSGGLDGGSNPPAFQSAEKERMEKAMVEEGSNDSSAIGKFSQGLNPCNN

Query:  TEKESDNVHADASKTVGSKEAEAKGAADEQRGLDEGANKIGADGLGATASDESDNWSDIDDIEVDGYLHNEEEKHYKKIIWEEMNREYLEEQAAKDAAAA
         +  S  + A  S       A + G +D  R   E  +    D   ++   E D  S IDD+E+D Y+ NE E   K  +W   N EYL EQ  K+A  A
Subjt:  TEKESDNVHADASKTVGSKEAEAKGAADEQRGLDEGANKIGADGLGATASDESDNWSDIDDIEVDGYLHNEEEKHYKKIIWEEMNREYLEEQAAKDAAAA

Query:  AAKNAYEANFQNCSEDLLAAKDLADAAAAAVAKSRKERQRKRAADAKNASPAQTAAEATRQMLNKKRLSSKINYDVLDKLFDESDG
          K                                KE + K++   +    A TA EA  +ML +K++SSKINY VL  L  +  G
Subjt:  AAKNAYEANFQNCSEDLLAAKDLADAAAAAVAKSRKERQRKRAADAKNASPAQTAAEATRQMLNKKRLSSKINYDVLDKLFDESDG

Q92994 Transcription factor IIIB 90 kDa subunit1.6e-0926.55Show/hide
Query:  EVSRTALKIITSMKRDWMQ-----------------------------IKIVHICDATLTKRLIEFENTESGSLTMEEFI-VMADKVKGSNSYTNNGLNA
        EVS TAL+++  MKRDWM                              I +V +C++TL KRL EFE+T +  LT++EF+ +  ++     SYT      
Subjt:  EVSRTALKIITSMKRDWMQ-----------------------------IKIVHICDATLTKRLIEFENTESGSLTMEEFI-VMADKVKGSNSYTNNGLNA

Query:  TS---DEVLCVHKNECKKPYALGLCKSCYDDFVEL---------SGGL-----DGGSNPPAF-----QSAEKERMEKAMVEEGSNDSSAIGKFSQGLNPC
             ++VL     E +         S Y D +E+          GGL     DG +   A      +  E E +E A      +    +   + G +  
Subjt:  TS---DEVLCVHKNECKKPYALGLCKSCYDDFVEL---------SGGL-----DGGSNPPAF-----QSAEKERMEKAMVEEGSNDSSAIGKFSQGLNPC

Query:  NNTEKESDNVHADASKTVGSKEAEAKGAADEQRGLDEGANKIGADGLGATASDESDNWSDIDDIEVDGYLHNEEEKHYKKIIWEEMNREYLEEQAAKDAA
          + +      A  S       A + G +D  R   E  +   +D   A+   E D  S IDD+E+D Y+ NE E   K  +W   N EYL EQ  K+A 
Subjt:  NNTEKESDNVHADASKTVGSKEAEAKGAADEQRGLDEGANKIGADGLGATASDESDNWSDIDDIEVDGYLHNEEEKHYKKIIWEEMNREYLEEQAAKDAA

Query:  AAAAKNAYEANFQNCSEDLLAAKDLADAAAAAVAKSRKERQRKRAADAKNASPAQTAAEATRQMLNKKRLSSKINYDVLDKLFDESDG
         A  K                                KE + K++   +    A TA EA  +ML +K++SSKINY VL  L     G
Subjt:  AAAAKNAYEANFQNCSEDLLAAKDLADAAAAAVAKSRKERQRKRAADAKNASPAQTAAEATRQMLNKKRLSSKINYDVLDKLFDESDG

Q9P6R0 Transcription factor IIIB 60 kDa subunit3.4e-0723.39Show/hide
Query:  VSRTALKIITSMKRDWMQI-----------------------------KIVHICDATLTKRLIEFENTESGSLTMEEFIVMADKVKGSNSYTNNGLNATS
        V+  A++++  M RDWMQI                              +V + D T+ KRL EF+ TESG L++ +F          N +         
Subjt:  VSRTALKIITSMKRDWMQI-----------------------------KIVHICDATLTKRLIEFENTESGSLTMEEFIVMADKVKGSNSYTNNGLNATS

Query:  DEVLCVHKNECKKPYALGLCKSCYDDFVELSGGLDGGSNPPAFQSAEKERMEKAMVEEGSNDSSAIGKFSQGLNPCNNTE----KESDNVHADASKTVGS
                                         L+G S+PP+F   +K        + G+   S I    + ++P   T      E  +     +  V S
Subjt:  DEVLCVHKNECKKPYALGLCKSCYDDFVELSGGLDGGSNPPAFQSAEKERMEKAMVEEGSNDSSAIGKFSQGLNPCNNTE----KESDNVHADASKTVGS

Query:  KEAEAKGAADEQRGLDEGANKIGADGL-------GATASDESDNWSDIDDIEVDGYLHNEEEKHYKKIIWEEMNREYLEEQAAKDAAAAAAKNAYEANFQ
        +E      ADE+    E    +  D L           S+E     D+DD E++  L +++E   K  +W E+N++YL E+ AK+               
Subjt:  KEAEAKGAADEQRGLDEGANKIGADGL-------GATASDESDNWSDIDDIEVDGYLHNEEEKHYKKIIWEEMNREYLEEQAAKDAAAAAAKNAYEANFQ

Query:  NCSEDLLAAKDLADAAAAAVAKSRKERQRKRAADAKNASPAQTAAEATRQMLNKKRLSSKINYDVLDKLFDE
           EDL             + +  ++R+R R  D+ +   A TAAE+ ++M+ ++  S KINY+ LD LFDE
Subjt:  NCSEDLLAAKDLADAAAAAVAKSRKERQRKRAADAKNASPAQTAAEATRQMLNKKRLSSKINYDVLDKLFDE

Arabidopsis top hitse value%identityAlignment
AT2G01280.1 Cyclin/Brf1-like TBP-binding protein2.7e-4439.84Show/hide
Query:  KEVSRTALKIITSMKRDWMQ-----------------------------IKIVHICDATLTKRLIEFENTESGSLTMEEFIVMADKVKGSNSYTNNGLNA
        K+V+ TA  II+SMKRDW+Q                              K+VH+C+AT+TKRL EF NTE+GSLT++E +  ++++    ++T    N+
Subjt:  KEVSRTALKIITSMKRDWMQ-----------------------------IKIVHICDATLTKRLIEFENTESGSLTMEEFIVMADKVKGSNSYTNNGLNA

Query:  TSDEVLCVHKNECKKPYALGLCKSCYDDFVELSGGLDGGSNPPAFQSAEKERMEKAMVEEGSNDSSAIGKFSQGLNPCNNTEKESDNVHADASKTVGSKE
            V C HK+   K +  GLCKSC+DDF+ +SGG+ GGS+PPA+Q AEKERMEKA  EE   +   IG         N    E  NV   A K     E
Subjt:  TSDEVLCVHKNECKKPYALGLCKSCYDDFVELSGGLDGGSNPPAFQSAEKERMEKAMVEEGSNDSSAIGKFSQGLNPCNNTEKESDNVHADASKTVGSKE

Query:  AEAKGAADEQRGLDEGANKIGADGLGATASDESDNWSDIDDIEVDGYLHNEEEKHYKKIIWEEMNREYLEEQAAKDAAAAAAKNAYEANFQNCSEDLLAA
         E  G                     A  SDESD  SD DD EV+  L  E+E   K   W   N++YLEEQA K+AA  AA         NC ED   A
Subjt:  AEAKGAADEQRGLDEGANKIGADGLGATASDESDNWSDIDDIEVDGYLHNEEEKHYKKIIWEEMNREYLEEQAAKDAAAAAAKNAYEANFQNCSEDLLAA

Query:  KDLADAAAAAVAKSRKERQRKRAADAKNASPAQTAAEATRQMLNKKRLSSKINYDVLDKLFDES
        ++L +A+ AAVA SRKE++RKRA +AKNA P+ TA EA+      +R++    + VLD+L D S
Subjt:  KDLADAAAAAVAKSRKERQRKRAADAKNASPAQTAAEATRQMLNKKRLSSKINYDVLDKLFDES

AT2G45100.1 Cyclin/Brf1-like TBP-binding protein3.0e-3534.97Show/hide
Query:  TALKIITSMKRDWMQ-----------------------------IKIVHICDATLTKRLIEFENTESGSLTMEEFI-------VMADKVKGSNSYTNNGL
        TA  II SMKRDWMQ                             + IVHIC+ATLTKRLIEF +TE+ SLT +E           A + K   ++   G+
Subjt:  TALKIITSMKRDWMQ-----------------------------IKIVHICDATLTKRLIEFENTESGSLTMEEFI-------VMADKVKGSNSYTNNGL

Query:  NATSDEVLCVHKNECKKPYALGLCKSCYDDFVELSGGLDGGSNPPAFQSAEKERMEKAMVEEGSNDSSAIGKFSQGLNPCNNTEKESDNVHADASKTVGS
              VLC+H+ +C KP   GLC+SCYD+F+ +SGGL+GGS+PPAFQ AEKERME                                            
Subjt:  NATSDEVLCVHKNECKKPYALGLCKSCYDDFVELSGGLDGGSNPPAFQSAEKERMEKAMVEEGSNDSSAIGKFSQGLNPCNNTEKESDNVHADASKTVGS

Query:  KEAEAKGAADEQRGLDEGANKIGADGLGATASDESDNWSDIDDIEVDGYLHNEEEKHYKKIIWEEMNREYLEEQAAKDAAAAAAKNAYEANFQNCSEDLL
                  E+   +E   ++  DG     SDES   SD+DD E+D Y    EE    KI ++  N  Y E++AAK AA   A N              
Subjt:  KEAEAKGAADEQRGLDEGANKIGADGLGATASDESDNWSDIDDIEVDGYLHNEEEKHYKKIIWEEMNREYLEEQAAKDAAAAAAKNAYEANFQNCSEDLL

Query:  AAKDLADAAAAAVAKSRKERQRKRAADAKNASPAQTAAEATRQMLNKKRLSSKINYDVLDKLFDES
         A ++ +A+ AA AKSRKE++++RA + KNA P  T  EA   M+ +K+    IN D L++LFD S
Subjt:  AAKDLADAAAAAVAKSRKERQRKRAADAKNASPAQTAAEATRQMLNKKRLSSKINYDVLDKLFDES

AT3G09360.1 Cyclin/Brf1-like TBP-binding protein4.3e-6641.35Show/hide
Query:  TELLQFSVQLFTDCCRYCDMCGKVLDFYNFSQDPTFTKDSGGQR--ETLHGDAIQSLFKLLAFTLLVGLLGGTKEDGM-KKEVSRTALKIITSMKRDWMQ
        TEL+Q S    T  CR  ++   ++DF ++ +   +   S   +  E L+    ++  KL+  ++ +     +   G   K+V  TA  II SMKRDW+Q
Subjt:  TELLQFSVQLFTDCCRYCDMCGKVLDFYNFSQDPTFTKDSGGQR--ETLHGDAIQSLFKLLAFTLLVGLLGGTKEDGM-KKEVSRTALKIITSMKRDWMQ

Query:  -----------------------------IKIVHICDATLTKRLIEFENTESGSLTMEEFIVMADKVKGSNSYTNNGLNATSDEVLCVHKNECKKPYALG
                                     + IVHIC+ATLTKRLIEF +T+SG+L + E   + ++     S+T     +  + VLC+H++   KP+  G
Subjt:  -----------------------------IKIVHICDATLTKRLIEFENTESGSLTMEEFIVMADKVKGSNSYTNNGLNATSDEVLCVHKNECKKPYALG

Query:  LCKSCYDDFVELSGGLDGGSNPPAFQSAEKERMEKAMVEEGSNDSSAIGKFSQGLNPCNNTEKESDNVHAD-ASKTVGSKEAEAKGAADEQRGLDEGANK
        LC+ CY DF+ +SGGL GGSNPPAFQ AEKERMEKA  EE             G++  N+ E+    +++D  S +   K+   KG  D+  G +E A+ 
Subjt:  LCKSCYDDFVELSGGLDGGSNPPAFQSAEKERMEKAMVEEGSNDSSAIGKFSQGLNPCNNTEKESDNVHAD-ASKTVGSKEAEAKGAADEQRGLDEGANK

Query:  IGADGLGATASDESDNWSDIDDIEVDGYLHNEEEKHYKKIIWEEMNREYLEEQAAKDAAAAAAKNAYEANFQNCSEDLLAAKDLADAAAAAVAKSRKERQ
                  SDESDN+SDI D EV+GY++NEEE HYK I W EMN++YLEEQAAK+AA  AA  A +A+  NC ED   A+   +AA A  AKSRKE+Q
Subjt:  IGADGLGATASDESDNWSDIDDIEVDGYLHNEEEKHYKKIIWEEMNREYLEEQAAKDAAAAAAKNAYEANFQNCSEDLLAAKDLADAAAAAVAKSRKERQ

Query:  RKRAADAKNASPAQTAAEATRQMLNKKRLSSKINYDVLDKLFDES
        +K+A +AKNA+P  TA EA R+ L+KKRLSS INYDVL+ LFD S
Subjt:  RKRAADAKNASPAQTAAEATRQMLNKKRLSSKINYDVLDKLFDES

AT5G18860.1 inosine-uridine preferring nucleoside hydrolase family protein3.6e-21452.25Show/hide
Query:  RYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGAIRE-------I
        RYRQAIP G  G L  ++N+G RK FLPQG RRY P++QPTAQ+V+ D IS GPTTV L+G+HTN A+FL+SNPHLK NI+HIY MGG +R         
Subjt:  RYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGAIRE-------I

Query:  CSESADKSHGKTCNNIGNLWPPNT-NPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQR-QNTYEAKYCFQSLKMARDTWTGNGF
         + +  +   + C N GNL+   T NPY+EFNIF DPFAAY V HSG+PVTLVPLDAT+TIP+N+K F  FE   Q TYEA+Y F SLK+ARDTW  + F
Subjt:  CSESADKSHGKTCNNIGNLWPPNT-NPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQR-QNTYEAKYCFQSLKMARDTWTGNGF

Query:  FEIYSMWDSFMVGVALSQM---YNLDRGGGNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEI--GK
        ++ Y MWDSF  GVA+S M    N +   G N +++MEY+NIT+VTSNKPYG SDGSNP  D    PKF +   GVHSGHVQTG+ DP C+  + I  GK
Subjt:  FEIYSMWDSFMVGVALSQM---YNLDRGGGNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEI--GK

Query:  CQDGYTKEADGPESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKP--------------------
        C+DGYT+E  G +SV+VLVA  AK  I+  S +D+ FY+ FL+VLN P +TGRF+F +QFP Y+E L+RP   K   GKP                    
Subjt:  CQDGYTKEADGPESVQVLVAVEAKSTIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKP--------------------

Query:  ---------GIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESS
                  II+SP GWA AATIDVVYD+LHMMGRDDI VGLGD+ A+ ++ P FPP+G CK                               Y+AE+S
Subjt:  ---------GIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCK-------------------------------YSAESS

Query:  VKYGAFRDTDHPELGQMSALDVWKDVVQSLDLEAKITVLTSGPLTNLAQII-HHKAVSSRIQEVYITGGHIN-YSVDKGNVFTIPSNEYSEFNFFLDPTA
        V +GA RDTD PEL Q  A++VW+++ +S +  +KITVLT+GPLTNLA+II   K  SS I+EVYI GGHIN    DKGN+FTIPSN Y+EFN FLDP A
Subjt:  VKYGAFRDTDHPELGQMSALDVWKDVVQSLDLEAKITVLTSGPLTNLAQII-HHKAVSSRIQEVYITGGHIN-YSVDKGNVFTIPSNEYSEFNFFLDPTA

Query:  ADLVLGSGLNITLIPLNVQRRVSSFHKILKKLKLRNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGG
        A  VL S LNITL+PL  Q ++SSF  +L +L    +TPEA F +RLL RL  L QKH +Y H+DMFLGEVLG V L G   +LK     + +KV+  G 
Subjt:  ADLVLGSGLNITLIPLNVQRRVSSFHKILKKLKLRNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGG

Query:  ESKVGQTIIDDKKGKWVRVLESVEPLAFYEDLANALADEKQTAVIGSFEGQKRQWT
        ES+ G+ +ID  +GK +++LE V+ ++  E  A+ L D+KQ+AVIGSFE QK+ W+
Subjt:  ESKVGQTIIDDKKGKWVRVLESVEPLAFYEDLANALADEKQTAVIGSFEGQKRQWT

AT5G18890.1 Inosine-uridine preferring nucleoside hydrolase family protein7.7e-13248.24Show/hide
Query:  NLDRGGGNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQDGYTKEADGPESVQVLVAVEAKS
        N +   G N +++MEY+NIT+VTSN+PYG+ D SNP       PKF +   GVHSGHVQ G+ DP C+  +  G C+DGYTKE  GP+SV+VLVA  AK 
Subjt:  NLDRGGGNNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQDGYTKEADGPESVQVLVAVEAKS

Query:  TIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKF-GKKLLGKP-----------------------------GIIISPNGWATAATI
        + + NS +D+ FY  FL+VLN P +TGRF F  QF  YRE L+  +    +L GKP                              +I+SP GWA  ATI
Subjt:  TIDANSSIDKAFYISFLDVLNSPRQTGRFDFRAQFPNYREVLYRPKF-GKKLLGKP-----------------------------GIIISPNGWATAATI

Query:  DVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCKYSA-----------------------------ESSVKYGAFRDTDHPELGQMSALDVWKDV
        DVVYD+LHMMGRDDI VGLGD+FAI ++ P FP  GDCKY+                              E+SV +GA  DTD PEL Q  AL+VW+++
Subjt:  DVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCKYSA-----------------------------ESSVKYGAFRDTDHPELGQMSALDVWKDV

Query:  VQSLDLEAKITVLTSGPLTNLAQII-HHKAVSSRIQEVYITGGHINY-SVDKGNVFTIPSNEYSEFNFFLDPTAADLVLGSGLNITLIPLNVQRRVSSFH
         +S+D  +KITVLT+GPLT+LA+II   K  SS I+EVYI GGHI+    DKGN+FT+PSN Y+EFN FLDP AA  VL SGLNITLIPL  QR   SF 
Subjt:  VQSLDLEAKITVLTSGPLTNLAQII-HHKAVSSRIQEVYITGGHINY-SVDKGNVFTIPSNEYSEFNFFLDPTAADLVLGSGLNITLIPLNVQRRVSSFH

Query:  KILKKLKLRNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKVGQTIIDDKKGKWVRVLESVEPL
         +L +L    +TPEA F +RLL+RL  L QK  +Y H+DMFLGE+LG + L G H  LK     + +KV+  G ESK G  +ID  +GK +++LE V+  
Subjt:  KILKKLKLRNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKVGQTIIDDKKGKWVRVLESVEPL

Query:  AFYEDLANALADEKQTAVIGSFEGQKRQW-TPREELPPRSR
          YE  A+ L D+KQ+AVIGSFE Q+ +W TP    P  +R
Subjt:  AFYEDLANALADEKQTAVIGSFEGQKRQW-TPREELPPRSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCGATACAGGCAAGCCATTCCGGTGGGAGAGAAAGGACGTCTCTATGCCAATACCAATTTTGGTTTACGCAAACCCTTTCTTCCTCAGGGTAAGAGGAGATATAT
TCCTATGAAGCAGCCAACTGCACAGCAGGTGATGAAAGATGCAATATCTGCAGGGCCTACTACAGTTTTTCTTATGGGAGCTCATACAAACTTGGCTATATTTCTTCTGT
CAAATCCTCATTTGAAGAAAAATATAAAGCATATATATGCCATGGGTGGTGCTATTAGAGAAATTTGCTCAGAAAGTGCTGACAAATCTCATGGGAAGACATGCAACAAT
ATCGGGAACTTGTGGCCTCCCAATACAAATCCATATGCTGAGTTCAATATATTTGGAGACCCTTTTGCTGCCTATACAGTACTACATTCTGGAATTCCTGTTACACTTGT
TCCTCTGGATGCAACAAGTACCATTCCTGTGAATAAGAAAGTATTTTTGGCATTTGAACAGAGACAAAACACTTATGAGGCTAAATATTGTTTCCAGTCTTTGAAAATGG
CTCGTGATACTTGGACTGGCAATGGATTTTTCGAAATTTATTCGATGTGGGATTCTTTTATGGTTGGTGTAGCTTTGTCCCAAATGTATAATTTGGACAGAGGGGGTGGA
AATAATGCATATTCGAAGATGGAATATCTAAATATCACTATTGTTACATCGAACAAACCTTATGGGATCTCTGATGGATCAAATCCACTTGTTGATGGACATTTGCTCCC
AAAATTTGGTGTCCAAAAGAATGGGGTGCACAGTGGTCATGTTCAGACAGGAATGCTAGATCCATTCTGCGTTGTTCCAACTGAAATAGGAAAATGTCAGGATGGTTATA
CAAAGGAGGCAGATGGCCCTGAATCAGTTCAAGTTCTAGTTGCTGTGGAAGCTAAATCAACTATTGACGCCAACAGCTCAATTGATAAAGCATTTTATATTAGCTTCCTG
GATGTTCTTAATAGTCCACGACAGACGGGGAGGTTTGATTTTAGAGCTCAGTTTCCTAACTACAGGGAAGTATTATATAGACCAAAATTTGGGAAGAAATTACTAGGAAA
ACCGGGAATAATAATTAGCCCAAATGGATGGGCAACGGCTGCAACAATAGATGTTGTATATGATGTATTACATATGATGGGCCGTGACGACATTTCGGTTGGTCTTGGAG
ATGTATTCGCCATTGGAGAAGCACATCCATCATTTCCTCCTATTGGAGACTGCAAATATTCTGCTGAAAGTTCAGTGAAGTATGGAGCATTTAGAGATACTGATCACCCT
GAACTCGGACAAATGTCTGCACTTGATGTTTGGAAAGATGTTGTGCAATCTCTAGATCTGGAGGCAAAAATTACCGTTTTAACCAGCGGACCTTTGACGAATCTCGCTCA
AATCATACATCATAAAGCCGTAAGCTCAAGGATTCAGGAAGTATATATTACTGGAGGACATATAAATTATAGTGTGGACAAGGGAAATGTGTTCACTATACCTTCAAATG
AATATTCAGAATTCAACTTCTTTCTGGACCCTACAGCTGCCGATTTGGTTTTGGGCTCAGGATTGAACATCACACTTATCCCGTTGAACGTGCAACGAAGAGTAAGTTCT
TTTCACAAGATCCTCAAAAAGTTGAAGCTCAGAAACAGAACTCCCGAAGCATGGTTCTCTCGACGTCTACTCTCCAGGCTATATCACCTTAAGCAGAAGCATCATCAATA
CCATCATGTGGACATGTTCTTAGGAGAAGTCCTTGGTGTAGTGAGCTTAGCTGGAAAACATTTGAATCTGAAGCAAACATTCAGCTTCAAGCCTCTCAAGGTAGTCACAA
ATGGAGGTGAATCAAAAGTAGGGCAAACAATTATAGATGATAAGAAAGGGAAGTGGGTGAGAGTTTTGGAGAGTGTTGAACCACTTGCATTTTATGAAGATCTTGCAAAC
GCATTGGCTGATGAAAAGCAAACTGCTGTTATTGGAAGCTTTGAGGGGCAAAAGAGGCAGTGGACCCCACGCGAAGAACTCCCTCCCCGCAGTCGTCAACTCTCAACCTC
GCTCACGTCAGTCGCCGGTCGTCCGCCGTCCCCGCCGCTGGGCTCTGTCGCCGTCCGTCGTGCTCAGACCAACGTTGCCGTTGCTGGTGCTGAGTTTCTCGTGGATCTCT
TTCTCTCTCTTCCGCGCGTTTGTATTCATATGCCAATTTTTATGTTTACTGAATTACTTCAATTTTCAGTTCAGCTGTTTACCGACTGTTGTAGATACTGTGATATGTGT
GGGAAGGTGTTGGATTTTTACAATTTCTCTCAAGATCCTACTTTTACAAAGGATTCTGGAGGGCAGAGAGAAACTTTACACGGGGACGCAATACAGAGTTTGTTCAAGCT
GCTTGCCTTTACATTGCTTGTCGGTTTGCTTGGAGGAACAAAGGAGGATGGAATGAAGAAGGAGGTTTCCAGAACTGCATTGAAGATCATAACCAGCATGAAGCGTGATT
GGATGCAGATAAAGATAGTGCATATTTGTGATGCAACATTGACAAAGCGGTTAATTGAGTTCGAGAATACAGAATCTGGCAGCTTAACGATGGAGGAATTTATTGTCATG
GCAGACAAGGTCAAGGGAAGCAACTCTTATACTAATAATGGCTTGAATGCAACGAGTGATGAAGTACTTTGCGTGCATAAGAATGAGTGCAAAAAGCCATATGCTCTCGG
TCTTTGTAAAAGCTGTTATGATGATTTTGTTGAACTGTCTGGAGGACTAGATGGGGGATCAAATCCTCCAGCATTTCAGTCTGCTGAGAAAGAAAGAATGGAAAAAGCAA
TGGTTGAGGAAGGTTCTAATGACTCTAGTGCTATTGGCAAATTTTCACAAGGACTAAACCCATGCAACAATACTGAAAAAGAATCGGACAATGTACACGCAGATGCATCA
AAAACTGTTGGTTCTAAAGAAGCTGAAGCGAAAGGAGCTGCAGATGAGCAAAGAGGACTTGATGAAGGTGCTAATAAAATTGGAGCTGATGGCCTGGGAGCCACTGCTTC
AGATGAATCAGATAATTGGTCTGATATAGATGATATTGAGGTTGACGGCTATCTTCACAATGAGGAGGAGAAGCATTACAAAAAGATAATTTGGGAAGAGATGAACCGGG
AGTATCTTGAGGAACAAGCAGCTAAGGACGCAGCAGCAGCTGCTGCAAAGAATGCTTATGAGGCCAATTTCCAAAATTGTTCAGAAGATTTGTTGGCAGCAAAGGATCTT
GCAGATGCTGCTGCGGCTGCTGTTGCAAAATCTAGAAAGGAAAGACAACGAAAACGAGCAGCAGATGCCAAAAATGCCTCTCCTGCACAAACGGCAGCAGAAGCCACCCG
ACAAATGTTAAATAAAAAGAGACTCAGCTCGAAAATCAACTATGATGTGTTGGACAAACTTTTTGACGAATCTGACGGGATGCTTGCAGCTAAAGTTGAACTGAAACTTA
GGCTGAAGATTGATGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGCGATACAGGCAAGCCATTCCGGTGGGAGAGAAAGGACGTCTCTATGCCAATACCAATTTTGGTTTACGCAAACCCTTTCTTCCTCAGGGTAAGAGGAGATATAT
TCCTATGAAGCAGCCAACTGCACAGCAGGTGATGAAAGATGCAATATCTGCAGGGCCTACTACAGTTTTTCTTATGGGAGCTCATACAAACTTGGCTATATTTCTTCTGT
CAAATCCTCATTTGAAGAAAAATATAAAGCATATATATGCCATGGGTGGTGCTATTAGAGAAATTTGCTCAGAAAGTGCTGACAAATCTCATGGGAAGACATGCAACAAT
ATCGGGAACTTGTGGCCTCCCAATACAAATCCATATGCTGAGTTCAATATATTTGGAGACCCTTTTGCTGCCTATACAGTACTACATTCTGGAATTCCTGTTACACTTGT
TCCTCTGGATGCAACAAGTACCATTCCTGTGAATAAGAAAGTATTTTTGGCATTTGAACAGAGACAAAACACTTATGAGGCTAAATATTGTTTCCAGTCTTTGAAAATGG
CTCGTGATACTTGGACTGGCAATGGATTTTTCGAAATTTATTCGATGTGGGATTCTTTTATGGTTGGTGTAGCTTTGTCCCAAATGTATAATTTGGACAGAGGGGGTGGA
AATAATGCATATTCGAAGATGGAATATCTAAATATCACTATTGTTACATCGAACAAACCTTATGGGATCTCTGATGGATCAAATCCACTTGTTGATGGACATTTGCTCCC
AAAATTTGGTGTCCAAAAGAATGGGGTGCACAGTGGTCATGTTCAGACAGGAATGCTAGATCCATTCTGCGTTGTTCCAACTGAAATAGGAAAATGTCAGGATGGTTATA
CAAAGGAGGCAGATGGCCCTGAATCAGTTCAAGTTCTAGTTGCTGTGGAAGCTAAATCAACTATTGACGCCAACAGCTCAATTGATAAAGCATTTTATATTAGCTTCCTG
GATGTTCTTAATAGTCCACGACAGACGGGGAGGTTTGATTTTAGAGCTCAGTTTCCTAACTACAGGGAAGTATTATATAGACCAAAATTTGGGAAGAAATTACTAGGAAA
ACCGGGAATAATAATTAGCCCAAATGGATGGGCAACGGCTGCAACAATAGATGTTGTATATGATGTATTACATATGATGGGCCGTGACGACATTTCGGTTGGTCTTGGAG
ATGTATTCGCCATTGGAGAAGCACATCCATCATTTCCTCCTATTGGAGACTGCAAATATTCTGCTGAAAGTTCAGTGAAGTATGGAGCATTTAGAGATACTGATCACCCT
GAACTCGGACAAATGTCTGCACTTGATGTTTGGAAAGATGTTGTGCAATCTCTAGATCTGGAGGCAAAAATTACCGTTTTAACCAGCGGACCTTTGACGAATCTCGCTCA
AATCATACATCATAAAGCCGTAAGCTCAAGGATTCAGGAAGTATATATTACTGGAGGACATATAAATTATAGTGTGGACAAGGGAAATGTGTTCACTATACCTTCAAATG
AATATTCAGAATTCAACTTCTTTCTGGACCCTACAGCTGCCGATTTGGTTTTGGGCTCAGGATTGAACATCACACTTATCCCGTTGAACGTGCAACGAAGAGTAAGTTCT
TTTCACAAGATCCTCAAAAAGTTGAAGCTCAGAAACAGAACTCCCGAAGCATGGTTCTCTCGACGTCTACTCTCCAGGCTATATCACCTTAAGCAGAAGCATCATCAATA
CCATCATGTGGACATGTTCTTAGGAGAAGTCCTTGGTGTAGTGAGCTTAGCTGGAAAACATTTGAATCTGAAGCAAACATTCAGCTTCAAGCCTCTCAAGGTAGTCACAA
ATGGAGGTGAATCAAAAGTAGGGCAAACAATTATAGATGATAAGAAAGGGAAGTGGGTGAGAGTTTTGGAGAGTGTTGAACCACTTGCATTTTATGAAGATCTTGCAAAC
GCATTGGCTGATGAAAAGCAAACTGCTGTTATTGGAAGCTTTGAGGGGCAAAAGAGGCAGTGGACCCCACGCGAAGAACTCCCTCCCCGCAGTCGTCAACTCTCAACCTC
GCTCACGTCAGTCGCCGGTCGTCCGCCGTCCCCGCCGCTGGGCTCTGTCGCCGTCCGTCGTGCTCAGACCAACGTTGCCGTTGCTGGTGCTGAGTTTCTCGTGGATCTCT
TTCTCTCTCTTCCGCGCGTTTGTATTCATATGCCAATTTTTATGTTTACTGAATTACTTCAATTTTCAGTTCAGCTGTTTACCGACTGTTGTAGATACTGTGATATGTGT
GGGAAGGTGTTGGATTTTTACAATTTCTCTCAAGATCCTACTTTTACAAAGGATTCTGGAGGGCAGAGAGAAACTTTACACGGGGACGCAATACAGAGTTTGTTCAAGCT
GCTTGCCTTTACATTGCTTGTCGGTTTGCTTGGAGGAACAAAGGAGGATGGAATGAAGAAGGAGGTTTCCAGAACTGCATTGAAGATCATAACCAGCATGAAGCGTGATT
GGATGCAGATAAAGATAGTGCATATTTGTGATGCAACATTGACAAAGCGGTTAATTGAGTTCGAGAATACAGAATCTGGCAGCTTAACGATGGAGGAATTTATTGTCATG
GCAGACAAGGTCAAGGGAAGCAACTCTTATACTAATAATGGCTTGAATGCAACGAGTGATGAAGTACTTTGCGTGCATAAGAATGAGTGCAAAAAGCCATATGCTCTCGG
TCTTTGTAAAAGCTGTTATGATGATTTTGTTGAACTGTCTGGAGGACTAGATGGGGGATCAAATCCTCCAGCATTTCAGTCTGCTGAGAAAGAAAGAATGGAAAAAGCAA
TGGTTGAGGAAGGTTCTAATGACTCTAGTGCTATTGGCAAATTTTCACAAGGACTAAACCCATGCAACAATACTGAAAAAGAATCGGACAATGTACACGCAGATGCATCA
AAAACTGTTGGTTCTAAAGAAGCTGAAGCGAAAGGAGCTGCAGATGAGCAAAGAGGACTTGATGAAGGTGCTAATAAAATTGGAGCTGATGGCCTGGGAGCCACTGCTTC
AGATGAATCAGATAATTGGTCTGATATAGATGATATTGAGGTTGACGGCTATCTTCACAATGAGGAGGAGAAGCATTACAAAAAGATAATTTGGGAAGAGATGAACCGGG
AGTATCTTGAGGAACAAGCAGCTAAGGACGCAGCAGCAGCTGCTGCAAAGAATGCTTATGAGGCCAATTTCCAAAATTGTTCAGAAGATTTGTTGGCAGCAAAGGATCTT
GCAGATGCTGCTGCGGCTGCTGTTGCAAAATCTAGAAAGGAAAGACAACGAAAACGAGCAGCAGATGCCAAAAATGCCTCTCCTGCACAAACGGCAGCAGAAGCCACCCG
ACAAATGTTAAATAAAAAGAGACTCAGCTCGAAAATCAACTATGATGTGTTGGACAAACTTTTTGACGAATCTGACGGGATGCTTGCAGCTAAAGTTGAACTGAAACTTA
GGCTGAAGATTGATGATTGA
Protein sequenceShow/hide protein sequence
MARYRQAIPVGEKGRLYANTNFGLRKPFLPQGKRRYIPMKQPTAQQVMKDAISAGPTTVFLMGAHTNLAIFLLSNPHLKKNIKHIYAMGGAIREICSESADKSHGKTCNN
IGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVNKKVFLAFEQRQNTYEAKYCFQSLKMARDTWTGNGFFEIYSMWDSFMVGVALSQMYNLDRGGG
NNAYSKMEYLNITIVTSNKPYGISDGSNPLVDGHLLPKFGVQKNGVHSGHVQTGMLDPFCVVPTEIGKCQDGYTKEADGPESVQVLVAVEAKSTIDANSSIDKAFYISFL
DVLNSPRQTGRFDFRAQFPNYREVLYRPKFGKKLLGKPGIIISPNGWATAATIDVVYDVLHMMGRDDISVGLGDVFAIGEAHPSFPPIGDCKYSAESSVKYGAFRDTDHP
ELGQMSALDVWKDVVQSLDLEAKITVLTSGPLTNLAQIIHHKAVSSRIQEVYITGGHINYSVDKGNVFTIPSNEYSEFNFFLDPTAADLVLGSGLNITLIPLNVQRRVSS
FHKILKKLKLRNRTPEAWFSRRLLSRLYHLKQKHHQYHHVDMFLGEVLGVVSLAGKHLNLKQTFSFKPLKVVTNGGESKVGQTIIDDKKGKWVRVLESVEPLAFYEDLAN
ALADEKQTAVIGSFEGQKRQWTPREELPPRSRQLSTSLTSVAGRPPSPPLGSVAVRRAQTNVAVAGAEFLVDLFLSLPRVCIHMPIFMFTELLQFSVQLFTDCCRYCDMC
GKVLDFYNFSQDPTFTKDSGGQRETLHGDAIQSLFKLLAFTLLVGLLGGTKEDGMKKEVSRTALKIITSMKRDWMQIKIVHICDATLTKRLIEFENTESGSLTMEEFIVM
ADKVKGSNSYTNNGLNATSDEVLCVHKNECKKPYALGLCKSCYDDFVELSGGLDGGSNPPAFQSAEKERMEKAMVEEGSNDSSAIGKFSQGLNPCNNTEKESDNVHADAS
KTVGSKEAEAKGAADEQRGLDEGANKIGADGLGATASDESDNWSDIDDIEVDGYLHNEEEKHYKKIIWEEMNREYLEEQAAKDAAAAAAKNAYEANFQNCSEDLLAAKDL
ADAAAAAVAKSRKERQRKRAADAKNASPAQTAAEATRQMLNKKRLSSKINYDVLDKLFDESDGMLAAKVELKLRLKIDD