; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr011947 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr011947
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionTSC22 domain family protein 1-like isoform X1
Genome locationtig00153148:139947..145405
RNA-Seq ExpressionSgr011947
SyntenySgr011947
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053895.1 Filamentous hemagglutinin [Cucumis melo var. makuwa]1.5e-19776.64Show/hide
Query:  MGKTEEEQPLPVGV-SSELSDRNVGSRC-GGGCCGIRRLIAVRCVFFLLLSTAVFLSAIFWLPPFLSYADRPDPGRDPQYRDHEIVASFHAWKPVSFLEN
        MGK+EEEQPLPVGV SSELSDRNV +RC GGGC  IR+LIAVRCVFFLLLS AVFLSAIFWLPPFLSY + PD   D  YRDH+IVASFHAWKPV FL+N
Subjt:  MGKTEEEQPLPVGV-SSELSDRNVGSRC-GGGCCGIRRLIAVRCVFFLLLSTAVFLSAIFWLPPFLSYADRPDPGRDPQYRDHEIVASFHAWKPVSFLEN

Query:  HILELEDSIFGEIPVPFAK---------------------------VAILALQSLGGLNVTKIVFAVDPDTKYSKIPPTSQSLIKETFETMVINQPPLSL
        HI ELED+IFGEIP+P  K                           VAIL+LQSL G NVTKIVFAVD D KYSKIPPTSQSLIKETFET+VIN+PPL L
Subjt:  HILELEDSIFGEIPVPFAK---------------------------VAILALQSLGGLNVTKIVFAVDPDTKYSKIPPTSQSLIKETFETMVINQPPLSL

Query:  AASLFGNTSSFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQLNFDDLTGQLRSGLHLSRYENLYVSLSNARGSTMDAPTIVQSSVLMAIGT
          SLFGNTS FEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQ+NFDDL+ QLRSGL LS YENLYVSLSN RGSTMDAPT+VQSSVLMAIGT
Subjt:  AASLFGNTSSFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQLNFDDLTGQLRSGLHLSRYENLYVSLSNARGSTMDAPTIVQSSVLMAIGT

Query:  N--SSKQRLKQLAQTITNSHSGNLGLNNTTFGKVKQVRLSSVLNHSLSGGGGARSPSPAPLPHHHHHQHHHHHHHHHHHHHHHHHHHHHHH-----DAAY
        N  SSKQRLKQLA TITNSHSGNLGLNNT FGKVKQVRL S LNHSL GGG A SPSPAPLPH HHH HHHHHHHHHHHH HHHHHHHHHH      AAY
Subjt:  N--SSKQRLKQLAQTITNSHSGNLGLNNTTFGKVKQVRLSSVLNHSLSGGGGARSPSPAPLPHHHHHQHHHHHHHHHHHHHHHHHHHHHHH-----DAAY

Query:  SPSP------HAPQNG-ASAPKAGSSAPVESPTPKKRNYEATPPACQYGYKRSSTKVRKQSHLGPIHSPSSPPLSPDLQVGPPAPVSDSISSSSALPGVV
        SPSP      HAP+NG +SAP+AGSS P+E PT +KRNYEATPPA +YGYKRSSTK+RKQ HLGPI SPSS P SP L+VG PAPVSDSIS+SS L GVV
Subjt:  SPSP------HAPQNG-ASAPKAGSSAPVESPTPKKRNYEATPPACQYGYKRSSTKVRKQSHLGPIHSPSSPPLSPDLQVGPPAPVSDSISSSSALPGVV

Query:  LSNVQPPDKGSRHAENFERRAPSVLPPQFSSGSDG
        LSNVQPP+ GS HAENFER +PSVLPPQFS    G
Subjt:  LSNVQPPDKGSRHAENFERRAPSVLPPQFSSGSDG

XP_004136773.3 uncharacterized protein LOC101213172 isoform X1 [Cucumis sativus]1.1e-20080.79Show/hide
Query:  MGKTEEEQPLPV-GVSSELSDRNVGSRC-GGGCCGIRRLIAVRCVFFLLLSTAVFLSAIFWLPPFLSYADRPDPGRDPQYRDHEIVASFHAWKPVSFLEN
        MGK+EEEQPLPV G SSELSDRNV +RC GGGC  IRRLIAVRCVFFLLLS AVFLSAIFWLPPFLSY + PD   D  YRDH+IVASFHA KPV FL+ 
Subjt:  MGKTEEEQPLPV-GVSSELSDRNVGSRC-GGGCCGIRRLIAVRCVFFLLLSTAVFLSAIFWLPPFLSYADRPDPGRDPQYRDHEIVASFHAWKPVSFLEN

Query:  HILELEDSIFGEIPVPFAKVAILALQSLGGLNVTKIVFAVDPDTKYSKIPPTSQSLIKETFETMVINQPPLSLAASLFGNTSSFEVLKFPGGITIIPPQS
        HI ELED+IFGEIP+P  KVAIL+LQSLGG NVTKIVFAVD D KYSKIPPTSQSLIKETFET+VIN+PPL L  SLFGNTS FEVLKFPGGITIIPPQS
Subjt:  HILELEDSIFGEIPVPFAKVAILALQSLGGLNVTKIVFAVDPDTKYSKIPPTSQSLIKETFETMVINQPPLSLAASLFGNTSSFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQLNFDDLTGQLRSGLHLSRYENLYVSLSNARGSTMDAPTIVQSSVLMAIGTN--SSKQRLKQLAQTITNSHSGNLGLN
        AFLLQTAQIYFNFTLNYSIYQIQ+NFDDL+ QLRSGL LS YENLYVSLSN RGST+DAPT+VQSSVLMAIGTN  SSKQRLKQLA TITNSHSGNLGLN
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQLNFDDLTGQLRSGLHLSRYENLYVSLSNARGSTMDAPTIVQSSVLMAIGTN--SSKQRLKQLAQTITNSHSGNLGLN

Query:  NTTFGKVKQVRLSSVLNHSLSGGGGARSPSPAPLPHHHHHQHHHHHHHHHHHHHHHHHHHHHHHDAAYSPSP------HAPQNG-ASAPKAGSSAPVESP
        NT FGKVKQVRL S LNHSL GGG ARSPSPAPLPH HHH+HHHHHHHHHHHHHHHHHHHHHH DAAYSPSP      HAP+NG +SAP+AGSS P+E P
Subjt:  NTTFGKVKQVRLSSVLNHSLSGGGGARSPSPAPLPHHHHHQHHHHHHHHHHHHHHHHHHHHHHHDAAYSPSP------HAPQNG-ASAPKAGSSAPVESP

Query:  TPKKRNYEATPPACQYGYKRSSTKVRKQSHLGPIHSPSSPPLSPDLQVGPPAPVSDSISSSSALPGVVLSNVQPPDKGSRHAENFERRAPSVLPPQFSSG
        T +KRNYEATPPA +YGYKRS TK+RK  +LGPI SPSS P SP L+VG PAPVSDSIS+SS L GVVLSNVQPP+ GS HAENFER +PSVLPPQFSS 
Subjt:  TPKKRNYEATPPACQYGYKRSSTKVRKQSHLGPIHSPSSPPLSPDLQVGPPAPVSDSISSSSALPGVVLSNVQPPDKGSRHAENFERRAPSVLPPQFSSG

Query:  SDGRL
        +  R+
Subjt:  SDGRL

XP_008443610.1 PREDICTED: uncharacterized protein LOC103487165 [Cucumis melo]1.0e-20177.92Show/hide
Query:  MGKTEEEQPLPVGV-SSELSDRNVGSRC-GGGCCGIRRLIAVRCVFFLLLSTAVFLSAIFWLPPFLSYADRPDPGRDPQYRDHEIVASFHAWKPVSFLEN
        MGK+EEEQPLPVGV SSELSDRNV +RC GGGC  IR+LIAVRCVFFLLLS AVFLSAIFWLPPFLSY + PD   D  YRDH+IVASFHAWKPV FL+N
Subjt:  MGKTEEEQPLPVGV-SSELSDRNVGSRC-GGGCCGIRRLIAVRCVFFLLLSTAVFLSAIFWLPPFLSYADRPDPGRDPQYRDHEIVASFHAWKPVSFLEN

Query:  HILELEDSIFGEIPVPFAKVAILALQSLGGLNVTKIVFAVDPDTKYSKIPPTSQSLIKETFETMVINQPPLSLAASLFGNTSSFEVLKFPGGITIIPPQS
        HI ELED+IFGEIP+P  KVAIL+LQSL G NVTKIVFAVD D KYSKIPPTSQSLIKETFET+VIN+PPL L  SLFGNTS FEVLKFPGGITIIPPQS
Subjt:  HILELEDSIFGEIPVPFAKVAILALQSLGGLNVTKIVFAVDPDTKYSKIPPTSQSLIKETFETMVINQPPLSLAASLFGNTSSFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQLNFDDLTGQLRSGLHLSRYENLYVSLSNARGSTMDAPTIVQSSVLMAIGTN--SSKQRLKQLAQTITNSHSGNLGLN
        AFLLQTAQIYFNFTLNYSIYQIQ+NFDDL+ QLRSGL LS YENLYVSLSN RGSTMDAPT+VQSSVLMAIGTN  SSKQRLKQLA TITNSHSGNLGLN
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQLNFDDLTGQLRSGLHLSRYENLYVSLSNARGSTMDAPTIVQSSVLMAIGTN--SSKQRLKQLAQTITNSHSGNLGLN

Query:  NTTFGKVKQVRLSSVLNHSLSGGGGARSPSPAPLPHHHHHQHHHHHHHHHHHHHHHHHHHHHHH-------------------------DAAYSPSP---
        NT FGKVKQVRL S LNHSL GGG A SPSPAPLPH HHH HHHHHHHHHHHHHHHHHHHHHHH                          AAYSPSP   
Subjt:  NTTFGKVKQVRLSSVLNHSLSGGGGARSPSPAPLPHHHHHQHHHHHHHHHHHHHHHHHHHHHHH-------------------------DAAYSPSP---

Query:  ---HAPQNG-ASAPKAGSSAPVESPTPKKRNYEATPPACQYGYKRSSTKVRKQSHLGPIHSPSSPPLSPDLQVGPPAPVSDSISSSSALPGVVLSNVQPP
           HAP+NG +SAP+AGSS P+E PT +KRNYEATPPA +YGYKRSSTK+RKQ HLGPI SPSS P SP L+VG PAPVSDSIS+SS L GVVLSNVQPP
Subjt:  ---HAPQNG-ASAPKAGSSAPVESPTPKKRNYEATPPACQYGYKRSSTKVRKQSHLGPIHSPSSPPLSPDLQVGPPAPVSDSISSSSALPGVVLSNVQPP

Query:  DKGSRHAENFERRAPSVLPPQFSSGSDGRL
        + GS HAENFER +PSVLPPQFSS +D R+
Subjt:  DKGSRHAENFERRAPSVLPPQFSSGSDGRL

XP_022147793.1 TSC22 domain family protein 1-like isoform X1 [Momordica charantia]6.3e-21284.44Show/hide
Query:  MGKTEEEQPLPVGV-SSELSDRNVGSRCGGGCCGIRRLIAVRCVFFLLLSTAVFLSAIFWLPPFLSYADRPDPGRDPQYRDHEIVASFHAWKPVSFLENH
        MGKTEEEQPLPVG+ SSELSD NVGSRCG    GIRRLIAVRCVFFLLLS AVF+SA FWLPPFLS  D PD   D  YRDHEIVASFHAWKPVSFL+NH
Subjt:  MGKTEEEQPLPVGV-SSELSDRNVGSRCGGGCCGIRRLIAVRCVFFLLLSTAVFLSAIFWLPPFLSYADRPDPGRDPQYRDHEIVASFHAWKPVSFLENH

Query:  ILELEDSIFGEIPVPFAKVAILALQSLGGLNVTKIVFAVDPDTKYSKIPPTSQSLIKETFETMVINQPPLSLAASLFGNTSSFEVLKFPGGITIIPPQSA
        ILELED+IFGEIPVPF KV IL+LQSLGGLNVTKIVFAVDPD KYSKIPPTSQSLIKE FETM IN+PPL+L A+LFGNTS FEVLKFPGGITIIPPQSA
Subjt:  ILELEDSIFGEIPVPFAKVAILALQSLGGLNVTKIVFAVDPDTKYSKIPPTSQSLIKETFETMVINQPPLSLAASLFGNTSSFEVLKFPGGITIIPPQSA

Query:  FLLQTAQIYFNFTLNYSIYQIQLNFDDLTGQLRSGLHLSRYENLYVSLSNARGSTMDAPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNNTT
        FLLQTAQIYFNFTLNYSIYQIQ+NFDDLT QLRSGLHLS YENLYVSLSNARGST+DAPTI+QSSVLMAIGTNSSK+RLKQLAQTIT+SHSGNLGLNNT 
Subjt:  FLLQTAQIYFNFTLNYSIYQIQLNFDDLTGQLRSGLHLSRYENLYVSLSNARGSTMDAPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNNTT

Query:  FGKVKQVRLSSVLNHSLSGGGGARSPSPAPLPHHHHHQHHHHHHHHHHHHHHHHHHHHHHHDAAYSPSP------HAPQNGASAPKAGSSAPVESPTPKK
        FGKVKQVRLSSVLNHSLSGG  ARSP+P+PLPH H   HHHHHHHHHHHHHHHHHHHHHHHDA+YSPSP      H P NG S+  A  SAPVESPTPKK
Subjt:  FGKVKQVRLSSVLNHSLSGGGGARSPSPAPLPHHHHHQHHHHHHHHHHHHHHHHHHHHHHHDAAYSPSP------HAPQNGASAPKAGSSAPVESPTPKK

Query:  RNYEATPPACQYGYKRSSTKVRKQSHLGPIHSPSSPPLSPDLQVGPPAPVSDSISSSSALPGVVLSNVQPPDKGSRHAENFERRAPSVLPPQFSS
        RN EATPPACQYGYKRSST+VRK+ HLGPIHS SSPP SP  +VGPPAPVS SIS+SS L  VVLSNVQPPDKGSRHAENFERRAPSVLP QFSS
Subjt:  RNYEATPPACQYGYKRSSTKVRKQSHLGPIHSPSSPPLSPDLQVGPPAPVSDSISSSSALPGVVLSNVQPPDKGSRHAENFERRAPSVLPPQFSS

XP_023528289.1 uncharacterized protein LOC111791252 [Cucurbita pepo subsp. pepo]1.1e-19780.16Show/hide
Query:  MGKTEEEQPLPVGV-SSELSDRNVGSRC-GGGCCGIRRLIAVRCVFFLLLSTAVFLSAIFWLPPFLSYADRPDPGRDPQYRDHEIVASFHAWKPVSFLEN
        MGK+EEEQPLPVGV SSELSD  V SRC GGGC  IRRLIAVRCVFFLLLS AVFLSA+FWLPPFLSY D PD   D  YRDHEIVA F A KPV FL+N
Subjt:  MGKTEEEQPLPVGV-SSELSDRNVGSRC-GGGCCGIRRLIAVRCVFFLLLSTAVFLSAIFWLPPFLSYADRPDPGRDPQYRDHEIVASFHAWKPVSFLEN

Query:  HILELEDSIFGEIPVPFAKVAILALQSLGGLNVTKIVFAVDPDTKYSKIPPTSQSLIKETFETMVINQPPLSLAASLFGNTSSFEVLKFPGGITIIPPQS
        HI ELED+IFGEIPVPF KVA+L+LQSLGG NVT I+F+VDPD KYSKIPPTSQSLIKETFET+VIN PPL L ASLFGNTS FEVLKFPGGITIIPPQS
Subjt:  HILELEDSIFGEIPVPFAKVAILALQSLGGLNVTKIVFAVDPDTKYSKIPPTSQSLIKETFETMVINQPPLSLAASLFGNTSSFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQLNFDDLTGQLRSGLHLSRYENLYVSLSNARGSTMDAPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNNT
        AFLLQTAQIYFNFTLNYSI+QIQ+NFDDLT QLRSGL LSRYENLYVSLSN RGSTM APTIVQSSVLMAIGTNSS QRLKQLAQTITNSHSGNLGLNNT
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQLNFDDLTGQLRSGLHLSRYENLYVSLSNARGSTMDAPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNNT

Query:  TFGKVKQVRLSSVLNHSLSGGGGARSPSPAPLP--------HHHHHQHHHHHHHHHHHHHHHHHHHHHHH-----DAAYSPSP------HAPQNG-ASAP
         FGKVKQVRLSSVLNHSLS GG ARSPSPAPLP        HHHHHQHHHHHHHHHHHHHHHHHHHHHHH     DAAYSPSP      HAP+NG +SAP
Subjt:  TFGKVKQVRLSSVLNHSLSGGGGARSPSPAPLP--------HHHHHQHHHHHHHHHHHHHHHHHHHHHHH-----DAAYSPSP------HAPQNG-ASAP

Query:  KAGSSAPVESPTPKKRNYEATPPACQYGYKRSSTKVRKQSHLGPIHSPSSPPLSPDLQVGPPAPVSDSISSSSALPGVVLSNVQPPDKGSRHAENFERRA
        +AGSS PVESP  KKRNY+ATPP  +YGYK  STKVRK+SHLG I SPSSPP SP L+VG PAPV+ SIS+SS LPGV LSNVQPP+KG       +R A
Subjt:  KAGSSAPVESPTPKKRNYEATPPACQYGYKRSSTKVRKQSHLGPIHSPSSPPLSPDLQVGPPAPVSDSISSSSALPGVVLSNVQPPDKGSRHAENFERRA

Query:  PSVLPPQFS
        PSVLPPQFS
Subjt:  PSVLPPQFS

TrEMBL top hitse value%identityAlignment
A0A0A0LHD1 Uncharacterized protein6.2e-19779.21Show/hide
Query:  MGKTEEEQPLPV-GVSSELSDRNVGSRC-GGGCCGIRRLIAVRCVFFLLLSTAVFLSAIFWLPPFLSYADRPDPGRDPQYRDHEIVASFHAWKPVSFLEN
        MGK+EEEQPLPV G SSELSDRNV +RC GGGC  IRRLIAVRCVFFLLLS AVFLSAIFWLPPFLSY + PD   D  YRDH+IVASFHA KPV FL+ 
Subjt:  MGKTEEEQPLPV-GVSSELSDRNVGSRC-GGGCCGIRRLIAVRCVFFLLLSTAVFLSAIFWLPPFLSYADRPDPGRDPQYRDHEIVASFHAWKPVSFLEN

Query:  HILELEDSIFGEIPVPFAKVAILALQSLGGLNVTKIVFAVDPDTKYSKIPPTSQSLIKETFETMVINQPPLSLAASLFGNTSSFEVLKFPGGITIIPPQS
        HI ELED+IFGEIP+P  KVAIL+LQSLGG NVTKIVFAVD D KYSKIPPTSQSLIKETFET+VIN+PPL L  SLFGNTS FEVLKFPGGITIIPPQS
Subjt:  HILELEDSIFGEIPVPFAKVAILALQSLGGLNVTKIVFAVDPDTKYSKIPPTSQSLIKETFETMVINQPPLSLAASLFGNTSSFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQLNFDDLTGQLRSGLHLSRYENLYVSLSNARGSTMDAPTIVQSSVLMAIGTN--SSKQRLKQLAQTITNSHSGNLGLN
        AFLLQTAQIYFNFTLNYSIYQIQ+NFDDL+ QLRSGL LS YENLYVSLSN RGST+DAPT+VQSSVLMAIGTN  SSKQRLKQLA TITNSHSGNLGLN
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQLNFDDLTGQLRSGLHLSRYENLYVSLSNARGSTMDAPTIVQSSVLMAIGTN--SSKQRLKQLAQTITNSHSGNLGLN

Query:  NTTFGKVKQVRLSSVLNHSLSGGGGARSPSPAPLPHHHHHQHHHHHHHHHHHHHHHHHHHHHHHDAAYSPSP------HAPQNG-ASAPKAGSSAPVESP
        NT FGKVKQVRL S LNHSL GGG ARSPSPAPLPH HHH+        HHHHHHHHHHHHHH DAAYSPSP      HAP+NG +SAP+AGSS P+E P
Subjt:  NTTFGKVKQVRLSSVLNHSLSGGGGARSPSPAPLPHHHHHQHHHHHHHHHHHHHHHHHHHHHHHDAAYSPSP------HAPQNG-ASAPKAGSSAPVESP

Query:  TPKKRNYEATPPACQYGYKRSSTKVRKQSHLGPIHSPSSPPLSPDLQVGPPAPVSDSISSSSALPGVVLSNVQPPDKGSRHAENFERRAPSVLPPQFSSG
        T +KRNYEATPPA +YGYKRS TK+RK  +LGPI SPSS P SP L+VG PAPVSDSIS+SS L GVVLSNVQPP+ GS HAENFER +PSVLPPQFSS 
Subjt:  TPKKRNYEATPPACQYGYKRSSTKVRKQSHLGPIHSPSSPPLSPDLQVGPPAPVSDSISSSSALPGVVLSNVQPPDKGSRHAENFERRAPSVLPPQFSSG

Query:  SDGRL
        +  R+
Subjt:  SDGRL

A0A1S3B8E9 uncharacterized protein LOC1034871654.9e-20277.92Show/hide
Query:  MGKTEEEQPLPVGV-SSELSDRNVGSRC-GGGCCGIRRLIAVRCVFFLLLSTAVFLSAIFWLPPFLSYADRPDPGRDPQYRDHEIVASFHAWKPVSFLEN
        MGK+EEEQPLPVGV SSELSDRNV +RC GGGC  IR+LIAVRCVFFLLLS AVFLSAIFWLPPFLSY + PD   D  YRDH+IVASFHAWKPV FL+N
Subjt:  MGKTEEEQPLPVGV-SSELSDRNVGSRC-GGGCCGIRRLIAVRCVFFLLLSTAVFLSAIFWLPPFLSYADRPDPGRDPQYRDHEIVASFHAWKPVSFLEN

Query:  HILELEDSIFGEIPVPFAKVAILALQSLGGLNVTKIVFAVDPDTKYSKIPPTSQSLIKETFETMVINQPPLSLAASLFGNTSSFEVLKFPGGITIIPPQS
        HI ELED+IFGEIP+P  KVAIL+LQSL G NVTKIVFAVD D KYSKIPPTSQSLIKETFET+VIN+PPL L  SLFGNTS FEVLKFPGGITIIPPQS
Subjt:  HILELEDSIFGEIPVPFAKVAILALQSLGGLNVTKIVFAVDPDTKYSKIPPTSQSLIKETFETMVINQPPLSLAASLFGNTSSFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQLNFDDLTGQLRSGLHLSRYENLYVSLSNARGSTMDAPTIVQSSVLMAIGTN--SSKQRLKQLAQTITNSHSGNLGLN
        AFLLQTAQIYFNFTLNYSIYQIQ+NFDDL+ QLRSGL LS YENLYVSLSN RGSTMDAPT+VQSSVLMAIGTN  SSKQRLKQLA TITNSHSGNLGLN
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQLNFDDLTGQLRSGLHLSRYENLYVSLSNARGSTMDAPTIVQSSVLMAIGTN--SSKQRLKQLAQTITNSHSGNLGLN

Query:  NTTFGKVKQVRLSSVLNHSLSGGGGARSPSPAPLPHHHHHQHHHHHHHHHHHHHHHHHHHHHHH-------------------------DAAYSPSP---
        NT FGKVKQVRL S LNHSL GGG A SPSPAPLPH HHH HHHHHHHHHHHHHHHHHHHHHHH                          AAYSPSP   
Subjt:  NTTFGKVKQVRLSSVLNHSLSGGGGARSPSPAPLPHHHHHQHHHHHHHHHHHHHHHHHHHHHHH-------------------------DAAYSPSP---

Query:  ---HAPQNG-ASAPKAGSSAPVESPTPKKRNYEATPPACQYGYKRSSTKVRKQSHLGPIHSPSSPPLSPDLQVGPPAPVSDSISSSSALPGVVLSNVQPP
           HAP+NG +SAP+AGSS P+E PT +KRNYEATPPA +YGYKRSSTK+RKQ HLGPI SPSS P SP L+VG PAPVSDSIS+SS L GVVLSNVQPP
Subjt:  ---HAPQNG-ASAPKAGSSAPVESPTPKKRNYEATPPACQYGYKRSSTKVRKQSHLGPIHSPSSPPLSPDLQVGPPAPVSDSISSSSALPGVVLSNVQPP

Query:  DKGSRHAENFERRAPSVLPPQFSSGSDGRL
        + GS HAENFER +PSVLPPQFSS +D R+
Subjt:  DKGSRHAENFERRAPSVLPPQFSSGSDGRL

A0A5A7UJM2 Filamentous hemagglutinin7.3e-19876.64Show/hide
Query:  MGKTEEEQPLPVGV-SSELSDRNVGSRC-GGGCCGIRRLIAVRCVFFLLLSTAVFLSAIFWLPPFLSYADRPDPGRDPQYRDHEIVASFHAWKPVSFLEN
        MGK+EEEQPLPVGV SSELSDRNV +RC GGGC  IR+LIAVRCVFFLLLS AVFLSAIFWLPPFLSY + PD   D  YRDH+IVASFHAWKPV FL+N
Subjt:  MGKTEEEQPLPVGV-SSELSDRNVGSRC-GGGCCGIRRLIAVRCVFFLLLSTAVFLSAIFWLPPFLSYADRPDPGRDPQYRDHEIVASFHAWKPVSFLEN

Query:  HILELEDSIFGEIPVPFAK---------------------------VAILALQSLGGLNVTKIVFAVDPDTKYSKIPPTSQSLIKETFETMVINQPPLSL
        HI ELED+IFGEIP+P  K                           VAIL+LQSL G NVTKIVFAVD D KYSKIPPTSQSLIKETFET+VIN+PPL L
Subjt:  HILELEDSIFGEIPVPFAK---------------------------VAILALQSLGGLNVTKIVFAVDPDTKYSKIPPTSQSLIKETFETMVINQPPLSL

Query:  AASLFGNTSSFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQLNFDDLTGQLRSGLHLSRYENLYVSLSNARGSTMDAPTIVQSSVLMAIGT
          SLFGNTS FEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQ+NFDDL+ QLRSGL LS YENLYVSLSN RGSTMDAPT+VQSSVLMAIGT
Subjt:  AASLFGNTSSFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQLNFDDLTGQLRSGLHLSRYENLYVSLSNARGSTMDAPTIVQSSVLMAIGT

Query:  N--SSKQRLKQLAQTITNSHSGNLGLNNTTFGKVKQVRLSSVLNHSLSGGGGARSPSPAPLPHHHHHQHHHHHHHHHHHHHHHHHHHHHHH-----DAAY
        N  SSKQRLKQLA TITNSHSGNLGLNNT FGKVKQVRL S LNHSL GGG A SPSPAPLPH HHH HHHHHHHHHHHH HHHHHHHHHH      AAY
Subjt:  N--SSKQRLKQLAQTITNSHSGNLGLNNTTFGKVKQVRLSSVLNHSLSGGGGARSPSPAPLPHHHHHQHHHHHHHHHHHHHHHHHHHHHHH-----DAAY

Query:  SPSP------HAPQNG-ASAPKAGSSAPVESPTPKKRNYEATPPACQYGYKRSSTKVRKQSHLGPIHSPSSPPLSPDLQVGPPAPVSDSISSSSALPGVV
        SPSP      HAP+NG +SAP+AGSS P+E PT +KRNYEATPPA +YGYKRSSTK+RKQ HLGPI SPSS P SP L+VG PAPVSDSIS+SS L GVV
Subjt:  SPSP------HAPQNG-ASAPKAGSSAPVESPTPKKRNYEATPPACQYGYKRSSTKVRKQSHLGPIHSPSSPPLSPDLQVGPPAPVSDSISSSSALPGVV

Query:  LSNVQPPDKGSRHAENFERRAPSVLPPQFSSGSDG
        LSNVQPP+ GS HAENFER +PSVLPPQFS    G
Subjt:  LSNVQPPDKGSRHAENFERRAPSVLPPQFSSGSDG

A0A6J1D2A3 TSC22 domain family protein 1-like isoform X13.0e-21284.44Show/hide
Query:  MGKTEEEQPLPVGV-SSELSDRNVGSRCGGGCCGIRRLIAVRCVFFLLLSTAVFLSAIFWLPPFLSYADRPDPGRDPQYRDHEIVASFHAWKPVSFLENH
        MGKTEEEQPLPVG+ SSELSD NVGSRCG    GIRRLIAVRCVFFLLLS AVF+SA FWLPPFLS  D PD   D  YRDHEIVASFHAWKPVSFL+NH
Subjt:  MGKTEEEQPLPVGV-SSELSDRNVGSRCGGGCCGIRRLIAVRCVFFLLLSTAVFLSAIFWLPPFLSYADRPDPGRDPQYRDHEIVASFHAWKPVSFLENH

Query:  ILELEDSIFGEIPVPFAKVAILALQSLGGLNVTKIVFAVDPDTKYSKIPPTSQSLIKETFETMVINQPPLSLAASLFGNTSSFEVLKFPGGITIIPPQSA
        ILELED+IFGEIPVPF KV IL+LQSLGGLNVTKIVFAVDPD KYSKIPPTSQSLIKE FETM IN+PPL+L A+LFGNTS FEVLKFPGGITIIPPQSA
Subjt:  ILELEDSIFGEIPVPFAKVAILALQSLGGLNVTKIVFAVDPDTKYSKIPPTSQSLIKETFETMVINQPPLSLAASLFGNTSSFEVLKFPGGITIIPPQSA

Query:  FLLQTAQIYFNFTLNYSIYQIQLNFDDLTGQLRSGLHLSRYENLYVSLSNARGSTMDAPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNNTT
        FLLQTAQIYFNFTLNYSIYQIQ+NFDDLT QLRSGLHLS YENLYVSLSNARGST+DAPTI+QSSVLMAIGTNSSK+RLKQLAQTIT+SHSGNLGLNNT 
Subjt:  FLLQTAQIYFNFTLNYSIYQIQLNFDDLTGQLRSGLHLSRYENLYVSLSNARGSTMDAPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNNTT

Query:  FGKVKQVRLSSVLNHSLSGGGGARSPSPAPLPHHHHHQHHHHHHHHHHHHHHHHHHHHHHHDAAYSPSP------HAPQNGASAPKAGSSAPVESPTPKK
        FGKVKQVRLSSVLNHSLSGG  ARSP+P+PLPH H   HHHHHHHHHHHHHHHHHHHHHHHDA+YSPSP      H P NG S+  A  SAPVESPTPKK
Subjt:  FGKVKQVRLSSVLNHSLSGGGGARSPSPAPLPHHHHHQHHHHHHHHHHHHHHHHHHHHHHHDAAYSPSP------HAPQNGASAPKAGSSAPVESPTPKK

Query:  RNYEATPPACQYGYKRSSTKVRKQSHLGPIHSPSSPPLSPDLQVGPPAPVSDSISSSSALPGVVLSNVQPPDKGSRHAENFERRAPSVLPPQFSS
        RN EATPPACQYGYKRSST+VRK+ HLGPIHS SSPP SP  +VGPPAPVS SIS+SS L  VVLSNVQPPDKGSRHAENFERRAPSVLP QFSS
Subjt:  RNYEATPPACQYGYKRSSTKVRKQSHLGPIHSPSSPPLSPDLQVGPPAPVSDSISSSSALPGVVLSNVQPPDKGSRHAENFERRAPSVLPPQFSS

A0A6J1J074 uncharacterized protein LOC111482272 isoform X22.8e-19779.42Show/hide
Query:  MGKTEEEQPLPVGV-SSELSDRNVGSRC-GGGCCGIRRLIAVRCVFFLLLSTAVFLSAIFWLPPFLSYADRPDPGRDPQYRDHEIVASFHAWKPVSFLEN
        MGK+EEEQPLPVGV SSELSD  V SRC GGGC  IRRLIAVRCVFFLLLS AVFLSAIFWLPPFLSY D PD   D  YRDHEIVA F A KPV FL+N
Subjt:  MGKTEEEQPLPVGV-SSELSDRNVGSRC-GGGCCGIRRLIAVRCVFFLLLSTAVFLSAIFWLPPFLSYADRPDPGRDPQYRDHEIVASFHAWKPVSFLEN

Query:  HILELEDSIFGEIPVPFAKVAILALQSLGGLNVTKIVFAVDPDTKYSKIPPTSQSLIKETFETMVINQPPLSLAASLFGNTSSFEVLKFPGGITIIPPQS
        HI ELED+IFGEIPVPF KVA+L+LQSLGG NVT I+F+VDPD KYSKIPPTSQSLIKETFET+VIN PPL L ASLFGNTS FEVLKFPGGITIIPPQS
Subjt:  HILELEDSIFGEIPVPFAKVAILALQSLGGLNVTKIVFAVDPDTKYSKIPPTSQSLIKETFETMVINQPPLSLAASLFGNTSSFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQLNFDDLTGQLRSGLHLSRYENLYVSLSNARGSTMDAPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNNT
        AFLLQTAQIYFNFTLNYSIYQIQ+NF+DLT QLRSGL LSRYENLYVSLSN RGSTM APTIVQSSVLMAIGTNSS QRLKQLAQTITNSHSGNLGLNNT
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQLNFDDLTGQLRSGLHLSRYENLYVSLSNARGSTMDAPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNNT

Query:  TFGKVKQVRLSSVLNHSLSGGGGARSPSPAPLP-------------HHHHHQHHHHHHHHHHHHHHHHHHHHHHH------DAAYSPSP------HAPQN
         FGKVKQVRLSSVLNHSLS GG ARSPSPAPLP             HHHHHQHHH HHHHHHHHHHHHHHHHHHH      DAAYSPSP      HAP+N
Subjt:  TFGKVKQVRLSSVLNHSLSGGGGARSPSPAPLP-------------HHHHHQHHHHHHHHHHHHHHHHHHHHHHH------DAAYSPSP------HAPQN

Query:  G-ASAPKAGSSAPVESPTPKKRNYEATPPACQYGYKRSSTKVRKQSHLGPIHSPSSPPLSPDLQVGPPAPVSDSISSSSALPGVVLSNVQPPDKGSRHAE
        G +SAP+AGSS PVESP  KKRNYEATPP  +YGYK  STKVRK+SHLG I SPSSPP SP L+VG PAPV+ SIS+SS LPGV LSNVQPP+KG     
Subjt:  G-ASAPKAGSSAPVESPTPKKRNYEATPPACQYGYKRSSTKVRKQSHLGPIHSPSSPPLSPDLQVGPPAPVSDSISSSSALPGVVLSNVQPPDKGSRHAE

Query:  NFERRAPSVLPPQFS
          +R APSVLPPQFS
Subjt:  NFERRAPSVLPPQFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10790.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT3G56590.2)7.0e-3634.39Show/hide
Query:  ELSDRNVGSRCGGGCCGIRRLIAVRCVFFLLLSTAVFLSAIFWLPPFLSYADRPDPGRDPQYRDHEIVASFHAWKPVSFLENHILELEDSIFGEIPVP-F
        E S R+ G  C        RL+ +RC+  L+LS A+ LSAIFWL P  S ++    G      +  + ASF   KPVS +  H  ++E  I   I +   
Subjt:  ELSDRNVGSRCGGGCCGIRRLIAVRCVFFLLLSTAVFLSAIFWLPPFLSYADRPDPGRDPQYRDHEIVASFHAWKPVSFLENHILELEDSIFGEIPVP-F

Query:  AKVAILALQSLGGLNVTKIVFAVDPDTKYSKIPPTSQSLIKETFETMVINQPPLSLAASLFGNTSSFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNY
        +KV +L+L   G  N T + FAV P     +I   S SL++ +F  +   +  L L  S FG  +SF+VLKFPGGIT+ P + A +   A + F+ T+  
Subjt:  AKVAILALQSLGGLNVTKIVFAVDPDTKYSKIPPTSQSLIKETFETMVINQPPLSLAASLFGNTSSFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNY

Query:  SIYQIQLNFDDLTGQLRSGLHLSRYENLYVSLSNARGSTMDAPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNNTTFGKVKQVRLSSVLNHS
        SI  +Q   D L G     L L  YE+++  L+N +GST+  P   Q  V   +      QRL    Q I  S + NLGL+   FG+VK +  S+ L+  
Subjt:  SIYQIQLNFDDLTGQLRSGLHLSRYENLYVSLSNARGSTMDAPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNNTTFGKVKQVRLSSVLNHS

Query:  LSGGGGARSPSPAP
        +       +P+P P
Subjt:  LSGGGGARSPSPAP

AT3G10810.1 zinc finger (C3HC4-type RING finger) family protein7.9e-9649.5Show/hide
Query:  MGKTEEEQPLPVGVSSELSDRNV-GSRCGGGCC-GIRRLIAVRCVFFLLLSTAVFLSAIFWLPPFLSYADRPDPGRDPQYRDHEIVASFHAWKPVSFLEN
        MGKTE++  L V       D  V  +RC  GCC  I   +  +C+F LLLS A+FLSA+F L PF    DR D   DP++R H IVASF   +  SFL  
Subjt:  MGKTEEEQPLPVGVSSELSDRNV-GSRCGGGCC-GIRRLIAVRCVFFLLLSTAVFLSAIFWLPPFLSYADRPDPGRDPQYRDHEIVASFHAWKPVSFLEN

Query:  HILELEDSIFGEIPVPFAKVAILALQSLGGLNVTKIVFAVDPDTKYSKIPPTSQSLIKETFETMVINQPPLSLAASLFGNTSSFEVLKFPGGITIIPPQS
        + L+L++ IF E+     KV ILA++    LN+TK+VF +DPDT Y +I P S S IKE FE+++INQ  L L  SLFG T  FEVLKFPGGIT+IPPQS
Subjt:  HILELEDSIFGEIPVPFAKVAILALQSLGGLNVTKIVFAVDPDTKYSKIPPTSQSLIKETFETMVINQPPLSLAASLFGNTSSFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQLNFDDLTGQLRSGLHLSRYENLYVSLSNARGSTMDAPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNNT
        AF LQ  +I FNFTLNYSI+QIQ+NF+ L  QL++GL+L+ YENLYVSLSN+ GST+  PT V SSVL+ +GT++S  RLKQL  TIT S S NLGLNNT
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQLNFDDLTGQLRSGLHLSRYENLYVSLSNARGSTMDAPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNNT

Query:  TFGKVKQVRLSSVLNHSLSGGGGARSPSPAPLPHHHHHQHHHHHHHHHHHHHHHHHHHHHHHDAAYSPSPHAPQNGASAPKAGSSAPVESPTP--KKRNY
         FGKVKQVRLSS L +  S     +SPSP+P PH  HH HHHHHHHHHHHHHH+HHHHHHH+               S   A   +PV SP P   ++  
Subjt:  TFGKVKQVRLSSVLNHSLSGGGGARSPSPAPLPHHHHHQHHHHHHHHHHHHHHHHHHHHHHHDAAYSPSPHAPQNGASAPKAGSSAPVESPTP--KKRNY

Query:  EATPPACQYGYKRSSTKVRKQSHLGPIHSPSSPPLSPDLQVGPPAPVSDS----ISSSSALPGVVLSN-VQPPDKGSRHAENFERRAPSVLPPQFSSGS
         + PP C  G +    + R Q    P  +PS+   +P  Q+  PAP+S +    +  S+ LP VV ++  QPP    R     E  A  V  PQ  S S
Subjt:  EATPPACQYGYKRSSTKVRKQSHLGPIHSPSSPPLSPDLQVGPPAPVSDS----ISSSSALPGVVLSN-VQPPDKGSRHAENFERRAPSVLPPQFSSGS

AT3G56590.1 hydroxyproline-rich glycoprotein family protein3.4e-9947.79Show/hide
Query:  MGK-TEEEQPLPVGVSSELSDRNVGSRCGGGCC--GIRRLIAVRCVFFLLLSTAVFLSAIFWLPPFLSYADRPDPGRDPQYRDHEIVASFHAWKPVSFLE
        MGK T EEQ LPV   +  +  N G      CC   I    ++RCV  L  S AVFLSA+FWLPPFL +AD  D   DP+++DH IVASF   KP+SF+E
Subjt:  MGK-TEEEQPLPVGVSSELSDRNVGSRCGGGCC--GIRRLIAVRCVFFLLLSTAVFLSAIFWLPPFLSYADRPDPGRDPQYRDHEIVASFHAWKPVSFLE

Query:  NHILELEDSIFGEIPVPFAKVAILALQSLGGLNVTKIVFAVDPDTKYSKIPPTSQSLIKETFETMVINQPPLSLAASLFGNTSSFEVLKFPGGITIIPPQ
        +++++LE+ I  EI  P  KV +LAL+ LG LN T ++FA+DP+ + SKIP   +SLIK  FET+V  Q    L  SLFG    FEVLKFPGGIT+IPPQ
Subjt:  NHILELEDSIFGEIPVPFAKVAILALQSLGGLNVTKIVFAVDPDTKYSKIPPTSQSLIKETFETMVINQPPLSLAASLFGNTSSFEVLKFPGGITIIPPQ

Query:  SAFLLQTAQIYFNFTLNYSIYQIQLNFDDLTGQLRSGLHLSRYENLYVSLSNARGSTMDAPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNN
          F LQ AQ+ FNFTLN+SIYQIQ NF++L  QL+ G++L+ YENLY++LSN+RGST+  PTIV SSVL+  G++S   RLKQLAQTIT+SHS NLGLN+
Subjt:  SAFLLQTAQIYFNFTLNYSIYQIQLNFDDLTGQLRSGLHLSRYENLYVSLSNARGSTMDAPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNN

Query:  TTFGKVKQVRLSSVLNHSLSGGGGARSPSPAPLPHHHHHQHHHHHHHHHHHHHHHHHHHHHHHDAAYSPSPHAPQNGASAPKAGSSAPVE-SPTPKKRNY
        T FGKVKQVRLSS+L HS      + +PSP+P P  H +             HHH HHHHHHH+ A  PS   P  G     A +SAP + SP P +   
Subjt:  TTFGKVKQVRLSSVLNHSLSGGGGARSPSPAPLPHHHHHQHHHHHHHHHHHHHHHHHHHHHHHDAAYSPSPHAPQNGASAPKAGSSAPVE-SPTPKKRNY

Query:  EATPPACQYGYKRSSTKVRKQSHLGP----IHSPSSPPLSPDLQVGPPAPVSDSISSSSALPGVVLSNVQPPDKGSRHAENFERRAPSVLPPQFSSGS
            P C Y  +R         H  P     H     P +P+    P  P   +I  SS LP VV +++ PP K S  +E    ++PS  P    S S
Subjt:  EATPPACQYGYKRSSTKVRKQSHLGP----IHSPSSPPLSPDLQVGPPAPVSDSISSSSALPGVVLSNVQPPDKGSRHAENFERRAPSVLPPQFSSGS

AT3G56590.2 hydroxyproline-rich glycoprotein family protein1.5e-9947.92Show/hide
Query:  MGK-TEEEQPLPVGVSSELSDRNVGSRCGGGCC--GIRRLIAVRCVFFLLLSTAVFLSAIFWLPPFLSYADRPDPGRDPQYRDHEIVASFHAWKPVSFLE
        MGK T EEQ LPV   +  +  N G      CC   I    ++RCV  L  S AVFLSA+FWLPPFL +AD  D   DP+++DH IVASF   KP+SF+E
Subjt:  MGK-TEEEQPLPVGVSSELSDRNVGSRCGGGCC--GIRRLIAVRCVFFLLLSTAVFLSAIFWLPPFLSYADRPDPGRDPQYRDHEIVASFHAWKPVSFLE

Query:  NHILELEDSIFGEIPVPFAKVAILALQSLGGLNVTKIVFAVDPDTKYSKIPPTSQSLIKETFETMVINQPPLSLAASLFGNTSSFEVLKFPGGITIIPPQ
        +++++LE+ I  EI  P  KV +LAL+ LG LN T ++FA+DP+ + SKIP   +SLIK  FET+V  Q    L  SLFG    FEVLKFPGGIT+IPPQ
Subjt:  NHILELEDSIFGEIPVPFAKVAILALQSLGGLNVTKIVFAVDPDTKYSKIPPTSQSLIKETFETMVINQPPLSLAASLFGNTSSFEVLKFPGGITIIPPQ

Query:  SAFLLQTAQIYFNFTLNYSIYQIQLNFDDLTGQLRSGLHLSRYENLYVSLSNARGSTMDAPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNN
          F LQ AQ+ FNFTLN+SIYQIQ NF++L  QL+ G++L+ YENLY++LSN+RGST+  PTIV SSVL+  G++S   RLKQLAQTIT+SHS NLGLN+
Subjt:  SAFLLQTAQIYFNFTLNYSIYQIQLNFDDLTGQLRSGLHLSRYENLYVSLSNARGSTMDAPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNN

Query:  TTFGKVKQVRLSSVLNHSLSGGGGARSPSPAPLPHHHHHQHHHHHHHHHHHHHHHHHHHHHHHDAAYSPSPHAPQNGASAPKAGSSAPVE-SPTPKKRNY
        T FGKVKQVRLSS+L HS      + +PSP+P P  H +             HHH HHHHHHH+ A  PS   P  G     A +SAP + SP P +   
Subjt:  TTFGKVKQVRLSSVLNHSLSGGGGARSPSPAPLPHHHHHQHHHHHHHHHHHHHHHHHHHHHHHDAAYSPSPHAPQNGASAPKAGSSAPVE-SPTPKKRNY

Query:  EATPPACQYGYKRSSTKVRKQSHLGP----IHSPSSPPLSPDLQVGPPAPVSDSISSSSALPGVVLSNVQPPDKGSRHAENFERRAPSVLPPQFSSGSDG
            P C Y  +R         H  P     H     P +P+    P  P   +I  SS LP VV +++ PP K S  +E    ++PS  P   SS S G
Subjt:  EATPPACQYGYKRSSTKVRKQSHLGP----IHSPSSPPLSPDLQVGPPAPVSDSISSSSALPGVVLSNVQPPDKGSRHAENFERRAPSVLPPQFSSGSDG

Query:  RLVKK
           KK
Subjt:  RLVKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAAACCGAGGAAGAACAGCCGCTGCCGGTTGGGGTGAGCTCCGAGCTTTCTGACCGGAATGTGGGAAGCCGATGCGGCGGTGGGTGTTGTGGGATTCGTAGACT
GATTGCGGTGAGATGTGTCTTCTTCCTCCTACTGTCGACGGCTGTGTTTCTTTCTGCTATCTTTTGGCTGCCGCCGTTCCTTTCCTATGCAGATCGGCCGGATCCGGGTC
GTGATCCTCAGTATAGAGATCATGAAATAGTAGCAAGTTTTCATGCTTGGAAGCCAGTTTCTTTTCTGGAAAATCATATTTTGGAGCTTGAGGATAGCATTTTTGGAGAA
ATACCTGTACCTTTTGCCAAGGTGGCCATCCTCGCACTACAATCATTAGGTGGATTGAACGTAACAAAAATTGTTTTTGCTGTAGATCCTGACACCAAATATTCAAAAAT
ACCCCCAACTTCTCAAAGTTTGATCAAGGAGACCTTTGAAACAATGGTTATAAATCAACCACCTCTCAGCCTGGCTGCATCATTATTTGGAAATACGTCCTCATTTGAGG
TGTTGAAATTTCCTGGAGGAATAACGATTATCCCTCCACAGAGTGCATTTCTACTGCAGACAGCACAGATCTATTTCAACTTTACGTTAAATTATTCTATTTATCAAATT
CAATTAAATTTCGATGATCTTACCGGCCAGCTGAGGTCAGGATTACATCTATCTCGTTATGAGAATTTATATGTTAGCCTATCGAATGCAAGAGGTTCAACAATGGATGC
CCCTACTATTGTTCAGTCATCTGTTCTAATGGCAATTGGAACTAATTCATCGAAACAAAGACTAAAACAGCTGGCTCAAACCATCACAAATTCTCATTCAGGAAATCTTG
GCCTGAATAACACTACATTTGGTAAGGTCAAGCAAGTTCGTCTTTCATCTGTCCTGAACCACTCTCTTAGTGGTGGTGGAGGTGCACGGTCACCCTCGCCTGCTCCTCTG
CCTCACCATCATCACCACCAACACCATCACCATCATCACCACCACCACCATCACCACCACCACCACCACCACCATCACCACCACCACCATGATGCTGCATATTCACCGAG
TCCACATGCGCCTCAGAATGGTGCATCTGCTCCCAAAGCTGGTTCATCTGCCCCAGTGGAAAGTCCCACTCCAAAGAAAAGAAATTATGAAGCAACTCCACCTGCTTGTC
AATATGGATATAAAAGGTCTTCAACAAAAGTCAGAAAGCAATCTCATTTAGGCCCTATTCATTCACCAAGCAGTCCTCCGTTGTCGCCAGACTTACAAGTAGGCCCACCA
GCACCTGTCTCTGACTCGATTTCTTCTTCAAGTGCACTGCCAGGGGTAGTTTTATCTAATGTTCAGCCTCCAGATAAAGGAAGCAGACATGCAGAAAATTTTGAAAGAAG
AGCCCCTTCAGTTTTACCACCACAATTTTCTTCTGGAAGTGATGGGAGATTGGTGAAGAAGCCAGAGTTGTTGAAGAAGTATTGGAGATTGTTGATAAAATACAAGAGTA
GTGCTCCTGGAACAAGTGCTCAATTGCAGTTTTTTCTCAACGGAGCTGATGGTGGTTTAAGAACTGCTGCTTTTGATGGTGTTTGGAAGAGTTATGATAAGGCTATTGTG
GAAGAGTTTATTAGTTATTGGTTGTTAGACAATGGAGGTTATAAATTGTGTGGCTTAGCAGATTTGGGGAAGTTAAGATGGATCGATTCTAATCTAACTTTAACACGCCC
ATCTGCAGCTTCTGTAGGTCTCATCATTCAGATCGGAGTTGTGATAGCGAGACCAACGCGAAGGCGTGCTCCTGCTAGAGTTGAGATATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAAAACCGAGGAAGAACAGCCGCTGCCGGTTGGGGTGAGCTCCGAGCTTTCTGACCGGAATGTGGGAAGCCGATGCGGCGGTGGGTGTTGTGGGATTCGTAGACT
GATTGCGGTGAGATGTGTCTTCTTCCTCCTACTGTCGACGGCTGTGTTTCTTTCTGCTATCTTTTGGCTGCCGCCGTTCCTTTCCTATGCAGATCGGCCGGATCCGGGTC
GTGATCCTCAGTATAGAGATCATGAAATAGTAGCAAGTTTTCATGCTTGGAAGCCAGTTTCTTTTCTGGAAAATCATATTTTGGAGCTTGAGGATAGCATTTTTGGAGAA
ATACCTGTACCTTTTGCCAAGGTGGCCATCCTCGCACTACAATCATTAGGTGGATTGAACGTAACAAAAATTGTTTTTGCTGTAGATCCTGACACCAAATATTCAAAAAT
ACCCCCAACTTCTCAAAGTTTGATCAAGGAGACCTTTGAAACAATGGTTATAAATCAACCACCTCTCAGCCTGGCTGCATCATTATTTGGAAATACGTCCTCATTTGAGG
TGTTGAAATTTCCTGGAGGAATAACGATTATCCCTCCACAGAGTGCATTTCTACTGCAGACAGCACAGATCTATTTCAACTTTACGTTAAATTATTCTATTTATCAAATT
CAATTAAATTTCGATGATCTTACCGGCCAGCTGAGGTCAGGATTACATCTATCTCGTTATGAGAATTTATATGTTAGCCTATCGAATGCAAGAGGTTCAACAATGGATGC
CCCTACTATTGTTCAGTCATCTGTTCTAATGGCAATTGGAACTAATTCATCGAAACAAAGACTAAAACAGCTGGCTCAAACCATCACAAATTCTCATTCAGGAAATCTTG
GCCTGAATAACACTACATTTGGTAAGGTCAAGCAAGTTCGTCTTTCATCTGTCCTGAACCACTCTCTTAGTGGTGGTGGAGGTGCACGGTCACCCTCGCCTGCTCCTCTG
CCTCACCATCATCACCACCAACACCATCACCATCATCACCACCACCACCATCACCACCACCACCACCACCACCATCACCACCACCACCATGATGCTGCATATTCACCGAG
TCCACATGCGCCTCAGAATGGTGCATCTGCTCCCAAAGCTGGTTCATCTGCCCCAGTGGAAAGTCCCACTCCAAAGAAAAGAAATTATGAAGCAACTCCACCTGCTTGTC
AATATGGATATAAAAGGTCTTCAACAAAAGTCAGAAAGCAATCTCATTTAGGCCCTATTCATTCACCAAGCAGTCCTCCGTTGTCGCCAGACTTACAAGTAGGCCCACCA
GCACCTGTCTCTGACTCGATTTCTTCTTCAAGTGCACTGCCAGGGGTAGTTTTATCTAATGTTCAGCCTCCAGATAAAGGAAGCAGACATGCAGAAAATTTTGAAAGAAG
AGCCCCTTCAGTTTTACCACCACAATTTTCTTCTGGAAGTGATGGGAGATTGGTGAAGAAGCCAGAGTTGTTGAAGAAGTATTGGAGATTGTTGATAAAATACAAGAGTA
GTGCTCCTGGAACAAGTGCTCAATTGCAGTTTTTTCTCAACGGAGCTGATGGTGGTTTAAGAACTGCTGCTTTTGATGGTGTTTGGAAGAGTTATGATAAGGCTATTGTG
GAAGAGTTTATTAGTTATTGGTTGTTAGACAATGGAGGTTATAAATTGTGTGGCTTAGCAGATTTGGGGAAGTTAAGATGGATCGATTCTAATCTAACTTTAACACGCCC
ATCTGCAGCTTCTGTAGGTCTCATCATTCAGATCGGAGTTGTGATAGCGAGACCAACGCGAAGGCGTGCTCCTGCTAGAGTTGAGATATAA
Protein sequenceShow/hide protein sequence
MGKTEEEQPLPVGVSSELSDRNVGSRCGGGCCGIRRLIAVRCVFFLLLSTAVFLSAIFWLPPFLSYADRPDPGRDPQYRDHEIVASFHAWKPVSFLENHILELEDSIFGE
IPVPFAKVAILALQSLGGLNVTKIVFAVDPDTKYSKIPPTSQSLIKETFETMVINQPPLSLAASLFGNTSSFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQI
QLNFDDLTGQLRSGLHLSRYENLYVSLSNARGSTMDAPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNNTTFGKVKQVRLSSVLNHSLSGGGGARSPSPAPL
PHHHHHQHHHHHHHHHHHHHHHHHHHHHHHDAAYSPSPHAPQNGASAPKAGSSAPVESPTPKKRNYEATPPACQYGYKRSSTKVRKQSHLGPIHSPSSPPLSPDLQVGPP
APVSDSISSSSALPGVVLSNVQPPDKGSRHAENFERRAPSVLPPQFSSGSDGRLVKKPELLKKYWRLLIKYKSSAPGTSAQLQFFLNGADGGLRTAAFDGVWKSYDKAIV
EEFISYWLLDNGGYKLCGLADLGKLRWIDSNLTLTRPSAASVGLIIQIGVVIARPTRRRAPARVEI