; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029696 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029696
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein HLB1-like isoform X1
Genome locationtig00153449:1920573..1927807
RNA-Seq ExpressionSgr029696
SyntenySgr029696
Gene Ontology termsGO:0006887 - exocytosis (biological process)
GO:0048768 - root hair cell tip growth (biological process)
GO:0005769 - early endosome (cellular component)
GO:0005802 - trans-Golgi network (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146133.1 protein HLB1 isoform X1 [Cucumis sativus]5.7e-20386.01Show/hide
Query:  MRELLNGLKGEDGNDSVNESEGERPEANSAYSFNQDSPHQPYSEQSRAAMELINNVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQE
        MRELLNGLKGEDG+DS+NESEGERPE NS YS NQDSPHQPYSEQSRAAMELIN+VTGVDEEGRSRQRILTFAA+RYASAIERN QDYDALYNWALVLQE
Subjt:  MRELLNGLKGEDGNDSVNESEGERPEANSAYSFNQDSPHQPYSEQSRAAMELINNVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQE

Query:  SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK-----------------QALNNWGLALQELSAIVPV
        SADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK                 QALNNWGLALQELSAIVP 
Subjt:  SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK-----------------QALNNWGLALQELSAIVPV

Query:  REKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTANAKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY
        REKQTIV+TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+ N KDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY
Subjt:  REKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTANAKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY

Query:  LTAPPVGRPLAPHSDWKRSQFFLNHDVLQK-------------------------RTIKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDS
        LTAPPVGRPLAPHSDWKRSQFFLNHDVLQK                         RTIKV+IPDIVSVSACADLTLPPGAGLCIDTIHGP+FLVADSWD+
Subjt:  LTAPPVGRPLAPHSDWKRSQFFLNHDVLQK-------------------------RTIKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDS

Query:  LDGWLDAIRLVYTIYARGKNEVLAAIITG
        LDGWLDAIRLVYTIYARGKNEVLA IITG
Subjt:  LDGWLDAIRLVYTIYARGKNEVLAAIITG

XP_008448563.1 PREDICTED: uncharacterized protein LOC103490705 isoform X1 [Cucumis melo]3.7e-20285.55Show/hide
Query:  MRELLNGLKGEDGNDSVNESEGERPEANSAYSFNQDSPHQPYSEQSRAAMELINNVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQE
        MRELLNGLKGEDG+D +NESEGERPE NS +S NQDSPHQPYSEQSRAAMELIN++TGVDEEGRSRQRILTFAA+RYASAIERN QDYDALYNWALVLQE
Subjt:  MRELLNGLKGEDGNDSVNESEGERPEANSAYSFNQDSPHQPYSEQSRAAMELINNVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQE

Query:  SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK-----------------QALNNWGLALQELSAIVPV
        SADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK                 QALNNWGLALQELSAIVP 
Subjt:  SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK-----------------QALNNWGLALQELSAIVPV

Query:  REKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTANAKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY
        REKQTIV+TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGT N KDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY
Subjt:  REKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTANAKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY

Query:  LTAPPVGRPLAPHSDWKRSQFFLNHDVLQK-------------------------RTIKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDS
        LTAPPVGRPLAPHSDWKRSQFFLNHDVLQK                         RTIKV+IPDIVSVSACADLTLPPGAGLCIDTIHGP+FLVADSWD+
Subjt:  LTAPPVGRPLAPHSDWKRSQFFLNHDVLQK-------------------------RTIKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDS

Query:  LDGWLDAIRLVYTIYARGKNEVLAAIITG
        LDGWLDAIRLVYTIYARGKNEVLA IITG
Subjt:  LDGWLDAIRLVYTIYARGKNEVLAAIITG

XP_022145328.1 protein HLB1 [Momordica charantia]7.5e-20387.41Show/hide
Query:  MRELLNGLKGEDGNDSVNESEGERPEANSAYSFNQDSPHQPYSEQSRAAMELINNVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQE
        MRELLNGLKG+DGNDSVNESEGERPEANS +SF QDSPHQPYSEQSRAAMELIN+VTGVDEEGRSRQRILTFAA+RYASAIERNAQDYDALYNWALVLQE
Subjt:  MRELLNGLKGEDGNDSVNESEGERPEANSAYSFNQDSPHQPYSEQSRAAMELINNVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQE

Query:  SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK-----------------QALNNWGLALQELSAIVPV
        SADNV  DS+SPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAK+RGRTKEAEELWK                 QALNNWGLALQELSAIVP 
Subjt:  SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK-----------------QALNNWGLALQELSAIVPV

Query:  REKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTANAKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY
        REKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGT NAKDVSPN+LYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY
Subjt:  REKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTANAKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY

Query:  LTAPPVGRPLAPHSDWKRSQFFLNHDVLQK-----------------RTIKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDSLDGWLDAI
        LTAPPVGRPLAPHSDWKRSQFFLNHDVLQK                 RTIKV+IPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVAD+WD+LDGWLDAI
Subjt:  LTAPPVGRPLAPHSDWKRSQFFLNHDVLQK-----------------RTIKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDSLDGWLDAI

Query:  RLVYTIYARGKNEVLAAIITG
        RLVYTIYARGKN+VLA II G
Subjt:  RLVYTIYARGKNEVLAAIITG

XP_022923573.1 protein HLB1-like [Cucurbita moschata]1.3e-19984.85Show/hide
Query:  MRELLNGLKGEDGNDSVNESEGERPEANSAYSFNQDSPHQPYSEQSRAAMELINNVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQE
        MRELLNGLK EDGNDS+NESEGE+PEANS YS NQDSPHQPYSEQSRAAMELIN+VTGVDEEGRSRQRILTFAA+RYASAIERN QDYDALYNWALVLQE
Subjt:  MRELLNGLKGEDGNDSVNESEGERPEANSAYSFNQDSPHQPYSEQSRAAMELINNVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQE

Query:  SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK-----------------QALNNWGLALQELSAIVPV
        SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK                 QALNNWGLALQELSAIVP 
Subjt:  SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK-----------------QALNNWGLALQELSAIVPV

Query:  REKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTANAKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY
        REKQTIV+TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGT   KDVSPNELYSQSAIYIAAAHALKP+YSVYSSALRLVRSMLPLPYLKVGY
Subjt:  REKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTANAKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY

Query:  LTAPPVGRPLAPHSDWKRSQFFLNHDVLQK-------------------------RTIKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDS
        LTAPPVGRP APH DWKRSQFFLNHDVLQK                         RT+KV+IPDIVSVSACADLTLPPGAGLCIDTIHG +FLVADSWD+
Subjt:  LTAPPVGRPLAPHSDWKRSQFFLNHDVLQK-------------------------RTIKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDS

Query:  LDGWLDAIRLVYTIYARGKNEVLAAIITG
        LDGWLDAIRLVYTIYARGKNEVLA II G
Subjt:  LDGWLDAIRLVYTIYARGKNEVLAAIITG

XP_038876586.1 protein HLB1 [Benincasa hispida]2.8e-20286.01Show/hide
Query:  MRELLNGLKGEDGNDSVNESEGERPEANSAYSFNQDSPHQPYSEQSRAAMELINNVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQE
        MRELLNGLKGEDGNDS+NESEGERPE N  YS NQDSPHQPYSEQSRAAMELI++VTGVDEEGRSRQRILTFAA+RYASAIERN QDYDALYNWALVLQE
Subjt:  MRELLNGLKGEDGNDSVNESEGERPEANSAYSFNQDSPHQPYSEQSRAAMELINNVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQE

Query:  SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK-----------------QALNNWGLALQELSAIVPV
        SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK                 QALNNWGLALQELSAIVP 
Subjt:  SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK-----------------QALNNWGLALQELSAIVPV

Query:  REKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTANAKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY
        REKQTIV+TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGT N KDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY
Subjt:  REKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTANAKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY

Query:  LTAPPVGRPLAPHSDWKRSQFFLNHDVLQKR-------------------------TIKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDS
        LTAPPVGRPLAPH DWKRSQFFLNHDVLQK                          TIKV+IPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWD+
Subjt:  LTAPPVGRPLAPHSDWKRSQFFLNHDVLQKR-------------------------TIKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDS

Query:  LDGWLDAIRLVYTIYARGKNEVLAAIITG
        LDGWLDAIRLVYTIYARGKNEVLA IITG
Subjt:  LDGWLDAIRLVYTIYARGKNEVLAAIITG

TrEMBL top hitse value%identityAlignment
A0A0A0L688 Uncharacterized protein2.8e-20386.01Show/hide
Query:  MRELLNGLKGEDGNDSVNESEGERPEANSAYSFNQDSPHQPYSEQSRAAMELINNVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQE
        MRELLNGLKGEDG+DS+NESEGERPE NS YS NQDSPHQPYSEQSRAAMELIN+VTGVDEEGRSRQRILTFAA+RYASAIERN QDYDALYNWALVLQE
Subjt:  MRELLNGLKGEDGNDSVNESEGERPEANSAYSFNQDSPHQPYSEQSRAAMELINNVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQE

Query:  SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK-----------------QALNNWGLALQELSAIVPV
        SADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK                 QALNNWGLALQELSAIVP 
Subjt:  SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK-----------------QALNNWGLALQELSAIVPV

Query:  REKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTANAKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY
        REKQTIV+TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+ N KDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY
Subjt:  REKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTANAKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY

Query:  LTAPPVGRPLAPHSDWKRSQFFLNHDVLQK-------------------------RTIKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDS
        LTAPPVGRPLAPHSDWKRSQFFLNHDVLQK                         RTIKV+IPDIVSVSACADLTLPPGAGLCIDTIHGP+FLVADSWD+
Subjt:  LTAPPVGRPLAPHSDWKRSQFFLNHDVLQK-------------------------RTIKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDS

Query:  LDGWLDAIRLVYTIYARGKNEVLAAIITG
        LDGWLDAIRLVYTIYARGKNEVLA IITG
Subjt:  LDGWLDAIRLVYTIYARGKNEVLAAIITG

A0A1S3BJC9 uncharacterized protein LOC103490705 isoform X11.8e-20285.55Show/hide
Query:  MRELLNGLKGEDGNDSVNESEGERPEANSAYSFNQDSPHQPYSEQSRAAMELINNVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQE
        MRELLNGLKGEDG+D +NESEGERPE NS +S NQDSPHQPYSEQSRAAMELIN++TGVDEEGRSRQRILTFAA+RYASAIERN QDYDALYNWALVLQE
Subjt:  MRELLNGLKGEDGNDSVNESEGERPEANSAYSFNQDSPHQPYSEQSRAAMELINNVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQE

Query:  SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK-----------------QALNNWGLALQELSAIVPV
        SADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK                 QALNNWGLALQELSAIVP 
Subjt:  SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK-----------------QALNNWGLALQELSAIVPV

Query:  REKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTANAKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY
        REKQTIV+TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGT N KDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY
Subjt:  REKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTANAKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY

Query:  LTAPPVGRPLAPHSDWKRSQFFLNHDVLQK-------------------------RTIKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDS
        LTAPPVGRPLAPHSDWKRSQFFLNHDVLQK                         RTIKV+IPDIVSVSACADLTLPPGAGLCIDTIHGP+FLVADSWD+
Subjt:  LTAPPVGRPLAPHSDWKRSQFFLNHDVLQK-------------------------RTIKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDS

Query:  LDGWLDAIRLVYTIYARGKNEVLAAIITG
        LDGWLDAIRLVYTIYARGKNEVLA IITG
Subjt:  LDGWLDAIRLVYTIYARGKNEVLAAIITG

A0A6J1CUX3 protein HLB13.6e-20387.41Show/hide
Query:  MRELLNGLKGEDGNDSVNESEGERPEANSAYSFNQDSPHQPYSEQSRAAMELINNVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQE
        MRELLNGLKG+DGNDSVNESEGERPEANS +SF QDSPHQPYSEQSRAAMELIN+VTGVDEEGRSRQRILTFAA+RYASAIERNAQDYDALYNWALVLQE
Subjt:  MRELLNGLKGEDGNDSVNESEGERPEANSAYSFNQDSPHQPYSEQSRAAMELINNVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQE

Query:  SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK-----------------QALNNWGLALQELSAIVPV
        SADNV  DS+SPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAK+RGRTKEAEELWK                 QALNNWGLALQELSAIVP 
Subjt:  SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK-----------------QALNNWGLALQELSAIVPV

Query:  REKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTANAKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY
        REKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGT NAKDVSPN+LYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY
Subjt:  REKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTANAKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY

Query:  LTAPPVGRPLAPHSDWKRSQFFLNHDVLQK-----------------RTIKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDSLDGWLDAI
        LTAPPVGRPLAPHSDWKRSQFFLNHDVLQK                 RTIKV+IPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVAD+WD+LDGWLDAI
Subjt:  LTAPPVGRPLAPHSDWKRSQFFLNHDVLQK-----------------RTIKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDSLDGWLDAI

Query:  RLVYTIYARGKNEVLAAIITG
        RLVYTIYARGKN+VLA II G
Subjt:  RLVYTIYARGKNEVLAAIITG

A0A6J1EA05 protein HLB1-like6.4e-20084.85Show/hide
Query:  MRELLNGLKGEDGNDSVNESEGERPEANSAYSFNQDSPHQPYSEQSRAAMELINNVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQE
        MRELLNGLK EDGNDS+NESEGE+PEANS YS NQDSPHQPYSEQSRAAMELIN+VTGVDEEGRSRQRILTFAA+RYASAIERN QDYDALYNWALVLQE
Subjt:  MRELLNGLKGEDGNDSVNESEGERPEANSAYSFNQDSPHQPYSEQSRAAMELINNVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQE

Query:  SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK-----------------QALNNWGLALQELSAIVPV
        SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK                 QALNNWGLALQELSAIVP 
Subjt:  SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK-----------------QALNNWGLALQELSAIVPV

Query:  REKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTANAKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY
        REKQTIV+TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGT   KDVSPNELYSQSAIYIAAAHALKP+YSVYSSALRLVRSMLPLPYLKVGY
Subjt:  REKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTANAKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY

Query:  LTAPPVGRPLAPHSDWKRSQFFLNHDVLQK-------------------------RTIKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDS
        LTAPPVGRP APH DWKRSQFFLNHDVLQK                         RT+KV+IPDIVSVSACADLTLPPGAGLCIDTIHG +FLVADSWD+
Subjt:  LTAPPVGRPLAPHSDWKRSQFFLNHDVLQK-------------------------RTIKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDS

Query:  LDGWLDAIRLVYTIYARGKNEVLAAIITG
        LDGWLDAIRLVYTIYARGKNEVLA II G
Subjt:  LDGWLDAIRLVYTIYARGKNEVLAAIITG

A0A6J1HJU5 protein HLB1-like isoform X23.2e-19984.62Show/hide
Query:  MRELLNGLKGEDGNDSVNESEGERPEANSAYSFNQDSPHQPYSEQSRAAMELINNVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQE
        MRELLNGLK EDGNDS+NESEGE+PEANS YS NQDSPHQPYSEQSRAAMELIN+VTGVDEEGRSRQRILTFAA+RYASAIERN QDYDALYNWALVLQE
Subjt:  MRELLNGLKGEDGNDSVNESEGERPEANSAYSFNQDSPHQPYSEQSRAAMELINNVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQE

Query:  SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK-----------------QALNNWGLALQELSAIVPV
        SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK                 QALNNWGLALQELSAIVP 
Subjt:  SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK-----------------QALNNWGLALQELSAIVPV

Query:  REKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTANAKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY
        REK TIV+TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGT   KDVSPNELYSQSAIYIAAAHALKP+YSVYSSALRLVRSMLPLPYLKVGY
Subjt:  REKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTANAKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY

Query:  LTAPPVGRPLAPHSDWKRSQFFLNHDVLQK-------------------------RTIKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDS
        LTAPPVGRP APH DWKRSQFFLNHDVLQK                         RT+KV+IPDIVSVSACADLTLPPGAGLCIDTIHG +FLVADSWD+
Subjt:  LTAPPVGRPLAPHSDWKRSQFFLNHDVLQK-------------------------RTIKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDS

Query:  LDGWLDAIRLVYTIYARGKNEVLAAIITG
        LDGWLDAIRLVYTIYARGKNEVLA II G
Subjt:  LDGWLDAIRLVYTIYARGKNEVLAAIITG

SwissProt top hitse value%identityAlignment
Q9FHY8 Protein HLB18.7e-16269.3Show/hide
Query:  MRELLNGLKGEDGNDSVNESEGERPEANSAYSFNQDSPHQPYSEQSRAAMELINNVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQE
        MRELL+ LK E+G+ + +         +SA  F+++S  QP   ++  AM+LIN +   DEEGRSRQR+L FAA++YASAIERN  D+DALYNWAL+LQE
Subjt:  MRELLNGLKGEDGNDSVNESEGERPEANSAYSFNQDSPHQPYSEQSRAAMELINNVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQE

Query:  SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK-----------------QALNNWGLALQELSAIVPV
        SADNVSPDS SPSKD LLEEACKKYDEATRLCPTL+DA+YNWAIAISDRAK+RGRTKEAEELW+                 QALNNWGL LQELS IVP 
Subjt:  SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK-----------------QALNNWGLALQELSAIVPV

Query:  REKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTANAKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY
        REK+ +VRTAISKFRAAI+LQFDFHRAIYNLGTVLYGLAEDTLRTGG+ N KD+ P ELYSQSAIYIAAAH+LKP+YSVYSSALRLVRSMLPLP+LKVGY
Subjt:  REKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTANAKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY

Query:  LTAPPVGRPLAPHSDWKRSQFFLNHDVL--------------------------QKRTIKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWD
        LTAPPVG  LAPHSDWKR++F LNH+ L                          +++T+KV+I +IVSV+ CADLTLPPGAGLCIDTIHGPVFLVADSW+
Subjt:  LTAPPVGRPLAPHSDWKRSQFFLNHDVL--------------------------QKRTIKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWD

Query:  SLDGWLDAIRLVYTIYARGKNEVLAAIITG
        SLDGWLDAIRLVYTIYARGK++VLA IITG
Subjt:  SLDGWLDAIRLVYTIYARGKNEVLAAIITG

Arabidopsis top hitse value%identityAlignment
AT5G41950.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.2e-16369.3Show/hide
Query:  MRELLNGLKGEDGNDSVNESEGERPEANSAYSFNQDSPHQPYSEQSRAAMELINNVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQE
        MRELL+ LK E+G+ + +         +SA  F+++S  QP   ++  AM+LIN +   DEEGRSRQR+L FAA++YASAIERN  D+DALYNWAL+LQE
Subjt:  MRELLNGLKGEDGNDSVNESEGERPEANSAYSFNQDSPHQPYSEQSRAAMELINNVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQE

Query:  SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK-----------------QALNNWGLALQELSAIVPV
        SADNVSPDS SPSKD LLEEACKKYDEATRLCPTL+DA+YNWAIAISDRAK+RGRTKEAEELW+                 QALNNWGL LQELS IVP 
Subjt:  SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK-----------------QALNNWGLALQELSAIVPV

Query:  REKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTANAKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY
        REK+ +VRTAISKFRAAI+LQFDFHRAIYNLGTVLYGLAEDTLRTGG+ N KD+ P ELYSQSAIYIAAAH+LKP+YSVYSSALRLVRSMLPLP+LKVGY
Subjt:  REKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTANAKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY

Query:  LTAPPVGRPLAPHSDWKRSQFFLNHDVL--------------------------QKRTIKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWD
        LTAPPVG  LAPHSDWKR++F LNH+ L                          +++T+KV+I +IVSV+ CADLTLPPGAGLCIDTIHGPVFLVADSW+
Subjt:  LTAPPVGRPLAPHSDWKRSQFFLNHDVL--------------------------QKRTIKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWD

Query:  SLDGWLDAIRLVYTIYARGKNEVLAAIITG
        SLDGWLDAIRLVYTIYARGK++VLA IITG
Subjt:  SLDGWLDAIRLVYTIYARGKNEVLAAIITG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGGAGTTGCTGAATGGATTGAAAGGTGAAGATGGTAACGACAGCGTTAACGAATCTGAAGGCGAGAGGCCCGAGGCAAACTCCGCTTACAGTTTTAATCAA
GATAGCCCACATCAGCCTTATTCTGAACAGAGCAGAGCTGCTATGGAGTTGATCAACAATGTTACAGGTGTTGATGAGGAGGGTCGTTCTCGTCAACGGATTCTC
ACATTTGCTGCTAAGAGGTATGCTAGTGCAATTGAGAGAAATGCTCAAGACTATGATGCTCTATACAATTGGGCATTGGTCCTCCAGGAGAGTGCAGATAATGTT
AGTCCAGATTCCACTTCACCTTCTAAAGATGCATTGCTTGAGGAGGCTTGTAAAAAGTATGATGAGGCTACCCGTCTTTGCCCAACACTTCATGATGCTTTCTAT
AATTGGGCTATTGCAATCTCTGATCGGGCCAAAATGCGTGGTCGTACAAAGGAGGCTGAAGAACTATGGAAGCAGGCGCTAAATAATTGGGGACTTGCTCTACAG
GAACTCAGTGCGATTGTGCCAGTACGAGAAAAGCAGACAATTGTAAGAACAGCTATCAGTAAGTTTCGTGCTGCAATACAGTTGCAATTTGATTTTCATCGAGCA
ATCTACAATCTTGGTACTGTTCTGTATGGACTAGCTGAGGACACATTACGGACTGGTGGAACAGCAAATGCTAAGGACGTTTCCCCTAATGAGTTGTACAGCCAA
TCTGCTATTTATATCGCAGCTGCTCATGCTCTAAAACCAAATTACTCTGTTTACAGCAGTGCCTTGCGGTTGGTTCGTTCAATGCTGCCGTTACCCTATCTTAAA
GTTGGATACCTGACTGCACCTCCTGTGGGGAGACCACTTGCTCCTCATAGTGATTGGAAACGTTCACAATTTTTTCTAAATCATGATGTGTTGCAAAAGAGGACA
ATCAAAGTAGATATTCCAGATATCGTCTCTGTATCAGCATGTGCAGATCTAACTCTACCACCCGGTGCTGGACTCTGCATCGACACAATCCATGGACCGGTTTTC
TTGGTCGCTGACTCGTGGGACTCGCTCGATGGATGGCTCGATGCAATTCGATTAGTTTACACAATCTATGCCCGAGGCAAGAACGAGGTTTTGGCTGCAATCATA
ACAGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGGAGTTGCTGAATGGATTGAAAGGTGAAGATGGTAACGACAGCGTTAACGAATCTGAAGGCGAGAGGCCCGAGGCAAACTCCGCTTACAGTTTTAATCAA
GATAGCCCACATCAGCCTTATTCTGAACAGAGCAGAGCTGCTATGGAGTTGATCAACAATGTTACAGGTGTTGATGAGGAGGGTCGTTCTCGTCAACGGATTCTC
ACATTTGCTGCTAAGAGGTATGCTAGTGCAATTGAGAGAAATGCTCAAGACTATGATGCTCTATACAATTGGGCATTGGTCCTCCAGGAGAGTGCAGATAATGTT
AGTCCAGATTCCACTTCACCTTCTAAAGATGCATTGCTTGAGGAGGCTTGTAAAAAGTATGATGAGGCTACCCGTCTTTGCCCAACACTTCATGATGCTTTCTAT
AATTGGGCTATTGCAATCTCTGATCGGGCCAAAATGCGTGGTCGTACAAAGGAGGCTGAAGAACTATGGAAGCAGGCGCTAAATAATTGGGGACTTGCTCTACAG
GAACTCAGTGCGATTGTGCCAGTACGAGAAAAGCAGACAATTGTAAGAACAGCTATCAGTAAGTTTCGTGCTGCAATACAGTTGCAATTTGATTTTCATCGAGCA
ATCTACAATCTTGGTACTGTTCTGTATGGACTAGCTGAGGACACATTACGGACTGGTGGAACAGCAAATGCTAAGGACGTTTCCCCTAATGAGTTGTACAGCCAA
TCTGCTATTTATATCGCAGCTGCTCATGCTCTAAAACCAAATTACTCTGTTTACAGCAGTGCCTTGCGGTTGGTTCGTTCAATGCTGCCGTTACCCTATCTTAAA
GTTGGATACCTGACTGCACCTCCTGTGGGGAGACCACTTGCTCCTCATAGTGATTGGAAACGTTCACAATTTTTTCTAAATCATGATGTGTTGCAAAAGAGGACA
ATCAAAGTAGATATTCCAGATATCGTCTCTGTATCAGCATGTGCAGATCTAACTCTACCACCCGGTGCTGGACTCTGCATCGACACAATCCATGGACCGGTTTTC
TTGGTCGCTGACTCGTGGGACTCGCTCGATGGATGGCTCGATGCAATTCGATTAGTTTACACAATCTATGCCCGAGGCAAGAACGAGGTTTTGGCTGCAATCATA
ACAGGCTGA
Protein sequenceShow/hide protein sequence
MRELLNGLKGEDGNDSVNESEGERPEANSAYSFNQDSPHQPYSEQSRAAMELINNVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNV
SPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQALNNWGLALQELSAIVPVREKQTIVRTAISKFRAAIQLQFDFHRA
IYNLGTVLYGLAEDTLRTGGTANAKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKRT
IKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDSLDGWLDAIRLVYTIYARGKNEVLAAIITG