; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS021044 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS021044
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionprotein HLB1
Genome locationscaffold290:760601..768164
RNA-Seq ExpressionMS021044
SyntenyMS021044
Gene Ontology termsGO:0006887 - exocytosis (biological process)
GO:0048768 - root hair cell tip growth (biological process)
GO:0005769 - early endosome (cellular component)
GO:0005802 - trans-Golgi network (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146133.1 protein HLB1 isoform X1 [Cucumis sativus]2.2e-24379.97Show/hide
Query:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPES-RVATIP----QQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEAS
        MSPTPEEP NLQNG + QPHI SES Q  E  S PE   V +IP    Q++RESESV   ++  P SEPES R+Q SESI L VVT  TDP   + +E S
Subjt:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPES-RVATIP----QQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEAS

Query:  IPSNG-ADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRFVSSAVFFFLLYEFAQFSYQKKNLHSFTQDSPHQPYSEQSRAAM
         PSNG  +N  PALRKDEGSRTFTMRELLNGLKG+DG+DS+NESEGERPE NSG                          +S  QDSPHQPYSEQSRAAM
Subjt:  IPSNG-ADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRFVSSAVFFFLLYEFAQFSYQKKNLHSFTQDSPHQPYSEQSRAAM

Query:  ELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRA
        ELINSVTGVDEEGRSRQRILTFAARRYASAIERN QDYDALYNWALVLQESADNV  DS+SPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRA
Subjt:  ELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRA

Query:  KIRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN
        K+RGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIV+TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+GN
Subjt:  KIRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN

Query:  AKDVSPNDLYSQSAIYIAAAHALKPNYSVSLVLLSRIIS-LPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPS-------
         KDVSPN+LYSQSAIYIAAAHALKPNYSV    L  + S LPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPS       
Subjt:  AKDVSPNDLYSQSAIYIAAAHALKPNYSVSLVLLSRIIS-LPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPS-------

Query:  -LNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG
         LNG+RTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGP+FLVAD+WD LDGWLDAIRLVYTIYARGKN+VLAGII G
Subjt:  -LNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG

XP_022145328.1 protein HLB1 [Momordica charantia]4.7e-29193.79Show/hide
Query:  MSPTPEEPNLQNGNQTQPHIPSESQQTEESGSDPESRVATIPQQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASIPSNGA
        MSPTPEEPNLQNGNQTQPHIPSESQQTEESGSDPESRVATIPQQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASIPSNGA
Subjt:  MSPTPEEPNLQNGNQTQPHIPSESQQTEESGSDPESRVATIPQQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASIPSNGA

Query:  DNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRFVSSAVFFFLLYEFAQFSYQKKNLHSFTQDSPHQPYSEQSRAAMELINSVT
        DNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSG                          HSFTQDSPHQPYSEQSRAAMELINSVT
Subjt:  DNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRFVSSAVFFFLLYEFAQFSYQKKNLHSFTQDSPHQPYSEQSRAAMELINSVT

Query:  GVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIRGRTK
        GVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIRGRTK
Subjt:  GVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIRGRTK

Query:  EAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPN
        EAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPN
Subjt:  EAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPN

Query:  DLYSQSAIYIAAAHALKPNYSVSLVLLSRIIS-LPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSLNGERTIKVEIPDI
        DLYSQSAIYIAAAHALKPNYSV    L  + S LPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSLNGERTIKVEIPDI
Subjt:  DLYSQSAIYIAAAHALKPNYSVSLVLLSRIIS-LPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSLNGERTIKVEIPDI

Query:  VSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG
        VSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG
Subjt:  VSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG

XP_022965251.1 protein HLB1-like isoform X1 [Cucurbita maxima]9.7e-24476.36Show/hide
Query:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPESRVATIP--------------------------------QQQRESESVDEEADAEPRSEPESRR
        MSP PEEP NLQNG + +PHI  ES Q  ES S+PES    +P                                QQ+RESESV+  AD+EP+SE +S R
Subjt:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPESRVATIP--------------------------------QQQRESESVDEEADAEPRSEPESRR

Query:  EQSSESIQLQVVTDATDPRSGDPEEASIPSNGADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRFVSSAVFFFLLYEFAQFS
        +Q SESI+LQVVTD TDPR  +P+  SI SNGA+NS PALRKDEGSRTFTMRELLNGLK +DGNDS+NESEGE+PEANSG                    
Subjt:  EQSSESIQLQVVTDATDPRSGDPEEASIPSNGADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRFVSSAVFFFLLYEFAQFS

Query:  YQKKNLHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKK
              +S  QDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERN QDYDALYNWALVLQESADNV  DS+SPSKDALLEEACKK
Subjt:  YQKKNLHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKK

Query:  YDEATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDF
        YDEATRLCPTLHDAFYNWAIAISDRAK+RGRTKEAEELWKQAT+NYEKAVQLNWNSPQALNNWGLALQELSAIVPAREK TIV+TAISKFRAAIQLQFDF
Subjt:  YDEATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDF

Query:  HRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPNYSVSLVLLSRIIS-LPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLN
        HRAIYNLGTVLYGLAEDTLRTGGTG  KDVSPN+LYSQSAIYIAAAHALKP+YSV    L  + S LPLPYLKVGYLTAPPVGRP APH DWKRSQFFLN
Subjt:  HRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPNYSVSLVLLSRIIS-LPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLN

Query:  HDVLQKLNIGGEQIQTSP--------SLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLA
        HDVLQKLNIGGEQIQTSP        +LNG+RT+KVEIPDIVSVSACADLTLPPGAGLCIDTIHG +FLVAD+WDALDGWLDAIRLVYTIYARGKN+VLA
Subjt:  HDVLQKLNIGGEQIQTSP--------SLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLA

Query:  GIIAG
        GIIAG
Subjt:  GIIAG

XP_022965252.1 protein HLB1-like isoform X2 [Cucurbita maxima]4.2e-24780.24Show/hide
Query:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPESRVATIP----QQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASI
        MSP PEEP NLQNG + +PHI  ES Q  ES S+PES    IP    QQ+RESESV+  AD+EP+SE +S R+Q SESI+LQVVTD TDPR  +P+  SI
Subjt:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPESRVATIP----QQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASI

Query:  PSNGADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRFVSSAVFFFLLYEFAQFSYQKKNLHSFTQDSPHQPYSEQSRAAMEL
         SNGA+NS PALRKDEGSRTFTMRELLNGLK +DGNDS+NESEGE+PEANSG                          +S  QDSPHQPYSEQSRAAMEL
Subjt:  PSNGADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRFVSSAVFFFLLYEFAQFSYQKKNLHSFTQDSPHQPYSEQSRAAMEL

Query:  INSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKI
        INSVTGVDEEGRSRQRILTFAARRYASAIERN QDYDALYNWALVLQESADNV  DS+SPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAK+
Subjt:  INSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKI

Query:  RGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAK
        RGRTKEAEELWKQAT+NYEKAVQLNWNSPQALNNWGLALQELSAIVPAREK TIV+TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG  K
Subjt:  RGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAK

Query:  DVSPNDLYSQSAIYIAAAHALKPNYSVSLVLLSRIIS-LPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSP--------SL
        DVSPN+LYSQSAIYIAAAHALKP+YSV    L  + S LPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDVLQKLNIGGEQIQTSP        +L
Subjt:  DVSPNDLYSQSAIYIAAAHALKPNYSVSLVLLSRIIS-LPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSP--------SL

Query:  NGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG
        NG+RT+KVEIPDIVSVSACADLTLPPGAGLCIDTIHG +FLVAD+WDALDGWLDAIRLVYTIYARGKN+VLAGIIAG
Subjt:  NGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG

XP_038876586.1 protein HLB1 [Benincasa hispida]2.2e-24380.1Show/hide
Query:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPESRVATI----PQQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASI
        MSPTPEEP NLQNG + QPHI  ES QT E  S+PE     I      Q+RESESV+   +    SEP SRR+Q  ESI LQV TD  DPR  + +E SI
Subjt:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPESRVATI----PQQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASI

Query:  PSNG-ADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRFVSSAVFFFLLYEFAQFSYQKKNLHSFTQDSPHQPYSEQSRAAME
        PSNG  +NS PALRKDEGSRTFTMRELLNGLKG+DGNDS+NESEGERPE N G                          +S  QDSPHQPYSEQSRAAME
Subjt:  PSNG-ADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRFVSSAVFFFLLYEFAQFSYQKKNLHSFTQDSPHQPYSEQSRAAME

Query:  LINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAK
        LI+SVTGVDEEGRSRQRILTFAARRYASAIERN QDYDALYNWALVLQESADNV  DS+SPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAK
Subjt:  LINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAK

Query:  IRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNA
        +RGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIV+TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN 
Subjt:  IRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNA

Query:  KDVSPNDLYSQSAIYIAAAHALKPNYSVSLVLLSRIIS-LPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPS--------
        KDVSPN+LYSQSAIYIAAAHALKPNYSV    L  + S LPLPYLKVGYLTAPPVGRPLAPH DWKRSQFFLNHDVLQKLNIGGEQIQTSPS        
Subjt:  KDVSPNDLYSQSAIYIAAAHALKPNYSVSLVLLSRIIS-LPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPS--------

Query:  LNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG
        LNG+ TIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVAD+WDALDGWLDAIRLVYTIYARGKN+VLAGII G
Subjt:  LNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG

TrEMBL top hitse value%identityAlignment
A0A0A0L688 Uncharacterized protein1.0e-24379.97Show/hide
Query:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPES-RVATIP----QQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEAS
        MSPTPEEP NLQNG + QPHI SES Q  E  S PE   V +IP    Q++RESESV   ++  P SEPES R+Q SESI L VVT  TDP   + +E S
Subjt:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPES-RVATIP----QQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEAS

Query:  IPSNG-ADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRFVSSAVFFFLLYEFAQFSYQKKNLHSFTQDSPHQPYSEQSRAAM
         PSNG  +N  PALRKDEGSRTFTMRELLNGLKG+DG+DS+NESEGERPE NSG                          +S  QDSPHQPYSEQSRAAM
Subjt:  IPSNG-ADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRFVSSAVFFFLLYEFAQFSYQKKNLHSFTQDSPHQPYSEQSRAAM

Query:  ELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRA
        ELINSVTGVDEEGRSRQRILTFAARRYASAIERN QDYDALYNWALVLQESADNV  DS+SPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRA
Subjt:  ELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRA

Query:  KIRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN
        K+RGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIV+TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+GN
Subjt:  KIRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN

Query:  AKDVSPNDLYSQSAIYIAAAHALKPNYSVSLVLLSRIIS-LPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPS-------
         KDVSPN+LYSQSAIYIAAAHALKPNYSV    L  + S LPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPS       
Subjt:  AKDVSPNDLYSQSAIYIAAAHALKPNYSVSLVLLSRIIS-LPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPS-------

Query:  -LNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG
         LNG+RTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGP+FLVAD+WD LDGWLDAIRLVYTIYARGKN+VLAGII G
Subjt:  -LNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG

A0A6J1CUX3 protein HLB12.3e-29193.79Show/hide
Query:  MSPTPEEPNLQNGNQTQPHIPSESQQTEESGSDPESRVATIPQQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASIPSNGA
        MSPTPEEPNLQNGNQTQPHIPSESQQTEESGSDPESRVATIPQQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASIPSNGA
Subjt:  MSPTPEEPNLQNGNQTQPHIPSESQQTEESGSDPESRVATIPQQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASIPSNGA

Query:  DNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRFVSSAVFFFLLYEFAQFSYQKKNLHSFTQDSPHQPYSEQSRAAMELINSVT
        DNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSG                          HSFTQDSPHQPYSEQSRAAMELINSVT
Subjt:  DNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRFVSSAVFFFLLYEFAQFSYQKKNLHSFTQDSPHQPYSEQSRAAMELINSVT

Query:  GVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIRGRTK
        GVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIRGRTK
Subjt:  GVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIRGRTK

Query:  EAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPN
        EAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPN
Subjt:  EAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPN

Query:  DLYSQSAIYIAAAHALKPNYSVSLVLLSRIIS-LPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSLNGERTIKVEIPDI
        DLYSQSAIYIAAAHALKPNYSV    L  + S LPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSLNGERTIKVEIPDI
Subjt:  DLYSQSAIYIAAAHALKPNYSVSLVLLSRIIS-LPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSLNGERTIKVEIPDI

Query:  VSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG
        VSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG
Subjt:  VSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG

A0A6J1EA05 protein HLB1-like6.8e-24376.2Show/hide
Query:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPESRVATIP--------------------------------QQQRESESVDEEADAEPRSEPESRR
        MSPTPEEP NLQNG + +PHI  ES Q  ES S+PES    +P                                QQ+RESESV+  AD+E +SE +S R
Subjt:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPESRVATIP--------------------------------QQQRESESVDEEADAEPRSEPESRR

Query:  EQSSESIQLQVVTDATDPRSGDPEEASIPSNGADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRFVSSAVFFFLLYEFAQFS
        +Q SESIQLQV TD  DPR  +P+  SI SNG +NS PALRKDEGSRTFTMRELLNGLK +DGNDS+NESEGE+PEANSG                    
Subjt:  EQSSESIQLQVVTDATDPRSGDPEEASIPSNGADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRFVSSAVFFFLLYEFAQFS

Query:  YQKKNLHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKK
              +S  QDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERN QDYDALYNWALVLQESADNV  DS+SPSKDALLEEACKK
Subjt:  YQKKNLHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKK

Query:  YDEATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDF
        YDEATRLCPTLHDAFYNWAIAISDRAK+RGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIV+TAISKFRAAIQLQFDF
Subjt:  YDEATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDF

Query:  HRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPNYSVSLVLLSRIIS-LPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLN
        HRAIYNLGTVLYGLAEDTLRTGGTG  KDVSPN+LYSQSAIYIAAAHALKP+YSV    L  + S LPLPYLKVGYLTAPPVGRP APH DWKRSQFFLN
Subjt:  HRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPNYSVSLVLLSRIIS-LPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLN

Query:  HDVLQKLNIGGEQIQTSP--------SLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLA
        HDVLQKLNIGGEQ QTSP        +LNG+RT+KVEIPDIVSVSACADLTLPPGAGLCIDTIHG +FLVAD+WDALDGWLDAIRLVYTIYARGKN+VLA
Subjt:  HDVLQKLNIGGEQIQTSP--------SLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLA

Query:  GIIAG
        GIIAG
Subjt:  GIIAG

A0A6J1HJU5 protein HLB1-like isoform X22.0e-24780.24Show/hide
Query:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPESRVATIP----QQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASI
        MSP PEEP NLQNG + +PHI  ES Q  ES S+PES    IP    QQ+RESESV+  AD+EP+SE +S R+Q SESI+LQVVTD TDPR  +P+  SI
Subjt:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPESRVATIP----QQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASI

Query:  PSNGADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRFVSSAVFFFLLYEFAQFSYQKKNLHSFTQDSPHQPYSEQSRAAMEL
         SNGA+NS PALRKDEGSRTFTMRELLNGLK +DGNDS+NESEGE+PEANSG                          +S  QDSPHQPYSEQSRAAMEL
Subjt:  PSNGADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRFVSSAVFFFLLYEFAQFSYQKKNLHSFTQDSPHQPYSEQSRAAMEL

Query:  INSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKI
        INSVTGVDEEGRSRQRILTFAARRYASAIERN QDYDALYNWALVLQESADNV  DS+SPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAK+
Subjt:  INSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKI

Query:  RGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAK
        RGRTKEAEELWKQAT+NYEKAVQLNWNSPQALNNWGLALQELSAIVPAREK TIV+TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG  K
Subjt:  RGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAK

Query:  DVSPNDLYSQSAIYIAAAHALKPNYSVSLVLLSRIIS-LPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSP--------SL
        DVSPN+LYSQSAIYIAAAHALKP+YSV    L  + S LPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDVLQKLNIGGEQIQTSP        +L
Subjt:  DVSPNDLYSQSAIYIAAAHALKPNYSVSLVLLSRIIS-LPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSP--------SL

Query:  NGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG
        NG+RT+KVEIPDIVSVSACADLTLPPGAGLCIDTIHG +FLVAD+WDALDGWLDAIRLVYTIYARGKN+VLAGIIAG
Subjt:  NGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG

A0A6J1HL68 protein HLB1-like isoform X14.7e-24476.36Show/hide
Query:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPESRVATIP--------------------------------QQQRESESVDEEADAEPRSEPESRR
        MSP PEEP NLQNG + +PHI  ES Q  ES S+PES    +P                                QQ+RESESV+  AD+EP+SE +S R
Subjt:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPESRVATIP--------------------------------QQQRESESVDEEADAEPRSEPESRR

Query:  EQSSESIQLQVVTDATDPRSGDPEEASIPSNGADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRFVSSAVFFFLLYEFAQFS
        +Q SESI+LQVVTD TDPR  +P+  SI SNGA+NS PALRKDEGSRTFTMRELLNGLK +DGNDS+NESEGE+PEANSG                    
Subjt:  EQSSESIQLQVVTDATDPRSGDPEEASIPSNGADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRFVSSAVFFFLLYEFAQFS

Query:  YQKKNLHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKK
              +S  QDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERN QDYDALYNWALVLQESADNV  DS+SPSKDALLEEACKK
Subjt:  YQKKNLHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKK

Query:  YDEATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDF
        YDEATRLCPTLHDAFYNWAIAISDRAK+RGRTKEAEELWKQAT+NYEKAVQLNWNSPQALNNWGLALQELSAIVPAREK TIV+TAISKFRAAIQLQFDF
Subjt:  YDEATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDF

Query:  HRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPNYSVSLVLLSRIIS-LPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLN
        HRAIYNLGTVLYGLAEDTLRTGGTG  KDVSPN+LYSQSAIYIAAAHALKP+YSV    L  + S LPLPYLKVGYLTAPPVGRP APH DWKRSQFFLN
Subjt:  HRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPNYSVSLVLLSRIIS-LPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLN

Query:  HDVLQKLNIGGEQIQTSP--------SLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLA
        HDVLQKLNIGGEQIQTSP        +LNG+RT+KVEIPDIVSVSACADLTLPPGAGLCIDTIHG +FLVAD+WDALDGWLDAIRLVYTIYARGKN+VLA
Subjt:  HDVLQKLNIGGEQIQTSP--------SLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLA

Query:  GIIAG
        GIIAG
Subjt:  GIIAG

SwissProt top hitse value%identityAlignment
Q9FHY8 Protein HLB13.4e-16758.11Show/hide
Query:  MSPTPEEPNLQNG------NQTQPHIPSESQQTEE--SGSDPESRVATIPQ--QQRESESVDEEADAEPRSE-----------PESRREQSSESIQLQVV
        M+ T EEP LQNG         Q  IP    QTE   +G  PE      P+  Q   +++  EE  +E + E            E++ E   E +Q  VV
Subjt:  MSPTPEEPNLQNG------NQTQPHIPSESQQTEE--SGSDPESRVATIPQ--QQRESESVDEEADAEPRSE-----------PESRREQSSESIQLQVV

Query:  TDA------TDPRSGDPEEASIPSNGA--DNSHPALRK-DEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRFVSSAVFFFLLYEFAQFSYQK
        TD        D   G  EE  I S     +++   L+K D+G++TFTMRELL+ LK ++G+ + + S                                 
Subjt:  TDA------TDPRSGDPEEASIPSNGA--DNSHPALRK-DEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRFVSSAVFFFLLYEFAQFSYQK

Query:  KNLHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDE
             F+++S  QP   ++  AM+LIN +   DEEGRSRQR+L FAAR+YASAIERN  D+DALYNWAL+LQESADNV  DS SPSKD LLEEACKKYDE
Subjt:  KNLHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDE

Query:  ATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRA
        ATRLCPTL+DA+YNWAIAISDRAKIRGRTKEAEELW+QA  NYEKAVQLNWNS QALNNWGL LQELS IVPAREK+ +VRTAISKFRAAI+LQFDFHRA
Subjt:  ATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRA

Query:  IYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPNYSVSLVLLSRIIS-LPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHD-
        IYNLGTVLYGLAEDTLRTGG+GN KD+ P +LYSQSAIYIAAAH+LKP+YSV    L  + S LPLP+LKVGYLTAPPVG  LAPHSDWKR++F LNH+ 
Subjt:  IYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPNYSVSLVLLSRIIS-LPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHD-

Query:  VLQKL---------NIGGEQIQTSPSLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAG
        +LQ L         N+ G+    S ++   +T+KV I +IVSV+ CADLTLPPGAGLCIDTIHGPVFLVAD+W++LDGWLDAIRLVYTIYARGK+DVLAG
Subjt:  VLQKL---------NIGGEQIQTSPSLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAG

Query:  IIAG
        II G
Subjt:  IIAG

Arabidopsis top hitse value%identityAlignment
AT5G41950.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-16858.11Show/hide
Query:  MSPTPEEPNLQNG------NQTQPHIPSESQQTEE--SGSDPESRVATIPQ--QQRESESVDEEADAEPRSE-----------PESRREQSSESIQLQVV
        M+ T EEP LQNG         Q  IP    QTE   +G  PE      P+  Q   +++  EE  +E + E            E++ E   E +Q  VV
Subjt:  MSPTPEEPNLQNG------NQTQPHIPSESQQTEE--SGSDPESRVATIPQ--QQRESESVDEEADAEPRSE-----------PESRREQSSESIQLQVV

Query:  TDA------TDPRSGDPEEASIPSNGA--DNSHPALRK-DEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRFVSSAVFFFLLYEFAQFSYQK
        TD        D   G  EE  I S     +++   L+K D+G++TFTMRELL+ LK ++G+ + + S                                 
Subjt:  TDA------TDPRSGDPEEASIPSNGA--DNSHPALRK-DEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRFVSSAVFFFLLYEFAQFSYQK

Query:  KNLHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDE
             F+++S  QP   ++  AM+LIN +   DEEGRSRQR+L FAAR+YASAIERN  D+DALYNWAL+LQESADNV  DS SPSKD LLEEACKKYDE
Subjt:  KNLHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDE

Query:  ATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRA
        ATRLCPTL+DA+YNWAIAISDRAKIRGRTKEAEELW+QA  NYEKAVQLNWNS QALNNWGL LQELS IVPAREK+ +VRTAISKFRAAI+LQFDFHRA
Subjt:  ATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRA

Query:  IYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPNYSVSLVLLSRIIS-LPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHD-
        IYNLGTVLYGLAEDTLRTGG+GN KD+ P +LYSQSAIYIAAAH+LKP+YSV    L  + S LPLP+LKVGYLTAPPVG  LAPHSDWKR++F LNH+ 
Subjt:  IYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPNYSVSLVLLSRIIS-LPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHD-

Query:  VLQKL---------NIGGEQIQTSPSLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAG
        +LQ L         N+ G+    S ++   +T+KV I +IVSV+ CADLTLPPGAGLCIDTIHGPVFLVAD+W++LDGWLDAIRLVYTIYARGK+DVLAG
Subjt:  VLQKL---------NIGGEQIQTSPSLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAG

Query:  IIAG
        II G
Subjt:  IIAG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGCCTACTCCCGAGGAGCCTAATTTGCAGAACGGAAACCAAACCCAACCGCACATTCCCTCAGAATCACAGCAAACTGAAGAATCCGGATCGGACCCAGAATCCAG
AGTTGCCACAATTCCCCAACAGCAACGCGAATCAGAATCGGTTGATGAAGAAGCAGATGCGGAGCCTCGATCGGAGCCGGAGTCTCGGAGGGAACAGTCGTCGGAGTCCA
TCCAGTTGCAGGTGGTGACGGATGCCACAGATCCCAGGTCCGGTGATCCCGAGGAAGCCTCGATCCCGTCCAACGGCGCCGACAACTCGCATCCCGCCCTGCGGAAGGAC
GAAGGAAGCCGGACGTTCACCATGAGGGAGCTGCTGAATGGATTGAAAGGTGATGATGGTAACGACAGCGTTAATGAATCGGAAGGCGAGAGGCCCGAGGCTAACTCCGG
TCACAGGTTCGTTTCTTCAGCTGTATTCTTTTTTTTACTGTATGAATTTGCACAGTTTTCTTATCAAAAAAAGAATTTGCACAGTTTTACTCAAGATAGCCCACATCAGC
CTTATTCTGAACAGAGCAGAGCTGCTATGGAGTTGATCAATAGTGTTACAGGTGTTGATGAGGAGGGTCGTTCTCGCCAACGGATTCTCACATTTGCTGCTAGGAGGTAT
GCTAGTGCAATTGAGAGAAATGCTCAAGACTATGATGCTCTATACAATTGGGCATTAGTCCTCCAGGAGAGTGCAGATAATGTTGGTGCAGATTCCTCTTCACCTTCTAA
AGATGCGTTGCTTGAGGAGGCTTGTAAAAAGTATGATGAGGCTACCCGTCTTTGCCCAACACTTCATGATGCTTTTTATAATTGGGCTATTGCAATCTCTGATCGGGCCA
AAATTCGTGGTCGTACAAAGGAGGCTGAAGAACTATGGAAGCAGGCTACCAAAAACTATGAAAAAGCTGTTCAACTCAATTGGAACAGTCCCCAGGCGCTAAATAATTGG
GGACTCGCTCTACAGGAACTCAGTGCGATTGTGCCAGCACGAGAAAAGCAGACAATTGTAAGAACAGCTATCAGTAAGTTTCGTGCAGCAATACAGTTGCAATTTGATTT
TCATCGAGCAATCTACAATCTTGGTACTGTTCTGTATGGACTAGCTGAGGACACATTAAGGACCGGAGGAACAGGAAATGCTAAGGATGTTTCCCCTAATGACTTGTACA
GCCAATCTGCTATTTATATCGCAGCTGCTCATGCTCTAAAACCAAATTACTCTGTAAGCCTGGTCCTTTTGTCTCGAATAATTTCTCTGCCGTTGCCGTATCTAAAAGTT
GGATACCTGACTGCACCTCCTGTGGGGAGACCACTTGCTCCTCACAGTGATTGGAAACGTTCACAATTTTTTCTAAATCATGATGTATTGCAAAAGCTTAACATAGGGGG
GGAGCAAATACAAACATCACCTAGCTTGAATGGCGAGAGGACAATCAAAGTAGAAATTCCAGATATTGTCTCAGTATCAGCATGCGCAGATCTAACTCTACCACCCGGTG
CTGGACTCTGCATTGACACAATCCATGGACCAGTTTTCTTGGTCGCTGACACGTGGGACGCGCTTGATGGATGGCTTGATGCAATTAGATTAGTTTACACAATCTATGCT
CGAGGCAAGAACGACGTTTTGGCTGGCATCATAGCAGGC
mRNA sequenceShow/hide mRNA sequence
ATGTCGCCTACTCCCGAGGAGCCTAATTTGCAGAACGGAAACCAAACCCAACCGCACATTCCCTCAGAATCACAGCAAACTGAAGAATCCGGATCGGACCCAGAATCCAG
AGTTGCCACAATTCCCCAACAGCAACGCGAATCAGAATCGGTTGATGAAGAAGCAGATGCGGAGCCTCGATCGGAGCCGGAGTCTCGGAGGGAACAGTCGTCGGAGTCCA
TCCAGTTGCAGGTGGTGACGGATGCCACAGATCCCAGGTCCGGTGATCCCGAGGAAGCCTCGATCCCGTCCAACGGCGCCGACAACTCGCATCCCGCCCTGCGGAAGGAC
GAAGGAAGCCGGACGTTCACCATGAGGGAGCTGCTGAATGGATTGAAAGGTGATGATGGTAACGACAGCGTTAATGAATCGGAAGGCGAGAGGCCCGAGGCTAACTCCGG
TCACAGGTTCGTTTCTTCAGCTGTATTCTTTTTTTTACTGTATGAATTTGCACAGTTTTCTTATCAAAAAAAGAATTTGCACAGTTTTACTCAAGATAGCCCACATCAGC
CTTATTCTGAACAGAGCAGAGCTGCTATGGAGTTGATCAATAGTGTTACAGGTGTTGATGAGGAGGGTCGTTCTCGCCAACGGATTCTCACATTTGCTGCTAGGAGGTAT
GCTAGTGCAATTGAGAGAAATGCTCAAGACTATGATGCTCTATACAATTGGGCATTAGTCCTCCAGGAGAGTGCAGATAATGTTGGTGCAGATTCCTCTTCACCTTCTAA
AGATGCGTTGCTTGAGGAGGCTTGTAAAAAGTATGATGAGGCTACCCGTCTTTGCCCAACACTTCATGATGCTTTTTATAATTGGGCTATTGCAATCTCTGATCGGGCCA
AAATTCGTGGTCGTACAAAGGAGGCTGAAGAACTATGGAAGCAGGCTACCAAAAACTATGAAAAAGCTGTTCAACTCAATTGGAACAGTCCCCAGGCGCTAAATAATTGG
GGACTCGCTCTACAGGAACTCAGTGCGATTGTGCCAGCACGAGAAAAGCAGACAATTGTAAGAACAGCTATCAGTAAGTTTCGTGCAGCAATACAGTTGCAATTTGATTT
TCATCGAGCAATCTACAATCTTGGTACTGTTCTGTATGGACTAGCTGAGGACACATTAAGGACCGGAGGAACAGGAAATGCTAAGGATGTTTCCCCTAATGACTTGTACA
GCCAATCTGCTATTTATATCGCAGCTGCTCATGCTCTAAAACCAAATTACTCTGTAAGCCTGGTCCTTTTGTCTCGAATAATTTCTCTGCCGTTGCCGTATCTAAAAGTT
GGATACCTGACTGCACCTCCTGTGGGGAGACCACTTGCTCCTCACAGTGATTGGAAACGTTCACAATTTTTTCTAAATCATGATGTATTGCAAAAGCTTAACATAGGGGG
GGAGCAAATACAAACATCACCTAGCTTGAATGGCGAGAGGACAATCAAAGTAGAAATTCCAGATATTGTCTCAGTATCAGCATGCGCAGATCTAACTCTACCACCCGGTG
CTGGACTCTGCATTGACACAATCCATGGACCAGTTTTCTTGGTCGCTGACACGTGGGACGCGCTTGATGGATGGCTTGATGCAATTAGATTAGTTTACACAATCTATGCT
CGAGGCAAGAACGACGTTTTGGCTGGCATCATAGCAGGC
Protein sequenceShow/hide protein sequence
MSPTPEEPNLQNGNQTQPHIPSESQQTEESGSDPESRVATIPQQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASIPSNGADNSHPALRKD
EGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRFVSSAVFFFLLYEFAQFSYQKKNLHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRY
ASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNW
GLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPNYSVSLVLLSRIISLPLPYLKV
GYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYA
RGKNDVLAGIIAG