; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc03G01840 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc03G01840
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionp-hydroxybenzoic acid efflux pump subunit aaeB
Genome locationClcChr03:1617949..1623022
RNA-Seq ExpressionClc03G01840
SyntenyClc03G01840
Gene Ontology termsGO:0055085 - transmembrane transport (biological process)
GO:0005886 - plasma membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0022857 - transmembrane transporter activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607229.1 hypothetical protein SDJN03_00571, partial [Cucurbita argyrosperma subsp. sororia]2.0e-26367.64Show/hide
Query:  AASTTTNHDGRAMWFTRLASAFQAALACSVVACTTLYGPATLRRLVAFPAFSYLTAILIVTNAALGDAVRGCCLAVFATVQTVCPAMFLFWFIGPTKFSH
        AA + TNH  RA+WF RLASAF+ ALAC+VVA TTLYGP  LRR VAFPAFSYLTA+LIVTNA+LGD +RGC LA+FATVQTVCPAMFLFWFIGPTKFSH
Subjt:  AASTTTNHDGRAMWFTRLASAFQAALACSVVACTTLYGPATLRRLVAFPAFSYLTAILIVTNAALGDAVRGCCLAVFATVQTVCPAMFLFWFIGPTKFSH

Query:  ITTAVTVALASVVVVVQSSTHVLAKKIALGQIVLIYVVGFIGGAHTDPLMHPLHVAVTTALGAAASLFATLFPFPRLASLQVKKKSKIMVENMVERLSVM
        +T A+TVALAS+VVV+ +STHVLAKKIALGQIVLIYVVGFIGGA TDPLMHPLHVA TTALGAAAS+ ATL PFPRLASLQVK+KSK +VENM+ERL++M
Subjt:  ITTAVTVALASVVVVVQSSTHVLAKKIALGQIVLIYVVGFIGGAHTDPLMHPLHVAVTTALGAAASLFATLFPFPRLASLQVKKKSKIMVENMVERLSVM

Query:  VKAVLAEDSTVAAASISRANFLSSSATKLLHSIKLYQGSKQWEKLPFKICKMGWLSNSEKLEDLELALNGMELALSKIPSYPIQNNPQNYQTLKHDLNNL
        VKA+L  D ++AA SIS+A  LSSSA+KLLHSI+ +Q SK WE+LPFKI K+GWLSNSE+L++LE ALNGMELALS IPSYPI+   QN QTLKHDLN L
Subjt:  VKAVLAEDSTVAAASISRANFLSSSATKLLHSIKLYQGSKQWEKLPFKICKMGWLSNSEKLEDLELALNGMELALSKIPSYPIQNNPQNYQTLKHDLNNL

Query:  ERQISHSLKQANSCFPPSDSVTFPEVNVDDKTIIINTLNSIQIIPTNRQDLPHFFFIFCMKLLLIKTQIKTPTKLQEESKKKKIENSIDKEKNRTWVSPM
        + QIS +LKQAN+C   SDS+TFP   VDD     NTL SIQI+PTN+QDLPH FFIFCMKLL  K+QI+                     KN+TWVS M
Subjt:  ERQISHSLKQANSCFPPSDSVTFPEVNVDDKTIIINTLNSIQIIPTNRQDLPHFFFIFCMKLLLIKTQIKTPTKLQEESKKKKIENSIDKEKNRTWVSPM

Query:  NSQRVIPALKCAVSLGISVILGLIYSKENGFWGSLAVAVSIASDREPTFKVANFKVHGTMLGSVFGILSFVLFERFLIGRLLCLLPWFLFTSFLQHSTMY
        N+Q ++ ALK A+SLGI+V LGL+Y KENGFW SLAVAVSI S+RE TFKVAN KVHGTMLGSV+GILSFV+F+ FL+GRLLCLLPWF+FTSFLQH+ MY
Subjt:  NSQRVIPALKCAVSLGISVILGLIYSKENGFWGSLAVAVSIASDREPTFKVANFKVHGTMLGSVFGILSFVLFERFLIGRLLCLLPWFLFTSFLQHSTMY

Query:  GSAGGISAIVGALVVLGRTNYGSPREFAFERMIETFIGILISIVVDIIFQPKRASKLAKIQLIVSLQMLQKCINDSFCYESNTIMEDYLRSLRIQVIKLK
        GSAGG++A+VGALVVLGRTNYG+P EFAF R IETFIGI IS+VVDII QP RASK+AKIQL +SLQ LQKCI +S    S+  +E+  R+LRIQV +LK
Subjt:  GSAGGISAIVGALVVLGRTNYGSPREFAFERMIETFIGILISIVVDIIFQPKRASKLAKIQLIVSLQMLQKCINDSFCYESNTIMEDYLRSLRIQVIKLK

Query:  KLIDEAEVEPNFLFLQPFHGNSYVKMFNSLSKMVGLLALNGEAMNNLKENVREDLWRKVVENLEGDFEKFKEIMANGFVTFYEDLSSSLKSSRGIE---I
        KLI+EAEVEPNF FL PFH NSY K+  SLSK V  LA + +AM NLKE        +VVE+LEGD E+FKE+M    V+ Y D+ SSLK  R +E    
Subjt:  KLIDEAEVEPNFLFLQPFHGNSYVKMFNSLSKMVGLLALNGEAMNNLKENVREDLWRKVVENLEGDFEKFKEIMANGFVTFYEDLSSSLKSSRGIE---I

Query:  KDDNYDDIEMGKPQRIKILMDEIEKEKLVNSFLQHLGEVV----EIKDGKSEEILSLSAMVFCLSSLMKEIEEIGKATRELIEWE
        K +N DD+EMG+  RI+   DEIEKEKL+N  L+H  E+V    E KD +S+ ILSLSA+ FCL+SL++ IEEIG+A REL++WE
Subjt:  KDDNYDDIEMGKPQRIKILMDEIEKEKLVNSFLQHLGEVV----EIKDGKSEEILSLSAMVFCLSSLMKEIEEIGKATRELIEWE

KAG7036914.1 hypothetical protein SDJN02_00534, partial [Cucurbita argyrosperma subsp. argyrosperma]2.6e-26367.64Show/hide
Query:  AASTTTNHDGRAMWFTRLASAFQAALACSVVACTTLYGPATLRRLVAFPAFSYLTAILIVTNAALGDAVRGCCLAVFATVQTVCPAMFLFWFIGPTKFSH
        AA + TNH  RA+WF RLASAF+ ALAC+VVA TTLYGP  LRR VAFPAFSYLTA+LIVTNA+LGD +RGC LA+FATVQTVCPAMFLFWFIGPTKFSH
Subjt:  AASTTTNHDGRAMWFTRLASAFQAALACSVVACTTLYGPATLRRLVAFPAFSYLTAILIVTNAALGDAVRGCCLAVFATVQTVCPAMFLFWFIGPTKFSH

Query:  ITTAVTVALASVVVVVQSSTHVLAKKIALGQIVLIYVVGFIGGAHTDPLMHPLHVAVTTALGAAASLFATLFPFPRLASLQVKKKSKIMVENMVERLSVM
        +T A+TVALAS+VVV+ +STHVLAKKIALGQIVLIYVVGFIGGA TDPLMHPLHVA TTALGAAAS+ ATL PFPRLASLQVK+KSK +VENM+ERL++M
Subjt:  ITTAVTVALASVVVVVQSSTHVLAKKIALGQIVLIYVVGFIGGAHTDPLMHPLHVAVTTALGAAASLFATLFPFPRLASLQVKKKSKIMVENMVERLSVM

Query:  VKAVLAEDSTVAAASISRANFLSSSATKLLHSIKLYQGSKQWEKLPFKICKMGWLSNSEKLEDLELALNGMELALSKIPSYPIQNNPQNYQTLKHDLNNL
        VKA+L  D ++AA SIS+A  LSSSA+KLLHSI+ +Q SK WE+LPFKI K+GWLSNSE+L++LE ALNGMELALS IPSYPI+   QN QTLKHDLN L
Subjt:  VKAVLAEDSTVAAASISRANFLSSSATKLLHSIKLYQGSKQWEKLPFKICKMGWLSNSEKLEDLELALNGMELALSKIPSYPIQNNPQNYQTLKHDLNNL

Query:  ERQISHSLKQANSCFPPSDSVTFPEVNVDDKTIIINTLNSIQIIPTNRQDLPHFFFIFCMKLLLIKTQIKTPTKLQEESKKKKIENSIDKEKNRTWVSPM
        + QIS +LKQAN+C   SDS+TFP   VDD     NTL SIQI+PTN+QDLPH FFIFCMKLL  K+QI+                     KN+TWVS M
Subjt:  ERQISHSLKQANSCFPPSDSVTFPEVNVDDKTIIINTLNSIQIIPTNRQDLPHFFFIFCMKLLLIKTQIKTPTKLQEESKKKKIENSIDKEKNRTWVSPM

Query:  NSQRVIPALKCAVSLGISVILGLIYSKENGFWGSLAVAVSIASDREPTFKVANFKVHGTMLGSVFGILSFVLFERFLIGRLLCLLPWFLFTSFLQHSTMY
        N+Q ++ ALK A+SLGI+V LGL+Y KENGFW SLAVAVSI S+RE TFKVAN KVHGTMLGSV+GILSFV+F+ FL+GRLLCLLPWF+FTSFLQH+ MY
Subjt:  NSQRVIPALKCAVSLGISVILGLIYSKENGFWGSLAVAVSIASDREPTFKVANFKVHGTMLGSVFGILSFVLFERFLIGRLLCLLPWFLFTSFLQHSTMY

Query:  GSAGGISAIVGALVVLGRTNYGSPREFAFERMIETFIGILISIVVDIIFQPKRASKLAKIQLIVSLQMLQKCINDSFCYESNTIMEDYLRSLRIQVIKLK
        GSAGG++A+VGALVVLGRTNYG+P EFAF R IETFIGI IS+VVDII QP RASK+AKIQL +SLQ LQKCI +S    S+  +E+  R+LRIQV +LK
Subjt:  GSAGGISAIVGALVVLGRTNYGSPREFAFERMIETFIGILISIVVDIIFQPKRASKLAKIQLIVSLQMLQKCINDSFCYESNTIMEDYLRSLRIQVIKLK

Query:  KLIDEAEVEPNFLFLQPFHGNSYVKMFNSLSKMVGLLALNGEAMNNLKENVREDLWRKVVENLEGDFEKFKEIMANGFVTFYEDLSSSLKSSRGIE---I
        KLI+EAEVEPNF FL PFH NSY K+  SLSK V  LA + +AM NLKE        +VVE+LEGD E+FKE+M    V+ Y D+ SSLK  R +E    
Subjt:  KLIDEAEVEPNFLFLQPFHGNSYVKMFNSLSKMVGLLALNGEAMNNLKENVREDLWRKVVENLEGDFEKFKEIMANGFVTFYEDLSSSLKSSRGIE---I

Query:  KDDNYDDIEMGKPQRIKILMDEIEKEKLVNSFLQHLGEVV----EIKDGKSEEILSLSAMVFCLSSLMKEIEEIGKATRELIEWE
        K +N DD+EMG+  RI+   DEIEKEKL+N  L+H  E+V    E KD +S+ ILSLSA+ FCL+SL++ IEEIG+A REL++WE
Subjt:  KDDNYDDIEMGKPQRIKILMDEIEKEKLVNSFLQHLGEVV----EIKDGKSEEILSLSAMVFCLSSLMKEIEEIGKATRELIEWE

XP_011658042.2 uncharacterized protein LOC101209654 [Cucumis sativus]0.0e+0084.22Show/hide
Query:  AASTTTNHDGRAMWFTRLASAFQAALACSVVACTTLYGPATLRRLVAFPAFSYLTAILIVTNAALGDAVRGCCLAVFATVQTVCPAMFLFWFIGPTKFSH
        AA++TTN D RAMWFTRLASA +AALACS+VA TTLYGPATLRRLVAFPAFSYLTA LIVTNAALGDAVRGCCL VFAT+QTVCPAMFLFWFIGP KFSH
Subjt:  AASTTTNHDGRAMWFTRLASAFQAALACSVVACTTLYGPATLRRLVAFPAFSYLTAILIVTNAALGDAVRGCCLAVFATVQTVCPAMFLFWFIGPTKFSH

Query:  ITTAVTVALASVVVVVQSSTHVLAKKIALGQIVLIYVVGFIGGAHTDPLMHPLHVAVTTALGAAASLFATLFPFPRLASLQVKKKSKIMVENMVERLSVM
        ITTAVTVALASVVVV+ SSTH+LAKKIALGQIV+IYVVGFIGGAHTDPLMHPLHVA TTALGAAASL ATL PFPRLASLQVK+KSK +VENM ERLS+M
Subjt:  ITTAVTVALASVVVVVQSSTHVLAKKIALGQIVLIYVVGFIGGAHTDPLMHPLHVAVTTALGAAASLFATLFPFPRLASLQVKKKSKIMVENMVERLSVM

Query:  VKAVLAEDSTVAAASISRANFLSSSATKLLHSIKLYQGSKQWEKLPFKICKMGWLSNSEKLEDLELALNGMELALSKIPSYPIQNNPQNYQTLKHDLNNL
        VKA+LAED T+AAASISRA FLSSSATKLLHSIKLYQ SKQWEK P +ICKMGWLSNSEKLEDLE+ALNGMELALSKIPSYPIQNNPQNYQTLKHDLN L
Subjt:  VKAVLAEDSTVAAASISRANFLSSSATKLLHSIKLYQGSKQWEKLPFKICKMGWLSNSEKLEDLELALNGMELALSKIPSYPIQNNPQNYQTLKHDLNNL

Query:  ERQISHSLKQANSCFPPSDSVTFPEVNVDDKT-IIINTLNSIQIIPTNRQDLPHFFFIFCMKLLLIKTQIKTPTKLQEESKKKKIENSIDKEKNR-TWVS
        E QI+ SLKQAN+ FPPSDSVTFPE+NVD  T  +INTL SIQI PT+ QDLP+FFFIFCMKLL  KTQ+KTP K +EESK+K+I+NS +KEKNR TWVS
Subjt:  ERQISHSLKQANSCFPPSDSVTFPEVNVDDKT-IIINTLNSIQIIPTNRQDLPHFFFIFCMKLLLIKTQIKTPTKLQEESKKKKIENSIDKEKNR-TWVS

Query:  PMNSQRVIPALKCAVSLGISVILGLIYSKENGFWGSLAVAVSIASDREPTFKVANFKVHGTMLGSVFGILSFVLFERFLIGRLLCLLPWFLFTSFLQHST
         MN+QRVI ALKCA+SLGISVILGLIY+KENGFWGSLAVAVSIAS+REPTFKVAN KVHGTMLGS+FGILSFVLF++FLIGRLLCLLPWF+FTSFLQHST
Subjt:  PMNSQRVIPALKCAVSLGISVILGLIYSKENGFWGSLAVAVSIASDREPTFKVANFKVHGTMLGSVFGILSFVLFERFLIGRLLCLLPWFLFTSFLQHST

Query:  MYGSAGGISAIVGALVVLGRTNYGSPREFAFERMIETFIGILISIVVDIIFQPKRASKLAKIQLIVSLQMLQKCINDSFCYESNTIMEDYLRSLRIQVIK
        MYGSAGGISAIVGALVVLGRTNYGSP+EFAFERMIETFIGI IS+VVDIIFQPKRASKL KIQLI+SLQ+LQKCINDSFCYES+TIME  L+ LR QVI+
Subjt:  MYGSAGGISAIVGALVVLGRTNYGSPREFAFERMIETFIGILISIVVDIIFQPKRASKLAKIQLIVSLQMLQKCINDSFCYESNTIMEDYLRSLRIQVIK

Query:  LKKLIDEAEVEPNFLFLQPFHGNSYVKMFNSLSKMVGLLALNGEAMNNLKENVREDLWRKVVENLEGDFEKFKEIMANGFVTFYEDL-SSSLKSSRGIEI
        +KKLIDEAEVEPNFLFL PFHG+S++KMFNSLSKMVGLLALNGEAMNNLKE     LWRKV E LEGDFEKFKEIMANGFVTFYE+L SSSLKS +G E 
Subjt:  LKKLIDEAEVEPNFLFLQPFHGNSYVKMFNSLSKMVGLLALNGEAMNNLKENVREDLWRKVVENLEGDFEKFKEIMANGFVTFYEDL-SSSLKSSRGIEI

Query:  KDDNYDDIEMGKPQRIKILMDEIEKEKLVNSFLQHLGEVVEIKDGKSEE-ILSLSAMVFCLSSLMKEIEEIGKATRELIEWEKSFF
        K+DN  DIEMG+ QRI++ MDEIEKEKL+NSFLQHLGE+VE KDGKSEE ILSLSAM FCL+SLMKE+EE+G+A REL+EWEKS F
Subjt:  KDDNYDDIEMGKPQRIKILMDEIEKEKLVNSFLQHLGEVVEIKDGKSEE-ILSLSAMVFCLSSLMKEIEEIGKATRELIEWEKSFF

XP_022948311.1 uncharacterized protein LOC111452025 [Cucurbita moschata]3.1e-26467.9Show/hide
Query:  AASTTTNHDGRAMWFTRLASAFQAALACSVVACTTLYGPATLRRLVAFPAFSYLTAILIVTNAALGDAVRGCCLAVFATVQTVCPAMFLFWFIGPTKFSH
        AA + TNH  RA+WF RLASAF+ ALAC+VVA TTLYGP  LRR VAFPAFSYLTA+LIVTNA+LGD +RGC LA+FATVQTVCPAMFLFWFIGPTKFSH
Subjt:  AASTTTNHDGRAMWFTRLASAFQAALACSVVACTTLYGPATLRRLVAFPAFSYLTAILIVTNAALGDAVRGCCLAVFATVQTVCPAMFLFWFIGPTKFSH

Query:  ITTAVTVALASVVVVVQSSTHVLAKKIALGQIVLIYVVGFIGGAHTDPLMHPLHVAVTTALGAAASLFATLFPFPRLASLQVKKKSKIMVENMVERLSVM
        +T A+TVALAS+VVV+ +STHVLAKKIALGQIVLIYVVGFIGGA TDPLMHPLHVA TTALGAAAS+ ATL PFPRLASLQVK+KSK +VENM+ERL++M
Subjt:  ITTAVTVALASVVVVVQSSTHVLAKKIALGQIVLIYVVGFIGGAHTDPLMHPLHVAVTTALGAAASLFATLFPFPRLASLQVKKKSKIMVENMVERLSVM

Query:  VKAVLAEDSTVAAASISRANFLSSSATKLLHSIKLYQGSKQWEKLPFKICKMGWLSNSEKLEDLELALNGMELALSKIPSYPIQNNPQNYQTLKHDLNNL
        VKA+L  D ++AA SIS+A  LSSSA+KLLHSI+ +Q SK WE+LPFKI K+GWLSNSE+L++LE ALNGMELALS IPSYPI+   QN QTLKHDLN L
Subjt:  VKAVLAEDSTVAAASISRANFLSSSATKLLHSIKLYQGSKQWEKLPFKICKMGWLSNSEKLEDLELALNGMELALSKIPSYPIQNNPQNYQTLKHDLNNL

Query:  ERQISHSLKQANSCFPPSDSVTFPEVNVDDKTIIINTLNSIQIIPTNRQDLPHFFFIFCMKLLLIKTQIKTPTKLQEESKKKKIENSIDKEKNRTWVSPM
        + QIS +LKQAN+C   SDS+TFP   VDD     NTL SIQI+PTN+QDLPH FFIFCMKLL  K+QI+                     KN+TWVS M
Subjt:  ERQISHSLKQANSCFPPSDSVTFPEVNVDDKTIIINTLNSIQIIPTNRQDLPHFFFIFCMKLLLIKTQIKTPTKLQEESKKKKIENSIDKEKNRTWVSPM

Query:  NSQRVIPALKCAVSLGISVILGLIYSKENGFWGSLAVAVSIASDREPTFKVANFKVHGTMLGSVFGILSFVLFERFLIGRLLCLLPWFLFTSFLQHSTMY
        N+Q ++ ALK A+SLGI+V LGL+Y KENGFW SLAVAVSI S+RE TFKVAN KVHGTMLGSV+GILSFV+F+ FL+GRLLCLLPWF+FTSFLQH+ MY
Subjt:  NSQRVIPALKCAVSLGISVILGLIYSKENGFWGSLAVAVSIASDREPTFKVANFKVHGTMLGSVFGILSFVLFERFLIGRLLCLLPWFLFTSFLQHSTMY

Query:  GSAGGISAIVGALVVLGRTNYGSPREFAFERMIETFIGILISIVVDIIFQPKRASKLAKIQLIVSLQMLQKCINDSFCYESNTIMEDYLRSLRIQVIKLK
        GSAGG++A+VGALVVLGRTNYG+P EFAF R IETFIGI IS+VVDII QP RASK+AKIQL +SLQ LQKCI +S    S+  +E+  R+LRIQV +LK
Subjt:  GSAGGISAIVGALVVLGRTNYGSPREFAFERMIETFIGILISIVVDIIFQPKRASKLAKIQLIVSLQMLQKCINDSFCYESNTIMEDYLRSLRIQVIKLK

Query:  KLIDEAEVEPNFLFLQPFHGNSYVKMFNSLSKMVGLLALNGEAMNNLKENVREDLWRKVVENLEGDFEKFKEIMANGFVTFYEDLSSSLKSSRGIE---I
        KLI+EAEVEPNF FL PFH NSY K+  SLSK V  LA + +AM NLKE        +VVE+LEGD E+FKE+M    V+ Y D+ SSLK  R +E    
Subjt:  KLIDEAEVEPNFLFLQPFHGNSYVKMFNSLSKMVGLLALNGEAMNNLKENVREDLWRKVVENLEGDFEKFKEIMANGFVTFYEDLSSSLKSSRGIE---I

Query:  KDDNYDDIEMGKPQRIKILMDEIEKEKLVNSFLQHLGEVV----EIKDGKSEEILSLSAMVFCLSSLMKEIEEIGKATRELIEWE
        K +N DD+EMG+  RI+   DEIEKEKL+N  L+H  E+V    E KD +S+ ILSLSA+ FCLSSLM+ IEEIG+A REL++WE
Subjt:  KDDNYDDIEMGKPQRIKILMDEIEKEKLVNSFLQHLGEVV----EIKDGKSEEILSLSAMVFCLSSLMKEIEEIGKATRELIEWE

XP_022998765.1 uncharacterized protein LOC111493334 [Cucurbita maxima]2.6e-26368.03Show/hide
Query:  AASTTTNHDGRAMWFTRLASAFQAALACSVVACTTLYGPATLRRLVAFPAFSYLTAILIVTNAALGDAVRGCCLAVFATVQTVCPAMFLFWFIGPTKFSH
        AA + TNH  RA+WF RLASAF+ ALAC+VVA TTLYGP  LRR VAFPAFSYLTA+LIVTNA+LGD +RGC LA+FATVQTVCPAMFLFWFIGPTKFSH
Subjt:  AASTTTNHDGRAMWFTRLASAFQAALACSVVACTTLYGPATLRRLVAFPAFSYLTAILIVTNAALGDAVRGCCLAVFATVQTVCPAMFLFWFIGPTKFSH

Query:  ITTAVTVALASVVVVVQSSTHVLAKKIALGQIVLIYVVGFIGGAHTDPLMHPLHVAVTTALGAAASLFATLFPFPRLASLQVKKKSKIMVENMVERLSVM
        +T A+TVALAS+VVV+ +STHVLAKKIALGQIVLIYVVGFIGGA TDPLMHPLHVA TTALGAAAS+ ATL PFPRLASLQVK+KSK +VENMVERL++M
Subjt:  ITTAVTVALASVVVVVQSSTHVLAKKIALGQIVLIYVVGFIGGAHTDPLMHPLHVAVTTALGAAASLFATLFPFPRLASLQVKKKSKIMVENMVERLSVM

Query:  VKAVLAEDSTVAAASISRANFLSSSATKLLHSIKLYQGSKQWEKLPFKICKMGWLSNSEKLEDLELALNGMELALSKIPSYPIQNNPQNYQTLKHDLNNL
        VKA+L  D +VAA SIS+A  LSSSA+KLLHSI+ +Q SK WE LPFKI K+GWLSNSE+LE+LE AL GMELALS IPSYPI+   QN Q LKHDLN L
Subjt:  VKAVLAEDSTVAAASISRANFLSSSATKLLHSIKLYQGSKQWEKLPFKICKMGWLSNSEKLEDLELALNGMELALSKIPSYPIQNNPQNYQTLKHDLNNL

Query:  ERQISHSLKQANSCFPPSDSVTFPEVNVDDKTIIINTLNSIQIIPTNRQDLPHFFFIFCMKLLLIKTQIKTPTKLQEESKKKKIENSIDKEKNRTWVSPM
        + QIS +LKQAN+C   SDS+TFP   VDD     NTL SIQI+PTN+QDLPH FFIFCMKLL  K+QI+                     KN+TWVS M
Subjt:  ERQISHSLKQANSCFPPSDSVTFPEVNVDDKTIIINTLNSIQIIPTNRQDLPHFFFIFCMKLLLIKTQIKTPTKLQEESKKKKIENSIDKEKNRTWVSPM

Query:  NSQRVIPALKCAVSLGISVILGLIYSKENGFWGSLAVAVSIASDREPTFKVANFKVHGTMLGSVFGILSFVLFERFLIGRLLCLLPWFLFTSFLQHSTMY
        NSQ ++ ALK A+SLGI+V LGL+Y KENGFW SLAVAVSI S+RE TFKVAN KVHGTMLGSV+GILSFV+F+ FL+GRLLCLLPWF+FTSFLQH+ MY
Subjt:  NSQRVIPALKCAVSLGISVILGLIYSKENGFWGSLAVAVSIASDREPTFKVANFKVHGTMLGSVFGILSFVLFERFLIGRLLCLLPWFLFTSFLQHSTMY

Query:  GSAGGISAIVGALVVLGRTNYGSPREFAFERMIETFIGILISIVVDIIFQPKRASKLAKIQLIVSLQMLQKCINDSFCYESNTIMEDYLRSLRIQVIKLK
        GSAGG++A+VGALVVLGRTNYG+P EFAF R IETFIGI IS+VVDII QP RASK+AKIQL +SLQ LQKCI +S    S+  +E+  R+LRIQV +LK
Subjt:  GSAGGISAIVGALVVLGRTNYGSPREFAFERMIETFIGILISIVVDIIFQPKRASKLAKIQLIVSLQMLQKCINDSFCYESNTIMEDYLRSLRIQVIKLK

Query:  KLIDEAEVEPNFLFLQPFHGNSYVKMFNSLSKMVGLLALNGEAMNNLKENVREDLWRKVVENLEGDFEKFKEIMANGFVTFYEDLSSSLKSSRGIE---I
        KLI+EAE EPNF FL PFH NSY K+  SLSK V  LA + +AM NLKE        +VVE+LEGD E+FKE+M    V+ Y D+ SSLK  + +E    
Subjt:  KLIDEAEVEPNFLFLQPFHGNSYVKMFNSLSKMVGLLALNGEAMNNLKENVREDLWRKVVENLEGDFEKFKEIMANGFVTFYEDLSSSLKSSRGIE---I

Query:  KDDNYDDIEMGKPQRIKILMDEIEKEKLVNSFLQHLGEVV----EIKDGKSEEILSLSAMVFCLSSLMKEIEEIGKATRELIEWE
        K +N DD+EMG+  RI+   DEIEKEKL+N  L+H  E+V    E KD KS+ ILSLSA+ FCLSSLM+ IEEIG+A REL++WE
Subjt:  KDDNYDDIEMGKPQRIKILMDEIEKEKLVNSFLQHLGEVV----EIKDGKSEEILSLSAMVFCLSSLMKEIEEIGKATRELIEWE

TrEMBL top hitse value%identityAlignment
A0A0A0LUT1 Uncharacterized protein0.0e+0084.22Show/hide
Query:  AASTTTNHDGRAMWFTRLASAFQAALACSVVACTTLYGPATLRRLVAFPAFSYLTAILIVTNAALGDAVRGCCLAVFATVQTVCPAMFLFWFIGPTKFSH
        AA++TTN D RAMWFTRLASA +AALACS+VA TTLYGPATLRRLVAFPAFSYLTA LIVTNAALGDAVRGCCL VFAT+QTVCPAMFLFWFIGP KFSH
Subjt:  AASTTTNHDGRAMWFTRLASAFQAALACSVVACTTLYGPATLRRLVAFPAFSYLTAILIVTNAALGDAVRGCCLAVFATVQTVCPAMFLFWFIGPTKFSH

Query:  ITTAVTVALASVVVVVQSSTHVLAKKIALGQIVLIYVVGFIGGAHTDPLMHPLHVAVTTALGAAASLFATLFPFPRLASLQVKKKSKIMVENMVERLSVM
        ITTAVTVALASVVVV+ SSTH+LAKKIALGQIV+IYVVGFIGGAHTDPLMHPLHVA TTALGAAASL ATL PFPRLASLQVK+KSK +VENM ERLS+M
Subjt:  ITTAVTVALASVVVVVQSSTHVLAKKIALGQIVLIYVVGFIGGAHTDPLMHPLHVAVTTALGAAASLFATLFPFPRLASLQVKKKSKIMVENMVERLSVM

Query:  VKAVLAEDSTVAAASISRANFLSSSATKLLHSIKLYQGSKQWEKLPFKICKMGWLSNSEKLEDLELALNGMELALSKIPSYPIQNNPQNYQTLKHDLNNL
        VKA+LAED T+AAASISRA FLSSSATKLLHSIKLYQ SKQWEK P +ICKMGWLSNSEKLEDLE+ALNGMELALSKIPSYPIQNNPQNYQTLKHDLN L
Subjt:  VKAVLAEDSTVAAASISRANFLSSSATKLLHSIKLYQGSKQWEKLPFKICKMGWLSNSEKLEDLELALNGMELALSKIPSYPIQNNPQNYQTLKHDLNNL

Query:  ERQISHSLKQANSCFPPSDSVTFPEVNVDDKT-IIINTLNSIQIIPTNRQDLPHFFFIFCMKLLLIKTQIKTPTKLQEESKKKKIENSIDKEKNR-TWVS
        E QI+ SLKQAN+ FPPSDSVTFPE+NVD  T  +INTL SIQI PT+ QDLP+FFFIFCMKLL  KTQ+KTP K +EESK+K+I+NS +KEKNR TWVS
Subjt:  ERQISHSLKQANSCFPPSDSVTFPEVNVDDKT-IIINTLNSIQIIPTNRQDLPHFFFIFCMKLLLIKTQIKTPTKLQEESKKKKIENSIDKEKNR-TWVS

Query:  PMNSQRVIPALKCAVSLGISVILGLIYSKENGFWGSLAVAVSIASDREPTFKVANFKVHGTMLGSVFGILSFVLFERFLIGRLLCLLPWFLFTSFLQHST
         MN+QRVI ALKCA+SLGISVILGLIY+KENGFWGSLAVAVSIAS+REPTFKVAN KVHGTMLGS+FGILSFVLF++FLIGRLLCLLPWF+FTSFLQHST
Subjt:  PMNSQRVIPALKCAVSLGISVILGLIYSKENGFWGSLAVAVSIASDREPTFKVANFKVHGTMLGSVFGILSFVLFERFLIGRLLCLLPWFLFTSFLQHST

Query:  MYGSAGGISAIVGALVVLGRTNYGSPREFAFERMIETFIGILISIVVDIIFQPKRASKLAKIQLIVSLQMLQKCINDSFCYESNTIMEDYLRSLRIQVIK
        MYGSAGGISAIVGALVVLGRTNYGSP+EFAFERMIETFIGI IS+VVDIIFQPKRASKL KIQLI+SLQ+LQKCINDSFCYES+TIME  L+ LR QVI+
Subjt:  MYGSAGGISAIVGALVVLGRTNYGSPREFAFERMIETFIGILISIVVDIIFQPKRASKLAKIQLIVSLQMLQKCINDSFCYESNTIMEDYLRSLRIQVIK

Query:  LKKLIDEAEVEPNFLFLQPFHGNSYVKMFNSLSKMVGLLALNGEAMNNLKENVREDLWRKVVENLEGDFEKFKEIMANGFVTFYEDL-SSSLKSSRGIEI
        +KKLIDEAEVEPNFLFL PFHG+S++KMFNSLSKMVGLLALNGEAMNNLKE     LWRKV E LEGDFEKFKEIMANGFVTFYE+L SSSLKS +G E 
Subjt:  LKKLIDEAEVEPNFLFLQPFHGNSYVKMFNSLSKMVGLLALNGEAMNNLKENVREDLWRKVVENLEGDFEKFKEIMANGFVTFYEDL-SSSLKSSRGIEI

Query:  KDDNYDDIEMGKPQRIKILMDEIEKEKLVNSFLQHLGEVVEIKDGKSEE-ILSLSAMVFCLSSLMKEIEEIGKATRELIEWEKSFF
        K+DN  DIEMG+ QRI++ MDEIEKEKL+NSFLQHLGE+VE KDGKSEE ILSLSAM FCL+SLMKE+EE+G+A REL+EWEKS F
Subjt:  KDDNYDDIEMGKPQRIKILMDEIEKEKLVNSFLQHLGEVVEIKDGKSEE-ILSLSAMVFCLSSLMKEIEEIGKATRELIEWEKSFF

A0A0A0LXZ7 Uncharacterized protein2.7e-22157.69Show/hide
Query:  AMWFTRLASAFQAALACSVVACTTLYGPATLRRLVAFPAFSYLTAILIVTNAALGDAVRGCCLAVFATVQTVCPAMFLFWFIGPTKFSHITTAVTVALAS
        ++WFT  A+  + A+ACS++A  T+YGP  LRR V FPAFSY+TAILIVTNA LGD VRGC LA++AT+QTVCPAM +FWFIGPTKFS+ T A+TVALAS
Subjt:  AMWFTRLASAFQAALACSVVACTTLYGPATLRRLVAFPAFSYLTAILIVTNAALGDAVRGCCLAVFATVQTVCPAMFLFWFIGPTKFSHITTAVTVALAS

Query:  VVVVVQSSTHVLAKKIALGQIVLIYVVGFIGGAHTDPLMHPLHVAVTTALGAAASLFATLFPFPRLASLQVKKKSKIMVENMVERLSVMVKAVLAEDSTV
        +VVV+ SS+HVLAK+IALGQIV+IYVVGFIGG  T PLMHP+HVA TTA+G AAS  ATL PFPRLASL+VK+KSK MVEN+ ERL V+VKA LA++ TV
Subjt:  VVVVVQSSTHVLAKKIALGQIVLIYVVGFIGGAHTDPLMHPLHVAVTTALGAAASLFATLFPFPRLASLQVKKKSKIMVENMVERLSVMVKAVLAEDSTV

Query:  AAASISRANFLSSSATKLLHSIKLYQGSKQWEKLPFKICKMGWLSNSEKLEDLELALNGMELALSKIPSYPIQNNPQNYQTLKHDLNNLERQISHSLKQA
        A  S+S+A  LS+SATKLL  IK YQ S +WE +P K+CK+GWL NS+KL+DLE  + GMELALS IPSYPI   P   ++L++ +N+LE QI  SL Q 
Subjt:  AAASISRANFLSSSATKLLHSIKLYQGSKQWEKLPFKICKMGWLSNSEKLEDLELALNGMELALSKIPSYPIQNNPQNYQTLKHDLNNLERQISHSLKQA

Query:  NSCFPPSDSVTFPEVN-VDDKTIIINTLNSIQII-PTNRQDLPHFFFIFCMKLLLIKTQIKTPTKLQEESKKKKIENSIDKEKNRTWVSP---MNSQRVI
           + PSDS TFPE N  D+       +N+IQ+I PTN ++LP FFFIFC+KLL  K+Q       Q+  ++K+  N+        W  P   ++S++V+
Subjt:  NSCFPPSDSVTFPEVN-VDDKTIIINTLNSIQII-PTNRQDLPHFFFIFCMKLLLIKTQIKTPTKLQEESKKKKIENSIDKEKNRTWVSP---MNSQRVI

Query:  PALKCAVSLGISVILGLIYSKENGFWGSLAVAVSIASDREPTFKVANFKVHGTMLGSVFGILSFVLFERFLIGRLLCLLPWFLFTSFLQHSTMYGSAGGI
         ALK A+SLGISV LGLIYSKENGFW SL VAVSIA  RE TFK++N K+ GT++GSV+G+L FV+FE+FLIGRLLCLLP F+FTSFLQ S MYG+AGG+
Subjt:  PALKCAVSLGISVILGLIYSKENGFWGSLAVAVSIASDREPTFKVANFKVHGTMLGSVFGILSFVLFERFLIGRLLCLLPWFLFTSFLQHSTMYGSAGGI

Query:  SAIVGALVVLGRTNYGSPREFAFERMIETFIGILISIVVDIIFQPKRASKLAKIQLIVSLQMLQKCINDSFCYESNTIMEDYLRSLRIQVIKLKKLIDEA
        SAI+GA+++LGRTNYGSP+E AF R++ET IG+  SI+VDII  P RASKLAK QL  +L++L KCI DS  ++    ++  L+ L   V++LKKLIDEA
Subjt:  SAIVGALVVLGRTNYGSPREFAFERMIETFIGILISIVVDIIFQPKRASKLAKIQLIVSLQMLQKCINDSFCYESNTIMEDYLRSLRIQVIKLKKLIDEA

Query:  EVEPNFLFLQPFHGNSYVKMFNSLSKMVGLLALNGEAMNNLKEN--VRED--LWRKVVENLEGDFEKFKEIMANGFVTFYEDLSSSLKSSRGIEIK----
         VEPNF FL PF    Y K+  SL K V L A    ++  + +N  V ED   W K+ ENLE D E FKE MA+G V    D+ SSLKS + +E +    
Subjt:  EVEPNFLFLQPFHGNSYVKMFNSLSKMVGLLALNGEAMNNLKEN--VRED--LWRKVVENLEGDFEKFKEIMANGFVTFYEDLSSSLKSSRGIEIK----

Query:  ---DDNYDDIEMGKPQRIKILMDEIEKEKLVNSFLQHLGEVV----EIKDGKSEEILSLSAMVFCLSSLMKEIEEIGKATRELIEWE
           + +++D+EMG+ + + I M+E+EKEKL+ SF++H  EV+    E +DGK E +LS SA+ FCLSSLMKEIEEIGKATRELI+ E
Subjt:  ---DDNYDDIEMGKPQRIKILMDEIEKEKLVNSFLQHLGEVV----EIKDGKSEEILSLSAMVFCLSSLMKEIEEIGKATRELIEWE

A0A6J1CLZ8 uncharacterized protein LOC1110121893.7e-23160.33Show/hide
Query:  NHDGRAMWFTRLASAFQAALACSVVACTTLYGPATLRRLVAFPAFSYLTAILIVTNAALGDAVRGCCLAVFATVQTVCPAMFLFWFIGPTKFSHITTAVT
        +  GRA+W TRLASAF+ ALACS+VAC TLYGPA+LR  VAFPAFSYLTAILIVTNA LGDA+ G CLA+FAT+QTVCPAM +FWFIGPTKFS +TTA+T
Subjt:  NHDGRAMWFTRLASAFQAALACSVVACTTLYGPATLRRLVAFPAFSYLTAILIVTNAALGDAVRGCCLAVFATVQTVCPAMFLFWFIGPTKFSHITTAVT

Query:  VALASVVVVVQSSTHVLAKKIALGQIVLIYVVGFIGGAHTDPLMHPLHVAVTTALGAAASLFATLFPFPRLASLQVKKKSKIMVENMVERLSVMVKAVLA
        VALASVVVV+Q ST +LAK+IALGQIV+IYVVGFIGG HTDPLMHP+HVA TTA+GA AS+ ATL PFPRLASLQV+ K K +VEN  ERL ++VKA L 
Subjt:  VALASVVVVVQSSTHVLAKKIALGQIVLIYVVGFIGGAHTDPLMHPLHVAVTTALGAAASLFATLFPFPRLASLQVKKKSKIMVENMVERLSVMVKAVLA

Query:  EDSTVAAASISRANFLSSSATKLLHSIKLYQGSKQWEKLPFKICKMGWLSNSEKLEDLELALNGMELALSKI-----PSYPIQNNPQNYQTLKHDLNNLE
        +D  VAAASIS+A  LSSSA KLLHSIK YQ S QWE+LP KIC M W  N E L+DL + L GMELALS       P Y  QN          D+N LE
Subjt:  EDSTVAAASISRANFLSSSATKLLHSIKLYQGSKQWEKLPFKICKMGWLSNSEKLEDLELALNGMELALSKI-----PSYPIQNNPQNYQTLKHDLNNLE

Query:  RQISHSLKQANSCFPPSDSVTFPEVNVDDKTIIINTLNSIQIIPTNRQDLPHFFFIFCMKLLLIKTQIKTPTKLQEESKKKKIENSIDKEKNRTWVSPMN
        + ++ +L QAN+   PSD                + +N++QI+PT+     H FF FC+KLL  K Q+K PTKL+++                   +   
Subjt:  RQISHSLKQANSCFPPSDSVTFPEVNVDDKTIIINTLNSIQIIPTNRQDLPHFFFIFCMKLLLIKTQIKTPTKLQEESKKKKIENSIDKEKNRTWVSPMN

Query:  SQRVIPALKCAVSLGISVILGLIYSKENGFWGSLAVAVSIASDREPTFKVANFKVHGTMLGSVFGILSFVLFERFLIGRLLCLLPWFLFTSFLQHSTMYG
         + ++ ALK AVSLG +V LGL+YS ENGFW SLAVA+S  SDREPTF+ AN KVHGTMLGSV+G+LSFV+F   L GRLLCLLPWF+FT+FL+HSTMYG
Subjt:  SQRVIPALKCAVSLGISVILGLIYSKENGFWGSLAVAVSIASDREPTFKVANFKVHGTMLGSVFGILSFVLFERFLIGRLLCLLPWFLFTSFLQHSTMYG

Query:  SAGGISAIVGALVVLGRTNYGSPREFAFERMIETFIGILISIVVDIIFQPKRASKLAKIQLIVSLQMLQKCINDSFCYESNTIMEDYLRSLRIQVIKLKK
        SAGG+SA+VGALVVLGRTNYGSP EFAF RM+ETFIG+ ISI  D+IFQP RASKLAKIQL  +L+ LQ CI         + +ED LR+L IQV +LKK
Subjt:  SAGGISAIVGALVVLGRTNYGSPREFAFERMIETFIGILISIVVDIIFQPKRASKLAKIQLIVSLQMLQKCINDSFCYESNTIMEDYLRSLRIQVIKLKK

Query:  LIDEAEVEPNFLFLQPFHGNSYVKMFNSLSKMVGLLALNGEAMNNLKENV---REDLWRKVVENLEGDFEKFKEIMANGFVTFYEDLSSSLKSSRGIE--
        LIDEAE EPN  FL PF  + Y K+F+SLSK+V  LAL+ EAM+ LK+N+    ED W K+VE L+G  EKFKE++ N  VT Y D+ SSLKS R +E  
Subjt:  LIDEAEVEPNFLFLQPFHGNSYVKMFNSLSKMVGLLALNGEAMNNLKENV---REDLWRKVVENLEGDFEKFKEIMANGFVTFYEDLSSSLKSSRGIE--

Query:  -IKDDNYDDIEMGKPQRIKILMDEI-EKEKLVNSFLQHLGEVV-----EIKDGKSEEILSLSAMVFCLSSLMKEIEEIGKATRELIEWE
          K+D   D+EMG+PQ  KI  DEI EKE L+  FLQH  EV+       +DGK E ILSL A+ FCLS+LM+EI+EIG+ATRELI+WE
Subjt:  -IKDDNYDDIEMGKPQRIKILMDEI-EKEKLVNSFLQHLGEVV-----EIKDGKSEEILSLSAMVFCLSSLMKEIEEIGKATRELIEWE

A0A6J1G8X2 uncharacterized protein LOC1114520251.5e-26467.9Show/hide
Query:  AASTTTNHDGRAMWFTRLASAFQAALACSVVACTTLYGPATLRRLVAFPAFSYLTAILIVTNAALGDAVRGCCLAVFATVQTVCPAMFLFWFIGPTKFSH
        AA + TNH  RA+WF RLASAF+ ALAC+VVA TTLYGP  LRR VAFPAFSYLTA+LIVTNA+LGD +RGC LA+FATVQTVCPAMFLFWFIGPTKFSH
Subjt:  AASTTTNHDGRAMWFTRLASAFQAALACSVVACTTLYGPATLRRLVAFPAFSYLTAILIVTNAALGDAVRGCCLAVFATVQTVCPAMFLFWFIGPTKFSH

Query:  ITTAVTVALASVVVVVQSSTHVLAKKIALGQIVLIYVVGFIGGAHTDPLMHPLHVAVTTALGAAASLFATLFPFPRLASLQVKKKSKIMVENMVERLSVM
        +T A+TVALAS+VVV+ +STHVLAKKIALGQIVLIYVVGFIGGA TDPLMHPLHVA TTALGAAAS+ ATL PFPRLASLQVK+KSK +VENM+ERL++M
Subjt:  ITTAVTVALASVVVVVQSSTHVLAKKIALGQIVLIYVVGFIGGAHTDPLMHPLHVAVTTALGAAASLFATLFPFPRLASLQVKKKSKIMVENMVERLSVM

Query:  VKAVLAEDSTVAAASISRANFLSSSATKLLHSIKLYQGSKQWEKLPFKICKMGWLSNSEKLEDLELALNGMELALSKIPSYPIQNNPQNYQTLKHDLNNL
        VKA+L  D ++AA SIS+A  LSSSA+KLLHSI+ +Q SK WE+LPFKI K+GWLSNSE+L++LE ALNGMELALS IPSYPI+   QN QTLKHDLN L
Subjt:  VKAVLAEDSTVAAASISRANFLSSSATKLLHSIKLYQGSKQWEKLPFKICKMGWLSNSEKLEDLELALNGMELALSKIPSYPIQNNPQNYQTLKHDLNNL

Query:  ERQISHSLKQANSCFPPSDSVTFPEVNVDDKTIIINTLNSIQIIPTNRQDLPHFFFIFCMKLLLIKTQIKTPTKLQEESKKKKIENSIDKEKNRTWVSPM
        + QIS +LKQAN+C   SDS+TFP   VDD     NTL SIQI+PTN+QDLPH FFIFCMKLL  K+QI+                     KN+TWVS M
Subjt:  ERQISHSLKQANSCFPPSDSVTFPEVNVDDKTIIINTLNSIQIIPTNRQDLPHFFFIFCMKLLLIKTQIKTPTKLQEESKKKKIENSIDKEKNRTWVSPM

Query:  NSQRVIPALKCAVSLGISVILGLIYSKENGFWGSLAVAVSIASDREPTFKVANFKVHGTMLGSVFGILSFVLFERFLIGRLLCLLPWFLFTSFLQHSTMY
        N+Q ++ ALK A+SLGI+V LGL+Y KENGFW SLAVAVSI S+RE TFKVAN KVHGTMLGSV+GILSFV+F+ FL+GRLLCLLPWF+FTSFLQH+ MY
Subjt:  NSQRVIPALKCAVSLGISVILGLIYSKENGFWGSLAVAVSIASDREPTFKVANFKVHGTMLGSVFGILSFVLFERFLIGRLLCLLPWFLFTSFLQHSTMY

Query:  GSAGGISAIVGALVVLGRTNYGSPREFAFERMIETFIGILISIVVDIIFQPKRASKLAKIQLIVSLQMLQKCINDSFCYESNTIMEDYLRSLRIQVIKLK
        GSAGG++A+VGALVVLGRTNYG+P EFAF R IETFIGI IS+VVDII QP RASK+AKIQL +SLQ LQKCI +S    S+  +E+  R+LRIQV +LK
Subjt:  GSAGGISAIVGALVVLGRTNYGSPREFAFERMIETFIGILISIVVDIIFQPKRASKLAKIQLIVSLQMLQKCINDSFCYESNTIMEDYLRSLRIQVIKLK

Query:  KLIDEAEVEPNFLFLQPFHGNSYVKMFNSLSKMVGLLALNGEAMNNLKENVREDLWRKVVENLEGDFEKFKEIMANGFVTFYEDLSSSLKSSRGIE---I
        KLI+EAEVEPNF FL PFH NSY K+  SLSK V  LA + +AM NLKE        +VVE+LEGD E+FKE+M    V+ Y D+ SSLK  R +E    
Subjt:  KLIDEAEVEPNFLFLQPFHGNSYVKMFNSLSKMVGLLALNGEAMNNLKENVREDLWRKVVENLEGDFEKFKEIMANGFVTFYEDLSSSLKSSRGIE---I

Query:  KDDNYDDIEMGKPQRIKILMDEIEKEKLVNSFLQHLGEVV----EIKDGKSEEILSLSAMVFCLSSLMKEIEEIGKATRELIEWE
        K +N DD+EMG+  RI+   DEIEKEKL+N  L+H  E+V    E KD +S+ ILSLSA+ FCLSSLM+ IEEIG+A REL++WE
Subjt:  KDDNYDDIEMGKPQRIKILMDEIEKEKLVNSFLQHLGEVV----EIKDGKSEEILSLSAMVFCLSSLMKEIEEIGKATRELIEWE

A0A6J1KDE1 uncharacterized protein LOC1114933341.3e-26368.03Show/hide
Query:  AASTTTNHDGRAMWFTRLASAFQAALACSVVACTTLYGPATLRRLVAFPAFSYLTAILIVTNAALGDAVRGCCLAVFATVQTVCPAMFLFWFIGPTKFSH
        AA + TNH  RA+WF RLASAF+ ALAC+VVA TTLYGP  LRR VAFPAFSYLTA+LIVTNA+LGD +RGC LA+FATVQTVCPAMFLFWFIGPTKFSH
Subjt:  AASTTTNHDGRAMWFTRLASAFQAALACSVVACTTLYGPATLRRLVAFPAFSYLTAILIVTNAALGDAVRGCCLAVFATVQTVCPAMFLFWFIGPTKFSH

Query:  ITTAVTVALASVVVVVQSSTHVLAKKIALGQIVLIYVVGFIGGAHTDPLMHPLHVAVTTALGAAASLFATLFPFPRLASLQVKKKSKIMVENMVERLSVM
        +T A+TVALAS+VVV+ +STHVLAKKIALGQIVLIYVVGFIGGA TDPLMHPLHVA TTALGAAAS+ ATL PFPRLASLQVK+KSK +VENMVERL++M
Subjt:  ITTAVTVALASVVVVVQSSTHVLAKKIALGQIVLIYVVGFIGGAHTDPLMHPLHVAVTTALGAAASLFATLFPFPRLASLQVKKKSKIMVENMVERLSVM

Query:  VKAVLAEDSTVAAASISRANFLSSSATKLLHSIKLYQGSKQWEKLPFKICKMGWLSNSEKLEDLELALNGMELALSKIPSYPIQNNPQNYQTLKHDLNNL
        VKA+L  D +VAA SIS+A  LSSSA+KLLHSI+ +Q SK WE LPFKI K+GWLSNSE+LE+LE AL GMELALS IPSYPI+   QN Q LKHDLN L
Subjt:  VKAVLAEDSTVAAASISRANFLSSSATKLLHSIKLYQGSKQWEKLPFKICKMGWLSNSEKLEDLELALNGMELALSKIPSYPIQNNPQNYQTLKHDLNNL

Query:  ERQISHSLKQANSCFPPSDSVTFPEVNVDDKTIIINTLNSIQIIPTNRQDLPHFFFIFCMKLLLIKTQIKTPTKLQEESKKKKIENSIDKEKNRTWVSPM
        + QIS +LKQAN+C   SDS+TFP   VDD     NTL SIQI+PTN+QDLPH FFIFCMKLL  K+QI+                     KN+TWVS M
Subjt:  ERQISHSLKQANSCFPPSDSVTFPEVNVDDKTIIINTLNSIQIIPTNRQDLPHFFFIFCMKLLLIKTQIKTPTKLQEESKKKKIENSIDKEKNRTWVSPM

Query:  NSQRVIPALKCAVSLGISVILGLIYSKENGFWGSLAVAVSIASDREPTFKVANFKVHGTMLGSVFGILSFVLFERFLIGRLLCLLPWFLFTSFLQHSTMY
        NSQ ++ ALK A+SLGI+V LGL+Y KENGFW SLAVAVSI S+RE TFKVAN KVHGTMLGSV+GILSFV+F+ FL+GRLLCLLPWF+FTSFLQH+ MY
Subjt:  NSQRVIPALKCAVSLGISVILGLIYSKENGFWGSLAVAVSIASDREPTFKVANFKVHGTMLGSVFGILSFVLFERFLIGRLLCLLPWFLFTSFLQHSTMY

Query:  GSAGGISAIVGALVVLGRTNYGSPREFAFERMIETFIGILISIVVDIIFQPKRASKLAKIQLIVSLQMLQKCINDSFCYESNTIMEDYLRSLRIQVIKLK
        GSAGG++A+VGALVVLGRTNYG+P EFAF R IETFIGI IS+VVDII QP RASK+AKIQL +SLQ LQKCI +S    S+  +E+  R+LRIQV +LK
Subjt:  GSAGGISAIVGALVVLGRTNYGSPREFAFERMIETFIGILISIVVDIIFQPKRASKLAKIQLIVSLQMLQKCINDSFCYESNTIMEDYLRSLRIQVIKLK

Query:  KLIDEAEVEPNFLFLQPFHGNSYVKMFNSLSKMVGLLALNGEAMNNLKENVREDLWRKVVENLEGDFEKFKEIMANGFVTFYEDLSSSLKSSRGIE---I
        KLI+EAE EPNF FL PFH NSY K+  SLSK V  LA + +AM NLKE        +VVE+LEGD E+FKE+M    V+ Y D+ SSLK  + +E    
Subjt:  KLIDEAEVEPNFLFLQPFHGNSYVKMFNSLSKMVGLLALNGEAMNNLKENVREDLWRKVVENLEGDFEKFKEIMANGFVTFYEDLSSSLKSSRGIE---I

Query:  KDDNYDDIEMGKPQRIKILMDEIEKEKLVNSFLQHLGEVV----EIKDGKSEEILSLSAMVFCLSSLMKEIEEIGKATRELIEWE
        K +N DD+EMG+  RI+   DEIEKEKL+N  L+H  E+V    E KD KS+ ILSLSA+ FCLSSLM+ IEEIG+A REL++WE
Subjt:  KDDNYDDIEMGKPQRIKILMDEIEKEKLVNSFLQHLGEVV----EIKDGKSEEILSLSAMVFCLSSLMKEIEEIGKATRELIEWE

SwissProt top hitse value%identityAlignment
D5CDA0 p-hydroxybenzoic acid efflux pump subunit AaeB5.2e-0424.61Show/hide
Query:  SQRVIPALKCAVSLGISVILGLIYSKENGFWGSLAVAV-----SIASDREPTFKVANF----KVHGTMLGSVFGILSFVLFERFLIGRLLCLLPWFLFTS
        SQ +  A+K A ++ +++ +G  +  E   W  L  A+     + A+  EP      +    ++ GT +G +  +   +L  R  +  +L    W  F +
Subjt:  SQRVIPALKCAVSLGISVILGLIYSKENGFWGSLAVAV-----SIASDREPTFKVANF----KVHGTMLGSVFGILSFVLFERFLIGRLLCLLPWFLFTS

Query:  FL------QHSTMYGSAGGISAIVGALVVLGRTNYGSPREFAFERMIETFIGILISIVVDIIFQPKRASKLAKIQL----IVSLQMLQKCI
        +L      ++S  +G AG  + I+   V+  +T      +FA ER  E  IGI+ +IV D++F P+   +    +L    +   Q++Q CI
Subjt:  FL------QHSTMYGSAGGISAIVGALVVLGRTNYGSPREFAFERMIETFIGILISIVVDIIFQPKRASKLAKIQL----IVSLQMLQKCI

Arabidopsis top hitse value%identityAlignment
AT2G28780.1 unknown protein2.3e-15643.11Show/hide
Query:  GRAMWFTRLASAFQAALACSVVACTTLYGPATLRRLVAFPAFSYLTAILIVTNAALGDAVRGCCLAVFATVQTVCPAMFLFWFIGPTKFSHITTAVTVAL
        GRAMW T LASAF+ ALAC++V   TLYGP  + R VAFPAFSY+T ILI+T+A LGD +RGC LA++AT Q+V PA+     I P + +  TTA+  AL
Subjt:  GRAMWFTRLASAFQAALACSVVACTTLYGPATLRRLVAFPAFSYLTAILIVTNAALGDAVRGCCLAVFATVQTVCPAMFLFWFIGPTKFSHITTAVTVAL

Query:  ASVVVVV-QSSTHVLAKKIALGQIVLIYVVGFIGGAHTDPLMHPLHVAVTTALGAAASLFATLFPFPRLASLQVKKKSKIMVENMVERLSVMVKAVLAED
        A+ VVV+  SSTH++AK+IALGQIVLIYV+G+I GA TDP+MHPL VA +TALG  A + A L P PRLA+ +VK+  K + +N+  R+ + +KA  ++D
Subjt:  ASVVVVV-QSSTHVLAKKIALGQIVLIYVVGFIGGAHTDPLMHPLHVAVTTALGAAASLFATLFPFPRLASLQVKKKSKIMVENMVERLSVMVKAVLAED

Query:  STVAAASISRANFLSSSATKLLHSIKLYQGSKQWEKLPFKICKMGWLSNS--EKLEDLELALNGMELAL---SKIPSYPIQNNPQNYQTLKHDLNNLERQ
        S  A AS+S+A  L+ S++KL  ++K YQ S  WE+LPFKI +   ++++  EKL+ +E+AL GME+ +   S IPS  +         +K DL N++ +
Subjt:  STVAAASISRANFLSSSATKLLHSIKLYQGSKQWEKLPFKICKMGWLSNS--EKLEDLELALNGMELAL---SKIPSYPIQNNPQNYQTLKHDLNNLERQ

Query:  ISHSLKQANSCFPPSDSVTFPEVNVDDKTIIINTLNSIQIIPTNRQDLPHFFFIFCMKLLLIKTQIKTPTKLQEESKKKKIENSIDKEKNRTWVSPMNSQ
        +  S+K+ N+   PS +      N D+       L ++Q IP   QDLP +FF+FC++LL      K      EE+K K +EN   K K R+W+S  +S+
Subjt:  ISHSLKQANSCFPPSDSVTFPEVNVDDKTIIINTLNSIQIIPTNRQDLPHFFFIFCMKLLLIKTQIKTPTKLQEESKKKKIENSIDKEKNRTWVSPMNSQ

Query:  RVIPALKCAVSLGISVILGLIYSKENGFWGSLAVAVSIASDREPTFKVANFKVHGTMLGSVFGILSFVLFERFLIGRLLCLLPWFLFTSFLQHSTMYGSA
        +++PALK ++SLG++++LG ++SK NG+W  L VAVS A+ RE TFKV N K  GT++G+V+G++   +F++FL  R L LLPWFLF+SFL  S MYG A
Subjt:  RVIPALKCAVSLGISVILGLIYSKENGFWGSLAVAVSIASDREPTFKVANFKVHGTMLGSVFGILSFVLFERFLIGRLLCLLPWFLFTSFLQHSTMYGSA

Query:  GGISAIVGALVVLGRTNYGSPREFAFERMIETFIGILISIVVDIIFQPKRASKLAKIQLIVSLQMLQKCINDSFCYESNTIMEDYLRSLRIQVIKLKKLI
        GGISA +GA+++LGR N+G P EFA ER+IETFIG+  SI+V+++FQP RA+ +AK++L  S   L +C +      S   + +  + LR  + +LKK  
Subjt:  GGISAIVGALVVLGRTNYGSPREFAFERMIETFIGILISIVVDIIFQPKRASKLAKIQLIVSLQMLQKCINDSFCYESNTIMEDYLRSLRIQVIKLKKLI

Query:  DEAEVEPNFLFLQPFHGNSYVKMFNSLSKMVGLLALNGEAMNNLKE--NVREDLWRKVVENLEGDFEKFKEIMANGFVTFYE-DLSSSLKSSRGIEIKDD
         EA  EP+F F  PF+ + Y K+F SLSKM  LL  +G A+  L E    +    ++++ N++ D +   E +     +F E  L  SL +      K D
Subjt:  DEAEVEPNFLFLQPFHGNSYVKMFNSLSKMVGLLALNGEAMNNLKE--NVREDLWRKVVENLEGDFEKFKEIMANGFVTFYE-DLSSSLKSSRGIEIKDD

Query:  NYD-DIEMGKPQRIKILMDEIEKEKLVNSFLQHLGEVV------------EIKDGKSEEILSLSAMVFCLSSLMKEIEEIGKATRELIEWE
        N   DIE+GK           E EK++ ++LQH   V             E++  KSE +LSL A+ FC+  + KE  EI +  +E+++ E
Subjt:  NYD-DIEMGKPQRIKILMDEIEKEKLVNSFLQHLGEVV------------EIKDGKSEEILSLSAMVFCLSSLMKEIEEIGKATRELIEWE

AT3G09450.1 CONTAINS InterPro DOMAIN/s: Fusaric acid resistance protein, conserved region (InterPro:IPR006726)1.7e-9033.25Show/hide
Query:  WFTRLASAFQAALACSVVACTTLYGPATLRRLVAFPAFSYLTAILIVTNAA---LGDAVRGCCLAV-FATVQTVCPAMFLFWFIGPTKFSH-ITTAVTVA
        W  RL  A + A+AC +V+ TTLYGP  LR    FPAFSYLT ILI  + A    G+ ++ CCL V +AT QT+  A+     +GP    + +   V VA
Subjt:  WFTRLASAFQAALACSVVACTTLYGPATLRRLVAFPAFSYLTAILIVTNAA---LGDAVRGCCLAV-FATVQTVCPAMFLFWFIGPTKFSH-ITTAVTVA

Query:  LASVVVVVQSSTHVLAKKIALGQIVLIYVVGFI-GGAHTDPLMHPLHVAVTTALGAAASLFATLFPFPRLASLQVKKKSKIMVENMVERLSVMVKAVLAE
        LAS +V    ST +L K+IA GQIV++YV   +  G      M P+HVA +TALGA ASL A L PFPRLA  Q+ K  K+  EN +ERL++ V+ ++A 
Subjt:  LASVVVVVQSSTHVLAKKIALGQIVLIYVVGFI-GGAHTDPLMHPLHVAVTTALGAAASLFATLFPFPRLASLQVKKKSKIMVENMVERLSVMVKAVLAE

Query:  DSTVAAASISRANFLSSSATKLLHSIKLYQGSKQWEKLPFK-ICKMGWLSNSEKLEDLELALNGMELALSKIPSYPIQNNPQNYQTLKHDLNNLERQISH
        D+T A   I+RA  LS++A   L +IK++     WE+   + + +   L  +EKL   +  L G+ELAL    S+P   +      L      LE   +H
Subjt:  DSTVAAASISRANFLSSSATKLLHSIKLYQGSKQWEKLPFK-ICKMGWLSNSEKLEDLELALNGMELALSKIPSYPIQNNPQNYQTLKHDLNNLERQISH

Query:  SLKQANSCFPPSDSVTFPEVNVDDKTIIINTLNSIQIIPTNRQDLPHFFFIFCMKL-----LLIKTQIKTPTKLQEESKKKKIENSIDKEKNRTW---VS
           ++ S     DS+ +                  +    +   LP  FF +C++L     L ++   K+      E +       +   + + W     
Subjt:  SLKQANSCFPPSDSVTFPEVNVDDKTIIINTLNSIQIIPTNRQDLPHFFFIFCMKL-----LLIKTQIKTPTKLQEESKKKKIENSIDKEKNRTW---VS

Query:  PMNSQRVIPALKCAVSLGISVILGLIYSKENGFWGSLAVAVSIASDREPTFKVANFKVHGTMLGSVFGILSFVLFERFLIGRLLCLLPWFLFTSFLQHST
         M  +R + A KC++SLG++V+ G++Y+K NG+W  L VA+S+ S R+ T  VAN ++ GT +GSV+G++   +F+R    R L LLPW +   F++HS 
Subjt:  PMNSQRVIPALKCAVSLGISVILGLIYSKENGFWGSLAVAVSIASDREPTFKVANFKVHGTMLGSVFGILSFVLFERFLIGRLLCLLPWFLFTSFLQHST

Query:  MYGSAGGISAIVGALVVLGRTNYGSPREFAFERMIETFIGILISIVVDIIFQPKRASKLAKIQLIVSLQMLQKCIND-SFCYE--SNTIMEDYLRS---L
        +YG  GG++A + AL++LGR NYG+P EFA  R++E  IG+L  +  +I+  P RA+ LA+ ++   L  L  CI     C E  +  ++ D  +S   L
Subjt:  MYGSAGGISAIVGALVVLGRTNYGSPREFAFERMIETFIGILISIVVDIIFQPKRASKLAKIQLIVSLQMLQKCIND-SFCYE--SNTIMEDYLRS---L

Query:  RIQVIKLKKLIDEAEVEPNFLFLQPFHGNSYVKMFNSLSKMVGLLALNGEAMNNLK----------ENVREDLWRKVVENLEGDFEKFKEIMANGFVTFY
        +  V  L++   EA  EP   FL+  + +SY ++  S SK+  L     + + NL           +N+  +L R   E L    +  KEI         
Subjt:  RIQVIKLKKLIDEAEVEPNFLFLQPFHGNSYVKMFNSLSKMVGLLALNGEAMNNLK----------ENVREDLWRKVVENLEGDFEKFKEIMANGFVTFY

Query:  EDLSSSLKSSRGIE---IKDDNYDDIEMGKPQRIKILMDEIEKEKLVNSFLQHLGEVVE------IKDG-KSEEILSLSAMVFCLSSLMKE
        +      K    +E     +DNY  +E+G  Q         + E+   SF+  L E  +        D  KSE  L LS++ FC+S LM+E
Subjt:  EDLSSSLKSSRGIE---IKDDNYDDIEMGKPQRIKILMDEIEKEKLVNSFLQHLGEVVE------IKDG-KSEEILSLSAMVFCLSSLMKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCCGCCTCCACCACCACCAACCACGACGGCCGAGCCATGTGGTTCACACGGCTGGCCTCCGCCTTCCAAGCGGCTCTTGCCTGCTCAGTAGTGGCCTGCACCAC
CTTGTACGGCCCGGCTACTCTCCGCCGCCTTGTGGCCTTTCCAGCCTTCTCCTATCTCACAGCTATTCTTATAGTCACCAACGCTGCCCTCGGCGACGCCGTCCGTGGCT
GTTGCCTAGCTGTCTTCGCCACCGTCCAGACCGTGTGCCCGGCCATGTTCCTGTTTTGGTTCATTGGTCCGACCAAATTCTCCCACATCACCACCGCCGTCACGGTGGCG
TTGGCTTCTGTGGTTGTGGTGGTCCAAAGCTCAACCCATGTGCTGGCTAAGAAGATTGCTTTGGGTCAGATTGTGCTCATTTACGTTGTGGGTTTCATCGGCGGCGCCCA
CACTGACCCACTCATGCACCCACTCCATGTCGCCGTCACCACCGCCTTGGGCGCCGCCGCTAGTCTCTTTGCCACGCTCTTCCCTTTCCCACGCCTTGCTTCTCTTCAGG
TGAAAAAGAAGAGCAAAATTATGGTGGAGAACATGGTAGAAAGGTTAAGTGTAATGGTGAAGGCAGTCCTTGCAGAAGATAGCACAGTGGCTGCTGCTTCCATATCTAGA
GCTAACTTCTTGTCTTCTTCAGCAACTAAACTTCTCCACTCCATAAAACTTTACCAAGGAAGTAAGCAATGGGAGAAGCTTCCATTCAAAATCTGCAAAATGGGATGGTT
GAGTAATAGTGAGAAATTAGAAGATTTAGAATTGGCCTTAAATGGAATGGAATTGGCTTTATCCAAAATTCCTTCATATCCAATTCAAAATAATCCTCAAAATTACCAAA
CCCTCAAACATGATCTAAATAATTTGGAGAGGCAAATTTCCCACTCTTTAAAGCAAGCCAATTCTTGTTTTCCACCGTCCGATTCGGTGACTTTTCCAGAAGTCAACGTA
GATGATAAAACAATAATAATCAACACCCTCAATTCCATTCAAATTATTCCCACAAACCGCCAAGATTTGCCCCATTTTTTCTTCATATTCTGCATGAAACTCCTCCTCAT
TAAAACCCAAATAAAAACTCCAACAAAGCTACAAGAAGAATCAAAGAAAAAAAAAATTGAAAATTCCATCGACAAAGAAAAAAACAGAACATGGGTTTCCCCGATGAACA
GTCAAAGGGTAATCCCAGCCTTGAAATGTGCAGTTTCATTGGGGATTTCAGTGATTTTGGGACTGATTTACAGTAAGGAAAATGGGTTTTGGGGGAGTTTGGCAGTGGCT
GTAAGCATTGCTTCAGACAGAGAACCAACATTTAAAGTTGCAAACTTTAAGGTTCATGGAACAATGTTGGGTTCTGTTTTTGGAATTTTGAGCTTTGTTCTTTTCGAAAG
GTTTTTAATTGGAAGGCTTCTTTGTCTTCTTCCTTGGTTTCTTTTCACCAGCTTTTTACAACATAGTACAATGTACGGTTCAGCTGGTGGAATTTCCGCCATTGTTGGAG
CTTTAGTAGTTTTAGGAAGAACAAACTATGGTTCCCCAAGAGAATTTGCTTTTGAGAGAATGATTGAAACTTTTATTGGGATTTTGATATCAATTGTAGTTGATATCATT
TTTCAACCAAAAAGAGCTTCTAAGTTGGCGAAAATTCAGCTCATTGTGAGTTTACAAATGCTACAAAAATGTATTAATGATTCATTTTGTTATGAATCAAACACAATAAT
GGAGGATTATTTGCGAAGTTTAAGAATTCAAGTTATCAAGCTGAAGAAGTTGATTGATGAGGCTGAGGTTGAACCCAATTTCTTGTTTTTGCAGCCATTTCATGGTAATA
GCTATGTGAAGATGTTTAATTCCTTGTCAAAAATGGTTGGTTTACTAGCTTTGAATGGTGAAGCAATGAATAATCTTAAAGAGAATGTGAGAGAGGATTTGTGGAGAAAG
GTTGTGGAGAACTTAGAGGGGGATTTTGAGAAGTTTAAGGAAATTATGGCCAATGGTTTTGTGACATTTTATGAGGATTTGAGTTCTTCATTGAAGTCTTCGAGAGGGAT
TGAAATTAAGGACGATAATTATGATGATATTGAGATGGGAAAGCCACAAAGGATTAAAATATTAATGGATGAAATTGAGAAGGAAAAGTTGGTTAATTCCTTTTTGCAGC
ACTTGGGAGAGGTTGTTGAAATTAAAGATGGTAAAAGTGAAGAAATTTTGAGTTTGAGTGCTATGGTGTTTTGCTTGAGTAGTTTGATGAAAGAGATAGAAGAGATTGGA
AAGGCAACTAGAGAACTCATTGAATGGGAGAAATCCTTTTTCTAG
mRNA sequenceShow/hide mRNA sequence
TAAACACCCTGCAATACTCACCCCGAAGCTCAACCTACTCCATTCATCTTCTCCCCATCTCTCTATAATGGCCGCCGCCTCCACCACCACCAACCACGACGGCCGAGCCA
TGTGGTTCACACGGCTGGCCTCCGCCTTCCAAGCGGCTCTTGCCTGCTCAGTAGTGGCCTGCACCACCTTGTACGGCCCGGCTACTCTCCGCCGCCTTGTGGCCTTTCCA
GCCTTCTCCTATCTCACAGCTATTCTTATAGTCACCAACGCTGCCCTCGGCGACGCCGTCCGTGGCTGTTGCCTAGCTGTCTTCGCCACCGTCCAGACCGTGTGCCCGGC
CATGTTCCTGTTTTGGTTCATTGGTCCGACCAAATTCTCCCACATCACCACCGCCGTCACGGTGGCGTTGGCTTCTGTGGTTGTGGTGGTCCAAAGCTCAACCCATGTGC
TGGCTAAGAAGATTGCTTTGGGTCAGATTGTGCTCATTTACGTTGTGGGTTTCATCGGCGGCGCCCACACTGACCCACTCATGCACCCACTCCATGTCGCCGTCACCACC
GCCTTGGGCGCCGCCGCTAGTCTCTTTGCCACGCTCTTCCCTTTCCCACGCCTTGCTTCTCTTCAGGTGAAAAAGAAGAGCAAAATTATGGTGGAGAACATGGTAGAAAG
GTTAAGTGTAATGGTGAAGGCAGTCCTTGCAGAAGATAGCACAGTGGCTGCTGCTTCCATATCTAGAGCTAACTTCTTGTCTTCTTCAGCAACTAAACTTCTCCACTCCA
TAAAACTTTACCAAGGAAGTAAGCAATGGGAGAAGCTTCCATTCAAAATCTGCAAAATGGGATGGTTGAGTAATAGTGAGAAATTAGAAGATTTAGAATTGGCCTTAAAT
GGAATGGAATTGGCTTTATCCAAAATTCCTTCATATCCAATTCAAAATAATCCTCAAAATTACCAAACCCTCAAACATGATCTAAATAATTTGGAGAGGCAAATTTCCCA
CTCTTTAAAGCAAGCCAATTCTTGTTTTCCACCGTCCGATTCGGTGACTTTTCCAGAAGTCAACGTAGATGATAAAACAATAATAATCAACACCCTCAATTCCATTCAAA
TTATTCCCACAAACCGCCAAGATTTGCCCCATTTTTTCTTCATATTCTGCATGAAACTCCTCCTCATTAAAACCCAAATAAAAACTCCAACAAAGCTACAAGAAGAATCA
AAGAAAAAAAAAATTGAAAATTCCATCGACAAAGAAAAAAACAGAACATGGGTTTCCCCGATGAACAGTCAAAGGGTAATCCCAGCCTTGAAATGTGCAGTTTCATTGGG
GATTTCAGTGATTTTGGGACTGATTTACAGTAAGGAAAATGGGTTTTGGGGGAGTTTGGCAGTGGCTGTAAGCATTGCTTCAGACAGAGAACCAACATTTAAAGTTGCAA
ACTTTAAGGTTCATGGAACAATGTTGGGTTCTGTTTTTGGAATTTTGAGCTTTGTTCTTTTCGAAAGGTTTTTAATTGGAAGGCTTCTTTGTCTTCTTCCTTGGTTTCTT
TTCACCAGCTTTTTACAACATAGTACAATGTACGGTTCAGCTGGTGGAATTTCCGCCATTGTTGGAGCTTTAGTAGTTTTAGGAAGAACAAACTATGGTTCCCCAAGAGA
ATTTGCTTTTGAGAGAATGATTGAAACTTTTATTGGGATTTTGATATCAATTGTAGTTGATATCATTTTTCAACCAAAAAGAGCTTCTAAGTTGGCGAAAATTCAGCTCA
TTGTGAGTTTACAAATGCTACAAAAATGTATTAATGATTCATTTTGTTATGAATCAAACACAATAATGGAGGATTATTTGCGAAGTTTAAGAATTCAAGTTATCAAGCTG
AAGAAGTTGATTGATGAGGCTGAGGTTGAACCCAATTTCTTGTTTTTGCAGCCATTTCATGGTAATAGCTATGTGAAGATGTTTAATTCCTTGTCAAAAATGGTTGGTTT
ACTAGCTTTGAATGGTGAAGCAATGAATAATCTTAAAGAGAATGTGAGAGAGGATTTGTGGAGAAAGGTTGTGGAGAACTTAGAGGGGGATTTTGAGAAGTTTAAGGAAA
TTATGGCCAATGGTTTTGTGACATTTTATGAGGATTTGAGTTCTTCATTGAAGTCTTCGAGAGGGATTGAAATTAAGGACGATAATTATGATGATATTGAGATGGGAAAG
CCACAAAGGATTAAAATATTAATGGATGAAATTGAGAAGGAAAAGTTGGTTAATTCCTTTTTGCAGCACTTGGGAGAGGTTGTTGAAATTAAAGATGGTAAAAGTGAAGA
AATTTTGAGTTTGAGTGCTATGGTGTTTTGCTTGAGTAGTTTGATGAAAGAGATAGAAGAGATTGGAAAGGCAACTAGAGAACTCATTGAATGGGAGAAATCCTTTTTCT
AGTTATGTAATTGATTTCAATGCAATTACTACATCTAACATATATTCATGTTGGTCAAAAGTAAAATCAAATGAATTGTGACATTTCCAAAATGTTTGTAACTATGTAAA
GACATTTAGTTACAAAAAAAGGAAAGAAAGAAAGAAAAGGTGGTTTTGTGAT
Protein sequenceShow/hide protein sequence
MAAASTTTNHDGRAMWFTRLASAFQAALACSVVACTTLYGPATLRRLVAFPAFSYLTAILIVTNAALGDAVRGCCLAVFATVQTVCPAMFLFWFIGPTKFSHITTAVTVA
LASVVVVVQSSTHVLAKKIALGQIVLIYVVGFIGGAHTDPLMHPLHVAVTTALGAAASLFATLFPFPRLASLQVKKKSKIMVENMVERLSVMVKAVLAEDSTVAAASISR
ANFLSSSATKLLHSIKLYQGSKQWEKLPFKICKMGWLSNSEKLEDLELALNGMELALSKIPSYPIQNNPQNYQTLKHDLNNLERQISHSLKQANSCFPPSDSVTFPEVNV
DDKTIIINTLNSIQIIPTNRQDLPHFFFIFCMKLLLIKTQIKTPTKLQEESKKKKIENSIDKEKNRTWVSPMNSQRVIPALKCAVSLGISVILGLIYSKENGFWGSLAVA
VSIASDREPTFKVANFKVHGTMLGSVFGILSFVLFERFLIGRLLCLLPWFLFTSFLQHSTMYGSAGGISAIVGALVVLGRTNYGSPREFAFERMIETFIGILISIVVDII
FQPKRASKLAKIQLIVSLQMLQKCINDSFCYESNTIMEDYLRSLRIQVIKLKKLIDEAEVEPNFLFLQPFHGNSYVKMFNSLSKMVGLLALNGEAMNNLKENVREDLWRK
VVENLEGDFEKFKEIMANGFVTFYEDLSSSLKSSRGIEIKDDNYDDIEMGKPQRIKILMDEIEKEKLVNSFLQHLGEVVEIKDGKSEEILSLSAMVFCLSSLMKEIEEIG
KATRELIEWEKSFF