; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g0841 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g0841
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionDsbD_2 domain-containing protein
Genome locationMC02:6663577..6668042
RNA-Seq ExpressionMC02g0841
SyntenyMC02g0841
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR039447 - Urease accessory protein UreH-like, transmembrane domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604942.1 hypothetical protein SDJN03_02259, partial [Cucurbita argyrosperma subsp. sororia]3.14e-22086.95Show/hide
Query:  MERLLYSSSPTPLKPHL-RRPPLLPRLPRIDQLKLHFPSSTRSEFGGVNSFSSRSENLFLPSSSSRPPPASSNSLLSLTDSRDGSSPNPHLCQRIEIPPS
        ME+LLYSSSPT LKPHL  R P LP L RID  KLHFPSSTRSEF  +NSFSSR EN FLPSSS    P SS   L L DSRDGS P PH+ QRI   PS
Subjt:  MERLLYSSSPTPLKPHL-RRPPLLPRLPRIDQLKLHFPSSTRSEFGGVNSFSSRSENLFLPSSSSRPPPASSNSLLSLTDSRDGSSPNPHLCQRIEIPPS

Query:  AGRKVKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGAL
          +K KT RT MALCVAVL L+QPVFAPSAFASF+ AATTGGPAAATFGGRFFRSELL SAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGAL
Subjt:  AGRKVKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGAL

Query:  WGCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPCVALDNGECDVSIYEALDNPTGGKKKIGFATFATGIVHGLQPD
        WGCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVG+TLLVIGALGIREASEVPTPCVALDNGECDVS+YE LDNPTGGKKKIGFATFATGIVHGLQPD
Subjt:  WGCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPCVALDNGECDVSIYEALDNPTGGKKKIGFATFATGIVHGLQPD

Query:  ALMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY
        ALMMILPALALPSR AGAAFLVMFL GTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAAS IAIALG AILISQYFGFSLY
Subjt:  ALMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY

XP_004147045.1 uncharacterized protein LOC101219983 [Cucumis sativus]4.87e-22187.17Show/hide
Query:  MERLLYSSSPTPLKPHLRRPPLLPRLPRIDQLKLHFPSSTRSEFGGVNSFSSRSENLFLPSSSSRPPPASSNSLLSLTDSRDGSSPNPHLCQRIEIPPSA
        M+RLLYS SPTPLKPHL R P LPRLPRID  KLHFPSSTRSEF GV+SFSSR+EN  L SSS    P SSN+LLSL DSRDGS P P   Q+IE  PS 
Subjt:  MERLLYSSSPTPLKPHLRRPPLLPRLPRIDQLKLHFPSSTRSEFGGVNSFSSRSENLFLPSSSSRPPPASSNSLLSLTDSRDGSSPNPHLCQRIEIPPSA

Query:  GRKVKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGALW
         RK K  +T MALCVA+L+LIQPVFAPSAFA   SAATTGGP+AATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGALW
Subjt:  GRKVKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGALW

Query:  GCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPCVALDNGECDVSIYEALDNPTGGKKKIGFATFATGIVHGLQPDA
        GCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVG+TLLVIGALGIREA+EVPTPCVALDNGE DVSIYEAL+NP+ GKKKIGFATFATGIVHGLQPDA
Subjt:  GCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPCVALDNGECDVSIYEALDNPTGGKKKIGFATFATGIVHGLQPDA

Query:  LMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY
        LMMILPALALPSR AGAAFLVMFL GTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY
Subjt:  LMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY

XP_008457702.1 PREDICTED: uncharacterized protein LOC103497337 [Cucumis melo]9.81e-22186.65Show/hide
Query:  MERLLYSSSPTPLKPHLRRPPLLPRLPRIDQLKLHFPSSTRSEFGGVNSFSSRSENLFLPSSSSRPPPASSNSLLSLTDSRDGSSPNPHLCQRIEIPPSA
        M+RLLYS SPTPLKPHL R P LPRL RID  KLHFPSSTRSEF GV+SFSSR+EN  L SSS    P SSN+LLSL DSRDGS   PH  Q+I+  PS 
Subjt:  MERLLYSSSPTPLKPHLRRPPLLPRLPRIDQLKLHFPSSTRSEFGGVNSFSSRSENLFLPSSSSRPPPASSNSLLSLTDSRDGSSPNPHLCQRIEIPPSA

Query:  GRKVKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGALW
         RK K  +T MALCVA+L+LIQPVFAPSAFA   S ATTGGP+AATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGALW
Subjt:  GRKVKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGALW

Query:  GCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPCVALDNGECDVSIYEALDNPTGGKKKIGFATFATGIVHGLQPDA
        GCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVG+TLLVIGALGIREA+EVPTPCVALDNGECDVSIYEAL+NP+ GKKKIGFATFATGIVHGLQPDA
Subjt:  GCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPCVALDNGECDVSIYEALDNPTGGKKKIGFATFATGIVHGLQPDA

Query:  LMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY
        LMMILPALALPSR AGAAFLVMFL GTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY
Subjt:  LMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY

XP_022146179.1 uncharacterized protein LOC111015458 [Momordica charantia]9.48e-266100Show/hide
Query:  MERLLYSSSPTPLKPHLRRPPLLPRLPRIDQLKLHFPSSTRSEFGGVNSFSSRSENLFLPSSSSRPPPASSNSLLSLTDSRDGSSPNPHLCQRIEIPPSA
        MERLLYSSSPTPLKPHLRRPPLLPRLPRIDQLKLHFPSSTRSEFGGVNSFSSRSENLFLPSSSSRPPPASSNSLLSLTDSRDGSSPNPHLCQRIEIPPSA
Subjt:  MERLLYSSSPTPLKPHLRRPPLLPRLPRIDQLKLHFPSSTRSEFGGVNSFSSRSENLFLPSSSSRPPPASSNSLLSLTDSRDGSSPNPHLCQRIEIPPSA

Query:  GRKVKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGALW
        GRKVKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGALW
Subjt:  GRKVKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGALW

Query:  GCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPCVALDNGECDVSIYEALDNPTGGKKKIGFATFATGIVHGLQPDA
        GCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPCVALDNGECDVSIYEALDNPTGGKKKIGFATFATGIVHGLQPDA
Subjt:  GCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPCVALDNGECDVSIYEALDNPTGGKKKIGFATFATGIVHGLQPDA

Query:  LMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY
        LMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY
Subjt:  LMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY

XP_038901379.1 uncharacterized protein LOC120088271 [Benincasa hispida]2.17e-22888.02Show/hide
Query:  MERLLYSSSPTPLKPHLR-RPPLLPRLPRIDQLKLHFPSSTRSEFGGVNSFSSRSENLFLPSSSSRPP-PASSNSLLSLTDSRDGSSPNPHLCQRIEIPP
        M+RLLYSSSPTPLKPHL+ R PLLPRL RID  K HFPSSTRSEF G+NSFSSR+EN  L SSS   P P SSN+LLSLT SRDGS+P PH  Q+IE+ P
Subjt:  MERLLYSSSPTPLKPHLR-RPPLLPRLPRIDQLKLHFPSSTRSEFGGVNSFSSRSENLFLPSSSSRPP-PASSNSLLSLTDSRDGSSPNPHLCQRIEIPP

Query:  SAGRKVKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGA
        S  RK KT  TLMALC+A+L+LIQPVFAPSA ASF+SAATTGGP+AAT GGRFF+SELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESA VGA
Subjt:  SAGRKVKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGA

Query:  LWGCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPCVALDNGECDVSIYEALDNPTGGKKKIGFATFATGIVHGLQP
        LWGCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVG+TLLVIGALGIREASEVPTPCVALDNGECDVSIYEAL+NPTGGKKKIGFATFATGIVHGLQP
Subjt:  LWGCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPCVALDNGECDVSIYEALDNPTGGKKKIGFATFATGIVHGLQP

Query:  DALMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY
        DALMMILPALALPSR AGAAFLVMFL GTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY
Subjt:  DALMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY

TrEMBL top hitse value%identityAlignment
A0A0A0LJG9 DsbD_2 domain-containing protein2.36e-22187.17Show/hide
Query:  MERLLYSSSPTPLKPHLRRPPLLPRLPRIDQLKLHFPSSTRSEFGGVNSFSSRSENLFLPSSSSRPPPASSNSLLSLTDSRDGSSPNPHLCQRIEIPPSA
        M+RLLYS SPTPLKPHL R P LPRLPRID  KLHFPSSTRSEF GV+SFSSR+EN  L SSS    P SSN+LLSL DSRDGS P P   Q+IE  PS 
Subjt:  MERLLYSSSPTPLKPHLRRPPLLPRLPRIDQLKLHFPSSTRSEFGGVNSFSSRSENLFLPSSSSRPPPASSNSLLSLTDSRDGSSPNPHLCQRIEIPPSA

Query:  GRKVKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGALW
         RK K  +T MALCVA+L+LIQPVFAPSAFA   SAATTGGP+AATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGALW
Subjt:  GRKVKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGALW

Query:  GCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPCVALDNGECDVSIYEALDNPTGGKKKIGFATFATGIVHGLQPDA
        GCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVG+TLLVIGALGIREA+EVPTPCVALDNGE DVSIYEAL+NP+ GKKKIGFATFATGIVHGLQPDA
Subjt:  GCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPCVALDNGECDVSIYEALDNPTGGKKKIGFATFATGIVHGLQPDA

Query:  LMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY
        LMMILPALALPSR AGAAFLVMFL GTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY
Subjt:  LMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY

A0A1S3C5P9 uncharacterized protein LOC1034973374.75e-22186.65Show/hide
Query:  MERLLYSSSPTPLKPHLRRPPLLPRLPRIDQLKLHFPSSTRSEFGGVNSFSSRSENLFLPSSSSRPPPASSNSLLSLTDSRDGSSPNPHLCQRIEIPPSA
        M+RLLYS SPTPLKPHL R P LPRL RID  KLHFPSSTRSEF GV+SFSSR+EN  L SSS    P SSN+LLSL DSRDGS   PH  Q+I+  PS 
Subjt:  MERLLYSSSPTPLKPHLRRPPLLPRLPRIDQLKLHFPSSTRSEFGGVNSFSSRSENLFLPSSSSRPPPASSNSLLSLTDSRDGSSPNPHLCQRIEIPPSA

Query:  GRKVKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGALW
         RK K  +T MALCVA+L+LIQPVFAPSAFA   S ATTGGP+AATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGALW
Subjt:  GRKVKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGALW

Query:  GCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPCVALDNGECDVSIYEALDNPTGGKKKIGFATFATGIVHGLQPDA
        GCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVG+TLLVIGALGIREA+EVPTPCVALDNGECDVSIYEAL+NP+ GKKKIGFATFATGIVHGLQPDA
Subjt:  GCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPCVALDNGECDVSIYEALDNPTGGKKKIGFATFATGIVHGLQPDA

Query:  LMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY
        LMMILPALALPSR AGAAFLVMFL GTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY
Subjt:  LMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY

A0A5A7TQT9 NicO domain-containing protein4.75e-22186.65Show/hide
Query:  MERLLYSSSPTPLKPHLRRPPLLPRLPRIDQLKLHFPSSTRSEFGGVNSFSSRSENLFLPSSSSRPPPASSNSLLSLTDSRDGSSPNPHLCQRIEIPPSA
        M+RLLYS SPTPLKPHL R P LPRL RID  KLHFPSSTRSEF GV+SFSSR+EN  L SSS    P SSN+LLSL DSRDGS   PH  Q+I+  PS 
Subjt:  MERLLYSSSPTPLKPHLRRPPLLPRLPRIDQLKLHFPSSTRSEFGGVNSFSSRSENLFLPSSSSRPPPASSNSLLSLTDSRDGSSPNPHLCQRIEIPPSA

Query:  GRKVKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGALW
         RK K  +T MALCVA+L+LIQPVFAPSAFA   S ATTGGP+AATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGALW
Subjt:  GRKVKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGALW

Query:  GCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPCVALDNGECDVSIYEALDNPTGGKKKIGFATFATGIVHGLQPDA
        GCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVG+TLLVIGALGIREA+EVPTPCVALDNGECDVSIYEAL+NP+ GKKKIGFATFATGIVHGLQPDA
Subjt:  GCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPCVALDNGECDVSIYEALDNPTGGKKKIGFATFATGIVHGLQPDA

Query:  LMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY
        LMMILPALALPSR AGAAFLVMFL GTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY
Subjt:  LMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY

A0A6J1CXD6 uncharacterized protein LOC1110154584.59e-266100Show/hide
Query:  MERLLYSSSPTPLKPHLRRPPLLPRLPRIDQLKLHFPSSTRSEFGGVNSFSSRSENLFLPSSSSRPPPASSNSLLSLTDSRDGSSPNPHLCQRIEIPPSA
        MERLLYSSSPTPLKPHLRRPPLLPRLPRIDQLKLHFPSSTRSEFGGVNSFSSRSENLFLPSSSSRPPPASSNSLLSLTDSRDGSSPNPHLCQRIEIPPSA
Subjt:  MERLLYSSSPTPLKPHLRRPPLLPRLPRIDQLKLHFPSSTRSEFGGVNSFSSRSENLFLPSSSSRPPPASSNSLLSLTDSRDGSSPNPHLCQRIEIPPSA

Query:  GRKVKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGALW
        GRKVKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGALW
Subjt:  GRKVKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGALW

Query:  GCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPCVALDNGECDVSIYEALDNPTGGKKKIGFATFATGIVHGLQPDA
        GCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPCVALDNGECDVSIYEALDNPTGGKKKIGFATFATGIVHGLQPDA
Subjt:  GCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPCVALDNGECDVSIYEALDNPTGGKKKIGFATFATGIVHGLQPDA

Query:  LMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY
        LMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY
Subjt:  LMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY

A0A6J1I271 uncharacterized protein LOC1114698288.45e-22086.42Show/hide
Query:  MERLLYSSSPTPLKPHL-RRPPLLPRLPRIDQLKLHFPSSTRSEFGGVNSFSSRSENLFLPSSSSRPPPASSNSLLSLTDSRDGSSPNPHLCQRIEIPPS
        ME+LLYSSSPT LKPHL  R P LP L RID  KLHFPSS RSEF G+NSFSS  EN FLPSS  R P +S    L L DSRDGS+P PH+ QRI   PS
Subjt:  MERLLYSSSPTPLKPHL-RRPPLLPRLPRIDQLKLHFPSSTRSEFGGVNSFSSRSENLFLPSSSSRPPPASSNSLLSLTDSRDGSSPNPHLCQRIEIPPS

Query:  AGRKVKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGAL
          +K KT RT MALCVAVL L+QPVFAPSAFASF+ AATTGGPAAATFGGRFFRSELL SAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGAL
Subjt:  AGRKVKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGAL

Query:  WGCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPCVALDNGECDVSIYEALDNPTGGKKKIGFATFATGIVHGLQPD
        WGCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVG+TLLVIGALGIREASEVPTPCVALDNGECDVS+YE LDNPTGGKKKIGFATFATGIVHGLQPD
Subjt:  WGCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPCVALDNGECDVSIYEALDNPTGGKKKIGFATFATGIVHGLQPD

Query:  ALMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY
        ALMMILPALALPSR AGAAFLVMFL GTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAAS IAIALG AILISQYFGFSLY
Subjt:  ALMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY

SwissProt top hitse value%identityAlignment
Q07404 Urease accessory protein UreH1.5e-0927.37Show/hide
Query:  ELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRM-ESAAVGALWGCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTR---VVGVTLLVIGALGIREA
        +LLS    GF  G  H +  PDH+ A++ +     ++  S+  G  WG GH +  +IFG+  +L+K ++  E    W      +VG+ L+  G   I   
Subjt:  ELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRM-ESAAVGALWGCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTR---VVGVTLLVIGALGIREA

Query:  SEVPTPCVALDNGECDVSIYEALDNPTGGKKKIGFA-TFATGIVHGLQPDALMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIG
         +         +      ++   D+P    K I +  +   GI+HGL   A M++L    +     G  +++ F AGTV+ M S+T  IG
Subjt:  SEVPTPCVALDNGECDVSIYEALDNPTGGKKKIGFA-TFATGIVHGLQPDALMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIG

Arabidopsis top hitse value%identityAlignment
AT2G16800.1 high-affinity nickel-transport family protein9.5e-11662.02Show/hide
Query:  MERLLY---SSSPTPLKPHLRRPPLLPRLPRIDQLKLH-FPSSTRSEFGGVNSFSSRSENLFLPSSSSRPPPASSNSLLSLTDSRDGSSPNPHLCQRIEI
        M+RLL    S S  P K   R  PLL  L R+    L  FPSS R E   ++S S        PS  + P    S++ L  +   D S PNP   QRI +
Subjt:  MERLLY---SSSPTPLKPHLRRPPLLPRLPRIDQLKLH-FPSSTRSEFGGVNSFSSRSENLFLPSSSSRPPPASSNSLLSLTDSRDGSSPNPHLCQRIEI

Query:  PPSAGRKVKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAV
             RK  +A  ++ +     +L+ P+  P AFASF +A  +GG  AA  GG+  R+E+L+SAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAV
Subjt:  PPSAGRKVKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAV

Query:  GALWGCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPC-VALDNGECDVSIYEALDNPTGGKKKIGFATFATGIVHG
        GALWGCGHDAGQ+IFGL+FLLLKDRLHIE+IRTWGTRVVG+TLLVIGA+GI+EASE+P PC V L+NGE D    +        KKKIGFATFATGIVHG
Subjt:  GALWGCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPC-VALDNGECDVSIYEALDNPTGGKKKIGFATFATGIVHG

Query:  LQPDALMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY
        LQPDALMM+LPALALPSR AGA+FL+MFL GTV+AMGSYTVFIGSC++ALKE+VPRITEKLTWA+S +AI LG AI++SQ+FGFSLY
Subjt:  LQPDALMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY

AT4G35080.1 high-affinity nickel-transport family protein1.5e-10857.88Show/hide
Query:  MERLLY-SSSPTPLKPHLRRPPLLPRLPRIDQLKLHFPSSTRSEFGGVNSFSSRSENLFLPSSSSRPPPASSNSLLSLTDSRDGSSPNPHLCQRIEIPPS
        MERLL  SSS + + P        P LPR+    L F S+ R E   V+S S  S    +PS  +     ++N+  + +   D S PNP    RI    S
Subjt:  MERLLY-SSSPTPLKPHLRRPPLLPRLPRIDQLKLHFPSSTRSEFGGVNSFSSRSENLFLPSSSSRPPPASSNSLLSLTDSRDGSSPNPHLCQRIEIPPS

Query:  AGRKVKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGAL
          RK  +  T++ +    ++L+ P+ AP AFASF +AA +G                L+SAWTGF AGCLHTLSGPDHLAALAPLSIGR++MESAAVGAL
Subjt:  AGRKVKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGAL

Query:  WGCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPCVALDNGECDVSIY----EALDNPTGGKKKIGFATFATGIVHG
        WGCGHDAGQVIFGL+FLLLKDRLHIE+++TWGTR+VG+TL++IGA+GI+EASE+P PCVAL   E D+S+     EAL  P   KKKIGFATFATG+VHG
Subjt:  WGCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPCVALDNGECDVSIY----EALDNPTGGKKKIGFATFATGIVHG

Query:  LQPDALMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY
        LQPDALM++LPALALPSR AG+AFL+MFL GTV+AMGSYT FIGSC++ALKE+VPRITEKLTW +S +AI LG  I+IS +FGFSLY
Subjt:  LQPDALMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY

AT4G35080.2 high-affinity nickel-transport family protein6.2e-9152.2Show/hide
Query:  MERLLY-SSSPTPLKPHLRRPPLLPRLPRIDQLKLHFPSSTRSEFGGVNSFSSRSENLFLPSSSSRPPPASSNSLLSLTDSRDGSSPNPHLCQRIEIPPS
        MERLL  SSS + + P        P LPR+    L F S+ R E   V+S S  S    +PS  +     ++N+  + +   D S PNP    RI    S
Subjt:  MERLLY-SSSPTPLKPHLRRPPLLPRLPRIDQLKLHFPSSTRSEFGGVNSFSSRSENLFLPSSSSRPPPASSNSLLSLTDSRDGSSPNPHLCQRIEIPPS

Query:  AGRKVKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGAL
          RK  +  T++ +    ++L+ P+ AP AFASF +AA +G                L+SAWTGF AGCLHTLSGPDHLAALAPLSIGR++MESAAVGAL
Subjt:  AGRKVKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGAL

Query:  WGCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPCVALDNGECDVSIY----EALDNPTGGKKKIGFATFATGIVHG
        WGCGHDAGQVIFGL+FLLLKDRLHIE+++TWGTR+VG+TL++IGA+GI+EASE+P PCVAL   E D+S+     EAL  P   KKKIGFATFATG+VHG
Subjt:  WGCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPCVALDNGECDVSIY----EALDNPTGGKKKIGFATFATGIVHG

Query:  LQPDALMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY
        LQPDALM++LPALALPSR                             +ALKE+VPRITEKLTW +S +AI LG  I+IS +FGFSLY
Subjt:  LQPDALMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY

AT4G35080.3 high-affinity nickel-transport family protein9.9e-10555.34Show/hide
Query:  MERLLY-SSSPTPLKPHLRRPPLLPRLPRIDQLKLHFPSSTRSEFGGVNSFSSRSENLFLPSSSSRPPPASSNSLLSLTDSRDGSSPNPHLCQRIEIPPS
        MERLL  SSS + + P        P LPR+    L F S+ R E   V+S S  S    +PS  +     ++N+  + +   D S PNP    RI    S
Subjt:  MERLLY-SSSPTPLKPHLRRPPLLPRLPRIDQLKLHFPSSTRSEFGGVNSFSSRSENLFLPSSSSRPPPASSNSLLSLTDSRDGSSPNPHLCQRIEIPPS

Query:  AGRK-------------------------VKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSG
          RK                         + T   ++   VAVL L+ P+ AP AFASF +AA +G                L+SAWTGF AGCLHTLSG
Subjt:  AGRK-------------------------VKTARTLMALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSG

Query:  PDHLAALAPLSIGRTRMESAAVGALWGCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPCVALDNGECDVSIY----
        PDHLAALAPLSIGR++MESAAVGALWGCGHDAGQVIFGL+FLLLKDRLHIE+++TWGTR+VG+TL++IGA+GI+EASE+P PCVAL   E D+S+     
Subjt:  PDHLAALAPLSIGRTRMESAAVGALWGCGHDAGQVIFGLIFLLLKDRLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPCVALDNGECDVSIY----

Query:  EALDNPTGGKKKIGFATFATGIVHGLQPDALMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFA
        EAL  P   KKKIGFATFATG+VHGLQPDALM++LPALALPSR AG+AFL+MFL GTV+AMGSYT FIGSC++ALKE+VPRITEKLTW +S +AI LG  
Subjt:  EALDNPTGGKKKIGFATFATGIVHGLQPDALMMILPALALPSRTAGAAFLVMFLAGTVVAMGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFA

Query:  ILISQYFGFSLY
        I+IS +FGFSLY
Subjt:  ILISQYFGFSLY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGGCTTCTGTATTCTTCTTCTCCGACCCCATTAAAACCCCATCTCAGGCGGCCTCCTCTTCTTCCTCGACTCCCACGAATTGACCAATTAAAACTCCATTTCCC
CTCATCAACCCGTTCTGAGTTCGGTGGGGTCAATTCGTTCTCTTCCAGAAGCGAAAATTTGTTTCTGCCGTCTTCTTCTTCTCGTCCTCCGCCTGCTTCTTCCAATTCTT
TGCTGTCTCTGACTGATTCTCGAGATGGGTCGTCGCCGAATCCTCATTTATGTCAGCGGATTGAAATTCCGCCCTCTGCTGGACGAAAGGTAAAAACTGCAAGGACACTT
ATGGCACTCTGTGTTGCTGTTCTGATCTTGATCCAACCAGTTTTTGCACCTTCAGCTTTTGCATCCTTCAACTCTGCAGCCACCACTGGGGGTCCTGCTGCTGCCACCTT
TGGGGGAAGATTTTTCCGGTCTGAACTATTGAGCAGTGCTTGGACTGGTTTCTTCGCTGGATGTTTGCACACGTTATCGGGACCGGACCATCTTGCTGCTTTGGCACCCC
TTTCCATAGGGCGTACTCGCATGGAAAGTGCTGCTGTTGGAGCTCTTTGGGGCTGTGGCCATGATGCAGGTCAGGTAATCTTCGGCCTCATATTTCTCCTATTGAAAGAC
CGGCTACATATCGAAATTATTCGCACTTGGGGTACAAGAGTGGTCGGCGTGACTCTGCTTGTCATCGGTGCATTAGGCATTAGGGAGGCTTCGGAGGTTCCTACTCCTTG
CGTCGCCTTAGACAACGGCGAGTGTGATGTTAGCATTTACGAAGCGTTGGACAATCCAACAGGAGGCAAGAAGAAGATAGGTTTTGCAACTTTCGCGACGGGTATCGTCC
ACGGGCTGCAACCAGATGCATTGATGATGATATTGCCTGCACTTGCACTTCCGTCGCGCACGGCCGGAGCTGCATTTCTTGTGATGTTCTTGGCAGGGACTGTGGTTGCA
ATGGGAAGTTATACAGTGTTCATAGGTTCTTGTACTCAGGCGCTGAAGGAGAGAGTGCCTAGGATTACTGAGAAGCTCACATGGGCTGCTTCCTCTATAGCAATCGCTCT
TGGATTTGCCATTCTCATCAGTCAGTATTTTGGGTTCAGCCTCTACTAG
mRNA sequenceShow/hide mRNA sequence
AGGGTAACAAAGTGTTCATAATCATCGACTGTGACAAACAAAACAAAACAATGATAAGCCACTAAGCCTAAAGGAAGCAGCTTAGATAAACGCTTAAGCAGCAAACAATG
GCTGAAATTAATTTAATGCGATCGCCATCTCCAGTTTCTTCTAGTTTAACACCCTGAGCGACAACTTTATCTTCTCAGATCCTCTTCTTCTCCACTTTGCTGAATCTTTA
CCTCACACCCAGAAACCAAACTCCTTGCAGAAGCCATGGAGAGGCTTCTGTATTCTTCTTCTCCGACCCCATTAAAACCCCATCTCAGGCGGCCTCCTCTTCTTCCTCGA
CTCCCACGAATTGACCAATTAAAACTCCATTTCCCCTCATCAACCCGTTCTGAGTTCGGTGGGGTCAATTCGTTCTCTTCCAGAAGCGAAAATTTGTTTCTGCCGTCTTC
TTCTTCTCGTCCTCCGCCTGCTTCTTCCAATTCTTTGCTGTCTCTGACTGATTCTCGAGATGGGTCGTCGCCGAATCCTCATTTATGTCAGCGGATTGAAATTCCGCCCT
CTGCTGGACGAAAGGTAAAAACTGCAAGGACACTTATGGCACTCTGTGTTGCTGTTCTGATCTTGATCCAACCAGTTTTTGCACCTTCAGCTTTTGCATCCTTCAACTCT
GCAGCCACCACTGGGGGTCCTGCTGCTGCCACCTTTGGGGGAAGATTTTTCCGGTCTGAACTATTGAGCAGTGCTTGGACTGGTTTCTTCGCTGGATGTTTGCACACGTT
ATCGGGACCGGACCATCTTGCTGCTTTGGCACCCCTTTCCATAGGGCGTACTCGCATGGAAAGTGCTGCTGTTGGAGCTCTTTGGGGCTGTGGCCATGATGCAGGTCAGG
TAATCTTCGGCCTCATATTTCTCCTATTGAAAGACCGGCTACATATCGAAATTATTCGCACTTGGGGTACAAGAGTGGTCGGCGTGACTCTGCTTGTCATCGGTGCATTA
GGCATTAGGGAGGCTTCGGAGGTTCCTACTCCTTGCGTCGCCTTAGACAACGGCGAGTGTGATGTTAGCATTTACGAAGCGTTGGACAATCCAACAGGAGGCAAGAAGAA
GATAGGTTTTGCAACTTTCGCGACGGGTATCGTCCACGGGCTGCAACCAGATGCATTGATGATGATATTGCCTGCACTTGCACTTCCGTCGCGCACGGCCGGAGCTGCAT
TTCTTGTGATGTTCTTGGCAGGGACTGTGGTTGCAATGGGAAGTTATACAGTGTTCATAGGTTCTTGTACTCAGGCGCTGAAGGAGAGAGTGCCTAGGATTACTGAGAAG
CTCACATGGGCTGCTTCCTCTATAGCAATCGCTCTTGGATTTGCCATTCTCATCAGTCAGTATTTTGGGTTCAGCCTCTACTAGACTTCGACATTTAACGTTTTAATTGC
TTTCTTGTCATTGTTAGAAGAATTTGCTAGTAAAAGGTTTGTTTGTTATGTTTTGTGCAAAATTTTCTCTAGTTTTCCTTCTGTGAGCCGTACTATCGCCATATCATTTC
TCTAGTTACAGATCTGCAAAGTAAAGAAATAAATCTCTCAAATTCGTTTCTAACATTGTAAGAGAAGACACGATTTGTTTAATAGGATCTTCAGTAAACTCATGATGTAC
ATAATCCATCACGTGAAGTAATCCAAAATTAAAGTTATTATCAAATATATCAAGATTGGAATCACTGTCATTAACTTAAGAGTTATTATTGTATATTCTTTTAGTTCAAC
AAGGAGCGGGAGATTCAAGTCTCCGACTTTTAAGAAAGTAATGCATTACTAATTGAACTATACACATGTCAGTCCAAAAAATTACTAAAAAGGTAAAGTTTTGCTACCTT
TAACAGTTTCTATATGTTAGTAACGAGGGAACAATTTCCAAACAGCAAAAGGCAAATAGTTTGTAGATGACATAAAAAGAGAACAGCTGCATCAATACTTTTCAAAATAG
AAAAAAGTAAACTACAAAATATTTGACATGTGAGTAAGACAGGAATGCTTTACTGAAACAAATGTTACAAAGGATCTCATCATGGCTTCTGTTTCTTTTTGTTGATATCG
TGAAGCACACATACATCTGCCGCAGTCTTCATCTGAAACAAAACAAGAGTCGATATATTAGAAATGGGTGTTGCTACTACTTAGTACTTACTCGGCAACCCATATTCATT
CACACACTCATCAACTTAAAATTAGGCAAATTTCTCTGACTGAATCTATACGACAGACAATGCTGCCACCTTAAACTCTCATACTCATCAATCAAGTCCATCTAATGAAA
GGGAAAACCCATCAGAGGTATGCAATTCAAACTGAAAGAGCAGGAGGGGAGAGTGGGATTGTGGCAGGGAGCACGACAGTGGTGGAGAAAAGAGAGGCATGATAAATTAG
AACAGGGGTCGAGCAATACCGCAAATTGGCAAGAGTCTAGAATATAAGATTTCATAACAATAATGAGTTAAAAGTGGATCACCAACAGGCATCGTCCATTAGAAAGTGAG
GGATGATGACTTGTACTAACCAAAACCTTATTCACATAATTGGCATAGGAAAAATGCCTATAAAATTTTGGCGCTCAAAGACAGAACCACCGGAAAAATGCTGCAGACAT
CAACATAATAAAAAGTTTAAGTATACTTCTGTTAAATACTGCCTACGATTACGGGGCACCATGGATCACATGCTAAAACAAGATGTAGGATGTATAAATCAGGTAGATTC
ATTCTGAAATTTCTACGAATTTAGGCACTCATATTGAATCCAGTAAATTAAAAAGAGATAGAGGTCTTGCTACATGTATTACCTGTATGACATTAACAGCTTGCTTCACT
GCCCATCCGAACAAACTTATTACAATCAAAAGTGATAATAGTGTCTTTTGTTGTATCGAATGCAATATAACCTGAAAAAAG
Protein sequenceShow/hide protein sequence
MERLLYSSSPTPLKPHLRRPPLLPRLPRIDQLKLHFPSSTRSEFGGVNSFSSRSENLFLPSSSSRPPPASSNSLLSLTDSRDGSSPNPHLCQRIEIPPSAGRKVKTARTL
MALCVAVLILIQPVFAPSAFASFNSAATTGGPAAATFGGRFFRSELLSSAWTGFFAGCLHTLSGPDHLAALAPLSIGRTRMESAAVGALWGCGHDAGQVIFGLIFLLLKD
RLHIEIIRTWGTRVVGVTLLVIGALGIREASEVPTPCVALDNGECDVSIYEALDNPTGGKKKIGFATFATGIVHGLQPDALMMILPALALPSRTAGAAFLVMFLAGTVVA
MGSYTVFIGSCTQALKERVPRITEKLTWAASSIAIALGFAILISQYFGFSLY