; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0002219 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0002219
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationchr4:40658347..40659681
RNA-Seq ExpressionLag0002219
SyntenyLag0002219
Gene Ontology termsNA
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022148888.1 UPF0481 protein At3g47200-like [Momordica charantia]2.5e-11756.57Show/hide
Query:  ENQPLLPVPIIRIISRDDNEDKAAMSFDDVKNNLKEHLPCSVVGSEQPSIYKLPECIREVQPKA-FEPRFVSIGPYHHGQQHLVRMEREKLKAFHSFRTH
        E  P +P+ II    +D+NE    +   +  N L    P   V +E  SIYK+P  +R+VQPKA FEP+ VS GPYHHG+ HL RME EK KAF +F+T 
Subjt:  ENQPLLPVPIIRIISRDDNEDKAAMSFDDVKNNLKEHLPCSVVGSEQPSIYKLPECIREVQPKA-FEPRFVSIGPYHHGQQHLVRMEREKLKAFHSFRTH

Query:  HGLEVESIVDRVGSMLKDLQGSYDELEDEWK-EDPAGKFLQLMIVDGCFMLECFDDNYCPKSLSNMQAELARDMLLLENQLPMKLLEELYSMTNNNQKQK
        +GL VE IV+RV SML+D+QG YDELE EWK ED A KFLQLM++DGCFMLE    +     LSNM+ ++ RDMLLLENQLPMKLLEEL+SM N++   K
Subjt:  HGLEVESIVDRVGSMLKDLQGSYDELEDEWK-EDPAGKFLQLMIVDGCFMLECFDDNYCPKSLSNMQAELARDMLLLENQLPMKLLEELYSMTNNNQKQK

Query:  NVKSLVSDFMCFKNK--NKLDGKYLHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSEESNPKIQIIPPARQLREAGIKFRESRTRSFKDV--DFEGGV
        NVKSLV DFM   +K  +KL  +YLHIL+MY  TLL P V  + R    +K + +E ++E + +IQII PA +L EAGI+FR S+++S  DV  D + GV
Subjt:  NVKSLVSDFMCFKNK--NKLDGKYLHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSEESNPKIQIIPPARQLREAGIKFRESRTRSFKDV--DFEGGV

Query:  LKLPVMEVDDDTEPGLLNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQSAAGLFKLLAKGASMD--SNSHISQVHQKVHRH
        LKLP M VDDDTE   LNVMAFEKLH  AG +VT FI+ M+NLID D DV LL S  I++NALG+DQ AA LF  LA+GA++D  S+SHI+ V + V  H
Subjt:  LKLPVMEVDDDTEPGLLNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQSAAGLFKLLAKGASMD--SNSHISQVHQKVHRH

Query:  CERKWNQWCASLKHNYFQNPWAIISLIAALVGFLIILLQAVYQLLDYYR
        C+++ ++WCASLKHNYFQ+PWAI+SLIAA +GF+I++LQAVYQ+ DYYR
Subjt:  CERKWNQWCASLKHNYFQNPWAIISLIAALVGFLIILLQAVYQLLDYYR

XP_022960454.1 UPF0481 protein At3g47200-like [Cucurbita moschata]1.2e-10851.22Show/hide
Query:  ENQPLLPVPIIRIISRDDNEDKAAMSFDDVKNNLKEHLPCSVV-------GSEQPSIYKLPECIREVQPKAFEPRFVSIGPYHHGQQHLVRMEREKLKAF
        E  P L V    ++ RDDN    A     VK  L + L    +        SE  SIYK+P  + +  PKA+EP+ VS+GPY+HG+QHL  ME EKLK F
Subjt:  ENQPLLPVPIIRIISRDDNEDKAAMSFDDVKNNLKEHLPCSVV-------GSEQPSIYKLPECIREVQPKAFEPRFVSIGPYHHGQQHLVRMEREKLKAF

Query:  HSFRTHHGLEVESIVDRVGSMLKDLQGSYDELEDEWKEDPAGKFLQLMIVDGCFMLECFDDNYCPKSLSNMQAELARDMLLLENQLPMKLLEELYSMTNN
        HSF+    L+VESIV  V ++L +L  SYD+LE++WKEDP GKFLQLMIVDGCFML    +  CP SL N+  ++ +DMLLLENQLPM LLE+LYS+ + 
Subjt:  HSFRTHHGLEVESIVDRVGSMLKDLQGSYDELEDEWKEDPAGKFLQLMIVDGCFMLECFDDNYCPKSLSNMQAELARDMLLLENQLPMKLLEELYSMTNN

Query:  NQK--QKNVKSLVSDFMCFKNKNKLDGKYLHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSEESNPKIQIIPPARQLREAGIKFRESRTRSFKDV--D
        N +  Q+++K LVS+++    +N++    LHIL+MY+ +LL PP+DR D S          +   S+P+ Q+IPPA +LREAGIKF+ S+T S  DV  D
Subjt:  NQK--QKNVKSLVSDFMCFKNKNKLDGKYLHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSEESNPKIQIIPPARQLREAGIKFRESRTRSFKDV--D

Query:  FEGGVLKLPVMEVDDDTEPGLLNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQSAAGLFKLLAKGASMDSNSHISQVHQKV
         +GGVL LP + VDD+TE  LLNVMAFEKLH+ AG +VTSF+I M NLID + DV +L    +L NA+G+D+ AAGLF  L  GA+M  ++H++ VH+KV
Subjt:  FEGGVLKLPVMEVDDDTEPGLLNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQSAAGLFKLLAKGASMDSNSHISQVHQKV

Query:  HRHCERKWNQWCASLKHNYFQNPWAIISLIAALVGFLIILLQAVYQLLDYY
        + HC + WN+ CA+LKH YFQ+PW IISL AA+ GF+I++LQA+YQ LDYY
Subjt:  HRHCERKWNQWCASLKHNYFQNPWAIISLIAALVGFLIILLQAVYQLLDYY

XP_023513986.1 UPF0481 protein At3g47200-like isoform X1 [Cucurbita pepo subsp. pepo]2.5e-10951.44Show/hide
Query:  ENQPLLPVPIIRIISRDDNEDKAAMSFDDVKNNLKEHLPCSVV-------GSEQPSIYKLPECIREVQPKAFEPRFVSIGPYHHGQQHLVRMEREKLKAF
        E  P L V    ++ RDDN    A     VK  L + L    +        SE  SIYK+P  + +  PKA+EPR VS+GPY+HG+QHL  ME EKLK F
Subjt:  ENQPLLPVPIIRIISRDDNEDKAAMSFDDVKNNLKEHLPCSVV-------GSEQPSIYKLPECIREVQPKAFEPRFVSIGPYHHGQQHLVRMEREKLKAF

Query:  HSFRTHHGLEVESIVDRVGSMLKDLQGSYDELEDEWKEDPAGKFLQLMIVDGCFMLECFDDNYCPKSLSNMQAELARDMLLLENQLPMKLLEELYSMTNN
        +SF+T   L+VESIV  V ++L +L  SYD+LE+EW EDP GKFLQLMIVDGCFML    +  CP SL N+  ++ +DMLLLENQLPM LLE+LYS+   
Subjt:  HSFRTHHGLEVESIVDRVGSMLKDLQGSYDELEDEWKEDPAGKFLQLMIVDGCFMLECFDDNYCPKSLSNMQAELARDMLLLENQLPMKLLEELYSMTNN

Query:  NQK--QKNVKSLVSDFMCFKNKNKLDGKYLHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSEESNPKIQIIPPARQLREAGIKFRESRTRSFKDV--D
        N +  Q++++ LVS+++    +N++    LHIL+MY+ +LL PP+DR D S          + + S+P+ Q+IPPA +LREAGIKF+ S+T S  DV  D
Subjt:  NQK--QKNVKSLVSDFMCFKNKNKLDGKYLHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSEESNPKIQIIPPARQLREAGIKFRESRTRSFKDV--D

Query:  FEGGVLKLPVMEVDDDTEPGLLNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQSAAGLFKLLAKGASMDSNSHISQVHQKV
         +GGVL LP + VDDDTE  LLNVMAFEKLH+ AG  VTSF+I M NLID + DV +L    +L NA+G+D+ AAGLF  L  GA+M  ++H++ VH+KV
Subjt:  FEGGVLKLPVMEVDDDTEPGLLNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQSAAGLFKLLAKGASMDSNSHISQVHQKV

Query:  HRHCERKWNQWCASLKHNYFQNPWAIISLIAALVGFLIILLQAVYQLLDYY
        + HC + WN+ CA+LKH+YFQ+PW IISL AA+ GF+I++LQA+YQ LDYY
Subjt:  HRHCERKWNQWCASLKHNYFQNPWAIISLIAALVGFLIILLQAVYQLLDYY

XP_023513987.1 UPF0481 protein At3g47200-like isoform X2 [Cucurbita pepo subsp. pepo]3.2e-10951.56Show/hide
Query:  ENQPLLPVPIIRIISRDDNEDKAAMSFDDVKNNLKEHLPCSVV-------GSEQPSIYKLPECIREVQPKAFEPRFVSIGPYHHGQQHLVRMEREKLKAF
        E  P L V    ++ RDDN    A     VK  L + L    +        SE  SIYK+P  + +  PKA+EPR VS+GPY+HG+QHL  ME EKLK F
Subjt:  ENQPLLPVPIIRIISRDDNEDKAAMSFDDVKNNLKEHLPCSVV-------GSEQPSIYKLPECIREVQPKAFEPRFVSIGPYHHGQQHLVRMEREKLKAF

Query:  HSFRTHHGLEVESIVDRVGSMLKDLQGSYDELEDEWKEDPAGKFLQLMIVDGCFMLECFDDNYCPKSLSNMQAELARDMLLLENQLPMKLLEELYSMTNN
        +SF+T   L+VESIV  V ++L +L  SYD+LE+EW EDP GKFLQLMIVDGCFML    +  CP SL N+  ++ +DMLLLENQLPM LLE+LYS+   
Subjt:  HSFRTHHGLEVESIVDRVGSMLKDLQGSYDELEDEWKEDPAGKFLQLMIVDGCFMLECFDDNYCPKSLSNMQAELARDMLLLENQLPMKLLEELYSMTNN

Query:  N-QKQKNVKSLVSDFMCFKNKNKLDGKYLHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSEESNPKIQIIPPARQLREAGIKFRESRTRSFKDV--DF
        N Q  ++++ LVS+++    +N++    LHIL+MY+ +LL PP+DR D S          + + S+P+ Q+IPPA +LREAGIKF+ S+T S  DV  D 
Subjt:  N-QKQKNVKSLVSDFMCFKNKNKLDGKYLHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSEESNPKIQIIPPARQLREAGIKFRESRTRSFKDV--DF

Query:  EGGVLKLPVMEVDDDTEPGLLNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQSAAGLFKLLAKGASMDSNSHISQVHQKVH
        +GGVL LP + VDDDTE  LLNVMAFEKLH+ AG  VTSF+I M NLID + DV +L    +L NA+G+D+ AAGLF  L  GA+M  ++H++ VH+KV+
Subjt:  EGGVLKLPVMEVDDDTEPGLLNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQSAAGLFKLLAKGASMDSNSHISQVHQKVH

Query:  RHCERKWNQWCASLKHNYFQNPWAIISLIAALVGFLIILLQAVYQLLDYY
         HC + WN+ CA+LKH+YFQ+PW IISL AA+ GF+I++LQA+YQ LDYY
Subjt:  RHCERKWNQWCASLKHNYFQNPWAIISLIAALVGFLIILLQAVYQLLDYY

XP_038875622.1 UPF0481 protein At3g47200-like [Benincasa hispida]9.0e-12856.76Show/hide
Query:  ENQPLLPVPIIRIISRDDNEDKAAMSFDDVKNNLKEHLPCSVVG-------SEQPSIYKLPECIREVQPKAFEPRFVSIGPYHHGQQHLVRMEREKLKAF
        E QPLLP   IR I  D+   K       VKNNL + L  + V        S QPSIYK+PE +R++Q KAFEP+ VS+GPYHHG++HLV ME EK KAF
Subjt:  ENQPLLPVPIIRIISRDDNEDKAAMSFDDVKNNLKEHLPCSVVG-------SEQPSIYKLPECIREVQPKAFEPRFVSIGPYHHGQQHLVRMEREKLKAF

Query:  HSFRTHHGLEVESIVDRVGSMLKDLQGSYDELEDEWKEDPAGKFLQLMIVDGCFMLECFDDNYCPKSLSNMQAELARDMLLLENQLPMKLLEELY-SMTN
          F   +GL +ESIV+ + + L++L G+YD+L+++WK+D A KFL++MIVDGCFML+ F +  CP SLS M+ ++ RDMLLLENQLPM+LL+ELY +M N
Subjt:  HSFRTHHGLEVESIVDRVGSMLKDLQGSYDELEDEWKEDPAGKFLQLMIVDGCFMLECFDDNYCPKSLSNMQAELARDMLLLENQLPMKLLEELY-SMTN

Query:  NNQKQKNVKSLVSDFMCFKNKNKLDGKYLHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSEESNPKIQIIPPARQLREAGIKFRESRTRSFKDVDF--
         NQ+ ++++SL+   MC  N+  + G+ LHILDMYRA+LL PPVDR+DRS + K + Q     +S+P+ QIIP A QL +AGIKF+ S T++  DV F  
Subjt:  NNQKQKNVKSLVSDFMCFKNKNKLDGKYLHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSEESNPKIQIIPPARQLREAGIKFRESRTRSFKDVDF--

Query:  EGGVLKLPVMEVDDDTEPGLLNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQSAAGLFKLLAKGASMDSNSHISQVHQKVH
        + GVL+LP + VDDDTE  LLNVMAFEKL+V AG +VTSF+I M+NLID D DV LL S  IL NALG+D+SAA LF LL KGA+MD +SHI+ VH KV+
Subjt:  EGGVLKLPVMEVDDDTEPGLLNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQSAAGLFKLLAKGASMDSNSHISQVHQKVH

Query:  RHCERKWNQWCASLKHNYFQNPWAIISLIAALVGFLIILLQAVYQLLDYYR
        +HC   WNQWCASLKH+YFQNPWAIISL AA+ GF I+++QA+YQ++DY+R
Subjt:  RHCERKWNQWCASLKHNYFQNPWAIISLIAALVGFLIILLQAVYQLLDYYR

TrEMBL top hitse value%identityAlignment
A0A6J1D5C0 UPF0481 protein At3g47200-like1.2e-11756.57Show/hide
Query:  ENQPLLPVPIIRIISRDDNEDKAAMSFDDVKNNLKEHLPCSVVGSEQPSIYKLPECIREVQPKA-FEPRFVSIGPYHHGQQHLVRMEREKLKAFHSFRTH
        E  P +P+ II    +D+NE    +   +  N L    P   V +E  SIYK+P  +R+VQPKA FEP+ VS GPYHHG+ HL RME EK KAF +F+T 
Subjt:  ENQPLLPVPIIRIISRDDNEDKAAMSFDDVKNNLKEHLPCSVVGSEQPSIYKLPECIREVQPKA-FEPRFVSIGPYHHGQQHLVRMEREKLKAFHSFRTH

Query:  HGLEVESIVDRVGSMLKDLQGSYDELEDEWK-EDPAGKFLQLMIVDGCFMLECFDDNYCPKSLSNMQAELARDMLLLENQLPMKLLEELYSMTNNNQKQK
        +GL VE IV+RV SML+D+QG YDELE EWK ED A KFLQLM++DGCFMLE    +     LSNM+ ++ RDMLLLENQLPMKLLEEL+SM N++   K
Subjt:  HGLEVESIVDRVGSMLKDLQGSYDELEDEWK-EDPAGKFLQLMIVDGCFMLECFDDNYCPKSLSNMQAELARDMLLLENQLPMKLLEELYSMTNNNQKQK

Query:  NVKSLVSDFMCFKNK--NKLDGKYLHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSEESNPKIQIIPPARQLREAGIKFRESRTRSFKDV--DFEGGV
        NVKSLV DFM   +K  +KL  +YLHIL+MY  TLL P V  + R    +K + +E ++E + +IQII PA +L EAGI+FR S+++S  DV  D + GV
Subjt:  NVKSLVSDFMCFKNK--NKLDGKYLHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSEESNPKIQIIPPARQLREAGIKFRESRTRSFKDV--DFEGGV

Query:  LKLPVMEVDDDTEPGLLNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQSAAGLFKLLAKGASMD--SNSHISQVHQKVHRH
        LKLP M VDDDTE   LNVMAFEKLH  AG +VT FI+ M+NLID D DV LL S  I++NALG+DQ AA LF  LA+GA++D  S+SHI+ V + V  H
Subjt:  LKLPVMEVDDDTEPGLLNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQSAAGLFKLLAKGASMD--SNSHISQVHQKVHRH

Query:  CERKWNQWCASLKHNYFQNPWAIISLIAALVGFLIILLQAVYQLLDYYR
        C+++ ++WCASLKHNYFQ+PWAI+SLIAA +GF+I++LQAVYQ+ DYYR
Subjt:  CERKWNQWCASLKHNYFQNPWAIISLIAALVGFLIILLQAVYQLLDYYR

A0A6J1H6V9 UPF0481 protein At3g47200-like2.4e-10249.56Show/hide
Query:  ENQPLLPVPIIRIISRDDNEDKAAMSFDDVKNNLKEHLPCSVVGSEQPSIYKLPECIREVQP----KAFEPRFVSIGPYHHGQQHLVRMEREKLKAFHSF
        E QP L + +I    RDDN DK       V   L E  P     + + SIYKLP  +R+       KAF+P+ VS GPYHHG++HL  MER+K KAF++ 
Subjt:  ENQPLLPVPIIRIISRDDNEDKAAMSFDDVKNNLKEHLPCSVVGSEQPSIYKLPECIREVQP----KAFEPRFVSIGPYHHGQQHLVRMEREKLKAFHSF

Query:  RTHHGLEVESIVDRVGSMLKDLQGSYDELEDEWKEDPAGKFLQLMIVDGCFMLECFDDNYCPKSLSNMQAELARDMLLLENQLPMKLLEELYSMT-----
        +T  GL VE+IV  V ++L DL  SYD LE+EW +DP GKFL+LMIVDGCFML  F D  CP SL NM+ ++  + LLLENQLP+KLL +L+S+      
Subjt:  RTHHGLEVESIVDRVGSMLKDLQGSYDELEDEWKEDPAGKFLQLMIVDGCFMLECFDDNYCPKSLSNMQAELARDMLLLENQLPMKLLEELYSMT-----

Query:  NNNQKQKNVKSLVSDFMCFKNKNKLDGKYLHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSEESNPKIQIIPPARQLREAGIKFRESRTRSFKDVDFE
        +    ++ ++ L+ +++    K+ LD ++LHILD+Y+A+LL PP+DR         ++  E+ E S  + Q+IPPA +L EAGIKF+ S+T+S KDV F+
Subjt:  NNNQKQKNVKSLVSDFMCFKNKNKLDGKYLHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSEESNPKIQIIPPARQLREAGIKFRESRTRSFKDVDFE

Query:  --GGVLKLPVMEVDDDTEPGLLNVMAFEKLHV-GAGGE----VTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQSAAGLFKLLAKGASMDSNSHISQ
           GVL LP++ VDD+TE   LNVMAFEKLHV GA  E    +TSF+I M NLID + DV LL S G L NALG+D+ AA LF  L KG +M  N H+  
Subjt:  --GGVLKLPVMEVDDDTEPGLLNVMAFEKLHV-GAGGE----VTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQSAAGLFKLLAKGASMDSNSHISQ

Query:  VHQKVHRHCERKWNQWCASLKHNYFQNPWAIISLIAALVGFLIILLQAVYQLLDYYRQ
        VH+ ++ +C+  WN+ CA+LKH YFQNPW +ISL AA+ GFLI++LQA+YQLLDYY++
Subjt:  VHQKVHRHCERKWNQWCASLKHNYFQNPWAIISLIAALVGFLIILLQAVYQLLDYYRQ

A0A6J1HB25 UPF0481 protein At3g47200-like5.9e-10951.22Show/hide
Query:  ENQPLLPVPIIRIISRDDNEDKAAMSFDDVKNNLKEHLPCSVV-------GSEQPSIYKLPECIREVQPKAFEPRFVSIGPYHHGQQHLVRMEREKLKAF
        E  P L V    ++ RDDN    A     VK  L + L    +        SE  SIYK+P  + +  PKA+EP+ VS+GPY+HG+QHL  ME EKLK F
Subjt:  ENQPLLPVPIIRIISRDDNEDKAAMSFDDVKNNLKEHLPCSVV-------GSEQPSIYKLPECIREVQPKAFEPRFVSIGPYHHGQQHLVRMEREKLKAF

Query:  HSFRTHHGLEVESIVDRVGSMLKDLQGSYDELEDEWKEDPAGKFLQLMIVDGCFMLECFDDNYCPKSLSNMQAELARDMLLLENQLPMKLLEELYSMTNN
        HSF+    L+VESIV  V ++L +L  SYD+LE++WKEDP GKFLQLMIVDGCFML    +  CP SL N+  ++ +DMLLLENQLPM LLE+LYS+ + 
Subjt:  HSFRTHHGLEVESIVDRVGSMLKDLQGSYDELEDEWKEDPAGKFLQLMIVDGCFMLECFDDNYCPKSLSNMQAELARDMLLLENQLPMKLLEELYSMTNN

Query:  NQK--QKNVKSLVSDFMCFKNKNKLDGKYLHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSEESNPKIQIIPPARQLREAGIKFRESRTRSFKDV--D
        N +  Q+++K LVS+++    +N++    LHIL+MY+ +LL PP+DR D S          +   S+P+ Q+IPPA +LREAGIKF+ S+T S  DV  D
Subjt:  NQK--QKNVKSLVSDFMCFKNKNKLDGKYLHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSEESNPKIQIIPPARQLREAGIKFRESRTRSFKDV--D

Query:  FEGGVLKLPVMEVDDDTEPGLLNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQSAAGLFKLLAKGASMDSNSHISQVHQKV
         +GGVL LP + VDD+TE  LLNVMAFEKLH+ AG +VTSF+I M NLID + DV +L    +L NA+G+D+ AAGLF  L  GA+M  ++H++ VH+KV
Subjt:  FEGGVLKLPVMEVDDDTEPGLLNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQSAAGLFKLLAKGASMDSNSHISQVHQKV

Query:  HRHCERKWNQWCASLKHNYFQNPWAIISLIAALVGFLIILLQAVYQLLDYY
        + HC + WN+ CA+LKH YFQ+PW IISL AA+ GF+I++LQA+YQ LDYY
Subjt:  HRHCERKWNQWCASLKHNYFQNPWAIISLIAALVGFLIILLQAVYQLLDYY

A0A6J1KVQ6 UPF0481 protein At3g47200-like isoform X25.0e-10851.11Show/hide
Query:  ENQPLLPVPIIRIISRDDNEDKAAMSFDDVKNNLKEHLPCSVV-------GSEQPSIYKLPECIREVQPKAFEPRFVSIGPYHHGQQHLVRMEREKLKAF
        E QP L V    ++ RDDN    A     VK  L + L    +        SE  SIYK+P  + +  PKA+EP+ VS+GPY+HG+QHL  ME EKLK F
Subjt:  ENQPLLPVPIIRIISRDDNEDKAAMSFDDVKNNLKEHLPCSVV-------GSEQPSIYKLPECIREVQPKAFEPRFVSIGPYHHGQQHLVRMEREKLKAF

Query:  HSFRTHHGLEVESIVDRVGSMLKDLQGSYDELEDEWKEDPAGKFLQLMIVDGCFMLECFDDNYCPKSLSNMQAELARDMLLLENQLPMKLLEELYSMTNN
        HSF+T   L+VESIV  V ++L +L  SYD LE+EW +DP GKFLQLMIVDGCFML       CP SL N+  ++ +DMLLLENQLPM LLE+LYS+   
Subjt:  HSFRTHHGLEVESIVDRVGSMLKDLQGSYDELEDEWKEDPAGKFLQLMIVDGCFMLECFDDNYCPKSLSNMQAELARDMLLLENQLPMKLLEELYSMTNN

Query:  N-QKQKNVKSLVSDFMCFKNKNKLDGKYLHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSEESNPKIQIIPPARQLREAGIKFRESRTRSFKDV--DF
        N Q  ++ K LV  ++    +N++    LHIL+MY+ +LL PP+DR D S          + + S+P+ Q+IPPA +L EAGIKF+ S+T S +DV  D 
Subjt:  N-QKQKNVKSLVSDFMCFKNKNKLDGKYLHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSEESNPKIQIIPPARQLREAGIKFRESRTRSFKDV--DF

Query:  EGGVLKLPVMEVDDDTEPGLLNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQSAAGLFKLLAKGASMDSNSHISQVHQKVH
        + GVL LP + VDDDTE  +LNVMAFEKLH+ AG +VTSF+I M NLID + DV +L    IL NA+G+D+ AAGLF  L  GA+M  +SH++ VH+ V+
Subjt:  EGGVLKLPVMEVDDDTEPGLLNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQSAAGLFKLLAKGASMDSNSHISQVHQKVH

Query:  RHCERKWNQWCASLKHNYFQNPWAIISLIAALVGFLIILLQAVYQLLDYY
         HC + WN+ CA+LKH+YFQ+PW IISL AA+ GF+I++LQA+YQ LDYY
Subjt:  RHCERKWNQWCASLKHNYFQNPWAIISLIAALVGFLIILLQAVYQLLDYY

A0A6J1KYV8 UPF0481 protein At3g47200-like isoform X13.9e-10851Show/hide
Query:  ENQPLLPVPIIRIISRDDNEDKAAMSFDDVKNNLKEHLPCSVV-------GSEQPSIYKLPECIREVQPKAFEPRFVSIGPYHHGQQHLVRMEREKLKAF
        E QP L V    ++ RDDN    A     VK  L + L    +        SE  SIYK+P  + +  PKA+EP+ VS+GPY+HG+QHL  ME EKLK F
Subjt:  ENQPLLPVPIIRIISRDDNEDKAAMSFDDVKNNLKEHLPCSVV-------GSEQPSIYKLPECIREVQPKAFEPRFVSIGPYHHGQQHLVRMEREKLKAF

Query:  HSFRTHHGLEVESIVDRVGSMLKDLQGSYDELEDEWKEDPAGKFLQLMIVDGCFMLECFDDNYCPKSLSNMQAELARDMLLLENQLPMKLLEELYSMTNN
        HSF+T   L+VESIV  V ++L +L  SYD LE+EW +DP GKFLQLMIVDGCFML       CP SL N+  ++ +DMLLLENQLPM LLE+LYS+   
Subjt:  HSFRTHHGLEVESIVDRVGSMLKDLQGSYDELEDEWKEDPAGKFLQLMIVDGCFMLECFDDNYCPKSLSNMQAELARDMLLLENQLPMKLLEELYSMTNN

Query:  NQK--QKNVKSLVSDFMCFKNKNKLDGKYLHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSEESNPKIQIIPPARQLREAGIKFRESRTRSFKDV--D
        N +  Q++ K LV  ++    +N++    LHIL+MY+ +LL PP+DR D S          + + S+P+ Q+IPPA +L EAGIKF+ S+T S +DV  D
Subjt:  NQK--QKNVKSLVSDFMCFKNKNKLDGKYLHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSEESNPKIQIIPPARQLREAGIKFRESRTRSFKDV--D

Query:  FEGGVLKLPVMEVDDDTEPGLLNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQSAAGLFKLLAKGASMDSNSHISQVHQKV
         + GVL LP + VDDDTE  +LNVMAFEKLH+ AG +VTSF+I M NLID + DV +L    IL NA+G+D+ AAGLF  L  GA+M  +SH++ VH+ V
Subjt:  FEGGVLKLPVMEVDDDTEPGLLNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQSAAGLFKLLAKGASMDSNSHISQVHQKV

Query:  HRHCERKWNQWCASLKHNYFQNPWAIISLIAALVGFLIILLQAVYQLLDYY
        + HC + WN+ CA+LKH+YFQ+PW IISL AA+ GF+I++LQA+YQ LDYY
Subjt:  HRHCERKWNQWCASLKHNYFQNPWAIISLIAALVGFLIILLQAVYQLLDYY

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026453.8e-1227.06Show/hide
Query:  IPPARQLREAGIKFRESRTRSFKDVDFE--GGVLKLPVMEVDDDTEPGLLNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQ
        IP    L +AG++F+ +   +   V F+   G   LPV+ +D +TE  L N++A+E  +       T +   ++ +ID++ DV LL   G+LV+ L  DQ
Subjt:  IPPARQLREAGIKFRESRTRSFKDVDFE--GGVLKLPVMEVDDDTEPGLLNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQ

Query:  SAAGLFKLLAKGASMDSNSHISQVHQKVHRHCERKWNQWCASLKHNYFQNPWAIISLIAALVGFLIILLQ
         AA ++  ++K   +     + +  + V+R+   +W      L   Y    W I++ +AA++  +++ LQ
Subjt:  SAAGLFKLLAKGASMDSNSHISQVHQKVHRHCERKWNQWCASLKHNYFQNPWAIISLIAALVGFLIILLQ

Q9SD53 UPF0481 protein At3g472006.7e-3327.84Show/hide
Query:  GSEQPSIYKLPECIREVQPKAFEPRFVSIGPYHHGQQHLVRMEREK---LKAFHSFRTHHGLEVESIVDRVGSMLKDLQGSYDELEDEWKEDPAGKFLQ-
        G E   I+++PE    + PKA++P+ VSIGPYH+G++HL  +++ K   L+ F        +E   +V  V  +   ++ SY E      E   G  L  
Subjt:  GSEQPSIYKLPECIREVQPKAFEPRFVSIGPYHHGQQHLVRMEREK---LKAFHSFRTHHGLEVESIVDRVGSMLKDLQGSYDELEDEWKEDPAGKFLQ-

Query:  LMIVDGCFMLECF-----------DDNY-CPKSLSNMQAELARDMLLLENQLPMKLLEELYSMTNNNQKQKNVKSLVSDFMCFKNKNKLDGKY-------
        +M++DGCF+L  F           D  +  P  LS++Q+    D+LLLENQ+P  +L+ LY + +      ++  +   F  FKN    +G Y       
Subjt:  LMIVDGCFMLECF-----------DDNY-CPKSLSNMQAELARDMLLLENQLPMKLLEELYSMTNNNQKQKNVKSLVSDFMCFKNKNKLDGKY-------

Query:  --LHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSEESN------PKIQIIPPARQLREAGIKFRESRTR--SFKDVDFEGGVLKLPVMEVDDDTEPGL
           H+LD+ R T L P     D++       Q  + +  N        + +I  A++LR  GIKFR  R++  S  +V  +   L++P +  D       
Subjt:  --LHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSEESN------PKIQIIPPARQLREAGIKFRESRTR--SFKDVDFEGGVLKLPVMEVDDDTEPGL

Query:  LNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILV-NALGDDQSAAGLFKLLAKGASMD-SNSHISQVHQKVHRHCERKWNQWCASLKHNY
        LN +AFE+ +  +  E+T++I+ M  L++ + DV  L +  +++ N  G +   +  FK ++K    +   S+++ V + V+ + ++ +N   A  +H +
Subjt:  LNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILV-NALGDDQSAAGLFKLLAKGASMD-SNSHISQVHQKVHRHCERKWNQWCASLKHNY

Query:  FQNPWAIISLIAALVGFLIILLQAVYQLLDY
        F++PW  +S  A L   L+ +LQ+   +L Y
Subjt:  FQNPWAIISLIAALVGFLIILLQAVYQLLDY

Arabidopsis top hitse value%identityAlignment
AT3G50120.1 Plant protein of unknown function (DUF247)3.9e-5229.98Show/hide
Query:  NQPLLPVPIIRII--SRDDNEDKAAMSFDDVKNNLKEHLPCSVVGSEQPSIYKLPECIREVQPKAFEPRFVSIGPYHHGQQHLVRMEREKLKAFHSFRTH
        NQ     P I +I  S  D+ D   +S  D           ++ G  +  IY++P  ++E   K++ P+ VS+GPYHHG++ L  M+R K +A       
Subjt:  NQPLLPVPIIRII--SRDDNEDKAAMSFDDVKNNLKEHLPCSVVGSEQPSIYKLPECIREVQPKAFEPRFVSIGPYHHGQQHLVRMEREKLKAFHSFRTH

Query:  HGLEVESIVDRVGSMLKDLQGSYDELEDEWK---EDP----AGKFLQLMIVDGCFMLECFDD------------NYCPKSLSNMQAELARDMLLLENQLP
            V  ++ R    +K    +  ELE++ +   E P    + +F++++++DGCF+LE F              N    ++      + RDM++LENQLP
Subjt:  HGLEVESIVDRVGSMLKDLQGSYDELEDEWK---EDP----AGKFLQLMIVDGCFMLECFDD------------NYCPKSLSNMQAELARDMLLLENQLP

Query:  MKLLEELYSMTNNNQKQKNVKSLVS-----------DFMCFKNKNKLDGKY--------------LHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSE
        + +L  L  +    + Q  + + ++           + +    ++KL+                 LH LD++R +LL+     E R  R + S+    ++
Subjt:  MKLLEELYSMTNNNQKQKNVKSLVS-----------DFMCFKNKNKLDGKY--------------LHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSE

Query:  ESNPKIQIIPPARQLREAGIKFRESRTRSFKDVDFEGGVLKLPVMEVDDDTEPGLLNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVN
        +   + Q+I    +L+EAGIKFR  +T  F D+ F+ G L++P + + D T+   LN++AFE+ H+ +  ++TS+II MDNLID+  DV  L   GI+ +
Subjt:  ESNPKIQIIPPARQLREAGIKFRESRTRSFKDVDFEGGVLKLPVMEVDDDTEPGLLNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVN

Query:  ALGDDQSAAGLFKLLAKGASMDS-NSHISQVHQKVHRHCERKWNQWCASLKHNYFQNPWAIISLIAALVGFLIILLQAVYQLLDYYR
         LG D   A LF  L +    D+ +S++S++  +V+R+ + KWN W A+LKH YF NPWAI+S  AA++  ++   Q+ Y +  YY+
Subjt:  ALGDDQSAAGLFKLLAKGASMDS-NSHISQVHQKVHRHCERKWNQWCASLKHNYFQNPWAIISLIAALVGFLIILLQAVYQLLDYYR

AT3G50130.1 Plant protein of unknown function (DUF247)1.1e-5432.86Show/hide
Query:  IYKLPECIREVQPKAFEPRFVSIGPYHHGQQHLVRMEREKLKAFHSFRTHHGLEVESIVDRVGSMLKDLQGSYDELEDEWKEDPAGKFLQLMIVDGCFML
        IY++P+ ++E   K++ P+ VS+GP+HHG +HL+ M+R K +A +        ++E  +D +  +    +  Y+   D      + KF +++++DGCF+L
Subjt:  IYKLPECIREVQPKAFEPRFVSIGPYHHGQQHLVRMEREKLKAFHSFRTHHGLEVESIVDRVGSMLKDLQGSYDELEDEWKEDPAGKFLQLMIVDGCFML

Query:  ECF------------DDNYCPKSLSNMQAELARDMLLLENQLPMKLLEELYSMTNNNQKQKNVKS----------LVSDFMCFKNKNKL-----------
        E F            D N    ++      + RDM++LENQLP+ +L  L  +    + Q  + S          + +D    K  + L           
Subjt:  ECF------------DDNYCPKSLSNMQAELARDMLLLENQLPMKLLEELYSMTNNNQKQKNVKS----------LVSDFMCFKNKNKL-----------

Query:  -DGKYLHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSEESNPKIQIIPPARQLREAGIKFRESRTRSFKDVDFEGGVLKLPVMEVDDDTEPGLLNVMA
         D   LH LD++R  LL+P  + E R  R + S +   +++     Q+I    +LREAGIKFR  +T  F D+ F+ G L++P + + D T+    N++A
Subjt:  -DGKYLHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSEESNPKIQIIPPARQLREAGIKFRESRTRSFKDVDFEGGVLKLPVMEVDDDTEPGLLNVMA

Query:  FEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQSAAGLFKLLAKGASMD-SNSHISQVHQKVHRHCERKWNQWCASLKHNYFQNPWA
        FE+ H+ +  ++TS+II MDNLID+  DV  L   GI+ + LG+D   A LF  L +  + D  NS++SQ+  KV R+  RKWN   A LKH YF NPWA
Subjt:  FEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQSAAGLFKLLAKGASMD-SNSHISQVHQKVHRHCERKWNQWCASLKHNYFQNPWA

Query:  IISLIAALVGFLIILLQAVYQLLDYY
          S  AALV  ++ L Q+ +    Y+
Subjt:  IISLIAALVGFLIILLQAVYQLLDYY

AT3G50140.1 Plant protein of unknown function (DUF247)2.5e-5131.34Show/hide
Query:  IYKLPECIREVQPKAFEPRFVSIGPYHHGQQHLVRMEREKLKAFHSF--RTHHGLEVESIVDRVGSMLKDLQGSYDELEDEWKEDPAG----KFLQLMIV
        IY++P  +++    ++ P+ VS+GPYHHG +HL  M+  K +A +    RT  G+E          M  D     +E      E P G    KF Q++++
Subjt:  IYKLPECIREVQPKAFEPRFVSIGPYHHGQQHLVRMEREKLKAFHSF--RTHHGLEVESIVDRVGSMLKDLQGSYDELEDEWKEDPAG----KFLQLMIV

Query:  DGCFMLECF------------DDNYCPKSLSNMQAELARDMLLLENQLPMKLLEELYSMTNNNQKQKNV---------KSLVSDFMCF-------KNKNK
        DGCF+L+ F            D N    ++      + RDML+LENQLP+ +L  L  +    Q Q  +           L+  +M         +N NK
Subjt:  DGCFMLECF------------DDNYCPKSLSNMQAELARDMLLLENQLPMKLLEELYSMTNNNQKQKNV---------KSLVSDFMCF-------KNKNK

Query:  L-------DGKYLHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSEESNPKIQIIPPARQLREAGIKFRESRTRSFKDVDFEGGVLKLPVMEVDDDTEP
                + + LH LD++R +LLQP +  + R  R + S++   +++     Q++    +LREAGIKF+  ++  F D+ F+ G L++P + + D T+ 
Subjt:  L-------DGKYLHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSEESNPKIQIIPPARQLREAGIKFRESRTRSFKDVDFEGGVLKLPVMEVDDDTEP

Query:  GLLNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQSAAGLFKLLAKGASMD-SNSHISQVHQKVHRHCERKWNQWCASLKHN
           N++A+E+ H+ +  ++TS+II MDNLID+  D+  L    I+ + LG+D   A +F  L +  + D  N+++S++  KV R+  RKWN   A+LKH 
Subjt:  GLLNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQSAAGLFKLLAKGASMD-SNSHISQVHQKVHRHCERKWNQWCASLKHN

Query:  YFQNPWAIISLIAALVGFLIILLQAVYQLLDYYR
        YF NPWA  S  AA++  L+ L Q+ +    Y++
Subjt:  YFQNPWAIISLIAALVGFLIILLQAVYQLLDYYR

AT3G50160.1 Plant protein of unknown function (DUF247)3.1e-4930.49Show/hide
Query:  DDNEDKA----AMSFDDVKNNLKEHLPCSVVGSEQPSIYKLPECIREVQPKAFEPRFVSIGPYHHGQQHLVRMEREKLKAFHSFRTHHGLEVESIVDRVG
        D NE K      +S +D    L ++   +    +   IY++P  ++E   K++ P+ VSIGPYHHG +HL+ MER K +A +        ++E  +D + 
Subjt:  DDNEDKA----AMSFDDVKNNLKEHLPCSVVGSEQPSIYKLPECIREVQPKAFEPRFVSIGPYHHGQQHLVRMEREKLKAFHSFRTHHGLEVESIVDRVG

Query:  SMLKDLQGSYDELEDEWKEDPAGKFLQLMIVDGCFMLECF--------DDNYCPK----SLSNMQAELARDMLLLENQLPMKLLEELYSMTNNNQKQK-N
         + +  +  Y    +  + +    F++++++DG F++E F        +  Y P      +  +   + RDM++LENQLP  +L+ L  +   +   K N
Subjt:  SMLKDLQGSYDELEDEWKEDPAGKFLQLMIVDGCFMLECF--------DDNYCPK----SLSNMQAELARDMLLLENQLPMKLLEELYSMTNNNQKQK-N

Query:  VKSLVSDFMCFKNKNKL--DGKYLHILDMYRATLLQPP-VDREDRSRRGKKSQQEEQSEESNPKIQIIPPARQLREAGIKFRESRTRSFKDVDFEGGVLK
        V+     F       ++  +   LH LD+ R  LLQ      ED S   K+ Q            Q+I    +LR AG++F    T  F D++F+ G LK
Subjt:  VKSLVSDFMCFKNKNKL--DGKYLHILDMYRATLLQPP-VDREDRSRRGKKSQQEEQSEESNPKIQIIPPARQLREAGIKFRESRTRSFKDVDFEGGVLK

Query:  LPVMEVDDDTEPGLLNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQSAAGLFKLLAKGASMDSN-SHISQVHQKVHRHCER
        +P + + D T+   LN++AFE+ H+ +  ++TS+II MDNLI++  DV  L   GI+ N LG D   + LF  L K    D N  ++S +  +V+ +  R
Subjt:  LPVMEVDDDTEPGLLNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQSAAGLFKLLAKGASMDSN-SHISQVHQKVHRHCER

Query:  KWNQWCASLKHNYFQNPWAIISLIAALVGFLIILLQAVYQLLDYYR
        KWN   A+L+H YF NPWA  S IAA+   +    Q+ + +  Y++
Subjt:  KWNQWCASLKHNYFQNPWAIISLIAALVGFLIILLQAVYQLLDYYR

AT3G50170.1 Plant protein of unknown function (DUF247)9.5e-5130.8Show/hide
Query:  IYKLPECIREVQPKAFEPRFVSIGPYHHGQQHLVRMEREKLKAFHSFRTHHGLEVESIVDRVGSMLKDLQGSYDELEDEWK---EDP----AGKFLQLMI
        IY++P  ++E   K++ P+ VS+GPYHHG++ L  MER K +A           +  ++ R+   ++    +  ELE++ +   E P      +F ++++
Subjt:  IYKLPECIREVQPKAFEPRFVSIGPYHHGQQHLVRMEREKLKAFHSFRTHHGLEVESIVDRVGSMLKDLQGSYDELEDEWK---EDP----AGKFLQLMI

Query:  VDGCFMLECF--------DDNYCPK----SLSNMQAELARDMLLLENQLPMKLLEELYSMTNNNQKQKNVKSLVS-----------DFMCFKNKNKL---
        +DGCF+LE F        +  Y       ++  +   + RDM++LENQLP+ +L+ L  +    Q Q  + + V+           + +   +++KL   
Subjt:  VDGCFMLECF--------DDNYCPK----SLSNMQAELARDMLLLENQLPMKLLEELYSMTNNNQKQKNVKSLVS-----------DFMCFKNKNKL---

Query:  ---------DGKYLHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSEESNPKIQIIPPARQLREAGIKFRESRTRSFKDVDFEGGVLKLPVMEVDDDTE
                 D   LH LD++R +LLQ       RS   + ++     ++     Q++    +LREAG+KFR+ +T  F D++F+ G L++P + + D T+
Subjt:  ---------DGKYLHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSEESNPKIQIIPPARQLREAGIKFRESRTRSFKDVDFEGGVLKLPVMEVDDDTE

Query:  PGLLNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQSAAGLFKLLAKGASMD-SNSHISQVHQKVHRHCERKWNQWCASLKH
            N++AFE+ H+ +   +TS+II MDNLI++  DV  L   GI+ + LG D   A LF  L +    D  +SH+S++   V+R+  RKWN   A+L H
Subjt:  PGLLNVMAFEKLHVGAGGEVTSFIIHMDNLIDTDGDVELLVSAGILVNALGDDQSAAGLFKLLAKGASMD-SNSHISQVHQKVHRHCERKWNQWCASLKH

Query:  NYFQNPWAIISLIAALVGFLIILLQAVYQLLDYYR
         YF NPWA  S  AA++  L+ L Q+ Y +  YY+
Subjt:  NYFQNPWAIISLIAALVGFLIILLQAVYQLLDYYR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGAGAACCAGCCACTGCTGCCTGTGCCCATCATCAGAATAATCAGTAGAGATGATAACGAGGATAAGGCCGCCATGTCGTTTGATGATGTCAAGAACAATCTCAA
AGAGCACTTGCCATGCTCGGTCGTGGGATCAGAACAACCTTCAATCTACAAGCTACCAGAGTGCATCAGAGAAGTCCAGCCCAAAGCTTTCGAGCCGCGGTTCGTGTCGA
TTGGGCCATACCACCATGGACAACAACATTTGGTTCGGATGGAACGAGAGAAGCTCAAAGCATTTCATAGTTTCAGAACCCATCACGGATTGGAGGTCGAATCCATCGTG
GACAGGGTGGGGAGCATGTTGAAGGATCTTCAAGGATCGTACGATGAGCTTGAGGATGAGTGGAAGGAAGATCCAGCTGGCAAGTTCTTGCAGCTCATGATCGTGGATGG
ATGTTTCATGCTGGAATGCTTCGACGATAACTACTGTCCCAAATCGTTGAGCAATATGCAAGCGGAATTAGCACGGGACATGCTGCTGCTTGAGAATCAGCTACCCATGA
AGCTTCTCGAGGAGTTGTATTCCATGACAAACAATAACCAAAAGCAAAAGAATGTGAAATCGCTGGTTTCTGATTTCATGTGCTTCAAAAATAAAAATAAGCTAGACGGA
AAATACTTGCACATTTTGGACATGTATAGGGCGACATTATTGCAACCTCCAGTAGACAGGGAGGACCGGAGTAGGAGAGGGAAAAAGTCGCAGCAGGAAGAACAATCCGA
GGAATCCAACCCAAAAATCCAGATAATTCCGCCGGCAAGGCAGCTTCGTGAAGCCGGGATAAAATTCAGGGAGAGCCGTACGAGGAGCTTCAAGGACGTGGATTTTGAAG
GAGGCGTGTTGAAGCTTCCGGTGATGGAAGTGGACGATGACACGGAACCAGGTTTGTTAAATGTGATGGCGTTTGAGAAACTCCACGTTGGGGCTGGCGGGGAAGTGACC
TCTTTCATCATCCACATGGATAATCTGATAGACACGGACGGAGATGTGGAGCTGCTGGTGTCTGCCGGAATATTGGTGAATGCCCTTGGAGACGATCAAAGTGCGGCAGG
TTTGTTCAAACTGCTGGCCAAAGGGGCGTCTATGGATTCCAACAGCCACATATCTCAGGTGCACCAGAAGGTGCATAGGCATTGCGAAAGGAAATGGAATCAATGGTGCG
CAAGTCTCAAACACAACTACTTTCAAAACCCATGGGCAATCATCTCCCTCATTGCGGCTTTGGTGGGTTTCCTCATCATACTTCTCCAGGCCGTCTACCAATTATTGGAT
TATTATCGACAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGAGAACCAGCCACTGCTGCCTGTGCCCATCATCAGAATAATCAGTAGAGATGATAACGAGGATAAGGCCGCCATGTCGTTTGATGATGTCAAGAACAATCTCAA
AGAGCACTTGCCATGCTCGGTCGTGGGATCAGAACAACCTTCAATCTACAAGCTACCAGAGTGCATCAGAGAAGTCCAGCCCAAAGCTTTCGAGCCGCGGTTCGTGTCGA
TTGGGCCATACCACCATGGACAACAACATTTGGTTCGGATGGAACGAGAGAAGCTCAAAGCATTTCATAGTTTCAGAACCCATCACGGATTGGAGGTCGAATCCATCGTG
GACAGGGTGGGGAGCATGTTGAAGGATCTTCAAGGATCGTACGATGAGCTTGAGGATGAGTGGAAGGAAGATCCAGCTGGCAAGTTCTTGCAGCTCATGATCGTGGATGG
ATGTTTCATGCTGGAATGCTTCGACGATAACTACTGTCCCAAATCGTTGAGCAATATGCAAGCGGAATTAGCACGGGACATGCTGCTGCTTGAGAATCAGCTACCCATGA
AGCTTCTCGAGGAGTTGTATTCCATGACAAACAATAACCAAAAGCAAAAGAATGTGAAATCGCTGGTTTCTGATTTCATGTGCTTCAAAAATAAAAATAAGCTAGACGGA
AAATACTTGCACATTTTGGACATGTATAGGGCGACATTATTGCAACCTCCAGTAGACAGGGAGGACCGGAGTAGGAGAGGGAAAAAGTCGCAGCAGGAAGAACAATCCGA
GGAATCCAACCCAAAAATCCAGATAATTCCGCCGGCAAGGCAGCTTCGTGAAGCCGGGATAAAATTCAGGGAGAGCCGTACGAGGAGCTTCAAGGACGTGGATTTTGAAG
GAGGCGTGTTGAAGCTTCCGGTGATGGAAGTGGACGATGACACGGAACCAGGTTTGTTAAATGTGATGGCGTTTGAGAAACTCCACGTTGGGGCTGGCGGGGAAGTGACC
TCTTTCATCATCCACATGGATAATCTGATAGACACGGACGGAGATGTGGAGCTGCTGGTGTCTGCCGGAATATTGGTGAATGCCCTTGGAGACGATCAAAGTGCGGCAGG
TTTGTTCAAACTGCTGGCCAAAGGGGCGTCTATGGATTCCAACAGCCACATATCTCAGGTGCACCAGAAGGTGCATAGGCATTGCGAAAGGAAATGGAATCAATGGTGCG
CAAGTCTCAAACACAACTACTTTCAAAACCCATGGGCAATCATCTCCCTCATTGCGGCTTTGGTGGGTTTCCTCATCATACTTCTCCAGGCCGTCTACCAATTATTGGAT
TATTATCGACAATAA
Protein sequenceShow/hide protein sequence
MAENQPLLPVPIIRIISRDDNEDKAAMSFDDVKNNLKEHLPCSVVGSEQPSIYKLPECIREVQPKAFEPRFVSIGPYHHGQQHLVRMEREKLKAFHSFRTHHGLEVESIV
DRVGSMLKDLQGSYDELEDEWKEDPAGKFLQLMIVDGCFMLECFDDNYCPKSLSNMQAELARDMLLLENQLPMKLLEELYSMTNNNQKQKNVKSLVSDFMCFKNKNKLDG
KYLHILDMYRATLLQPPVDREDRSRRGKKSQQEEQSEESNPKIQIIPPARQLREAGIKFRESRTRSFKDVDFEGGVLKLPVMEVDDDTEPGLLNVMAFEKLHVGAGGEVT
SFIIHMDNLIDTDGDVELLVSAGILVNALGDDQSAAGLFKLLAKGASMDSNSHISQVHQKVHRHCERKWNQWCASLKHNYFQNPWAIISLIAALVGFLIILLQAVYQLLD
YYRQ