; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G021280 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G021280
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationchr02:27514542..27519301
RNA-Seq ExpressionLsi02G021280
SyntenyLsi02G021280
Gene Ontology termsNA
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004149334.2 UPF0481 protein At3g47200 [Cucumis sativus]3.0e-10751.1Show/hide
Query:  EMDKGEGNDVVIS-----MEVPTKSPKSFLCKVPKMFRKMNNETAYTPQFISIGPFHHGQPDLMDAEQYKDGALRKYLSRKRMKFEEAMSLAYSWDEKAR
        + ++   +DVV+S     M++P  +PK F+ +VP++ RKM N+ AYTPQ ISIGP HHGQ     AEQYK  AL++YLSR  M   EAM +A  W+E AR
Subjt:  EMDKGEGNDVVIS-----MEVPTKSPKSFLCKVPKMFRKMNNETAYTPQFISIGPFHHGQPDLMDAEQYKDGALRKYLSRKRMKFEEAMSLAYSWDEKAR

Query:  SCYGYHIDMNKHDFAKMMLLDGCFILEFTMKSTTNYRWSLFSQSE---------------VERDLTMFENQLPFFVLQQLYHKSSNPQN-KNEEDPRKQL
        +CY   IDMN+ +F KMMLLD CFILEF   +    R+  + Q+E               ++RDL+MFENQLPFF LQ+L++ S N QN  N   P    
Subjt:  SCYGYHIDMNKHDFAKMMLLDGCFILEFTMKSTTNYRWSLFSQSE---------------VERDLTMFENQLPFFVLQQLYHKSSNPQN-KNEEDPRKQL

Query:  YGDKTLMEWMEMLFTATTATYMLPSDV--SPTKVRHLVDLLTFYYNPSPETDEYYKFIENEEGYSFPYKAPTITKLCEAGIKVRRAEEAQSMMDFSFKNG
              +  M  LF ATT  Y L  +V     KVRHLVDLL FYY PSP+T+EY  +   +  Y      PTI++LCEAG++++RA++A+S++DF+FK+G
Subjt:  YGDKTLMEWMEMLFTATTATYMLPSDV--SPTKVRHLVDLLTFYYNPSPETDEYYKFIENEEGYSFPYKAPTITKLCEAGIKVRRAEEAQSMMDFSFKNG

Query:  VLKIPPFTIHTNFEIQFRNLIVSEILSDDVKGEKYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYFNYMSGDL
        VLKIPPF IH  FEIQ RNLIVSEI   D    K++F Y+ LLDDLI TE DVSILVKE+I+ NE   SDE V K+I+ L L APT  L  Y++ MS DL
Subjt:  VLKIPPFTIHTNFEIQFRNLIVSEILSDDVKGEKYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYFNYMSGDL

Query:  NDYCNRPMASIHRSMALLRRDYFNSPWASISCLAAIFLLILDCLQTLFSAPSFF
        N++C +     +RS+A LRRDYFN+PWASIS +AA  LLIL  LQTLFSAP+FF
Subjt:  NDYCNRPMASIHRSMALLRRDYFNSPWASISCLAAIFLLILDCLQTLFSAPSFF

XP_022131634.1 UPF0481 protein At3g47200-like [Momordica charantia]2.6e-7139.25Show/hide
Query:  NELEMDKGEGNDVVISME-------VPTKSPKSFLCKVPKMFRKMNNETAYTPQFISIGPFHH-GQPDLMDAEQYKDGALRKYLSRKRMKFEEAMSLAYS
        N  E+D+     VVIS+E        P   P+  + +VPK    MN + AYTPQ ISIGPFHH  Q +L+  +Q+K  AL  YL R +M  E  + +  +
Subjt:  NELEMDKGEGNDVVISME-------VPTKSPKSFLCKVPKMFRKMNNETAYTPQFISIGPFHH-GQPDLMDAEQYKDGALRKYLSRKRMKFEEAMSLAYS

Query:  WDEKARSCYGYHIDMNKHDFAKMMLLDGCFILEFTMKSTTNYRW------SLFSQ---SEVERDLTMFENQLPFFVLQQLYHKSSNPQNKNEEDPRKQLY
        W+ +ARSCYG  I MN   F  M+LLDGCF++ F +    NY        S F +   S++  D+TM ENQLPFFVLQ LY          +  P+    
Subjt:  WDEKARSCYGYHIDMNKHDFAKMMLLDGCFILEFTMKSTTNYRW------SLFSQ---SEVERDLTMFENQLPFFVLQQLYHKSSNPQNKNEEDPRKQLY

Query:  GDKTLMEWMEMLFTATTATYMLPSDVSPTKVRHLVDLLTFYYNPSPETDEYYKFIENEEGYSFPYKAPTITKLCEAGIKVRRAEEAQSMMDFSFKNGVLK
         + +L++ +E  F+ +   + +P  VSP  V+HLVDLL+ Y+ P  +T + +    +++ Y      P +T+LCEAG+ +++ +EA  +MD SFKNGVL+
Subjt:  GDKTLMEWMEMLFTATTATYMLPSDVSPTKVRHLVDLLTFYYNPSPETDEYYKFIENEEGYSFPYKAPTITKLCEAGIKVRRAEEAQSMMDFSFKNGVLK

Query:  IPPFTIHTNFEIQFRNLIVSEILSDDVKGEKYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYFNYMSGDLNDY
        IPP  I  +FE   RNL+  E         +Y   Y L LD +I TE DV +LV+  IIIN    SD++VS++ + L           Y N+++  L+D+
Subjt:  IPPFTIHTNFEIQFRNLIVSEILSDDVKGEKYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYFNYMSGDLNDY

Query:  CNRPMASIHRSMALLRRDYFNSPWASISCLAAIFLLILDCLQTLFSAPSFF
        C +      RS A L+RDYFNSPWA IS +AA +++IL  LQT+F+A S F
Subjt:  CNRPMASIHRSMALLRRDYFNSPWASISCLAAIFLLILDCLQTLFSAPSFF

XP_022132118.1 UPF0481 protein At3g47200-like [Momordica charantia]3.5e-7142.3Show/hide
Query:  VISMEVPTKSPKSFLCKVPKMFRKMNNETAYTPQFISIGPFHHGQPDLMDAEQYKDGALRKYLSRKRMKFEEAMSLAYSWDEKARSCYGYHIDMNKHDFA
        ++  + P  + +  + +VPK    M + TAYTP+ I+IGPFHHG+ DL+  +Q K    R YL R     +  +++A  W+ KAR  Y   I+M   DF 
Subjt:  VISMEVPTKSPKSFLCKVPKMFRKMNNETAYTPQFISIGPFHHGQPDLMDAEQYKDGALRKYLSRKRMKFEEAMSLAYSWDEKARSCYGYHIDMNKHDFA

Query:  KMMLLDGCFILE---------FTMKSTTNYRWSLFSQSEVERDLTMFENQLPFFVLQQLYHKSSNPQNKNEED-PRKQLYGDKTLMEWMEMLFTATTATY
        +MML+D CFI+E         F  +   +  ++     ++ ++LTM ENQLPFFVLQ L+     P +KN+ +    QL G        + L       Y
Subjt:  KMMLLDGCFILE---------FTMKSTTNYRWSLFSQSEVERDLTMFENQLPFFVLQQLYHKSSNPQNKNEED-PRKQLYGDKTLMEWMEMLFTATTATY

Query:  MLPSDVSPT-KVRHLVDLLTFYYNPSPETDEY--YKFIENEEGYSFPYKAPTITKLCEAGIKVRRAEEAQSMMDFSFKNGVLKIPPFTIHTNFEIQFRNL
         LP  VS T +V HLVDLL FYY PSP+T+EY   K I+N   +  P   PTITKLCEAG+KV++A  A+S++D SFK GVL+IPPF IH +FEI  RNL
Subjt:  MLPSDVSPT-KVRHLVDLLTFYYNPSPETDEY--YKFIENEEGYSFPYKAPTITKLCEAGIKVRRAEEAQSMMDFSFKNGVLKIPPFTIHTNFEIQFRNL

Query:  IVSE-ILSDDVKGEKYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYFNYMSGDLNDYCNRPMASIHRSMALLR
        +  E     +   ++Y+ HY+  LD LI T  DV++LVKE IIIN    S+++VS++ ++L    P I  + YF   S +L+D+C +      RS A LR
Subjt:  IVSE-ILSDDVKGEKYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYFNYMSGDLNDYCNRPMASIHRSMALLR

Query:  RDYFNSPWASISCLAAIFLLILDCLQTLFSAPSFF
        RDYF+SPWA IS  AA FL++L  LQT+F+A S F
Subjt:  RDYFNSPWASISCLAAIFLLILDCLQTLFSAPSFF

XP_038896405.1 UPF0481 protein At3g47200-like [Benincasa hispida]1.6e-14564.41Show/hide
Query:  MDKGEGNDVVIS-----MEVPTKSPKSFLCKVPKMFRKMNNETAYTPQFISIGPFHHGQPDLMDAEQYKDGALRKYLSRKRMKFEEAMSLAYSWDEKARS
        MDKGEG+ V IS     ME+P  +P+SFL KVP++ RKM NET YTPQFISIGPFHHG+P+LM  EQYK  AL+ YL+R  +K EEAMS+ +SW+EKARS
Subjt:  MDKGEGNDVVIS-----MEVPTKSPKSFLCKVPKMFRKMNNETAYTPQFISIGPFHHGQPDLMDAEQYKDGALRKYLSRKRMKFEEAMSLAYSWDEKARS

Query:  CYGYHIDMNKHDFAKMMLLDGCFILEFTMKSTTNYRWSLFSQ----------------SEVERDLTMFENQLPFFVLQQLYHKSSNPQNKNEEDPRKQLY
        CYG+ IDMNK DFAKMMLLDGCFILEF   ++TNY  SL S+                 +VERDL +FENQLPFFVLQQLY  S +   K+++       
Subjt:  CYGYHIDMNKHDFAKMMLLDGCFILEFTMKSTTNYRWSLFSQ----------------SEVERDLTMFENQLPFFVLQQLYHKSSNPQNKNEEDPRKQLY

Query:  GDKTLMEWMEMLFTATTATYMLPSDVSPTKVRHLVDLLTFYYNPSPETDEYYKFIENEEGYSFPYKAPTITKLCEAGIKVRRAEEAQSMMDFSFKNGVLK
             MEWM  LF +TT+ Y LP     TKVRH+VDLL FYY PSPETDEY KF +NE  YS  Y++PTITKLCE GIKV+RA EA SMMDFSFKNGVLK
Subjt:  GDKTLMEWMEMLFTATTATYMLPSDVSPTKVRHLVDLLTFYYNPSPETDEYYKFIENEEGYSFPYKAPTITKLCEAGIKVRRAEEAQSMMDFSFKNGVLK

Query:  IPPFTIHTNFEIQFRNLIVSEILSDDVKGEKYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYFNYMSGDLNDY
        IPPF IHTNFEIQ RNLIVSEI  + V+G+KY+FHYLLLLDDLIKTE DVSIL+KE+IIINETD S EQV K+IH+LHL+AP+ R +SYFN MS DL++Y
Subjt:  IPPFTIHTNFEIQFRNLIVSEILSDDVKGEKYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYFNYMSGDLNDY

Query:  C----NRPMASIHRSMALLRRDYFNSPWASISCLAAIFLLILDCLQTLFSAPSFFLKK
        C     R MAS+ RSMA L+RDYF++PWA IS LAA FLLIL  LQTLFSAPS F KK
Subjt:  C----NRPMASIHRSMALLRRDYFNSPWASISCLAAIFLLILDCLQTLFSAPSFFLKK

XP_038899472.1 UPF0481 protein At3g47200-like [Benincasa hispida]3.9e-12358.86Show/hide
Query:  MGENELEMDKGEGNDVVIS-----MEVPTKSPKSFLCKVPKMFRKMNNETAYTPQFISIGPFHHGQPDLMDAEQYKDGALRKYLSRKRMKFEEAMSLAYS
        M  N    D  EG+DVVIS     ME+P+ +P S L KVPK   K  NET YTP+FISIGPFHHGQP+LM AEQYK  AL+ YL R +M  +EAMSLA S
Subjt:  MGENELEMDKGEGNDVVIS-----MEVPTKSPKSFLCKVPKMFRKMNNETAYTPQFISIGPFHHGQPDLMDAEQYKDGALRKYLSRKRMKFEEAMSLAYS

Query:  WDEKARSCYGYHIDMNKHDFAKMMLLDGCFILEFTMKSTTNY-----RW----SLFSQ--SEVERDLTMFENQLPFFVLQQLYHKSSNPQNKNEEDPRKQ
        W+EKARSCYG+ IDMNK +FAKMMLLDGCFILEF   +   Y     +W    SL  +   +VERDLT++ENQLPFFVLQQLY     P     +DP   
Subjt:  WDEKARSCYGYHIDMNKHDFAKMMLLDGCFILEFTMKSTTNY-----RW----SLFSQ--SEVERDLTMFENQLPFFVLQQLYHKSSNPQNKNEEDPRKQ

Query:  LYGDKTLMEWMEMLFTATTATYMLPSDVSPTKVRHLVDLLTFYYNPSPETDEYYKFIENEEGYSFPYKAPTITKLCEAGIKVRRAEEAQSMMDFSFKNGV
               ME M  LF ATT  Y    + + T+VRHLVDLL FY+ PSP+TDEY KF E+EE      ++P +TKLCEAG+KV++A+EA+SMMDFSFKNG+
Subjt:  LYGDKTLMEWMEMLFTATTATYMLPSDVSPTKVRHLVDLLTFYYNPSPETDEYYKFIENEEGYSFPYKAPTITKLCEAGIKVRRAEEAQSMMDFSFKNGV

Query:  LKIPPFTIHTNFEIQFRNLIVSEILSDDVKGEKYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKS-YFNYMSGDL
        LKIPP  IH  FEI  RNLIVSEI+S+D   E YLFHY+ LLDDLI TE DVSILVKE+II N +  SD+QV+++I++L L APT   K  Y+N+MS +L
Subjt:  LKIPPFTIHTNFEIQFRNLIVSEILSDDVKGEKYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKS-YFNYMSGDL

Query:  NDYCNRPMASIHRSMALLRRDYFNSPWASISCLAAIFLLILDCLQTLFSAPSFFLKK
        NDYCN+     HRSMA LRRDYFN+PWASIS  AA FLLIL  LQTLFS P  F  K
Subjt:  NDYCNRPMASIHRSMALLRRDYFNSPWASISCLAAIFLLILDCLQTLFSAPSFFLKK

TrEMBL top hitse value%identityAlignment
A0A0A0K6U9 Uncharacterized protein1.4e-10751.1Show/hide
Query:  EMDKGEGNDVVIS-----MEVPTKSPKSFLCKVPKMFRKMNNETAYTPQFISIGPFHHGQPDLMDAEQYKDGALRKYLSRKRMKFEEAMSLAYSWDEKAR
        + ++   +DVV+S     M++P  +PK F+ +VP++ RKM N+ AYTPQ ISIGP HHGQ     AEQYK  AL++YLSR  M   EAM +A  W+E AR
Subjt:  EMDKGEGNDVVIS-----MEVPTKSPKSFLCKVPKMFRKMNNETAYTPQFISIGPFHHGQPDLMDAEQYKDGALRKYLSRKRMKFEEAMSLAYSWDEKAR

Query:  SCYGYHIDMNKHDFAKMMLLDGCFILEFTMKSTTNYRWSLFSQSE---------------VERDLTMFENQLPFFVLQQLYHKSSNPQN-KNEEDPRKQL
        +CY   IDMN+ +F KMMLLD CFILEF   +    R+  + Q+E               ++RDL+MFENQLPFF LQ+L++ S N QN  N   P    
Subjt:  SCYGYHIDMNKHDFAKMMLLDGCFILEFTMKSTTNYRWSLFSQSE---------------VERDLTMFENQLPFFVLQQLYHKSSNPQN-KNEEDPRKQL

Query:  YGDKTLMEWMEMLFTATTATYMLPSDV--SPTKVRHLVDLLTFYYNPSPETDEYYKFIENEEGYSFPYKAPTITKLCEAGIKVRRAEEAQSMMDFSFKNG
              +  M  LF ATT  Y L  +V     KVRHLVDLL FYY PSP+T+EY  +   +  Y      PTI++LCEAG++++RA++A+S++DF+FK+G
Subjt:  YGDKTLMEWMEMLFTATTATYMLPSDV--SPTKVRHLVDLLTFYYNPSPETDEYYKFIENEEGYSFPYKAPTITKLCEAGIKVRRAEEAQSMMDFSFKNG

Query:  VLKIPPFTIHTNFEIQFRNLIVSEILSDDVKGEKYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYFNYMSGDL
        VLKIPPF IH  FEIQ RNLIVSEI   D    K++F Y+ LLDDLI TE DVSILVKE+I+ NE   SDE V K+I+ L L APT  L  Y++ MS DL
Subjt:  VLKIPPFTIHTNFEIQFRNLIVSEILSDDVKGEKYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYFNYMSGDL

Query:  NDYCNRPMASIHRSMALLRRDYFNSPWASISCLAAIFLLILDCLQTLFSAPSFF
        N++C +     +RS+A LRRDYFN+PWASIS +AA  LLIL  LQTLFSAP+FF
Subjt:  NDYCNRPMASIHRSMALLRRDYFNSPWASISCLAAIFLLILDCLQTLFSAPSFF

A0A1S3BBL9 UPF0481 protein At3g47200-like1.1e-6538.66Show/hide
Query:  NDVVISME------VPTKSPKSFLCKVPKMFRKMNNETAYTPQFISIGPFH-HGQPDLMDAEQYKDGALRKYLSR-----------KRMKFEEAMSLAYS
        ++VVIS+E       PT   +  + +VPK  R+MN + AY PQ ISIGPFH H   +L+  EQYK      YL R           +    E+ +  A S
Subjt:  NDVVISME------VPTKSPKSFLCKVPKMFRKMNNETAYTPQFISIGPFH-HGQPDLMDAEQYKDGALRKYLSR-----------KRMKFEEAMSLAYS

Query:  WDEKARSCYGYHIDMNKHDFAKMMLLDGCFILEFTMKSTTNY---RWSLFSQSE--------------VERDLTMFENQLPFFVLQQLYHKSSNPQNKNE
        W E+AR+CY   I+MN  DF KMML+DGCFI+EF +     Y     SLF Q E              ++ DL   ENQLPFFVLQ L+     P++K+ 
Subjt:  WDEKARSCYGYHIDMNKHDFAKMMLLDGCFILEFTMKSTTNY---RWSLFSQSE--------------VERDLTMFENQLPFFVLQQLYHKSSNPQNKNE

Query:  EDPRKQLYGDKTLMEWMEMLFTATTATYMLPSDVSPTKVRHLVDLLTFYYNPSPETDEYYKFIENEE-GYSFPYKAPTITKLCEAGIKVRRAEE-AQSMM
         +  KQL  +   M W+E            PSD+   K +H +D L+FY  P  + +   K  + EE     P   P+IT++CEAG+ +++A++  + ++
Subjt:  EDPRKQLYGDKTLMEWMEMLFTATTATYMLPSDVSPTKVRHLVDLLTFYYNPSPETDEYYKFIENEE-GYSFPYKAPTITKLCEAGIKVRRAEE-AQSMM

Query:  DFSFKNGVLKIPPFTIHTNFEIQFRNLIVSEILSDDVKGEKYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYF
        +  F+NG+L+IPP  I   FE   RNL+  E    +VK   Y+  YL  +D LI TE DV++LVKE+IIIN+   SD +VS++ ++L     +    +YF
Subjt:  DFSFKNGVLKIPPFTIHTNFEIQFRNLIVSEILSDDVKGEKYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYF

Query:  NYMSGDLNDYCNRPMASIHRSMALLRRDYFNSPWASISCLAAIFLLILDCLQTLFSAPSFFLK
        N  S  L D+C+R     +++ A L+ +YFN+PWA+IS  AA FLL+L  LQT+FSA S F K
Subjt:  NYMSGDLNDYCNRPMASIHRSMALLRRDYFNSPWASISCLAAIFLLILDCLQTLFSAPSFFLK

A0A5A7V9C4 UPF0481 protein1.8e-6538.84Show/hide
Query:  PTKSPKSFLCKVPKMFRKMNNETAYTPQFISIGPFH-HGQPDLMDAEQYKDGALRKYLSR-----------KRMKFEEAMSLAYSWDEKARSCYGYHIDM
        PT   +  + +VPK  R+MN + AY PQ ISIGPFH H   +L+  EQYK      YL R           +    E+ +  A SW E+AR+CY   I+M
Subjt:  PTKSPKSFLCKVPKMFRKMNNETAYTPQFISIGPFH-HGQPDLMDAEQYKDGALRKYLSR-----------KRMKFEEAMSLAYSWDEKARSCYGYHIDM

Query:  NKHDFAKMMLLDGCFILEFTMKSTTNY---RWSLFSQSE--------------VERDLTMFENQLPFFVLQQLYHKSSNPQNKNEEDPRKQLYGDKTLME
        N  DF KMML+DGCFI+EF +     Y     SLF Q E              ++ DL   ENQLPFFVLQ L+     P++K+  +  KQL  +   M 
Subjt:  NKHDFAKMMLLDGCFILEFTMKSTTNY---RWSLFSQSE--------------VERDLTMFENQLPFFVLQQLYHKSSNPQNKNEEDPRKQLYGDKTLME

Query:  WMEMLFTATTATYMLPSDVSPTKVRHLVDLLTFYYNPSPETDEYYKFIENEE-GYSFPYKAPTITKLCEAGIKVRRAEE-AQSMMDFSFKNGVLKIPPFT
        W+E            PSD+   K +H +D L+FY  P  + +   K  + EE     P   P+IT++CEAG+ +++A++  + +++  F+NG+L+IPP  
Subjt:  WMEMLFTATTATYMLPSDVSPTKVRHLVDLLTFYYNPSPETDEYYKFIENEE-GYSFPYKAPTITKLCEAGIKVRRAEE-AQSMMDFSFKNGVLKIPPFT

Query:  IHTNFEIQFRNLIVSEILSDDVKGEKYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYFNYMSGDLNDYCNRPM
        I   FE   RNL+  E    +VK   Y+  YL  +D LI TE DV++LVKE+IIIN+   SD +VS++ ++L     +    +YFN  S  L D+C+R  
Subjt:  IHTNFEIQFRNLIVSEILSDDVKGEKYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYFNYMSGDLNDYCNRPM

Query:  ASIHRSMALLRRDYFNSPWASISCLAAIFLLILDCLQTLFSAPSFFLK
           +++ A L+ +YFN+PWA+IS  AA FLL+L  LQT+FSA S F K
Subjt:  ASIHRSMALLRRDYFNSPWASISCLAAIFLLILDCLQTLFSAPSFFLK

A0A6J1BQT6 UPF0481 protein At3g47200-like1.3e-7139.25Show/hide
Query:  NELEMDKGEGNDVVISME-------VPTKSPKSFLCKVPKMFRKMNNETAYTPQFISIGPFHH-GQPDLMDAEQYKDGALRKYLSRKRMKFEEAMSLAYS
        N  E+D+     VVIS+E        P   P+  + +VPK    MN + AYTPQ ISIGPFHH  Q +L+  +Q+K  AL  YL R +M  E  + +  +
Subjt:  NELEMDKGEGNDVVISME-------VPTKSPKSFLCKVPKMFRKMNNETAYTPQFISIGPFHH-GQPDLMDAEQYKDGALRKYLSRKRMKFEEAMSLAYS

Query:  WDEKARSCYGYHIDMNKHDFAKMMLLDGCFILEFTMKSTTNYRW------SLFSQ---SEVERDLTMFENQLPFFVLQQLYHKSSNPQNKNEEDPRKQLY
        W+ +ARSCYG  I MN   F  M+LLDGCF++ F +    NY        S F +   S++  D+TM ENQLPFFVLQ LY          +  P+    
Subjt:  WDEKARSCYGYHIDMNKHDFAKMMLLDGCFILEFTMKSTTNYRW------SLFSQ---SEVERDLTMFENQLPFFVLQQLYHKSSNPQNKNEEDPRKQLY

Query:  GDKTLMEWMEMLFTATTATYMLPSDVSPTKVRHLVDLLTFYYNPSPETDEYYKFIENEEGYSFPYKAPTITKLCEAGIKVRRAEEAQSMMDFSFKNGVLK
         + +L++ +E  F+ +   + +P  VSP  V+HLVDLL+ Y+ P  +T + +    +++ Y      P +T+LCEAG+ +++ +EA  +MD SFKNGVL+
Subjt:  GDKTLMEWMEMLFTATTATYMLPSDVSPTKVRHLVDLLTFYYNPSPETDEYYKFIENEEGYSFPYKAPTITKLCEAGIKVRRAEEAQSMMDFSFKNGVLK

Query:  IPPFTIHTNFEIQFRNLIVSEILSDDVKGEKYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYFNYMSGDLNDY
        IPP  I  +FE   RNL+  E         +Y   Y L LD +I TE DV +LV+  IIIN    SD++VS++ + L           Y N+++  L+D+
Subjt:  IPPFTIHTNFEIQFRNLIVSEILSDDVKGEKYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYFNYMSGDLNDY

Query:  CNRPMASIHRSMALLRRDYFNSPWASISCLAAIFLLILDCLQTLFSAPSFF
        C +      RS A L+RDYFNSPWA IS +AA +++IL  LQT+F+A S F
Subjt:  CNRPMASIHRSMALLRRDYFNSPWASISCLAAIFLLILDCLQTLFSAPSFF

A0A6J1BVD4 UPF0481 protein At3g47200-like1.7e-7142.3Show/hide
Query:  VISMEVPTKSPKSFLCKVPKMFRKMNNETAYTPQFISIGPFHHGQPDLMDAEQYKDGALRKYLSRKRMKFEEAMSLAYSWDEKARSCYGYHIDMNKHDFA
        ++  + P  + +  + +VPK    M + TAYTP+ I+IGPFHHG+ DL+  +Q K    R YL R     +  +++A  W+ KAR  Y   I+M   DF 
Subjt:  VISMEVPTKSPKSFLCKVPKMFRKMNNETAYTPQFISIGPFHHGQPDLMDAEQYKDGALRKYLSRKRMKFEEAMSLAYSWDEKARSCYGYHIDMNKHDFA

Query:  KMMLLDGCFILE---------FTMKSTTNYRWSLFSQSEVERDLTMFENQLPFFVLQQLYHKSSNPQNKNEED-PRKQLYGDKTLMEWMEMLFTATTATY
        +MML+D CFI+E         F  +   +  ++     ++ ++LTM ENQLPFFVLQ L+     P +KN+ +    QL G        + L       Y
Subjt:  KMMLLDGCFILE---------FTMKSTTNYRWSLFSQSEVERDLTMFENQLPFFVLQQLYHKSSNPQNKNEED-PRKQLYGDKTLMEWMEMLFTATTATY

Query:  MLPSDVSPT-KVRHLVDLLTFYYNPSPETDEY--YKFIENEEGYSFPYKAPTITKLCEAGIKVRRAEEAQSMMDFSFKNGVLKIPPFTIHTNFEIQFRNL
         LP  VS T +V HLVDLL FYY PSP+T+EY   K I+N   +  P   PTITKLCEAG+KV++A  A+S++D SFK GVL+IPPF IH +FEI  RNL
Subjt:  MLPSDVSPT-KVRHLVDLLTFYYNPSPETDEY--YKFIENEEGYSFPYKAPTITKLCEAGIKVRRAEEAQSMMDFSFKNGVLKIPPFTIHTNFEIQFRNL

Query:  IVSE-ILSDDVKGEKYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYFNYMSGDLNDYCNRPMASIHRSMALLR
        +  E     +   ++Y+ HY+  LD LI T  DV++LVKE IIIN    S+++VS++ ++L    P I  + YF   S +L+D+C +      RS A LR
Subjt:  IVSE-ILSDDVKGEKYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYFNYMSGDLNDYCNRPMASIHRSMALLR

Query:  RDYFNSPWASISCLAAIFLLILDCLQTLFSAPSFF
        RDYF+SPWA IS  AA FL++L  LQT+F+A S F
Subjt:  RDYFNSPWASISCLAAIFLLILDCLQTLFSAPSFF

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026451.1e-1128.48Show/hide
Query:  ELEMDKGEGNDVVISMEVPTKSPKSFLCKVPKMFRKMNNETAYTPQFISIGPFHHGQPDLMDAEQYKDGALRKYLSR-KRMKFEEAMSLAYSWDEKARSC
        + E+++ +  +V +S+      PK+ +C  P          +YTP  +SIGP+H  +P+L + E+YK    RK  ++    +F + +    S + K R+C
Subjt:  ELEMDKGEGNDVVISMEVPTKSPKSFLCKVPKMFRKMNNETAYTPQFISIGPFHHGQPDLMDAEQYKDGALRKYLSR-KRMKFEEAMSLAYSWDEKARSC

Query:  YGYHIDMNKHDFAKMMLLDGCFILEFTMKSTTNYRWSLFSQ---SEVERDLTMFENQLPFFVLQQ
        Y  +I  N      +M +D  F++EF    +     +L ++   +E+ RD+ M ENQ+P FVL++
Subjt:  YGYHIDMNKHDFAKMMLLDGCFILEFTMKSTTNYRWSLFSQ---SEVERDLTMFENQLPFFVLQQ

Q9SD53 UPF0481 protein At3g472001.4e-2226.79Show/hide
Query:  KVPKMFRKMNNETAYTPQFISIGPFHHGQPDLMDAEQYKDGALRKYLSRKRMKFEE---AMSLAYSWDEKARSCYGYHIDMNKHDFAKMMLLDGCFILEF
        +VP+ F  +N + AY P+ +SIGP+H+G+  L   +Q+K   L+ +L   + K  E    +      ++K R  Y   +    HD   MM+LDGCFIL  
Subjt:  KVPKMFRKMNNETAYTPQFISIGPFHHGQPDLMDAEQYKDGALRKYLSRKRMKFEE---AMSLAYSWDEKARSCYGYHIDMNKHDFAKMMLLDGCFILEF

Query:  TMKSTTNYRWS---LFS----QSEVERDLTMFENQLPFFVLQQLYHKSSNPQNKNEEDPRKQLYGDKTLMEWMEMLFTATTATYMLPSDVSPTKVRHLVD
         +  + N   S   +FS     S ++ DL + ENQ+PFFVLQ LY  S      + +  R   +  K  ++  E  +      Y         K +HL+D
Subjt:  TMKSTTNYRWS---LFS----QSEVERDLTMFENQLPFFVLQQLYHKSSNPQNKNEEDPRKQLYGDKTLMEWMEMLFTATTATYMLPSDVSPTKVRHLVD

Query:  LL--TFYYNPSPETDE------YYKFIENEEGYSFPYKAPTITKLCEA------GIKVR-RAEEAQSMMDFSFKNGVLKIPPFTIHTNFEIQFRNLIVSE
        L+  TF  N S E+D+        +  E + G      +  +  +  A      GIK R R  +  S+++   K   L+IP           F N +  E
Subjt:  LL--TFYYNPSPETDE------YYKFIENEEGYSFPYKAPTITKLCEA------GIKVR-RAEEAQSMMDFSFKNGVLKIPPFTIHTNFEIQFRNLIVSE

Query:  ILSDDVKGEKYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYFNYMSGDLNDYCNRPMASIHRSMALLRRDYFN
            D   E  +  Y++ +  L+  E DV+ L  +++II     S+ +VS+   ++          SY N +   +N+Y  +    +    A  R  +F 
Subjt:  ILSDDVKGEKYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYFNYMSGDLNDYCNRPMASIHRSMALLRRDYFN

Query:  SPWASISCLAAIFLLILDCLQTLFSAPSFFLKK
        SPW  +S  A +F+++L  LQ+  +  S+   K
Subjt:  SPWASISCLAAIFLLILDCLQTLFSAPSFFLKK

Arabidopsis top hitse value%identityAlignment
AT2G36430.1 Plant protein of unknown function (DUF247)3.6e-3429.09Show/hide
Query:  PKSFLCKVPKMFRKMNNETAYTPQFISIGPFHHGQPDLMDAEQYKDGALRKYLSR-KRMKFEEAMSLAYSWDEKARSCYGYHIDMNKHDFAKMMLLDGCF
        P   + +VP+     N    Y P+ +SIGP+H GQ  L   E++K   L   L+R + +  E+ M    + +E AR CY   I M+  +F +MM+LDGCF
Subjt:  PKSFLCKVPKMFRKMNNETAYTPQFISIGPFHHGQPDLMDAEQYKDGALRKYLSR-KRMKFEEAMSLAYSWDEKARSCYGYHIDMNKHDFAKMMLLDGCF

Query:  ILEFTMKSTTNYRWS----LFSQSEV----ERDLTMFENQLPFFVLQQLYHKSSNPQNKNEEDPRKQLYGDKTLMEWMEMLFTATTATYMLPSDVSPTKV
        +LE   K      +     L + + V     RD    ENQ+PFFVL+ L++ +    N+NE +   Q          M         T    +     + 
Subjt:  ILEFTMKSTTNYRWS----LFSQSEV----ERDLTMFENQLPFFVLQQLYHKSSNPQNKNEEDPRKQLYGDKTLMEWMEMLFTATTATYMLPSDVSPTKV

Query:  RHLVDLLTFYYNPSPETDEYYKFIENEEGYSFP-YKAPTITKLCEAGIKVRRAEEAQSMMDFSFKNGVLKIPPFTIHTNFEIQFRNLIVSEILSDDVKGE
        +HL+DLL   +   PE++ +     N      P +   +I+KL  AGIK+R  ++A+S +   F++G +++P  T+         N +  E     V   
Subjt:  RHLVDLLTFYYNPSPETDEYYKFIENEEGYSFP-YKAPTITKLCEAGIKVRRAEEAQSMMDFSFKNGVLKIPPFTIHTNFEIQFRNLIVSEILSDDVKGE

Query:  KYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYFNYMSGDLNDYCNRPMASIHRSMALLRRDYFNSPWASISCL
         +   Y  LLD L  T  DV  L  + II N    +D +++K ++SL         + Y   +  ++N+Y     +S H   A  +  YFNSPW+ +S L
Subjt:  KYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYFNYMSGDLNDYCNRPMASIHRSMALLRRDYFNSPWASISCL

Query:  AAIFLLILDCLQTLFS
        AA+ LL+L  +QT+++
Subjt:  AAIFLLILDCLQTLFS

AT3G50150.1 Plant protein of unknown function (DUF247)5.4e-3828.41Show/hide
Query:  ISMEVPTKSPKSFLCKVPKMFRKMNNETAYTPQFISIGPFHHGQPDLMDAEQYKDGALRKYLSRKRMKFEEAMSLAYSWDEKARSCYGYHIDM-NKHDFA
        +S +      K  + +VP  + + N++ +Y PQ +SIGP+HHG+  L   E++K  A+   ++R +   E  +      +E+AR+CY   IDM N ++F 
Subjt:  ISMEVPTKSPKSFLCKVPKMFRKMNNETAYTPQFISIGPFHHGQPDLMDAEQYKDGALRKYLSRKRMKFEEAMSLAYSWDEKARSCYGYHIDM-NKHDFA

Query:  KMMLLDGCFILEFTMKSTTNYRWSLFSQSE-----------VERDLTMFENQLPFFVLQQLYHKSSNPQNKNEEDPRKQLYGDKTLMEWMEMLFTATTA-
        +M++LDGCF+LE    +   ++   +++++           ++RD+ M ENQLP FVL +L    +   N+        +   KTLM   E+L  +  + 
Subjt:  KMMLLDGCFILEFTMKSTTNYRWSLFSQSE-----------VERDLTMFENQLPFFVLQQLYHKSSNPQNKNEEDPRKQLYGDKTLMEWMEMLFTATTA-

Query:  -TYMLPSDVSPTKVRHLVDLLTFYYNPSPETD------EYYKFIENEEGYSFPYKAPTITKLCEAGIKVRRAEEAQSMMDFSFKNGVLKIPPFTIHTNFE
         +     ++      H +D+       S ET       E    +E ++          +T+L  AG+   R E  Q + D  FKNG LKIP   IH   +
Subjt:  -TYMLPSDVSPTKVRHLVDLLTFYYNPSPETD------EYYKFIENEEGYSFPYKAPTITKLCEAGIKVRRAEEAQSMMDFSFKNGVLKIPPFTIHTNFE

Query:  IQFRNLIVSEILSDDVKGEKYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYFNYMSGDLNDYCNRPMASIHRS
          F NLI  E      +    +  Y++ +D+LI +  DVS L   + II     SD +V+ + + L           Y + +S ++N Y +R   S+   
Subjt:  IQFRNLIVSEILSDDVKGEKYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYFNYMSGDLNDYCNRPMASIHRS

Query:  MALLRRDYFNSPWASISCLAAIFLLILDCLQTLFSAPSFF
         A LR+ YFN+PWA  S  AA+ LL L   Q+ F+  +++
Subjt:  MALLRRDYFNSPWASISCLAAIFLLILDCLQTLFSAPSFF

AT3G50160.1 Plant protein of unknown function (DUF247)1.0e-3629.09Show/hide
Query:  KVPKMFRKMNNETAYTPQFISIGPFHHGQPDLMDAEQYKDGALRKYLSRKRMKFEEAMSLAYSWDEKARSCYGYHIDMNKHDFAKMMLLDGCFILEFTMK
        +VP   ++ N+  +Y PQ +SIGP+HHG   LM  E++K  A+   ++R +   E  +      +EKAR+CY   I+MN+++F +M++LDG FI+E    
Subjt:  KVPKMFRKMNNETAYTPQFISIGPFHHGQPDLMDAEQYKDGALRKYLSRKRMKFEEAMSLAYSWDEKARSCYGYHIDMNKHDFAKMMLLDGCFILEFTMK

Query:  STTNYRWSLFSQSE-----------VERDLTMFENQLPFFVLQQLYHKSSNPQNKNEEDPRKQLYGDKTLMEWMEMLFTATTATYMLPSDVSPTKVRHLV
        ++  ++   ++ ++           + RD+ M ENQLP+ VL+ L               ++    DK  ++  +  F     T  + ++       H +
Subjt:  STTNYRWSLFSQSE-----------VERDLTMFENQLPFFVLQQLYHKSSNPQNKNEEDPRKQLYGDKTLMEWMEMLFTATTATYMLPSDVSPTKVRHLV

Query:  DLLTFYYNPSPETDEYYKFIENEEGYSFPYKAPTITKLCEAGIKVRRAEEAQSMMDFSFKNGVLKIPPFTIHTNFEIQFRNLIVSEILSDDVKGEKYLFH
        D+L      S  T +    + N++     +    +T+L  AG++  R +E     D  FKNG LKIP   IH   +  F NLI  E     +K  K +  
Subjt:  DLLTFYYNPSPETDEYYKFIENEEGYSFPYKAPTITKLCEAGIKVRRAEEAQSMMDFSFKNGVLKIPPFTIHTNFEIQFRNLIVSEILSDDVKGEKYLFH

Query:  YLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYFNYMSGDLNDYCNRPMASIHRSMALLRRDYFNSPWASISCLAAIFL
        Y++ +D+LI +  DVS L    II N    SD +VS + + L           Y + ++G++N Y  R    +    A LR  YFN+PWA  S +AA+ L
Subjt:  YLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYFNYMSGDLNDYCNRPMASIHRSMALLRRDYFNSPWASISCLAAIFL

Query:  LILDCLQTLFSAPSFF
        LI    Q+ F+  ++F
Subjt:  LILDCLQTLFSAPSFF

AT3G50170.1 Plant protein of unknown function (DUF247)7.0e-3828.47Show/hide
Query:  KSFLCKVPKMFRKMNNETAYTPQFISIGPFHHGQPDLMDAEQYKDGALRKYLSRKRMKFEEAMSLAYSWDEKARSCYGYHIDMNKHDFAKMMLLDGCFIL
        K  + +VP   ++ N++ +Y PQ +S+GP+HHG+  L   E++K  AL K L R + + E   +     +EKAR+CY   I +++++F +M++LDGCF+L
Subjt:  KSFLCKVPKMFRKMNNETAYTPQFISIGPFHHGQPDLMDAEQYKDGALRKYLSRKRMKFEEAMSLAYSWDEKARSCYGYHIDMNKHDFAKMMLLDGCFIL

Query:  EFTMKSTTNYRWSLFSQSE-----------VERDLTMFENQLPFFVLQQLYHKSSNPQNKNEEDPRKQLYGDKTLMEWMEMLFTATTATYMLPSDVSPTK
        E    +   +    +++++           ++RD+ M ENQLP FVL +L       QN+        +     LM   E L     +  M   + S   
Subjt:  EFTMKSTTNYRWSLFSQSE-----------VERDLTMFENQLPFFVLQQLYHKSSNPQNKNEEDPRKQLYGDKTLMEWMEMLFTATTATYMLPSDVSPTK

Query:  V-----RHLVDLL---TFYYNPSPETDEYYKFIENEEGYSFPYK---APTITKLCEAGIKVRRAEEAQSMMDFSFKNGVLKIPPFTIHTNFEIQFRNLIV
        +      H +D+        +P+P T    K +          +      +T+L EAG+K R+  +     D  FKNG L+IP   IH   +  F NLI 
Subjt:  V-----RHLVDLL---TFYYNPSPETDEYYKFIENEEGYSFPYK---APTITKLCEAGIKVRRAEEAQSMMDFSFKNGVLKIPPFTIHTNFEIQFRNLIV

Query:  SEILSDDVKGEKYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYFNYMSGDLNDYCNRPMASIHRSMALLRRDY
         E     ++   ++  Y++ +D+LI +  DVS L     II     SD +V+ + + L          S+ + +SGD+N Y NR    +    A L   Y
Subjt:  SEILSDDVKGEKYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYFNYMSGDLNDYCNRPMASIHRSMALLRRDY

Query:  FNSPWASISCLAAIFLLILDCLQTLFSAPSFF
        FN+PWA  S  AA+ LL+L   Q+ ++  +++
Subjt:  FNSPWASISCLAAIFLLILDCLQTLFSAPSFF

AT4G31980.1 unknown protein2.7e-4529.91Show/hide
Query:  MDKGEGNDVVISME-----VPTKSPKSFLCKVPKMFRKMNNETAYTPQFISIGPFHHGQPDLMDAEQYKDGALRKYLSRKRMKFEEAMSLAYSWDEKARS
        M++ EG+ +V S++     + + S K  + KVP   R++N + AYTP+ +S GP H G+ +L   E  K   L  ++ R     E+ + LA +W++ ARS
Subjt:  MDKGEGNDVVISME-----VPTKSPKSFLCKVPKMFRKMNNETAYTPQFISIGPFHHGQPDLMDAEQYKDGALRKYLSRKRMKFEEAMSLAYSWDEKARS

Query:  CYGYHIDMNKHDFAKMMLLDGCFILEFTMKS----TTNYRWSLFSQS----EVERDLTMFENQLPFFVLQQLYHKSSNPQNKNEEDPRKQLYGDKTLMEW
        CY   + ++  +F +M+++DG F++E  ++S           +F  S    +V RD+ + ENQLPFFV+++++    N   +          G  ++++ 
Subjt:  CYGYHIDMNKHDFAKMMLLDGCFILEFTMKS----TTNYRWSLFSQS----EVERDLTMFENQLPFFVLQQLYHKSSNPQNKNEEDPRKQLYGDKTLMEW

Query:  MEMLFTATTATYMLPSDVSPTKVRHLVDLLTFYYNPSPETDEYYKFIENEEGYSFPYKAPTITKLCEAGIKVRRAEEAQSMMDFSFKNGVLKIPPFTIHT
         +  F+   +   +  +   T+  H VDLL   Y P       Y  ++ +        AP  T+L  AG++ + AE +  ++D SF +GVLKIP   +  
Subjt:  MEMLFTATTATYMLPSDVSPTKVRHLVDLLTFYYNPSPETDEYYKFIENEEGYSFPYKAPTITKLCEAGIKVRRAEEAQSMMDFSFKNGVLKIPPFTIHT

Query:  NFEIQFRNLIVSEILSDDVKGEKYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYFNYMSGDLNDYCNRPMASI
          E  ++N+I  E         K    Y++LL   IK+  D  +L+   II+N   +S + VS + +S+       R + YF+ +S +L  YCN P    
Subjt:  NFEIQFRNLIVSEILSDDVKGEKYLFHYLLLLDDLIKTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYFNYMSGDLNDYCNRPMASI

Query:  HRSMALLRRDYFNSPWASISCLAAIFLLILDCLQTLFS
        +R  A+LRRDYF++PWA  S  AA+ LL+L  +Q++ S
Subjt:  HRSMALLRRDYFNSPWASISCLAAIFLLILDCLQTLFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGAAAATGAATTGGAAATGGATAAGGGTGAGGGTAATGATGTTGTGATATCCATGGAAGTGCCGACGAAATCTCCAAAAAGCTTCTTGTGTAAAGTTCCCAAAAT
GTTCCGGAAGATGAATAATGAAACAGCCTACACGCCTCAATTCATTTCAATTGGGCCTTTCCACCATGGCCAACCAGATTTAATGGACGCAGAACAATATAAGGATGGGG
CTCTCAGAAAATATCTTTCTCGTAAAAGAATGAAATTTGAGGAGGCCATGAGTTTAGCTTACTCATGGGATGAAAAAGCTCGTAGTTGCTATGGATATCACATAGACATG
AACAAACATGACTTTGCAAAAATGATGCTTTTAGATGGTTGTTTTATATTGGAGTTTACGATGAAGTCAACTACAAACTACAGGTGGAGTCTTTTCAGTCAAAGTGAAGT
AGAACGTGACCTAACTATGTTTGAGAATCAGCTTCCTTTCTTTGTTCTTCAACAATTATATCACAAATCATCAAACCCACAAAACAAAAATGAGGAAGATCCCAGAAAAC
AGTTGTATGGCGATAAAACATTAATGGAATGGATGGAAATGTTGTTTACAGCGACCACGGCTACATATATGCTTCCTTCTGACGTGTCGCCGACCAAAGTAAGACACTTA
GTGGATTTATTAACATTCTACTACAATCCCTCACCTGAGACAGATGAGTATTATAAATTCATTGAGAATGAAGAAGGGTACTCCTTTCCCTACAAAGCCCCAACTATAAC
TAAGCTCTGCGAGGCTGGTATCAAGGTACGGAGAGCAGAGGAAGCCCAGAGCATGATGGACTTTAGCTTTAAAAATGGGGTTCTCAAAATTCCACCTTTCACAATTCATA
CTAACTTTGAAATCCAATTTCGAAATTTGATAGTATCTGAGATTTTAAGTGATGACGTTAAAGGTGAAAAGTATTTATTCCATTATCTTTTACTCTTAGATGACTTGATT
AAGACAGAAAATGATGTGAGTATACTTGTGAAGGAGGAAATTATAATCAACGAAACTGATGATAGTGACGAACAAGTTTCGAAAATAATTCACAGTCTACATCTAATGGC
TCCAACCATTCGACTTAAATCTTACTTCAACTATATGAGCGGAGATTTAAACGACTATTGCAACCGACCTATGGCTTCAATACACCGATCTATGGCTTTACTAAGACGTG
ATTATTTCAATTCTCCATGGGCTTCTATCTCCTGTCTTGCTGCTATCTTCCTCCTTATCCTTGATTGCCTTCAAACCCTATTCTCTGCTCCATCTTTTTTCCTTAAAAAA
TAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGAAAATGAATTGGAAATGGATAAGGGTGAGGGTAATGATGTTGTGATATCCATGGAAGTGCCGACGAAATCTCCAAAAAGCTTCTTGTGTAAAGTTCCCAAAAT
GTTCCGGAAGATGAATAATGAAACAGCCTACACGCCTCAATTCATTTCAATTGGGCCTTTCCACCATGGCCAACCAGATTTAATGGACGCAGAACAATATAAGGATGGGG
CTCTCAGAAAATATCTTTCTCGTAAAAGAATGAAATTTGAGGAGGCCATGAGTTTAGCTTACTCATGGGATGAAAAAGCTCGTAGTTGCTATGGATATCACATAGACATG
AACAAACATGACTTTGCAAAAATGATGCTTTTAGATGGTTGTTTTATATTGGAGTTTACGATGAAGTCAACTACAAACTACAGGTGGAGTCTTTTCAGTCAAAGTGAAGT
AGAACGTGACCTAACTATGTTTGAGAATCAGCTTCCTTTCTTTGTTCTTCAACAATTATATCACAAATCATCAAACCCACAAAACAAAAATGAGGAAGATCCCAGAAAAC
AGTTGTATGGCGATAAAACATTAATGGAATGGATGGAAATGTTGTTTACAGCGACCACGGCTACATATATGCTTCCTTCTGACGTGTCGCCGACCAAAGTAAGACACTTA
GTGGATTTATTAACATTCTACTACAATCCCTCACCTGAGACAGATGAGTATTATAAATTCATTGAGAATGAAGAAGGGTACTCCTTTCCCTACAAAGCCCCAACTATAAC
TAAGCTCTGCGAGGCTGGTATCAAGGTACGGAGAGCAGAGGAAGCCCAGAGCATGATGGACTTTAGCTTTAAAAATGGGGTTCTCAAAATTCCACCTTTCACAATTCATA
CTAACTTTGAAATCCAATTTCGAAATTTGATAGTATCTGAGATTTTAAGTGATGACGTTAAAGGTGAAAAGTATTTATTCCATTATCTTTTACTCTTAGATGACTTGATT
AAGACAGAAAATGATGTGAGTATACTTGTGAAGGAGGAAATTATAATCAACGAAACTGATGATAGTGACGAACAAGTTTCGAAAATAATTCACAGTCTACATCTAATGGC
TCCAACCATTCGACTTAAATCTTACTTCAACTATATGAGCGGAGATTTAAACGACTATTGCAACCGACCTATGGCTTCAATACACCGATCTATGGCTTTACTAAGACGTG
ATTATTTCAATTCTCCATGGGCTTCTATCTCCTGTCTTGCTGCTATCTTCCTCCTTATCCTTGATTGCCTTCAAACCCTATTCTCTGCTCCATCTTTTTTCCTTAAAAAA
TAA
Protein sequenceShow/hide protein sequence
MGENELEMDKGEGNDVVISMEVPTKSPKSFLCKVPKMFRKMNNETAYTPQFISIGPFHHGQPDLMDAEQYKDGALRKYLSRKRMKFEEAMSLAYSWDEKARSCYGYHIDM
NKHDFAKMMLLDGCFILEFTMKSTTNYRWSLFSQSEVERDLTMFENQLPFFVLQQLYHKSSNPQNKNEEDPRKQLYGDKTLMEWMEMLFTATTATYMLPSDVSPTKVRHL
VDLLTFYYNPSPETDEYYKFIENEEGYSFPYKAPTITKLCEAGIKVRRAEEAQSMMDFSFKNGVLKIPPFTIHTNFEIQFRNLIVSEILSDDVKGEKYLFHYLLLLDDLI
KTENDVSILVKEEIIINETDDSDEQVSKIIHSLHLMAPTIRLKSYFNYMSGDLNDYCNRPMASIHRSMALLRRDYFNSPWASISCLAAIFLLILDCLQTLFSAPSFFLKK