; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011834 (gene) of Snake gourd v1 genome

Gene IDTan0011834
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein E6
Genome locationLG03:78448743..78450016
RNA-Seq ExpressionTan0011834
SyntenyTan0011834
Gene Ontology termsNA
InterPro domainsIPR040290 - Protein E6-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052668.1 protein E6 [Cucumis melo var. makuwa]1.4e-8969.38Show/hide
Query:  MASSAKLIASLFLLTLLFIQIHARES-HFFSKVPNNG---------ETQTQIPN-KVLDPLTNSEKTTTNPQDQEPNFIPQTQDNGYGLYGHESGQLPPN
        MASS KL  S+ +LTLLFIQIHARES +FFSK+PNN            +TQIPN + +DPLTN EKTTT+PQDQ+PNFIPQTQDNGYGLYGHESGQLPPN
Subjt:  MASSAKLIASLFLLTLLFIQIHARES-HFFSKVPNNG---------ETQTQIPN-KVLDPLTNSEKTTTNPQDQEPNFIPQTQDNGYGLYGHESGQLPPN

Query:  FDAK-FSDPSA--GRPF--TTTYDNDNNYRTSNDALPAYNSESENQY-YNDN--------FQNSNSKKPYENSFYYNKDLYDNGRQSFQNTRLSRDDFTT
         D+K FSD S   GR F  TTTYDN++N +  ND +  Y SESE  Y Y DN        F+NSNS KPYENSFYYNKDLYDNGRQSFQNTRLSRDD+ T
Subjt:  FDAK-FSDPSA--GRPF--TTTYDNDNNYRTSNDALPAYNSESENQY-YNDN--------FQNSNSKKPYENSFYYNKDLYDNGRQSFQNTRLSRDDFTT

Query:  TTPSYDDNNNYNFYFNNNNNGGGD---NANNVARQGMSDTRFMENGKYFYDLNREPHHYSMSRGSFGNNHYNYNNNGNTYEYGNSMGRY--QNDEAEFQE
        TTP YD     NFY  N+NNGGGD   N NN+ RQGMSDTRFMENGKY+YDL+REPHHYS SRG FGNN  N NNN N+YEYGNSMGRY  QNDEAEFQE
Subjt:  TTPSYDDNNNYNFYFNNNNNGGGD---NANNVARQGMSDTRFMENGKYFYDLNREPHHYSMSRGSFGNNHYNYNNNGNTYEYGNSMGRY--QNDEAEFQE

Query:  ESDDFVP
        E D+FVP
Subjt:  ESDDFVP

KAG6594586.1 Protein E6, partial [Cucurbita argyrosperma subsp. sororia]1.0e-8971.38Show/hide
Query:  MASSAKLIASLFLLTLLFIQIHARESHFFSKVPNNGETQTQIPNKVLDPLTNSEKTTTNPQDQEPNFIPQTQDNGYGLYGHESGQLPPNFDAKFSDPSA-
        MASS KL+ SLFLL+L FIQIH RES+FFSKVPNN   ++QIPNKV DPLTNSEKTTT PQDQ+PNFIPQTQDNGYGLYGHESGQ  P+ DAKFSDP+A 
Subjt:  MASSAKLIASLFLLTLLFIQIHARESHFFSKVPNNGETQTQIPNKVLDPLTNSEKTTTNPQDQEPNFIPQTQDNGYGLYGHESGQLPPNFDAKFSDPSA-

Query:  --GRPF--TTTYDN-DNNYRTSNDALPAYNSESE------NQYYNDNFQNSNSKKPYENSFYYNKDLYDNGRQSFQNTRLSRDDF--TTTTPSY------
          GRPF  TTTYD+ +NNY+ ++D   +Y SESE      N Y NDNFQNSN KKPYENSFYYNKDLYDN RQSFQNTRLSR+++  TTTTPSY      
Subjt:  --GRPF--TTTYDN-DNNYRTSNDALPAYNSESE------NQYYNDNFQNSNSKKPYENSFYYNKDLYDNGRQSFQNTRLSRDDF--TTTTPSY------

Query:  DDNNNYNFYFNNNNNGGGDNANNVARQGMSDTRFMENGKYFYDLNREPHHYSMSRGSFGNNHYNYNNNGNTYEYGNSMGRYQNDEAEFQEESDDFVP
        DD++N NFY NN N    DN NNVARQGMSDTRFMENGKYFYDLNREP H S S      N++  NNN N YEYGNSMGRYQNDE EFQEESD+FVP
Subjt:  DDNNNYNFYFNNNNNGGGDNANNVARQGMSDTRFMENGKYFYDLNREPHHYSMSRGSFGNNHYNYNNNGNTYEYGNSMGRYQNDEAEFQEESDDFVP

XP_022926835.1 protein E6 [Cucurbita moschata]7.2e-9171.81Show/hide
Query:  MASSAKLIASLFLLTLLFIQIHARESHFFSKVPNNGETQTQIPNKVLDPLTNSEKTTTNPQDQEPNFIPQTQDNGYGLYGHESGQLPPNFDAKFSDPSA-
        MASS KL+ SLFLL+L FIQIH RES+FFSKVPNN   ++QIPNKV DPLTNSEKTTT PQDQ+PNFIPQTQDNGYGLYGHESGQ  P+ DAKFSDP+A 
Subjt:  MASSAKLIASLFLLTLLFIQIHARESHFFSKVPNNGETQTQIPNKVLDPLTNSEKTTTNPQDQEPNFIPQTQDNGYGLYGHESGQLPPNFDAKFSDPSA-

Query:  --GRPF--TTTYDN-DNNYRTSNDALPAYNSESE------NQYYNDNFQNSNSKKPYENSFYYNKDLYDNGRQSFQNTRLSRDDF---TTTTPSY-----
          GRPF  TTTYD+ +NNY+ ++D   +Y SESE      N Y NDNFQNSN KKPYENSFYYNKDLYDN RQSFQNTRLSR+++   TTTTPSY     
Subjt:  --GRPF--TTTYDN-DNNYRTSNDALPAYNSESE------NQYYNDNFQNSNSKKPYENSFYYNKDLYDNGRQSFQNTRLSRDDF---TTTTPSY-----

Query:  -DDNNNYNFYFNNNNNGGGDNANNVARQGMSDTRFMENGKYFYDLNREPHHYSMSRGSFGNNHYNYNNNGNTYEYGNSMGRYQNDEAEFQEESDDFVP
         DDNNN NFY NN N    DN NNVARQGMSDTRFMENGKYFYDL+REP H S S      N++  NNN NTYEYGNSMGRYQNDE EFQEESD+FVP
Subjt:  -DDNNNYNFYFNNNNNGGGDNANNVARQGMSDTRFMENGKYFYDLNREPHHYSMSRGSFGNNHYNYNNNGNTYEYGNSMGRYQNDEAEFQEESDDFVP

XP_023003783.1 protein E6-like [Cucurbita maxima]2.1e-9071.57Show/hide
Query:  MASSAKLIASLFLLTLLFIQIHARESHFFSKVPNNGETQTQIPNKVLDPLTNSEKTTTNPQDQEPNFIPQTQDNGYGLYGHESGQLPPNFDAKFSDPSA-
        MASS KL+ SLFLL+L FIQIH RES+FFSKVPNN   +TQIPN V DPLTNSEK TT+PQDQ+PNFIPQTQDNGYGLYGHES Q  P+ D+KFSDP+A 
Subjt:  MASSAKLIASLFLLTLLFIQIHARESHFFSKVPNNGETQTQIPNKVLDPLTNSEKTTTNPQDQEPNFIPQTQDNGYGLYGHESGQLPPNFDAKFSDPSA-

Query:  --GRPFTTTY---DNDNNYRTSNDALPAYNSESE------NQYYNDNFQNSNSKKPYENSFYYNKDLYDNGRQSFQNTRLSRDDF--TTTTPSY------
          GRPFTTT    D +NNY+ ++D   +Y SESE      N Y NDNFQNSN KKPYENSFYYNKDLYDN RQSFQNT LSRD++  TTTTPSY      
Subjt:  --GRPFTTTY---DNDNNYRTSNDALPAYNSESE------NQYYNDNFQNSNSKKPYENSFYYNKDLYDNGRQSFQNTRLSRDDF--TTTTPSY------

Query:  -DDNNNYNFYFNNNNNGGGDNANNVARQGMSDTRFMENGKYFYDLNREPHHYSM-SRGSFGNNHYNYNNNGNTYEYGNSMGRYQNDEAEFQEESDDFVP
         DDNNN NFY NN N    DN NNVARQGMSDTRFMENGKYFYDLNREP H S  SR +F NN+Y   NN NTYEYGNSMGRYQNDE EFQEESD+FVP
Subjt:  -DDNNNYNFYFNNNNNGGGDNANNVARQGMSDTRFMENGKYFYDLNREPHHYSM-SRGSFGNNHYNYNNNGNTYEYGNSMGRYQNDEAEFQEESDDFVP

XP_038883950.1 protein E6-like [Benincasa hispida]5.5e-9169.7Show/hide
Query:  MASSAKLIASLFLLTLLFIQIHARES-HFFSKVPNNG-----ETQTQIPNKVLDPLTNSEKTTTNPQDQEPNFIPQTQDNGYGLYGHESGQLPPNFDAKF
        MASS KL+ S+ LL+LL IQIHARES +FFSKVPNN        +TQ+PN  +DPLTN EKTTT PQDQEPNFIPQTQDN YGLYGHESGQLPPN D KF
Subjt:  MASSAKLIASLFLLTLLFIQIHARES-HFFSKVPNNG-----ETQTQIPNKVLDPLTNSEKTTTNPQDQEPNFIPQTQDNGYGLYGHESGQLPPNFDAKF

Query:  SDPSAGRPF---TTTYDNDNNYRTSNDALPAYNSESENQY------YNDNFQNSNSKKPYENSFYYNKDLYDNGRQSFQNTRLSRDDFTTTTPSYDDNNN
        S    GRPF   TTTYDN+N YR  NDA+P    ESE  Y      YN+ F+NS S KPYENSFYYNKDLYDNG+QSFQNTRLS+DD+  TTP YD    
Subjt:  SDPSAGRPF---TTTYDNDNNYRTSNDALPAYNSESENQY------YNDNFQNSNSKKPYENSFYYNKDLYDNGRQSFQNTRLSRDDFTTTTPSYDDNNN

Query:  YNFYFNNNNNG---GGDNANNVARQGMSDTRFMENGKYFYDLNREPHHYSMSRGSFGNNHYNYNNNGNTYEYGNSMGRY--QNDEAEFQEESDDFVP
         NFY NNN  G     DN NNV RQGMSDTRFMENGKY+YDLNREPHHYS SRG FGNN+   NNN NTYEYGNSMGRY  QNDEAEFQEE ++FVP
Subjt:  YNFYFNNNNNG---GGDNANNVARQGMSDTRFMENGKYFYDLNREPHHYSMSRGSFGNNHYNYNNNGNTYEYGNSMGRY--QNDEAEFQEESDDFVP

TrEMBL top hitse value%identityAlignment
A0A1S3B2M6 protein E68.6e-9069.16Show/hide
Query:  MASSAKLIASLFLLTLLFIQIHARES-HFFSKVPNNG----------ETQTQIPN-KVLDPLTNSEKTTTNPQDQEPNFIPQTQDNGYGLYGHESGQLPP
        MASS KL  S+ +LTLLFIQIHARES +FFSK+PNN             +TQIPN + +DPLTN EKTTT+PQDQ+PNFIPQTQDNGYGLYGHESGQLPP
Subjt:  MASSAKLIASLFLLTLLFIQIHARES-HFFSKVPNNG----------ETQTQIPN-KVLDPLTNSEKTTTNPQDQEPNFIPQTQDNGYGLYGHESGQLPP

Query:  NFDAK-FSDPSA--GRPF--TTTYDNDNNYRTSNDALPAYNSESENQY-YNDN--------FQNSNSKKPYENSFYYNKDLYDNGRQSFQNTRLSRDDFT
        N D+K FSD S   GR F  TTTYDN++N +  ND +  Y SESE  Y Y DN        F+NSNS KPYENSFYYNKDLYDNGRQSFQNTRLSRDD+ 
Subjt:  NFDAK-FSDPSA--GRPF--TTTYDNDNNYRTSNDALPAYNSESENQY-YNDN--------FQNSNSKKPYENSFYYNKDLYDNGRQSFQNTRLSRDDFT

Query:  TTTPSYDDNNNYNFYFNNNNNGGGD---NANNVARQGMSDTRFMENGKYFYDLNREPHHYSMSRGSFGNNHYNYNNNGNTYEYGNSMGRY--QNDEAEFQ
        TTTP YD     NFY  N+NNGGGD   N NN+ RQGMSDTRFMENGKY+YDL+REPHHYS SRG FGNN  N NNN N+YEYGNSMGRY  QNDEAEFQ
Subjt:  TTTPSYDDNNNYNFYFNNNNNGGGD---NANNVARQGMSDTRFMENGKYFYDLNREPHHYSMSRGSFGNNHYNYNNNGNTYEYGNSMGRY--QNDEAEFQ

Query:  EESDDFVP
        EE D+FVP
Subjt:  EESDDFVP

A0A5D3CMI3 Protein E66.6e-9069.38Show/hide
Query:  MASSAKLIASLFLLTLLFIQIHARES-HFFSKVPNNG---------ETQTQIPN-KVLDPLTNSEKTTTNPQDQEPNFIPQTQDNGYGLYGHESGQLPPN
        MASS KL  S+ +LTLLFIQIHARES +FFSK+PNN            +TQIPN + +DPLTN EKTTT+PQDQ+PNFIPQTQDNGYGLYGHESGQLPPN
Subjt:  MASSAKLIASLFLLTLLFIQIHARES-HFFSKVPNNG---------ETQTQIPN-KVLDPLTNSEKTTTNPQDQEPNFIPQTQDNGYGLYGHESGQLPPN

Query:  FDAK-FSDPSA--GRPF--TTTYDNDNNYRTSNDALPAYNSESENQY-YNDN--------FQNSNSKKPYENSFYYNKDLYDNGRQSFQNTRLSRDDFTT
         D+K FSD S   GR F  TTTYDN++N +  ND +  Y SESE  Y Y DN        F+NSNS KPYENSFYYNKDLYDNGRQSFQNTRLSRDD+ T
Subjt:  FDAK-FSDPSA--GRPF--TTTYDNDNNYRTSNDALPAYNSESENQY-YNDN--------FQNSNSKKPYENSFYYNKDLYDNGRQSFQNTRLSRDDFTT

Query:  TTPSYDDNNNYNFYFNNNNNGGGD---NANNVARQGMSDTRFMENGKYFYDLNREPHHYSMSRGSFGNNHYNYNNNGNTYEYGNSMGRY--QNDEAEFQE
        TTP YD     NFY  N+NNGGGD   N NN+ RQGMSDTRFMENGKY+YDL+REPHHYS SRG FGNN  N NNN N+YEYGNSMGRY  QNDEAEFQE
Subjt:  TTPSYDDNNNYNFYFNNNNNGGGD---NANNVARQGMSDTRFMENGKYFYDLNREPHHYSMSRGSFGNNHYNYNNNGNTYEYGNSMGRY--QNDEAEFQE

Query:  ESDDFVP
        E D+FVP
Subjt:  ESDDFVP

A0A6J1EM79 protein E63.5e-9171.81Show/hide
Query:  MASSAKLIASLFLLTLLFIQIHARESHFFSKVPNNGETQTQIPNKVLDPLTNSEKTTTNPQDQEPNFIPQTQDNGYGLYGHESGQLPPNFDAKFSDPSA-
        MASS KL+ SLFLL+L FIQIH RES+FFSKVPNN   ++QIPNKV DPLTNSEKTTT PQDQ+PNFIPQTQDNGYGLYGHESGQ  P+ DAKFSDP+A 
Subjt:  MASSAKLIASLFLLTLLFIQIHARESHFFSKVPNNGETQTQIPNKVLDPLTNSEKTTTNPQDQEPNFIPQTQDNGYGLYGHESGQLPPNFDAKFSDPSA-

Query:  --GRPF--TTTYDN-DNNYRTSNDALPAYNSESE------NQYYNDNFQNSNSKKPYENSFYYNKDLYDNGRQSFQNTRLSRDDF---TTTTPSY-----
          GRPF  TTTYD+ +NNY+ ++D   +Y SESE      N Y NDNFQNSN KKPYENSFYYNKDLYDN RQSFQNTRLSR+++   TTTTPSY     
Subjt:  --GRPF--TTTYDN-DNNYRTSNDALPAYNSESE------NQYYNDNFQNSNSKKPYENSFYYNKDLYDNGRQSFQNTRLSRDDF---TTTTPSY-----

Query:  -DDNNNYNFYFNNNNNGGGDNANNVARQGMSDTRFMENGKYFYDLNREPHHYSMSRGSFGNNHYNYNNNGNTYEYGNSMGRYQNDEAEFQEESDDFVP
         DDNNN NFY NN N    DN NNVARQGMSDTRFMENGKYFYDL+REP H S S      N++  NNN NTYEYGNSMGRYQNDE EFQEESD+FVP
Subjt:  -DDNNNYNFYFNNNNNGGGDNANNVARQGMSDTRFMENGKYFYDLNREPHHYSMSRGSFGNNHYNYNNNGNTYEYGNSMGRYQNDEAEFQEESDDFVP

A0A6J1IU58 protein E6-like2.6e-8668.86Show/hide
Query:  MASSAKLIASLFLLTLLFIQIHARESHFFSKVPNNGET----QTQIPNK--VLDPLTNSEKTTTNPQDQEPNFIPQTQDNGYGLYGHESGQLPPNFDAKF
        MASS KLI++L LL LLFIQ+HARESHFFSKVPNNG T    +TQIPNK    DPLTN +KT+  PQD +PNF+PQTQDN YGLYGHESGQLPPN D  F
Subjt:  MASSAKLIASLFLLTLLFIQIHARESHFFSKVPNNGET----QTQIPNK--VLDPLTNSEKTTTNPQDQEPNFIPQTQDNGYGLYGHESGQLPPNFDAKF

Query:  SDPSAGRPFTTTYDNDNNYRTSNDALPAYNSESENQYYNDNFQNSNSKKPYENSFYYNKDLYDNGRQSFQNTRLSRDDFTTT----TPSYDDNNNYNFYF
           S  RP      +DN YR  NDA+ +Y SESE  Y NDNFQN N+ KPYENSFYYNKDLYDNGRQSF+NTRLSR+D+TTT       Y D+++ NFY+
Subjt:  SDPSAGRPFTTTYDNDNNYRTSNDALPAYNSESENQYYNDNFQNSNSKKPYENSFYYNKDLYDNGRQSFQNTRLSRDDFTTT----TPSYDDNNNYNFYF

Query:  NNNNNGGGDNANNVARQGMSDTRFMENGKYFYDLNREPHHYSMSRGSFGNNHYNYNNNGNTYEYGNSMGRY--QNDEAEFQEESDDFVP
        N+NNN   +NANNV RQGMSDTRFMENGKY+YDL+REPHHYS SRG F  N+   NNNGNTY+YGNSMGRY  QNDEAEFQEE D+FVP
Subjt:  NNNNNGGGDNANNVARQGMSDTRFMENGKYFYDLNREPHHYSMSRGSFGNNHYNYNNNGNTYEYGNSMGRY--QNDEAEFQEESDDFVP

A0A6J1KXL8 protein E6-like1.0e-9071.57Show/hide
Query:  MASSAKLIASLFLLTLLFIQIHARESHFFSKVPNNGETQTQIPNKVLDPLTNSEKTTTNPQDQEPNFIPQTQDNGYGLYGHESGQLPPNFDAKFSDPSA-
        MASS KL+ SLFLL+L FIQIH RES+FFSKVPNN   +TQIPN V DPLTNSEK TT+PQDQ+PNFIPQTQDNGYGLYGHES Q  P+ D+KFSDP+A 
Subjt:  MASSAKLIASLFLLTLLFIQIHARESHFFSKVPNNGETQTQIPNKVLDPLTNSEKTTTNPQDQEPNFIPQTQDNGYGLYGHESGQLPPNFDAKFSDPSA-

Query:  --GRPFTTTY---DNDNNYRTSNDALPAYNSESE------NQYYNDNFQNSNSKKPYENSFYYNKDLYDNGRQSFQNTRLSRDDF--TTTTPSY------
          GRPFTTT    D +NNY+ ++D   +Y SESE      N Y NDNFQNSN KKPYENSFYYNKDLYDN RQSFQNT LSRD++  TTTTPSY      
Subjt:  --GRPFTTTY---DNDNNYRTSNDALPAYNSESE------NQYYNDNFQNSNSKKPYENSFYYNKDLYDNGRQSFQNTRLSRDDF--TTTTPSY------

Query:  -DDNNNYNFYFNNNNNGGGDNANNVARQGMSDTRFMENGKYFYDLNREPHHYSM-SRGSFGNNHYNYNNNGNTYEYGNSMGRYQNDEAEFQEESDDFVP
         DDNNN NFY NN N    DN NNVARQGMSDTRFMENGKYFYDLNREP H S  SR +F NN+Y   NN NTYEYGNSMGRYQNDE EFQEESD+FVP
Subjt:  -DDNNNYNFYFNNNNNGGGDNANNVARQGMSDTRFMENGKYFYDLNREPHHYSM-SRGSFGNNHYNYNNNGNTYEYGNSMGRYQNDEAEFQEESDDFVP

SwissProt top hitse value%identityAlignment
Q01197 Protein E61.1e-2537.81Show/hide
Query:  MASSAKL--IASLFLLTLLFIQIHARESHFFSKVPNNGETQTQIPNKVLDPLTNSEKTTTNPQDQEPNFIPQTQDNGYGLYGHESGQLPPNFDAKFSDPS
        MASS KL  ++ LFL  L  +QIHARE  +FSK P     + +   +     T   +TT  P++QEP FIP+TQ NGYGLYGHESG   P+F        
Subjt:  MASSAKL--IASLFLLTLLFIQIHARESHFFSKVPNNGETQTQIPNKVLDPLTNSEKTTTNPQDQEPNFIPQTQDNGYGLYGHESGQLPPNFDAKFSDPS

Query:  AGRPFTTTYDNDNNYRTSNDALPAYNSESENQYYNDNFQNSNSKKPYENSFYYNKDLYDNGRQSFQNTRLSRDDFTTTTPSYDDNNNYNFYFNNNNNGGG
              TT +    Y T     P       ++ YN   ++SN+K    +++YYNK+ Y++ +Q      L    FT    S  +N N N+Y  N NNG  
Subjt:  AGRPFTTTYDNDNNYRTSNDALPAYNSESENQYYNDNFQNSNSKKPYENSFYYNKDLYDNGRQSFQNTRLSRDDFTTTTPSYDDNNNYNFYFNNNNNGGG

Query:  DNANNVARQGMSDTRFMENGKYFYDLNRE----PHHYSMSRGSFGNNHYNYNNNGNTYEYGNSMGRYQNDEAEFQEESDDFVP
           NN  +QGMSDTR++ENGKY+YD+  E    P+ +  SRG    N +N N         N+MGRY  ++ EF+E  ++F P
Subjt:  DNANNVARQGMSDTRFMENGKYFYDLNRE----PHHYSMSRGSFGNNHYNYNNNGNTYEYGNSMGRYQNDEAEFQEESDDFVP

Arabidopsis top hitse value%identityAlignment
AT1G28400.1 unknown protein9.0e-1531.68Show/hide
Query:  LTLLFIQIHARESHFFSKVPNNGETQTQIPNKVLDPLTNSEKTTT-------NPQDQEPNFIPQTQDNGYGLYGHESGQLPPNFDAK-------------
        L LL  QIHAR+S+FF K  +    + Q PN  + PL  SEKTT          Q+Q+P F+P++  NGYGLYGHE+     N + +             
Subjt:  LTLLFIQIHARESHFFSKVPNNGETQTQIPNKVLDPLTNSEKTTT-------NPQDQEPNFIPQTQDNGYGLYGHESGQLPPNFDAK-------------

Query:  ---FSDPSAGRPFTTTYDNDNNY--RTSNDALPAYNSESENQYYNDNFQNSNSKKPYENSFY---YNKDLYDNGRQSFQNTRLSRDDFTTTTPSYDDNNN
           FS PS      +  + + NY  +T N     YN+E  N   N+N  ++N K+ + N+ Y   Y K+ ++N   +  N    + D      S+ +NN 
Subjt:  ---FSDPSAGRPFTTTYDNDNNY--RTSNDALPAYNSESENQYYNDNFQNSNSKKPYENSFY---YNKDLYDNGRQSFQNTRLSRDDFTTTTPSYDDNNN

Query:  YNFYFNNNNNGGGDN----------ANNVARQGMSDTRFMENGKYFYDLNREPHHYSMSRGSFGNNHYNY--------NNNGNTYEYGNSMGRYQNDEAE
         N     N+N  G            ++N+ RQGMSDTRFME G Y+YDL  + +H    R S   +   Y        N    +Y YGN+     N+E  
Subjt:  YNFYFNNNNNGGGDN----------ANNVARQGMSDTRFMENGKYFYDLNREPHHYSMSRGSFGNNHYNY--------NNNGNTYEYGNSMGRYQNDEAE

Query:  FQE
        F++
Subjt:  FQE

AT2G33850.1 unknown protein1.2e-1433.22Show/hide
Query:  MASSAKLIASLFLLTLLFI--QIHARESHFFSKVPNNGETQTQIPNKVLDPLTNSEKTTTNPQDQEPNFIPQTQDNGYGLYGHESG------------QL
        MA S       FLLTL+    QI AR S+ F K     + + Q PN ++   TN +K    P DQ P FIPQ+ +NGYGLYGHE+             + 
Subjt:  MASSAKLIASLFLLTLLFI--QIHARESHFFSKVPNNGETQTQIPNKVLDPLTNSEKTTTNPQDQEPNFIPQTQDNGYGLYGHESG------------QL

Query:  PPNFDAKFSDPSAGRPFTTTYDNDNNYRTSNDALPAYNSESENQYYNDNFQNSNSKKPYENSFYYNKDLYDNGRQSFQNTRLSRDDFTTTTPSYDDNNNY
          N+D  FS PS     + T     +Y+   ++ P        + Y++N   S     YENS  Y  D  DN          ++D     T  Y++ N Y
Subjt:  PPNFDAKFSDPSAGRPFTTTYDNDNNYRTSNDALPAYNSESENQYYNDNFQNSNSKKPYENSFYYNKDLYDNGRQSFQNTRLSRDDFTTTTPSYDDNNNY

Query:  NFYFNNNNNGGGDNANNVARQGMSDTRFMENGKYFYDLNREPHHYSMSRGSFGNNH-YNYNNNGNTYEYGNSMGRY---QNDEAEFQEESDDFVP
                       NNV RQGMSDTR+M NGKY+YDL+ + +H     G F  NH YNY   G   +  N    Y   Q  E   +E+ D   P
Subjt:  NFYFNNNNNGGGDNANNVARQGMSDTRFMENGKYFYDLNREPHHYSMSRGSFGNNH-YNYNNNGNTYEYGNSMGRY---QNDEAEFQEESDDFVP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCTCAGCCAAACTGATCGCTTCTCTTTTTCTTCTCACTCTTCTCTTCATTCAAATCCATGCCAGAGAAAGCCATTTCTTCAGCAAAGTACCCAACAATGGCGA
AACCCAAACCCAAATCCCTAACAAAGTACTCGACCCTTTAACAAACTCTGAAAAAACAACCACCAACCCACAAGACCAAGAGCCCAATTTCATCCCTCAAACCCAAGACA
ACGGCTACGGCCTCTACGGCCACGAATCTGGCCAGCTCCCTCCTAATTTCGACGCCAAATTCTCCGATCCCTCCGCCGGCCGACCATTTACCACCACCTACGACAACGAC
AACAACTACAGGACATCAAACGACGCCCTACCAGCTTACAATTCCGAGTCAGAGAATCAGTACTACAATGACAATTTCCAAAACAGCAACTCCAAAAAGCCGTACGAGAA
TTCCTTTTACTACAACAAGGATCTTTACGACAACGGACGACAAAGCTTCCAAAACACCCGCCTTTCCCGAGATGATTTCACAACAACAACTCCTTCATACGACGACAACA
ACAATTACAACTTCTACTTCAACAACAACAACAACGGCGGCGGCGACAATGCGAATAACGTGGCGCGACAGGGGATGAGCGACACGAGATTCATGGAGAACGGAAAGTAC
TTTTATGACCTCAACAGGGAGCCTCACCATTACAGCATGTCCAGGGGCAGTTTCGGAAACAACCACTACAATTACAACAACAACGGCAACACATATGAATACGGTAATTC
CATGGGAAGGTATCAGAATGATGAGGCCGAATTCCAAGAGGAATCAGACGACTTCGTCCCATAA
mRNA sequenceShow/hide mRNA sequence
CCTCTATAAAATAATGTCACCTATCCCAACTCTCCTTCACTATACACACACACACACACTTAAAACACACAATTTCTCTCTTATCTTCTTCTTCTTCTTCTTCTTCTTCA
TTTTAATCCATACCAAGACCAGAAAGAATCGGCCATTGTATTCATGGCTTCCTCAGCCAAACTGATCGCTTCTCTTTTTCTTCTCACTCTTCTCTTCATTCAAATCCATG
CCAGAGAAAGCCATTTCTTCAGCAAAGTACCCAACAATGGCGAAACCCAAACCCAAATCCCTAACAAAGTACTCGACCCTTTAACAAACTCTGAAAAAACAACCACCAAC
CCACAAGACCAAGAGCCCAATTTCATCCCTCAAACCCAAGACAACGGCTACGGCCTCTACGGCCACGAATCTGGCCAGCTCCCTCCTAATTTCGACGCCAAATTCTCCGA
TCCCTCCGCCGGCCGACCATTTACCACCACCTACGACAACGACAACAACTACAGGACATCAAACGACGCCCTACCAGCTTACAATTCCGAGTCAGAGAATCAGTACTACA
ATGACAATTTCCAAAACAGCAACTCCAAAAAGCCGTACGAGAATTCCTTTTACTACAACAAGGATCTTTACGACAACGGACGACAAAGCTTCCAAAACACCCGCCTTTCC
CGAGATGATTTCACAACAACAACTCCTTCATACGACGACAACAACAATTACAACTTCTACTTCAACAACAACAACAACGGCGGCGGCGACAATGCGAATAACGTGGCGCG
ACAGGGGATGAGCGACACGAGATTCATGGAGAACGGAAAGTACTTTTATGACCTCAACAGGGAGCCTCACCATTACAGCATGTCCAGGGGCAGTTTCGGAAACAACCACT
ACAATTACAACAACAACGGCAACACATATGAATACGGTAATTCCATGGGAAGGTATCAGAATGATGAGGCCGAATTCCAAGAGGAATCAGACGACTTCGTCCCATAAATT
TCTGTCCGCTACTAAATTATAAGAAAAATATATTTAGTTGTGCATGGGAATATGTGTTTTTTTTTTCTTTCTCCTCCTCCTAGTTTGCTTAATTGACTTCATATTTAATT
GTGCTTCTGTTTTTTTTTTTAAAAAAAGGAAAAACGTGAGTCCAGCAAAGATTAAATTATATATTTTGAAGGTGCGTTTATATATATAGATATAAATGTGGTAACTTGTT
CATCTTTTGTTTTTTGGGTTTGCAAGAGAAAGATATATATGATATAGAAATGTTTTGCTTTCCA
Protein sequenceShow/hide protein sequence
MASSAKLIASLFLLTLLFIQIHARESHFFSKVPNNGETQTQIPNKVLDPLTNSEKTTTNPQDQEPNFIPQTQDNGYGLYGHESGQLPPNFDAKFSDPSAGRPFTTTYDND
NNYRTSNDALPAYNSESENQYYNDNFQNSNSKKPYENSFYYNKDLYDNGRQSFQNTRLSRDDFTTTTPSYDDNNNYNFYFNNNNNGGGDNANNVARQGMSDTRFMENGKY
FYDLNREPHHYSMSRGSFGNNHYNYNNNGNTYEYGNSMGRYQNDEAEFQEESDDFVP