; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001381 (gene) of Snake gourd v1 genome

Gene IDTan0001381
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF506)
Genome locationLG07:14950641..14952819
RNA-Seq ExpressionTan0001381
SyntenyTan0001381
Gene Ontology termsNA
InterPro domainsIPR006502 - Protein of unknown function PDDEXK-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605316.1 hypothetical protein SDJN03_02633, partial [Cucurbita argyrosperma subsp. sororia]1.3e-8464.96Show/hide
Query:  MGSLEEEKLVQMVDDFIESAETP---SSCSSNSLPLSSKSHYFFSLKEILG-SGRKAEREVGETVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSHLC
        MGS EEE+LVQM+DDFIES ETP   SS SSN LPL+S +HYFF+LK+ILG +GR AE EV E VMKH+RRK DAPKTT +KKWLVMKLKMDGY S  LC
Subjt:  MGSLEEEKLVQMVDDFIESAETP---SSCSSNSLPLSSKSHYFFSLKEILG-SGRKAEREVGETVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSHLC

Query:  RTSWVTSIGCPAGDYEYIEMKGEDEEGSIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQSLKDSGLHIPPWRTSTYM
         TSWVTS+GCP GDYEYIEMK E      KR+IIDIDFK QFEVARATE YKQLT+ALPSVFVG+E+KVV+IISILCSAAKQSLK+SGLHIPPWRTSTYM
Subjt:  RTSWVTSIGCPAGDYEYIEMKGEDEEGSIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQSLKDSGLHIPPWRTSTYM

Query:  QCKWLSPHQQQQINNNNIIIINFKKENEKESGHLINKLWNPPNPMPMLNQIRK-----SSALSTQFSSMSINCC
        Q KW++  QQ+                              P   P++  I K      SALSTQFS+MSINCC
Subjt:  QCKWLSPHQQQQINNNNIIIINFKKENEKESGHLINKLWNPPNPMPMLNQIRK-----SSALSTQFSSMSINCC

KAG7035272.1 hypothetical protein SDJN02_02067, partial [Cucurbita argyrosperma subsp. argyrosperma]9.6e-8564.86Show/hide
Query:  MGSLEEEKLVQMVDDFIESAETP-----SSCSSNSLPLSSKSHYFFSLKEILG-SGRKAEREVGETVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSH
        MGS EEE+LVQM+DDFIES ETP     SS SSN LPL+S +HYFF+LKEILG +GR AE EV E VMKH+RRK DAPKTT +KKWLVMKLKMDGY S  
Subjt:  MGSLEEEKLVQMVDDFIESAETP-----SSCSSNSLPLSSKSHYFFSLKEILG-SGRKAEREVGETVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSH

Query:  LCRTSWVTSIGCPAGDYEYIEMKGEDEEGSIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQSLKDSGLHIPPWRTST
        LC TSWVTS+GCP GDYEYIEMK E      KR+IIDIDFK QFEVARATE YKQLT+ALPSVFVG+E+KVV+IISILCSAAKQSLK+SGLHIPPWRTST
Subjt:  LCRTSWVTSIGCPAGDYEYIEMKGEDEEGSIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQSLKDSGLHIPPWRTST

Query:  YMQCKWLSPHQQQQINNNNIIIINFKKENEKESGHLINKLWNPPNPMPMLNQIRK-----SSALSTQFSSMSINCC
        YMQ KW++  QQ+                              P   P++  I K      SALSTQFS+MSINCC
Subjt:  YMQCKWLSPHQQQQINNNNIIIINFKKENEKESGHLINKLWNPPNPMPMLNQIRK-----SSALSTQFSSMSINCC

XP_022143594.1 uncharacterized protein LOC111013453 [Momordica charantia]1.4e-8866.79Show/hide
Query:  MGSLEEEKLVQMVDDFIESAETPSSCSS--NSLPLSSKSHYFFSLKEILGSGRKAEREVGETVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSHLCRT
        MGSLEEEKLVQMVDDFIES ETP+SCSS  +S  L+SK+H+ FSLKEILGSGR+AE EV E V KHLR+K ++PKTTSLKKWLVMKL+MDGYDS+ LC T
Subjt:  MGSLEEEKLVQMVDDFIESAETPSSCSS--NSLPLSSKSHYFFSLKEILGSGRKAEREVGETVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSHLCRT

Query:  SWVTSIGCPAGDYEYIEMKGEDEEGSIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQSLKDSGLHIPPWRTSTYMQC
        SWVTSIGCPAG+YEYIE K EDE G  KR+IIDI+FK QFEVAR T  YKQLTEALP+VFVGTE+ V RII+ILCSAAKQSL++SGLHIPPWRTSTYMQ 
Subjt:  SWVTSIGCPAGDYEYIEMKGEDEEGSIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQSLKDSGLHIPPWRTSTYMQC

Query:  KWLSPHQQQQINNNNIIIINFKKENEKESGHLINKLWNPPNPMPMLNQIRK----SSALSTQFSSMSINCC
        K+     +               E ++E G   N  W P    PM+ Q R+     SALSTQFS+MSINCC
Subjt:  KWLSPHQQQQINNNNIIIINFKKENEKESGHLINKLWNPPNPMPMLNQIRK----SSALSTQFSSMSINCC

XP_023007236.1 uncharacterized protein LOC111499781 [Cucurbita maxima]1.3e-8465.22Show/hide
Query:  MGSLEEEKLVQMVDDFIESAETP-----SSCSSNSLPLSSKSHYFFSLKEILG-SGRKAEREVGETVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSH
        MGSLEEE+LVQM+DDF+ES E P     SS SSN LPL+S +HYFF+LKEILG SGR AE EV E VMKH+RRK DAPKTT LKKWLVMKLKMDGY S  
Subjt:  MGSLEEEKLVQMVDDFIESAETP-----SSCSSNSLPLSSKSHYFFSLKEILG-SGRKAEREVGETVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSH

Query:  LCRTSWVTSIGCPAGDYEYIEMKGEDEEGSIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQSLKDSGLHIPPWRTST
        LC TSWVTS+GCP GDYEYIEMK E      KR+IIDIDFK QFEVARATE YKQLT+ALPSVFVG+E+KVV+IISILCSAAKQSLK+SGLHIPPWRTST
Subjt:  LCRTSWVTSIGCPAGDYEYIEMKGEDEEGSIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQSLKDSGLHIPPWRTST

Query:  YMQCKWLSPHQQQQINNNNIIIINFKKENEKESGHLINKLWNPPNPMPMLNQIRK-----SSALSTQFSSMSINCC
        YMQ KW++  QQ+                              P   P++  I K      SALSTQFS+MSINCC
Subjt:  YMQCKWLSPHQQQQINNNNIIIINFKKENEKESGHLINKLWNPPNPMPMLNQIRK-----SSALSTQFSSMSINCC

XP_038900827.1 uncharacterized protein LOC120087891 [Benincasa hispida]9.9e-9873.26Show/hide
Query:  MGSLEEEKLVQMVDDFIESAE-TPSSC----SSNSLPLSSKSHYFFSLKEILGSGRKAEREVGETVMKHLRR-KRDAPKTT-SLKKWLVMKLKMDGYDSS
        M SLEEEKLVQMVDDFIESA+ TPSSC    SSNSLPL+SKSHYF SLKEILGSG KAE EVGETVMKHLR  K D+PKTT SLKKWLVMKLKMDGYDSS
Subjt:  MGSLEEEKLVQMVDDFIESAE-TPSSC----SSNSLPLSSKSHYFFSLKEILGSGRKAEREVGETVMKHLRR-KRDAPKTT-SLKKWLVMKLKMDGYDSS

Query:  HLCRTSWVTSIGCPAGDYEYIEMKGEDEEGSIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQSLKDSGLHIPPWRTS
         LC TSWVTS+GCPAGDYEYIEMK +DE GS KRVIIDI+FK QFEVARATE YKQLTEALP+VFVG+E++V RIIS+LCSAAKQSLK+SGLHIPPWRTS
Subjt:  HLCRTSWVTSIGCPAGDYEYIEMKGEDEEGSIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQSLKDSGLHIPPWRTS

Query:  TYMQCKWLSPHQQQQINNNNIIIINFKKENEKESGHLINKLWNPPNPMPMLNQI-RKSSALSTQFSSMSINCC
        TYM CKWL  H+   I+NNN          E  +  + NK W PP   P+  ++   +SALSTQFS+MSINCC
Subjt:  TYMQCKWLSPHQQQQINNNNIIIINFKKENEKESGHLINKLWNPPNPMPMLNQI-RKSSALSTQFSSMSINCC

TrEMBL top hitse value%identityAlignment
A0A0A0LJS3 Uncharacterized protein5.9e-8065.08Show/hide
Query:  MGSLEEEKLVQMVDDFIESA----ETPSSCSSNSLPLSSKSHYFFSLKEILGSGRKAEREVGETVMKHLRRKR---DAPKTTSLKKWLVMKLKMDGYDSS
        M SLEEEKLVQMVDDFIES     ++P+S S   L  +SKSH+FF+LKEILG+G K E EVGE+VMKHLRR +    + KT SL+KWLVMKLKMDGYDSS
Subjt:  MGSLEEEKLVQMVDDFIESA----ETPSSCSSNSLPLSSKSHYFFSLKEILGSGRKAEREVGETVMKHLRRKR---DAPKTTSLKKWLVMKLKMDGYDSS

Query:  HLCRTSWVTSIGCPAGDYEYIEMKGEDEE-GSIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQSLKDSGLHIPPWRT
        HLC TSWVTS+GCPAGDYEYIEM+ +D+E GS KR+IIDI+FK QFEVARATE YKQLT+ALP+VFVG+E+KV RIIS+LCSAAKQSL+ SGLHIPPWRT
Subjt:  HLCRTSWVTSIGCPAGDYEYIEMKGEDEE-GSIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQSLKDSGLHIPPWRT

Query:  STYMQCKWLSPHQQQQINNNN------IIIINFKKENEKESGHLINKLWNPP
        STYM  KWL  H     NN++       I IN  + N   + +     W PP
Subjt:  STYMQCKWLSPHQQQQINNNN------IIIINFKKENEKESGHLINKLWNPP

A0A1S3CDH7 uncharacterized protein LOC1034996452.8e-7463.67Show/hide
Query:  MGSLEEEKLVQMVDDFIESAE---TPSSCSSNSLPLSSKSHYFFSLKEILGSGRKAEREVGETVMKHLRRKR-----DAPKTTSLKKWLVMKLKMDGYDS
        MGSLEEEKL QMVDDFIES +     S  SS  L  +S SHY F+LKEILG+G K E EVGE+VMKHLRR +     ++ KT SL+KWLVMKLKM+GYDS
Subjt:  MGSLEEEKLVQMVDDFIESAE---TPSSCSSNSLPLSSKSHYFFSLKEILGSGRKAEREVGETVMKHLRRKR-----DAPKTTSLKKWLVMKLKMDGYDS

Query:  SHLCRTSWVTSIGCPAGDYEYIEMKGEDEEGSIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQSLKDSGLHIPPWRT
        SHL  TSWVTS+GCPAGDYEYIEM+ + E     R+IIDI+FK QFEVARATE YKQLT+ALPSVFVG+E+KV RIIS+LCSAAKQSLK SGLHIPPWRT
Subjt:  SHLCRTSWVTSIGCPAGDYEYIEMKGEDEEGSIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQSLKDSGLHIPPWRT

Query:  STYMQCKWL-------SPHQQQQINNNNI-IIINFKKENEKESGHLINKL--WNPP
        STYM  KWL       S H    I  N+I I IN    N   + +  NK   W PP
Subjt:  STYMQCKWL-------SPHQQQQINNNNI-IIINFKKENEKESGHLINKL--WNPP

A0A6J1CPS4 uncharacterized protein LOC1110134537.0e-8966.79Show/hide
Query:  MGSLEEEKLVQMVDDFIESAETPSSCSS--NSLPLSSKSHYFFSLKEILGSGRKAEREVGETVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSHLCRT
        MGSLEEEKLVQMVDDFIES ETP+SCSS  +S  L+SK+H+ FSLKEILGSGR+AE EV E V KHLR+K ++PKTTSLKKWLVMKL+MDGYDS+ LC T
Subjt:  MGSLEEEKLVQMVDDFIESAETPSSCSS--NSLPLSSKSHYFFSLKEILGSGRKAEREVGETVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSHLCRT

Query:  SWVTSIGCPAGDYEYIEMKGEDEEGSIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQSLKDSGLHIPPWRTSTYMQC
        SWVTSIGCPAG+YEYIE K EDE G  KR+IIDI+FK QFEVAR T  YKQLTEALP+VFVGTE+ V RII+ILCSAAKQSL++SGLHIPPWRTSTYMQ 
Subjt:  SWVTSIGCPAGDYEYIEMKGEDEEGSIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQSLKDSGLHIPPWRTSTYMQC

Query:  KWLSPHQQQQINNNNIIIINFKKENEKESGHLINKLWNPPNPMPMLNQIRK----SSALSTQFSSMSINCC
        K+     +               E ++E G   N  W P    PM+ Q R+     SALSTQFS+MSINCC
Subjt:  KWLSPHQQQQINNNNIIIINFKKENEKESGHLINKLWNPPNPMPMLNQIRK----SSALSTQFSSMSINCC

A0A6J1G7H8 uncharacterized protein LOC111451460 isoform X13.0e-8464.96Show/hide
Query:  MGSLEEEKLVQMVDDFIESAETP---SSCSSNSLPLSSKSHYFFSLKEILG-SGRKAEREVGETVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSHLC
        MGS EEE+LVQM+DDFIES ETP   SS SS+ LPL+S +HYFF+LKEILG +GR AE EV E VMKH+RRK DAPKTT LKKWLVMKLKMDGY S  LC
Subjt:  MGSLEEEKLVQMVDDFIESAETP---SSCSSNSLPLSSKSHYFFSLKEILG-SGRKAEREVGETVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSHLC

Query:  RTSWVTSIGCPAGDYEYIEMKGEDEEGSIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQSLKDSGLHIPPWRTSTYM
         +SWVTS+GCP GDYEYIEMK E      KR+IIDIDFK QFEVARATE YKQLT+ALPSVFVG+E+KVV+IISILCSAAKQSLK+SGLHIPPWRTSTYM
Subjt:  RTSWVTSIGCPAGDYEYIEMKGEDEEGSIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQSLKDSGLHIPPWRTSTYM

Query:  QCKWLSPHQQQQINNNNIIIINFKKENEKESGHLINKLWNPPNPMPMLNQIRK-----SSALSTQFSSMSINCC
        Q KW++  QQ+                              P   P++  I K      SALSTQFS+MSINCC
Subjt:  QCKWLSPHQQQQINNNNIIIINFKKENEKESGHLINKLWNPPNPMPMLNQIRK-----SSALSTQFSSMSINCC

A0A6J1KZZ7 uncharacterized protein LOC1114997816.1e-8565.22Show/hide
Query:  MGSLEEEKLVQMVDDFIESAETP-----SSCSSNSLPLSSKSHYFFSLKEILG-SGRKAEREVGETVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSH
        MGSLEEE+LVQM+DDF+ES E P     SS SSN LPL+S +HYFF+LKEILG SGR AE EV E VMKH+RRK DAPKTT LKKWLVMKLKMDGY S  
Subjt:  MGSLEEEKLVQMVDDFIESAETP-----SSCSSNSLPLSSKSHYFFSLKEILG-SGRKAEREVGETVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSH

Query:  LCRTSWVTSIGCPAGDYEYIEMKGEDEEGSIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQSLKDSGLHIPPWRTST
        LC TSWVTS+GCP GDYEYIEMK E      KR+IIDIDFK QFEVARATE YKQLT+ALPSVFVG+E+KVV+IISILCSAAKQSLK+SGLHIPPWRTST
Subjt:  LCRTSWVTSIGCPAGDYEYIEMKGEDEEGSIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQSLKDSGLHIPPWRTST

Query:  YMQCKWLSPHQQQQINNNNIIIINFKKENEKESGHLINKLWNPPNPMPMLNQIRK-----SSALSTQFSSMSINCC
        YMQ KW++  QQ+                              P   P++  I K      SALSTQFS+MSINCC
Subjt:  YMQCKWLSPHQQQQINNNNIIIINFKKENEKESGHLINKLWNPPNPMPMLNQIRK-----SSALSTQFSSMSINCC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G77145.1 Protein of unknown function (DUF506)1.7e-3441.33Show/hide
Query:  MGSLEEEKLVQMVDDFIESAETPSSCSSNSLPLSSKSHYFFSLKEILGSGRKAEREVGETVMKHLRRKR-----DAPKTTSLKKWLVMKLKMDGYDSSHL
        MGSL EE L ++V D+IES  T S        + S +    +LKEIL +  + E+E+ E +   +   R     D  K   + K +V KL+ +GYD+S L
Subjt:  MGSLEEEKLVQMVDDFIESAETPSSCSSNSLPLSSKSHYFFSLKEILGSGRKAEREVGETVMKHLRRKR-----DAPKTTSLKKWLVMKLKMDGYDSSHL

Query:  CRTSWVTSI----GCP----AGDYEYIE--MKGE---DEEGSIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQSLKD
         +TSW +S     GC     +  YEYI+  +KG+   D    +KRVIID+DFK QFE+AR TE YK +TE LP VFV TE ++ R++S++C   K+S+K 
Subjt:  CRTSWVTSI----GCP----AGDYEYIE--MKGE---DEEGSIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQSLKD

Query:  SGLHIPPWRTSTYMQCKWLSPHQQQ
         G+  PPWRT+ YMQ KWL  ++++
Subjt:  SGLHIPPWRTSTYMQCKWLSPHQQQ

AT1G77160.1 Protein of unknown function (DUF506)4.1e-3341.82Show/hide
Query:  MGSLEEEKLVQMVDDFIESAETPSS-CSSNSLPLSSKSHYFFSLKEILGSGRKAEREVGETVMKHLRRKR-----DAPKTTSLKKWLVMKLKMDGYDSSH
        MGSL EE   ++V  +IES  T S   +++S  L +    F  L+EIL +    E+E+ E +  ++ R R     D  K   + K +V KL+ +GY++S 
Subjt:  MGSLEEEKLVQMVDDFIESAETPSS-CSSNSLPLSSKSHYFFSLKEILGSGRKAEREVGETVMKHLRRKR-----DAPKTTSLKKWLVMKLKMDGYDSSH

Query:  LCRTSWVTSI----GCP----AGDYEYIE---MKGEDEEG--SIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQSLK
        L +TSW +S     GC     +  YEYI+   +   D +G   +KRVIID+DFK QFE+AR TE YK +TE LP+VFV TE ++ R++S++C   K+S+K
Subjt:  LCRTSWVTSI----GCP----AGDYEYIE---MKGEDEEG--SIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQSLK

Query:  DSGLHIPPWRTSTYMQCKWL
          G+  PPWRTS YMQ KWL
Subjt:  DSGLHIPPWRTSTYMQCKWL

AT2G38820.1 Protein of unknown function (DUF506)2.0e-3248.87Show/hide
Query:  VMKLKMDGYDSSHLCRTSWVTSIGCPAGDYEYIE--MKGEDEEGSIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQS
        V K+    YD++ LC++ W  S  CPAG+YEY++  MKGE       R++IDIDFK +FE+ARAT+ YK + + LP +FVG  D++ +II ++C AAKQS
Subjt:  VMKLKMDGYDSSHLCRTSWVTSIGCPAGDYEYIE--MKGEDEEGSIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQS

Query:  LKDSGLHIPPWRTSTYMQCKWLSPHQQQQINNN
        LK  GLH+PPWR + Y++ KWLS H +   N+N
Subjt:  LKDSGLHIPPWRTSTYMQCKWLSPHQQQQINNN

AT2G38820.2 Protein of unknown function (DUF506)2.0e-3250.79Show/hide
Query:  GYDSSHLCRTSWVTSIGCPAGDYEYIE--MKGEDEEGSIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQSLKDSGLH
        GYD++ LC++ W  S  CPAG+YEY++  MKGE       R++IDIDFK +FE+ARAT+ YK + + LP +FVG  D++ +II ++C AAKQSLK  GLH
Subjt:  GYDSSHLCRTSWVTSIGCPAGDYEYIE--MKGEDEEGSIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQSLKDSGLH

Query:  IPPWRTSTYMQCKWLSPHQQQQINNN
        +PPWR + Y++ KWLS H +   N+N
Subjt:  IPPWRTSTYMQCKWLSPHQQQQINNN

AT4G14620.1 Protein of unknown function (DUF506)2.7e-3235.19Show/hide
Query:  GSLEEEKLVQMVDDFIESAETPS----------SCSSNSLPLSSKSHYFF---SLKEILGSGRKAEREVGETVMKHLRRKRDAPKTTSLKKWLVMKLKMD
        G+  E  L +MV +++E                +C + +  +S     FF   + K ++  G   E+ +     K + + +   +   L+K +V +L   
Subjt:  GSLEEEKLVQMVDDFIESAETPS----------SCSSNSLPLSSKSHYFF---SLKEILGSGRKAEREVGETVMKHLRRKRDAPKTTSLKKWLVMKLKMD

Query:  GYDSSHLCRTSWVTSIGCPAGDYEYIEMKGEDEEGSIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQSLKDSGLHIP
        GYDSS +C++ W  +   PAG+YEYI++    E     R+IIDIDF+ +FE+AR T GYK+L ++LP +FVG  D++ +I+SI+  A+KQSLK  G+H P
Subjt:  GYDSSHLCRTSWVTSIGCPAGDYEYIEMKGEDEEGSIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQSLKDSGLHIP

Query:  PWRTSTYMQCKWLSPH
        PWR + YM+ KWLS +
Subjt:  PWRTSTYMQCKWLSPH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGTTTAGAGGAGGAAAAGTTAGTCCAAATGGTTGATGATTTTATAGAATCAGCTGAAACACCAAGCTCTTGTTCTTCAAATTCTCTTCCTCTGAGTTCCAAAAG
CCATTACTTTTTCAGTTTAAAGGAGATTCTTGGGAGTGGAAGAAAAGCAGAAAGAGAGGTGGGTGAGACTGTGATGAAGCACCTGAGAAGGAAAAGAGATGCTCCCAAAA
CCACTAGCCTTAAGAAATGGCTTGTGATGAAGCTCAAAATGGACGGCTATGATTCTTCTCATCTTTGTCGCACCTCTTGGGTCACTTCCATCGGCTGCCCAGCAGGGGAT
TATGAATACATAGAGATGAAAGGGGAGGATGAGGAAGGAAGCATAAAGAGAGTGATAATAGACATAGACTTCAAGGGTCAATTTGAAGTAGCAAGGGCAACAGAAGGGTA
CAAGCAGCTCACAGAAGCACTTCCATCAGTGTTTGTTGGGACTGAAGACAAGGTTGTGAGAATAATCTCTATTTTATGTTCAGCAGCCAAACAGTCCCTTAAGGACAGTG
GACTCCACATTCCCCCTTGGAGAACTTCCACTTACATGCAGTGCAAATGGCTGTCTCCTCATCAACAACAACAAATTAATAATAATAATATTATTATTATTAATTTCAAG
AAAGAAAATGAAAAAGAAAGTGGTCATTTAATTAACAAATTATGGAACCCTCCCAATCCCATGCCCATGCTCAACCAAATTAGGAAGAGCTCTGCCTTGTCCACTCAATT
TTCTAGCATGAGTATTAATTGTTGTTGA
mRNA sequenceShow/hide mRNA sequence
CCAATGTCCATTACCTCAACTTTCTATCTCTCTCTTCCTCTCTATATAACACCTGTTCTTTTCACAGCCACCGCCATAATCATCCCATTCCTCTCTCTAAAAACTTATTA
AAACCATTCTCTTTCCTTAGCTTCTTTACACTTTACTAAGTTCTTGGCACCCTAGATAGAGAAAGTTGAGGTCAATTTTGGAGTTATCCAATTAGGAAAAGAGAAAACAA
AAAAAATGGGAAGTTTAGAGGAGGAAAAGTTAGTCCAAATGGTTGATGATTTTATAGAATCAGCTGAAACACCAAGCTCTTGTTCTTCAAATTCTCTTCCTCTGAGTTCC
AAAAGCCATTACTTTTTCAGTTTAAAGGAGATTCTTGGGAGTGGAAGAAAAGCAGAAAGAGAGGTGGGTGAGACTGTGATGAAGCACCTGAGAAGGAAAAGAGATGCTCC
CAAAACCACTAGCCTTAAGAAATGGCTTGTGATGAAGCTCAAAATGGACGGCTATGATTCTTCTCATCTTTGTCGCACCTCTTGGGTCACTTCCATCGGCTGCCCAGCAG
GGGATTATGAATACATAGAGATGAAAGGGGAGGATGAGGAAGGAAGCATAAAGAGAGTGATAATAGACATAGACTTCAAGGGTCAATTTGAAGTAGCAAGGGCAACAGAA
GGGTACAAGCAGCTCACAGAAGCACTTCCATCAGTGTTTGTTGGGACTGAAGACAAGGTTGTGAGAATAATCTCTATTTTATGTTCAGCAGCCAAACAGTCCCTTAAGGA
CAGTGGACTCCACATTCCCCCTTGGAGAACTTCCACTTACATGCAGTGCAAATGGCTGTCTCCTCATCAACAACAACAAATTAATAATAATAATATTATTATTATTAATT
TCAAGAAAGAAAATGAAAAAGAAAGTGGTCATTTAATTAACAAATTATGGAACCCTCCCAATCCCATGCCCATGCTCAACCAAATTAGGAAGAGCTCTGCCTTGTCCACT
CAATTTTCTAGCATGAGTATTAATTGTTGTTGATCTCTTTTATCCCTTTTTTTAAAAAAAAAAATCAATTTTGATCCAATCTTCATTCCAATAATCCCTCTCTCTCTCTC
TCTCTCTCCTCAAATAATTATGGCTTGAATTAGCTTTTGAGGTTTCAATTCTCTCCATTAAGAGTTGGTTTAATCTGTTGGCCAACCAAGAGTAGATCATGATTTGTTGA
AGATTGTAAGATTCCTTGTAATAAATGGAGAAGGAAGGTTCATGAAGGCAAAATATTCTTTCTTCTCTATAGTTGGTA
Protein sequenceShow/hide protein sequence
MGSLEEEKLVQMVDDFIESAETPSSCSSNSLPLSSKSHYFFSLKEILGSGRKAEREVGETVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSHLCRTSWVTSIGCPAGD
YEYIEMKGEDEEGSIKRVIIDIDFKGQFEVARATEGYKQLTEALPSVFVGTEDKVVRIISILCSAAKQSLKDSGLHIPPWRTSTYMQCKWLSPHQQQQINNNNIIIINFK
KENEKESGHLINKLWNPPNPMPMLNQIRKSSALSTQFSSMSINCC