; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS016915 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS016915
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUPF0496 protein At1g20180
Genome locationscaffold9_1:1038861..1040997
RNA-Seq ExpressionMS016915
SyntenyMS016915
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR007749 - Protein of unknown function DUF677


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008445326.1 PREDICTED: UPF0496 protein At1g20180-like [Cucumis melo]2.9e-9461.22Show/hide
Query:  EKNKVRNNGLWRKSNVKEEYEAAFRTNSYIEITTPTDDTSSA------PCRLQPDQDLILHDMSFQSSHI-DHHLLVHYLEASFQAFETCQLLLQALHQT
        +KNK   N L  K N+ EEY+AAFRTNSY+E T  T++TSS+       C LQPDQD+ LH+++    H+ +H LL+ Y +ASF+AFETC+LLLQAL+QT
Subjt:  EKNKVRNNGLWRKSNVKEEYEAAFRTNSYIEITTPTDDTSSA------PCRLQPDQDLILHDMSFQSSHI-DHHLLVHYLEASFQAFETCQLLLQALHQT

Query:  KISHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGE------LAYFSHLENP-FSIASH--AHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKL
        KI+H I   + +  LT  A  A+DDDNGN+  G+VYGE      L+ FSHL++P FSI +     FLALH++H ELL  L  KRN+ R KLRLK++ K+ 
Subjt:  KISHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGE------LAYFSHLENP-FSIASH--AHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKL

Query:  GRACFVISNAAVLGALLVLALHSLIGVVAAPG-LISCFVSLPKK--KRDGTL-VSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGEM
         + C +ISNAAVL ALL+LALHSL+G+VAAPG LI+CFV L KK  KRD  L  + E  L+QMEI  R TYITMNDLDTLSRMAARL+ E+EHLRA+GEM
Subjt:  GRACFVISNAAVLGALLVLALHSLIGVVAAPG-LISCFVSLPKK--KRDGTL-VSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGEM

Query:  WMRSSSRIRSEILKEAVAEDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEIIMGDRDQNG
        WMR SSR   EILKE V EDEAIVEQMKELQQHIYLCF TINRSRRLVM E IMGD DQ+G
Subjt:  WMRSSSRIRSEILKEAVAEDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEIIMGDRDQNG

XP_022144310.1 UPF0496 protein At1g20180 [Momordica charantia]1.6e-18098.54Show/hide
Query:  EKNKVRNNGLWRKSNVKEEYEAAFRTNSYIEITTPTDDTSSAPCRLQPDQDLILHDMSFQSSHIDHHLLVHYLEASFQAFETCQLLLQALHQTKISHAIA
        EKNKVRNNGLWRKSNVKEEYEAAFRTNSYIEITTPTDDTSSAPCRLQPDQDLILHDMSFQSSHIDHHLLVHYLEASFQAFETCQLLLQALHQTKISHAIA
Subjt:  EKNKVRNNGLWRKSNVKEEYEAAFRTNSYIEITTPTDDTSSAPCRLQPDQDLILHDMSFQSSHIDHHLLVHYLEASFQAFETCQLLLQALHQTKISHAIA

Query:  KRSVVNYLTIRASTADDDDNGNNGDG--VVYGELAYFSHLENPFSIASHAHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKLGRACFVISNAAVLG
        KRSVVNYLT RAST DDDDNGNNGDG  VVYGELAYFSHLENPFSIASHAHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKLGRACFVISNAAVLG
Subjt:  KRSVVNYLTIRASTADDDDNGNNGDG--VVYGELAYFSHLENPFSIASHAHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKLGRACFVISNAAVLG

Query:  ALLVLALHSLIGVVAAPGLISCFVSLPKKKRDGTLVSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGEMWMRSSSRIRSEILKEAVA
        ALLVLALHSLIGVVAAPGLISCFVSLPKKKRDGT VSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGEMWMRSSSRIRSEILKEAVA
Subjt:  ALLVLALHSLIGVVAAPGLISCFVSLPKKKRDGTLVSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGEMWMRSSSRIRSEILKEAVA

Query:  EDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEIIMGDRDQNG
        EDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEIIMGDRDQNG
Subjt:  EDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEIIMGDRDQNG

XP_023002422.1 UPF0496 protein At1g20180 isoform X1 [Cucurbita maxima]3.9e-8660.53Show/hide
Query:  EKNKVRNNGLWRKSNVKEEYEAAFRTNSYIEITTPTDD--TSSA----PCRLQPDQDLILHDM--SFQSSHIDHHLLVHYLEASFQAFETCQLLLQALHQ
        +KNK   N L RK N+ EEY+ AFRT S+IE TT   D  TSS+    PCRL+PDQ L LHDM  S    H+ H LL+ Y EASFQAF+ CQLLLQA+HQ
Subjt:  EKNKVRNNGLWRKSNVKEEYEAAFRTNSYIEITTPTDD--TSSA----PCRLQPDQDLILHDM--SFQSSHIDHHLLVHYLEASFQAFETCQLLLQALHQ

Query:  TKISHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGELAYFSHLENPFSIASHAHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKLGRACFVIS
        T+I+HA+ K      L   AS ADDD+  +N +G+VY EL                 FL+LHD H ELL  L+  RN+ R+KLRL+   KK+   CFVIS
Subjt:  TKISHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGELAYFSHLENPFSIASHAHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKLGRACFVIS

Query:  NAAVLGALLVLALHSLIGVVAAPGLISCFV-SLPKKKRDGTLVSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGEMWMRSSSRIRSE
        NAAVL  LLVLALHSL+G+VA PGLI CFV SL KK+RD      + +LRQMEI  RATYITMNDLDTLSRMA RL+GE+EHLRA+G MWMR S R+R E
Subjt:  NAAVLGALLVLALHSLIGVVAAPGLISCFV-SLPKKKRDGTLVSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGEMWMRSSSRIRSE

Query:  ILKEAVAEDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEII
        ILKE V ED+AIVEQMKELQQHIYLCF+TINRSRRLVM EI+
Subjt:  ILKEAVAEDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEII

XP_031736612.1 UPF0496 protein At1g20180 [Cucumis sativus]1.6e-9562.01Show/hide
Query:  EKNKVRNNGLWRKSNVKEEYEAAFRTNSYIEITTPTDDTSS----APCRLQPDQDLILHDMSFQSSHI-DHHLLVHYLEASFQAFETCQLLLQALHQTKI
        +KNK   N L  K N+ EEY+AAFRTNSY+E T  T++TSS      C L+PDQD++LH+++    H+ DH LL+ Y +ASF+AFETCQLLLQAL+QTKI
Subjt:  EKNKVRNNGLWRKSNVKEEYEAAFRTNSYIEITTPTDDTSS----APCRLQPDQDLILHDMSFQSSHI-DHHLLVHYLEASFQAFETCQLLLQALHQTKI

Query:  SHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGE------LAYFSHLENP-FSIASH--AHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKLGR
        +H I   + V  LT  A  A DDDNGN+G   VYGE       ++FS L+NP FSI +   + FLALH++H ELL  L  K+N+ RRKLRLK+I K++ +
Subjt:  SHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGE------LAYFSHLENP-FSIASH--AHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKLGR

Query:  ACFVISNAAVLGALLVLALHSLIGVVAAPG-LISCFVSLPKK--KRDGTL-VSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGEMWM
         C +ISNAAVL ALL+LALHSL+G+VAAPG LI+CFV L KK  KRD  L  + E  L+QMEI  R TYITMNDLDTLSRMAARL+ E+EHLRA+GEMWM
Subjt:  ACFVISNAAVLGALLVLALHSLIGVVAAPG-LISCFVSLPKK--KRDGTL-VSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGEMWM

Query:  RSSSRIRSEILKEAVAEDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEIIMGDRDQN
        R SSR R EILKE V EDEAIVEQMKELQQHIYLCF TINRSRRLVM E  MGD DQ+
Subjt:  RSSSRIRSEILKEAVAEDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEIIMGDRDQN

XP_038884832.1 UPF0496 protein At1g20180-like [Benincasa hispida]9.1e-9662.57Show/hide
Query:  EKNKVRNNGLWRKSNVKEEYEAAFRTNSYIEITTPT---DDTSSA---PCRLQPDQDLILHDMSFQSSHIDHHLLVHYLEASFQAFETCQLLLQALHQTK
        +KNK   N L  K N+ EEY+AAFRTNSY+E TT     D+TSS+    C L+PDQD+ L   +    H+ HHLL+ Y  ASF+AF++CQLLLQALHQT+
Subjt:  EKNKVRNNGLWRKSNVKEEYEAAFRTNSYIEITTPT---DDTSSA---PCRLQPDQDLILHDMSFQSSHIDHHLLVHYLEASFQAFETCQLLLQALHQTK

Query:  ISHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGE-----LAYFSHLENP-FSIASH--AHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKLGR
        I+HAI    V   LT  A  ADDDDN     G+VY E     L+ F  L+NP FSI +H    FL LH++H ELL  LT KRN+ RR+LRLK+IWK+  R
Subjt:  ISHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGE-----LAYFSHLENP-FSIASH--AHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKLGR

Query:  ACFVISNAAVLGALLVLALHSLIGVVAAPGLISCFVSLPKKK--RDGTLV-SSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGEMWMR
         CF+ISNAAVL ALL+LA HSLIG+VAAPGLI+CFV L KKK  RD  +V SSE+VLRQMEI  RATYITMNDLDTLSRMAARL+ E+EHLRA+ EM++R
Subjt:  ACFVISNAAVLGALLVLALHSLIGVVAAPGLISCFVSLPKKK--RDGTLV-SSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGEMWMR

Query:  SSSRIRSEILKEAVAEDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEIIMGDRDQNG
        SS   R EILKE V ED+A+VEQMK+LQQHIYLC LTINRSRRLVM E IMGDRDQ+G
Subjt:  SSSRIRSEILKEAVAEDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEIIMGDRDQNG

TrEMBL top hitse value%identityAlignment
A0A0A0LN80 Uncharacterized protein7.6e-9662.01Show/hide
Query:  EKNKVRNNGLWRKSNVKEEYEAAFRTNSYIEITTPTDDTSS----APCRLQPDQDLILHDMSFQSSHI-DHHLLVHYLEASFQAFETCQLLLQALHQTKI
        +KNK   N L  K N+ EEY+AAFRTNSY+E T  T++TSS      C L+PDQD++LH+++    H+ DH LL+ Y +ASF+AFETCQLLLQAL+QTKI
Subjt:  EKNKVRNNGLWRKSNVKEEYEAAFRTNSYIEITTPTDDTSS----APCRLQPDQDLILHDMSFQSSHI-DHHLLVHYLEASFQAFETCQLLLQALHQTKI

Query:  SHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGE------LAYFSHLENP-FSIASH--AHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKLGR
        +H I   + V  LT  A  A DDDNGN+G   VYGE       ++FS L+NP FSI +   + FLALH++H ELL  L  K+N+ RRKLRLK+I K++ +
Subjt:  SHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGE------LAYFSHLENP-FSIASH--AHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKLGR

Query:  ACFVISNAAVLGALLVLALHSLIGVVAAPG-LISCFVSLPKK--KRDGTL-VSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGEMWM
         C +ISNAAVL ALL+LALHSL+G+VAAPG LI+CFV L KK  KRD  L  + E  L+QMEI  R TYITMNDLDTLSRMAARL+ E+EHLRA+GEMWM
Subjt:  ACFVISNAAVLGALLVLALHSLIGVVAAPG-LISCFVSLPKK--KRDGTL-VSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGEMWM

Query:  RSSSRIRSEILKEAVAEDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEIIMGDRDQN
        R SSR R EILKE V EDEAIVEQMKELQQHIYLCF TINRSRRLVM E  MGD DQ+
Subjt:  RSSSRIRSEILKEAVAEDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEIIMGDRDQN

A0A1S3BDA5 UPF0496 protein At1g20180-like1.4e-9461.22Show/hide
Query:  EKNKVRNNGLWRKSNVKEEYEAAFRTNSYIEITTPTDDTSSA------PCRLQPDQDLILHDMSFQSSHI-DHHLLVHYLEASFQAFETCQLLLQALHQT
        +KNK   N L  K N+ EEY+AAFRTNSY+E T  T++TSS+       C LQPDQD+ LH+++    H+ +H LL+ Y +ASF+AFETC+LLLQAL+QT
Subjt:  EKNKVRNNGLWRKSNVKEEYEAAFRTNSYIEITTPTDDTSSA------PCRLQPDQDLILHDMSFQSSHI-DHHLLVHYLEASFQAFETCQLLLQALHQT

Query:  KISHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGE------LAYFSHLENP-FSIASH--AHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKL
        KI+H I   + +  LT  A  A+DDDNGN+  G+VYGE      L+ FSHL++P FSI +     FLALH++H ELL  L  KRN+ R KLRLK++ K+ 
Subjt:  KISHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGE------LAYFSHLENP-FSIASH--AHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKL

Query:  GRACFVISNAAVLGALLVLALHSLIGVVAAPG-LISCFVSLPKK--KRDGTL-VSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGEM
         + C +ISNAAVL ALL+LALHSL+G+VAAPG LI+CFV L KK  KRD  L  + E  L+QMEI  R TYITMNDLDTLSRMAARL+ E+EHLRA+GEM
Subjt:  GRACFVISNAAVLGALLVLALHSLIGVVAAPG-LISCFVSLPKK--KRDGTL-VSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGEM

Query:  WMRSSSRIRSEILKEAVAEDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEIIMGDRDQNG
        WMR SSR   EILKE V EDEAIVEQMKELQQHIYLCF TINRSRRLVM E IMGD DQ+G
Subjt:  WMRSSSRIRSEILKEAVAEDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEIIMGDRDQNG

A0A6J1CRA5 UPF0496 protein At1g201807.6e-18198.54Show/hide
Query:  EKNKVRNNGLWRKSNVKEEYEAAFRTNSYIEITTPTDDTSSAPCRLQPDQDLILHDMSFQSSHIDHHLLVHYLEASFQAFETCQLLLQALHQTKISHAIA
        EKNKVRNNGLWRKSNVKEEYEAAFRTNSYIEITTPTDDTSSAPCRLQPDQDLILHDMSFQSSHIDHHLLVHYLEASFQAFETCQLLLQALHQTKISHAIA
Subjt:  EKNKVRNNGLWRKSNVKEEYEAAFRTNSYIEITTPTDDTSSAPCRLQPDQDLILHDMSFQSSHIDHHLLVHYLEASFQAFETCQLLLQALHQTKISHAIA

Query:  KRSVVNYLTIRASTADDDDNGNNGDG--VVYGELAYFSHLENPFSIASHAHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKLGRACFVISNAAVLG
        KRSVVNYLT RAST DDDDNGNNGDG  VVYGELAYFSHLENPFSIASHAHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKLGRACFVISNAAVLG
Subjt:  KRSVVNYLTIRASTADDDDNGNNGDG--VVYGELAYFSHLENPFSIASHAHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKLGRACFVISNAAVLG

Query:  ALLVLALHSLIGVVAAPGLISCFVSLPKKKRDGTLVSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGEMWMRSSSRIRSEILKEAVA
        ALLVLALHSLIGVVAAPGLISCFVSLPKKKRDGT VSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGEMWMRSSSRIRSEILKEAVA
Subjt:  ALLVLALHSLIGVVAAPGLISCFVSLPKKKRDGTLVSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGEMWMRSSSRIRSEILKEAVA

Query:  EDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEIIMGDRDQNG
        EDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEIIMGDRDQNG
Subjt:  EDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEIIMGDRDQNG

A0A6J1KJG4 UPF0496 protein At1g20180 isoform X11.9e-8660.53Show/hide
Query:  EKNKVRNNGLWRKSNVKEEYEAAFRTNSYIEITTPTDD--TSSA----PCRLQPDQDLILHDM--SFQSSHIDHHLLVHYLEASFQAFETCQLLLQALHQ
        +KNK   N L RK N+ EEY+ AFRT S+IE TT   D  TSS+    PCRL+PDQ L LHDM  S    H+ H LL+ Y EASFQAF+ CQLLLQA+HQ
Subjt:  EKNKVRNNGLWRKSNVKEEYEAAFRTNSYIEITTPTDD--TSSA----PCRLQPDQDLILHDM--SFQSSHIDHHLLVHYLEASFQAFETCQLLLQALHQ

Query:  TKISHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGELAYFSHLENPFSIASHAHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKLGRACFVIS
        T+I+HA+ K      L   AS ADDD+  +N +G+VY EL                 FL+LHD H ELL  L+  RN+ R+KLRL+   KK+   CFVIS
Subjt:  TKISHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGELAYFSHLENPFSIASHAHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKLGRACFVIS

Query:  NAAVLGALLVLALHSLIGVVAAPGLISCFV-SLPKKKRDGTLVSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGEMWMRSSSRIRSE
        NAAVL  LLVLALHSL+G+VA PGLI CFV SL KK+RD      + +LRQMEI  RATYITMNDLDTLSRMA RL+GE+EHLRA+G MWMR S R+R E
Subjt:  NAAVLGALLVLALHSLIGVVAAPGLISCFV-SLPKKKRDGTLVSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGEMWMRSSSRIRSE

Query:  ILKEAVAEDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEII
        ILKE V ED+AIVEQMKELQQHIYLCF+TINRSRRLVM EI+
Subjt:  ILKEAVAEDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEII

A0A6J1KTI4 UPF0496 protein At1g20180 isoform X21.6e-8560.9Show/hide
Query:  NGLWRKSNVKEEYEAAFRTNSYIEITTPTDD--TSSA----PCRLQPDQDLILHDM--SFQSSHIDHHLLVHYLEASFQAFETCQLLLQALHQTKISHAI
        N L RK N+ EEY+ AFRT S+IE TT   D  TSS+    PCRL+PDQ L LHDM  S    H+ H LL+ Y EASFQAF+ CQLLLQA+HQT+I+HA+
Subjt:  NGLWRKSNVKEEYEAAFRTNSYIEITTPTDD--TSSA----PCRLQPDQDLILHDM--SFQSSHIDHHLLVHYLEASFQAFETCQLLLQALHQTKISHAI

Query:  AKRSVVNYLTIRASTADDDDNGNNGDGVVYGELAYFSHLENPFSIASHAHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKLGRACFVISNAAVLGA
         K      L   AS ADDD+  +N +G+VY EL                 FL+LHD H ELL  L+  RN+ R+KLRL+   KK+   CFVISNAAVL  
Subjt:  AKRSVVNYLTIRASTADDDDNGNNGDGVVYGELAYFSHLENPFSIASHAHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKLGRACFVISNAAVLGA

Query:  LLVLALHSLIGVVAAPGLISCFV-SLPKKKRDGTLVSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGEMWMRSSSRIRSEILKEAVA
        LLVLALHSL+G+VA PGLI CFV SL KK+RD      + +LRQMEI  RATYITMNDLDTLSRMA RL+GE+EHLRA+G MWMR S R+R EILKE V 
Subjt:  LLVLALHSLIGVVAAPGLISCFV-SLPKKKRDGTLVSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGEMWMRSSSRIRSEILKEAVA

Query:  EDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEII
        ED+AIVEQMKELQQHIYLCF+TINRSRRLVM EI+
Subjt:  EDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEII

SwissProt top hitse value%identityAlignment
A2XCJ1 UPF0496 protein 33.4e-0824.23Show/hide
Query:  KEEYEAAFRTNSYIEITTPTDDTSSA-------------PCR---------------LQPDQDLILHDM-SFQSSHID---HHLLVHYLEASFQAFETCQ
        +EEY +AFRT SY +      D + A              C                L+PDQ  +   + S + S +      LL  Y   +  A   C 
Subjt:  KEEYEAAFRTNSYIEITTPTDDTSSA-------------PCR---------------LQPDQDLILHDM-SFQSSHID---HHLLVHYLEASFQAFETCQ

Query:  LLLQALHQTKISHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGELAYFSHLENPFS--IASHAHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWK
         LL+ +   ++ +   K       T+R   +D   +G        G+         PF+   AS      +     +LL  L   R +AR ++R     +
Subjt:  LLLQALHQTKISHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGELAYFSHLENPFS--IASHAHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWK

Query:  KLGRACFVISNAAVLGALLVLALHSLIGVVAAPGLISCFVSLPKKKRDGTLVSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGEMWM
        +     FV + A V      + +H L    A P +   +  L ++   G      LV  Q+E   + TYI   D++T+SR+ AR+  E EH+ A+  + +
Subjt:  KLGRACFVISNAAVLGALLVLALHSLIGVVAAPGLISCFVSLPKKKRDGTLVSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGEMWM

Query:  R--------SSSRIRSEILKEAVAEDEAIVEQMKELQQHIYLCFLTINRSRRLVM
                    R+  E+L++    +E+  +Q+ EL++H++LCF+TIN++R +VM
Subjt:  R--------SSSRIRSEILKEAVAEDEAIVEQMKELQQHIYLCFLTINRSRRLVM

A2YH25 Putative UPF0496 protein 21.1e-1931.52Show/hide
Query:  LLVHYLEASFQAFETCQLLLQALHQTKISHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGELAYFSHLENPFSIASHAHFLALHDTHLELLHDLTHK
        LL+ Y + + +A E C  LL A+   +  H   +R     L +R    DDDD            LA    L+NP S  S + F  +H     L   L   
Subjt:  LLVHYLEASFQAFETCQLLLQALHQTKISHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGELAYFSHLENPFSIASHAHFLALHDTHLELLHDLTHK

Query:  RNQARRKLRLKKIWKKLGRACFVISNAAVLGALLVLALHSLIGV---VAAPGLISCFVSLPKKKRDGTLVSSELVLR---QMEIVMRATYITMNDLDTLS
        + + RR  R  +I +    A  V + AA + A +VLA H+L+G+    AA G      +    +R    VSS    R    ++   R  YI   DLDT+S
Subjt:  RNQARRKLRLKKIWKKLGRACFVISNAAVLGALLVLALHSLIGV---VAAPGLISCFVSLPKKKRDGTLVSSELVLR---QMEIVMRATYITMNDLDTLS

Query:  RMAARLEGEMEHLRAIGEMWMRSSSR--IRSEILKEAVAEDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEIIMG
        RM  R   E+EH R +  + MR      +  E+ +E    +E +  Q+ EL++H+ LC +TINR+RRLV  E+  G
Subjt:  RMAARLEGEMEHLRAIGEMWMRSSSR--IRSEILKEAVAEDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEIIMG

Q5Z8N6 Putative UPF0496 protein 21.5e-1931.52Show/hide
Query:  LLVHYLEASFQAFETCQLLLQALHQTKISHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGELAYFSHLENPFSIASHAHFLALHDTHLELLHDLTHK
        LL+ Y + + +A E C  LL A+   +  H   +R     L +R    DDDD            LA    L+NP S  S + F  +H     L   L   
Subjt:  LLVHYLEASFQAFETCQLLLQALHQTKISHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGELAYFSHLENPFSIASHAHFLALHDTHLELLHDLTHK

Query:  RNQARRKLRLKKIWKKLGRACFVISNAAVLGALLVLALHSLIGV---VAAPGLISCFVSLPKKKRDGTLVSSELVLR---QMEIVMRATYITMNDLDTLS
        + + RR  R  +I +    A  V + AA + A +VLA H+L+G+    AA G      +    +R    VSS    R    ++   R  YI   DLDT+S
Subjt:  RNQARRKLRLKKIWKKLGRACFVISNAAVLGALLVLALHSLIGV---VAAPGLISCFVSLPKKKRDGTLVSSELVLR---QMEIVMRATYITMNDLDTLS

Query:  RMAARLEGEMEHLRAIGEMWMRSSSR--IRSEILKEAVAEDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEIIMG
        RM  R   E+EH R +  + MR      +  E+ +E    +E +  Q+ EL++H+ LC +TINR+RRLV  E+  G
Subjt:  RMAARLEGEMEHLRAIGEMWMRSSSR--IRSEILKEAVAEDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEIIMG

Q6DYE5 UPF0496 protein At1g201805.5e-4336.74Show/hide
Query:  LWRKSNVKEEYEAAFRTNSYIEITTPTDD----------TSSAPC----------------RLQPDQDLILHDMSFQSSHIDHHLLVHYLEASFQAFETC
        L  K +V EEY+ AFRTNSY+E  T  +D          +SS+P                  L P Q+ +  D   Q S +D +L+V + + S +A + C
Subjt:  LWRKSNVKEEYEAAFRTNSYIEITTPTDD----------TSSAPC----------------RLQPDQDLILHDMSFQSSHIDHHLLVHYLEASFQAFETC

Query:  QLLLQALHQTKISHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGELAYFSHLENPF-SIASHAHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWK
        + LLQ L Q KI+H   KR +     +       + +      +++ EL+ F+ L+NP   I + A F  +HD + +LL  LT K+ + RRK+R  K  K
Subjt:  QLLLQALHQTKISHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGELAYFSHLENPF-SIASHAHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWK

Query:  KLGRACFVISNAAVLGALLVLALHSLIGVVAAPGLIS--CFVSLPKKKRDGTLVSS------ELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHL
        KLG    VI+++A++  LL++ALHS++GV AAP L+    F  L KKK  G +  S      E +  Q++I  +  +I +NDLDTLSR+A RL  E+EH 
Subjt:  KLGRACFVISNAAVLGALLVLALHSLIGVVAAPGLIS--CFVSLPKKKRDGTLVSS------ELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHL

Query:  RAIGEMWMRSSSRIRSEILKEAVAE----DEAIVEQMKELQQHIYLCFLTINRSRRLVMKEI
        + +  M  +S    + E+LKEA+ E    +E   +Q++EL++H+YLCF TINRSRRLV+ +I
Subjt:  RAIGEMWMRSSSRIRSEILKEAVAE----DEAIVEQMKELQQHIYLCFLTINRSRRLVMKEI

Q9SMU4 UPF0496 protein At3g490708.5e-2028.02Show/hide
Query:  NVKEEYEAAFRTNSYIEITT------------------PTDDTSSAPCRLQP---------DQDLILHDMSFQSSHIDHH---LLVHYLEASFQAFETCQ
        +V+EEY  AFRT SY    T                  P  ++SS   RL           D DL         S +  H   LL  Y   +  AF  C 
Subjt:  NVKEEYEAAFRTNSYIEITT------------------PTDDTSSAPCRLQP---------DQDLILHDMSFQSSHIDHH---LLVHYLEASFQAFETCQ

Query:  LLLQALHQTKISHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGELAYFSHLENPFSIASHAHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKL
         LL+ +H  +  +   K                  + N+    +  +    S   +PF I+S +    +    L LL  L  +R++ R KL+L       
Subjt:  LLLQALHQTKISHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGELAYFSHLENPFSIASHAHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKL

Query:  GRACFVISNAAVLGALLVLALHSLIGVVAAPGLISCFVSLP-----KKKRDGTLVSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGE
              I+       LLVLAL + + V  A    + F++ P     + K  G          ++++  + TYI   DLDT+SR+  R+  E+ H+RA+ E
Subjt:  GRACFVISNAAVLGALLVLALHSLIGVVAAPGLISCFVSLP-----KKKRDGTLVSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGE

Query:  MWM-RSSSRIR--SEILKEAVAEDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEIIMGDRDQN
         W+ R S R+R   E+ +E    +E+  E++ EL++HIYLCF+TINR+R L++KEI+  D   N
Subjt:  MWM-RSSSRIR--SEILKEAVAEDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEIIMGDRDQN

Arabidopsis top hitse value%identityAlignment
AT1G20180.1 Protein of unknown function (DUF677)3.9e-4436.74Show/hide
Query:  LWRKSNVKEEYEAAFRTNSYIEITTPTDD----------TSSAPC----------------RLQPDQDLILHDMSFQSSHIDHHLLVHYLEASFQAFETC
        L  K +V EEY+ AFRTNSY+E  T  +D          +SS+P                  L P Q+ +  D   Q S +D +L+V + + S +A + C
Subjt:  LWRKSNVKEEYEAAFRTNSYIEITTPTDD----------TSSAPC----------------RLQPDQDLILHDMSFQSSHIDHHLLVHYLEASFQAFETC

Query:  QLLLQALHQTKISHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGELAYFSHLENPF-SIASHAHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWK
        + LLQ L Q KI+H   KR +     +       + +      +++ EL+ F+ L+NP   I + A F  +HD + +LL  LT K+ + RRK+R  K  K
Subjt:  QLLLQALHQTKISHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGELAYFSHLENPF-SIASHAHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWK

Query:  KLGRACFVISNAAVLGALLVLALHSLIGVVAAPGLIS--CFVSLPKKKRDGTLVSS------ELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHL
        KLG    VI+++A++  LL++ALHS++GV AAP L+    F  L KKK  G +  S      E +  Q++I  +  +I +NDLDTLSR+A RL  E+EH 
Subjt:  KLGRACFVISNAAVLGALLVLALHSLIGVVAAPGLIS--CFVSLPKKKRDGTLVSS------ELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHL

Query:  RAIGEMWMRSSSRIRSEILKEAVAE----DEAIVEQMKELQQHIYLCFLTINRSRRLVMKEI
        + +  M  +S    + E+LKEA+ E    +E   +Q++EL++H+YLCF TINRSRRLV+ +I
Subjt:  RAIGEMWMRSSSRIRSEILKEAVAE----DEAIVEQMKELQQHIYLCFLTINRSRRLVMKEI

AT1G20180.2 Protein of unknown function (DUF677)5.3e-3332.87Show/hide
Query:  LWRKSNVKEEYEAAFRTNSYIEITTPTDD----------TSSAPC----------------RLQPDQDLILHDMSFQSSHIDHHLLVHYLEASFQAFETC
        L  K +V EEY+ AFRTNSY+E  T  +D          +SS+P                  L P Q+ +  D   Q S +D +L+V + + S +A + C
Subjt:  LWRKSNVKEEYEAAFRTNSYIEITTPTDD----------TSSAPC----------------RLQPDQDLILHDMSFQSSHIDHHLLVHYLEASFQAFETC

Query:  QLLLQALHQTKISHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGELAYFSHLENPF-SIASHAHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWK
        + LLQ L Q KI+H   KR +     +       + +      +++ EL+ F+ L+NP   I + A F  +HD + +LL  LT K+ + RRK+       
Subjt:  QLLLQALHQTKISHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGELAYFSHLENPF-SIASHAHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWK

Query:  KLGRACFVISNAAVLGALLVLALHSLIGVVAAPGLIS--CFVSLPKKKRDGTLVSS------ELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHL
                                S++GV AAP L+    F  L KKK  G +  S      E +  Q++I  +  +I +NDLDTLSR+A RL  E+EH 
Subjt:  KLGRACFVISNAAVLGALLVLALHSLIGVVAAPGLIS--CFVSLPKKKRDGTLVSS------ELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHL

Query:  RAIGEMWMRSSSRIRSEILKEAVAE----DEAIVEQMKELQQHIYLCFLTINRSRRLVMKEI
        + +  M  +S    + E+LKEA+ E    +E   +Q++EL++H+YLCF TINRSRRLV+ +I
Subjt:  RAIGEMWMRSSSRIRSEILKEAVAE----DEAIVEQMKELQQHIYLCFLTINRSRRLVMKEI

AT3G19330.1 Protein of unknown function (DUF677)4.1e-0923.44Show/hide
Query:  NVKEEYEAAFRTNSYIEITT----PTDDTSSAPCRLQPDQDLILHDMSFQSSHIDHHLLVH------------YLEASFQAFETCQLLLQALHQTKISHA
        N+  E   AF+T SY ++ +      D T      +QPD +L+L  +   +       + H            Y + S  A   C  L Q +H  +    
Subjt:  NVKEEYEAAFRTNSYIEITT----PTDDTSSAPCRLQPDQDLILHDMSFQSSHIDHHLLVH------------YLEASFQAFETCQLLLQALHQTKISHA

Query:  IAKRSVVNYLTIRASTADDDDNGNNGDGVVYGELAYFSHLENPFSIASHAHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKLGRACFVISNAAVLG
             + N     +  A D+   +    + +         ENPFS      F        +L H+L  +  ++R ++RL          C V +  AV  
Subjt:  IAKRSVVNYLTIRASTADDDDNGNNGDGVVYGELAYFSHLENPFSIASHAHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKLGRACFVISNAAVLG

Query:  ALLVLALHSL-IGVVAAPGLISCFVSLPKKKRDGTLVSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGEMWMRSSSRIRS--EILKE
        + +V+A H+L I +V A  L S ++    K+++ T +       Q+    + T++   DLDT+ R+ +RL   +E+ + +  + +     + S  EILK 
Subjt:  ALLVLALHSL-IGVVAAPGLISCFVSLPKKKRDGTLVSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGEMWMRSSSRIRS--EILKE

Query:  AVAEDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEI
               +  Q+K+L+ HI L F  +N++R L++ EI
Subjt:  AVAEDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEI

AT3G19330.3 Protein of unknown function (DUF677)2.3e-0421.13Show/hide
Query:  NVKEEYEAAFRTNSYIEITT----PTDDTSSAPCRLQPDQDLILHDMSFQSSHIDHHLLVH------------YLEASFQAFETCQLLLQALHQTKISHA
        N+  E   AF+T SY ++ +      D T      +QPD +L+L  +   +       + H            Y + S  A   C  L Q +H  +    
Subjt:  NVKEEYEAAFRTNSYIEITT----PTDDTSSAPCRLQPDQDLILHDMSFQSSHIDHHLLVH------------YLEASFQAFETCQLLLQALHQTKISHA

Query:  IAKRSVVNYLTIRASTADDDDNGNNGDGVVYGELAYFSHLENPFSIASHAHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKLGRACFVISNAAVLG
             + N     +  A D+   +    + +         ENPFS      F        +L H+L  +  ++R ++R                      
Subjt:  IAKRSVVNYLTIRASTADDDDNGNNGDGVVYGELAYFSHLENPFSIASHAHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKLGRACFVISNAAVLG

Query:  ALLVLALHSLIGVVAAPGLISCFVSLPKKKRDGTLVSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGEMWMRSSSRIRS--EILKEA
            L  H+  G + +P L   F     K+++ T +       Q+    + T++   DLDT+ R+ +RL   +E+ + +  + +     + S  EILK  
Subjt:  ALLVLALHSLIGVVAAPGLISCFVSLPKKKRDGTLVSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGEMWMRSSSRIRS--EILKEA

Query:  VAEDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEI
              +  Q+K+L+ HI L F  +N++R L++ EI
Subjt:  VAEDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEI

AT3G49070.1 Protein of unknown function (DUF677)6.0e-2128.02Show/hide
Query:  NVKEEYEAAFRTNSYIEITT------------------PTDDTSSAPCRLQP---------DQDLILHDMSFQSSHIDHH---LLVHYLEASFQAFETCQ
        +V+EEY  AFRT SY    T                  P  ++SS   RL           D DL         S +  H   LL  Y   +  AF  C 
Subjt:  NVKEEYEAAFRTNSYIEITT------------------PTDDTSSAPCRLQP---------DQDLILHDMSFQSSHIDHH---LLVHYLEASFQAFETCQ

Query:  LLLQALHQTKISHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGELAYFSHLENPFSIASHAHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKL
         LL+ +H  +  +   K                  + N+    +  +    S   +PF I+S +    +    L LL  L  +R++ R KL+L       
Subjt:  LLLQALHQTKISHAIAKRSVVNYLTIRASTADDDDNGNNGDGVVYGELAYFSHLENPFSIASHAHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKL

Query:  GRACFVISNAAVLGALLVLALHSLIGVVAAPGLISCFVSLP-----KKKRDGTLVSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGE
              I+       LLVLAL + + V  A    + F++ P     + K  G          ++++  + TYI   DLDT+SR+  R+  E+ H+RA+ E
Subjt:  GRACFVISNAAVLGALLVLALHSLIGVVAAPGLISCFVSLP-----KKKRDGTLVSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGE

Query:  MWM-RSSSRIR--SEILKEAVAEDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEIIMGDRDQN
         W+ R S R+R   E+ +E    +E+  E++ EL++HIYLCF+TINR+R L++KEI+  D   N
Subjt:  MWM-RSSSRIR--SEILKEAVAEDEAIVEQMKELQQHIYLCFLTINRSRRLVMKEIIMGDRDQN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GAAAAGAACAAGGTTAGAAACAATGGGTTATGGAGAAAGTCAAACGTTAAGGAAGAATACGAAGCAGCCTTCAGAACAAACTCTTACATTGAAATCACAACCCCCACCGA
CGACACATCGTCCGCCCCATGTCGGCTCCAACCAGACCAAGACCTAATTCTCCACGACATGTCGTTCCAGAGCTCCCATATTGATCATCATCTTCTTGTTCACTACTTGG
AAGCCAGTTTTCAAGCTTTTGAAACCTGCCAACTTCTCCTCCAAGCCCTCCACCAAACGAAGATCAGCCACGCCATTGCCAAAAGATCAGTCGTCAATTACCTCACAATA
AGGGCTTCAACGGCGGACGACGACGATAACGGCAATAACGGCGACGGTGTTGTTTATGGGGAGTTGGCTTATTTTTCCCACCTCGAAAACCCTTTCTCCATTGCCAGCCA
CGCCCATTTTCTTGCCTTACACGACACCCATTTGGAGTTGCTTCATGACCTAACTCACAAACGAAATCAAGCTCGAAGAAAGTTGAGGTTGAAAAAGATCTGGAAGAAAT
TGGGCAGAGCTTGTTTTGTGATATCAAATGCAGCGGTTTTGGGCGCTCTGTTGGTTCTAGCTCTACATAGTCTCATCGGAGTAGTGGCGGCACCAGGGTTGATCAGTTGC
TTCGTAAGCTTACCGAAGAAGAAAAGAGATGGTACGCTTGTTTCGTCGGAGCTCGTCCTTCGACAAATGGAGATTGTGATGAGAGCGACGTATATAACGATGAACGACTT
GGATACGTTGAGTCGAATGGCAGCGAGGCTCGAGGGTGAGATGGAGCATCTGAGAGCCATTGGCGAGATGTGGATGAGAAGCAGCAGCAGAATAAGAAGTGAGATATTGA
AGGAGGCTGTTGCAGAAGATGAGGCCATTGTGGAGCAAATGAAGGAGCTTCAACAACATATCTACTTGTGTTTTCTTACCATAAATAGGTCTAGAAGGCTTGTTATGAAG
GAAATTATTATGGGAGATAGGGATCAAAATGGG
mRNA sequenceShow/hide mRNA sequence
GAAAAGAACAAGGTTAGAAACAATGGGTTATGGAGAAAGTCAAACGTTAAGGAAGAATACGAAGCAGCCTTCAGAACAAACTCTTACATTGAAATCACAACCCCCACCGA
CGACACATCGTCCGCCCCATGTCGGCTCCAACCAGACCAAGACCTAATTCTCCACGACATGTCGTTCCAGAGCTCCCATATTGATCATCATCTTCTTGTTCACTACTTGG
AAGCCAGTTTTCAAGCTTTTGAAACCTGCCAACTTCTCCTCCAAGCCCTCCACCAAACGAAGATCAGCCACGCCATTGCCAAAAGATCAGTCGTCAATTACCTCACAATA
AGGGCTTCAACGGCGGACGACGACGATAACGGCAATAACGGCGACGGTGTTGTTTATGGGGAGTTGGCTTATTTTTCCCACCTCGAAAACCCTTTCTCCATTGCCAGCCA
CGCCCATTTTCTTGCCTTACACGACACCCATTTGGAGTTGCTTCATGACCTAACTCACAAACGAAATCAAGCTCGAAGAAAGTTGAGGTTGAAAAAGATCTGGAAGAAAT
TGGGCAGAGCTTGTTTTGTGATATCAAATGCAGCGGTTTTGGGCGCTCTGTTGGTTCTAGCTCTACATAGTCTCATCGGAGTAGTGGCGGCACCAGGGTTGATCAGTTGC
TTCGTAAGCTTACCGAAGAAGAAAAGAGATGGTACGCTTGTTTCGTCGGAGCTCGTCCTTCGACAAATGGAGATTGTGATGAGAGCGACGTATATAACGATGAACGACTT
GGATACGTTGAGTCGAATGGCAGCGAGGCTCGAGGGTGAGATGGAGCATCTGAGAGCCATTGGCGAGATGTGGATGAGAAGCAGCAGCAGAATAAGAAGTGAGATATTGA
AGGAGGCTGTTGCAGAAGATGAGGCCATTGTGGAGCAAATGAAGGAGCTTCAACAACATATCTACTTGTGTTTTCTTACCATAAATAGGTCTAGAAGGCTTGTTATGAAG
GAAATTATTATGGGAGATAGGGATCAAAATGGG
Protein sequenceShow/hide protein sequence
EKNKVRNNGLWRKSNVKEEYEAAFRTNSYIEITTPTDDTSSAPCRLQPDQDLILHDMSFQSSHIDHHLLVHYLEASFQAFETCQLLLQALHQTKISHAIAKRSVVNYLTI
RASTADDDDNGNNGDGVVYGELAYFSHLENPFSIASHAHFLALHDTHLELLHDLTHKRNQARRKLRLKKIWKKLGRACFVISNAAVLGALLVLALHSLIGVVAAPGLISC
FVSLPKKKRDGTLVSSELVLRQMEIVMRATYITMNDLDTLSRMAARLEGEMEHLRAIGEMWMRSSSRIRSEILKEAVAEDEAIVEQMKELQQHIYLCFLTINRSRRLVMK
EIIMGDRDQNG