; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS006832 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS006832
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function, DUF547
Genome locationscaffold60:999251..1002706
RNA-Seq ExpressionMS006832
SyntenyMS006832
Gene Ontology termsGO:0016853 - isomerase activity (molecular function)
InterPro domainsIPR006869 - Domain of unknown function DUF547


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051762.1 Topoisomerase 1-associated factor 1 [Cucumis melo var. makuwa]7.9e-19175.29Show/hide
Query:  MQVKEVLAELAMVESEIARLELQITQLQKDLKTEQQQNSKS-------------NNK---------ATSFDTKALHFISKAIKGDYYALNN----NNAQH
        MQVKE+LAELAMVESEIARLE+QITQL+KDLK EQQ  +KS             NNK          T+FDTKALHFISKAIKGD YALN+    +N+++
Subjt:  MQVKEVLAELAMVESEIARLELQITQLQKDLKTEQQQNSKS-------------NNK---------ATSFDTKALHFISKAIKGDYYALNN----NNAQH

Query:  HITHPQ-------PLHQVQ--ERLVSRKSGLLVASSPLRDPRHPSPKQRERNPLDMAGLKSVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTSR
        +   P+       PLH+V+  ER VSRKSGLLVASSPLRDPRHPSPKQRERNPLD+   KS+P+  QAEEN+Q+W PNKLSESIMKCLNFIYVRLLR SR
Subjt:  HITHPQ-------PLHQVQ--ERLVSRKSGLLVASSPLRDPRHPSPKQRERNPLDMAGLKSVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTSR

Query:  TMELEKSGPISRSLH-SSLSSRSFRVENNGLNSSLPLHKESRQQDPYAIFDNEESIPRDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQR
        TMELEKSGPISRSLH SSLSSRSFRVE NGLNSSL  HKE RQQDPY IF+NEESIPRDIGPYKNLVIFTSTSMDPKSISS +FIPL+RKLR LMSNLQ+
Subjt:  TMELEKSGPISRSLH-SSLSSRSFRVENNGLNSSLPLHKESRQQDPYAIFDNEESIPRDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQR

Query:  VDLRPLSYQQKLAFWINMYNACIMHGFLQYGVPSSPEKLAALMNKA-----SNSINAPAIEHYILKKPGSPNREDEEKERVVGKLYGLESWEANATFGLC
        VDLRPLSYQQKLAFWINMYNACIM+GFLQYGVPSSPEKLA LMNKA      N+INA AI+HYIL+KP S N ED+ KE +V KLYGLES E N TF LC
Subjt:  VDLRPLSYQQKLAFWINMYNACIMHGFLQYGVPSSPEKLAALMNKA-----SNSINAPAIEHYILKKPGSPNREDEEKERVVGKLYGLESWEANATFGLC

Query:  CGTRSSPAVRIYSGEAVAAELERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFTTT-AAAAAEWVCHQLPTSGSLRKSMVECFRSGHP---PTIHTLPY
        CGTRSSPAVRIYSGE V AELERSKLEYLQASVVVTSS+RVAVPELL+RSLPEF++       EWVCHQLPTSGSLRKS+VECFR GHP   PTI TL Y
Subjt:  CGTRSSPAVRIYSGEAVAAELERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFTTT-AAAAAEWVCHQLPTSGSLRKSMVECFRSGHP---PTIHTLPY

Query:  DFEFQYLLPL
        DFEFQYLLPL
Subjt:  DFEFQYLLPL

XP_004139551.1 uncharacterized protein LOC101221529 [Cucumis sativus]2.4e-19275.39Show/hide
Query:  QVKEVLAELAMVESEIARLELQITQLQKDLKTEQQQNSKS------------NNK---------ATSFDTKALHFISKAIKGDYYALNN----NNAQHHI
        +VKE+LAELAMVESEIARLE+QITQLQKDLK EQQQ +KS            NNK          T+FDTKALHFISKAIKGDY  LN+    + ++++ 
Subjt:  QVKEVLAELAMVESEIARLELQITQLQKDLKTEQQQNSKS------------NNK---------ATSFDTKALHFISKAIKGDYYALNN----NNAQHHI

Query:  THPQ-------PLHQVQ--ERLVSRKSGLLVASSPLRDPRHPSPKQRERNPLDMAGLKSVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTSRTM
          P+       PLH+V+  ER VSRKSGLLVASSPLRDPRHPSPKQRERNPLD+   KS+P+  QAEEN+Q+W PNKLSESIMKCLNFIYVRLLR SRTM
Subjt:  THPQ-------PLHQVQ--ERLVSRKSGLLVASSPLRDPRHPSPKQRERNPLDMAGLKSVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTSRTM

Query:  ELEKSGPISRSLH-SSLSSRSFRVENNGLNSSLPLHKESRQQDPYAIFDNEESIPRDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQRVD
        ELEKSGPISRSLH SSLSSRSFRVE NGLNSSL  HKE RQQDPY IF+NEES+PRDIGPYKNLVIFTSTSMDPKSISS +FIPL+RKLR LMSNLQ+VD
Subjt:  ELEKSGPISRSLH-SSLSSRSFRVENNGLNSSLPLHKESRQQDPYAIFDNEESIPRDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQRVD

Query:  LRPLSYQQKLAFWINMYNACIMHGFLQYGVPSSPEKLAALMNKA-----SNSINAPAIEHYILKKPGSPNREDEEKERVVGKLYGLESWEANATFGLCCG
        LRPLSYQQKLAFWINMYNACIM+GFLQYGVPSSPEKLA LMNKA      N+INA AI+HYIL+KP S N+ED+ KE +V KLYGLES E N TF LCCG
Subjt:  LRPLSYQQKLAFWINMYNACIMHGFLQYGVPSSPEKLAALMNKA-----SNSINAPAIEHYILKKPGSPNREDEEKERVVGKLYGLESWEANATFGLCCG

Query:  TRSSPAVRIYSGEAVAAELERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFTTT-AAAAAEWVCHQLPTSGSLRKSMVECFRSGHP---PTIHTLPYDF
        TRSSPAVRIYSGE V  ELERSKLEYLQASVVVTSS+RVAVPELLVRSLPEF++       EWVCHQLPTSGSLRKSMVECFR GHP   PTI TLPYDF
Subjt:  TRSSPAVRIYSGEAVAAELERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFTTT-AAAAAEWVCHQLPTSGSLRKSMVECFRSGHP---PTIHTLPYDF

Query:  EFQYLLPL
        EFQYLLPL
Subjt:  EFQYLLPL

XP_008462917.1 PREDICTED: uncharacterized protein LOC103501181 isoform X1 [Cucumis melo]6.7e-19075.05Show/hide
Query:  QVKEVLAELAMVESEIARLELQITQLQKDLKTEQQQNSKS-------------NNK---------ATSFDTKALHFISKAIKGDYYALNN----NNAQHH
        +VKE+LAELAMVESEIARLE+QITQL+KDLK EQQ  +KS             NNK          T+FDTKALHFISKAIKGD YALN+    +N++++
Subjt:  QVKEVLAELAMVESEIARLELQITQLQKDLKTEQQQNSKS-------------NNK---------ATSFDTKALHFISKAIKGDYYALNN----NNAQHH

Query:  ITHPQ-------PLHQVQ--ERLVSRKSGLLVASSPLRDPRHPSPKQRERNPLDMAGLKSVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTSRT
           P+       PLH+V+  ER VSRKSGLLVASSPLRDPRHPSPKQRERNPLD+   KS+P+  QAEEN+Q+W PNKLSESIMKCLNFIYVRLLR SRT
Subjt:  ITHPQ-------PLHQVQ--ERLVSRKSGLLVASSPLRDPRHPSPKQRERNPLDMAGLKSVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTSRT

Query:  MELEKSGPISRSLH-SSLSSRSFRVENNGLNSSLPLHKESRQQDPYAIFDNEESIPRDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQRV
        MELEKSGPISRSLH SSLSSRSFRVE NGLNSSL  HKE RQQDPY IF+NEESIPRDIGPYKNLVIFTSTSMDPKSISS +FIPL+RKLR LMSNLQ+V
Subjt:  MELEKSGPISRSLH-SSLSSRSFRVENNGLNSSLPLHKESRQQDPYAIFDNEESIPRDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQRV

Query:  DLRPLSYQQKLAFWINMYNACIMHGFLQYGVPSSPEKLAALMNKA-----SNSINAPAIEHYILKKPGSPNREDEEKERVVGKLYGLESWEANATFGLCC
        DLRPLSYQQKLAFWINMYNACIM+GFLQYGVPSSPEKLA LMNKA      N+INA AI+HYIL+KP S N ED+ KE +V KLYGLES E N TF LCC
Subjt:  DLRPLSYQQKLAFWINMYNACIMHGFLQYGVPSSPEKLAALMNKA-----SNSINAPAIEHYILKKPGSPNREDEEKERVVGKLYGLESWEANATFGLCC

Query:  GTRSSPAVRIYSGEAVAAELERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFTTT-AAAAAEWVCHQLPTSGSLRKSMVECFRSGHP---PTIHTLPYD
        GTRSSPAVRIYSGE V AELERSKLEYLQASVVVTSS+RVAVPELL+RSLPEF++       EWVCHQLPTSGSLRKS+VECFR GHP   PTI TL YD
Subjt:  GTRSSPAVRIYSGEAVAAELERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFTTT-AAAAAEWVCHQLPTSGSLRKSMVECFRSGHP---PTIHTLPYD

Query:  FEFQYLLPL
        FEFQYLLPL
Subjt:  FEFQYLLPL

XP_022152890.1 uncharacterized protein LOC111020513 [Momordica charantia]3.1e-25698.29Show/hide
Query:  QVKEVLAELAMVESEIARLELQITQLQKDLKTEQQQNSKSNNKATSFDTKALHFISKAIKGDYYALNNNNAQHHITHPQPLHQVQERLVSRKSGLLVASS
        +VKEVLAELAMVESEIARLELQITQLQKDLKTEQQQNSKSN+KATSFDTKALHFISKAIKGDYYALNNNNAQHHITHPQPLHQVQERLVSRKSGLLVASS
Subjt:  QVKEVLAELAMVESEIARLELQITQLQKDLKTEQQQNSKSNNKATSFDTKALHFISKAIKGDYYALNNNNAQHHITHPQPLHQVQERLVSRKSGLLVASS

Query:  PLRDPRHPSPKQRERNPLDMAGLKSVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTSRTMELEKSGPISRSLHSSLSSRSFRVENNGLNSSLPL
        PLRDPRHPSPKQRERNPLDMAGLKSVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTSRTMELEKSGPISRSLHSSLSSRSFRVENNGLNSSLPL
Subjt:  PLRDPRHPSPKQRERNPLDMAGLKSVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTSRTMELEKSGPISRSLHSSLSSRSFRVENNGLNSSLPL

Query:  HKESRQQDPYAIFDNEESIPRDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQRVDLRPLSYQQKLAFWINMYNACIMHGFLQYGVPSSPE
        HKESRQQDPYAIFDNEESIPRDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQRVDLRPLSYQQKLAFWINMYNACIMHGFLQYGVPSSPE
Subjt:  HKESRQQDPYAIFDNEESIPRDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQRVDLRPLSYQQKLAFWINMYNACIMHGFLQYGVPSSPE

Query:  KLAALMNKAS-----NSINAPAIEHYILKKPGSPNREDEEKERVVGKLYGLESWEANATFGLCCGTRSSPAVRIYSGEAVAAELERSKLEYLQASVVVTS
        KLAALMNKA+     NSINAPAIEHYILKKPGSPNREDEEKERVVGKLYGLESWEANATFGLCCGTRSSPAVRIYSGEAVAAELERSKLEYLQASVVVTS
Subjt:  KLAALMNKAS-----NSINAPAIEHYILKKPGSPNREDEEKERVVGKLYGLESWEANATFGLCCGTRSSPAVRIYSGEAVAAELERSKLEYLQASVVVTS

Query:  SRRVAVPELLVRSLPEFTTTAAAAAEWVCHQLPTSGSLRKSMVECFRSGHPPTIHTLPYDFEFQYLLPL
        SRRVAVPELLVRSLPEFTTTAAAAAEWVCHQLPTSGSLRKSMVECFRSGHPPTIHTLPYDFEFQYLLPL
Subjt:  SRRVAVPELLVRSLPEFTTTAAAAAEWVCHQLPTSGSLRKSMVECFRSGHPPTIHTLPYDFEFQYLLPL

XP_038894153.1 uncharacterized protein LOC120082868 [Benincasa hispida]7.9e-19175.59Show/hide
Query:  QVKEVLAELAMVESEIARLELQITQLQKDLKTEQQQ-------------NSKSNNK---------ATSFDTKALHFISKAIKGDYYALNN---NNAQ---
        +VKEVLAELAMVESEIARLE+QITQLQKDLKTEQQ               + +NNK           +FDTKALHFISKAIKGD YALN+   +NA+   
Subjt:  QVKEVLAELAMVESEIARLELQITQLQKDLKTEQQQ-------------NSKSNNK---------ATSFDTKALHFISKAIKGDYYALNN---NNAQ---

Query:  ---------HHITHPQPLHQVQERLVSRKSGLLVASSPLRDPRHPSPKQRERNPLDMAGLKSVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTS
                 HH+     LH+     VSRKSGLLVASSPLRDPRHPSPKQRER+ LDM   KS+P+ IQAEEN+Q+W PNKLSESIMKCLNF+YVRLLR S
Subjt:  ---------HHITHPQPLHQVQERLVSRKSGLLVASSPLRDPRHPSPKQRERNPLDMAGLKSVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTS

Query:  RTMELEKSGPISRSLH-SSLSSRSFRVENNGLNSSLPLHKESRQQDPYAIFDNEESIPRDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQ
        RTMELEKSGPISRSLH SSLSSRSFRVE NGLNSSL +HKE RQQDPY IF+NEESIPRDIGPYKNLVIFTSTSMDPKSISS +FIPLIRKLR LMSNLQ
Subjt:  RTMELEKSGPISRSLH-SSLSSRSFRVENNGLNSSLPLHKESRQQDPYAIFDNEESIPRDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQ

Query:  RVDLRPLSYQQKLAFWINMYNACIMHGFLQYGVPSSPEKLAALMNKA-----SNSINAPAIEHYILKKPGSPNREDEEKERVVGKLYGLESWEANATFGL
        +VDLRPLSYQQKLAFWINMYNACIM+GFLQYGVPSSPEKLA LMNKA      N+INA AIEHYIL+K  S N+ED+ KE VV KLYGLES E N TF L
Subjt:  RVDLRPLSYQQKLAFWINMYNACIMHGFLQYGVPSSPEKLAALMNKA-----SNSINAPAIEHYILKKPGSPNREDEEKERVVGKLYGLESWEANATFGL

Query:  CCGTRSSPAVRIYSGEAVAAELERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFTTTA--AAAAEWVCHQLPTSGSLRKSMVECFRSGHP---PTIHTL
        CCGTRSSPAVRIYSGEAVAAELERSKLEYLQASVVVT+SRRVAVPELLVRSLPEF   A   A  EWVCHQLPTSGSLRKSMVECFR+ HP   PTI TL
Subjt:  CCGTRSSPAVRIYSGEAVAAELERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFTTTA--AAAAEWVCHQLPTSGSLRKSMVECFRSGHP---PTIHTL

Query:  PYDFEFQYLLPL
        PYDFEFQYLLPL
Subjt:  PYDFEFQYLLPL

TrEMBL top hitse value%identityAlignment
A0A1S3CHZ3 uncharacterized protein LOC103501181 isoform X13.2e-19075.05Show/hide
Query:  QVKEVLAELAMVESEIARLELQITQLQKDLKTEQQQNSKS-------------NNK---------ATSFDTKALHFISKAIKGDYYALNN----NNAQHH
        +VKE+LAELAMVESEIARLE+QITQL+KDLK EQQ  +KS             NNK          T+FDTKALHFISKAIKGD YALN+    +N++++
Subjt:  QVKEVLAELAMVESEIARLELQITQLQKDLKTEQQQNSKS-------------NNK---------ATSFDTKALHFISKAIKGDYYALNN----NNAQHH

Query:  ITHPQ-------PLHQVQ--ERLVSRKSGLLVASSPLRDPRHPSPKQRERNPLDMAGLKSVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTSRT
           P+       PLH+V+  ER VSRKSGLLVASSPLRDPRHPSPKQRERNPLD+   KS+P+  QAEEN+Q+W PNKLSESIMKCLNFIYVRLLR SRT
Subjt:  ITHPQ-------PLHQVQ--ERLVSRKSGLLVASSPLRDPRHPSPKQRERNPLDMAGLKSVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTSRT

Query:  MELEKSGPISRSLH-SSLSSRSFRVENNGLNSSLPLHKESRQQDPYAIFDNEESIPRDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQRV
        MELEKSGPISRSLH SSLSSRSFRVE NGLNSSL  HKE RQQDPY IF+NEESIPRDIGPYKNLVIFTSTSMDPKSISS +FIPL+RKLR LMSNLQ+V
Subjt:  MELEKSGPISRSLH-SSLSSRSFRVENNGLNSSLPLHKESRQQDPYAIFDNEESIPRDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQRV

Query:  DLRPLSYQQKLAFWINMYNACIMHGFLQYGVPSSPEKLAALMNKA-----SNSINAPAIEHYILKKPGSPNREDEEKERVVGKLYGLESWEANATFGLCC
        DLRPLSYQQKLAFWINMYNACIM+GFLQYGVPSSPEKLA LMNKA      N+INA AI+HYIL+KP S N ED+ KE +V KLYGLES E N TF LCC
Subjt:  DLRPLSYQQKLAFWINMYNACIMHGFLQYGVPSSPEKLAALMNKA-----SNSINAPAIEHYILKKPGSPNREDEEKERVVGKLYGLESWEANATFGLCC

Query:  GTRSSPAVRIYSGEAVAAELERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFTTT-AAAAAEWVCHQLPTSGSLRKSMVECFRSGHP---PTIHTLPYD
        GTRSSPAVRIYSGE V AELERSKLEYLQASVVVTSS+RVAVPELL+RSLPEF++       EWVCHQLPTSGSLRKS+VECFR GHP   PTI TL YD
Subjt:  GTRSSPAVRIYSGEAVAAELERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFTTT-AAAAAEWVCHQLPTSGSLRKSMVECFRSGHP---PTIHTLPYD

Query:  FEFQYLLPL
        FEFQYLLPL
Subjt:  FEFQYLLPL

A0A1S3CJL7 uncharacterized protein LOC103501181 isoform X23.2e-19075.05Show/hide
Query:  QVKEVLAELAMVESEIARLELQITQLQKDLKTEQQQNSKS-------------NNK---------ATSFDTKALHFISKAIKGDYYALNN----NNAQHH
        +VKE+LAELAMVESEIARLE+QITQL+KDLK EQQ  +KS             NNK          T+FDTKALHFISKAIKGD YALN+    +N++++
Subjt:  QVKEVLAELAMVESEIARLELQITQLQKDLKTEQQQNSKS-------------NNK---------ATSFDTKALHFISKAIKGDYYALNN----NNAQHH

Query:  ITHPQ-------PLHQVQ--ERLVSRKSGLLVASSPLRDPRHPSPKQRERNPLDMAGLKSVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTSRT
           P+       PLH+V+  ER VSRKSGLLVASSPLRDPRHPSPKQRERNPLD+   KS+P+  QAEEN+Q+W PNKLSESIMKCLNFIYVRLLR SRT
Subjt:  ITHPQ-------PLHQVQ--ERLVSRKSGLLVASSPLRDPRHPSPKQRERNPLDMAGLKSVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTSRT

Query:  MELEKSGPISRSLH-SSLSSRSFRVENNGLNSSLPLHKESRQQDPYAIFDNEESIPRDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQRV
        MELEKSGPISRSLH SSLSSRSFRVE NGLNSSL  HKE RQQDPY IF+NEESIPRDIGPYKNLVIFTSTSMDPKSISS +FIPL+RKLR LMSNLQ+V
Subjt:  MELEKSGPISRSLH-SSLSSRSFRVENNGLNSSLPLHKESRQQDPYAIFDNEESIPRDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQRV

Query:  DLRPLSYQQKLAFWINMYNACIMHGFLQYGVPSSPEKLAALMNKA-----SNSINAPAIEHYILKKPGSPNREDEEKERVVGKLYGLESWEANATFGLCC
        DLRPLSYQQKLAFWINMYNACIM+GFLQYGVPSSPEKLA LMNKA      N+INA AI+HYIL+KP S N ED+ KE +V KLYGLES E N TF LCC
Subjt:  DLRPLSYQQKLAFWINMYNACIMHGFLQYGVPSSPEKLAALMNKA-----SNSINAPAIEHYILKKPGSPNREDEEKERVVGKLYGLESWEANATFGLCC

Query:  GTRSSPAVRIYSGEAVAAELERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFTTT-AAAAAEWVCHQLPTSGSLRKSMVECFRSGHP---PTIHTLPYD
        GTRSSPAVRIYSGE V AELERSKLEYLQASVVVTSS+RVAVPELL+RSLPEF++       EWVCHQLPTSGSLRKS+VECFR GHP   PTI TL YD
Subjt:  GTRSSPAVRIYSGEAVAAELERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFTTT-AAAAAEWVCHQLPTSGSLRKSMVECFRSGHP---PTIHTLPYD

Query:  FEFQYLLPL
        FEFQYLLPL
Subjt:  FEFQYLLPL

A0A5A7U7M8 Topoisomerase 1-associated factor 13.8e-19175.29Show/hide
Query:  MQVKEVLAELAMVESEIARLELQITQLQKDLKTEQQQNSKS-------------NNK---------ATSFDTKALHFISKAIKGDYYALNN----NNAQH
        MQVKE+LAELAMVESEIARLE+QITQL+KDLK EQQ  +KS             NNK          T+FDTKALHFISKAIKGD YALN+    +N+++
Subjt:  MQVKEVLAELAMVESEIARLELQITQLQKDLKTEQQQNSKS-------------NNK---------ATSFDTKALHFISKAIKGDYYALNN----NNAQH

Query:  HITHPQ-------PLHQVQ--ERLVSRKSGLLVASSPLRDPRHPSPKQRERNPLDMAGLKSVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTSR
        +   P+       PLH+V+  ER VSRKSGLLVASSPLRDPRHPSPKQRERNPLD+   KS+P+  QAEEN+Q+W PNKLSESIMKCLNFIYVRLLR SR
Subjt:  HITHPQ-------PLHQVQ--ERLVSRKSGLLVASSPLRDPRHPSPKQRERNPLDMAGLKSVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTSR

Query:  TMELEKSGPISRSLH-SSLSSRSFRVENNGLNSSLPLHKESRQQDPYAIFDNEESIPRDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQR
        TMELEKSGPISRSLH SSLSSRSFRVE NGLNSSL  HKE RQQDPY IF+NEESIPRDIGPYKNLVIFTSTSMDPKSISS +FIPL+RKLR LMSNLQ+
Subjt:  TMELEKSGPISRSLH-SSLSSRSFRVENNGLNSSLPLHKESRQQDPYAIFDNEESIPRDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQR

Query:  VDLRPLSYQQKLAFWINMYNACIMHGFLQYGVPSSPEKLAALMNKA-----SNSINAPAIEHYILKKPGSPNREDEEKERVVGKLYGLESWEANATFGLC
        VDLRPLSYQQKLAFWINMYNACIM+GFLQYGVPSSPEKLA LMNKA      N+INA AI+HYIL+KP S N ED+ KE +V KLYGLES E N TF LC
Subjt:  VDLRPLSYQQKLAFWINMYNACIMHGFLQYGVPSSPEKLAALMNKA-----SNSINAPAIEHYILKKPGSPNREDEEKERVVGKLYGLESWEANATFGLC

Query:  CGTRSSPAVRIYSGEAVAAELERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFTTT-AAAAAEWVCHQLPTSGSLRKSMVECFRSGHP---PTIHTLPY
        CGTRSSPAVRIYSGE V AELERSKLEYLQASVVVTSS+RVAVPELL+RSLPEF++       EWVCHQLPTSGSLRKS+VECFR GHP   PTI TL Y
Subjt:  CGTRSSPAVRIYSGEAVAAELERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFTTT-AAAAAEWVCHQLPTSGSLRKSMVECFRSGHP---PTIHTLPY

Query:  DFEFQYLLPL
        DFEFQYLLPL
Subjt:  DFEFQYLLPL

A0A6J1DF87 uncharacterized protein LOC1110205133.4e-25698.29Show/hide
Query:  QVKEVLAELAMVESEIARLELQITQLQKDLKTEQQQNSKSNNKATSFDTKALHFISKAIKGDYYALNNNNAQHHITHPQPLHQVQERLVSRKSGLLVASS
        +VKEVLAELAMVESEIARLELQITQLQKDLKTEQQQNSKSN+KATSFDTKALHFISKAIKGDYYALNNNNAQHHITHPQPLHQVQERLVSRKSGLLVASS
Subjt:  QVKEVLAELAMVESEIARLELQITQLQKDLKTEQQQNSKSNNKATSFDTKALHFISKAIKGDYYALNNNNAQHHITHPQPLHQVQERLVSRKSGLLVASS

Query:  PLRDPRHPSPKQRERNPLDMAGLKSVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTSRTMELEKSGPISRSLHSSLSSRSFRVENNGLNSSLPL
        PLRDPRHPSPKQRERNPLDMAGLKSVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTSRTMELEKSGPISRSLHSSLSSRSFRVENNGLNSSLPL
Subjt:  PLRDPRHPSPKQRERNPLDMAGLKSVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTSRTMELEKSGPISRSLHSSLSSRSFRVENNGLNSSLPL

Query:  HKESRQQDPYAIFDNEESIPRDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQRVDLRPLSYQQKLAFWINMYNACIMHGFLQYGVPSSPE
        HKESRQQDPYAIFDNEESIPRDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQRVDLRPLSYQQKLAFWINMYNACIMHGFLQYGVPSSPE
Subjt:  HKESRQQDPYAIFDNEESIPRDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQRVDLRPLSYQQKLAFWINMYNACIMHGFLQYGVPSSPE

Query:  KLAALMNKAS-----NSINAPAIEHYILKKPGSPNREDEEKERVVGKLYGLESWEANATFGLCCGTRSSPAVRIYSGEAVAAELERSKLEYLQASVVVTS
        KLAALMNKA+     NSINAPAIEHYILKKPGSPNREDEEKERVVGKLYGLESWEANATFGLCCGTRSSPAVRIYSGEAVAAELERSKLEYLQASVVVTS
Subjt:  KLAALMNKAS-----NSINAPAIEHYILKKPGSPNREDEEKERVVGKLYGLESWEANATFGLCCGTRSSPAVRIYSGEAVAAELERSKLEYLQASVVVTS

Query:  SRRVAVPELLVRSLPEFTTTAAAAAEWVCHQLPTSGSLRKSMVECFRSGHPPTIHTLPYDFEFQYLLPL
        SRRVAVPELLVRSLPEFTTTAAAAAEWVCHQLPTSGSLRKSMVECFRSGHPPTIHTLPYDFEFQYLLPL
Subjt:  SRRVAVPELLVRSLPEFTTTAAAAAEWVCHQLPTSGSLRKSMVECFRSGHPPTIHTLPYDFEFQYLLPL

A0A6J1G939 uncharacterized protein LOC111452055 isoform X11.2e-18172.19Show/hide
Query:  QVKEVLAELAMVESEIARLELQITQLQKDLKTEQQ-----------QNSKSNN------------KATSFDTKALHFISKAIKGDYY-------ALNNNN
        +VKE+LAELAMVESEI RLE+QIT+LQKDLK+E+Q           Q +++NN               +FDTK LHFISKAIKGDY        A N + 
Subjt:  QVKEVLAELAMVESEIARLELQITQLQKDLKTEQQ-----------QNSKSNN------------KATSFDTKALHFISKAIKGDYY-------ALNNNN

Query:  AQHHITH-PQPLHQVQERLVSRKSGLLVASSPLRDPRHPSPKQRERNPLDMAGLKSVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTSRTMELE
             TH P    ++QER V RKSGLLV  SPLR+P+HPSPK+RER+PL M   K V + IQ EEN+Q+W PNKLSESI+KCLNFIYVRLLRTSRTMELE
Subjt:  AQHHITH-PQPLHQVQERLVSRKSGLLVASSPLRDPRHPSPKQRERNPLDMAGLKSVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTSRTMELE

Query:  KSGPISRSLHSSLSSRSFRVENNGLNSSLPLHKESRQQDPYAIFDNEESIPRDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQRVDLRPL
        KSGPISRSLHSSLSSRSFRVE NGLNS L LHKE RQQDPY+IF+NEESIPRDIGPYKNLVIFTSTSMDPKSI+S +FIPLI KLR LMSNLQ VDL+PL
Subjt:  KSGPISRSLHSSLSSRSFRVENNGLNSSLPLHKESRQQDPYAIFDNEESIPRDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQRVDLRPL

Query:  SYQQKLAFWINMYNACIMHGFLQYGVPSSPEKLAALMNKA-----SNSINAPAIEHYILKKPGSPNREDEEKERVVGKLYGLESWEANATFGLCCGTRSS
        +YQQKLAFWINMYNACIM+GFL YGVPSSPEKLAAL+NKA      N+INA AIEH+IL+KP S N ED  KE VV KLYGLES + N TF LCCGTRSS
Subjt:  SYQQKLAFWINMYNACIMHGFLQYGVPSSPEKLAALMNKA-----SNSINAPAIEHYILKKPGSPNREDEEKERVVGKLYGLESWEANATFGLCCGTRSS

Query:  PAVRIYSGEAVAAELERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFTTTAAAA-----AEWVCHQLPTSGSLRKSMVECFRSGH---PPTIHTLPYDF
        PAVRIYSGEAV AELERSKLEYLQAS+VVTSSRRVAVPELLVRSLPEFT   AAA      EWVC+QLPTSGSLRKSMVECFR GH   PPT+ TLPYDF
Subjt:  PAVRIYSGEAVAAELERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFTTTAAAA-----AEWVCHQLPTSGSLRKSMVECFRSGH---PPTIHTLPYDF

Query:  EFQYLLP
        EFQYLLP
Subjt:  EFQYLLP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G39690.1 Protein of unknown function, DUF5471.2e-5633.05Show/hide
Query:  QVKEVLAELAMVESEIARLELQITQLQKDLKTEQQQNSKSNNKATSFDTKALHFISKAIKGDYYALNNNNAQHHITHPQPLHQVQERLVSRKSGLLVASS
        Q  E++ ELA+VE+EI  L+ +I +L+  L +EQ+Q  +   + T  + K        ++     L ++  Q  ++H      +     +        + 
Subjt:  QVKEVLAELAMVESEIARLELQITQLQKDLKTEQQQNSKSNNKATSFDTKALHFISKAIKGDYYALNNNNAQHHITHPQPLHQVQERLVSRKSGLLVASS

Query:  PLRDPRHPSPKQRERNPLDMAGLKSVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTSRTMELEKSGPISRSLHSSLSSRSFRVENNGLNSSLPL
           D         E + +    +  V   ++  E      PN++SE ++ CL  IY+ L   S     +  G +S S   S  SR         +++   
Subjt:  PLRDPRHPSPKQRERNPLDMAGLKSVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTSRTMELEKSGPISRSLHSSLSSRSFRVENNGLNSSLPL

Query:  HKESRQQDPYAIF-DNEESIPRDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQRVDLRPLSYQQKLAFWINMYNACIMHGFLQYGVPSSP
        ++ +   DPY +  D+   + RDIGPYKN +  + +S+D    +     P + +L  LM  L  VDL  L+Y+QKLAFWIN+YNACIMH FL+YG+PSS 
Subjt:  HKESRQQDPYAIF-DNEESIPRDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQRVDLRPLSYQQKLAFWINMYNACIMHGFLQYGVPSSP

Query:  EKLAALMNKASNSI-----NAPAIEHYILKKPGSPNRED-EEKERVVGKLYGLESWEANATFGLCCGTRSSPAVRIYSGEAVAAELERSKLEYLQASVVV
         +L  LMNKAS ++     NA AIEH++L+ P  P  +  +EKE ++   YGL   E N TF LC G+ SSPA+R+Y+ + V  +L R+++EYL+ASV V
Subjt:  EKLAALMNKASNSI-----NAPAIEHYILKKPGSPNRED-EEKERVVGKLYGLESWEANATFGLCCGTRSSPAVRIYSGEAVAAELERSKLEYLQASVVV

Query:  TSSRRVAVPELLVRSLPEFTTTAAAAAEWVCHQLPTSGSLRKSMVECF-RSGHPP---TIHTLPYDFEFQYLLPL
        +S +++ VP+LL   + +F     +  EW+  QLP SG+L+  ++EC  R    P    +    Y  EF+YLL L
Subjt:  TSSRRVAVPELLVRSLPEFTTTAAAAAEWVCHQLPTSGSLRKSMVECF-RSGHPP---TIHTLPYDFEFQYLLPL

AT2G39690.2 Protein of unknown function, DUF5471.2e-5639.58Show/hide
Query:  PNKLSESIMKCLNFIYVRLLRTSRTMELEKSGPISRSLHSSLSSRSFRVENNGLNSSLPLHKESRQQDPYAIF-DNEESIPRDIGPYKNLVIFTSTSMDP
        PN++SE ++ CL  IY+ L   S     +  G +S S   S  SR         +++   ++ +   DPY +  D+   + RDIGPYKN +  + +S+D 
Subjt:  PNKLSESIMKCLNFIYVRLLRTSRTMELEKSGPISRSLHSSLSSRSFRVENNGLNSSLPLHKESRQQDPYAIF-DNEESIPRDIGPYKNLVIFTSTSMDP

Query:  KSISSPSFIPLIRKLRALMSNLQRVDLRPLSYQQKLAFWINMYNACIMHGFLQYGVPSSPEKLAALMNKASNSI-----NAPAIEHYILKKPGSPNRED-
           +     P + +L  LM  L  VDL  L+Y+QKLAFWIN+YNACIMH FL+YG+PSS  +L  LMNKAS ++     NA AIEH++L+ P  P  +D 
Subjt:  KSISSPSFIPLIRKLRALMSNLQRVDLRPLSYQQKLAFWINMYNACIMHGFLQYGVPSSPEKLAALMNKASNSI-----NAPAIEHYILKKPGSPNRED-

Query:  -EEKERVVGKLYGLESWEANATFGLCCGTRSSPAVRIYSGEAVAAELERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFTTTAAAAAEWVCHQLPTSGS
         +EKE ++   YGL   E N TF LC G+ SSPA+R+Y+ + V  +L R+++EYL+ASV V+S +++ VP+LL   + +F     +  EW+  QLP SG+
Subjt:  -EEKERVVGKLYGLESWEANATFGLCCGTRSSPAVRIYSGEAVAAELERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFTTTAAAAAEWVCHQLPTSGS

Query:  LRKSMVECF-RSGHPP---TIHTLPYDFEFQYLLPL
        L+  ++EC  R    P    +    Y  EF+YLL L
Subjt:  LRKSMVECF-RSGHPP---TIHTLPYDFEFQYLLPL

AT3G12540.1 Protein of unknown function, DUF5473.7e-4531.19Show/hide
Query:  QVKEVLAELAMVESEIARLELQITQLQKDLKTEQQQNSKSNNKATSFDTKALHFISKAIKGDYYALNNNNAQHHITHPQPLHQVQERLVSRKSGLLVASS
        QV+E++ ELA VE+EI  LE +I  L+ D+ +E+++N +  +     + + +    + ++   +   + +        + L Q + +  S     +V   
Subjt:  QVKEVLAELAMVESEIARLELQITQLQKDLKTEQQQNSKSNNKATSFDTKALHFISKAIKGDYYALNNNNAQHHITHPQPLHQVQERLVSRKSGLLVASS

Query:  PLRDPR-HPS--PKQRERNPLDMAGLKSVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTSRTMELEKSGPISRSLHSSLSSRSFRVENNGLNSS
         +  PR H S        + +  +         Q + N+Q   PN +SE ++KCL  IY+ L R+SR  E E S  +S+   + L + SF+        S
Subjt:  PLRDPR-HPS--PKQRERNPLDMAGLKSVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTSRTMELEKSGPISRSLHSSLSSRSFRVENNGLNSS

Query:  LPLHKESRQQDPYAIFDNEESIPRDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQRVDLRPLSYQQKLAFWINMYNACIMHGFLQYGVPS
        +  H  S   DPY          RDIG YKN +  T TS+D   +S  S    +  LR L   L +VDL  L++++K+AFWIN YNAC+M+GFL++G+PS
Subjt:  LPLHKESRQQDPYAIFDNEESIPRDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQRVDLRPLSYQQKLAFWINMYNACIMHGFLQYGVPS

Query:  SPEKLAALMNKAS-----NSINAPAIEHYILKKPGSPNREDE--EKERVVGKLYGLESWEANATFGLCCGTRSSPAVRIYSGEAVAAELERSKLEYLQAS
        S EKL  ++  A+       ++A  IE  IL+ P  P       E E  +   YG    E N  F LC G  SSPA+R+Y+ E V  EL +++ EYL+AS
Subjt:  SPEKLAALMNKAS-----NSINAPAIEHYILKKPGSPNREDE--EKERVVGKLYGLESWEANATFGLCCGTRSSPAVRIYSGEAVAAELERSKLEYLQAS

Query:  VVVTSSRRVAVPELLVRSLPEFTTTAAAAAEWVCHQLPTSG---SLRKSMVECF----RSGHPPTIHTLPYDFEFQYLLPL
        + V+  +++ +P  L + L +F     +  EW+C QLP +     L+++ +E       S     I    +++EF+YLL L
Subjt:  VVVTSSRRVAVPELLVRSLPEFTTTAAAAAEWVCHQLPTSG---SLRKSMVECF----RSGHPPTIHTLPYDFEFQYLLPL

AT5G42690.2 Protein of unknown function, DUF5472.0e-4330Show/hide
Query:  VKEVLAELAMVESEIARLELQITQLQKDLKTEQQQNSKSNNKATSFDTKALHFISKAIKGDYYALNNNNAQHHITHPQPLHQVQERLVSRKSGLLVASSP
        V E+LAE+A++E E+ RLE  I   +++L  E    S S            H+ +K+      A     ++  ++       V  +    K       +P
Subjt:  VKEVLAELAMVESEIARLELQITQLQKDLKTEQQQNSKSNNKATSFDTKALHFISKAIKGDYYALNNNNAQHHITHPQPLHQVQERLVSRKSGLLVASSP

Query:  LRD---PRHPSPKQRERNPLDMAGLKSVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTSRTMELEKSGPISRSLHSSLSSRSFRVENNGLNSSL
        ++          K  E   L     +      +   +    +PNK+SE ++KCL+ I++R+    R+M  +                    EN+      
Subjt:  LRD---PRHPSPKQRERNPLDMAGLKSVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTSRTMELEKSGPISRSLHSSLSSRSFRVENNGLNSSL

Query:  PLHKESRQQDPYAIFDNEESIPRDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQRVDLRPLSYQQKLAFWINMYNACIMHGFLQYGVPSS
           K++  +DPY I  +     RDIG YKN       S++    SS S   LIR+L+ L+  L  V+++ L+ Q+KLAFWIN+YN+C+M+GFL++G+P S
Subjt:  PLHKESRQQDPYAIFDNEESIPRDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQRVDLRPLSYQQKLAFWINMYNACIMHGFLQYGVPSS

Query:  PEKLA----ALMNKASNSINAPAIEHYILKKPG-----SPNREDEEKERVVGKLYGLESWEANATFGLCCGTRSSPAVRIYSGEAVAAELERSKLEYLQA
        P+ +     A +N   + +NA  IEH+IL+ P      SP +  ++ E  V   +GLE  E   TF L CG+ SSPAVR+Y+   V  ELE +K EYL+A
Subjt:  PEKLA----ALMNKASNSINAPAIEHYILKKPG-----SPNREDEEKERVVGKLYGLESWEANATFGLCCGTRSSPAVRIYSGEAVAAELERSKLEYLQA

Query:  SVVVTSSRRVAVPELLVRSLPEFTTTAAAAAEWVCHQLPTSGSLRKSMVECFRSGHPPT-----IHTLPYDFEFQYLLPL
        SV + S  ++ +P+L+     +F     +  +W+  QLPT   L K  + C   G   +     +H +PYDF F+YL  +
Subjt:  SVVVTSSRRVAVPELLVRSLPEFTTTAAAAAEWVCHQLPTSGSLRKSMVECFRSGHPPT-----IHTLPYDFEFQYLLPL

AT5G60720.1 Protein of unknown function, DUF5471.8e-12146.65Show/hide
Query:  QVKEVLAELAMVESEIARLELQITQLQKDLKTEQQQNSK----------------------------------------------------SNNKAT---
        ++KE++ EL++VE EI+RLE+QI+ LQ +LK EQ +  K                                                    +N K+T   
Subjt:  QVKEVLAELAMVESEIARLELQITQLQKDLKTEQQQNSK----------------------------------------------------SNNKAT---

Query:  -----SFDTKALHFISKAIKGDYYALNNNNAQHHI------THPQPLHQ--VQERLVSRKSGLLVASSPLRDPRHPSPKQRERN-------PLDMAGLK-
             +F TK LHFI+KAIKGDY   +   +   +       H    H+  VQE +  +K   + + SPLR+PR+ SP +  ++        LD+     
Subjt:  -----SFDTKALHFISKAIKGDYYALNNNNAQHHI------THPQPLHQ--VQERLVSRKSGLLVASSPLRDPRHPSPKQRERN-------PLDMAGLK-

Query:  SVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTSRTMELEKSGPISRSLHSSLSSRSFRVEN--NGLNSSLPL--HKESRQQDPYAIFDNEESIP
        S  I ++  +N+Q W PNKL+E+IMKCLNFIYVRLLRT+R MELEK+GPISRS + SLSSRSFRV+N  + L+ S+ L  +KESRQQDPY IFD E S+ 
Subjt:  SVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTSRTMELEKSGPISRSLHSSLSSRSFRVEN--NGLNSSLPL--HKESRQQDPYAIFDNEESIP

Query:  RDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQRVDLRPLSYQQKLAFWINMYNACIMHGFLQYGVPSSPEKLAAL------MNKASNSIN
        RDIGPYKNLVIFTS+SMD K ISS S + LI+KLR LM+NL+ VDL+ LS+QQKLAFWINM+NAC+MHG+LQ+GVP + E+L +L      MN    +I+
Subjt:  RDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQRVDLRPLSYQQKLAFWINMYNACIMHGFLQYGVPSSPEKLAAL------MNKASNSIN

Query:  APAIEHYILKK-PGSPNREDEEKERVVGKLYGLESWEANATFGLCCGTRSSPAVRIYSGEAVAAELERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFT
        A  IEH IL+K   S   +D  +E ++ KLYG+E+ + N TF L CGTRSSPAVRIY+GE V  ELE+SKLEYLQAS+VVT+++R+ +PELL++   +F 
Subjt:  APAIEHYILKK-PGSPNREDEEKERVVGKLYGLESWEANATFGLCCGTRSSPAVRIYSGEAVAAELERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFT

Query:  TTAA-----------AAAEWVCHQLPTSGSLRKSMVECFR------SGHPPTIHTLPYDFEFQYLLPL
           A           +  +WVC+QLPTSGSLRKSMV+CF+      S     +  +PYDFEFQYLL +
Subjt:  TTAA-----------AAAEWVCHQLPTSGSLRKSMVECFR------SGHPPTIHTLPYDFEFQYLLPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGGTGAAGGAAGTACTAGCAGAGCTAGCAATGGTGGAGAGTGAAATAGCAAGGCTTGAGCTCCAAATAACACAACTCCAAAAGGACTTGAAAACTGAGCAACAACA
AAATTCCAAGTCCAATAATAAAGCAACTAGTTTTGACACCAAGGCTCTCCATTTCATTAGCAAGGCCATCAAGGGAGATTATTATGCTCTCAACAACAACAATGCTCAAC
ATCACATCACTCATCCTCAGCCTCTTCATCAGGTCCAAGAAAGACTGGTTTCCCGAAAGAGCGGCCTCCTCGTAGCTTCGTCTCCGTTGCGAGACCCCCGACATCCTTCT
CCAAAGCAACGAGAGCGGAATCCATTAGACATGGCGGGGTTAAAATCGGTCCCAATAGCAATTCAAGCAGAAGAAAACATGCAGCATTGGCAACCCAACAAGCTATCAGA
GAGCATAATGAAGTGCTTAAACTTCATATATGTGAGACTACTAAGAACCTCAAGAACAATGGAGCTAGAGAAGTCAGGTCCCATTTCAAGATCTCTGCATTCCTCCCTGA
GCTCAAGAAGCTTCCGAGTCGAGAACAACGGCCTGAACTCGAGTCTTCCGCTACACAAAGAATCGAGGCAACAAGATCCTTACGCCATCTTCGACAACGAAGAGTCGATC
CCGAGGGACATTGGCCCTTACAAAAACTTGGTTATATTCACATCAACTTCCATGGACCCAAAATCCATATCAAGTCCAAGCTTCATCCCTCTCATAAGAAAGCTAAGGGC
CCTGATGAGCAATTTGCAAAGAGTTGATTTAAGGCCATTGAGTTACCAACAGAAACTAGCATTTTGGATCAACATGTACAATGCTTGTATCATGCATGGATTTCTTCAAT
ATGGAGTGCCTTCATCCCCAGAAAAACTTGCTGCTTTGATGAACAAGGCAAGCAACTCCATAAACGCACCAGCCATAGAGCATTACATTCTAAAGAAACCAGGGTCTCCT
AACAGGGAGGATGAGGAGAAAGAAAGGGTGGTTGGGAAGCTGTACGGGCTAGAATCGTGGGAGGCGAACGCGACATTCGGGCTGTGCTGCGGGACCCGTTCTTCTCCGGC
GGTGAGAATATACAGCGGCGAGGCGGTGGCGGCGGAGCTGGAGAGATCGAAGCTGGAGTATCTGCAGGCCTCGGTGGTGGTGACCAGCTCCAGAAGGGTGGCAGTGCCGG
AGCTTCTGGTCCGGAGTCTGCCGGAGTTCACGACAACGGCGGCGGCGGCGGCGGAGTGGGTGTGCCACCAGCTGCCGACCTCCGGGAGTCTGAGGAAATCCATGGTTGAG
TGCTTCAGATCAGGCCATCCCCCCACCATCCACACTCTCCCTTATGATTTCGAGTTTCAATATCTCTTGCCTTTG
mRNA sequenceShow/hide mRNA sequence
ATGCAGGTGAAGGAAGTACTAGCAGAGCTAGCAATGGTGGAGAGTGAAATAGCAAGGCTTGAGCTCCAAATAACACAACTCCAAAAGGACTTGAAAACTGAGCAACAACA
AAATTCCAAGTCCAATAATAAAGCAACTAGTTTTGACACCAAGGCTCTCCATTTCATTAGCAAGGCCATCAAGGGAGATTATTATGCTCTCAACAACAACAATGCTCAAC
ATCACATCACTCATCCTCAGCCTCTTCATCAGGTCCAAGAAAGACTGGTTTCCCGAAAGAGCGGCCTCCTCGTAGCTTCGTCTCCGTTGCGAGACCCCCGACATCCTTCT
CCAAAGCAACGAGAGCGGAATCCATTAGACATGGCGGGGTTAAAATCGGTCCCAATAGCAATTCAAGCAGAAGAAAACATGCAGCATTGGCAACCCAACAAGCTATCAGA
GAGCATAATGAAGTGCTTAAACTTCATATATGTGAGACTACTAAGAACCTCAAGAACAATGGAGCTAGAGAAGTCAGGTCCCATTTCAAGATCTCTGCATTCCTCCCTGA
GCTCAAGAAGCTTCCGAGTCGAGAACAACGGCCTGAACTCGAGTCTTCCGCTACACAAAGAATCGAGGCAACAAGATCCTTACGCCATCTTCGACAACGAAGAGTCGATC
CCGAGGGACATTGGCCCTTACAAAAACTTGGTTATATTCACATCAACTTCCATGGACCCAAAATCCATATCAAGTCCAAGCTTCATCCCTCTCATAAGAAAGCTAAGGGC
CCTGATGAGCAATTTGCAAAGAGTTGATTTAAGGCCATTGAGTTACCAACAGAAACTAGCATTTTGGATCAACATGTACAATGCTTGTATCATGCATGGATTTCTTCAAT
ATGGAGTGCCTTCATCCCCAGAAAAACTTGCTGCTTTGATGAACAAGGCAAGCAACTCCATAAACGCACCAGCCATAGAGCATTACATTCTAAAGAAACCAGGGTCTCCT
AACAGGGAGGATGAGGAGAAAGAAAGGGTGGTTGGGAAGCTGTACGGGCTAGAATCGTGGGAGGCGAACGCGACATTCGGGCTGTGCTGCGGGACCCGTTCTTCTCCGGC
GGTGAGAATATACAGCGGCGAGGCGGTGGCGGCGGAGCTGGAGAGATCGAAGCTGGAGTATCTGCAGGCCTCGGTGGTGGTGACCAGCTCCAGAAGGGTGGCAGTGCCGG
AGCTTCTGGTCCGGAGTCTGCCGGAGTTCACGACAACGGCGGCGGCGGCGGCGGAGTGGGTGTGCCACCAGCTGCCGACCTCCGGGAGTCTGAGGAAATCCATGGTTGAG
TGCTTCAGATCAGGCCATCCCCCCACCATCCACACTCTCCCTTATGATTTCGAGTTTCAATATCTCTTGCCTTTG
Protein sequenceShow/hide protein sequence
MQVKEVLAELAMVESEIARLELQITQLQKDLKTEQQQNSKSNNKATSFDTKALHFISKAIKGDYYALNNNNAQHHITHPQPLHQVQERLVSRKSGLLVASSPLRDPRHPS
PKQRERNPLDMAGLKSVPIAIQAEENMQHWQPNKLSESIMKCLNFIYVRLLRTSRTMELEKSGPISRSLHSSLSSRSFRVENNGLNSSLPLHKESRQQDPYAIFDNEESI
PRDIGPYKNLVIFTSTSMDPKSISSPSFIPLIRKLRALMSNLQRVDLRPLSYQQKLAFWINMYNACIMHGFLQYGVPSSPEKLAALMNKASNSINAPAIEHYILKKPGSP
NREDEEKERVVGKLYGLESWEANATFGLCCGTRSSPAVRIYSGEAVAAELERSKLEYLQASVVVTSSRRVAVPELLVRSLPEFTTTAAAAAEWVCHQLPTSGSLRKSMVE
CFRSGHPPTIHTLPYDFEFQYLLPL