; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10018223 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10018223
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function, DUF547
Genome locationChr04:1873875..1879653
RNA-Seq ExpressionHG10018223
SyntenyHG10018223
Gene Ontology termsGO:0016853 - isomerase activity (molecular function)
InterPro domainsIPR006869 - Domain of unknown function DUF547


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051762.1 Topoisomerase 1-associated factor 1 [Cucumis melo var. makuwa]2.2e-23987.94Show/hide
Query:  MQVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWSSE-QPQTNNNDNNPNNKPPLNWNPISKSTFDTKALHFISKAIKGDYALN-HFKLD
        MQVKE+LAELAMVESEIARLEIQITQL+KDLK EQQ TTKSKQWSSE QPQTNN     NNKPPLNWNPISK+TFDTKALHFISKAIKGDYALN HFKLD
Subjt:  MQVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWSSE-QPQTNNNDNNPNNKPPLNWNPISKSTFDTKALHFISKAIKGDYALN-HFKLD

Query:  NPKISSELDPRDTKDTHHLLHEVKLHER-VSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMSMPIQQEENIQNWHPSKLSESIMKCLNFIYVRL
        N K ++ELDPRD KD+HH LHEVKLHER VSRKSGLLVASSPLRDPRHPSPKQRERNPLD+P PKSM M  Q EENIQNWHP+KLSESIMKCLNFIYVRL
Subjt:  NPKISSELDPRDTKDTHHLLHEVKLHER-VSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMSMPIQQEENIQNWHPSKLSESIMKCLNFIYVRL

Query:  LRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMS
        LRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLS HKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPL+RKLRVLMS
Subjt:  LRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMS

Query:  NLQKVDLRPLSYQQKLAFWINMYNACIMN--------------------AMVNIGGNTINAQAIEHYILRKPMSSNKEDDNKEAVVRKLYGLESSEPNVT
        NLQKVDLRPLSYQQKLAFWINMYNACIMN                    AMVNIGGNTINAQAI+HYILRKPMS N EDDNKEA+VRKLYGLESSEPNVT
Subjt:  NLQKVDLRPLSYQQKLAFWINMYNACIMN--------------------AMVNIGGNTINAQAIEHYILRKPMSSNKEDDNKEAVVRKLYGLESSEPNVT

Query:  FALCCGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEFSAAADMKAVVEWVCHQLPTSGSLRKSMVECFRGHPKTHPTID
        FALCCGTRSSPAVRIYSGE V AELERSKLEYLQASVVVTSSKRVAVPELL+RSLPEFS +ADMK VVEWVCHQLPTSGSLRKS+VECFRGHPKT PTID
Subjt:  FALCCGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEFSAAADMKAVVEWVCHQLPTSGSLRKSMVECFRGHPKTHPTID

Query:  TLPYDFEFQYLLPL
        TL YDFEFQYLLPL
Subjt:  TLPYDFEFQYLLPL

XP_004139551.1 uncharacterized protein LOC101221529 [Cucumis sativus]9.1e-23887.35Show/hide
Query:  QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWSSE-QPQTNNNDNNPNNKPPLNWNPISKSTFDTKALHFISKAIKGDYA-LN-HFKLD
        +VKE+LAELAMVESEIARLEIQITQLQKDLK EQQQTTKSKQWSSE QPQTN      NNKPPLNWNPISK+TFDTKALHFISKAIKGDYA LN HFKLD
Subjt:  QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWSSE-QPQTNNNDNNPNNKPPLNWNPISKSTFDTKALHFISKAIKGDYA-LN-HFKLD

Query:  NPKISSELDPRDTKDTHHLLHEVKLHER-VSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMSMPIQQEENIQNWHPSKLSESIMKCLNFIYVRL
          K ++ELDPRD KD+HH LHEVKLHER VSRKSGLLVASSPLRDPRHPSPKQRERNPLD+P PKS+ M  Q EENIQNWHP+KLSESIMKCLNFIYVRL
Subjt:  NPKISSELDPRDTKDTHHLLHEVKLHER-VSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMSMPIQQEENIQNWHPSKLSESIMKCLNFIYVRL

Query:  LRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMS
        LRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLS HKELRQQDPYGIFENEES+PRDIGPYKNLVIFTSTSMDPKSISSATFIPL+RKLRVLMS
Subjt:  LRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMS

Query:  NLQKVDLRPLSYQQKLAFWINMYNACIMN--------------------AMVNIGGNTINAQAIEHYILRKPMSSNKEDDNKEAVVRKLYGLESSEPNVT
        NLQKVDLRPLSYQQKLAFWINMYNACIMN                    AM+N+GGNTINAQAI+HYILRKPMS NKEDDNKEA+VRKLYGLESSEPNVT
Subjt:  NLQKVDLRPLSYQQKLAFWINMYNACIMN--------------------AMVNIGGNTINAQAIEHYILRKPMSSNKEDDNKEAVVRKLYGLESSEPNVT

Query:  FALCCGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEFSAAADMKAVVEWVCHQLPTSGSLRKSMVECFRGHPKTHPTID
        FALCCGTRSSPAVRIYSGE V  ELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEFS +ADMK VVEWVCHQLPTSGSLRKSMVECFRGHPKT PTID
Subjt:  FALCCGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEFSAAADMKAVVEWVCHQLPTSGSLRKSMVECFRGHPKTHPTID

Query:  TLPYDFEFQYLLPL
        TLPYDFEFQYLLPL
Subjt:  TLPYDFEFQYLLPL

XP_008462917.1 PREDICTED: uncharacterized protein LOC103501181 isoform X1 [Cucumis melo]1.8e-23887.72Show/hide
Query:  QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWSSE-QPQTNNNDNNPNNKPPLNWNPISKSTFDTKALHFISKAIKGDYALN-HFKLDN
        +VKE+LAELAMVESEIARLEIQITQL+KDLK EQQ TTKSKQWSSE QPQTNN     NNKPPLNWNPISK+TFDTKALHFISKAIKGDYALN HFKLDN
Subjt:  QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWSSE-QPQTNNNDNNPNNKPPLNWNPISKSTFDTKALHFISKAIKGDYALN-HFKLDN

Query:  PKISSELDPRDTKDTHHLLHEVKLHER-VSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMSMPIQQEENIQNWHPSKLSESIMKCLNFIYVRLL
         K ++ELDPRD KD+HH LHEVKLHER VSRKSGLLVASSPLRDPRHPSPKQRERNPLD+P PKSM M  Q EENIQNWHP+KLSESIMKCLNFIYVRLL
Subjt:  PKISSELDPRDTKDTHHLLHEVKLHER-VSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMSMPIQQEENIQNWHPSKLSESIMKCLNFIYVRLL

Query:  RASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSN
        RASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLS HKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPL+RKLRVLMSN
Subjt:  RASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSN

Query:  LQKVDLRPLSYQQKLAFWINMYNACIMN--------------------AMVNIGGNTINAQAIEHYILRKPMSSNKEDDNKEAVVRKLYGLESSEPNVTF
        LQKVDLRPLSYQQKLAFWINMYNACIMN                    AMVNIGGNTINAQAI+HYILRKPMS N EDDNKEA+VRKLYGLESSEPNVTF
Subjt:  LQKVDLRPLSYQQKLAFWINMYNACIMN--------------------AMVNIGGNTINAQAIEHYILRKPMSSNKEDDNKEAVVRKLYGLESSEPNVTF

Query:  ALCCGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEFSAAADMKAVVEWVCHQLPTSGSLRKSMVECFRGHPKTHPTIDT
        ALCCGTRSSPAVRIYSGE V AELERSKLEYLQASVVVTSSKRVAVPELL+RSLPEFS +ADMK VVEWVCHQLPTSGSLRKS+VECFRGHPKT PTIDT
Subjt:  ALCCGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEFSAAADMKAVVEWVCHQLPTSGSLRKSMVECFRGHPKTHPTIDT

Query:  LPYDFEFQYLLPL
        L YDFEFQYLLPL
Subjt:  LPYDFEFQYLLPL

XP_008462925.1 PREDICTED: uncharacterized protein LOC103501181 isoform X2 [Cucumis melo]1.8e-23887.72Show/hide
Query:  QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWSSE-QPQTNNNDNNPNNKPPLNWNPISKSTFDTKALHFISKAIKGDYALN-HFKLDN
        +VKE+LAELAMVESEIARLEIQITQL+KDLK EQQ TTKSKQWSSE QPQTNN     NNKPPLNWNPISK+TFDTKALHFISKAIKGDYALN HFKLDN
Subjt:  QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWSSE-QPQTNNNDNNPNNKPPLNWNPISKSTFDTKALHFISKAIKGDYALN-HFKLDN

Query:  PKISSELDPRDTKDTHHLLHEVKLHER-VSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMSMPIQQEENIQNWHPSKLSESIMKCLNFIYVRLL
         K ++ELDPRD KD+HH LHEVKLHER VSRKSGLLVASSPLRDPRHPSPKQRERNPLD+P PKSM M  Q EENIQNWHP+KLSESIMKCLNFIYVRLL
Subjt:  PKISSELDPRDTKDTHHLLHEVKLHER-VSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMSMPIQQEENIQNWHPSKLSESIMKCLNFIYVRLL

Query:  RASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSN
        RASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLS HKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPL+RKLRVLMSN
Subjt:  RASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSN

Query:  LQKVDLRPLSYQQKLAFWINMYNACIMN--------------------AMVNIGGNTINAQAIEHYILRKPMSSNKEDDNKEAVVRKLYGLESSEPNVTF
        LQKVDLRPLSYQQKLAFWINMYNACIMN                    AMVNIGGNTINAQAI+HYILRKPMS N EDDNKEA+VRKLYGLESSEPNVTF
Subjt:  LQKVDLRPLSYQQKLAFWINMYNACIMN--------------------AMVNIGGNTINAQAIEHYILRKPMSSNKEDDNKEAVVRKLYGLESSEPNVTF

Query:  ALCCGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEFSAAADMKAVVEWVCHQLPTSGSLRKSMVECFRGHPKTHPTIDT
        ALCCGTRSSPAVRIYSGE V AELERSKLEYLQASVVVTSSKRVAVPELL+RSLPEFS +ADMK VVEWVCHQLPTSGSLRKS+VECFRGHPKT PTIDT
Subjt:  ALCCGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEFSAAADMKAVVEWVCHQLPTSGSLRKSMVECFRGHPKTHPTIDT

Query:  LPYDFEFQYLLPL
        L YDFEFQYLLPL
Subjt:  LPYDFEFQYLLPL

XP_038894153.1 uncharacterized protein LOC120082868 [Benincasa hispida]6.9e-24689.43Show/hide
Query:  QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQ-TTKSKQWSSEQPQTNNNDNNPNNKPPLNWNPISKSTFDTKALHFISKAIKGDYALNHFKLDNP
        +VKEVLAELAMVESEIARLEIQITQLQKDLKTEQQ  TTKSKQWS EQPQTNN     NNKPP+ WNPIS++TFDTKALHFISKAIKGDYALNHFKLDN 
Subjt:  QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQ-TTKSKQWSSEQPQTNNNDNNPNNKPPLNWNPISKSTFDTKALHFISKAIKGDYALNHFKLDNP

Query:  KISSELDPRDTKDTHHLLHEVKLHERVSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMSMPIQQEENIQNWHPSKLSESIMKCLNFIYVRLLRA
        K +SE  P DTKD HHLL EVKLHERVSRKSGLLVASSPLRDPRHPSPKQRER+ LDMPPPKSM MPIQ EENIQNWHP+KLSESIMKCLNF+YVRLLRA
Subjt:  KISSELDPRDTKDTHHLLHEVKLHERVSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMSMPIQQEENIQNWHPSKLSESIMKCLNFIYVRLLRA

Query:  SRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQ
        SRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQ
Subjt:  SRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQ

Query:  KVDLRPLSYQQKLAFWINMYNACIMN--------------------AMVNIGGNTINAQAIEHYILRKPMSSNKEDDNKEAVVRKLYGLESSEPNVTFAL
        KVDLRPLSYQQKLAFWINMYNACIMN                    AMVNIGGNTINAQAIEHYILRK MSSNKEDDNKEAVVRKLYGLESSEPNVTFAL
Subjt:  KVDLRPLSYQQKLAFWINMYNACIMN--------------------AMVNIGGNTINAQAIEHYILRKPMSSNKEDDNKEAVVRKLYGLESSEPNVTFAL

Query:  CCGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEFSAAADMKAVVEWVCHQLPTSGSLRKSMVECFRGHPKTHPTIDTLP
        CCGTRSSPAVRIYSGE+VAAELERSKLEYLQASVVVT+S+RVAVPELLVRSLPEF+AAADMKAVVEWVCHQLPTSGSLRKSMVECFR HPKT PTIDTLP
Subjt:  CCGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEFSAAADMKAVVEWVCHQLPTSGSLRKSMVECFRGHPKTHPTIDTLP

Query:  YDFEFQYLLPL
        YDFEFQYLLPL
Subjt:  YDFEFQYLLPL

TrEMBL top hitse value%identityAlignment
A0A0A0LSP4 Uncharacterized protein8.6e-22684.93Show/hide
Query:  QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWSSE-QPQTNNNDNNPNNKPPLNWNPISKSTFDTKALHFISKAIKGDYA-LN-HFKLD
        +VKE+LAELAMVESEIARLEIQITQLQKDLK EQQQTTKSKQWSSE QPQTN      NNKPPLNWNPISK+TFDTKALHFISKAIKGDYA LN HFKLD
Subjt:  QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWSSE-QPQTNNNDNNPNNKPPLNWNPISKSTFDTKALHFISKAIKGDYA-LN-HFKLD

Query:  NPKISSELDPRDTKDTHHLLHEVKLHER-VSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMSMPIQQEENIQNWHPSKLSESIMKCLNFIYVRL
          K ++ELDPRD KD+HH LHEVKLHER VSRKSGLLVASSPLRDPRHPSPKQRERNPLD+P PKS+ M  Q EENIQNWHP+KLSESIMKCLNFIYVRL
Subjt:  NPKISSELDPRDTKDTHHLLHEVKLHER-VSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMSMPIQQEENIQNWHPSKLSESIMKCLNFIYVRL

Query:  LRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMS
        LRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLS HKELRQQDPYGIFENEES+PRDIGPYKNLVIFTSTSMDPKSISSATFIPL+RKLRVLMS
Subjt:  LRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMS

Query:  NLQKVDLRPLSYQQKLAFWINMYNACIMNA-------------MVNIGG--NTINAQAIEHYIL--RKPMSSNKEDDNKEAVVRKLYGLESSEPNVTFAL
        NLQKVDLRPLSYQQKLAFWINMYNACIMN               V I    ++  ++   H+    +KPMS NKEDDNKEA+VRKLYGLESSEPNVTFAL
Subjt:  NLQKVDLRPLSYQQKLAFWINMYNACIMNA-------------MVNIGG--NTINAQAIEHYIL--RKPMSSNKEDDNKEAVVRKLYGLESSEPNVTFAL

Query:  CCGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEFSAAADMKAVVEWVCHQLPTSGSLRKSMVECFRGHPKTHPTIDTLP
        CCGTRSSPAVRIYSGE V  ELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEFS +ADMK VVEWVCHQLPTSGSLRKSMVECFRGHPKT PTIDTLP
Subjt:  CCGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEFSAAADMKAVVEWVCHQLPTSGSLRKSMVECFRGHPKTHPTIDTLP

Query:  YDFEFQYLLPL
        YDFEFQYLLPL
Subjt:  YDFEFQYLLPL

A0A1S3CHZ3 uncharacterized protein LOC103501181 isoform X18.8e-23987.72Show/hide
Query:  QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWSSE-QPQTNNNDNNPNNKPPLNWNPISKSTFDTKALHFISKAIKGDYALN-HFKLDN
        +VKE+LAELAMVESEIARLEIQITQL+KDLK EQQ TTKSKQWSSE QPQTNN     NNKPPLNWNPISK+TFDTKALHFISKAIKGDYALN HFKLDN
Subjt:  QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWSSE-QPQTNNNDNNPNNKPPLNWNPISKSTFDTKALHFISKAIKGDYALN-HFKLDN

Query:  PKISSELDPRDTKDTHHLLHEVKLHER-VSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMSMPIQQEENIQNWHPSKLSESIMKCLNFIYVRLL
         K ++ELDPRD KD+HH LHEVKLHER VSRKSGLLVASSPLRDPRHPSPKQRERNPLD+P PKSM M  Q EENIQNWHP+KLSESIMKCLNFIYVRLL
Subjt:  PKISSELDPRDTKDTHHLLHEVKLHER-VSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMSMPIQQEENIQNWHPSKLSESIMKCLNFIYVRLL

Query:  RASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSN
        RASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLS HKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPL+RKLRVLMSN
Subjt:  RASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSN

Query:  LQKVDLRPLSYQQKLAFWINMYNACIMN--------------------AMVNIGGNTINAQAIEHYILRKPMSSNKEDDNKEAVVRKLYGLESSEPNVTF
        LQKVDLRPLSYQQKLAFWINMYNACIMN                    AMVNIGGNTINAQAI+HYILRKPMS N EDDNKEA+VRKLYGLESSEPNVTF
Subjt:  LQKVDLRPLSYQQKLAFWINMYNACIMN--------------------AMVNIGGNTINAQAIEHYILRKPMSSNKEDDNKEAVVRKLYGLESSEPNVTF

Query:  ALCCGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEFSAAADMKAVVEWVCHQLPTSGSLRKSMVECFRGHPKTHPTIDT
        ALCCGTRSSPAVRIYSGE V AELERSKLEYLQASVVVTSSKRVAVPELL+RSLPEFS +ADMK VVEWVCHQLPTSGSLRKS+VECFRGHPKT PTIDT
Subjt:  ALCCGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEFSAAADMKAVVEWVCHQLPTSGSLRKSMVECFRGHPKTHPTIDT

Query:  LPYDFEFQYLLPL
        L YDFEFQYLLPL
Subjt:  LPYDFEFQYLLPL

A0A1S3CJL7 uncharacterized protein LOC103501181 isoform X28.8e-23987.72Show/hide
Query:  QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWSSE-QPQTNNNDNNPNNKPPLNWNPISKSTFDTKALHFISKAIKGDYALN-HFKLDN
        +VKE+LAELAMVESEIARLEIQITQL+KDLK EQQ TTKSKQWSSE QPQTNN     NNKPPLNWNPISK+TFDTKALHFISKAIKGDYALN HFKLDN
Subjt:  QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWSSE-QPQTNNNDNNPNNKPPLNWNPISKSTFDTKALHFISKAIKGDYALN-HFKLDN

Query:  PKISSELDPRDTKDTHHLLHEVKLHER-VSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMSMPIQQEENIQNWHPSKLSESIMKCLNFIYVRLL
         K ++ELDPRD KD+HH LHEVKLHER VSRKSGLLVASSPLRDPRHPSPKQRERNPLD+P PKSM M  Q EENIQNWHP+KLSESIMKCLNFIYVRLL
Subjt:  PKISSELDPRDTKDTHHLLHEVKLHER-VSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMSMPIQQEENIQNWHPSKLSESIMKCLNFIYVRLL

Query:  RASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSN
        RASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLS HKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPL+RKLRVLMSN
Subjt:  RASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSN

Query:  LQKVDLRPLSYQQKLAFWINMYNACIMN--------------------AMVNIGGNTINAQAIEHYILRKPMSSNKEDDNKEAVVRKLYGLESSEPNVTF
        LQKVDLRPLSYQQKLAFWINMYNACIMN                    AMVNIGGNTINAQAI+HYILRKPMS N EDDNKEA+VRKLYGLESSEPNVTF
Subjt:  LQKVDLRPLSYQQKLAFWINMYNACIMN--------------------AMVNIGGNTINAQAIEHYILRKPMSSNKEDDNKEAVVRKLYGLESSEPNVTF

Query:  ALCCGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEFSAAADMKAVVEWVCHQLPTSGSLRKSMVECFRGHPKTHPTIDT
        ALCCGTRSSPAVRIYSGE V AELERSKLEYLQASVVVTSSKRVAVPELL+RSLPEFS +ADMK VVEWVCHQLPTSGSLRKS+VECFRGHPKT PTIDT
Subjt:  ALCCGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEFSAAADMKAVVEWVCHQLPTSGSLRKSMVECFRGHPKTHPTIDT

Query:  LPYDFEFQYLLPL
        L YDFEFQYLLPL
Subjt:  LPYDFEFQYLLPL

A0A5A7U7M8 Topoisomerase 1-associated factor 11.0e-23987.94Show/hide
Query:  MQVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWSSE-QPQTNNNDNNPNNKPPLNWNPISKSTFDTKALHFISKAIKGDYALN-HFKLD
        MQVKE+LAELAMVESEIARLEIQITQL+KDLK EQQ TTKSKQWSSE QPQTNN     NNKPPLNWNPISK+TFDTKALHFISKAIKGDYALN HFKLD
Subjt:  MQVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWSSE-QPQTNNNDNNPNNKPPLNWNPISKSTFDTKALHFISKAIKGDYALN-HFKLD

Query:  NPKISSELDPRDTKDTHHLLHEVKLHER-VSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMSMPIQQEENIQNWHPSKLSESIMKCLNFIYVRL
        N K ++ELDPRD KD+HH LHEVKLHER VSRKSGLLVASSPLRDPRHPSPKQRERNPLD+P PKSM M  Q EENIQNWHP+KLSESIMKCLNFIYVRL
Subjt:  NPKISSELDPRDTKDTHHLLHEVKLHER-VSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMSMPIQQEENIQNWHPSKLSESIMKCLNFIYVRL

Query:  LRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMS
        LRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLS HKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPL+RKLRVLMS
Subjt:  LRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMS

Query:  NLQKVDLRPLSYQQKLAFWINMYNACIMN--------------------AMVNIGGNTINAQAIEHYILRKPMSSNKEDDNKEAVVRKLYGLESSEPNVT
        NLQKVDLRPLSYQQKLAFWINMYNACIMN                    AMVNIGGNTINAQAI+HYILRKPMS N EDDNKEA+VRKLYGLESSEPNVT
Subjt:  NLQKVDLRPLSYQQKLAFWINMYNACIMN--------------------AMVNIGGNTINAQAIEHYILRKPMSSNKEDDNKEAVVRKLYGLESSEPNVT

Query:  FALCCGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEFSAAADMKAVVEWVCHQLPTSGSLRKSMVECFRGHPKTHPTID
        FALCCGTRSSPAVRIYSGE V AELERSKLEYLQASVVVTSSKRVAVPELL+RSLPEFS +ADMK VVEWVCHQLPTSGSLRKS+VECFRGHPKT PTID
Subjt:  FALCCGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEFSAAADMKAVVEWVCHQLPTSGSLRKSMVECFRGHPKTHPTID

Query:  TLPYDFEFQYLLPL
        TL YDFEFQYLLPL
Subjt:  TLPYDFEFQYLLPL

A0A6J1G939 uncharacterized protein LOC111452055 isoform X13.1e-21579.69Show/hide
Query:  QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWSSEQPQTNNNDNNPNNKPPLNWNPISKSTFDTKALHFISKAIKGDYALNHFKLDNPK
        +VKE+LAELAMVESEI RLEIQIT+LQKDLK+E+Q+ T+SK WSSEQP      NN   KPPLNWNPISK TFDTK LHFISKAIKGDYALN F+LD   
Subjt:  QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWSSEQPQTNNNDNNPNNKPPLNWNPISKSTFDTKALHFISKAIKGDYALNHFKLDNPK

Query:  ISSELDPRDTKDTHHLLHEVKLHERVSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMSMPIQQEENIQNWHPSKLSESIMKCLNFIYVRLLRAS
         +   D RD K TH    E+KL ERV RKSGLLV  SPLR+P+HPSPK+RER+PL MPP K +SMPIQ EENIQNWHP+KLSESI+KCLNFIYVRLLR S
Subjt:  ISSELDPRDTKDTHHLLHEVKLHERVSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMSMPIQQEENIQNWHPSKLSESIMKCLNFIYVRLLRAS

Query:  RTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQK
        RTMELEKSGPISRSLH SSLSSRSFRVENGLNS LS+HKELRQQDPY IFENEESIPRDIGPYKNLVIFTSTSMDPKSI+SATFIPLI KLRVLMSNLQ 
Subjt:  RTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQK

Query:  VDLRPLSYQQKLAFWINMYNACIMN--------------------AMVNIGGNTINAQAIEHYILRKPMSSNKEDDNKEAVVRKLYGLESSEPNVTFALC
        VDL+PL+YQQKLAFWINMYNACIMN                    AM+NIGGNTINAQAIEH+ILRKP S N ED+NKEAVVRKLYGLESS+PNVTFALC
Subjt:  VDLRPLSYQQKLAFWINMYNACIMN--------------------AMVNIGGNTINAQAIEHYILRKPMSSNKEDDNKEAVVRKLYGLESSEPNVTFALC

Query:  CGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEF---SAAADMKAVVEWVCHQLPTSGSLRKSMVECFRGHPKTHPTIDT
        CGTRSSPAVRIYSGE+V AELERSKLEYLQAS+VVTSS+RVAVPELLVRSLPEF   +AAADMKAVVEWVC+QLPTSGSLRKSMVECFRGH KT PT++T
Subjt:  CGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEF---SAAADMKAVVEWVCHQLPTSGSLRKSMVECFRGHPKTHPTIDT

Query:  LPYDFEFQYLLP
        LPYDFEFQYLLP
Subjt:  LPYDFEFQYLLP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G39690.1 Protein of unknown function, DUF5474.3e-5230.93Show/hide
Query:  QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWSSEQPQTNNNDNNPNNKPPLNWNPISKSTFDTKALHFISKAIKGDYALNHFKLDNPK
        Q  E++ ELA+VE+EI  L+ +I +L+  L +EQ+QT + +   +EQ +T    ++       +  P+          H   ++     +  H +L    
Subjt:  QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWSSEQPQTNNNDNNPNNKPPLNWNPISKSTFDTKALHFISKAIKGDYALNHFKLDNPK

Query:  ISSELDPRDTKDTHHLLHEVKLHERVSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMSMPIQQEENIQNWHPSKLSESIMKCLNFIYVRLLRAS
            LD   +         V   +    + GL +  +  +D                                    P+++SE ++ CL  IY+ L   S
Subjt:  ISSELDPRDTKDTHHLLHEVKLHERVSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMSMPIQQEENIQNWHPSKLSESIMKCLNFIYVRLLRAS

Query:  RTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIF-ENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQ
             +  G +S S   SS S +S        ++ S ++     DPY +  ++   + RDIGPYKN +  + +S+D    +     P + +L VLM  L 
Subjt:  RTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIF-ENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQ

Query:  KVDLRPLSYQQKLAFWINMYNACIMNAM--------------------VNIGGNTINAQAIEHYILRKPMSSNKED-DNKEAVVRKLYGLESSEPNVTFA
        +VDL  L+Y+QKLAFWIN+YNACIM+A                     +N+GG  +NA AIEH++LR P     +  D KE ++R  YGL  SEPNVTFA
Subjt:  KVDLRPLSYQQKLAFWINMYNACIMNAM--------------------VNIGGNTINAQAIEHYILRKPMSSNKED-DNKEAVVRKLYGLESSEPNVTFA

Query:  LCCGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEFSAAADMKAVVEWVCHQLPTSGSLRKSMVECFRGHPKT--HPTID
        LC G+ SSPA+R+Y+ + V  +L R+++EYL+ASV V+S K++ VP+LL   + +F  A D+++++EW+  QLP SG+L+  ++EC +   K      ++
Subjt:  LCCGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEFSAAADMKAVVEWVCHQLPTSGSLRKSMVECFRGHPKT--HPTID

Query:  TLPYDFEFQYLLPL
           Y  EF+YLL L
Subjt:  TLPYDFEFQYLLPL

AT2G39690.2 Protein of unknown function, DUF5471.6e-5438.46Show/hide
Query:  PSKLSESIMKCLNFIYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIF-ENEESIPRDIGPYKNLVIFTSTSMDP
        P+++SE ++ CL  IY+ L   S     +  G +S S   SS S +S        ++ S ++     DPY +  ++   + RDIGPYKN +  + +S+D 
Subjt:  PSKLSESIMKCLNFIYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIF-ENEESIPRDIGPYKNLVIFTSTSMDP

Query:  KSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNAM--------------------VNIGGNTINAQAIEHYILRKPMSSNKED-
           +     P + +L VLM  L +VDL  L+Y+QKLAFWIN+YNACIM+A                     +N+GG  +NA AIEH++LR P     +D 
Subjt:  KSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNAM--------------------VNIGGNTINAQAIEHYILRKPMSSNKED-

Query:  -DNKEAVVRKLYGLESSEPNVTFALCCGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEFSAAADMKAVVEWVCHQLPTS
         D KE ++R  YGL  SEPNVTFALC G+ SSPA+R+Y+ + V  +L R+++EYL+ASV V+S K++ VP+LL   + +F  A D+++++EW+  QLP S
Subjt:  -DNKEAVVRKLYGLESSEPNVTFALCCGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEFSAAADMKAVVEWVCHQLPTS

Query:  GSLRKSMVECFRGHPKT--HPTIDTLPYDFEFQYLLPL
        G+L+  ++EC +   K      ++   Y  EF+YLL L
Subjt:  GSLRKSMVECFRGHPKT--HPTIDTLPYDFEFQYLLPL

AT3G12540.1 Protein of unknown function, DUF5477.3e-4434.57Show/hide
Query:  QQEENIQNWHPSKLSESIMKCLNFIYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLV
        Q++ N+Q   P+ +SE ++KCL  IY+ L R+SR  E E S  +S+ L  + L + SF+ ++  + + S        DPYG         RDIG YKN +
Subjt:  QQEENIQNWHPSKLSESIMKCLNFIYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLV

Query:  IFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMN--------------------AMVNIGGNTINAQAIEHYILRK
          T TS+D   +S  +    +  LRVL   L KVDL  L++++K+AFWIN YNAC+MN                    A +++GG  ++A  IE  IL+ 
Subjt:  IFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMN--------------------AMVNIGGNTINAQAIEHYILRK

Query:  PMSSNKEDDNKEAVVR--KLYGLESSEPNVTFALCCGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEFSAAADMKAVVE
        P    +     E+ VR    YG    EPN+ F LC G  SSPA+R+Y+ E V  EL +++ EYL+AS+ V+  K++ +P  L + L +F  A D  +++E
Subjt:  PMSSNKEDDNKEAVVR--KLYGLESSEPNVTFALCCGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEFSAAADMKAVVE

Query:  WVCHQLPTSG---SLRKSMVECF--RGHPKTHPTIDTLPYDFEFQYLLPL
        W+C QLP +     L+++ +E    +   +    I+   +++EF+YLL L
Subjt:  WVCHQLPTSG---SLRKSMVECF--RGHPKTHPTIDTLPYDFEFQYLLPL

AT5G42690.1 Protein of unknown function, DUF5471.3e-4029.09Show/hide
Query:  VKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKS-----------KQWSSEQPQTNNNDNNPNNKPPLNWNPISKSTFDTKALHFISKAIKGDYA
        V E+LAE+A++E E+ RLE  I   +++L  E   T+ S           K W ++    + +     ++ PL+  P S S       + +S A      
Subjt:  VKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKS-----------KQWSSEQPQTNNNDNNPNNKPPLNWNPISKSTFDTKALHFISKAIKGDYA

Query:  LNHFKLDNPKISSELDPRDTKDTHHLLHEVKLHERVSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMSMPIQQEENIQNWHPSKLSESIMKCLN
        +    + + +++  L+ +  KD+H             RK+    +S    D                                    P+K+SE ++KCL+
Subjt:  LNHFKLDNPKISSELDPRDTKDTHHLLHEVKLHERVSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMSMPIQQEENIQNWHPSKLSESIMKCLN

Query:  FIYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRK
         I++R+    R+M                           +  S    K+   +DPYGI  +     RDIG YKN       S++    SS++   LIR+
Subjt:  FIYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRK

Query:  LRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMN-------------------AMVNIGGNTINAQAIEHYILRKPMSSN----KEDDNKEAVVRKLYG
        L+ L+  L  V+++ L+ Q+KLAFWIN+YN+C+MN                   A +N+GG+ +NA  IEH+ILR P  S     K     E  VR  +G
Subjt:  LRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMN-------------------AMVNIGGNTINAQAIEHYILRKPMSSN----KEDDNKEAVVRKLYG

Query:  LESSEPNVTFALCCGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEFSAAADMKAVVEWVCHQLPTSGSLRKSMVECFRG
        LE SEP VTFAL CG+ SSPAVR+Y+   V  ELE +K EYL+ASV + S  ++ +P+L+     +F  A D++++++W+  QLPT   L K  + C   
Subjt:  LESSEPNVTFALCCGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEFSAAADMKAVVEWVCHQLPTSGSLRKSMVECFRG

Query:  HPKTHPT---IDTLPYDFEFQYLLPL
             P+   +  +PYDF F+YL  +
Subjt:  HPKTHPT---IDTLPYDFEFQYLLPL

AT5G60720.1 Protein of unknown function, DUF5473.1e-12748.87Show/hide
Query:  QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWSS-----EQPQTNNNDN-NPNNKPPLNWNP--------ISK----------------
        ++KE++ EL++VE EI+RLEIQI+ LQ +LK EQ +T K    SS     +  +T  +DN NP   P L   P        ++K                
Subjt:  QVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWSS-----EQPQTNNNDN-NPNNKPPLNWNP--------ISK----------------

Query:  ----STFDTKALHFISKAIKGDYALNHFKLDNPKISSELDPRDTKDTHH-LLHEVKLHERVS-RKSGLLVASSPLRDPRHPSPKQRERN-------PLDM
            +TF TK LHFI+KAIKGDYA+  F+  N K+         K+ H  + HE K+ E +  +K   + + SPLR+PR+ SP +  ++        LD+
Subjt:  ----STFDTKALHFISKAIKGDYALNHFKLDNPKISSELDPRDTKDTHH-LLHEVKLHERVS-RKSGLLVASSPLRDPRHPSPKQRERN-------PLDM

Query:  PPPKSMSMPIQQE--ENIQNWHPSKLSESIMKCLNFIYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNS-----SLSVHKELRQQDPYGIF
         PPKS+S  I  E  +NIQ WHP+KL+E+IMKCLNFIYVRLLR +R MELEK+GPISRS ++ SLSSRSFRV+N  +S     +L  +KE RQQDPYGIF
Subjt:  PPPKSMSMPIQQE--ENIQNWHPSKLSESIMKCLNFIYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNS-----SLSVHKELRQQDPYGIF

Query:  ENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMN---------------------AMVN
        + E S+ RDIGPYKNLVIFTS+SMD K ISS++ + LI+KLRVLM+NL+ VDL+ LS+QQKLAFWINM+NAC+M+                     A +N
Subjt:  ENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMN---------------------AMVN

Query:  IGGNTINAQAIEHYILRKPMSSN-KEDDNKEAVVRKLYGLESSEPNVTFALCCGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLV
        +GG  I+A  IEH ILRK  SS   +D ++E ++RKLYG+E+++PN+TFAL CGTRSSPAVRIY+GE V  ELE+SKLEYLQAS+VVT++KR+ +PELL+
Subjt:  IGGNTINAQAIEHYILRKPMSSN-KEDDNKEAVVRKLYGLESSEPNVTFALCCGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLV

Query:  RSLPEF---------SAAADMKAVVEWVCHQLPTSGSLRKSMVECFRG----HPKTHPTIDTLPYDFEFQYLLPL
        +   +F              + ++V+WVC+QLPTSGSLRKSMV+CF+        +   ++ +PYDFEFQYLL +
Subjt:  RSLPEF---------SAAADMKAVVEWVCHQLPTSGSLRKSMVECFRG----HPKTHPTIDTLPYDFEFQYLLPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGGTAAAGGAAGTGTTGGCAGAGCTAGCAATGGTGGAAAGTGAAATAGCAAGGCTTGAGATCCAAATAACCCAACTCCAAAAGGACTTGAAAACTGAGCAACAACA
AACCACAAAGTCCAAGCAATGGAGCTCTGAACAACCTCAAACCAATAATAATGATAATAATCCCAACAATAAGCCACCATTGAATTGGAACCCAATTAGCAAATCAACTT
TTGACACTAAGGCTCTCCACTTCATTAGCAAAGCTATCAAGGGAGATTATGCTCTCAATCACTTTAAGTTGGATAATCCAAAAATTAGTAGTGAATTAGATCCTAGAGAT
ACCAAAGACACTCATCATCTTCTTCATGAGGTTAAACTCCATGAAAGAGTTTCTAGAAAGAGTGGTCTTCTCGTCGCTTCGTCTCCGTTGCGAGACCCTCGACATCCTTC
TCCAAAGCAACGAGAGAGAAATCCATTAGACATGCCACCGCCTAAATCTATGTCAATGCCAATTCAACAAGAAGAAAACATTCAAAATTGGCATCCCAGCAAGCTATCAG
AGAGTATCATGAAGTGCTTGAACTTCATATATGTGAGACTGCTGAGAGCCTCAAGAACAATGGAGCTAGAGAAATCAGGTCCCATTTCAAGATCTTTGCATTATTCTTCT
TTGAGCTCGAGAAGCTTTCGAGTCGAGAACGGTTTAAACTCGAGCCTTTCAGTACACAAAGAATTGAGGCAACAAGATCCTTATGGCATCTTCGAAAATGAAGAATCGAT
ACCGAGGGATATTGGCCCTTACAAGAACTTGGTCATATTCACATCAACTTCCATGGATCCCAAATCAATATCCAGTGCCACCTTCATCCCTCTCATAAGGAAGCTAAGGG
TCTTGATGAGCAATTTGCAAAAAGTGGATCTAAGGCCATTGAGTTACCAACAAAAACTGGCCTTTTGGATCAACATGTACAATGCTTGTATCATGAATGCAATGGTTAAT
ATCGGAGGCAACACCATAAATGCACAAGCCATAGAGCATTACATTCTAAGGAAACCAATGTCTAGTAACAAAGAGGACGACAACAAAGAAGCCGTCGTCCGGAAGCTGTA
CGGCCTAGAATCATCAGAACCGAACGTCACATTCGCCCTATGCTGTGGGACCCGGTCGTCGCCGGCGGTGAGAATATACAGTGGCGAGTCGGTGGCGGCGGAGCTTGAGA
GATCGAAGCTCGAGTATCTGCAGGCGTCGGTGGTGGTCACCAGCTCCAAAAGGGTCGCCGTGCCGGAGCTTCTAGTTCGAAGCTTGCCGGAATTTTCAGCGGCGGCGGAC
ATGAAGGCGGTGGTGGAGTGGGTGTGCCACCAGCTTCCGACGTCGGGGAGTCTGAGGAAATCAATGGTGGAATGCTTTCGAGGACACCCGAAAACGCATCCCACCATCGA
CACATTGCCTTATGACTTCGAGTTTCAATATCTTTTGCCTTTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGGTAAAGGAAGTGTTGGCAGAGCTAGCAATGGTGGAAAGTGAAATAGCAAGGCTTGAGATCCAAATAACCCAACTCCAAAAGGACTTGAAAACTGAGCAACAACA
AACCACAAAGTCCAAGCAATGGAGCTCTGAACAACCTCAAACCAATAATAATGATAATAATCCCAACAATAAGCCACCATTGAATTGGAACCCAATTAGCAAATCAACTT
TTGACACTAAGGCTCTCCACTTCATTAGCAAAGCTATCAAGGGAGATTATGCTCTCAATCACTTTAAGTTGGATAATCCAAAAATTAGTAGTGAATTAGATCCTAGAGAT
ACCAAAGACACTCATCATCTTCTTCATGAGGTTAAACTCCATGAAAGAGTTTCTAGAAAGAGTGGTCTTCTCGTCGCTTCGTCTCCGTTGCGAGACCCTCGACATCCTTC
TCCAAAGCAACGAGAGAGAAATCCATTAGACATGCCACCGCCTAAATCTATGTCAATGCCAATTCAACAAGAAGAAAACATTCAAAATTGGCATCCCAGCAAGCTATCAG
AGAGTATCATGAAGTGCTTGAACTTCATATATGTGAGACTGCTGAGAGCCTCAAGAACAATGGAGCTAGAGAAATCAGGTCCCATTTCAAGATCTTTGCATTATTCTTCT
TTGAGCTCGAGAAGCTTTCGAGTCGAGAACGGTTTAAACTCGAGCCTTTCAGTACACAAAGAATTGAGGCAACAAGATCCTTATGGCATCTTCGAAAATGAAGAATCGAT
ACCGAGGGATATTGGCCCTTACAAGAACTTGGTCATATTCACATCAACTTCCATGGATCCCAAATCAATATCCAGTGCCACCTTCATCCCTCTCATAAGGAAGCTAAGGG
TCTTGATGAGCAATTTGCAAAAAGTGGATCTAAGGCCATTGAGTTACCAACAAAAACTGGCCTTTTGGATCAACATGTACAATGCTTGTATCATGAATGCAATGGTTAAT
ATCGGAGGCAACACCATAAATGCACAAGCCATAGAGCATTACATTCTAAGGAAACCAATGTCTAGTAACAAAGAGGACGACAACAAAGAAGCCGTCGTCCGGAAGCTGTA
CGGCCTAGAATCATCAGAACCGAACGTCACATTCGCCCTATGCTGTGGGACCCGGTCGTCGCCGGCGGTGAGAATATACAGTGGCGAGTCGGTGGCGGCGGAGCTTGAGA
GATCGAAGCTCGAGTATCTGCAGGCGTCGGTGGTGGTCACCAGCTCCAAAAGGGTCGCCGTGCCGGAGCTTCTAGTTCGAAGCTTGCCGGAATTTTCAGCGGCGGCGGAC
ATGAAGGCGGTGGTGGAGTGGGTGTGCCACCAGCTTCCGACGTCGGGGAGTCTGAGGAAATCAATGGTGGAATGCTTTCGAGGACACCCGAAAACGCATCCCACCATCGA
CACATTGCCTTATGACTTCGAGTTTCAATATCTTTTGCCTTTGTGA
Protein sequenceShow/hide protein sequence
MQVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQQTTKSKQWSSEQPQTNNNDNNPNNKPPLNWNPISKSTFDTKALHFISKAIKGDYALNHFKLDNPKISSELDPRD
TKDTHHLLHEVKLHERVSRKSGLLVASSPLRDPRHPSPKQRERNPLDMPPPKSMSMPIQQEENIQNWHPSKLSESIMKCLNFIYVRLLRASRTMELEKSGPISRSLHYSS
LSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNAMVN
IGGNTINAQAIEHYILRKPMSSNKEDDNKEAVVRKLYGLESSEPNVTFALCCGTRSSPAVRIYSGESVAAELERSKLEYLQASVVVTSSKRVAVPELLVRSLPEFSAAAD
MKAVVEWVCHQLPTSGSLRKSMVECFRGHPKTHPTIDTLPYDFEFQYLLPL