; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g1121 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g1121
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionSMI1_KNR4 domain-containing protein
Genome locationMC02:9562234..9563412
RNA-Seq ExpressionMC02g1121
SyntenyMC02g1121
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7035203.1 hypothetical protein SDJN02_01998, partial [Cucurbita argyrosperma subsp. argyrosperma]3.13e-24888.72Show/hide
Query:  MVDVDRRMTGMNPAHLAGLRRLSARAAAAPSAPST--RNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGF
        MVDVDRRMTG+NPAHLAGLRRLSARAAA  +A S   RNGLLSF+SLAD+VLTHL NSGV+VQPGLSDAEFARAEAEF FAFPPDLRAVLSAGLPVGPGF
Subjt:  MVDVDRRMTGMNPAHLAGLRRLSARAAAAPSAPST--RNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGF

Query:  PDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFF
        PDWR++GARLHLR+SLDLP+AAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRV CCGFDLSDFF
Subjt:  PDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFF

Query:  ERESLFRCSVSD----PLFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGS
        ERESLFRCS SD    PLF KQRS++EKS  SST FSRRS+D+GV KTPRWVEFWSDA VDRRRRNSSSSS+SSPDRFFE+P R EIPKWVGEY+GELGS
Subjt:  ERESLFRCSVSD----PLFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGS

Query:  VLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS
        VLR GGWSESEVAEMV+VSA+GFFDGEMV+LDNQAV DALLLKVDRFS SLRRAGWSSEEVSEAFGFDFR EKERKPAKKLSAELVERIGKLAESVSRS
Subjt:  VLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS

POO02805.1 SMI1/KNR4-like domain containing protein [Trema orientale]5.25e-23281.93Show/hide
Query:  MVDVDRRMTGMNPAHLAGLRRLSARAAAAPSAPST-------RNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLP
        MVDVDRRMTG+NPAH+AGLRRLSARAAAAPS  +        RNGLLSF+SLA+KV+THL NSG++VQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLP
Subjt:  MVDVDRRMTGMNPAHLAGLRRLSARAAAAPSAPST-------RNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLP

Query:  VGPGFPDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFD
        VGPGFPDWRA GARLHLRASLDLP+AAISFQIARNTLWS+SWGPRP++PE+ALRVARN+LKRAP+LIPIFNHCYIPCNP LAGNPIFFVDE+R+FCCG D
Subjt:  VGPGFPDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFD

Query:  LSDFFERESLFRCSVSDPLFLK-QRSVSEKSAASSTTFSRRSVDAGV---AKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYV
        LSDFFERESLFR S SDP  LK QRSVSEKSA SS+ FSRRS+D G    A+TPRWVEFWSDA VDRRRRNSSSSSSSSP+RFF+MP R EIPKWV EYV
Subjt:  LSDFFERESLFRCSVSDPLFLK-QRSVSEKSAASSTTFSRRSVDAGV---AKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYV

Query:  GELGSVLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAES
         ++GSVLR GGWSES+V+E++EVSA+GFF+GEM+L+DNQAV DALLLK DRFSDSLR+AGWSSEEVS+A GFDFR EKE+KP KKLS ELVERIGKLAES
Subjt:  GELGSVLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAES

Query:  VSRS
        VSRS
Subjt:  VSRS

XP_004149705.1 uncharacterized protein LOC101213140 [Cucumis sativus]1.80e-23284.56Show/hide
Query:  MVDVDRRMTGMNPAHLAGLRRLSARAAAAPSAPSTRNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPD
        MVDVDRRM+ +NPAH+AGLRRLSARAAA  SAP  RN LLSF+SLADKVLTHL NSGV+VQPGLSDAEFARAEAEF F+FPPDLRAVLSAGLPVGPGFPD
Subjt:  MVDVDRRMTGMNPAHLAGLRRLSARAAAAPSAPSTRNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPD

Query:  WRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFFER
        WR+AGARLHLR+SLDLP+AAISFQIA+NTLWS SWG +PAEPEKALR+ARN LKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDE+RV CCGFDLSDFFER
Subjt:  WRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFFER

Query:  ESLFRCSVSDP--LFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGSVLRG
        ESLFRCSVSD   LF KQ S+++KS   S  FSRRS+D+GV +TPRWVEFWSDA +DRRRRNSSSSS+SSPDRFFEMP R E+PKWVG+Y+ ELGSVLR 
Subjt:  ESLFRCSVSDP--LFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGSVLRG

Query:  GGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS
        GGWSESEVAEMVEVSAAG FDGEMV+LDNQAV DALLLKVDRFS SLRR+GWSSEEVSEAFGFDFR EK +K AKKLSAELVERIGKLAESVSRS
Subjt:  GGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS

XP_022947736.1 uncharacterized protein LOC111451511 [Cucurbita moschata]2.20e-24888.72Show/hide
Query:  MVDVDRRMTGMNPAHLAGLRRLSARAAAAPSAPST--RNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGF
        MVDVDRRMTG+NPAHLAGLRRLSARAAA  +A S   RNGLLSF+SLAD+VLTHL NSGV+VQPGLSDAEFARAEAEF FAFPPDLRAVLSAGLPVGPGF
Subjt:  MVDVDRRMTGMNPAHLAGLRRLSARAAAAPSAPST--RNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGF

Query:  PDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFF
        PDWR++GARLHLR+SLDLP+AAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRV CCGFDLSDFF
Subjt:  PDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFF

Query:  ERESLFRCSVSD----PLFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGS
        ERESLFRCS SD    PLF KQRS++EKS  SST FSRRS+D+GV KTPRWVEFWSDA VDRRRRNSSSSS+SSPDRFFE+P R EIPKWVGEY+GELGS
Subjt:  ERESLFRCSVSD----PLFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGS

Query:  VLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS
        VLR GGWSESEVAEMV+VSA+GFFDGEMV+LDNQAV DALLLKVDRFS SLRRAGWSSEEVSEAFGFDFR EKERKPAKKLSAELVERIGKLAESVSRS
Subjt:  VLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS

XP_038902056.1 uncharacterized protein LOC120088705 [Benincasa hispida]2.12e-23685.64Show/hide
Query:  MVDVDRRMTGMNPAHLAGLRRLSARAAAAPSAPSTRNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPD
        MVDVDRRM+ +NPAH+AGLRRLSARAAA  SAP  RNGLLSF+SLADKVLTHL NSGV+VQPGLSDAEFARAEAEF F+FPPDLRAVLSAGLPVGPGFPD
Subjt:  MVDVDRRMTGMNPAHLAGLRRLSARAAAAPSAPSTRNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPD

Query:  WRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFFER
        WR+AGARLHLR+SLDLP+AAISFQIA+NTLWS SWG +PAEPEKALR+ARN LKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDE+RV CCG DLSDFFER
Subjt:  WRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFFER

Query:  ESLFRCSVSD----PLFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGSVL
        ESLFRCSVSD    PLF KQRS++EKS   S  FSRRSVD+GV +TPRWVEFWSDA +DRRRRNSSSSS+SSPDRF EMP R EIPKWVG+Y+GELGSVL
Subjt:  ESLFRCSVSD----PLFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGSVL

Query:  RGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS
        R GGWSESEVAEMVEVSAAGFFD EMV+LDNQAV DALLLKVDRFS SLRR+GWSSEEVSEAFGFDFR EK RK AKKLSAELVERIGKLAESVSRS
Subjt:  RGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS

TrEMBL top hitse value%identityAlignment
A0A0A0LMG8 Uncharacterized protein8.71e-23384.56Show/hide
Query:  MVDVDRRMTGMNPAHLAGLRRLSARAAAAPSAPSTRNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPD
        MVDVDRRM+ +NPAH+AGLRRLSARAAA  SAP  RN LLSF+SLADKVLTHL NSGV+VQPGLSDAEFARAEAEF F+FPPDLRAVLSAGLPVGPGFPD
Subjt:  MVDVDRRMTGMNPAHLAGLRRLSARAAAAPSAPSTRNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPD

Query:  WRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFFER
        WR+AGARLHLR+SLDLP+AAISFQIA+NTLWS SWG +PAEPEKALR+ARN LKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDE+RV CCGFDLSDFFER
Subjt:  WRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFFER

Query:  ESLFRCSVSDP--LFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGSVLRG
        ESLFRCSVSD   LF KQ S+++KS   S  FSRRS+D+GV +TPRWVEFWSDA +DRRRRNSSSSS+SSPDRFFEMP R E+PKWVG+Y+ ELGSVLR 
Subjt:  ESLFRCSVSDP--LFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGSVLRG

Query:  GGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS
        GGWSESEVAEMVEVSAAG FDGEMV+LDNQAV DALLLKVDRFS SLRR+GWSSEEVSEAFGFDFR EK +K AKKLSAELVERIGKLAESVSRS
Subjt:  GGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS

A0A2P5FYD4 SMI1/KNR4-like domain containing protein2.54e-23281.93Show/hide
Query:  MVDVDRRMTGMNPAHLAGLRRLSARAAAAPSAPST-------RNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLP
        MVDVDRRMTG+NPAH+AGLRRLSARAAAAPS  +        RNGLLSF+SLA+KV+THL NSG++VQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLP
Subjt:  MVDVDRRMTGMNPAHLAGLRRLSARAAAAPSAPST-------RNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLP

Query:  VGPGFPDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFD
        VGPGFPDWRA GARLHLRASLDLP+AAISFQIARNTLWS+SWGPRP++PE+ALRVARN+LKRAP+LIPIFNHCYIPCNP LAGNPIFFVDE+R+FCCG D
Subjt:  VGPGFPDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFD

Query:  LSDFFERESLFRCSVSDPLFLK-QRSVSEKSAASSTTFSRRSVDAGV---AKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYV
        LSDFFERESLFR S SDP  LK QRSVSEKSA SS+ FSRRS+D G    A+TPRWVEFWSDA VDRRRRNSSSSSSSSP+RFF+MP R EIPKWV EYV
Subjt:  LSDFFERESLFRCSVSDPLFLK-QRSVSEKSAASSTTFSRRSVDAGV---AKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYV

Query:  GELGSVLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAES
         ++GSVLR GGWSES+V+E++EVSA+GFF+GEM+L+DNQAV DALLLK DRFSDSLR+AGWSSEEVS+A GFDFR EKE+KP KKLS ELVERIGKLAES
Subjt:  GELGSVLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAES

Query:  VSRS
        VSRS
Subjt:  VSRS

A0A6J1G799 uncharacterized protein LOC1114515111.07e-24888.72Show/hide
Query:  MVDVDRRMTGMNPAHLAGLRRLSARAAAAPSAPST--RNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGF
        MVDVDRRMTG+NPAHLAGLRRLSARAAA  +A S   RNGLLSF+SLAD+VLTHL NSGV+VQPGLSDAEFARAEAEF FAFPPDLRAVLSAGLPVGPGF
Subjt:  MVDVDRRMTGMNPAHLAGLRRLSARAAAAPSAPST--RNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGF

Query:  PDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFF
        PDWR++GARLHLR+SLDLP+AAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRV CCGFDLSDFF
Subjt:  PDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFF

Query:  ERESLFRCSVSD----PLFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGS
        ERESLFRCS SD    PLF KQRS++EKS  SST FSRRS+D+GV KTPRWVEFWSDA VDRRRRNSSSSS+SSPDRFFE+P R EIPKWVGEY+GELGS
Subjt:  ERESLFRCSVSD----PLFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGS

Query:  VLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS
        VLR GGWSESEVAEMV+VSA+GFFDGEMV+LDNQAV DALLLKVDRFS SLRRAGWSSEEVSEAFGFDFR EKERKPAKKLSAELVERIGKLAESVSRS
Subjt:  VLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS

A0A6J1L6H7 uncharacterized protein LOC1114996241.07e-24888.72Show/hide
Query:  MVDVDRRMTGMNPAHLAGLRRLSARAAAAPSAPST--RNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGF
        MVDVDRRMTG+NPAHLAGLRRLSARAAA  +A S   RNGLLSF+SLAD+VLTHL NSGV+VQPGLSDAEFARAEAEF FAFPPDLRAVLSAGLPVGPGF
Subjt:  MVDVDRRMTGMNPAHLAGLRRLSARAAAAPSAPST--RNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGF

Query:  PDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFF
        PDWR++GARLHLR+SLDLP+AAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRV CCGFDLSDFF
Subjt:  PDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFF

Query:  ERESLFRCSVSD----PLFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGS
        ERESLFRCS SD    PLF KQRS++EKS  SST FSRRS+D+GV KTPRWVEFWSDA VDRRRRNSSSSS+SSPDRFFE+P R EIPKWVGEY+GELGS
Subjt:  ERESLFRCSVSD----PLFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGS

Query:  VLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS
        VLR GGWSESEVAEMV+VSA+GFFDGEMV+LDNQAV DALLLKVDRFS SLRRAGWSSEEVSEAFGFDFR EKERKPAKKLSAELVERIGKLAESVSRS
Subjt:  VLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS

A0A6P4AVM2 uncharacterized protein LOC1074247985.12e-23281.44Show/hide
Query:  MVDVDRRMTGMNPAHLAGLRRLSARAAAAPSAPST-------RNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLP
        MVDVDRRMTG+NPAH+AGLRRLSARAAAAPS   T       RNGL SF+SLA+KV++HL NSG++VQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLP
Subjt:  MVDVDRRMTGMNPAHLAGLRRLSARAAAAPSAPST-------RNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLP

Query:  VGPGFPDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFD
        VGPGFPDWR+AGARLHLRASLDLP+AAISFQIARNTLWS+SWGPRP++PE+ALRVARN+LKRAP+LIPIFNHCYIPCNP LAGNPIFFVDE+R+FCCG D
Subjt:  VGPGFPDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFD

Query:  LSDFFERESLFRCSVSDPLFLK-QRSVSEKSAASSTTFSRRSVDAGV---AKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYV
        LSDFF+RESLFR S SDP  LK QRSVSEKSA SS+ FSRRS+D+G    A+TPRWVEFWSDA VDRRRRNSSSSSSSSP+RFF+MP R EIP WV EY+
Subjt:  LSDFFERESLFRCSVSDPLFLK-QRSVSEKSAASSTTFSRRSVDAGV---AKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYV

Query:  GELGSVLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAES
        G++GSVLR GGW+ES+++E+V VSA+GFF+GEMVLLDNQAV DALLLK DRFSDSLR+AGWSSEEVS+A GFDFR EKE+KPAKKLS ELVERIGKLAES
Subjt:  GELGSVLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAES

Query:  VSRS
        VSRS
Subjt:  VSRS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G22790.1 unknown protein1.2e-2537.69Show/hide
Query:  TNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSL
        + +G  V PGL++ E +  E+  GF+FP DLR++L  GLPVG  FP+WR    R +L     LP+  +S  + RN  W  SWG RP    +AL + +  +
Subjt:  TNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSL

Query:  KRAPVLIPIFNHCYIPCNPP-LAGNPIFFVDESRVFCCGFDLSDFFERESLFRCSVSDPLFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSD
        + APVL+P++   Y+P   P LAGNP+F +D   V            RE    C V    FLK    SE     + T  RR       + PR VEFWSD
Subjt:  KRAPVLIPIFNHCYIPCNPP-LAGNPIFFVDESRVFCCGFDLSDFFERESLFRCSVSDPLFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSD

AT3G50340.1 unknown protein6.0e-16673.33Show/hide
Query:  MVDVDRRMTGMNPAHLAGLRRLSARAAAAPSAPSTRNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPD
        MVDVDRRMTG+ PAH AGLRRLSAR AAAP+ P+ RN L+SF+SLAD+V++HL  S ++VQPGL+D+EFARAEAEF FAFPPDLRAVL+AGLPVG GFPD
Subjt:  MVDVDRRMTGMNPAHLAGLRRLSARAAAAPSAPSTRNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPD

Query:  WRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFFER
        WR+ GARLHLRA +DLP+AA+SFQIARNTLWS+SWG RP++PEKALRVARN+LKRAP++IPIF+HCYIPCNP LAGNP+F++DE+R+FCCG DLSDFFER
Subjt:  WRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFFER

Query:  ESLFRCSVSDPLFL-KQRSVSEKSA----ASSTTFSRRSVDAG---VAKTPRWVEFWSDATVDRRRRNS----SSSSSSSPDRFFEMPPRFEIPKWVGEY
        ES+FR S + P+ L KQRSVSEKSA    +SS+ FSR S+D+G    + TPRWVEFWSDA VDRRRRNS    SSS SSSP+R+ ++ PR E PKWV +Y
Subjt:  ESLFRCSVSDPLFL-KQRSVSEKSA----ASSTTFSRRSVDAG---VAKTPRWVEFWSDATVDRRRRNS----SSSSSSSPDRFFEMPPRFEIPKWVGEY

Query:  VGELGSVLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAE
        V  +GSVLRGGGWSES+V ++V VSA+GFF+GEMV+LDNQAV DALLLK  RFS+SLR+AGWSSEEVS+A GFDFR EKE+KP KKLS ELV+RIGKLAE
Subjt:  VGELGSVLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAE

Query:  SVSRS
        SVSRS
Subjt:  SVSRS

AT5G67020.1 unknown protein1.3e-15570.5Show/hide
Query:  MVDVDRRMTGMNPAHLAGLRRLSARAAAAPSAPSTRNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPD
        MVDVDRRMTG+ PAH AGLRRLSAR AAAPS P+ RN L SF+  ADKV+ HL NSG+++QPGLSD EFAR EAEFGF FPPDLR +LSAGL VG GFPD
Subjt:  MVDVDRRMTGMNPAHLAGLRRLSARAAAAPSAPSTRNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPD

Query:  WRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFFER
        WR+ GARLHLRA +DLPVAA+SFQIA+N+LW +SWG +P +PEKALRVARN+LKRAP+LIPIF+HCYIPCNP LAGNP+FF+DE+R+FCCG DLS+FFER
Subjt:  WRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFFER

Query:  ESLFRCSVSDPLFL-KQRSVSEKSAASSTTFSRRSVDAGVAK---TPRWVEFWSDATVDRRRRNS---SSSSSSSPDRFFEMPPRFEIPKWVGEYVGELG
        ES FR S   P  L KQRSVSEKSA SS+ FSRRS+D G A      RWVEFWSDA VDR RRNS   SSSSSSSPD      P+ E PKWV +YV  +G
Subjt:  ESLFRCSVSDPLFL-KQRSVSEKSAASSTTFSRRSVDAGVAK---TPRWVEFWSDATVDRRRRNS---SSSSSSSPDRFFEMPPRFEIPKWVGEYVGELG

Query:  SVLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS
        SVLR GGWSES++ E++ VSA+GFF+GEMV++DNQ V D LLLK  R S+SLR++GWSSEEVS+A GFDFR EKERKP KKLS  LVE+  KLAE VS+S
Subjt:  SVLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCGACGTCGACCGGAGGATGACTGGTATGAACCCGGCCCATCTCGCCGGCCTCCGCCGTCTCTCCGCCCGAGCCGCCGCCGCCCCCTCCGCTCCCTCCACCCGCAA
CGGCCTCCTCTCCTTCGCTTCCCTCGCCGACAAGGTCCTCACCCATCTCACGAACTCCGGCGTCGAGGTCCAGCCGGGCCTCTCCGACGCCGAGTTCGCCCGAGCCGAGG
CCGAGTTCGGATTCGCCTTTCCCCCAGATCTCCGGGCAGTTCTCTCCGCCGGTCTGCCCGTCGGCCCGGGATTCCCCGATTGGCGCGCCGCCGGTGCCCGCCTCCATCTC
CGGGCATCTCTCGACCTCCCCGTCGCCGCAATTTCCTTCCAAATCGCCAGAAATACGCTCTGGTCCCGCTCCTGGGGCCCGCGACCCGCCGAACCCGAAAAGGCCCTTCG
GGTCGCCCGAAATTCGCTGAAGAGAGCCCCGGTTTTGATCCCCATTTTCAACCATTGCTACATTCCTTGCAACCCGCCGCTGGCCGGAAACCCAATTTTCTTCGTCGACG
AGAGCCGCGTGTTCTGCTGCGGGTTCGATTTGTCGGATTTCTTCGAACGGGAGTCGCTGTTTCGGTGCTCTGTTTCGGATCCCCTGTTTCTGAAACAGAGGTCCGTCAGC
GAGAAGTCCGCCGCATCGTCGACGACTTTCTCCCGGCGGAGCGTGGACGCCGGCGTCGCAAAGACGCCGCGGTGGGTCGAGTTTTGGAGCGATGCCACCGTGGATCGGCG
GCGGAGAAACTCGTCGTCTTCGTCATCGTCGTCGCCGGATCGATTCTTCGAGATGCCGCCGAGGTTCGAAATCCCCAAGTGGGTCGGGGAATACGTCGGAGAATTGGGAT
CAGTGTTGAGAGGCGGCGGGTGGAGCGAATCGGAGGTGGCGGAGATGGTGGAGGTTTCGGCGGCCGGATTCTTCGACGGCGAGATGGTTCTGTTGGATAATCAGGCGGTC
TTCGACGCTCTGCTTCTGAAAGTGGACCGGTTTTCGGATTCGCTGCGGCGGGCAGGGTGGAGCTCCGAGGAGGTTTCGGAGGCGTTCGGGTTCGATTTCCGGTCGGAGAA
GGAGAGGAAACCGGCTAAGAAGCTATCGGCAGAACTGGTGGAGAGGATCGGGAAACTGGCCGAGTCGGTTTCCCGGTCA
mRNA sequenceShow/hide mRNA sequence
ATGGTCGACGTCGACCGGAGGATGACTGGTATGAACCCGGCCCATCTCGCCGGCCTCCGCCGTCTCTCCGCCCGAGCCGCCGCCGCCCCCTCCGCTCCCTCCACCCGCAA
CGGCCTCCTCTCCTTCGCTTCCCTCGCCGACAAGGTCCTCACCCATCTCACGAACTCCGGCGTCGAGGTCCAGCCGGGCCTCTCCGACGCCGAGTTCGCCCGAGCCGAGG
CCGAGTTCGGATTCGCCTTTCCCCCAGATCTCCGGGCAGTTCTCTCCGCCGGTCTGCCCGTCGGCCCGGGATTCCCCGATTGGCGCGCCGCCGGTGCCCGCCTCCATCTC
CGGGCATCTCTCGACCTCCCCGTCGCCGCAATTTCCTTCCAAATCGCCAGAAATACGCTCTGGTCCCGCTCCTGGGGCCCGCGACCCGCCGAACCCGAAAAGGCCCTTCG
GGTCGCCCGAAATTCGCTGAAGAGAGCCCCGGTTTTGATCCCCATTTTCAACCATTGCTACATTCCTTGCAACCCGCCGCTGGCCGGAAACCCAATTTTCTTCGTCGACG
AGAGCCGCGTGTTCTGCTGCGGGTTCGATTTGTCGGATTTCTTCGAACGGGAGTCGCTGTTTCGGTGCTCTGTTTCGGATCCCCTGTTTCTGAAACAGAGGTCCGTCAGC
GAGAAGTCCGCCGCATCGTCGACGACTTTCTCCCGGCGGAGCGTGGACGCCGGCGTCGCAAAGACGCCGCGGTGGGTCGAGTTTTGGAGCGATGCCACCGTGGATCGGCG
GCGGAGAAACTCGTCGTCTTCGTCATCGTCGTCGCCGGATCGATTCTTCGAGATGCCGCCGAGGTTCGAAATCCCCAAGTGGGTCGGGGAATACGTCGGAGAATTGGGAT
CAGTGTTGAGAGGCGGCGGGTGGAGCGAATCGGAGGTGGCGGAGATGGTGGAGGTTTCGGCGGCCGGATTCTTCGACGGCGAGATGGTTCTGTTGGATAATCAGGCGGTC
TTCGACGCTCTGCTTCTGAAAGTGGACCGGTTTTCGGATTCGCTGCGGCGGGCAGGGTGGAGCTCCGAGGAGGTTTCGGAGGCGTTCGGGTTCGATTTCCGGTCGGAGAA
GGAGAGGAAACCGGCTAAGAAGCTATCGGCAGAACTGGTGGAGAGGATCGGGAAACTGGCCGAGTCGGTTTCCCGGTCA
Protein sequenceShow/hide protein sequence
MVDVDRRMTGMNPAHLAGLRRLSARAAAAPSAPSTRNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPDWRAAGARLHL
RASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFFERESLFRCSVSDPLFLKQRSVS
EKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGSVLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAV
FDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS