; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS003628 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS003628
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionSMI1_KNR4 domain-containing protein
Genome locationscaffold963:225878..227056
RNA-Seq ExpressionMS003628
SyntenyMS003628
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7035203.1 hypothetical protein SDJN02_01998, partial [Cucurbita argyrosperma subsp. argyrosperma]7.6e-19588.97Show/hide
Query:  MVDVDRRMTGLNPAHLAGLRRLSARAAAAPSAPST--RNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGF
        MVDVDRRMTGLNPAHLAGLRRLSARAAA  +A S   RNGLLSF+SLAD+VLTHL NSGV+VQPGLSDAEFARAEAEF FAFPPDLRAVLSAGLPVGPGF
Subjt:  MVDVDRRMTGLNPAHLAGLRRLSARAAAAPSAPST--RNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGF

Query:  PDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFF
        PDWR++GARLHLR+SLDLP+AAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRV CCGFDLSDFF
Subjt:  PDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFF

Query:  ERESLFRCSVSD----PLFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGS
        ERESLFRCS SD    PLF KQRS++EKS  SST FSRRS+D+GV KTPRWVEFWSDA VDRRRRNSSSSS+SSPDRFFE+ PR EIPKWVGEY+GELGS
Subjt:  ERESLFRCSVSD----PLFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGS

Query:  VLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS
        VLR GGWSESEVAEMV+VSA+GFFDGEMV+LDNQAV DALLLKVDRFS SLRRAGWSSEEVSEAFGFDFR EKERKPAKKLSAELVERIGKLAESVSRS
Subjt:  VLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS

POO02805.1 SMI1/KNR4-like domain containing protein [Trema orientale]1.5e-18282.67Show/hide
Query:  MVDVDRRMTGLNPAHLAGLRRLSARAAAAPS-----APS--TRNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLP
        MVDVDRRMTGLNPAH+AGLRRLSARAAAAPS      PS   RNGLLSF+SLA+KV+THL NSG++VQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLP
Subjt:  MVDVDRRMTGLNPAHLAGLRRLSARAAAAPS-----APS--TRNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLP

Query:  VGPGFPDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFD
        VGPGFPDWRA GARLHLRASLDLP+AAISFQIARNTLWS+SWGPRP++PE+ALRVARN+LKRAP+LIPIFNHCYIPCNP LAGNPIFFVDE+R+FCCG D
Subjt:  VGPGFPDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFD

Query:  LSDFFERESLFRCSVSDPLFL-KQRSVSEKSAASSTTFSRRSVDAGV---AKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYV
        LSDFFERESLFR S SDP  L KQRSVSEKSA SS+ FSRRS+D G    A+TPRWVEFWSDA VDRRRRNSSSSSSSSP+RFF+M PR EIPKWV EYV
Subjt:  LSDFFERESLFRCSVSDPLFL-KQRSVSEKSAASSTTFSRRSVDAGV---AKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYV

Query:  GELGSVLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAES
         ++GSVLR GGWSES+V+E++EVSA+GFF+GEM+L+DNQAV DALLLK DRFSDSLR+AGWSSEEVS+A GFDFR EKE+KP KKLS ELVERIGKLAES
Subjt:  GELGSVLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAES

Query:  VSRS
        VSRS
Subjt:  VSRS

XP_004149705.1 uncharacterized protein LOC101213140 [Cucumis sativus]6.7e-18384.81Show/hide
Query:  MVDVDRRMTGLNPAHLAGLRRLSARAAAAPSAPSTRNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPD
        MVDVDRRM+ LNPAH+AGLRRLSARAAA  SAP  RN LLSF+SLADKVLTHL NSGV+VQPGLSDAEFARAEAEF F+FPPDLRAVLSAGLPVGPGFPD
Subjt:  MVDVDRRMTGLNPAHLAGLRRLSARAAAAPSAPSTRNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPD

Query:  WRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFFER
        WR+AGARLHLR+SLDLP+AAISFQIA+NTLWS SWG +PAEPEKALR+ARN LKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDE+RV CCGFDLSDFFER
Subjt:  WRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFFER

Query:  ESLFRCSV--SDPLFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGSVLRG
        ESLFRCSV  SD LF KQ S+++KS   S  FSRRS+D+GV +TPRWVEFWSDA +DRRRRNSSSSS+SSPDRFFEM PR E+PKWVG+Y+ ELGSVLR 
Subjt:  ESLFRCSV--SDPLFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGSVLRG

Query:  GGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS
        GGWSESEVAEMVEVSAAG FDGEMV+LDNQAV DALLLKVDRFS SLRR+GWSSEEVSEAFGFDFR EK +K AKKLSAELVERIGKLAESVSRS
Subjt:  GGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS

XP_022947736.1 uncharacterized protein LOC111451511 [Cucurbita moschata]5.8e-19588.97Show/hide
Query:  MVDVDRRMTGLNPAHLAGLRRLSARAAAAPSAPST--RNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGF
        MVDVDRRMTGLNPAHLAGLRRLSARAAA  +A S   RNGLLSF+SLAD+VLTHL NSGV+VQPGLSDAEFARAEAEF FAFPPDLRAVLSAGLPVGPGF
Subjt:  MVDVDRRMTGLNPAHLAGLRRLSARAAAAPSAPST--RNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGF

Query:  PDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFF
        PDWR++GARLHLR+SLDLP+AAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRV CCGFDLSDFF
Subjt:  PDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFF

Query:  ERESLFRCSVSD----PLFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGS
        ERESLFRCS SD    PLF KQRS++EKS  SST FSRRS+D+GV KTPRWVEFWSDA VDRRRRNSSSSS+SSPDRFFE+ PR EIPKWVGEY+GELGS
Subjt:  ERESLFRCSVSD----PLFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGS

Query:  VLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS
        VLR GGWSESEVAEMV+VSA+GFFDGEMV+LDNQAV DALLLKVDRFS SLRRAGWSSEEVSEAFGFDFR EKERKPAKKLSAELVERIGKLAESVSRS
Subjt:  VLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS

XP_038902056.1 uncharacterized protein LOC120088705 [Benincasa hispida]6.4e-18685.89Show/hide
Query:  MVDVDRRMTGLNPAHLAGLRRLSARAAAAPSAPSTRNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPD
        MVDVDRRM+ LNPAH+AGLRRLSARAAA  SAP  RNGLLSF+SLADKVLTHL NSGV+VQPGLSDAEFARAEAEF F+FPPDLRAVLSAGLPVGPGFPD
Subjt:  MVDVDRRMTGLNPAHLAGLRRLSARAAAAPSAPSTRNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPD

Query:  WRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFFER
        WR+AGARLHLR+SLDLP+AAISFQIA+NTLWS SWG +PAEPEKALR+ARN LKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDE+RV CCG DLSDFFER
Subjt:  WRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFFER

Query:  ESLFRCSV----SDPLFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGSVL
        ESLFRCSV    SDPLF KQRS++EKS   S  FSRRSVD+GV +TPRWVEFWSDA +DRRRRNSSSSS+SSPDRF EM PR EIPKWVG+Y+GELGSVL
Subjt:  ESLFRCSV----SDPLFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGSVL

Query:  RGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS
        R GGWSESEVAEMVEVSAAGFFD EMV+LDNQAV DALLLKVDRFS SLRR+GWSSEEVSEAFGFDFR EK RK AKKLSAELVERIGKLAESVSRS
Subjt:  RGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS

TrEMBL top hitse value%identityAlignment
A0A0A0LMG8 Uncharacterized protein3.2e-18384.81Show/hide
Query:  MVDVDRRMTGLNPAHLAGLRRLSARAAAAPSAPSTRNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPD
        MVDVDRRM+ LNPAH+AGLRRLSARAAA  SAP  RN LLSF+SLADKVLTHL NSGV+VQPGLSDAEFARAEAEF F+FPPDLRAVLSAGLPVGPGFPD
Subjt:  MVDVDRRMTGLNPAHLAGLRRLSARAAAAPSAPSTRNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPD

Query:  WRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFFER
        WR+AGARLHLR+SLDLP+AAISFQIA+NTLWS SWG +PAEPEKALR+ARN LKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDE+RV CCGFDLSDFFER
Subjt:  WRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFFER

Query:  ESLFRCSV--SDPLFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGSVLRG
        ESLFRCSV  SD LF KQ S+++KS   S  FSRRS+D+GV +TPRWVEFWSDA +DRRRRNSSSSS+SSPDRFFEM PR E+PKWVG+Y+ ELGSVLR 
Subjt:  ESLFRCSV--SDPLFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGSVLRG

Query:  GGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS
        GGWSESEVAEMVEVSAAG FDGEMV+LDNQAV DALLLKVDRFS SLRR+GWSSEEVSEAFGFDFR EK +K AKKLSAELVERIGKLAESVSRS
Subjt:  GGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS

A0A2P5FYD4 SMI1/KNR4-like domain containing protein7.2e-18382.67Show/hide
Query:  MVDVDRRMTGLNPAHLAGLRRLSARAAAAPS-----APS--TRNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLP
        MVDVDRRMTGLNPAH+AGLRRLSARAAAAPS      PS   RNGLLSF+SLA+KV+THL NSG++VQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLP
Subjt:  MVDVDRRMTGLNPAHLAGLRRLSARAAAAPS-----APS--TRNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLP

Query:  VGPGFPDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFD
        VGPGFPDWRA GARLHLRASLDLP+AAISFQIARNTLWS+SWGPRP++PE+ALRVARN+LKRAP+LIPIFNHCYIPCNP LAGNPIFFVDE+R+FCCG D
Subjt:  VGPGFPDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFD

Query:  LSDFFERESLFRCSVSDPLFL-KQRSVSEKSAASSTTFSRRSVDAGV---AKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYV
        LSDFFERESLFR S SDP  L KQRSVSEKSA SS+ FSRRS+D G    A+TPRWVEFWSDA VDRRRRNSSSSSSSSP+RFF+M PR EIPKWV EYV
Subjt:  LSDFFERESLFRCSVSDPLFL-KQRSVSEKSAASSTTFSRRSVDAGV---AKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYV

Query:  GELGSVLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAES
         ++GSVLR GGWSES+V+E++EVSA+GFF+GEM+L+DNQAV DALLLK DRFSDSLR+AGWSSEEVS+A GFDFR EKE+KP KKLS ELVERIGKLAES
Subjt:  GELGSVLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAES

Query:  VSRS
        VSRS
Subjt:  VSRS

A0A6J1G799 uncharacterized protein LOC1114515112.8e-19588.97Show/hide
Query:  MVDVDRRMTGLNPAHLAGLRRLSARAAAAPSAPST--RNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGF
        MVDVDRRMTGLNPAHLAGLRRLSARAAA  +A S   RNGLLSF+SLAD+VLTHL NSGV+VQPGLSDAEFARAEAEF FAFPPDLRAVLSAGLPVGPGF
Subjt:  MVDVDRRMTGLNPAHLAGLRRLSARAAAAPSAPST--RNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGF

Query:  PDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFF
        PDWR++GARLHLR+SLDLP+AAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRV CCGFDLSDFF
Subjt:  PDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFF

Query:  ERESLFRCSVSD----PLFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGS
        ERESLFRCS SD    PLF KQRS++EKS  SST FSRRS+D+GV KTPRWVEFWSDA VDRRRRNSSSSS+SSPDRFFE+ PR EIPKWVGEY+GELGS
Subjt:  ERESLFRCSVSD----PLFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGS

Query:  VLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS
        VLR GGWSESEVAEMV+VSA+GFFDGEMV+LDNQAV DALLLKVDRFS SLRRAGWSSEEVSEAFGFDFR EKERKPAKKLSAELVERIGKLAESVSRS
Subjt:  VLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS

A0A6J1L6H7 uncharacterized protein LOC1114996242.8e-19588.97Show/hide
Query:  MVDVDRRMTGLNPAHLAGLRRLSARAAAAPSAPST--RNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGF
        MVDVDRRMTGLNPAHLAGLRRLSARAAA  +A S   RNGLLSF+SLAD+VLTHL NSGV+VQPGLSDAEFARAEAEF FAFPPDLRAVLSAGLPVGPGF
Subjt:  MVDVDRRMTGLNPAHLAGLRRLSARAAAAPSAPST--RNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGF

Query:  PDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFF
        PDWR++GARLHLR+SLDLP+AAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRV CCGFDLSDFF
Subjt:  PDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFF

Query:  ERESLFRCSVSD----PLFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGS
        ERESLFRCS SD    PLF KQRS++EKS  SST FSRRS+D+GV KTPRWVEFWSDA VDRRRRNSSSSS+SSPDRFFE+ PR EIPKWVGEY+GELGS
Subjt:  ERESLFRCSVSD----PLFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGS

Query:  VLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS
        VLR GGWSESEVAEMV+VSA+GFFDGEMV+LDNQAV DALLLKVDRFS SLRRAGWSSEEVSEAFGFDFR EKERKPAKKLSAELVERIGKLAESVSRS
Subjt:  VLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS

A0A6P4AVM2 uncharacterized protein LOC1074247981.2e-18281.68Show/hide
Query:  MVDVDRRMTGLNPAHLAGLRRLSARAAAAPSAPST-------RNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLP
        MVDVDRRMTGLNPAH+AGLRRLSARAAAAPS   T       RNGL SF+SLA+KV++HL NSG++VQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLP
Subjt:  MVDVDRRMTGLNPAHLAGLRRLSARAAAAPSAPST-------RNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLP

Query:  VGPGFPDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFD
        VGPGFPDWR+AGARLHLRASLDLP+AAISFQIARNTLWS+SWGPRP++PE+ALRVARN+LKRAP+LIPIFNHCYIPCNP LAGNPIFFVDE+R+FCCG D
Subjt:  VGPGFPDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFD

Query:  LSDFFERESLFRCSVSDP-LFLKQRSVSEKSAASSTTFSRRSVDAGV---AKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYV
        LSDFF+RESLFR S SDP L  KQRSVSEKSA SS+ FSRRS+D+G    A+TPRWVEFWSDA VDRRRRNSSSSSSSSP+RFF+M PR EIP WV EY+
Subjt:  LSDFFERESLFRCSVSDP-LFLKQRSVSEKSAASSTTFSRRSVDAGV---AKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYV

Query:  GELGSVLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAES
        G++GSVLR GGW+ES+++E+V VSA+GFF+GEMVLLDNQAV DALLLK DRFSDSLR+AGWSSEEVS+A GFDFR EKE+KPAKKLS ELVERIGKLAES
Subjt:  GELGSVLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAES

Query:  VSRS
        VSRS
Subjt:  VSRS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G22790.1 unknown protein1.2e-2537.69Show/hide
Query:  TNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSL
        + +G  V PGL++ E +  E+  GF+FP DLR++L  GLPVG  FP+WR    R +L     LP+  +S  + RN  W  SWG RP    +AL + +  +
Subjt:  TNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPDWRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSL

Query:  KRAPVLIPIFNHCYIPCNPP-LAGNPIFFVDESRVFCCGFDLSDFFERESLFRCSVSDPLFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSD
        + APVL+P++   Y+P   P LAGNP+F +D   V            RE    C V    FLK    SE     + T  RR       + PR VEFWSD
Subjt:  KRAPVLIPIFNHCYIPCNPP-LAGNPIFFVDESRVFCCGFDLSDFFERESLFRCSVSDPLFLKQRSVSEKSAASSTTFSRRSVDAGVAKTPRWVEFWSD

AT3G50340.1 unknown protein3.5e-16673.58Show/hide
Query:  MVDVDRRMTGLNPAHLAGLRRLSARAAAAPSAPSTRNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPD
        MVDVDRRMTGL PAH AGLRRLSAR AAAP+ P+ RN L+SF+SLAD+V++HL  S ++VQPGL+D+EFARAEAEF FAFPPDLRAVL+AGLPVG GFPD
Subjt:  MVDVDRRMTGLNPAHLAGLRRLSARAAAAPSAPSTRNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPD

Query:  WRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFFER
        WR+ GARLHLRA +DLP+AA+SFQIARNTLWS+SWG RP++PEKALRVARN+LKRAP++IPIF+HCYIPCNP LAGNP+F++DE+R+FCCG DLSDFFER
Subjt:  WRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFFER

Query:  ESLFRCSVSDPLFL-KQRSVSEKSA----ASSTTFSRRSVDAG---VAKTPRWVEFWSDATVDRRRRNS----SSSSSSSPDRFFEMPPRFEIPKWVGEY
        ES+FR S + P+ L KQRSVSEKSA    +SS+ FSR S+D+G    + TPRWVEFWSDA VDRRRRNS    SSS SSSP+R+ ++ PR E PKWV +Y
Subjt:  ESLFRCSVSDPLFL-KQRSVSEKSA----ASSTTFSRRSVDAG---VAKTPRWVEFWSDATVDRRRRNS----SSSSSSSPDRFFEMPPRFEIPKWVGEY

Query:  VGELGSVLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAE
        V  +GSVLRGGGWSES+V ++V VSA+GFF+GEMV+LDNQAV DALLLK  RFS+SLR+AGWSSEEVS+A GFDFR EKE+KP KKLS ELV+RIGKLAE
Subjt:  VGELGSVLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAE

Query:  SVSRS
        SVSRS
Subjt:  SVSRS

AT5G67020.1 unknown protein5.7e-15670.75Show/hide
Query:  MVDVDRRMTGLNPAHLAGLRRLSARAAAAPSAPSTRNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPD
        MVDVDRRMTGL PAH AGLRRLSAR AAAPS P+ RN L SF+  ADKV+ HL NSG+++QPGLSD EFAR EAEFGF FPPDLR +LSAGL VG GFPD
Subjt:  MVDVDRRMTGLNPAHLAGLRRLSARAAAAPSAPSTRNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPD

Query:  WRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFFER
        WR+ GARLHLRA +DLPVAA+SFQIA+N+LW +SWG +P +PEKALRVARN+LKRAP+LIPIF+HCYIPCNP LAGNP+FF+DE+R+FCCG DLS+FFER
Subjt:  WRAAGARLHLRASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFFER

Query:  ESLFRCSVSDPLFL-KQRSVSEKSAASSTTFSRRSVDAGVAK---TPRWVEFWSDATVDRRRRNS---SSSSSSSPDRFFEMPPRFEIPKWVGEYVGELG
        ES FR S   P  L KQRSVSEKSA SS+ FSRRS+D G A      RWVEFWSDA VDR RRNS   SSSSSSSPD      P+ E PKWV +YV  +G
Subjt:  ESLFRCSVSDPLFL-KQRSVSEKSAASSTTFSRRSVDAGVAK---TPRWVEFWSDATVDRRRRNS---SSSSSSSPDRFFEMPPRFEIPKWVGEYVGELG

Query:  SVLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS
        SVLR GGWSES++ E++ VSA+GFF+GEMV++DNQ V D LLLK  R S+SLR++GWSSEEVS+A GFDFR EKERKP KKLS  LVE+  KLAE VS+S
Subjt:  SVLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAVFDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCGACGTCGACCGGAGGATGACTGGTCTGAACCCGGCCCATCTCGCCGGCCTCCGCCGTCTCTCCGCCCGAGCCGCCGCCGCCCCCTCCGCTCCCTCCACCCGCAA
CGGCCTCCTCTCCTTCGCTTCCCTCGCCGACAAGGTCCTAACCCATCTCACGAACTCCGGCGTCGAGGTCCAGCCGGGCCTCTCCGACGCCGAGTTCGCCCGAGCCGAGG
CCGAGTTCGGATTCGCCTTTCCCCCAGATCTCCGGGCAGTTCTCTCCGCCGGTCTGCCCGTCGGCCCGGGATTCCCCGATTGGCGCGCCGCCGGTGCCCGCCTCCATCTC
CGGGCATCTCTCGACCTCCCCGTCGCCGCAATTTCCTTCCAAATCGCCAGAAATACGCTCTGGTCCCGCTCCTGGGGCCCGCGACCCGCCGAACCCGAAAAGGCCCTTCG
GGTCGCCCGAAATTCGCTGAAGAGAGCCCCGGTTTTGATCCCCATTTTCAACCATTGCTACATTCCTTGCAACCCGCCGCTGGCCGGAAACCCAATTTTCTTCGTCGACG
AGAGCCGCGTGTTCTGCTGCGGGTTCGATTTGTCGGATTTCTTCGAACGGGAGTCGCTGTTTCGGTGCTCTGTTTCGGATCCCCTGTTTCTGAAACAGAGGTCCGTCAGC
GAGAAGTCCGCCGCATCGTCGACGACTTTCTCCCGGCGGAGCGTGGACGCCGGCGTCGCAAAGACGCCGCGGTGGGTCGAGTTTTGGAGCGACGCCACCGTGGATCGGCG
GCGGAGAAACTCGTCGTCTTCGTCATCGTCGTCGCCGGATCGATTCTTCGAGATGCCGCCGAGGTTCGAAATCCCCAAGTGGGTCGGGGAATACGTTGGAGAATTGGGAT
CGGTGTTGAGAGGCGGCGGGTGGAGCGAATCGGAGGTGGCGGAGATGGTGGAGGTTTCGGCGGCCGGATTCTTCGACGGCGAGATGGTTCTGTTGGATAATCAGGCGGTC
TTCGACGCTCTGCTTCTGAAAGTGGACCGGTTTTCGGATTCGCTGCGGCGGGCAGGGTGGAGCTCCGAGGAGGTTTCGGAGGCGTTCGGGTTCGATTTCCGGTCGGAGAA
GGAGAGGAAACCGGCTAAGAAGCTATCGGCAGAACTGGTGGAGAGGATCGGGAAACTGGCCGAGTCGGTTTCCCGGTCA
mRNA sequenceShow/hide mRNA sequence
ATGGTCGACGTCGACCGGAGGATGACTGGTCTGAACCCGGCCCATCTCGCCGGCCTCCGCCGTCTCTCCGCCCGAGCCGCCGCCGCCCCCTCCGCTCCCTCCACCCGCAA
CGGCCTCCTCTCCTTCGCTTCCCTCGCCGACAAGGTCCTAACCCATCTCACGAACTCCGGCGTCGAGGTCCAGCCGGGCCTCTCCGACGCCGAGTTCGCCCGAGCCGAGG
CCGAGTTCGGATTCGCCTTTCCCCCAGATCTCCGGGCAGTTCTCTCCGCCGGTCTGCCCGTCGGCCCGGGATTCCCCGATTGGCGCGCCGCCGGTGCCCGCCTCCATCTC
CGGGCATCTCTCGACCTCCCCGTCGCCGCAATTTCCTTCCAAATCGCCAGAAATACGCTCTGGTCCCGCTCCTGGGGCCCGCGACCCGCCGAACCCGAAAAGGCCCTTCG
GGTCGCCCGAAATTCGCTGAAGAGAGCCCCGGTTTTGATCCCCATTTTCAACCATTGCTACATTCCTTGCAACCCGCCGCTGGCCGGAAACCCAATTTTCTTCGTCGACG
AGAGCCGCGTGTTCTGCTGCGGGTTCGATTTGTCGGATTTCTTCGAACGGGAGTCGCTGTTTCGGTGCTCTGTTTCGGATCCCCTGTTTCTGAAACAGAGGTCCGTCAGC
GAGAAGTCCGCCGCATCGTCGACGACTTTCTCCCGGCGGAGCGTGGACGCCGGCGTCGCAAAGACGCCGCGGTGGGTCGAGTTTTGGAGCGACGCCACCGTGGATCGGCG
GCGGAGAAACTCGTCGTCTTCGTCATCGTCGTCGCCGGATCGATTCTTCGAGATGCCGCCGAGGTTCGAAATCCCCAAGTGGGTCGGGGAATACGTTGGAGAATTGGGAT
CGGTGTTGAGAGGCGGCGGGTGGAGCGAATCGGAGGTGGCGGAGATGGTGGAGGTTTCGGCGGCCGGATTCTTCGACGGCGAGATGGTTCTGTTGGATAATCAGGCGGTC
TTCGACGCTCTGCTTCTGAAAGTGGACCGGTTTTCGGATTCGCTGCGGCGGGCAGGGTGGAGCTCCGAGGAGGTTTCGGAGGCGTTCGGGTTCGATTTCCGGTCGGAGAA
GGAGAGGAAACCGGCTAAGAAGCTATCGGCAGAACTGGTGGAGAGGATCGGGAAACTGGCCGAGTCGGTTTCCCGGTCA
Protein sequenceShow/hide protein sequence
MVDVDRRMTGLNPAHLAGLRRLSARAAAAPSAPSTRNGLLSFASLADKVLTHLTNSGVEVQPGLSDAEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPDWRAAGARLHL
RASLDLPVAAISFQIARNTLWSRSWGPRPAEPEKALRVARNSLKRAPVLIPIFNHCYIPCNPPLAGNPIFFVDESRVFCCGFDLSDFFERESLFRCSVSDPLFLKQRSVS
EKSAASSTTFSRRSVDAGVAKTPRWVEFWSDATVDRRRRNSSSSSSSSPDRFFEMPPRFEIPKWVGEYVGELGSVLRGGGWSESEVAEMVEVSAAGFFDGEMVLLDNQAV
FDALLLKVDRFSDSLRRAGWSSEEVSEAFGFDFRSEKERKPAKKLSAELVERIGKLAESVSRS