; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G14990 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G14990
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionBeta-galactosidase
Genome locationChr6:13098915..13100078
RNA-Seq ExpressionCSPI06G14990
SyntenyCSPI06G14990
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025363.1 Beta-galactosidase [Cucumis melo var. makuwa]7.7e-21694.57Show/hide
Query:  MSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDL
        MSPPPGFEAQFGQ VCKLQKS+YGLKQSPRAWFDRFTTFVKSQGY QGHSDHTLFTK SKTGKIA+LIVYVDDIVLTGDDQ EISQLKQRMGDEFEIKDL
Subjt:  MSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDL

Query:  GNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEH
        GNLKYFLGMEVARSKEGISVSQRKY LDLLTETGMLGCRP DTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQ P E+H
Subjt:  GNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEH

Query:  MKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDLHQ
        M+AVNRILRYLK+TPGKGLMFRKT+RKTIEAYTDSDWAGSV+DRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVL+DLHQ
Subjt:  MKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDLHQ

Query:  ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT
        ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKE+LDSGSICIPYIPSSQQ+ADVLTKGLLRP+FD CVSKLGLIDIY+PT
Subjt:  ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT

KAA0048203.1 Beta-galactosidase [Cucumis melo var. makuwa]7.7e-21694.57Show/hide
Query:  MSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDL
        MSPPPGFEAQFGQ VCKLQKS+YGLKQSPRAWFDRFTTFVKSQGY QGHSDHTLFTK SKTGKIA+LIVYVDDIVLTGDDQ EISQLKQRMGDEFEIKDL
Subjt:  MSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDL

Query:  GNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEH
        GNLKYFLGMEVARSKEGISVSQRKY LDLLTETGMLGCRP DTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQ P E+H
Subjt:  GNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEH

Query:  MKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDLHQ
        M+AVNRILRYLK+TPGKGLMFRKT+RKTIEAYTDSDWAGSV+DRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVL+DLHQ
Subjt:  MKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDLHQ

Query:  ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT
        ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKE+LDSGSICIPYIPSSQQ+ADVLTKGLLRP+FD CVSKLGLIDIY+PT
Subjt:  ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT

TYJ99952.1 Beta-galactosidase [Cucumis melo var. makuwa]7.7e-21694.57Show/hide
Query:  MSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDL
        MSPPPGFEAQFGQ VCKLQKS+YGLKQSPRAWFDRFTTFVKSQGY QGHSDHTLFTK SKTGKIA+LIVYVDDIVLTGDDQ EISQLKQRMGDEFEIKDL
Subjt:  MSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDL

Query:  GNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEH
        GNLKYFLGMEVARSKEGISVSQRKY LDLLTETGMLGCRP DTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQ P E+H
Subjt:  GNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEH

Query:  MKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDLHQ
        M+AVNRILRYLK+TPGKGLMFRKT+RKTIEAYTDSDWAGSV+DRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVL+DLHQ
Subjt:  MKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDLHQ

Query:  ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT
        ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKE+LDSGSICIPYIPSSQQ+ADVLTKGLLRP+FD CVSKLGLIDIY+PT
Subjt:  ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT

TYK08054.1 Beta-galactosidase [Cucumis melo var. makuwa]7.7e-21694.57Show/hide
Query:  MSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDL
        MSPPPGFEAQFGQ VCKLQKS+YGLKQSPRAWFDRFTTFVKSQGY QGHSDHTLFTK SKTGKIA+LIVYVDDIVLTGDDQ EISQLKQRMGDEFEIKDL
Subjt:  MSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDL

Query:  GNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEH
        GNLKYFLGMEVARSKEGISVSQRKY LDLLTETGMLGCRP DTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQ P E+H
Subjt:  GNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEH

Query:  MKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDLHQ
        M+AVNRILRYLK+TPGKGLMFRKT+RKTIEAYTDSDWAGSV+DRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVL+DLHQ
Subjt:  MKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDLHQ

Query:  ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT
        ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKE+LDSGSICIPYIPSSQQ+ADVLTKGLLRP+FD CVSKLGLIDIY+PT
Subjt:  ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT

XP_031741968.1 uncharacterized protein LOC116403980 [Cucumis sativus]2.2e-22399.22Show/hide
Query:  MSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDL
        MS  PGFEAQFGQHVC+LQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDL
Subjt:  MSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDL

Query:  GNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEH
        GNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEH
Subjt:  GNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEH

Query:  MKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDLHQ
        MKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDLHQ
Subjt:  MKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDLHQ

Query:  ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT
        ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT
Subjt:  ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT

TrEMBL top hitse value%identityAlignment
A0A5A7SM64 Beta-galactosidase3.7e-21694.57Show/hide
Query:  MSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDL
        MSPPPGFEAQFGQ VCKLQKS+YGLKQSPRAWFDRFTTFVKSQGY QGHSDHTLFTK SKTGKIA+LIVYVDDIVLTGDDQ EISQLKQRMGDEFEIKDL
Subjt:  MSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDL

Query:  GNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEH
        GNLKYFLGMEVARSKEGISVSQRKY LDLLTETGMLGCRP DTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQ P E+H
Subjt:  GNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEH

Query:  MKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDLHQ
        M+AVNRILRYLK+TPGKGLMFRKT+RKTIEAYTDSDWAGSV+DRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVL+DLHQ
Subjt:  MKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDLHQ

Query:  ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT
        ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKE+LDSGSICIPYIPSSQQ+ADVLTKGLLRP+FD CVSKLGLIDIY+PT
Subjt:  ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT

A0A5A7UGB2 Beta-galactosidase3.7e-21694.57Show/hide
Query:  MSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDL
        MSPPPGFEAQFGQ VCKLQKS+YGLKQSPRAWFDRFTTFVKSQGY QGHSDHTLFTK SKTGKIA+LIVYVDDIVLTGDDQ EISQLKQRMGDEFEIKDL
Subjt:  MSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDL

Query:  GNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEH
        GNLKYFLGMEVARSKEGISVSQRKY LDLLTETGMLGCRP DTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQ P E+H
Subjt:  GNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEH

Query:  MKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDLHQ
        M+AVNRILRYLK+TPGKGLMFRKT+RKTIEAYTDSDWAGSV+DRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVL+DLHQ
Subjt:  MKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDLHQ

Query:  ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT
        ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKE+LDSGSICIPYIPSSQQ+ADVLTKGLLRP+FD CVSKLGLIDIY+PT
Subjt:  ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT

A0A5A7VLQ7 Beta-galactosidase3.7e-21694.57Show/hide
Query:  MSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDL
        MSPPPGFEAQFGQ VCKLQKS+YGLKQSPRAWFDRFTTFVKSQGY QGHSDHTLFTK SKTGKIA+LIVYVDDIVLTGDDQ EISQLKQRMGDEFEIKDL
Subjt:  MSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDL

Query:  GNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEH
        GNLKYFLGMEVARSKEGISVSQRKY LDLLTETGMLGCRP DTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQ P E+H
Subjt:  GNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEH

Query:  MKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDLHQ
        M+AVNRILRYLK+TPGKGLMFRKT+RKTIEAYTDSDWAGSV+DRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVL+DLHQ
Subjt:  MKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDLHQ

Query:  ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT
        ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKE+LDSGSICIPYIPSSQQ+ADVLTKGLLRP+FD CVSKLGLIDIY+PT
Subjt:  ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT

A0A5D3BJK7 Beta-galactosidase3.7e-21694.57Show/hide
Query:  MSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDL
        MSPPPGFEAQFGQ VCKLQKS+YGLKQSPRAWFDRFTTFVKSQGY QGHSDHTLFTK SKTGKIA+LIVYVDDIVLTGDDQ EISQLKQRMGDEFEIKDL
Subjt:  MSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDL

Query:  GNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEH
        GNLKYFLGMEVARSKEGISVSQRKY LDLLTETGMLGCRP DTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQ P E+H
Subjt:  GNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEH

Query:  MKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDLHQ
        M+AVNRILRYLK+TPGKGLMFRKT+RKTIEAYTDSDWAGSV+DRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVL+DLHQ
Subjt:  MKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDLHQ

Query:  ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT
        ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKE+LDSGSICIPYIPSSQQ+ADVLTKGLLRP+FD CVSKLGLIDIY+PT
Subjt:  ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT

A0A5D3C4T4 Beta-galactosidase3.7e-21694.57Show/hide
Query:  MSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDL
        MSPPPGFEAQFGQ VCKLQKS+YGLKQSPRAWFDRFTTFVKSQGY QGHSDHTLFTK SKTGKIA+LIVYVDDIVLTGDDQ EISQLKQRMGDEFEIKDL
Subjt:  MSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDL

Query:  GNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEH
        GNLKYFLGMEVARSKEGISVSQRKY LDLLTETGMLGCRP DTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQ P E+H
Subjt:  GNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEH

Query:  MKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDLHQ
        M+AVNRILRYLK+TPGKGLMFRKT+RKTIEAYTDSDWAGSV+DRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVL+DLHQ
Subjt:  MKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDLHQ

Query:  ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT
        ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKE+LDSGSICIPYIPSSQQ+ADVLTKGLLRP+FD CVSKLGLIDIY+PT
Subjt:  ECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-6836.67Show/hide
Query:  MSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKI---AVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEI
        M  P G       +VCKL K+IYGLKQ+ R WF+ F   +K   +     D  ++  +   G I     +++YVDD+V+   D   ++  K+ + ++F +
Subjt:  MSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKI---AVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEI

Query:  KDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPI--EFNCKLGNSDDQVPVDKEQYQRLVGKLIYLS-HTRPDISFAVSVVSQFMQ
         DL  +K+F+G+ +   ++ I +SQ  Y+  +L++  M  C    TP+  + N +L NSD+         + L+G L+Y+   TRPD++ AV+++S++  
Subjt:  KDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPI--EFNCKLGNSDDQVPVDKEQYQRLVGKLIYLS-HTRPDISFAVSVVSQFMQ

Query:  TPNEEHMKAVNRILRYLKSTPGKGLMFRK--TDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWG-NLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWL
          N E  + + R+LRYLK T    L+F+K       I  Y DSDWAGS +DRKST+GY   ++  NL+ W +K+Q+ VA SS EAEY A+   + E +WL
Subjt:  TPNEEHMKAVNRILRYLKSTPGKGLMFRK--TDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWG-NLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWL

Query:  QKVLTDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLI
        + +LT ++ + E P+K++ DN+  ISIANNP  H R KH++I  HF +E++ +  IC+ YIP+  Q+AD+ TK L    F     KLGL+
Subjt:  QKVLTDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-7138.01Show/hide
Query:  MSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKD
        M  P GFE    +H VCKL KS+YGLKQ+PR W+ +F +F+KSQ Y + +SD  ++ K        +L++YVDD+++ G D+  I++LK  +   F++KD
Subjt:  MSPPPGFEAQFGQH-VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKD

Query:  LGNLKYFLGMEVARSKEG--ISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNS------DDQVPVDKEQYQRLVGKLIY-LSHTRPDISFAVSVVS
        LG  +  LGM++ R +    + +SQ KYI  +L    M   +P  TP+  + KL         +++  + K  Y   VG L+Y +  TRPDI+ AV VVS
Subjt:  LGNLKYFLGMEVARSKEG--ISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNS------DDQVPVDKEQYQRLVGKLIY-LSHTRPDISFAVSVVS

Query:  QFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIW
        +F++ P +EH +AV  ILRYL+ T G  L F  +D   ++ YTD+D AG + +RKS++GY     G  ++W+SK Q  VA S+ EAEY A +    E IW
Subjt:  QFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIW

Query:  LQKVLTD--LHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGL
        L++ L +  LHQ+      ++CD+++AI ++ N + H RTKH+++  H+I+E +D  S+ +  I +++  AD+LTK + R  F+ C   +G+
Subjt:  LQKVLTD--LHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGL

P92519 Uncharacterized mitochondrial protein AtMg008103.5e-4640.62Show/hide
Query:  LIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQ
        L++YVDDI+LTG     ++ L  ++   F +KDLG + YFLG+++     G+ +SQ KY   +L   GML C+P  TP+        S  + P D   ++
Subjt:  LIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQ

Query:  RLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQ
         +VG L YL+ TRPDIS+AV++V Q M  P       + R+LRY+K T   GL   K  +  ++A+ DSDWAG    R+ST+G+CTF+  N+++W +K+Q
Subjt:  RLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQ

Query:  SVVARSSAEAEYRAMSLGICEEIW
          V+RSS E EYRA++L   E  W
Subjt:  SVVARSSAEAEYRAMSLGICEEIW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.9e-8442.15Show/hide
Query:  MSPPPGF-EAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGK-IAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIK
        MS PPGF +     +VCKL+K++YGLKQ+PRAW+     ++ + G+    SD +LF  V + GK I  ++VYVDDI++TG+D   +      +   F +K
Subjt:  MSPPPGF-EAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGK-IAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIK

Query:  DLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNE
        D   L YFLG+E  R   G+ +SQR+YILDLL  T M+  +P  TP+  + KL         D  +Y+ +VG L YL+ TRPDIS+AV+ +SQFM  P E
Subjt:  DLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNE

Query:  EHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDL
        EH++A+ RILRYL  TP  G+  +K +  ++ AY+D+DWAG   D  ST+GY  ++  + ++W SKKQ  V RSS EAEYR+++    E  W+  +LT+L
Subjt:  EHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDL

Query:  HQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGL
              P  ++CDN  A  +  NPV H R KH+ ID HFI+ ++ SG++ + ++ +  Q+AD LTK L R  F    SK+G+
Subjt:  HQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.1e-8240Show/hide
Query:  MSPPPGF-EAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKD
        MS PPGF +     +VC+L+K+IYGLKQ+PRAW+    T++ + G+    SD +LF  + +   I  ++VYVDDI++TG+D   +      +   F +K+
Subjt:  MSPPPGF-EAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKD

Query:  LGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLG-NSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNE
          +L YFLG+E  R  +G+ +SQR+Y LDLL  T ML  +P  TP+  + KL  +S  ++P D  +Y+ +VG L YL+ TRPD+S+AV+ +SQ+M  P +
Subjt:  LGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLG-NSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNE

Query:  EHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDL
        +H  A+ R+LRYL  TP  G+  +K +  ++ AY+D+DWAG   D  ST+GY  ++  + ++W SKKQ  V RSS EAEYR+++    E  W+  +LT+L
Subjt:  EHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDL

Query:  HQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDI
          +   P  ++CDN  A  +  NPV H R KH+ +D HFI+ ++ SG++ + ++ +  Q+AD LTK L R  F     K+G+I +
Subjt:  HQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.2e-8645.66Show/hide
Query:  MSPPPGFEAQFGQH-----VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEF
        M  PPG+ A+ G       VC L+KSIYGLKQ+ R WF +F+  +   G+ Q HSDHT F K++ T  + VL VYVDDI++  ++ A + +LK ++   F
Subjt:  MSPPPGFEAQFGQH-----VCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEF

Query:  EIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQT
        +++DLG LKYFLG+E+ARS  GI++ QRKY LDLL ETG+LGC+P+  P++ +           VD + Y+RL+G+L+YL  TR DISFAV+ +SQF + 
Subjt:  EIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQT

Query:  PNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVL
        P   H +AV +IL Y+K T G+GL +       ++ ++D+ +      R+ST+GYC F+  +L++W+SKKQ VV++SSAEAEYRA+S    E +WL +  
Subjt:  PNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVL

Query:  TDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEK
         +L      P  LFCDN AAI IA N V H+RTKH+E D H ++E+
Subjt:  TDLHQECETPLKLFCDNKAAISIANNPVQHDRTKHVEIDRHFIKEK

ATMG00240.1 Gag-Pol-related retrotransposon family protein1.1e-1341.46Show/hide
Query:  IYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFV
        +YL+ TRPD++FAV+ +SQF        M+AV ++L Y+K T G+GL +  T    ++A+ DSDWA     R+S +G+C+ V
Subjt:  IYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFV

ATMG00810.1 DNA/RNA polymerases superfamily protein2.5e-4740.62Show/hide
Query:  LIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQ
        L++YVDDI+LTG     ++ L  ++   F +KDLG + YFLG+++     G+ +SQ KY   +L   GML C+P  TP+        S  + P D   ++
Subjt:  LIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQ

Query:  RLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQ
         +VG L YL+ TRPDIS+AV++V Q M  P       + R+LRY+K T   GL   K  +  ++A+ DSDWAG    R+ST+G+CTF+  N+++W +K+Q
Subjt:  RLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLMFRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQ

Query:  SVVARSSAEAEYRAMSLGICEEIW
          V+RSS E EYRA++L   E  W
Subjt:  SVVARSSAEAEYRAMSLGICEEIW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCCTCCGCCTGGATTTGAAGCCCAGTTTGGTCAGCATGTGTGTAAACTCCAGAAATCTATATATGGTCTGAAACAGTCTCCCAGAGCATGGTTTGACAGATTCAC
TACCTTTGTCAAGTCCCAAGGGTACAGGCAGGGACACTCTGATCATACTTTATTTACAAAGGTTTCCAAAACAGGAAAGATTGCTGTTCTAATAGTTTATGTGGATGACA
TTGTTTTGACTGGAGATGATCAGGCAGAAATCAGTCAACTAAAGCAGAGAATGGGCGATGAGTTTGAAATCAAGGATTTGGGAAATTTGAAATATTTCCTTGGAATGGAG
GTGGCCAGATCTAAAGAAGGTATCTCCGTATCTCAAAGAAAATACATCCTTGATTTGTTAACCGAGACAGGTATGTTAGGATGTCGTCCCACTGACACTCCTATTGAATT
CAACTGCAAACTAGGAAACTCTGATGATCAAGTTCCAGTTGATAAAGAACAGTATCAACGCCTCGTGGGTAAATTAATTTACTTATCTCATACTCGTCCTGATATTTCCT
TTGCTGTGAGTGTTGTCAGCCAGTTTATGCAGACCCCTAATGAGGAACACATGAAAGCTGTCAACAGAATCTTGAGATACTTAAAATCAACACCTGGTAAAGGGCTGATG
TTTAGAAAAACAGACAGAAAGACCATTGAGGCATACACTGACTCGGATTGGGCAGGATCTGTTGTTGACAGAAAATCTACCTCTGGTTATTGTACCTTTGTTTGGGGCAA
TCTTGTAACTTGGAGGAGTAAGAAGCAAAGTGTTGTGGCCAGGAGCAGCGCTGAGGCTGAATATAGAGCTATGAGTTTAGGAATATGTGAGGAAATTTGGCTTCAGAAAG
TTTTGACAGATCTTCATCAGGAATGTGAGACACCATTGAAGCTTTTCTGTGATAATAAAGCCGCTATTAGTATTGCTAACAACCCTGTTCAACATGATAGAACTAAACAT
GTTGAGATTGATCGACATTTTATCAAAGAAAAACTTGACAGTGGGAGCATATGCATTCCGTACATCCCTTCGAGTCAACAGGTTGCTGATGTTCTTACCAAAGGGCTTCT
CAGACCAAACTTCGACTTCTGCGTTAGCAAGTTGGGCCTCATTGATATTTACGTCCCAACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCCCTCCGCCTGGATTTGAAGCCCAGTTTGGTCAGCATGTGTGTAAACTCCAGAAATCTATATATGGTCTGAAACAGTCTCCCAGAGCATGGTTTGACAGATTCAC
TACCTTTGTCAAGTCCCAAGGGTACAGGCAGGGACACTCTGATCATACTTTATTTACAAAGGTTTCCAAAACAGGAAAGATTGCTGTTCTAATAGTTTATGTGGATGACA
TTGTTTTGACTGGAGATGATCAGGCAGAAATCAGTCAACTAAAGCAGAGAATGGGCGATGAGTTTGAAATCAAGGATTTGGGAAATTTGAAATATTTCCTTGGAATGGAG
GTGGCCAGATCTAAAGAAGGTATCTCCGTATCTCAAAGAAAATACATCCTTGATTTGTTAACCGAGACAGGTATGTTAGGATGTCGTCCCACTGACACTCCTATTGAATT
CAACTGCAAACTAGGAAACTCTGATGATCAAGTTCCAGTTGATAAAGAACAGTATCAACGCCTCGTGGGTAAATTAATTTACTTATCTCATACTCGTCCTGATATTTCCT
TTGCTGTGAGTGTTGTCAGCCAGTTTATGCAGACCCCTAATGAGGAACACATGAAAGCTGTCAACAGAATCTTGAGATACTTAAAATCAACACCTGGTAAAGGGCTGATG
TTTAGAAAAACAGACAGAAAGACCATTGAGGCATACACTGACTCGGATTGGGCAGGATCTGTTGTTGACAGAAAATCTACCTCTGGTTATTGTACCTTTGTTTGGGGCAA
TCTTGTAACTTGGAGGAGTAAGAAGCAAAGTGTTGTGGCCAGGAGCAGCGCTGAGGCTGAATATAGAGCTATGAGTTTAGGAATATGTGAGGAAATTTGGCTTCAGAAAG
TTTTGACAGATCTTCATCAGGAATGTGAGACACCATTGAAGCTTTTCTGTGATAATAAAGCCGCTATTAGTATTGCTAACAACCCTGTTCAACATGATAGAACTAAACAT
GTTGAGATTGATCGACATTTTATCAAAGAAAAACTTGACAGTGGGAGCATATGCATTCCGTACATCCCTTCGAGTCAACAGGTTGCTGATGTTCTTACCAAAGGGCTTCT
CAGACCAAACTTCGACTTCTGCGTTAGCAAGTTGGGCCTCATTGATATTTACGTCCCAACTTGA
Protein sequenceShow/hide protein sequence
MSPPPGFEAQFGQHVCKLQKSIYGLKQSPRAWFDRFTTFVKSQGYRQGHSDHTLFTKVSKTGKIAVLIVYVDDIVLTGDDQAEISQLKQRMGDEFEIKDLGNLKYFLGME
VARSKEGISVSQRKYILDLLTETGMLGCRPTDTPIEFNCKLGNSDDQVPVDKEQYQRLVGKLIYLSHTRPDISFAVSVVSQFMQTPNEEHMKAVNRILRYLKSTPGKGLM
FRKTDRKTIEAYTDSDWAGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAMSLGICEEIWLQKVLTDLHQECETPLKLFCDNKAAISIANNPVQHDRTKH
VEIDRHFIKEKLDSGSICIPYIPSSQQVADVLTKGLLRPNFDFCVSKLGLIDIYVPT