; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0010501 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0010501
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr01:5516653..5518349
RNA-Seq ExpressionCmc01g0010501
SyntenyCmc01g0010501
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]3.0e-26689.19Show/hide
Query:  LTSVTLNMLAADKLNGNNYASWKNIINTVLIIDDLRFVLVEECPQVPAANATRTIREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
        +TS TLNMLAADKLNGNNYASWKN INTVLIIDDLRFVLVEECPQVPAANATRT+REPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
Subjt:  LTSVTLNMLAADKLNGNNYASWKNIINTVLIIDDLRFVLVEECPQVPAANATRTIREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE

Query:  MFGQASYQIKHDPLKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLLESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
        MFGQASYQIKHD LKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESL ESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDPLKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLLESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG

Query:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
        QKGEANVATSTRKFHRG TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
Subjt:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL

Query:  VENDDSAWIIDS----------DDDASW------NWACRLSN-------CSGSASTLFTEIFSLLENVYVVPNLKRNLISVKCLLEQSYSLTFNVNKVFI
        VENDDSAWIIDS             +SW          R+           G       + F LLENVYVVP+LKRNLISVKCLLEQSYSLTFNVNKVFI
Subjt:  VENDDSAWIIDS----------DDDASW------NWACRLSN-------CSGSASTLFTEIFSLLENVYVVPNLKRNLISVKCLLEQSYSLTFNVNKVFI

Query:  YKNGVEICSAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKIPFT
        YKNGVEICSAKLENNLYVLR+LTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRL HINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTK PFT
Subjt:  YKNGVEICSAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKIPFT

Query:  GKGRRVKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY
        GKG R KEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY
Subjt:  GKGRRVKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]3.0e-26689.19Show/hide
Query:  LTSVTLNMLAADKLNGNNYASWKNIINTVLIIDDLRFVLVEECPQVPAANATRTIREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
        +TS TLNMLAADKLNGNNYASWKN INTVLIIDDLRFVLVEECPQVPAANATRT+REPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
Subjt:  LTSVTLNMLAADKLNGNNYASWKNIINTVLIIDDLRFVLVEECPQVPAANATRTIREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE

Query:  MFGQASYQIKHDPLKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLLESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
        MFGQASYQIKHD LKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESL ESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDPLKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLLESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG

Query:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
        QKGEANVATSTRKFHRG TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
Subjt:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL

Query:  VENDDSAWIIDS----------DDDASW------NWACRLSN-------CSGSASTLFTEIFSLLENVYVVPNLKRNLISVKCLLEQSYSLTFNVNKVFI
        VENDDSAWIIDS             +SW          R+           G       + F LLENVYVVP+LKRNLISVKCLLEQSYSLTFNVNKVFI
Subjt:  VENDDSAWIIDS----------DDDASW------NWACRLSN-------CSGSASTLFTEIFSLLENVYVVPNLKRNLISVKCLLEQSYSLTFNVNKVFI

Query:  YKNGVEICSAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKIPFT
        YKNGVEICSAKLENNLYVLR+LTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRL HINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTK PFT
Subjt:  YKNGVEICSAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKIPFT

Query:  GKGRRVKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY
        GKG R KEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY
Subjt:  GKGRRVKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]3.0e-26689.19Show/hide
Query:  LTSVTLNMLAADKLNGNNYASWKNIINTVLIIDDLRFVLVEECPQVPAANATRTIREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
        +TS TLNMLAADKLNGNNYASWKN INTVLIIDDLRFVLVEECPQVPAANATRT+REPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
Subjt:  LTSVTLNMLAADKLNGNNYASWKNIINTVLIIDDLRFVLVEECPQVPAANATRTIREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE

Query:  MFGQASYQIKHDPLKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLLESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
        MFGQASYQIKHD LKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESL ESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDPLKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLLESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG

Query:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
        QKGEANVATSTRKFHRG TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
Subjt:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL

Query:  VENDDSAWIIDS----------DDDASW------NWACRLSN-------CSGSASTLFTEIFSLLENVYVVPNLKRNLISVKCLLEQSYSLTFNVNKVFI
        VENDDSAWIIDS             +SW          R+           G       + F LLENVYVVP+LKRNLISVKCLLEQSYSLTFNVNKVFI
Subjt:  VENDDSAWIIDS----------DDDASW------NWACRLSN-------CSGSASTLFTEIFSLLENVYVVPNLKRNLISVKCLLEQSYSLTFNVNKVFI

Query:  YKNGVEICSAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKIPFT
        YKNGVEICSAKLENNLYVLR+LTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRL HINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTK PFT
Subjt:  YKNGVEICSAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKIPFT

Query:  GKGRRVKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY
        GKG R KEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY
Subjt:  GKGRRVKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY

KAA0062993.1 gag/pol protein [Cucumis melo var. makuwa]4.2e-26888.95Show/hide
Query:  VFSNLILTSVTLNMLAADKLNGNNYASWKNIINTVLIIDDLRFVLVEECPQVPAANATRTIREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREI
        VF NLI+TS TLNMLAADKLNGNNYASWKN INTVLIIDDLRFVLV+ECPQVPAANATRT+REPYERWAKANEKARAYILASLSEVLAKKHESMLTAREI
Subjt:  VFSNLILTSVTLNMLAADKLNGNNYASWKNIINTVLIIDDLRFVLVEECPQVPAANATRTIREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREI

Query:  MDSLQEMFGQASYQIKHDPLKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLLESFLQFRSNAVMNKIAYTLTTLLNELQTFES
        MDSLQEMFGQASYQIKHD LKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESL ESFLQFRSNAVMNKIAYTLTTLLNELQTFES
Subjt:  MDSLQEMFGQASYQIKHDPLKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLLESFLQFRSNAVMNKIAYTLTTLLNELQTFES

Query:  LMKIKGQKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLL
        LMKIKGQKGEANVATSTRKFHRG TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLL
Subjt:  LMKIKGQKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLL

Query:  VLETCLVENDDSAWIIDS----------DDDASW------NWACRLSN-------CSGSASTLFTEIFSLLENVYVVPNLKRNLISVKCLLEQSYSLTFN
        VLETCLVENDDSAWIIDS             +SW          R+           G       + F LLENVYVVP+LKRNLISVKCLLEQSYSLTFN
Subjt:  VLETCLVENDDSAWIIDS----------DDDASW------NWACRLSN-------CSGSASTLFTEIFSLLENVYVVPNLKRNLISVKCLLEQSYSLTFN

Query:  VNKVFIYKNGVEICSAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVKNGLLSELEENSLPVCESCLEGKM
        VNKVFIYKNGVEICSAKLENNLYVLR+LTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRL HINLNRIERLVKNGLLSELEENSLPVCESCLEGKM
Subjt:  VNKVFIYKNGVEICSAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVKNGLLSELEENSLPVCESCLEGKM

Query:  TKIPFTGKGRRVKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY
        TK PFTGKG R KEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY
Subjt:  TKIPFTGKGRRVKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]8.7e-26689.01Show/hide
Query:  LTSVTLNMLAADKLNGNNYASWKNIINTVLIIDDLRFVLVEECPQVPAANATRTIREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
        +TS TLNMLAADKLNGNNYASWKN INTVLIIDDLRFVLVEECPQVPAANATRT+REPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
Subjt:  LTSVTLNMLAADKLNGNNYASWKNIINTVLIIDDLRFVLVEECPQVPAANATRTIREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE

Query:  MFGQASYQIKHDPLKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLLESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
        MFGQASYQIKHD LKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESL ESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDPLKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLLESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG

Query:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
        QKGEANVATSTRKFHRG TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKK KAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
Subjt:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL

Query:  VENDDSAWIIDS----------DDDASW------NWACRLSN-------CSGSASTLFTEIFSLLENVYVVPNLKRNLISVKCLLEQSYSLTFNVNKVFI
        VENDDSAWIIDS             +SW          R+           G       + F LLENVYVVP+LKRNLISVKCLLEQSYSLTFNVNKVFI
Subjt:  VENDDSAWIIDS----------DDDASW------NWACRLSN-------CSGSASTLFTEIFSLLENVYVVPNLKRNLISVKCLLEQSYSLTFNVNKVFI

Query:  YKNGVEICSAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKIPFT
        YKNGVEICSAKLENNLYVLR+LTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRL HINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTK PFT
Subjt:  YKNGVEICSAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKIPFT

Query:  GKGRRVKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY
        GKG R KEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY
Subjt:  GKGRRVKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.4e-26689.19Show/hide
Query:  LTSVTLNMLAADKLNGNNYASWKNIINTVLIIDDLRFVLVEECPQVPAANATRTIREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
        +TS TLNMLAADKLNGNNYASWKN INTVLIIDDLRFVLVEECPQVPAANATRT+REPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
Subjt:  LTSVTLNMLAADKLNGNNYASWKNIINTVLIIDDLRFVLVEECPQVPAANATRTIREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE

Query:  MFGQASYQIKHDPLKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLLESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
        MFGQASYQIKHD LKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESL ESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDPLKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLLESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG

Query:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
        QKGEANVATSTRKFHRG TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
Subjt:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL

Query:  VENDDSAWIIDS----------DDDASW------NWACRLSN-------CSGSASTLFTEIFSLLENVYVVPNLKRNLISVKCLLEQSYSLTFNVNKVFI
        VENDDSAWIIDS             +SW          R+           G       + F LLENVYVVP+LKRNLISVKCLLEQSYSLTFNVNKVFI
Subjt:  VENDDSAWIIDS----------DDDASW------NWACRLSN-------CSGSASTLFTEIFSLLENVYVVPNLKRNLISVKCLLEQSYSLTFNVNKVFI

Query:  YKNGVEICSAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKIPFT
        YKNGVEICSAKLENNLYVLR+LTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRL HINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTK PFT
Subjt:  YKNGVEICSAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKIPFT

Query:  GKGRRVKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY
        GKG R KEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY
Subjt:  GKGRRVKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY

A0A5A7TWB9 Gag/pol protein1.4e-26689.19Show/hide
Query:  LTSVTLNMLAADKLNGNNYASWKNIINTVLIIDDLRFVLVEECPQVPAANATRTIREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
        +TS TLNMLAADKLNGNNYASWKN INTVLIIDDLRFVLVEECPQVPAANATRT+REPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
Subjt:  LTSVTLNMLAADKLNGNNYASWKNIINTVLIIDDLRFVLVEECPQVPAANATRTIREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE

Query:  MFGQASYQIKHDPLKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLLESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
        MFGQASYQIKHD LKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESL ESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDPLKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLLESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG

Query:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
        QKGEANVATSTRKFHRG TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
Subjt:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL

Query:  VENDDSAWIIDS----------DDDASW------NWACRLSN-------CSGSASTLFTEIFSLLENVYVVPNLKRNLISVKCLLEQSYSLTFNVNKVFI
        VENDDSAWIIDS             +SW          R+           G       + F LLENVYVVP+LKRNLISVKCLLEQSYSLTFNVNKVFI
Subjt:  VENDDSAWIIDS----------DDDASW------NWACRLSN-------CSGSASTLFTEIFSLLENVYVVPNLKRNLISVKCLLEQSYSLTFNVNKVFI

Query:  YKNGVEICSAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKIPFT
        YKNGVEICSAKLENNLYVLR+LTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRL HINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTK PFT
Subjt:  YKNGVEICSAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKIPFT

Query:  GKGRRVKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY
        GKG R KEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY
Subjt:  GKGRRVKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY

A0A5A7UGV2 Gag/pol protein1.4e-26689.19Show/hide
Query:  LTSVTLNMLAADKLNGNNYASWKNIINTVLIIDDLRFVLVEECPQVPAANATRTIREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
        +TS TLNMLAADKLNGNNYASWKN INTVLIIDDLRFVLVEECPQVPAANATRT+REPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
Subjt:  LTSVTLNMLAADKLNGNNYASWKNIINTVLIIDDLRFVLVEECPQVPAANATRTIREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE

Query:  MFGQASYQIKHDPLKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLLESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
        MFGQASYQIKHD LKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESL ESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDPLKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLLESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG

Query:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
        QKGEANVATSTRKFHRG TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
Subjt:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL

Query:  VENDDSAWIIDS----------DDDASW------NWACRLSN-------CSGSASTLFTEIFSLLENVYVVPNLKRNLISVKCLLEQSYSLTFNVNKVFI
        VENDDSAWIIDS             +SW          R+           G       + F LLENVYVVP+LKRNLISVKCLLEQSYSLTFNVNKVFI
Subjt:  VENDDSAWIIDS----------DDDASW------NWACRLSN-------CSGSASTLFTEIFSLLENVYVVPNLKRNLISVKCLLEQSYSLTFNVNKVFI

Query:  YKNGVEICSAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKIPFT
        YKNGVEICSAKLENNLYVLR+LTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRL HINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTK PFT
Subjt:  YKNGVEICSAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKIPFT

Query:  GKGRRVKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY
        GKG R KEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY
Subjt:  GKGRRVKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY

A0A5A7V4M1 Gag/pol protein2.0e-26888.95Show/hide
Query:  VFSNLILTSVTLNMLAADKLNGNNYASWKNIINTVLIIDDLRFVLVEECPQVPAANATRTIREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREI
        VF NLI+TS TLNMLAADKLNGNNYASWKN INTVLIIDDLRFVLV+ECPQVPAANATRT+REPYERWAKANEKARAYILASLSEVLAKKHESMLTAREI
Subjt:  VFSNLILTSVTLNMLAADKLNGNNYASWKNIINTVLIIDDLRFVLVEECPQVPAANATRTIREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREI

Query:  MDSLQEMFGQASYQIKHDPLKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLLESFLQFRSNAVMNKIAYTLTTLLNELQTFES
        MDSLQEMFGQASYQIKHD LKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESL ESFLQFRSNAVMNKIAYTLTTLLNELQTFES
Subjt:  MDSLQEMFGQASYQIKHDPLKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLLESFLQFRSNAVMNKIAYTLTTLLNELQTFES

Query:  LMKIKGQKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLL
        LMKIKGQKGEANVATSTRKFHRG TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLL
Subjt:  LMKIKGQKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLL

Query:  VLETCLVENDDSAWIIDS----------DDDASW------NWACRLSN-------CSGSASTLFTEIFSLLENVYVVPNLKRNLISVKCLLEQSYSLTFN
        VLETCLVENDDSAWIIDS             +SW          R+           G       + F LLENVYVVP+LKRNLISVKCLLEQSYSLTFN
Subjt:  VLETCLVENDDSAWIIDS----------DDDASW------NWACRLSN-------CSGSASTLFTEIFSLLENVYVVPNLKRNLISVKCLLEQSYSLTFN

Query:  VNKVFIYKNGVEICSAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVKNGLLSELEENSLPVCESCLEGKM
        VNKVFIYKNGVEICSAKLENNLYVLR+LTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRL HINLNRIERLVKNGLLSELEENSLPVCESCLEGKM
Subjt:  VNKVFIYKNGVEICSAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVKNGLLSELEENSLPVCESCLEGKM

Query:  TKIPFTGKGRRVKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY
        TK PFTGKG R KEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY
Subjt:  TKIPFTGKGRRVKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY

A0A5D3CPJ6 Gag/pol protein4.2e-26689.01Show/hide
Query:  LTSVTLNMLAADKLNGNNYASWKNIINTVLIIDDLRFVLVEECPQVPAANATRTIREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
        +TS TLNMLAADKLNGNNYASWKN INTVLIIDDLRFVLVEECPQVPAANATRT+REPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
Subjt:  LTSVTLNMLAADKLNGNNYASWKNIINTVLIIDDLRFVLVEECPQVPAANATRTIREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE

Query:  MFGQASYQIKHDPLKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLLESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
        MFGQASYQIKHD LKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESL ESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDPLKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLLESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG

Query:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
        QKGEANVATSTRKFHRG TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKK KAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
Subjt:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL

Query:  VENDDSAWIIDS----------DDDASW------NWACRLSN-------CSGSASTLFTEIFSLLENVYVVPNLKRNLISVKCLLEQSYSLTFNVNKVFI
        VENDDSAWIIDS             +SW          R+           G       + F LLENVYVVP+LKRNLISVKCLLEQSYSLTFNVNKVFI
Subjt:  VENDDSAWIIDS----------DDDASW------NWACRLSN-------CSGSASTLFTEIFSLLENVYVVPNLKRNLISVKCLLEQSYSLTFNVNKVFI

Query:  YKNGVEICSAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKIPFT
        YKNGVEICSAKLENNLYVLR+LTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRL HINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTK PFT
Subjt:  YKNGVEICSAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKIPFT

Query:  GKGRRVKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY
        GKG R KEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY
Subjt:  GKGRRVKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.0e-1922.91Show/hide
Query:  NGNNYASWKNIINTVLIIDDLRFVLVEECPQVPAANATRTIREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDPL
        +G  YA WK  I  +L   D+  V+    P            E  + W KA   A++ I+  LS+       S +TAR+I+++L  ++ + S   +    
Subjt:  NGNNYASWKNIINTVLIIDDLRFVLVEECPQVPAANATRTIREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDPL

Query:  KYIYNARMNEGASVRE--HVLNMMVHFNVAEMNGAVIDEASQVSFILESLLESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTR
        K + + +++   S+    H+ + ++   +A   GA I+E  ++S +L + L S       A+       LT    + +  +  +KIK    + +      
Subjt:  KYIYNARMNEGASVRE--HVLNMMVHFNVAEMNGAVIDEASQVSFILESLLESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTR

Query:  KFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKG------ICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCLVENDDS
          H             + N  +K          NL   + TK  K  KG       C HC +EGH K++C  Y    K+    K      +     +   
Subjt:  KFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKG------ICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCLVENDDS

Query:  AWIIDSDDDASWNWACRLSNCSGSASTLFTEIFSLLENVYVVPNLK--------------------------------------RNLISVKCLLEQSYSL
        A+++   ++ S    C     SG++  L  +     ++V VVP LK                                       NL+SVK L E   S+
Subjt:  AWIIDSDDDASWNWACRLSNCSGSASTLFTEIFSLLENVYVVPNLK--------------------------------------RNLISVKCLLEQSYSL

Query:  TFNVNKVFIYKNGVEIC-SAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVKNGLLSELE-----ENSLPV
         F+ + V I KNG+ +  ++ + NN+ V+                   Q   +    K N  LWH R  HI+  ++  + +  + S+       E S  +
Subjt:  TFNVNKVFIYKNGVEIC-SAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVKNGLLSELE-----ENSLPV

Query:  CESCLEGKMTKIPF--TGKGRRVKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY
        CE CL GK  ++PF        +K PL +VHSD+CGP+         YF+ F D ++ Y   Y
Subjt:  CESCLEGKMTKIPF--TGKGRRVKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.3e-2625.86Show/hide
Query:  KLNGNN-YASWKNIINTVLIIDDLRFVLVEECPQVPAANATRTIREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKH
        K NG+N +++W+  +  +LI   L  VL  +  +     A        E WA  +E+A + I   LS+ +        TAR I   L+ ++   +   K 
Subjt:  KLNGNN-YASWKNIINTVLIIDDLRFVLVEECPQVPAANATRTIREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKH

Query:  DPLKYIYNARMNEGASVREH--VLNMMVHFNVAEMNGAVIDEASQVSFILESLLESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVAT
           K +Y   M+EG +   H  V N ++   +A + G  I+E  +   +L SL  S+    +  +  K    L  + + L   E + K    +G+A +  
Subjt:  DPLKYIYNARMNEGASVREH--VLNMMVHFNVAEMNGAVIDEASQVSFILESLLESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVAT

Query:  STRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYD-------------LLVL
              RG     +S   SS N       G  G     A  K+  ++K+    C++CNQ GH+KR+CP     K +    K D             +L +
Subjt:  STRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYD-------------LLVL

Query:  ---ETCL-VENDDSAWIIDSDDD--------------ASWNWACRLSNCSGSASTLFTEI--------FSLLENVYVVPNLKRNLISVKCLLEQSYSLTF
           E C+ +   +S W++D+                 A      ++ N S S      +I          +L++V  VP+L+ NLIS   L    Y   F
Subjt:  ---ETCL-VENDDSAWIIDSDDD--------------ASWNWACRLSNCSGSASTLFTEI--------FSLLENVYVVPNLKRNLISVKCLLEQSYSLTF

Query:  NVNKVFIYKNGVEICSAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVKNGLLSELEENSLPVCESCLEGK
           K  + K  + I        LY  RT       N E     I Q +      + +  LWH R+ H++   ++ L K  L+S  +  ++  C+ CL GK
Subjt:  NVNKVFIYKNGVEICSAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVKNGLLSELEENSLPVCESCLEGK

Query:  MTKIPFTGKGRRVKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY
          ++ F     R    L+LV+SD+CGPM +++ GG +YF+TF DD SR  +VY
Subjt:  MTKIPFTGKGRRVKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY

P25384 Transposon Ty2-C Gag-Pol polyprotein9.7e-1027.81Show/hide
Query:  PNLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVK
        PN+  +L+S+  L  Q+ +  F  N      +G  +       + Y    L+ K L+ + + K  I    + K   K    L H  L H N   I++ +K
Subjt:  PNLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVK

Query:  NGLLSELEENSLP-------VCESCLEGKMTKIPFTGKGRRVK-----EPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY
           ++ L+E+ +         C  CL GK TK     KG R+K     EP + +H+D+ GP++   +    YFI+FTD+ +R+ +VY
Subjt:  NGLLSELEENSLP-------VCESCLEGKMTKIPFTGKGRRVK-----EPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY

Q12472 Transposon Ty2-DR1 Gag-Pol polyprotein9.7e-1027.81Show/hide
Query:  PNLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVK
        PN+  +L+S+  L  Q+ +  F  N      +G  +       + Y    L+ K L+ + + K  I    + K   K    L H  L H N   I++ +K
Subjt:  PNLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVK

Query:  NGLLSELEENSLP-------VCESCLEGKMTKIPFTGKGRRVK-----EPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY
           ++ L+E+ +         C  CL GK TK     KG R+K     EP + +H+D+ GP++   +    YFI+FTD+ +R+ +VY
Subjt:  NGLLSELEENSLP-------VCESCLEGKMTKIPFTGKGRRVK-----EPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY

Q12491 Transposon Ty2-B Gag-Pol polyprotein9.7e-1027.81Show/hide
Query:  PNLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVK
        PN+  +L+S+  L  Q+ +  F  N      +G  +       + Y    L+ K L+ + + K  I    + K   K    L H  L H N   I++ +K
Subjt:  PNLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLWHINLNRIERLVK

Query:  NGLLSELEENSLP-------VCESCLEGKMTKIPFTGKGRRVK-----EPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY
           ++ L+E+ +         C  CL GK TK     KG R+K     EP + +H+D+ GP++   +    YFI+FTD+ +R+ +VY
Subjt:  NGLLSELEENSLP-------VCESCLEGKMTKIPFTGKGRRVK-----EPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein6.9e-1138.37Show/hide
Query:  TQNKRLKISPKENAHLWHLRLWHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKIPFTGKGRRVKEPLELVHSDLCGPMNV
        T    L  + K+   LWH RL H++   +E LVK G L   + +SL  CE C+ GK  ++ F+      K PL+ VHSDL G  +V
Subjt:  TQNKRLKISPKENAHLWHLRLWHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKIPFTGKGRRVKEPLELVHSDLCGPMNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAATGCTAGTATAACTAATATAAATAAATTGTATGTTTTCAGTAATTTGATACTGACGAGTGTTACTTTGAATATGCTGGCTGCTGATAAACTTAATGGCAATAA
TTATGCATCTTGGAAAAATATTATCAACACTGTGCTAATCATCGATGACCTTAGATTTGTCCTAGTTGAGGAGTGTCCTCAAGTCCCAGCTGCTAATGCAACTCGAACTA
TTCGAGAACCATATGAGCGTTGGGCCAAGGCAAATGAAAAAGCCCGAGCATACATCTTGGCAAGCTTATCTGAAGTATTGGCCAAGAAACATGAATCAATGCTCACTGCT
CGTGAGATTATGGACTCCTTGCAGGAGATGTTTGGTCAGGCCTCTTATCAGATCAAGCATGATCCTCTGAAATACATTTATAATGCCCGTATGAATGAGGGAGCCTCAGT
TCGAGAACATGTTCTCAATATGATGGTTCATTTCAACGTGGCAGAAATGAATGGGGCTGTCATCGATGAAGCCAGTCAGGTTAGCTTTATTTTAGAATCTCTGTTAGAGA
GTTTCCTGCAATTTAGAAGCAATGCTGTTATGAATAAGATTGCTTATACCCTTACCACCCTTCTTAACGAGTTACAGACTTTCGAGTCTCTGATGAAAATCAAGGGACAG
AAGGGAGAGGCAAATGTTGCTACTTCCACAAGAAAGTTCCATAGGGGTTTGACCTCTGGAACTAAGTCTATGCCTTCTTCATCTGGCAATAAGAAGTGGAAGAAGAAGAA
GGGTGGCCAAGGAAATAAAGCTAACCTCGCTGCTGCTAAAACGACCAAGAAAGCCAAAGCTGCAAAGGGAATATGTTTCCATTGCAACCAAGAGGGACATTGGAAGAGAA
ACTGTCCCAAGTACTTGGCAGAAAAGAAGAAGGCCAAACAAGGTAAATATGATTTACTAGTGCTAGAGACTTGTTTAGTGGAAAATGATGATTCAGCCTGGATAATAGAT
TCAGATGACGATGCGAGTTGGAACTGGGCATGTCGTCTCAGCAATTGCAGTGGGAGCGCTTCAACTTTGTTTACAGAAATCTTTTCTTTATTAGAAAATGTATATGTTGT
TCCTAATTTAAAAAGGAACTTGATTTCTGTAAAGTGCTTACTAGAACAATCTTACTCGTTAACTTTTAATGTAAATAAAGTGTTTATTTACAAAAATGGTGTTGAGATTT
GTTCTGCAAAGTTAGAAAATAATCTTTATGTGTTAAGAACATTAACATCTAAAGCCCTTCTTAATACTGAAATGTTCAAAACTGCAATAACTCAAAATAAAAGACTTAAA
ATTTCTCCAAAAGAAAATGCTCATCTTTGGCACCTAAGATTATGGCACATAAATCTCAATAGGATTGAGAGATTAGTAAAGAATGGACTTCTAAGTGAGTTAGAAGAAAA
TTCTTTACCTGTATGTGAGTCATGCCTTGAAGGTAAGATGACCAAAATACCTTTTACTGGAAAAGGTCGTAGGGTCAAAGAACCTCTAGAACTTGTACATTCAGATCTAT
GTGGTCCTATGAATGTTAAAGCAAGAGGAGGATTTGAATATTTCATCACTTTTACTGATGATTATTCAAGATATGGGTATGTTTATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAATGCTAGTATAACTAATATAAATAAATTGTATGTTTTCAGTAATTTGATACTGACGAGTGTTACTTTGAATATGCTGGCTGCTGATAAACTTAATGGCAATAA
TTATGCATCTTGGAAAAATATTATCAACACTGTGCTAATCATCGATGACCTTAGATTTGTCCTAGTTGAGGAGTGTCCTCAAGTCCCAGCTGCTAATGCAACTCGAACTA
TTCGAGAACCATATGAGCGTTGGGCCAAGGCAAATGAAAAAGCCCGAGCATACATCTTGGCAAGCTTATCTGAAGTATTGGCCAAGAAACATGAATCAATGCTCACTGCT
CGTGAGATTATGGACTCCTTGCAGGAGATGTTTGGTCAGGCCTCTTATCAGATCAAGCATGATCCTCTGAAATACATTTATAATGCCCGTATGAATGAGGGAGCCTCAGT
TCGAGAACATGTTCTCAATATGATGGTTCATTTCAACGTGGCAGAAATGAATGGGGCTGTCATCGATGAAGCCAGTCAGGTTAGCTTTATTTTAGAATCTCTGTTAGAGA
GTTTCCTGCAATTTAGAAGCAATGCTGTTATGAATAAGATTGCTTATACCCTTACCACCCTTCTTAACGAGTTACAGACTTTCGAGTCTCTGATGAAAATCAAGGGACAG
AAGGGAGAGGCAAATGTTGCTACTTCCACAAGAAAGTTCCATAGGGGTTTGACCTCTGGAACTAAGTCTATGCCTTCTTCATCTGGCAATAAGAAGTGGAAGAAGAAGAA
GGGTGGCCAAGGAAATAAAGCTAACCTCGCTGCTGCTAAAACGACCAAGAAAGCCAAAGCTGCAAAGGGAATATGTTTCCATTGCAACCAAGAGGGACATTGGAAGAGAA
ACTGTCCCAAGTACTTGGCAGAAAAGAAGAAGGCCAAACAAGGTAAATATGATTTACTAGTGCTAGAGACTTGTTTAGTGGAAAATGATGATTCAGCCTGGATAATAGAT
TCAGATGACGATGCGAGTTGGAACTGGGCATGTCGTCTCAGCAATTGCAGTGGGAGCGCTTCAACTTTGTTTACAGAAATCTTTTCTTTATTAGAAAATGTATATGTTGT
TCCTAATTTAAAAAGGAACTTGATTTCTGTAAAGTGCTTACTAGAACAATCTTACTCGTTAACTTTTAATGTAAATAAAGTGTTTATTTACAAAAATGGTGTTGAGATTT
GTTCTGCAAAGTTAGAAAATAATCTTTATGTGTTAAGAACATTAACATCTAAAGCCCTTCTTAATACTGAAATGTTCAAAACTGCAATAACTCAAAATAAAAGACTTAAA
ATTTCTCCAAAAGAAAATGCTCATCTTTGGCACCTAAGATTATGGCACATAAATCTCAATAGGATTGAGAGATTAGTAAAGAATGGACTTCTAAGTGAGTTAGAAGAAAA
TTCTTTACCTGTATGTGAGTCATGCCTTGAAGGTAAGATGACCAAAATACCTTTTACTGGAAAAGGTCGTAGGGTCAAAGAACCTCTAGAACTTGTACATTCAGATCTAT
GTGGTCCTATGAATGTTAAAGCAAGAGGAGGATTTGAATATTTCATCACTTTTACTGATGATTATTCAAGATATGGGTATGTTTATTAA
Protein sequenceShow/hide protein sequence
MENASITNINKLYVFSNLILTSVTLNMLAADKLNGNNYASWKNIINTVLIIDDLRFVLVEECPQVPAANATRTIREPYERWAKANEKARAYILASLSEVLAKKHESMLTA
REIMDSLQEMFGQASYQIKHDPLKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLLESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQ
KGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCLVENDDSAWIID
SDDDASWNWACRLSNCSGSASTLFTEIFSLLENVYVVPNLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLRTLTSKALLNTEMFKTAITQNKRLK
ISPKENAHLWHLRLWHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKIPFTGKGRRVKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVY