; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015838 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015838
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr12:26763775..26777695
RNA-Seq ExpressionLag0015838
SyntenyLag0015838
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain
IPR012337 - Ribonuclease H-like superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-20058.13Show/hide
Query:  DLRFVLTEECPPVPPRTTAQAVKDAHERWTKVNEKVRVYILVSLSEVLAKRYENVKTTREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVRKHVL
        DLRFVL EECP VP     + V++ +ERW K NEK R YIL SLSEVLAK++E++ T REIM+SLQEMFG  SYQ+ HDALK ++NA+M EG SVR+HVL
Subjt:  DLRFVLTEECPPVPPRTTAQAVKDAHERWTKVNEKVRVYILVSLSEVLAKRYENVKTTREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVRKHVL

Query:  DMINQFNIAEANGRAVCECSQVVFILHSLPASYMSFRTNASMNKIQFNLTTLLSELQIYESMQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKSVPHASS
        +M+  FN+AE NG  + E SQV FIL SLP S++ FR+NA MNKI + LTTLL+ELQ +ES+ K KG+   KGEANVA S +KF +GS+SGTKS+P +S 
Subjt:  DMINQFNIAEANGRAVCECSQVVFILHSLPASYMSFRTNASMNKIQFNLTTLLSELQIYESMQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKSVPHASS

Query:  SKQNQKRTGDKG-KAPTQAVKGKGKAKVVADKGRCFHCNADGHWKCNCPYYLAEKKREKEGKFDLLVLETCLIEHDEFAWILDSEATNHVCSSFQG-NDF
        +K+ +K+ G +G KA   A K   KAK  A KG CFHCN +GHWK NCP YLAEKK+ K+GK+DLLVLETCL+E+D+ AWI+DS ATNHVCSSFQG + +
Subjt:  SKQNQKRTGDKG-KAPTQAVKGKGKAKVVADKGRCFHCNADGHWKCNCPYYLAEKKREKEGKFDLLVLETCLIEHDEFAWILDSEATNHVCSSFQG-NDF

Query:  QQLADGEMTLKVGIGEVVSARAMGTAKLFFRNKYFILEDLYLVPRIKRNLISVSAFLEQGYTISFLLNEALISRNGTYICSAKRENNLFVLRPTDAKAIL
        +QL  GEMT++VG G VVSA A+G  +L  +  + +LE++Y+VP +KRNLISV   LEQ Y+++F +N+  I +NG  ICSAK ENNL+VLR   +KA+L
Subjt:  QQLADGEMTLKVGIGEVVSARAMGTAKLFFRNKYFILEDLYLVPRIKRNLISVSAFLEQGYTISFLLNEALISRNGTYICSAKRENNLFVLRPTDAKAIL

Query:  SHEMFKTTETQNKRQKISPLSNNSYLSHLCLGHININQIDRLVKNGLLTDLEDTSLPSYESCLEGKMTNRSFTGKGYRAKEPLELIHSDLCGLMNVKARA
        + EMFKT  TQNKR KISP   N++L HL LGHIN+N+I+RLVKNGLL++LE+ SLP  ESCLEGKMT R FTGKG+RAKEPLEL+HSDLCG MNVKAR 
Subjt:  SHEMFKTTETQNKRQKISPLSNNSYLSHLCLGHININQIDRLVKNGLLTDLEDTSLPSYESCLEGKMTNRSFTGKGYRAKEPLELIHSDLCGLMNVKARA

Query:  ------------PKTQNPYVLPQNSSCSGSHK--------------PVLRPENSGE-----------------DSRAPGTPQQNGVSERRNKTLLDMVRS
                     +    Y++   S      K                 R +  GE                    APGTPQQNGVSERRN+TLLDMVRS
Subjt:  ------------PKTQNPYVLPQNSSCSGSHK--------------PVLRPENSGE-----------------DSRAPGTPQQNGVSERRNKTLLDMVRS

Query:  MMSYAQLPSSFWGYAVETAVHILNNVPSKSFSETPFELWRGRKPSLRYFRIWGCPA---DNCPK
        MMSYA LP+SFWGYAV+TAV+ILN VPSKS SETP +LW GRK SLR+FRIWGCPA   +N PK
Subjt:  MMSYAQLPSSFWGYAVETAVHILNNVPSKSFSETPFELWRGRKPSLRYFRIWGCPA---DNCPK

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-20058.13Show/hide
Query:  DLRFVLTEECPPVPPRTTAQAVKDAHERWTKVNEKVRVYILVSLSEVLAKRYENVKTTREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVRKHVL
        DLRFVL EECP VP     + V++ +ERW K NEK R YIL SLSEVLAK++E++ T REIM+SLQEMFG  SYQ+ HDALK ++NA+M EG SVR+HVL
Subjt:  DLRFVLTEECPPVPPRTTAQAVKDAHERWTKVNEKVRVYILVSLSEVLAKRYENVKTTREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVRKHVL

Query:  DMINQFNIAEANGRAVCECSQVVFILHSLPASYMSFRTNASMNKIQFNLTTLLSELQIYESMQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKSVPHASS
        +M+  FN+AE NG  + E SQV FIL SLP S++ FR+NA MNKI + LTTLL+ELQ +ES+ K KG+   KGEANVA S +KF +GS+SGTKS+P +S 
Subjt:  DMINQFNIAEANGRAVCECSQVVFILHSLPASYMSFRTNASMNKIQFNLTTLLSELQIYESMQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKSVPHASS

Query:  SKQNQKRTGDKG-KAPTQAVKGKGKAKVVADKGRCFHCNADGHWKCNCPYYLAEKKREKEGKFDLLVLETCLIEHDEFAWILDSEATNHVCSSFQG-NDF
        +K+ +K+ G +G KA   A K   KAK  A KG CFHCN +GHWK NCP YLAEKK+ K+GK+DLLVLETCL+E+D+ AWI+DS ATNHVCSSFQG + +
Subjt:  SKQNQKRTGDKG-KAPTQAVKGKGKAKVVADKGRCFHCNADGHWKCNCPYYLAEKKREKEGKFDLLVLETCLIEHDEFAWILDSEATNHVCSSFQG-NDF

Query:  QQLADGEMTLKVGIGEVVSARAMGTAKLFFRNKYFILEDLYLVPRIKRNLISVSAFLEQGYTISFLLNEALISRNGTYICSAKRENNLFVLRPTDAKAIL
        +QL  GEMT++VG G VVSA A+G  +L  +  + +LE++Y+VP +KRNLISV   LEQ Y+++F +N+  I +NG  ICSAK ENNL+VLR   +KA+L
Subjt:  QQLADGEMTLKVGIGEVVSARAMGTAKLFFRNKYFILEDLYLVPRIKRNLISVSAFLEQGYTISFLLNEALISRNGTYICSAKRENNLFVLRPTDAKAIL

Query:  SHEMFKTTETQNKRQKISPLSNNSYLSHLCLGHININQIDRLVKNGLLTDLEDTSLPSYESCLEGKMTNRSFTGKGYRAKEPLELIHSDLCGLMNVKARA
        + EMFKT  TQNKR KISP   N++L HL LGHIN+N+I+RLVKNGLL++LE+ SLP  ESCLEGKMT R FTGKG+RAKEPLEL+HSDLCG MNVKAR 
Subjt:  SHEMFKTTETQNKRQKISPLSNNSYLSHLCLGHININQIDRLVKNGLLTDLEDTSLPSYESCLEGKMTNRSFTGKGYRAKEPLELIHSDLCGLMNVKARA

Query:  ------------PKTQNPYVLPQNSSCSGSHK--------------PVLRPENSGE-----------------DSRAPGTPQQNGVSERRNKTLLDMVRS
                     +    Y++   S      K                 R +  GE                    APGTPQQNGVSERRN+TLLDMVRS
Subjt:  ------------PKTQNPYVLPQNSSCSGSHK--------------PVLRPENSGE-----------------DSRAPGTPQQNGVSERRNKTLLDMVRS

Query:  MMSYAQLPSSFWGYAVETAVHILNNVPSKSFSETPFELWRGRKPSLRYFRIWGCPA---DNCPK
        MMSYA LP+SFWGYAV+TAV+ILN VPSKS SETP +LW GRK SLR+FRIWGCPA   +N PK
Subjt:  MMSYAQLPSSFWGYAVETAVHILNNVPSKSFSETPFELWRGRKPSLRYFRIWGCPA---DNCPK

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-20058.13Show/hide
Query:  DLRFVLTEECPPVPPRTTAQAVKDAHERWTKVNEKVRVYILVSLSEVLAKRYENVKTTREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVRKHVL
        DLRFVL EECP VP     + V++ +ERW K NEK R YIL SLSEVLAK++E++ T REIM+SLQEMFG  SYQ+ HDALK ++NA+M EG SVR+HVL
Subjt:  DLRFVLTEECPPVPPRTTAQAVKDAHERWTKVNEKVRVYILVSLSEVLAKRYENVKTTREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVRKHVL

Query:  DMINQFNIAEANGRAVCECSQVVFILHSLPASYMSFRTNASMNKIQFNLTTLLSELQIYESMQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKSVPHASS
        +M+  FN+AE NG  + E SQV FIL SLP S++ FR+NA MNKI + LTTLL+ELQ +ES+ K KG+   KGEANVA S +KF +GS+SGTKS+P +S 
Subjt:  DMINQFNIAEANGRAVCECSQVVFILHSLPASYMSFRTNASMNKIQFNLTTLLSELQIYESMQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKSVPHASS

Query:  SKQNQKRTGDKG-KAPTQAVKGKGKAKVVADKGRCFHCNADGHWKCNCPYYLAEKKREKEGKFDLLVLETCLIEHDEFAWILDSEATNHVCSSFQG-NDF
        +K+ +K+ G +G KA   A K   KAK  A KG CFHCN +GHWK NCP YLAEKK+ K+GK+DLLVLETCL+E+D+ AWI+DS ATNHVCSSFQG + +
Subjt:  SKQNQKRTGDKG-KAPTQAVKGKGKAKVVADKGRCFHCNADGHWKCNCPYYLAEKKREKEGKFDLLVLETCLIEHDEFAWILDSEATNHVCSSFQG-NDF

Query:  QQLADGEMTLKVGIGEVVSARAMGTAKLFFRNKYFILEDLYLVPRIKRNLISVSAFLEQGYTISFLLNEALISRNGTYICSAKRENNLFVLRPTDAKAIL
        +QL  GEMT++VG G VVSA A+G  +L  +  + +LE++Y+VP +KRNLISV   LEQ Y+++F +N+  I +NG  ICSAK ENNL+VLR   +KA+L
Subjt:  QQLADGEMTLKVGIGEVVSARAMGTAKLFFRNKYFILEDLYLVPRIKRNLISVSAFLEQGYTISFLLNEALISRNGTYICSAKRENNLFVLRPTDAKAIL

Query:  SHEMFKTTETQNKRQKISPLSNNSYLSHLCLGHININQIDRLVKNGLLTDLEDTSLPSYESCLEGKMTNRSFTGKGYRAKEPLELIHSDLCGLMNVKARA
        + EMFKT  TQNKR KISP   N++L HL LGHIN+N+I+RLVKNGLL++LE+ SLP  ESCLEGKMT R FTGKG+RAKEPLEL+HSDLCG MNVKAR 
Subjt:  SHEMFKTTETQNKRQKISPLSNNSYLSHLCLGHININQIDRLVKNGLLTDLEDTSLPSYESCLEGKMTNRSFTGKGYRAKEPLELIHSDLCGLMNVKARA

Query:  ------------PKTQNPYVLPQNSSCSGSHK--------------PVLRPENSGE-----------------DSRAPGTPQQNGVSERRNKTLLDMVRS
                     +    Y++   S      K                 R +  GE                    APGTPQQNGVSERRN+TLLDMVRS
Subjt:  ------------PKTQNPYVLPQNSSCSGSHK--------------PVLRPENSGE-----------------DSRAPGTPQQNGVSERRNKTLLDMVRS

Query:  MMSYAQLPSSFWGYAVETAVHILNNVPSKSFSETPFELWRGRKPSLRYFRIWGCPA---DNCPK
        MMSYA LP+SFWGYAV+TAV+ILN VPSKS SETP +LW GRK SLR+FRIWGCPA   +N PK
Subjt:  MMSYAQLPSSFWGYAVETAVHILNNVPSKSFSETPFELWRGRKPSLRYFRIWGCPA---DNCPK

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-20058.13Show/hide
Query:  DLRFVLTEECPPVPPRTTAQAVKDAHERWTKVNEKVRVYILVSLSEVLAKRYENVKTTREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVRKHVL
        DLRFVL EECP VP     + V++ +ERW K NEK R YIL SLSEVLAK++E++ T REIM+SLQEMFG  SYQ+ HDALK ++NA+M EG SVR+HVL
Subjt:  DLRFVLTEECPPVPPRTTAQAVKDAHERWTKVNEKVRVYILVSLSEVLAKRYENVKTTREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVRKHVL

Query:  DMINQFNIAEANGRAVCECSQVVFILHSLPASYMSFRTNASMNKIQFNLTTLLSELQIYESMQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKSVPHASS
        +M+  FN+AE NG  + E SQV FIL SLP S++ FR+NA MNKI + LTTLL+ELQ +ES+ K KG+   KGEANVA S +KF +GS+SGTKS+P +S 
Subjt:  DMINQFNIAEANGRAVCECSQVVFILHSLPASYMSFRTNASMNKIQFNLTTLLSELQIYESMQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKSVPHASS

Query:  SKQNQKRTGDKG-KAPTQAVKGKGKAKVVADKGRCFHCNADGHWKCNCPYYLAEKKREKEGKFDLLVLETCLIEHDEFAWILDSEATNHVCSSFQG-NDF
        +K+ +K+ G +G KA   A K   KAK  A KG CFHCN +GHWK NCP YLAEKK+ K+GK+DLLVLETCL+E+D+ AWI+DS ATNHVCSSFQG + +
Subjt:  SKQNQKRTGDKG-KAPTQAVKGKGKAKVVADKGRCFHCNADGHWKCNCPYYLAEKKREKEGKFDLLVLETCLIEHDEFAWILDSEATNHVCSSFQG-NDF

Query:  QQLADGEMTLKVGIGEVVSARAMGTAKLFFRNKYFILEDLYLVPRIKRNLISVSAFLEQGYTISFLLNEALISRNGTYICSAKRENNLFVLRPTDAKAIL
        +QL  GEMT++VG G VVSA A+G  +L  +  + +LE++Y+VP +KRNLISV   LEQ Y+++F +N+  I +NG  ICSAK ENNL+VLR   +KA+L
Subjt:  QQLADGEMTLKVGIGEVVSARAMGTAKLFFRNKYFILEDLYLVPRIKRNLISVSAFLEQGYTISFLLNEALISRNGTYICSAKRENNLFVLRPTDAKAIL

Query:  SHEMFKTTETQNKRQKISPLSNNSYLSHLCLGHININQIDRLVKNGLLTDLEDTSLPSYESCLEGKMTNRSFTGKGYRAKEPLELIHSDLCGLMNVKARA
        + EMFKT  TQNKR KISP   N++L HL LGHIN+N+I+RLVKNGLL++LE+ SLP  ESCLEGKMT R FTGKG+RAKEPLEL+HSDLCG MNVKAR 
Subjt:  SHEMFKTTETQNKRQKISPLSNNSYLSHLCLGHININQIDRLVKNGLLTDLEDTSLPSYESCLEGKMTNRSFTGKGYRAKEPLELIHSDLCGLMNVKARA

Query:  ------------PKTQNPYVLPQNSSCSGSHK--------------PVLRPENSGE-----------------DSRAPGTPQQNGVSERRNKTLLDMVRS
                     +    Y++   S      K                 R +  GE                    APGTPQQNGVSERRN+TLLDMVRS
Subjt:  ------------PKTQNPYVLPQNSSCSGSHK--------------PVLRPENSGE-----------------DSRAPGTPQQNGVSERRNKTLLDMVRS

Query:  MMSYAQLPSSFWGYAVETAVHILNNVPSKSFSETPFELWRGRKPSLRYFRIWGCPA---DNCPK
        MMSYA LP+SFWGYAV+TAV+ILN VPSKS SETP +LW GRK SLR+FRIWGCPA   +N PK
Subjt:  MMSYAQLPSSFWGYAVETAVHILNNVPSKSFSETPFELWRGRKPSLRYFRIWGCPA---DNCPK

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]3.2e-20057.98Show/hide
Query:  DLRFVLTEECPPVPPRTTAQAVKDAHERWTKVNEKVRVYILVSLSEVLAKRYENVKTTREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVRKHVL
        DLRFVL EECP VP     + V++ +ERW K NEK R YIL SLSEVLAK++E++ T REIM+SLQEMFG  SYQ+ HDALK ++NA+M EG SVR+HVL
Subjt:  DLRFVLTEECPPVPPRTTAQAVKDAHERWTKVNEKVRVYILVSLSEVLAKRYENVKTTREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVRKHVL

Query:  DMINQFNIAEANGRAVCECSQVVFILHSLPASYMSFRTNASMNKIQFNLTTLLSELQIYESMQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKSVPHASS
        +M+  FN+AE NG  + E SQV FIL SLP S++ FR+NA MNKI + LTTLL+ELQ +ES+ K KG+   KGEANVA S +KF +GS+SGTKS+P +S 
Subjt:  DMINQFNIAEANGRAVCECSQVVFILHSLPASYMSFRTNASMNKIQFNLTTLLSELQIYESMQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKSVPHASS

Query:  SKQNQKRTGDKG-KAPTQAVKGKGKAKVVADKGRCFHCNADGHWKCNCPYYLAEKKREKEGKFDLLVLETCLIEHDEFAWILDSEATNHVCSSFQG-NDF
        +K+ +K+ G +G KA   A K   K K  A KG CFHCN +GHWK NCP YLAEKK+ K+GK+DLLVLETCL+E+D+ AWI+DS ATNHVCSSFQG + +
Subjt:  SKQNQKRTGDKG-KAPTQAVKGKGKAKVVADKGRCFHCNADGHWKCNCPYYLAEKKREKEGKFDLLVLETCLIEHDEFAWILDSEATNHVCSSFQG-NDF

Query:  QQLADGEMTLKVGIGEVVSARAMGTAKLFFRNKYFILEDLYLVPRIKRNLISVSAFLEQGYTISFLLNEALISRNGTYICSAKRENNLFVLRPTDAKAIL
        +QL  GEMT++VG G VVSA A+G  +L  +  + +LE++Y+VP +KRNLISV   LEQ Y+++F +N+  I +NG  ICSAK ENNL+VLR   +KA+L
Subjt:  QQLADGEMTLKVGIGEVVSARAMGTAKLFFRNKYFILEDLYLVPRIKRNLISVSAFLEQGYTISFLLNEALISRNGTYICSAKRENNLFVLRPTDAKAIL

Query:  SHEMFKTTETQNKRQKISPLSNNSYLSHLCLGHININQIDRLVKNGLLTDLEDTSLPSYESCLEGKMTNRSFTGKGYRAKEPLELIHSDLCGLMNVKARA
        + EMFKT  TQNKR KISP   N++L HL LGHIN+N+I+RLVKNGLL++LE+ SLP  ESCLEGKMT R FTGKG+RAKEPLEL+HSDLCG MNVKAR 
Subjt:  SHEMFKTTETQNKRQKISPLSNNSYLSHLCLGHININQIDRLVKNGLLTDLEDTSLPSYESCLEGKMTNRSFTGKGYRAKEPLELIHSDLCGLMNVKARA

Query:  ------------PKTQNPYVLPQNSSCSGSHK--------------PVLRPENSGE-----------------DSRAPGTPQQNGVSERRNKTLLDMVRS
                     +    Y++   S      K                 R +  GE                    APGTPQQNGVSERRN+TLLDMVRS
Subjt:  ------------PKTQNPYVLPQNSSCSGSHK--------------PVLRPENSGE-----------------DSRAPGTPQQNGVSERRNKTLLDMVRS

Query:  MMSYAQLPSSFWGYAVETAVHILNNVPSKSFSETPFELWRGRKPSLRYFRIWGCPA---DNCPK
        MMSYA LP+SFWGYAV+TAV+ILN VPSKS SETP +LW GRK SLR+FRIWGCPA   +N PK
Subjt:  MMSYAQLPSSFWGYAVETAVHILNNVPSKSFSETPFELWRGRKPSLRYFRIWGCPA---DNCPK

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein5.3e-20158.13Show/hide
Query:  DLRFVLTEECPPVPPRTTAQAVKDAHERWTKVNEKVRVYILVSLSEVLAKRYENVKTTREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVRKHVL
        DLRFVL EECP VP     + V++ +ERW K NEK R YIL SLSEVLAK++E++ T REIM+SLQEMFG  SYQ+ HDALK ++NA+M EG SVR+HVL
Subjt:  DLRFVLTEECPPVPPRTTAQAVKDAHERWTKVNEKVRVYILVSLSEVLAKRYENVKTTREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVRKHVL

Query:  DMINQFNIAEANGRAVCECSQVVFILHSLPASYMSFRTNASMNKIQFNLTTLLSELQIYESMQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKSVPHASS
        +M+  FN+AE NG  + E SQV FIL SLP S++ FR+NA MNKI + LTTLL+ELQ +ES+ K KG+   KGEANVA S +KF +GS+SGTKS+P +S 
Subjt:  DMINQFNIAEANGRAVCECSQVVFILHSLPASYMSFRTNASMNKIQFNLTTLLSELQIYESMQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKSVPHASS

Query:  SKQNQKRTGDKG-KAPTQAVKGKGKAKVVADKGRCFHCNADGHWKCNCPYYLAEKKREKEGKFDLLVLETCLIEHDEFAWILDSEATNHVCSSFQG-NDF
        +K+ +K+ G +G KA   A K   KAK  A KG CFHCN +GHWK NCP YLAEKK+ K+GK+DLLVLETCL+E+D+ AWI+DS ATNHVCSSFQG + +
Subjt:  SKQNQKRTGDKG-KAPTQAVKGKGKAKVVADKGRCFHCNADGHWKCNCPYYLAEKKREKEGKFDLLVLETCLIEHDEFAWILDSEATNHVCSSFQG-NDF

Query:  QQLADGEMTLKVGIGEVVSARAMGTAKLFFRNKYFILEDLYLVPRIKRNLISVSAFLEQGYTISFLLNEALISRNGTYICSAKRENNLFVLRPTDAKAIL
        +QL  GEMT++VG G VVSA A+G  +L  +  + +LE++Y+VP +KRNLISV   LEQ Y+++F +N+  I +NG  ICSAK ENNL+VLR   +KA+L
Subjt:  QQLADGEMTLKVGIGEVVSARAMGTAKLFFRNKYFILEDLYLVPRIKRNLISVSAFLEQGYTISFLLNEALISRNGTYICSAKRENNLFVLRPTDAKAIL

Query:  SHEMFKTTETQNKRQKISPLSNNSYLSHLCLGHININQIDRLVKNGLLTDLEDTSLPSYESCLEGKMTNRSFTGKGYRAKEPLELIHSDLCGLMNVKARA
        + EMFKT  TQNKR KISP   N++L HL LGHIN+N+I+RLVKNGLL++LE+ SLP  ESCLEGKMT R FTGKG+RAKEPLEL+HSDLCG MNVKAR 
Subjt:  SHEMFKTTETQNKRQKISPLSNNSYLSHLCLGHININQIDRLVKNGLLTDLEDTSLPSYESCLEGKMTNRSFTGKGYRAKEPLELIHSDLCGLMNVKARA

Query:  ------------PKTQNPYVLPQNSSCSGSHK--------------PVLRPENSGE-----------------DSRAPGTPQQNGVSERRNKTLLDMVRS
                     +    Y++   S      K                 R +  GE                    APGTPQQNGVSERRN+TLLDMVRS
Subjt:  ------------PKTQNPYVLPQNSSCSGSHK--------------PVLRPENSGE-----------------DSRAPGTPQQNGVSERRNKTLLDMVRS

Query:  MMSYAQLPSSFWGYAVETAVHILNNVPSKSFSETPFELWRGRKPSLRYFRIWGCPA---DNCPK
        MMSYA LP+SFWGYAV+TAV+ILN VPSKS SETP +LW GRK SLR+FRIWGCPA   +N PK
Subjt:  MMSYAQLPSSFWGYAVETAVHILNNVPSKSFSETPFELWRGRKPSLRYFRIWGCPA---DNCPK

A0A5A7TWB9 Gag/pol protein5.3e-20158.13Show/hide
Query:  DLRFVLTEECPPVPPRTTAQAVKDAHERWTKVNEKVRVYILVSLSEVLAKRYENVKTTREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVRKHVL
        DLRFVL EECP VP     + V++ +ERW K NEK R YIL SLSEVLAK++E++ T REIM+SLQEMFG  SYQ+ HDALK ++NA+M EG SVR+HVL
Subjt:  DLRFVLTEECPPVPPRTTAQAVKDAHERWTKVNEKVRVYILVSLSEVLAKRYENVKTTREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVRKHVL

Query:  DMINQFNIAEANGRAVCECSQVVFILHSLPASYMSFRTNASMNKIQFNLTTLLSELQIYESMQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKSVPHASS
        +M+  FN+AE NG  + E SQV FIL SLP S++ FR+NA MNKI + LTTLL+ELQ +ES+ K KG+   KGEANVA S +KF +GS+SGTKS+P +S 
Subjt:  DMINQFNIAEANGRAVCECSQVVFILHSLPASYMSFRTNASMNKIQFNLTTLLSELQIYESMQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKSVPHASS

Query:  SKQNQKRTGDKG-KAPTQAVKGKGKAKVVADKGRCFHCNADGHWKCNCPYYLAEKKREKEGKFDLLVLETCLIEHDEFAWILDSEATNHVCSSFQG-NDF
        +K+ +K+ G +G KA   A K   KAK  A KG CFHCN +GHWK NCP YLAEKK+ K+GK+DLLVLETCL+E+D+ AWI+DS ATNHVCSSFQG + +
Subjt:  SKQNQKRTGDKG-KAPTQAVKGKGKAKVVADKGRCFHCNADGHWKCNCPYYLAEKKREKEGKFDLLVLETCLIEHDEFAWILDSEATNHVCSSFQG-NDF

Query:  QQLADGEMTLKVGIGEVVSARAMGTAKLFFRNKYFILEDLYLVPRIKRNLISVSAFLEQGYTISFLLNEALISRNGTYICSAKRENNLFVLRPTDAKAIL
        +QL  GEMT++VG G VVSA A+G  +L  +  + +LE++Y+VP +KRNLISV   LEQ Y+++F +N+  I +NG  ICSAK ENNL+VLR   +KA+L
Subjt:  QQLADGEMTLKVGIGEVVSARAMGTAKLFFRNKYFILEDLYLVPRIKRNLISVSAFLEQGYTISFLLNEALISRNGTYICSAKRENNLFVLRPTDAKAIL

Query:  SHEMFKTTETQNKRQKISPLSNNSYLSHLCLGHININQIDRLVKNGLLTDLEDTSLPSYESCLEGKMTNRSFTGKGYRAKEPLELIHSDLCGLMNVKARA
        + EMFKT  TQNKR KISP   N++L HL LGHIN+N+I+RLVKNGLL++LE+ SLP  ESCLEGKMT R FTGKG+RAKEPLEL+HSDLCG MNVKAR 
Subjt:  SHEMFKTTETQNKRQKISPLSNNSYLSHLCLGHININQIDRLVKNGLLTDLEDTSLPSYESCLEGKMTNRSFTGKGYRAKEPLELIHSDLCGLMNVKARA

Query:  ------------PKTQNPYVLPQNSSCSGSHK--------------PVLRPENSGE-----------------DSRAPGTPQQNGVSERRNKTLLDMVRS
                     +    Y++   S      K                 R +  GE                    APGTPQQNGVSERRN+TLLDMVRS
Subjt:  ------------PKTQNPYVLPQNSSCSGSHK--------------PVLRPENSGE-----------------DSRAPGTPQQNGVSERRNKTLLDMVRS

Query:  MMSYAQLPSSFWGYAVETAVHILNNVPSKSFSETPFELWRGRKPSLRYFRIWGCPA---DNCPK
        MMSYA LP+SFWGYAV+TAV+ILN VPSKS SETP +LW GRK SLR+FRIWGCPA   +N PK
Subjt:  MMSYAQLPSSFWGYAVETAVHILNNVPSKSFSETPFELWRGRKPSLRYFRIWGCPA---DNCPK

A0A5A7TZD7 Gag/pol protein5.3e-20158.13Show/hide
Query:  DLRFVLTEECPPVPPRTTAQAVKDAHERWTKVNEKVRVYILVSLSEVLAKRYENVKTTREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVRKHVL
        DLRFVL EECP VP     + V++ +ERW K NEK R YIL SLSEVLAK++E++ T REIM+SLQEMFG  SYQ+ HDALK ++NA+M EG SVR+HVL
Subjt:  DLRFVLTEECPPVPPRTTAQAVKDAHERWTKVNEKVRVYILVSLSEVLAKRYENVKTTREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVRKHVL

Query:  DMINQFNIAEANGRAVCECSQVVFILHSLPASYMSFRTNASMNKIQFNLTTLLSELQIYESMQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKSVPHASS
        +M+  FN+AE NG  + E SQV FIL SLP S++ FR+NA MNKI + LTTLL+ELQ +ES+ K KG+   KGEANVA S +KF +GS+SGTKS+P +S 
Subjt:  DMINQFNIAEANGRAVCECSQVVFILHSLPASYMSFRTNASMNKIQFNLTTLLSELQIYESMQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKSVPHASS

Query:  SKQNQKRTGDKG-KAPTQAVKGKGKAKVVADKGRCFHCNADGHWKCNCPYYLAEKKREKEGKFDLLVLETCLIEHDEFAWILDSEATNHVCSSFQG-NDF
        +K+ +K+ G +G KA   A K   KAK  A KG CFHCN +GHWK NCP YLAEKK+ K+GK+DLLVLETCL+E+D+ AWI+DS ATNHVCSSFQG + +
Subjt:  SKQNQKRTGDKG-KAPTQAVKGKGKAKVVADKGRCFHCNADGHWKCNCPYYLAEKKREKEGKFDLLVLETCLIEHDEFAWILDSEATNHVCSSFQG-NDF

Query:  QQLADGEMTLKVGIGEVVSARAMGTAKLFFRNKYFILEDLYLVPRIKRNLISVSAFLEQGYTISFLLNEALISRNGTYICSAKRENNLFVLRPTDAKAIL
        +QL  GEMT++VG G VVSA A+G  +L  +  + +LE++Y+VP +KRNLISV   LEQ Y+++F +N+  I +NG  ICSAK ENNL+VLR   +KA+L
Subjt:  QQLADGEMTLKVGIGEVVSARAMGTAKLFFRNKYFILEDLYLVPRIKRNLISVSAFLEQGYTISFLLNEALISRNGTYICSAKRENNLFVLRPTDAKAIL

Query:  SHEMFKTTETQNKRQKISPLSNNSYLSHLCLGHININQIDRLVKNGLLTDLEDTSLPSYESCLEGKMTNRSFTGKGYRAKEPLELIHSDLCGLMNVKARA
        + EMFKT  TQNKR KISP   N++L HL LGHIN+N+I+RLVKNGLL++LE+ SLP  ESCLEGKMT R FTGKG+RAKEPLEL+HSDLCG MNVKAR 
Subjt:  SHEMFKTTETQNKRQKISPLSNNSYLSHLCLGHININQIDRLVKNGLLTDLEDTSLPSYESCLEGKMTNRSFTGKGYRAKEPLELIHSDLCGLMNVKARA

Query:  ------------PKTQNPYVLPQNSSCSGSHK--------------PVLRPENSGE-----------------DSRAPGTPQQNGVSERRNKTLLDMVRS
                     +    Y++   S      K                 R +  GE                    APGTPQQNGVSERRN+TLLDMVRS
Subjt:  ------------PKTQNPYVLPQNSSCSGSHK--------------PVLRPENSGE-----------------DSRAPGTPQQNGVSERRNKTLLDMVRS

Query:  MMSYAQLPSSFWGYAVETAVHILNNVPSKSFSETPFELWRGRKPSLRYFRIWGCPA---DNCPK
        MMSYA LP+SFWGYAV+TAV+ILN VPSKS SETP +LW GRK SLR+FRIWGCPA   +N PK
Subjt:  MMSYAQLPSSFWGYAVETAVHILNNVPSKSFSETPFELWRGRKPSLRYFRIWGCPA---DNCPK

A0A5A7UGV2 Gag/pol protein5.3e-20158.13Show/hide
Query:  DLRFVLTEECPPVPPRTTAQAVKDAHERWTKVNEKVRVYILVSLSEVLAKRYENVKTTREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVRKHVL
        DLRFVL EECP VP     + V++ +ERW K NEK R YIL SLSEVLAK++E++ T REIM+SLQEMFG  SYQ+ HDALK ++NA+M EG SVR+HVL
Subjt:  DLRFVLTEECPPVPPRTTAQAVKDAHERWTKVNEKVRVYILVSLSEVLAKRYENVKTTREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVRKHVL

Query:  DMINQFNIAEANGRAVCECSQVVFILHSLPASYMSFRTNASMNKIQFNLTTLLSELQIYESMQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKSVPHASS
        +M+  FN+AE NG  + E SQV FIL SLP S++ FR+NA MNKI + LTTLL+ELQ +ES+ K KG+   KGEANVA S +KF +GS+SGTKS+P +S 
Subjt:  DMINQFNIAEANGRAVCECSQVVFILHSLPASYMSFRTNASMNKIQFNLTTLLSELQIYESMQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKSVPHASS

Query:  SKQNQKRTGDKG-KAPTQAVKGKGKAKVVADKGRCFHCNADGHWKCNCPYYLAEKKREKEGKFDLLVLETCLIEHDEFAWILDSEATNHVCSSFQG-NDF
        +K+ +K+ G +G KA   A K   KAK  A KG CFHCN +GHWK NCP YLAEKK+ K+GK+DLLVLETCL+E+D+ AWI+DS ATNHVCSSFQG + +
Subjt:  SKQNQKRTGDKG-KAPTQAVKGKGKAKVVADKGRCFHCNADGHWKCNCPYYLAEKKREKEGKFDLLVLETCLIEHDEFAWILDSEATNHVCSSFQG-NDF

Query:  QQLADGEMTLKVGIGEVVSARAMGTAKLFFRNKYFILEDLYLVPRIKRNLISVSAFLEQGYTISFLLNEALISRNGTYICSAKRENNLFVLRPTDAKAIL
        +QL  GEMT++VG G VVSA A+G  +L  +  + +LE++Y+VP +KRNLISV   LEQ Y+++F +N+  I +NG  ICSAK ENNL+VLR   +KA+L
Subjt:  QQLADGEMTLKVGIGEVVSARAMGTAKLFFRNKYFILEDLYLVPRIKRNLISVSAFLEQGYTISFLLNEALISRNGTYICSAKRENNLFVLRPTDAKAIL

Query:  SHEMFKTTETQNKRQKISPLSNNSYLSHLCLGHININQIDRLVKNGLLTDLEDTSLPSYESCLEGKMTNRSFTGKGYRAKEPLELIHSDLCGLMNVKARA
        + EMFKT  TQNKR KISP   N++L HL LGHIN+N+I+RLVKNGLL++LE+ SLP  ESCLEGKMT R FTGKG+RAKEPLEL+HSDLCG MNVKAR 
Subjt:  SHEMFKTTETQNKRQKISPLSNNSYLSHLCLGHININQIDRLVKNGLLTDLEDTSLPSYESCLEGKMTNRSFTGKGYRAKEPLELIHSDLCGLMNVKARA

Query:  ------------PKTQNPYVLPQNSSCSGSHK--------------PVLRPENSGE-----------------DSRAPGTPQQNGVSERRNKTLLDMVRS
                     +    Y++   S      K                 R +  GE                    APGTPQQNGVSERRN+TLLDMVRS
Subjt:  ------------PKTQNPYVLPQNSSCSGSHK--------------PVLRPENSGE-----------------DSRAPGTPQQNGVSERRNKTLLDMVRS

Query:  MMSYAQLPSSFWGYAVETAVHILNNVPSKSFSETPFELWRGRKPSLRYFRIWGCPA---DNCPK
        MMSYA LP+SFWGYAV+TAV+ILN VPSKS SETP +LW GRK SLR+FRIWGCPA   +N PK
Subjt:  MMSYAQLPSSFWGYAVETAVHILNNVPSKSFSETPFELWRGRKPSLRYFRIWGCPA---DNCPK

A0A5D3CPJ6 Gag/pol protein1.5e-20057.98Show/hide
Query:  DLRFVLTEECPPVPPRTTAQAVKDAHERWTKVNEKVRVYILVSLSEVLAKRYENVKTTREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVRKHVL
        DLRFVL EECP VP     + V++ +ERW K NEK R YIL SLSEVLAK++E++ T REIM+SLQEMFG  SYQ+ HDALK ++NA+M EG SVR+HVL
Subjt:  DLRFVLTEECPPVPPRTTAQAVKDAHERWTKVNEKVRVYILVSLSEVLAKRYENVKTTREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVRKHVL

Query:  DMINQFNIAEANGRAVCECSQVVFILHSLPASYMSFRTNASMNKIQFNLTTLLSELQIYESMQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKSVPHASS
        +M+  FN+AE NG  + E SQV FIL SLP S++ FR+NA MNKI + LTTLL+ELQ +ES+ K KG+   KGEANVA S +KF +GS+SGTKS+P +S 
Subjt:  DMINQFNIAEANGRAVCECSQVVFILHSLPASYMSFRTNASMNKIQFNLTTLLSELQIYESMQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKSVPHASS

Query:  SKQNQKRTGDKG-KAPTQAVKGKGKAKVVADKGRCFHCNADGHWKCNCPYYLAEKKREKEGKFDLLVLETCLIEHDEFAWILDSEATNHVCSSFQG-NDF
        +K+ +K+ G +G KA   A K   K K  A KG CFHCN +GHWK NCP YLAEKK+ K+GK+DLLVLETCL+E+D+ AWI+DS ATNHVCSSFQG + +
Subjt:  SKQNQKRTGDKG-KAPTQAVKGKGKAKVVADKGRCFHCNADGHWKCNCPYYLAEKKREKEGKFDLLVLETCLIEHDEFAWILDSEATNHVCSSFQG-NDF

Query:  QQLADGEMTLKVGIGEVVSARAMGTAKLFFRNKYFILEDLYLVPRIKRNLISVSAFLEQGYTISFLLNEALISRNGTYICSAKRENNLFVLRPTDAKAIL
        +QL  GEMT++VG G VVSA A+G  +L  +  + +LE++Y+VP +KRNLISV   LEQ Y+++F +N+  I +NG  ICSAK ENNL+VLR   +KA+L
Subjt:  QQLADGEMTLKVGIGEVVSARAMGTAKLFFRNKYFILEDLYLVPRIKRNLISVSAFLEQGYTISFLLNEALISRNGTYICSAKRENNLFVLRPTDAKAIL

Query:  SHEMFKTTETQNKRQKISPLSNNSYLSHLCLGHININQIDRLVKNGLLTDLEDTSLPSYESCLEGKMTNRSFTGKGYRAKEPLELIHSDLCGLMNVKARA
        + EMFKT  TQNKR KISP   N++L HL LGHIN+N+I+RLVKNGLL++LE+ SLP  ESCLEGKMT R FTGKG+RAKEPLEL+HSDLCG MNVKAR 
Subjt:  SHEMFKTTETQNKRQKISPLSNNSYLSHLCLGHININQIDRLVKNGLLTDLEDTSLPSYESCLEGKMTNRSFTGKGYRAKEPLELIHSDLCGLMNVKARA

Query:  ------------PKTQNPYVLPQNSSCSGSHK--------------PVLRPENSGE-----------------DSRAPGTPQQNGVSERRNKTLLDMVRS
                     +    Y++   S      K                 R +  GE                    APGTPQQNGVSERRN+TLLDMVRS
Subjt:  ------------PKTQNPYVLPQNSSCSGSHK--------------PVLRPENSGE-----------------DSRAPGTPQQNGVSERRNKTLLDMVRS

Query:  MMSYAQLPSSFWGYAVETAVHILNNVPSKSFSETPFELWRGRKPSLRYFRIWGCPA---DNCPK
        MMSYA LP+SFWGYAV+TAV+ILN VPSKS SETP +LW GRK SLR+FRIWGCPA   +N PK
Subjt:  MMSYAQLPSSFWGYAVETAVHILNNVPSKSFSETPFELWRGRKPSLRYFRIWGCPA---DNCPK

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.2e-2623.79Show/hide
Query:  DAHERWTKVNEKVRVYILVSLSEVLAKRYENVKTTREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVRKHVLDMINQFNIAEANGRAVCECSQVV
        +  + W K     +  I+  LS+       +  T R+I+ +L  ++   S        K + + K+    S+  H        +   A G  + E  ++ 
Subjt:  DAHERWTKVNEKVRVYILVSLSEVLAKRYENVKTTREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVRKHVLDMINQFNIAEANGRAVCECSQVV

Query:  FILHSLPASYMSFRTNASMNKIQFNLTTLLSELQIYESMQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKSVPHASSSKQNQKRTGDKGKAPTQAVKGKG
         +L +LP+ Y    T A     + NLT    + ++ +   K      +K + N   + KK +        ++ H +++        ++   P +  KG  
Subjt:  FILHSLPASYMSFRTNASMNKIQFNLTTLLSELQIYESMQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKSVPHASSSKQNQKRTGDKGKAPTQAVKGKG

Query:  KAKVVADKGRCFHCNADGHWKCNCPYY---LAEKKREKEGKFDLL-----------VLETCLIEHDEFAWILDSEATNHVCSSFQGNDFQQLADGEMT--
        K KV     +C HC  +GH K +C +Y   L  K +E E +               V  T ++  D   ++LDS A++H+      ND     D      
Subjt:  KAKVVADKGRCFHCNADGHWKCNCPYY---LAEKKREKEGKFDLL-----------VLETCLIEHDEFAWILDSEATNHVCSSFQGNDFQQLADGEMT--

Query:  -LKVGI---GEVVSARAMGTAKLFFRNKYFI-LEDLYLVPRIKRNLISVSAFLEQGYTISFLLNEALISRNGTYIC-SAKRENNLFVLRPTDAKAILSHE
         LK+ +   GE + A   G  +L  RN + I LED+        NL+SV    E G +I F  +   IS+NG  +  ++   NN+ V+            
Subjt:  -LKVGI---GEVVSARAMGTAKLFFRNKYFI-LEDLYLVPRIKRNLISVSAFLEQGYTISFLLNEALISRNGTYIC-SAKRENNLFVLRPTDAKAILSHE

Query:  MFKTTETQNKRQKISPLSNNSYLSHLCLGHININQIDRLVKNGLLTDLE-----DTSLPSYESCLEGKMTNRSFTGKGYRA--KEPLELIHSDLCGLMNV
         F+      K +      NN  L H   GHI+  ++  + +  + +D       + S    E CL GK     F     +   K PL ++HSD+CG +  
Subjt:  MFKTTETQNKRQKISPLSNNSYLSHLCLGHININQIDRLVKNGLLTDLE-----DTSLPSYESCLEGKMTNRSFTGKGYRA--KEPLELIHSDLCGLMNV

Query:  KARAPKTQNPYVLPQ-------------------------NSSCSGSHKPVLRPENSGED------------------SRAPGTPQQNGVSERRNKTLLD
             K      + Q                          S    + K V    ++G +                     P TPQ NGVSER  +T+ +
Subjt:  KARAPKTQNPYVLPQ-------------------------NSSCSGSHKPVLRPENSGED------------------SRAPGTPQQNGVSERRNKTLLD

Query:  MVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSF---SETPFELWRGRKPSLRYFRIWG
          R+M+S A+L  SFWG AV TA +++N +PS++    S+TP+E+W  +KP L++ R++G
Subjt:  MVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSF---SETPFELWRGRKPSLRYFRIWG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.0e-3525Show/hide
Query:  ERWTKVNEKVRVYILVSLSEVLAKRYENVKTTREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVRKHVLDMINQFNIAEAN-GRAVCECSQVVFI
        E W  ++E+    I + LS+ +     +  T R I   L+ ++   +        K ++   M EG +   H L++ N      AN G  + E  + + +
Subjt:  ERWTKVNEKVRVYILVSLSEVLAKRYENVKTTREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVRKHVLDMINQFNIAEAN-GRAVCECSQVVFI

Query:  LHSLPASYMSFRTNASMNKIQFNLTTLLSELQIYESMQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKSVPHASSSKQNQKRTGDKGKAPTQAVKGKGKA
        L+SLP+SY +  T     K    L  + S L + E M+K K +N  +G+A +   + +  + SS+             N  R+G +GK+     K + K+
Subjt:  LHSLPASYMSFRTNASMNKIQFNLTTLLSELQIYESMQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKSVPHASSSKQNQKRTGDKGKAPTQAVKGKGKA

Query:  KVVADKGRCFHCNADGHWKCNCPYYLAEKKREKEGK------------FDLLVL-----ETCL-IEHDEFAWILDSEATNH------VCSSFQGNDFQQL
        +V      C++CN  GH+K +CP    + K E  G+             D +VL     E C+ +   E  W++D+ A++H      +   +   DF  +
Subjt:  KVVADKGRCFHCNADGHWKCNCPYYLAEKKREKEGK------------FDLLVL-----ETCL-IEHDEFAWILDSEATNH------VCSSFQGNDFQQL

Query:  ADGEMTLK--VGIGEVVSARAMGTAKLFFRNKYFILEDLYLVPRIKRNLISVSAFLEQGYTISFLLNEALISRNGTYICSAKRENNLFVLRPTDAKAILS
          G  +     GIG++     +G           +L+D+  VP ++ NLIS  A    GY   F   +  +++    I        L+    T+A+    
Subjt:  ADGEMTLK--VGIGEVVSARAMGTAKLFFRNKYFILEDLYLVPRIKRNLISVSAFLEQGYTISFLLNEALISRNGTYICSAKRENNLFVLRPTDAKAILS

Query:  HEMFKTTETQNKRQKISPLSNNSYLSHLCLGHININQIDRLVKNGLLTDLEDTSLPSYESCLEGKMTNRSFTGKGYRAKEPLELIHSDLCGLMNVKAR--
               E    + +IS       L H  +GH++   +  L K  L++  + T++   + CL GK    SF     R    L+L++SD+CG M +++   
Subjt:  HEMFKTTETQNKRQKISPLSNNSYLSHLCLGHININQIDRLVKNGLLTDLEDTSLPSYESCLEGKMTNRSFTGKGYRAKEPLELIHSDLCGLMNVKAR--

Query:  ----------APKTQNPYVLPQNSSC--------------SGSHKPVLRPENSGE-----------------DSRAPGTPQQNGVSERRNKTLLDMVRSM
                  A +    Y+L                    +G     LR +N GE                 +   PGTPQ NGV+ER N+T+++ VRSM
Subjt:  ----------APKTQNPYVLPQNSSC--------------SGSHKPVLRPENSGE-----------------DSRAPGTPQQNGVSERRNKTLLDMVRSM

Query:  MSYAQLPSSFWGYAVETAVHILNNVPSKSFS-ETPFELWRGRKPSLRYFRIWGCPA
        +  A+LP SFWG AV+TA +++N  PS   + E P  +W  ++ S  + +++GC A
Subjt:  MSYAQLPSSFWGYAVETAVHILNNVPSKSFS-ETPFELWRGRKPSLRYFRIWGCPA

P92512 Uncharacterized mitochondrial protein AtMg007107.3e-0640.3Show/hide
Query:  NKTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSFS-ETPFELWRGRKPSLRYFRIWGCPA
        N+T+++ VRSM+    LP +F   A  TAVHI+N  PS + +   P E+W    P+  Y R +GC A
Subjt:  NKTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSFS-ETPFELWRGRKPSLRYFRIWGCPA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-1919.64Show/hide
Query:  VPPRT----TAQAVKDAHERWTKVNEKVRVYILVSLSEVLAKRYENVKTTREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVRKHVLDMINQFNI
        +PP T     A  V   + RW + ++ +   +L ++S  +        T  +I  +L++++  PSY  H   L+       +  +++  ++  ++ +F+ 
Subjt:  VPPRT----TAQAVKDAHERWTKVNEKVRVYILVSLSEVLAKRYENVKTTREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVRKHVLDMINQFNI

Query:  AEANGRAVCECSQVVFILHSLPASYMSFRTNASMNKIQFNLTTLLSELQIYES-MQKSKGKNVVKGEAN-VAHSKKKFLKGSSSGTKSVPHASSSKQNQK
            G+ +    QV  +L +LP  Y       +       LT +   L  +ES +       V+   AN V+H        +++G ++  + + +  N  
Subjt:  AEANGRAVCECSQVVFILHSLPASYMSFRTNASMNKIQFNLTTLLSELQIYES-MQKSKGKNVVKGEAN-VAHSKKKFLKGSSSGTKSVPHASSSKQNQK

Query:  RTGDKGKAPTQAVKGKGKAKVVADKGRCFHCNADGHWKCNCP----YYLAEKKREKEGKFDLLVLETCLIEHDEFA---WILDSEATNHVCSSFQGNDFQ
        +   +          + K  +    G+C  C   GH    C     +  +   ++    F        L     ++   W+LDS AT+H+ S F      
Subjt:  RTGDKGKAPTQAVKGKGKAKVVADKGRCFHCNADGHWKCNCP----YYLAEKKREKEGKFDLLVLETCLIEHDEFA---WILDSEATNHVCSSFQGNDFQ

Query:  QLADGEMTLKVGIGEVVSARAMGTAKLFFRNKYFILEDLYLVPRIKRNLISVSAFLE-QGYTISFLLNEALIS--RNGTYICSAKRENNLFVLRPTDAKA
        Q   G   + V  G  +     G+  L  +++   L ++  VP I +NLISV       G ++ F      +     G  +   K ++ L+         
Subjt:  QLADGEMTLKVGIGEVVSARAMGTAKLFFRNKYFILEDLYLVPRIKRNLISVSAFLE-QGYTISFLLNEALIS--RNGTYICSAKRENNLFVLRPTDAKA

Query:  ILSHEMFKTTETQNKRQKISPLSNNSYLS-HLCLGHININQIDRLVKNGLLTDLEDT-SLPSYESCLEGKMTNRSFTGKGYRAKEPLELIHSDLCG----
              +    +Q      SP S  ++ S H  LGH   + ++ ++ N  L+ L  +    S   CL  K     F+     +  PLE I+SD+      
Subjt:  ILSHEMFKTTETQNKRQKISPLSNNSYLS-HLCLGHININQIDRLVKNGLLTDLEDT-SLPSYESCLEGKMTNRSFTGKGYRAKEPLELIHSDLCG----

Query:  --------------------LMNVKARAPKTQNPYVLPQN--SSCSGSHKPVLRPENSGED---------------SRAPGTPQQNGVSERRNKTLLDMV
                            L  +K ++ + +  ++  +N   +   +       +N GE                +  P TP+ NG+SER+++ +++  
Subjt:  --------------------LMNVKARAPKTQNPYVLPQN--SSCSGSHKPVLRPENSGED---------------SRAPGTPQQNGVSERRNKTLLDMV

Query:  RSMMSYAQLPSSFWGYAVETAVHILNNVPSKSFS-ETPFELWRGRKPSLRYFRIWGCPADNCPKDYAAERLE
         +++S+A +P ++W YA   AV+++N +P+     E+PF+   G  P+    R++GC      + Y   +L+
Subjt:  RSMMSYAQLPSSFWGYAVETAVHILNNVPSKSFS-ETPFELWRGRKPSLRYFRIWGCPADNCPKDYAAERLE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.2e-2120.5Show/hide
Query:  PVPPRT----TAQAVKDAHERWTKVNEKVRVYILVSLSEVLAKRYENVKTTREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVRKHV--LDMINQ
        P+PP T        V   + RW + ++ +   IL ++S  +        T  +I  +L++++  PSY                       HV  L  I +
Subjt:  PVPPRT----TAQAVKDAHERWTKVNEKVRVYILVSLSEVLAKRYENVKTTREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVRKHV--LDMINQ

Query:  FNIAEANGRAVCECSQVVFILHSLPASYMSFRTNASMNKIQFNLTTLLSELQIYESMQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKSVPHASSSKQNQ
        F+     G+ +    QV  +L +LP  Y       +      +LT      +I+E +   + K +    A V       +   ++ T    +     +N 
Subjt:  FNIAEANGRAVCECSQVVFILHSLPASYMSFRTNASMNKIQFNLTTLLSELQIYESMQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKSVPHASSSKQNQ

Query:  KRTGDKGKA--PTQAVKGKGKAKVVADKGRCFHCNADGHWKCNCP----YYLAEKKREKEGKFDLLVLETCLIEHDEF---AWILDSEATNHVCSSFQGN
            ++  +  P+ +       +     GRC  C+  GH    CP    +     +++    F        L  +  +    W+LDS AT+H+ S F   
Subjt:  KRTGDKGKA--PTQAVKGKGKAKVVADKGRCFHCNADGHWKCNCP----YYLAEKKREKEGKFDLLVLETCLIEHDEF---AWILDSEATNHVCSSFQGN

Query:  DFQQLADGEMTLKVGIGEVVSARAMGTAKLFFRNKYFILEDLYLVPRIKRNLISVSAFLE------QGYTISFLLNEALISRNGTYICSAKRENNLFVLR
         F Q   G   + +  G  +     G+A L   ++   L  +  VP I +NLISV           + +  SF + +      G  +   K ++ L+   
Subjt:  DFQQLADGEMTLKVGIGEVVSARAMGTAKLFFRNKYFILEDLYLVPRIKRNLISVSAFLE------QGYTISFLLNEALISRNGTYICSAKRENNLFVLR

Query:  PTDAKAILSHEMFKTTETQNKRQKISPLSNNSYLS-HLCLGHININQIDRLVKNGLLTDLEDT-SLPSYESCLEGKMTNRSFTGKGYRAKEPLELIHSDL
           ++A+    MF            SP S  ++ S H  LGH ++  ++ ++ N  L  L  +  L S   C   K     F+     + +PLE I+SD+
Subjt:  PTDAKAILSHEMFKTTETQNKRQKISPLSNNSYLS-HLCLGHININQIDRLVKNGLLTDLEDT-SLPSYESCLEGKMTNRSFTGKGYRAKEPLELIHSDL

Query:  CG------------------------LMNVKARAPKTQNPYVLPQN-------------SSCSGSHKPVLRPENS----GEDSRAPGTPQQNGVSERRNK
                                  L  +K ++ + ++ +++ ++              S +G    VLR   S       +  P TP+ NG+SER+++
Subjt:  CG------------------------LMNVKARAPKTQNPYVLPQN-------------SSCSGSHKPVLRPENS----GEDSRAPGTPQQNGVSERRNK

Query:  TLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSFS-ETPFELWRGRKPSLRYFRIWGCPADNCPKDYAAERLE
         +++M  +++S+A +P ++W YA   AV+++N +P+     ++PF+   G+ P+    +++GC      + Y   +LE
Subjt:  TLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSFS-ETPFELWRGRKPSLRYFRIWGCPADNCPKDYAAERLE

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein1.8e-0435.21Show/hide
Query:  LSHLCLGHININQIDRLVKNGLLTDLEDTSLPSYESCLEGKMTNRSFTGKGYRAKEPLELIHSDLCGLMNV
        L H  L H++   ++ LVK G L   + +SL   E C+ GK    +F+   +  K PL+ +HSDL G  +V
Subjt:  LSHLCLGHININQIDRLVKNGLLTDLEDTSLPSYESCLEGKMTNRSFTGKGYRAKEPLELIHSDLCGLMNV

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein5.2e-0740.3Show/hide
Query:  NKTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSFS-ETPFELWRGRKPSLRYFRIWGCPA
        N+T+++ VRSM+    LP +F   A  TAVHI+N  PS + +   P E+W    P+  Y R +GC A
Subjt:  NKTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSFS-ETPFELWRGRKPSLRYFRIWGCPA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGAAGAAACAAGGTGGTTGATTTGTTTCCGCTAGATCTTGAGATTAACAGGACTCTTAAATCCATTCGAAGAGAAAAAAGATTAGCAGAAGCGATGGCCCACCAAGA
AGAAGCTCCCAAGGCAATTAGAGATTTTCTGCAGCCAGTTCTTCCTACCGAGAATTCTGGAATTGTCTACGCCCCTATCCAAGCTACAAATTTGGAGCTAAAAACAGGGT
TGATTCAGATGGCGCGCGATAACTCGTTCAAGGGACATCCTTCCGAGGACCCCCACTCTCATCTGCGATCATTCCTAGAAATATGTGGGACGGATAAAGCAAAAGATTGG
CTCGAATCAGTCGAGACGGGTAGCATCAGTACTTCGGACGAGCTTGCCCAGGCCTTTCTGACAAAATTTTTTCCACCTGCTAAGACTACCAAGCTCCGGACTGAAATTGG
AACATTCAAGCAACTTAACGAGGAGCAGTTGTACGAAGCGTGGGAAAGATATAAGGAAATGCTCAGGCGATGCCCCCAACACGGATATCCTGATTGGCTTCAGGTACAGT
TATTTTATAATGGATTAAACCCCTCCACAAAGACAGTCCTAGACACATCAGCAGGAGGGAGTTTTCTTTCAAAAACAGTGACAGAAGCCAAAGACCTACTTGAGGAAATG
GCGGCAACAAGTTATCAATGGCCGGCGGAGAGAGGAACAGTTACAAAGAAGGCTGGATTATATGAATTGGATGAGTCAAGTTCACTGAAAGCGCAACTGGCATCTCTGAC
CAATGCACTAAACAAATTGACTTCATATGAGGTGGTTAAGTCCATTTCCACCTTAGCAGAAGGACATTCAAAAAGGAAGCCACCCCCAGGTTTTGCATCATCCAGTGCCC
CTGAAAAGAAAAATAATCTGGAAGAGATGGTGGCTTTATTCATCAAAGAACAAAGAATACTGAATGTAAGTCTCCAGACATCAGTAAACAACCACGATGCAGCTCTAAAG
AATATGGAAGTGCAGATAGGACAGATCGCTTCAGCAGTAAATGCCCTTCAAAAGGGAAAATTTCCAAGTGATACTGAGCCTAACCCGAAAGAGCAGTGTAAGATGGTGGT
TCTGAGAAGTGGCAGGAGACTGGAGGACAGTTTAGAGAAGAAAAAGGAAGAAGAAAAGAGAAGGGTTAAAGATGAAGGGACTGAGGCACAAAAAGCCTCCTTTGAAAGGT
TCCAACATCCTCCCAAATCTATTGAATTAAAATGTGATTTTTCTAACAACTTTGCAGGTAGAAAAGAAGATGAGAGGCAGAATGACAAAAAGAAGCTGACTGAGGAAGAA
GTGGTTCCATGCAACCACCATGACAGAGGTTCGCATATTAGCCCGCCCAAGCGGAGGGGCGAATGTCCAACCTTTGATTTAAGGGAGTTACCTTTTCCTCAAAGATTTAA
AAATGTCAAATTAGATGAACAGTTTCAAAAATTTCTAGAAATGTTTAAAAAGTTGTCTGTGAATATTCCATTGGTAGAAGCCTTGTATAATATGCCAAATTATGGAAAAT
TCATGAAGGAAATGCTTTCTAAAAAGAAAAGTCTGAAAAAAGAAGTTTTTAATTTATCTGAGAGTAGCAGTACCATTATTTCTGGTAGGATACCCTCTAAGCAGAAAGAT
CCAGGGAGTTTTACTGTTCCCTGCACCATAGGAGAAGTATCCTTCGATAGGGCTTTATATGATTTAGGAGCAAGTATAAATTTGATGCCCTACTCTGTGTACAGGAAGAT
TGGTTTAACAGGTATGACAGATACCAGCGTCACTCTCCAGCTTGCTGATAGATCGATTACCCACCCGATGGGTGTTGTGGAGGACGTGTTGGTGAAAGTCAACAAATTCA
TCTTCCCTGTAGATTTCGTGGTACTGGACATGAAGGAGGACAAAGAAGTGCCAATTATCTTAGGAAGACCTTTCCTAGCCACTGGTAAGGCTGAGATTAGCGTGCATACA
GGTAAACTTACCTTGAACATTGATGATGAAAAAGTAGTGTTTAGTATTTTTGGCCAAGATGAATCTATTTGTAGTTTGCATACATGTTTTTCTGTTGGGCCTGAATACTT
AACTGATGACGATGAAGAGGTAGACTATAATCTTGGGCTAGGCTTAGGAGAAATGCTTATGGATAATGTGAATTTTGATCATGATGCATATATGGATAATCCTATGTTTG
AAAATGATTTGGATCTGCCTGACTTTGAAAATGAATTAGATTTGCCTGCTTGTGAAAATGAAAGATCTGCAGTTGATGATTTACCTTCCTTTGAAAATGAATTAGATTTG
CCTGAAATGGATAATTTTGATGATGATATTGAATTGCCTGACATTGAGCATGAACATAAAAGGCATAAAAAAGATTGCTCGATAGATGATTTTGAGTCTGACCATGATTA
CAGTGAATCTATTGAGTCTGATCTTGACATTCCTGAATGCATGAATCCTGACAATGTGAATACTTTTTTTTCATGTCCTGATGATGTGTATAGCATAGAATCTGACCTAG
AAGAACTCGAATCTGTGCATAGTATAGAATCTGATCCTGAAATTCTTGAATTCTTGAATTCTTCTGATGATGAGTCATGTGATAGCTTAGTTGAATCATTTTTTAATTCT
GAGTCTGAGGGATCTGCTGAGCATGTAGACATTTTTTCTGATGAGTGGACTGGCATGATTGATAGGCCATCTCTAGATCCTAGACCAGTAGATATTATAACGCTTGATGA
CTCTGTTAACAACTGTTTTGAGAATAGAGATTTAGAAAAGAGATTTGATACGGCTGATTATTTTCTAAAATGTTATTATGAGAATCTCTTTAGTGAAGATGGGCCAGGGG
AAAGTTTTTTTCTCCCCTGTTTGTACGTCGAGACGCTAAGTAAGGCAGCGTCGCGACGCTACCCTTTCTTGTTGCCCAAGCCGCGTGCGCAGCAGCGTCGCGACGCTGAG
CAAGAATCGAGCTCTAGACGCAACAGCGTCGGGACGCTTGGCCTATTGTGTCTCGACACTGTCGCGATTTTGTGCGGCAGTTCAGCCTTTGTGAGGAGTTGTGCTGAAAC
CCCAATGGGCGGTCTGGTTCGTGCGGTTAGGGTTGAGTTGGGACCGGTTTGGTCCGGTTCAACCCATTTCTTGGGCGGTTCGAGATTTTCAGGGCCGGTTCGAAGCTGTT
CGGGTCGTTCACGCCGTGAGATCCATGCTCAACTTCGTGTCGTTTTAGCGTGGCCTCCCTTCGGAAAGTGTTTGAATGGGATCAATACCAAGGTGAATGGGGAAAGTGAC
TATAAGGATCTTAGGTTCGTCTTAACGGAGGAATGTCCTCCTGTTCCCCCTCGCACTACCGCTCAGGCAGTAAAGGATGCCCACGAACGCTGGACAAAGGTCAATGAAAA
GGTCAGAGTCTATATACTGGTCAGCTTGTCTGAAGTTTTGGCCAAACGTTACGAGAACGTGAAGACTACCAGAGAGATTATGAATTCCCTGCAGGAGATGTTTGGACTCC
CGTCCTATCAGCTCCACCACGATGCCTTGAAGAACGTCTTCAATGCCAAGATGCAAGAGGGTCAATCTGTTCGGAAACATGTCCTGGACATGATTAACCAGTTCAATATT
GCTGAGGCAAATGGAAGGGCAGTCTGCGAGTGCAGTCAGGTTGTGTTCATCCTTCACTCGCTTCCTGCGAGCTATATGTCGTTTAGGACGAACGCGAGCATGAACAAAAT
TCAGTTCAACCTGACTACCCTCCTCTCGGAGTTACAGATTTATGAGTCCATGCAGAAAAGCAAGGGCAAGAATGTGGTGAAAGGAGAGGCCAATGTGGCCCATTCCAAAA
AGAAGTTCCTGAAGGGTTCATCCTCAGGGACTAAATCTGTACCTCATGCTTCTTCATCGAAGCAAAATCAAAAGAGGACGGGAGACAAGGGGAAGGCTCCTACGCAAGCT
GTGAAAGGTAAGGGCAAGGCCAAGGTTGTGGCCGACAAAGGTAGATGCTTCCACTGCAATGCAGATGGTCATTGGAAGTGCAACTGTCCCTATTACCTCGCTGAGAAGAA
GAGAGAAAAAGAAGGTAAATTCGATTTACTTGTGTTAGAGACTTGTCTCATCGAACATGATGAGTTTGCCTGGATATTGGATTCGGAAGCCACTAATCATGTTTGCTCTT
CTTTTCAGGGAAATGATTTCCAGCAGCTGGCTGATGGTGAAATGACTCTCAAGGTTGGAATAGGAGAGGTCGTTTCAGCTCGTGCAATGGGAACTGCAAAATTATTTTTT
AGAAATAAGTATTTCATTTTAGAGGACTTGTATTTGGTTCCTAGAATTAAAAGGAATCTTATTTCTGTTTCTGCTTTCCTTGAACAAGGTTATACTATTTCATTCTTGCT
TAATGAAGCTTTAATTTCTCGGAATGGAACTTATATTTGTTCAGCTAAACGTGAAAATAATTTATTTGTGTTAAGACCTACCGACGCTAAGGCTATTTTAAGTCATGAAA
TGTTTAAAACGACCGAAACGCAAAACAAAAGGCAAAAGATTTCTCCTCTAAGTAACAATTCGTATCTTTCGCACCTTTGTCTCGGTCATATTAACATCAATCAGATCGAT
CGTTTGGTCAAAAATGGACTTCTAACTGATTTAGAAGATACATCTTTACCATCCTATGAATCGTGTCTCGAGGGTAAAATGACCAATCGGTCTTTTACTGGAAAAGGTTA
TAGGGCCAAAGAACCACTTGAACTGATACATTCGGATCTTTGTGGTCTAATGAATGTAAAGGCTCGAGCTCCTAAGACACAAAACCCATATGTTCTTCCTCAAAATTCAT
CATGCTCGGGTTCCCACAAGCCCGTTCTAAGGCCGGAGAATAGCGGGGAAGACTCTAGAGCACCTGGAACACCTCAGCAAAATGGTGTATCAGAAAGGAGAAATAAAACC
TTGTTAGACATGGTTCGATCTATGATGAGCTATGCTCAATTGCCTAGCTCGTTTTGGGGGTATGCAGTAGAGACTGCGGTACATATTCTTAACAACGTTCCCTCAAAAAG
TTTTTCTGAAACTCCTTTTGAGTTATGGAGGGGGCGTAAACCTAGTTTACGTTACTTCCGTATCTGGGGTTGCCCTGCTGATAACTGCCCAAAAGATTATGCTGCTGAGC
GACTGGAGGGAGCAAATTCTATGCTGCAGCAAAACTGGGAACAGAAACTGCCACATCACAGCTCGTTAGCCAACTTCATGAACCGACTTCTGTTGAGTTATTTTCGTGAT
AAAGGATCAAGGAGAGCCTTACACGTGTCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCGAAGAAACAAGGTGGTTGATTTGTTTCCGCTAGATCTTGAGATTAACAGGACTCTTAAATCCATTCGAAGAGAAAAAAGATTAGCAGAAGCGATGGCCCACCAAGA
AGAAGCTCCCAAGGCAATTAGAGATTTTCTGCAGCCAGTTCTTCCTACCGAGAATTCTGGAATTGTCTACGCCCCTATCCAAGCTACAAATTTGGAGCTAAAAACAGGGT
TGATTCAGATGGCGCGCGATAACTCGTTCAAGGGACATCCTTCCGAGGACCCCCACTCTCATCTGCGATCATTCCTAGAAATATGTGGGACGGATAAAGCAAAAGATTGG
CTCGAATCAGTCGAGACGGGTAGCATCAGTACTTCGGACGAGCTTGCCCAGGCCTTTCTGACAAAATTTTTTCCACCTGCTAAGACTACCAAGCTCCGGACTGAAATTGG
AACATTCAAGCAACTTAACGAGGAGCAGTTGTACGAAGCGTGGGAAAGATATAAGGAAATGCTCAGGCGATGCCCCCAACACGGATATCCTGATTGGCTTCAGGTACAGT
TATTTTATAATGGATTAAACCCCTCCACAAAGACAGTCCTAGACACATCAGCAGGAGGGAGTTTTCTTTCAAAAACAGTGACAGAAGCCAAAGACCTACTTGAGGAAATG
GCGGCAACAAGTTATCAATGGCCGGCGGAGAGAGGAACAGTTACAAAGAAGGCTGGATTATATGAATTGGATGAGTCAAGTTCACTGAAAGCGCAACTGGCATCTCTGAC
CAATGCACTAAACAAATTGACTTCATATGAGGTGGTTAAGTCCATTTCCACCTTAGCAGAAGGACATTCAAAAAGGAAGCCACCCCCAGGTTTTGCATCATCCAGTGCCC
CTGAAAAGAAAAATAATCTGGAAGAGATGGTGGCTTTATTCATCAAAGAACAAAGAATACTGAATGTAAGTCTCCAGACATCAGTAAACAACCACGATGCAGCTCTAAAG
AATATGGAAGTGCAGATAGGACAGATCGCTTCAGCAGTAAATGCCCTTCAAAAGGGAAAATTTCCAAGTGATACTGAGCCTAACCCGAAAGAGCAGTGTAAGATGGTGGT
TCTGAGAAGTGGCAGGAGACTGGAGGACAGTTTAGAGAAGAAAAAGGAAGAAGAAAAGAGAAGGGTTAAAGATGAAGGGACTGAGGCACAAAAAGCCTCCTTTGAAAGGT
TCCAACATCCTCCCAAATCTATTGAATTAAAATGTGATTTTTCTAACAACTTTGCAGGTAGAAAAGAAGATGAGAGGCAGAATGACAAAAAGAAGCTGACTGAGGAAGAA
GTGGTTCCATGCAACCACCATGACAGAGGTTCGCATATTAGCCCGCCCAAGCGGAGGGGCGAATGTCCAACCTTTGATTTAAGGGAGTTACCTTTTCCTCAAAGATTTAA
AAATGTCAAATTAGATGAACAGTTTCAAAAATTTCTAGAAATGTTTAAAAAGTTGTCTGTGAATATTCCATTGGTAGAAGCCTTGTATAATATGCCAAATTATGGAAAAT
TCATGAAGGAAATGCTTTCTAAAAAGAAAAGTCTGAAAAAAGAAGTTTTTAATTTATCTGAGAGTAGCAGTACCATTATTTCTGGTAGGATACCCTCTAAGCAGAAAGAT
CCAGGGAGTTTTACTGTTCCCTGCACCATAGGAGAAGTATCCTTCGATAGGGCTTTATATGATTTAGGAGCAAGTATAAATTTGATGCCCTACTCTGTGTACAGGAAGAT
TGGTTTAACAGGTATGACAGATACCAGCGTCACTCTCCAGCTTGCTGATAGATCGATTACCCACCCGATGGGTGTTGTGGAGGACGTGTTGGTGAAAGTCAACAAATTCA
TCTTCCCTGTAGATTTCGTGGTACTGGACATGAAGGAGGACAAAGAAGTGCCAATTATCTTAGGAAGACCTTTCCTAGCCACTGGTAAGGCTGAGATTAGCGTGCATACA
GGTAAACTTACCTTGAACATTGATGATGAAAAAGTAGTGTTTAGTATTTTTGGCCAAGATGAATCTATTTGTAGTTTGCATACATGTTTTTCTGTTGGGCCTGAATACTT
AACTGATGACGATGAAGAGGTAGACTATAATCTTGGGCTAGGCTTAGGAGAAATGCTTATGGATAATGTGAATTTTGATCATGATGCATATATGGATAATCCTATGTTTG
AAAATGATTTGGATCTGCCTGACTTTGAAAATGAATTAGATTTGCCTGCTTGTGAAAATGAAAGATCTGCAGTTGATGATTTACCTTCCTTTGAAAATGAATTAGATTTG
CCTGAAATGGATAATTTTGATGATGATATTGAATTGCCTGACATTGAGCATGAACATAAAAGGCATAAAAAAGATTGCTCGATAGATGATTTTGAGTCTGACCATGATTA
CAGTGAATCTATTGAGTCTGATCTTGACATTCCTGAATGCATGAATCCTGACAATGTGAATACTTTTTTTTCATGTCCTGATGATGTGTATAGCATAGAATCTGACCTAG
AAGAACTCGAATCTGTGCATAGTATAGAATCTGATCCTGAAATTCTTGAATTCTTGAATTCTTCTGATGATGAGTCATGTGATAGCTTAGTTGAATCATTTTTTAATTCT
GAGTCTGAGGGATCTGCTGAGCATGTAGACATTTTTTCTGATGAGTGGACTGGCATGATTGATAGGCCATCTCTAGATCCTAGACCAGTAGATATTATAACGCTTGATGA
CTCTGTTAACAACTGTTTTGAGAATAGAGATTTAGAAAAGAGATTTGATACGGCTGATTATTTTCTAAAATGTTATTATGAGAATCTCTTTAGTGAAGATGGGCCAGGGG
AAAGTTTTTTTCTCCCCTGTTTGTACGTCGAGACGCTAAGTAAGGCAGCGTCGCGACGCTACCCTTTCTTGTTGCCCAAGCCGCGTGCGCAGCAGCGTCGCGACGCTGAG
CAAGAATCGAGCTCTAGACGCAACAGCGTCGGGACGCTTGGCCTATTGTGTCTCGACACTGTCGCGATTTTGTGCGGCAGTTCAGCCTTTGTGAGGAGTTGTGCTGAAAC
CCCAATGGGCGGTCTGGTTCGTGCGGTTAGGGTTGAGTTGGGACCGGTTTGGTCCGGTTCAACCCATTTCTTGGGCGGTTCGAGATTTTCAGGGCCGGTTCGAAGCTGTT
CGGGTCGTTCACGCCGTGAGATCCATGCTCAACTTCGTGTCGTTTTAGCGTGGCCTCCCTTCGGAAAGTGTTTGAATGGGATCAATACCAAGGTGAATGGGGAAAGTGAC
TATAAGGATCTTAGGTTCGTCTTAACGGAGGAATGTCCTCCTGTTCCCCCTCGCACTACCGCTCAGGCAGTAAAGGATGCCCACGAACGCTGGACAAAGGTCAATGAAAA
GGTCAGAGTCTATATACTGGTCAGCTTGTCTGAAGTTTTGGCCAAACGTTACGAGAACGTGAAGACTACCAGAGAGATTATGAATTCCCTGCAGGAGATGTTTGGACTCC
CGTCCTATCAGCTCCACCACGATGCCTTGAAGAACGTCTTCAATGCCAAGATGCAAGAGGGTCAATCTGTTCGGAAACATGTCCTGGACATGATTAACCAGTTCAATATT
GCTGAGGCAAATGGAAGGGCAGTCTGCGAGTGCAGTCAGGTTGTGTTCATCCTTCACTCGCTTCCTGCGAGCTATATGTCGTTTAGGACGAACGCGAGCATGAACAAAAT
TCAGTTCAACCTGACTACCCTCCTCTCGGAGTTACAGATTTATGAGTCCATGCAGAAAAGCAAGGGCAAGAATGTGGTGAAAGGAGAGGCCAATGTGGCCCATTCCAAAA
AGAAGTTCCTGAAGGGTTCATCCTCAGGGACTAAATCTGTACCTCATGCTTCTTCATCGAAGCAAAATCAAAAGAGGACGGGAGACAAGGGGAAGGCTCCTACGCAAGCT
GTGAAAGGTAAGGGCAAGGCCAAGGTTGTGGCCGACAAAGGTAGATGCTTCCACTGCAATGCAGATGGTCATTGGAAGTGCAACTGTCCCTATTACCTCGCTGAGAAGAA
GAGAGAAAAAGAAGGTAAATTCGATTTACTTGTGTTAGAGACTTGTCTCATCGAACATGATGAGTTTGCCTGGATATTGGATTCGGAAGCCACTAATCATGTTTGCTCTT
CTTTTCAGGGAAATGATTTCCAGCAGCTGGCTGATGGTGAAATGACTCTCAAGGTTGGAATAGGAGAGGTCGTTTCAGCTCGTGCAATGGGAACTGCAAAATTATTTTTT
AGAAATAAGTATTTCATTTTAGAGGACTTGTATTTGGTTCCTAGAATTAAAAGGAATCTTATTTCTGTTTCTGCTTTCCTTGAACAAGGTTATACTATTTCATTCTTGCT
TAATGAAGCTTTAATTTCTCGGAATGGAACTTATATTTGTTCAGCTAAACGTGAAAATAATTTATTTGTGTTAAGACCTACCGACGCTAAGGCTATTTTAAGTCATGAAA
TGTTTAAAACGACCGAAACGCAAAACAAAAGGCAAAAGATTTCTCCTCTAAGTAACAATTCGTATCTTTCGCACCTTTGTCTCGGTCATATTAACATCAATCAGATCGAT
CGTTTGGTCAAAAATGGACTTCTAACTGATTTAGAAGATACATCTTTACCATCCTATGAATCGTGTCTCGAGGGTAAAATGACCAATCGGTCTTTTACTGGAAAAGGTTA
TAGGGCCAAAGAACCACTTGAACTGATACATTCGGATCTTTGTGGTCTAATGAATGTAAAGGCTCGAGCTCCTAAGACACAAAACCCATATGTTCTTCCTCAAAATTCAT
CATGCTCGGGTTCCCACAAGCCCGTTCTAAGGCCGGAGAATAGCGGGGAAGACTCTAGAGCACCTGGAACACCTCAGCAAAATGGTGTATCAGAAAGGAGAAATAAAACC
TTGTTAGACATGGTTCGATCTATGATGAGCTATGCTCAATTGCCTAGCTCGTTTTGGGGGTATGCAGTAGAGACTGCGGTACATATTCTTAACAACGTTCCCTCAAAAAG
TTTTTCTGAAACTCCTTTTGAGTTATGGAGGGGGCGTAAACCTAGTTTACGTTACTTCCGTATCTGGGGTTGCCCTGCTGATAACTGCCCAAAAGATTATGCTGCTGAGC
GACTGGAGGGAGCAAATTCTATGCTGCAGCAAAACTGGGAACAGAAACTGCCACATCACAGCTCGTTAGCCAACTTCATGAACCGACTTCTGTTGAGTTATTTTCGTGAT
AAAGGATCAAGGAGAGCCTTACACGTGTCCTAG
Protein sequenceShow/hide protein sequence
MRRNKVVDLFPLDLEINRTLKSIRREKRLAEAMAHQEEAPKAIRDFLQPVLPTENSGIVYAPIQATNLELKTGLIQMARDNSFKGHPSEDPHSHLRSFLEICGTDKAKDW
LESVETGSISTSDELAQAFLTKFFPPAKTTKLRTEIGTFKQLNEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTEAKDLLEEM
AATSYQWPAERGTVTKKAGLYELDESSSLKAQLASLTNALNKLTSYEVVKSISTLAEGHSKRKPPPGFASSSAPEKKNNLEEMVALFIKEQRILNVSLQTSVNNHDAALK
NMEVQIGQIASAVNALQKGKFPSDTEPNPKEQCKMVVLRSGRRLEDSLEKKKEEEKRRVKDEGTEAQKASFERFQHPPKSIELKCDFSNNFAGRKEDERQNDKKKLTEEE
VVPCNHHDRGSHISPPKRRGECPTFDLRELPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVEALYNMPNYGKFMKEMLSKKKSLKKEVFNLSESSSTIISGRIPSKQKD
PGSFTVPCTIGEVSFDRALYDLGASINLMPYSVYRKIGLTGMTDTSVTLQLADRSITHPMGVVEDVLVKVNKFIFPVDFVVLDMKEDKEVPIILGRPFLATGKAEISVHT
GKLTLNIDDEKVVFSIFGQDESICSLHTCFSVGPEYLTDDDEEVDYNLGLGLGEMLMDNVNFDHDAYMDNPMFENDLDLPDFENELDLPACENERSAVDDLPSFENELDL
PEMDNFDDDIELPDIEHEHKRHKKDCSIDDFESDHDYSESIESDLDIPECMNPDNVNTFFSCPDDVYSIESDLEELESVHSIESDPEILEFLNSSDDESCDSLVESFFNS
ESEGSAEHVDIFSDEWTGMIDRPSLDPRPVDIITLDDSVNNCFENRDLEKRFDTADYFLKCYYENLFSEDGPGESFFLPCLYVETLSKAASRRYPFLLPKPRAQQRRDAE
QESSSRRNSVGTLGLLCLDTVAILCGSSAFVRSCAETPMGGLVRAVRVELGPVWSGSTHFLGGSRFSGPVRSCSGRSRREIHAQLRVVLAWPPFGKCLNGINTKVNGESD
YKDLRFVLTEECPPVPPRTTAQAVKDAHERWTKVNEKVRVYILVSLSEVLAKRYENVKTTREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVRKHVLDMINQFNI
AEANGRAVCECSQVVFILHSLPASYMSFRTNASMNKIQFNLTTLLSELQIYESMQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKSVPHASSSKQNQKRTGDKGKAPTQA
VKGKGKAKVVADKGRCFHCNADGHWKCNCPYYLAEKKREKEGKFDLLVLETCLIEHDEFAWILDSEATNHVCSSFQGNDFQQLADGEMTLKVGIGEVVSARAMGTAKLFF
RNKYFILEDLYLVPRIKRNLISVSAFLEQGYTISFLLNEALISRNGTYICSAKRENNLFVLRPTDAKAILSHEMFKTTETQNKRQKISPLSNNSYLSHLCLGHININQID
RLVKNGLLTDLEDTSLPSYESCLEGKMTNRSFTGKGYRAKEPLELIHSDLCGLMNVKARAPKTQNPYVLPQNSSCSGSHKPVLRPENSGEDSRAPGTPQQNGVSERRNKT
LLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSFSETPFELWRGRKPSLRYFRIWGCPADNCPKDYAAERLEGANSMLQQNWEQKLPHHSSLANFMNRLLLSYFRD
KGSRRALHVS