; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021670 (gene) of Snake gourd v1 genome

Gene IDTan0021670
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG06:47301933..47303683
RNA-Seq ExpressionTan0021670
SyntenyTan0021670
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]1.8e-15954.33Show/hide
Query:  MSDVLAKKHELMVTANEIMESLQEMFGQQSIQVRHDSLKYVFNAWMKEGSSVREHVLDTMTHFNLAEMNGASIDESS-----------------------
        +S+VLAKKHE M+TA EIM+SLQEMFGQ S Q++HD+LKY++NA M EG+SVREHVL+ M HFN+AEMNGA IDE+S                       
Subjt:  MSDVLAKKHELMVTANEIMESLQEMFGQQSIQVRHDSLKYVFNAWMKEGSSVREHVLDTMTHFNLAEMNGASIDESS-----------------------

Query:  -------------QNFQSLNRVKASKFEANVA--YRSYHRGSTSRTKPVAPSRPKGKKRMTRG----KTDRAVAHKGKKART------------------
                     Q F+SL ++K  K EANVA   R +HRGSTS TK +  S    K +  +G    K + A A   KKA+                   
Subjt:  -------------QNFQSLNRVKASKFEANVA--YRSYHRGSTSRTKPVAPSRPKGKKRMTRG----KTDRAVAHKGKKART------------------

Query:  ---LQRKESVSTS-----------MGADTGRGIVPNSLPR---GRIKGIDSWQPLQEGEVTLRVGSGELVSAAAIDTVKLHFGTSYILLDNLYIVQGFTR
           L  K+                +  D    I+ +          +GI SW+ L+ GE+T+RVG+G +VSA A+  ++L    S++LL+N+Y+V    R
Subjt:  ---LQRKESVSTS-----------MGADTGRGIVPNSLPR---GRIKGIDSWQPLQEGEVTLRVGSGELVSAAAIDTVKLHFGTSYILLDNLYIVQGFTR

Query:  NLGSISCLLEQCISVSFYGNKAFISRNGNLICSASLEHNLYVLKPNSVKSVLNTKLFKTAETGTKKAKVSPKENAYLWHLRLGHINLKRIEKLVKSGLLN
        NL S+ CLLEQ  S++F  NK FI +NG  ICSA LE+NLYVL+  + K++LNT++FKTA T  K+ K+SPKENA+LWHLRLGHINL RIE+LVK+GLL+
Subjt:  NLGSISCLLEQCISVSFYGNKAFISRNGNLICSASLEHNLYVLKPNSVKSVLNTKLFKTAETGTKKAKVSPKENAYLWHLRLGHINLKRIEKLVKSGLLN

Query:  ELEENFLSVCESCLLCMMTKRPFSGKGYRAKEPLKLVHSDLCCPMNVKARGGYEYFVSFIGDYSRYGYIYLMDRKSETLEKFKEYKTEVENLLGKSLKTL
        ELEEN L VCESCL   MTKRPF+GKG+RAKEPL+LVHSDLC PMNVKARGG+EYF++F  DYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT 
Subjt:  ELEENFLSVCESCLLCMMTKRPFSGKGYRAKEPLKLVHSDLCCPMNVKARGGYEYFVSFIGDYSRYGYIYLMDRKSETLEKFKEYKTEVENLLGKSLKTL

Query:  RSDRGGEYMDTEFQDYMIEHKITSQLSAPGMPQQNGVSERKNKSLLDMVRSMRSYARLPDSFWGYAVETAVYILNNVLSKSVCETPFEL
        RSDRGGEYMD +FQ+Y++E  I SQLSAPG PQQNGVSER+N++LLDMVRSM SYA LP+SFWGYAV+TAVYILN V SKSV ETP +L
Subjt:  RSDRGGEYMDTEFQDYMIEHKITSQLSAPGMPQQNGVSERKNKSLLDMVRSMRSYARLPDSFWGYAVETAVYILNNVLSKSVCETPFEL

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]1.8e-15954.33Show/hide
Query:  MSDVLAKKHELMVTANEIMESLQEMFGQQSIQVRHDSLKYVFNAWMKEGSSVREHVLDTMTHFNLAEMNGASIDESS-----------------------
        +S+VLAKKHE M+TA EIM+SLQEMFGQ S Q++HD+LKY++NA M EG+SVREHVL+ M HFN+AEMNGA IDE+S                       
Subjt:  MSDVLAKKHELMVTANEIMESLQEMFGQQSIQVRHDSLKYVFNAWMKEGSSVREHVLDTMTHFNLAEMNGASIDESS-----------------------

Query:  -------------QNFQSLNRVKASKFEANVA--YRSYHRGSTSRTKPVAPSRPKGKKRMTRG----KTDRAVAHKGKKART------------------
                     Q F+SL ++K  K EANVA   R +HRGSTS TK +  S    K +  +G    K + A A   KKA+                   
Subjt:  -------------QNFQSLNRVKASKFEANVA--YRSYHRGSTSRTKPVAPSRPKGKKRMTRG----KTDRAVAHKGKKART------------------

Query:  ---LQRKESVSTS-----------MGADTGRGIVPNSLPR---GRIKGIDSWQPLQEGEVTLRVGSGELVSAAAIDTVKLHFGTSYILLDNLYIVQGFTR
           L  K+                +  D    I+ +          +GI SW+ L+ GE+T+RVG+G +VSA A+  ++L    S++LL+N+Y+V    R
Subjt:  ---LQRKESVSTS-----------MGADTGRGIVPNSLPR---GRIKGIDSWQPLQEGEVTLRVGSGELVSAAAIDTVKLHFGTSYILLDNLYIVQGFTR

Query:  NLGSISCLLEQCISVSFYGNKAFISRNGNLICSASLEHNLYVLKPNSVKSVLNTKLFKTAETGTKKAKVSPKENAYLWHLRLGHINLKRIEKLVKSGLLN
        NL S+ CLLEQ  S++F  NK FI +NG  ICSA LE+NLYVL+  + K++LNT++FKTA T  K+ K+SPKENA+LWHLRLGHINL RIE+LVK+GLL+
Subjt:  NLGSISCLLEQCISVSFYGNKAFISRNGNLICSASLEHNLYVLKPNSVKSVLNTKLFKTAETGTKKAKVSPKENAYLWHLRLGHINLKRIEKLVKSGLLN

Query:  ELEENFLSVCESCLLCMMTKRPFSGKGYRAKEPLKLVHSDLCCPMNVKARGGYEYFVSFIGDYSRYGYIYLMDRKSETLEKFKEYKTEVENLLGKSLKTL
        ELEEN L VCESCL   MTKRPF+GKG+RAKEPL+LVHSDLC PMNVKARGG+EYF++F  DYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT 
Subjt:  ELEENFLSVCESCLLCMMTKRPFSGKGYRAKEPLKLVHSDLCCPMNVKARGGYEYFVSFIGDYSRYGYIYLMDRKSETLEKFKEYKTEVENLLGKSLKTL

Query:  RSDRGGEYMDTEFQDYMIEHKITSQLSAPGMPQQNGVSERKNKSLLDMVRSMRSYARLPDSFWGYAVETAVYILNNVLSKSVCETPFEL
        RSDRGGEYMD +FQ+Y++E  I SQLSAPG PQQNGVSER+N++LLDMVRSM SYA LP+SFWGYAV+TAVYILN V SKSV ETP +L
Subjt:  RSDRGGEYMDTEFQDYMIEHKITSQLSAPGMPQQNGVSERKNKSLLDMVRSMRSYARLPDSFWGYAVETAVYILNNVLSKSVCETPFEL

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]1.8e-15954.33Show/hide
Query:  MSDVLAKKHELMVTANEIMESLQEMFGQQSIQVRHDSLKYVFNAWMKEGSSVREHVLDTMTHFNLAEMNGASIDESS-----------------------
        +S+VLAKKHE M+TA EIM+SLQEMFGQ S Q++HD+LKY++NA M EG+SVREHVL+ M HFN+AEMNGA IDE+S                       
Subjt:  MSDVLAKKHELMVTANEIMESLQEMFGQQSIQVRHDSLKYVFNAWMKEGSSVREHVLDTMTHFNLAEMNGASIDESS-----------------------

Query:  -------------QNFQSLNRVKASKFEANVA--YRSYHRGSTSRTKPVAPSRPKGKKRMTRG----KTDRAVAHKGKKART------------------
                     Q F+SL ++K  K EANVA   R +HRGSTS TK +  S    K +  +G    K + A A   KKA+                   
Subjt:  -------------QNFQSLNRVKASKFEANVA--YRSYHRGSTSRTKPVAPSRPKGKKRMTRG----KTDRAVAHKGKKART------------------

Query:  ---LQRKESVSTS-----------MGADTGRGIVPNSLPR---GRIKGIDSWQPLQEGEVTLRVGSGELVSAAAIDTVKLHFGTSYILLDNLYIVQGFTR
           L  K+                +  D    I+ +          +GI SW+ L+ GE+T+RVG+G +VSA A+  ++L    S++LL+N+Y+V    R
Subjt:  ---LQRKESVSTS-----------MGADTGRGIVPNSLPR---GRIKGIDSWQPLQEGEVTLRVGSGELVSAAAIDTVKLHFGTSYILLDNLYIVQGFTR

Query:  NLGSISCLLEQCISVSFYGNKAFISRNGNLICSASLEHNLYVLKPNSVKSVLNTKLFKTAETGTKKAKVSPKENAYLWHLRLGHINLKRIEKLVKSGLLN
        NL S+ CLLEQ  S++F  NK FI +NG  ICSA LE+NLYVL+  + K++LNT++FKTA T  K+ K+SPKENA+LWHLRLGHINL RIE+LVK+GLL+
Subjt:  NLGSISCLLEQCISVSFYGNKAFISRNGNLICSASLEHNLYVLKPNSVKSVLNTKLFKTAETGTKKAKVSPKENAYLWHLRLGHINLKRIEKLVKSGLLN

Query:  ELEENFLSVCESCLLCMMTKRPFSGKGYRAKEPLKLVHSDLCCPMNVKARGGYEYFVSFIGDYSRYGYIYLMDRKSETLEKFKEYKTEVENLLGKSLKTL
        ELEEN L VCESCL   MTKRPF+GKG+RAKEPL+LVHSDLC PMNVKARGG+EYF++F  DYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT 
Subjt:  ELEENFLSVCESCLLCMMTKRPFSGKGYRAKEPLKLVHSDLCCPMNVKARGGYEYFVSFIGDYSRYGYIYLMDRKSETLEKFKEYKTEVENLLGKSLKTL

Query:  RSDRGGEYMDTEFQDYMIEHKITSQLSAPGMPQQNGVSERKNKSLLDMVRSMRSYARLPDSFWGYAVETAVYILNNVLSKSVCETPFEL
        RSDRGGEYMD +FQ+Y++E  I SQLSAPG PQQNGVSER+N++LLDMVRSM SYA LP+SFWGYAV+TAVYILN V SKSV ETP +L
Subjt:  RSDRGGEYMDTEFQDYMIEHKITSQLSAPGMPQQNGVSERKNKSLLDMVRSMRSYARLPDSFWGYAVETAVYILNNVLSKSVCETPFEL

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]1.8e-15954.33Show/hide
Query:  MSDVLAKKHELMVTANEIMESLQEMFGQQSIQVRHDSLKYVFNAWMKEGSSVREHVLDTMTHFNLAEMNGASIDESS-----------------------
        +S+VLAKKHE M+TA EIM+SLQEMFGQ S Q++HD+LKY++NA M EG+SVREHVL+ M HFN+AEMNGA IDE+S                       
Subjt:  MSDVLAKKHELMVTANEIMESLQEMFGQQSIQVRHDSLKYVFNAWMKEGSSVREHVLDTMTHFNLAEMNGASIDESS-----------------------

Query:  -------------QNFQSLNRVKASKFEANVA--YRSYHRGSTSRTKPVAPSRPKGKKRMTRG----KTDRAVAHKGKKART------------------
                     Q F+SL ++K  K EANVA   R +HRGSTS TK +  S    K +  +G    K + A A   KKA+                   
Subjt:  -------------QNFQSLNRVKASKFEANVA--YRSYHRGSTSRTKPVAPSRPKGKKRMTRG----KTDRAVAHKGKKART------------------

Query:  ---LQRKESVSTS-----------MGADTGRGIVPNSLPR---GRIKGIDSWQPLQEGEVTLRVGSGELVSAAAIDTVKLHFGTSYILLDNLYIVQGFTR
           L  K+                +  D    I+ +          +GI SW+ L+ GE+T+RVG+G +VSA A+  ++L    S++LL+N+Y+V    R
Subjt:  ---LQRKESVSTS-----------MGADTGRGIVPNSLPR---GRIKGIDSWQPLQEGEVTLRVGSGELVSAAAIDTVKLHFGTSYILLDNLYIVQGFTR

Query:  NLGSISCLLEQCISVSFYGNKAFISRNGNLICSASLEHNLYVLKPNSVKSVLNTKLFKTAETGTKKAKVSPKENAYLWHLRLGHINLKRIEKLVKSGLLN
        NL S+ CLLEQ  S++F  NK FI +NG  ICSA LE+NLYVL+  + K++LNT++FKTA T  K+ K+SPKENA+LWHLRLGHINL RIE+LVK+GLL+
Subjt:  NLGSISCLLEQCISVSFYGNKAFISRNGNLICSASLEHNLYVLKPNSVKSVLNTKLFKTAETGTKKAKVSPKENAYLWHLRLGHINLKRIEKLVKSGLLN

Query:  ELEENFLSVCESCLLCMMTKRPFSGKGYRAKEPLKLVHSDLCCPMNVKARGGYEYFVSFIGDYSRYGYIYLMDRKSETLEKFKEYKTEVENLLGKSLKTL
        ELEEN L VCESCL   MTKRPF+GKG+RAKEPL+LVHSDLC PMNVKARGG+EYF++F  DYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT 
Subjt:  ELEENFLSVCESCLLCMMTKRPFSGKGYRAKEPLKLVHSDLCCPMNVKARGGYEYFVSFIGDYSRYGYIYLMDRKSETLEKFKEYKTEVENLLGKSLKTL

Query:  RSDRGGEYMDTEFQDYMIEHKITSQLSAPGMPQQNGVSERKNKSLLDMVRSMRSYARLPDSFWGYAVETAVYILNNVLSKSVCETPFEL
        RSDRGGEYMD +FQ+Y++E  I SQLSAPG PQQNGVSER+N++LLDMVRSM SYA LP+SFWGYAV+TAVYILN V SKSV ETP +L
Subjt:  RSDRGGEYMDTEFQDYMIEHKITSQLSAPGMPQQNGVSERKNKSLLDMVRSMRSYARLPDSFWGYAVETAVYILNNVLSKSVCETPFEL

KAA0062993.1 gag/pol protein [Cucumis melo var. makuwa]1.8e-15954.33Show/hide
Query:  MSDVLAKKHELMVTANEIMESLQEMFGQQSIQVRHDSLKYVFNAWMKEGSSVREHVLDTMTHFNLAEMNGASIDESS-----------------------
        +S+VLAKKHE M+TA EIM+SLQEMFGQ S Q++HD+LKY++NA M EG+SVREHVL+ M HFN+AEMNGA IDE+S                       
Subjt:  MSDVLAKKHELMVTANEIMESLQEMFGQQSIQVRHDSLKYVFNAWMKEGSSVREHVLDTMTHFNLAEMNGASIDESS-----------------------

Query:  -------------QNFQSLNRVKASKFEANVA--YRSYHRGSTSRTKPVAPSRPKGKKRMTRG----KTDRAVAHKGKKART------------------
                     Q F+SL ++K  K EANVA   R +HRGSTS TK +  S    K +  +G    K + A A   KKA+                   
Subjt:  -------------QNFQSLNRVKASKFEANVA--YRSYHRGSTSRTKPVAPSRPKGKKRMTRG----KTDRAVAHKGKKART------------------

Query:  ---LQRKESVSTS-----------MGADTGRGIVPNSLPR---GRIKGIDSWQPLQEGEVTLRVGSGELVSAAAIDTVKLHFGTSYILLDNLYIVQGFTR
           L  K+                +  D    I+ +          +GI SW+ L+ GE+T+RVG+G +VSA A+  ++L    S++LL+N+Y+V    R
Subjt:  ---LQRKESVSTS-----------MGADTGRGIVPNSLPR---GRIKGIDSWQPLQEGEVTLRVGSGELVSAAAIDTVKLHFGTSYILLDNLYIVQGFTR

Query:  NLGSISCLLEQCISVSFYGNKAFISRNGNLICSASLEHNLYVLKPNSVKSVLNTKLFKTAETGTKKAKVSPKENAYLWHLRLGHINLKRIEKLVKSGLLN
        NL S+ CLLEQ  S++F  NK FI +NG  ICSA LE+NLYVL+  + K++LNT++FKTA T  K+ K+SPKENA+LWHLRLGHINL RIE+LVK+GLL+
Subjt:  NLGSISCLLEQCISVSFYGNKAFISRNGNLICSASLEHNLYVLKPNSVKSVLNTKLFKTAETGTKKAKVSPKENAYLWHLRLGHINLKRIEKLVKSGLLN

Query:  ELEENFLSVCESCLLCMMTKRPFSGKGYRAKEPLKLVHSDLCCPMNVKARGGYEYFVSFIGDYSRYGYIYLMDRKSETLEKFKEYKTEVENLLGKSLKTL
        ELEEN L VCESCL   MTKRPF+GKG+RAKEPL+LVHSDLC PMNVKARGG+EYF++F  DYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT 
Subjt:  ELEENFLSVCESCLLCMMTKRPFSGKGYRAKEPLKLVHSDLCCPMNVKARGGYEYFVSFIGDYSRYGYIYLMDRKSETLEKFKEYKTEVENLLGKSLKTL

Query:  RSDRGGEYMDTEFQDYMIEHKITSQLSAPGMPQQNGVSERKNKSLLDMVRSMRSYARLPDSFWGYAVETAVYILNNVLSKSVCETPFEL
        RSDRGGEYMD +FQ+Y++E  I SQLSAPG PQQNGVSER+N++LLDMVRSM SYA LP+SFWGYAV+TAVYILN V SKSV ETP +L
Subjt:  RSDRGGEYMDTEFQDYMIEHKITSQLSAPGMPQQNGVSERKNKSLLDMVRSMRSYARLPDSFWGYAVETAVYILNNVLSKSVCETPFEL

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein8.5e-16054.33Show/hide
Query:  MSDVLAKKHELMVTANEIMESLQEMFGQQSIQVRHDSLKYVFNAWMKEGSSVREHVLDTMTHFNLAEMNGASIDESS-----------------------
        +S+VLAKKHE M+TA EIM+SLQEMFGQ S Q++HD+LKY++NA M EG+SVREHVL+ M HFN+AEMNGA IDE+S                       
Subjt:  MSDVLAKKHELMVTANEIMESLQEMFGQQSIQVRHDSLKYVFNAWMKEGSSVREHVLDTMTHFNLAEMNGASIDESS-----------------------

Query:  -------------QNFQSLNRVKASKFEANVA--YRSYHRGSTSRTKPVAPSRPKGKKRMTRG----KTDRAVAHKGKKART------------------
                     Q F+SL ++K  K EANVA   R +HRGSTS TK +  S    K +  +G    K + A A   KKA+                   
Subjt:  -------------QNFQSLNRVKASKFEANVA--YRSYHRGSTSRTKPVAPSRPKGKKRMTRG----KTDRAVAHKGKKART------------------

Query:  ---LQRKESVSTS-----------MGADTGRGIVPNSLPR---GRIKGIDSWQPLQEGEVTLRVGSGELVSAAAIDTVKLHFGTSYILLDNLYIVQGFTR
           L  K+                +  D    I+ +          +GI SW+ L+ GE+T+RVG+G +VSA A+  ++L    S++LL+N+Y+V    R
Subjt:  ---LQRKESVSTS-----------MGADTGRGIVPNSLPR---GRIKGIDSWQPLQEGEVTLRVGSGELVSAAAIDTVKLHFGTSYILLDNLYIVQGFTR

Query:  NLGSISCLLEQCISVSFYGNKAFISRNGNLICSASLEHNLYVLKPNSVKSVLNTKLFKTAETGTKKAKVSPKENAYLWHLRLGHINLKRIEKLVKSGLLN
        NL S+ CLLEQ  S++F  NK FI +NG  ICSA LE+NLYVL+  + K++LNT++FKTA T  K+ K+SPKENA+LWHLRLGHINL RIE+LVK+GLL+
Subjt:  NLGSISCLLEQCISVSFYGNKAFISRNGNLICSASLEHNLYVLKPNSVKSVLNTKLFKTAETGTKKAKVSPKENAYLWHLRLGHINLKRIEKLVKSGLLN

Query:  ELEENFLSVCESCLLCMMTKRPFSGKGYRAKEPLKLVHSDLCCPMNVKARGGYEYFVSFIGDYSRYGYIYLMDRKSETLEKFKEYKTEVENLLGKSLKTL
        ELEEN L VCESCL   MTKRPF+GKG+RAKEPL+LVHSDLC PMNVKARGG+EYF++F  DYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT 
Subjt:  ELEENFLSVCESCLLCMMTKRPFSGKGYRAKEPLKLVHSDLCCPMNVKARGGYEYFVSFIGDYSRYGYIYLMDRKSETLEKFKEYKTEVENLLGKSLKTL

Query:  RSDRGGEYMDTEFQDYMIEHKITSQLSAPGMPQQNGVSERKNKSLLDMVRSMRSYARLPDSFWGYAVETAVYILNNVLSKSVCETPFEL
        RSDRGGEYMD +FQ+Y++E  I SQLSAPG PQQNGVSER+N++LLDMVRSM SYA LP+SFWGYAV+TAVYILN V SKSV ETP +L
Subjt:  RSDRGGEYMDTEFQDYMIEHKITSQLSAPGMPQQNGVSERKNKSLLDMVRSMRSYARLPDSFWGYAVETAVYILNNVLSKSVCETPFEL

A0A5A7TWB9 Gag/pol protein8.5e-16054.33Show/hide
Query:  MSDVLAKKHELMVTANEIMESLQEMFGQQSIQVRHDSLKYVFNAWMKEGSSVREHVLDTMTHFNLAEMNGASIDESS-----------------------
        +S+VLAKKHE M+TA EIM+SLQEMFGQ S Q++HD+LKY++NA M EG+SVREHVL+ M HFN+AEMNGA IDE+S                       
Subjt:  MSDVLAKKHELMVTANEIMESLQEMFGQQSIQVRHDSLKYVFNAWMKEGSSVREHVLDTMTHFNLAEMNGASIDESS-----------------------

Query:  -------------QNFQSLNRVKASKFEANVA--YRSYHRGSTSRTKPVAPSRPKGKKRMTRG----KTDRAVAHKGKKART------------------
                     Q F+SL ++K  K EANVA   R +HRGSTS TK +  S    K +  +G    K + A A   KKA+                   
Subjt:  -------------QNFQSLNRVKASKFEANVA--YRSYHRGSTSRTKPVAPSRPKGKKRMTRG----KTDRAVAHKGKKART------------------

Query:  ---LQRKESVSTS-----------MGADTGRGIVPNSLPR---GRIKGIDSWQPLQEGEVTLRVGSGELVSAAAIDTVKLHFGTSYILLDNLYIVQGFTR
           L  K+                +  D    I+ +          +GI SW+ L+ GE+T+RVG+G +VSA A+  ++L    S++LL+N+Y+V    R
Subjt:  ---LQRKESVSTS-----------MGADTGRGIVPNSLPR---GRIKGIDSWQPLQEGEVTLRVGSGELVSAAAIDTVKLHFGTSYILLDNLYIVQGFTR

Query:  NLGSISCLLEQCISVSFYGNKAFISRNGNLICSASLEHNLYVLKPNSVKSVLNTKLFKTAETGTKKAKVSPKENAYLWHLRLGHINLKRIEKLVKSGLLN
        NL S+ CLLEQ  S++F  NK FI +NG  ICSA LE+NLYVL+  + K++LNT++FKTA T  K+ K+SPKENA+LWHLRLGHINL RIE+LVK+GLL+
Subjt:  NLGSISCLLEQCISVSFYGNKAFISRNGNLICSASLEHNLYVLKPNSVKSVLNTKLFKTAETGTKKAKVSPKENAYLWHLRLGHINLKRIEKLVKSGLLN

Query:  ELEENFLSVCESCLLCMMTKRPFSGKGYRAKEPLKLVHSDLCCPMNVKARGGYEYFVSFIGDYSRYGYIYLMDRKSETLEKFKEYKTEVENLLGKSLKTL
        ELEEN L VCESCL   MTKRPF+GKG+RAKEPL+LVHSDLC PMNVKARGG+EYF++F  DYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT 
Subjt:  ELEENFLSVCESCLLCMMTKRPFSGKGYRAKEPLKLVHSDLCCPMNVKARGGYEYFVSFIGDYSRYGYIYLMDRKSETLEKFKEYKTEVENLLGKSLKTL

Query:  RSDRGGEYMDTEFQDYMIEHKITSQLSAPGMPQQNGVSERKNKSLLDMVRSMRSYARLPDSFWGYAVETAVYILNNVLSKSVCETPFEL
        RSDRGGEYMD +FQ+Y++E  I SQLSAPG PQQNGVSER+N++LLDMVRSM SYA LP+SFWGYAV+TAVYILN V SKSV ETP +L
Subjt:  RSDRGGEYMDTEFQDYMIEHKITSQLSAPGMPQQNGVSERKNKSLLDMVRSMRSYARLPDSFWGYAVETAVYILNNVLSKSVCETPFEL

A0A5A7TZD7 Gag/pol protein8.5e-16054.33Show/hide
Query:  MSDVLAKKHELMVTANEIMESLQEMFGQQSIQVRHDSLKYVFNAWMKEGSSVREHVLDTMTHFNLAEMNGASIDESS-----------------------
        +S+VLAKKHE M+TA EIM+SLQEMFGQ S Q++HD+LKY++NA M EG+SVREHVL+ M HFN+AEMNGA IDE+S                       
Subjt:  MSDVLAKKHELMVTANEIMESLQEMFGQQSIQVRHDSLKYVFNAWMKEGSSVREHVLDTMTHFNLAEMNGASIDESS-----------------------

Query:  -------------QNFQSLNRVKASKFEANVA--YRSYHRGSTSRTKPVAPSRPKGKKRMTRG----KTDRAVAHKGKKART------------------
                     Q F+SL ++K  K EANVA   R +HRGSTS TK +  S    K +  +G    K + A A   KKA+                   
Subjt:  -------------QNFQSLNRVKASKFEANVA--YRSYHRGSTSRTKPVAPSRPKGKKRMTRG----KTDRAVAHKGKKART------------------

Query:  ---LQRKESVSTS-----------MGADTGRGIVPNSLPR---GRIKGIDSWQPLQEGEVTLRVGSGELVSAAAIDTVKLHFGTSYILLDNLYIVQGFTR
           L  K+                +  D    I+ +          +GI SW+ L+ GE+T+RVG+G +VSA A+  ++L    S++LL+N+Y+V    R
Subjt:  ---LQRKESVSTS-----------MGADTGRGIVPNSLPR---GRIKGIDSWQPLQEGEVTLRVGSGELVSAAAIDTVKLHFGTSYILLDNLYIVQGFTR

Query:  NLGSISCLLEQCISVSFYGNKAFISRNGNLICSASLEHNLYVLKPNSVKSVLNTKLFKTAETGTKKAKVSPKENAYLWHLRLGHINLKRIEKLVKSGLLN
        NL S+ CLLEQ  S++F  NK FI +NG  ICSA LE+NLYVL+  + K++LNT++FKTA T  K+ K+SPKENA+LWHLRLGHINL RIE+LVK+GLL+
Subjt:  NLGSISCLLEQCISVSFYGNKAFISRNGNLICSASLEHNLYVLKPNSVKSVLNTKLFKTAETGTKKAKVSPKENAYLWHLRLGHINLKRIEKLVKSGLLN

Query:  ELEENFLSVCESCLLCMMTKRPFSGKGYRAKEPLKLVHSDLCCPMNVKARGGYEYFVSFIGDYSRYGYIYLMDRKSETLEKFKEYKTEVENLLGKSLKTL
        ELEEN L VCESCL   MTKRPF+GKG+RAKEPL+LVHSDLC PMNVKARGG+EYF++F  DYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT 
Subjt:  ELEENFLSVCESCLLCMMTKRPFSGKGYRAKEPLKLVHSDLCCPMNVKARGGYEYFVSFIGDYSRYGYIYLMDRKSETLEKFKEYKTEVENLLGKSLKTL

Query:  RSDRGGEYMDTEFQDYMIEHKITSQLSAPGMPQQNGVSERKNKSLLDMVRSMRSYARLPDSFWGYAVETAVYILNNVLSKSVCETPFEL
        RSDRGGEYMD +FQ+Y++E  I SQLSAPG PQQNGVSER+N++LLDMVRSM SYA LP+SFWGYAV+TAVYILN V SKSV ETP +L
Subjt:  RSDRGGEYMDTEFQDYMIEHKITSQLSAPGMPQQNGVSERKNKSLLDMVRSMRSYARLPDSFWGYAVETAVYILNNVLSKSVCETPFEL

A0A5A7UGV2 Gag/pol protein8.5e-16054.33Show/hide
Query:  MSDVLAKKHELMVTANEIMESLQEMFGQQSIQVRHDSLKYVFNAWMKEGSSVREHVLDTMTHFNLAEMNGASIDESS-----------------------
        +S+VLAKKHE M+TA EIM+SLQEMFGQ S Q++HD+LKY++NA M EG+SVREHVL+ M HFN+AEMNGA IDE+S                       
Subjt:  MSDVLAKKHELMVTANEIMESLQEMFGQQSIQVRHDSLKYVFNAWMKEGSSVREHVLDTMTHFNLAEMNGASIDESS-----------------------

Query:  -------------QNFQSLNRVKASKFEANVA--YRSYHRGSTSRTKPVAPSRPKGKKRMTRG----KTDRAVAHKGKKART------------------
                     Q F+SL ++K  K EANVA   R +HRGSTS TK +  S    K +  +G    K + A A   KKA+                   
Subjt:  -------------QNFQSLNRVKASKFEANVA--YRSYHRGSTSRTKPVAPSRPKGKKRMTRG----KTDRAVAHKGKKART------------------

Query:  ---LQRKESVSTS-----------MGADTGRGIVPNSLPR---GRIKGIDSWQPLQEGEVTLRVGSGELVSAAAIDTVKLHFGTSYILLDNLYIVQGFTR
           L  K+                +  D    I+ +          +GI SW+ L+ GE+T+RVG+G +VSA A+  ++L    S++LL+N+Y+V    R
Subjt:  ---LQRKESVSTS-----------MGADTGRGIVPNSLPR---GRIKGIDSWQPLQEGEVTLRVGSGELVSAAAIDTVKLHFGTSYILLDNLYIVQGFTR

Query:  NLGSISCLLEQCISVSFYGNKAFISRNGNLICSASLEHNLYVLKPNSVKSVLNTKLFKTAETGTKKAKVSPKENAYLWHLRLGHINLKRIEKLVKSGLLN
        NL S+ CLLEQ  S++F  NK FI +NG  ICSA LE+NLYVL+  + K++LNT++FKTA T  K+ K+SPKENA+LWHLRLGHINL RIE+LVK+GLL+
Subjt:  NLGSISCLLEQCISVSFYGNKAFISRNGNLICSASLEHNLYVLKPNSVKSVLNTKLFKTAETGTKKAKVSPKENAYLWHLRLGHINLKRIEKLVKSGLLN

Query:  ELEENFLSVCESCLLCMMTKRPFSGKGYRAKEPLKLVHSDLCCPMNVKARGGYEYFVSFIGDYSRYGYIYLMDRKSETLEKFKEYKTEVENLLGKSLKTL
        ELEEN L VCESCL   MTKRPF+GKG+RAKEPL+LVHSDLC PMNVKARGG+EYF++F  DYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT 
Subjt:  ELEENFLSVCESCLLCMMTKRPFSGKGYRAKEPLKLVHSDLCCPMNVKARGGYEYFVSFIGDYSRYGYIYLMDRKSETLEKFKEYKTEVENLLGKSLKTL

Query:  RSDRGGEYMDTEFQDYMIEHKITSQLSAPGMPQQNGVSERKNKSLLDMVRSMRSYARLPDSFWGYAVETAVYILNNVLSKSVCETPFEL
        RSDRGGEYMD +FQ+Y++E  I SQLSAPG PQQNGVSER+N++LLDMVRSM SYA LP+SFWGYAV+TAVYILN V SKSV ETP +L
Subjt:  RSDRGGEYMDTEFQDYMIEHKITSQLSAPGMPQQNGVSERKNKSLLDMVRSMRSYARLPDSFWGYAVETAVYILNNVLSKSVCETPFEL

A0A5A7V4M1 Gag/pol protein8.5e-16054.33Show/hide
Query:  MSDVLAKKHELMVTANEIMESLQEMFGQQSIQVRHDSLKYVFNAWMKEGSSVREHVLDTMTHFNLAEMNGASIDESS-----------------------
        +S+VLAKKHE M+TA EIM+SLQEMFGQ S Q++HD+LKY++NA M EG+SVREHVL+ M HFN+AEMNGA IDE+S                       
Subjt:  MSDVLAKKHELMVTANEIMESLQEMFGQQSIQVRHDSLKYVFNAWMKEGSSVREHVLDTMTHFNLAEMNGASIDESS-----------------------

Query:  -------------QNFQSLNRVKASKFEANVA--YRSYHRGSTSRTKPVAPSRPKGKKRMTRG----KTDRAVAHKGKKART------------------
                     Q F+SL ++K  K EANVA   R +HRGSTS TK +  S    K +  +G    K + A A   KKA+                   
Subjt:  -------------QNFQSLNRVKASKFEANVA--YRSYHRGSTSRTKPVAPSRPKGKKRMTRG----KTDRAVAHKGKKART------------------

Query:  ---LQRKESVSTS-----------MGADTGRGIVPNSLPR---GRIKGIDSWQPLQEGEVTLRVGSGELVSAAAIDTVKLHFGTSYILLDNLYIVQGFTR
           L  K+                +  D    I+ +          +GI SW+ L+ GE+T+RVG+G +VSA A+  ++L    S++LL+N+Y+V    R
Subjt:  ---LQRKESVSTS-----------MGADTGRGIVPNSLPR---GRIKGIDSWQPLQEGEVTLRVGSGELVSAAAIDTVKLHFGTSYILLDNLYIVQGFTR

Query:  NLGSISCLLEQCISVSFYGNKAFISRNGNLICSASLEHNLYVLKPNSVKSVLNTKLFKTAETGTKKAKVSPKENAYLWHLRLGHINLKRIEKLVKSGLLN
        NL S+ CLLEQ  S++F  NK FI +NG  ICSA LE+NLYVL+  + K++LNT++FKTA T  K+ K+SPKENA+LWHLRLGHINL RIE+LVK+GLL+
Subjt:  NLGSISCLLEQCISVSFYGNKAFISRNGNLICSASLEHNLYVLKPNSVKSVLNTKLFKTAETGTKKAKVSPKENAYLWHLRLGHINLKRIEKLVKSGLLN

Query:  ELEENFLSVCESCLLCMMTKRPFSGKGYRAKEPLKLVHSDLCCPMNVKARGGYEYFVSFIGDYSRYGYIYLMDRKSETLEKFKEYKTEVENLLGKSLKTL
        ELEEN L VCESCL   MTKRPF+GKG+RAKEPL+LVHSDLC PMNVKARGG+EYF++F  DYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT 
Subjt:  ELEENFLSVCESCLLCMMTKRPFSGKGYRAKEPLKLVHSDLCCPMNVKARGGYEYFVSFIGDYSRYGYIYLMDRKSETLEKFKEYKTEVENLLGKSLKTL

Query:  RSDRGGEYMDTEFQDYMIEHKITSQLSAPGMPQQNGVSERKNKSLLDMVRSMRSYARLPDSFWGYAVETAVYILNNVLSKSVCETPFEL
        RSDRGGEYMD +FQ+Y++E  I SQLSAPG PQQNGVSER+N++LLDMVRSM SYA LP+SFWGYAV+TAVYILN V SKSV ETP +L
Subjt:  RSDRGGEYMDTEFQDYMIEHKITSQLSAPGMPQQNGVSERKNKSLLDMVRSMRSYARLPDSFWGYAVETAVYILNNVLSKSVCETPFEL

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.5e-3230.24Show/hide
Query:  GELVSAAAIDTVKLHFGTSYILLDNLYIVQGFTRNLGSISCLLEQCISVSFYGNKAFISRNGNLICSASLEHNLYVLKPNSVKSVLNTKLFKTAETGTKK
        GE + A     V+L      I L+++   +    NL S+  L E  +S+ F  +   IS+NG           L V+K + + + +    F+      K 
Subjt:  GELVSAAAIDTVKLHFGTSYILLDNLYIVQGFTRNLGSISCLLEQCISVSFYGNKAFISRNGNLICSASLEHNLYVLKPNSVKSVLNTKLFKTAETGTKK

Query:  AKVSPKENAYLWHLRLGHIN------LKRIEKLVKSGLLNELEENFLSVCESCLLCMMTKRPFSGKGYRA--KEPLKLVHSDLCCPMNVKARGGYEYFVS
             K N  LWH R GHI+      +KR        LLN LE +   +CE CL     + PF     +   K PL +VHSD+C P+         YFV 
Subjt:  AKVSPKENAYLWHLRLGHIN------LKRIEKLVKSGLLNELEENFLSVCESCLLCMMTKRPFSGKGYRA--KEPLKLVHSDLCCPMNVKARGGYEYFVS

Query:  FIGDYSRYGYIYLMDRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHKITSQLSAPGMPQQNGVSERKNKSLLDMVRSMRSYARL
        F+  ++ Y   YL+  KS+    F+++  + E      +  L  D G EY+  E + + ++  I+  L+ P  PQ NGVSER  +++ +  R+M S A+L
Subjt:  FIGDYSRYGYIYLMDRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHKITSQLSAPGMPQQNGVSERKNKSLLDMVRSMRSYARL

Query:  PDSFWGYAVETAVYILNNVLSKSVCE---TPFEL
          SFWG AV TA Y++N + S+++ +   TP+E+
Subjt:  PDSFWGYAVETAVYILNNVLSKSVCE---TPFEL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.2e-3928.67Show/hide
Query:  LAEMNGASIDESSQNF-QSLNRVKASKFEANVAYRSYHRGSTSRTKPVAPSRPKGKKRMTRGKTD---RAVAHKGKKARTLQRKESVSTSMGADTGRGIV
        + E  G S   SS N+ +S  R K+     +     Y+       K   P+  KGK   +  K D    A+            +E     +       +V
Subjt:  LAEMNGASIDESSQNF-QSLNRVKASKFEANVAYRSYHRGSTSRTKPVAPSRPKGKKRMTRGKTD---RAVAHKGKKARTLQRKESVSTSMGADTGRGIV

Query:  PNSLPRGRIKGIDSWQPLQEGEV-TLRVGSGELVSAAAID--TVKLHFGTSYILLDNLYIVQGFTRNLGSISCLLEQCISVSFYGNKAFISRNGNLICSA
          +         D +     G+  T+++G+      A I    +K + G + +L D  + V     NL S    L++    S++ N+ +    G+L+ + 
Subjt:  PNSLPRGRIKGIDSWQPLQEGEV-TLRVGSGELVSAAAID--TVKLHFGTSYILLDNLYIVQGFTRNLGSISCLLEQCISVSFYGNKAFISRNGNLICSA

Query:  SLEHNLYVLKPNSVKSVLNTKLFKT-AETGTKKAKVSPKE-NAYLWHLRLGHINLKRIEKLVKSGLLNELEENFLSVCESCLLCMMTKRPFSGKGYRAKE
                      K V    L++T AE    +   +  E +  LWH R+GH++ K ++ L K  L++  +   +  C+ CL     +  F     R   
Subjt:  SLEHNLYVLKPNSVKSVLNTKLFKT-AETGTKKAKVSPKE-NAYLWHLRLGHINLKRIEKLVKSGLLNELEENFLSVCESCLLCMMTKRPFSGKGYRAKE

Query:  PLKLVHSDLCCPMNVKARGGYEYFVSFIGDYSRYGYIYLMDRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHKITSQLSAPGMP
         L LV+SD+C PM +++ GG +YFV+FI D SR  ++Y++  K +  + F+++   VE   G+ LK LRSD GGEY   EF++Y   H I  + + PG P
Subjt:  PLKLVHSDLCCPMNVKARGGYEYFVSFIGDYSRYGYIYLMDRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHKITSQLSAPGMP

Query:  QQNGVSERKNKSLLDMVRSMRSYARLPDSFWGYAVETAVYILN
        Q NGV+ER N+++++ VRSM   A+LP SFWG AV+TA Y++N
Subjt:  QQNGVSERKNKSLLDMVRSMRSYARLPDSFWGYAVETAVYILN

P25384 Transposon Ty2-C Gag-Pol polyprotein2.8e-1927.21Show/hide
Query:  NLGSISCLLEQCISVSFYGNKAFISRNGNLICSASLEH-NLYVLKPNSVKSVLNTKLFKTAETGTKKAKVSPKENAYLWHLRLGHINLKRIEKLVKSGLL
        +L S+S L  Q I+  F  N   + R+   + +  ++H + Y L   S K ++ + + K       K+K   K    L H  LGH N + I+K +K   +
Subjt:  NLGSISCLLEQCISVSFYGNKAFISRNGNLICSASLEH-NLYVLKPNSVKSVLNTKLFKTAETGTKKAKVSPKENAYLWHLRLGHINLKRIEKLVKSGLL

Query:  NELEENFLS-------VCESCLLCMMTKRPFSGKGYRAK-----EPLKLVHSDLCCPMNVKARGGYEYFVSFIGDYSRYGYIY-LMDRKSET-LEKFKEY
          L+E+ +         C  CL+   TK     KG R K     EP + +H+D+  P++   +    YF+SF  + +R+ ++Y L DR+ E+ L  F   
Subjt:  NELEENFLS-------VCESCLLCMMTKRPFSGKGYRAK-----EPLKLVHSDLCCPMNVKARGGYEYFVSFIGDYSRYGYIY-LMDRKSET-LEKFKEY

Query:  KTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHKITSQLSAPGMPQQNGVSERKNKSLLDMVRSMRSYARLPDSFWGYAVETAVYILNNVLS
           ++N     +  ++ DRG EY +     +     IT+  +     + +GV+ER N++LL+  R++   + LP+  W  AVE +  I N+++S
Subjt:  KTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHKITSQLSAPGMPQQNGVSERKNKSLLDMVRSMRSYARLPDSFWGYAVETAVYILNNVLS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.1e-2631.11Show/hide
Query:  AKVSPKENAYLWHLRLGHINLKRIEKLVKSGLLNEL--EENFLSVCESCLLCMMTKRPFSGKGYRAKEPLKLVHSDLCCPMNVKARGGYEYFVSFIGDYS
        A  S K     WH RLGH     +  ++ +  L+ L     FLS C  CL+    K PFS     +  PL+ ++SD+     + +   Y Y+V F+  ++
Subjt:  AKVSPKENAYLWHLRLGHINLKRIEKLVKSGLLNEL--EENFLSVCESCLLCMMTKRPFSGKGYRAKEPLKLVHSDLCCPMNVKARGGYEYFVSFIGDYS

Query:  RYGYIYLMDRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHKITSQLSAPGMPQQNGVSERKNKSLLDMVRSMRSYARLPDSFWG
        RY ++Y + +KS+  E F  +K  +EN     + T  SD GGE++     +Y  +H I+   S P  P+ NG+SERK++ +++   ++ S+A +P ++W 
Subjt:  RYGYIYLMDRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHKITSQLSAPGMPQQNGVSERKNKSLLDMVRSMRSYARLPDSFWG

Query:  YAVETAVYILNNVLSKSV-CETPFE
        YA   AVY++N + +  +  E+PF+
Subjt:  YAVETAVYILNNVLSKSV-CETPFE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-2731.46Show/hide
Query:  WHLRLGHINLKRIEKLVKSGLLNELEENF-LSVCESCLLCMMTKRPFSGKGYRAKEPLKLVHSDLCCPMNVKARGGYEYFVSFIGDYSRYGYIYLMDRKS
        WH RLGH +L  +  ++ +  L  L  +  L  C  C +    K PFS     + +PL+ ++SD+     + +   Y Y+V F+  ++RY ++Y + +KS
Subjt:  WHLRLGHINLKRIEKLVKSGLLNELEENF-LSVCESCLLCMMTKRPFSGKGYRAKEPLKLVHSDLCCPMNVKARGGYEYFVSFIGDYSRYGYIYLMDRKS

Query:  ETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHKITSQLSAPGMPQQNGVSERKNKSLLDMVRSMRSYARLPDSFWGYAVETAVYILNN
        +  + F  +K+ VEN     + TL SD GGE++    +DY+ +H I+   S P  P+ NG+SERK++ +++M  ++ S+A +P ++W YA   AVY++N 
Subjt:  ETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHKITSQLSAPGMPQQNGVSERKNKSLLDMVRSMRSYARLPDSFWGYAVETAVYILNN

Query:  VLSKSV-CETPFE
        + +  +  ++PF+
Subjt:  VLSKSV-CETPFE

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein6.1e-0936.14Show/hide
Query:  TAETGTKKAKVSPKENAYLWHLRLGHINLKRIEKLVKSGLLNELEENFLSVCESCLLCMMTKRPFSGKGYRAKEPLKLVHSDL
        + ETG      + K+   LWH RL H++ + +E LVK G L+  + + L  CE C+     +  FS   +  K PL  VHSDL
Subjt:  TAETGTKKAKVSPKENAYLWHLRLGHINLKRIEKLVKSGLLNELEENFLSVCESCLLCMMTKRPFSGKGYRAKEPLKLVHSDL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGATGTTCTGGCAAAGAAGCATGAGCTGATGGTCACCGCCAATGAGATCATGGAGTCCTTGCAGGAAATGTTTGGACAACAGTCCATTCAGGTCCGGCATGACTC
GCTCAAATACGTCTTCAATGCATGGATGAAAGAGGGGTCGTCTGTCCGTGAACATGTTCTAGATACGATGACCCACTTTAATCTGGCTGAGATGAACGGGGCTTCGATCG
ATGAGTCGAGCCAAAATTTCCAGTCCTTGAACAGGGTCAAGGCATCGAAATTTGAGGCAAATGTTGCGTACAGGTCTTATCACAGGGGTTCGACCTCTAGGACGAAACCT
GTTGCTCCTTCACGCCCGAAAGGGAAGAAGAGGATGACGAGGGGTAAAACTGACCGAGCTGTCGCCCACAAGGGCAAGAAGGCAAGGACGTTGCAGAGAAAGGAAAGTGT
TTCCACTTCGATGGGGGCAGACACTGGAAGAGGAATTGTCCCAAATTCCTTACCGAGAGGAAGAATCAAGGGGATTGATTCCTGGCAGCCGCTGCAAGAGGGTGAGGTGA
CTCTACGGGTTGGATCCGGAGAGCTTGTCTCTGCTGCAGCGATCGACACGGTCAAACTACACTTTGGCACGAGCTACATTTTGTTGGACAATTTGTACATAGTTCAAGGG
TTTACTAGAAACCTAGGTTCTATTTCCTGCCTTTTAGAACAGTGTATTTCCGTTTCCTTTTATGGTAATAAAGCGTTTATTTCCAGAAATGGTAATCTTATTTGTTCTGC
TTCACTTGAGCATAATCTGTATGTTTTGAAACCTAATTCGGTCAAAAGTGTTTTGAATACTAAATTGTTTAAAACTGCAGAAACAGGAACTAAGAAAGCGAAAGTTTCTC
CTAAAGAAAATGCCTATCTTTGGCATCTACGGTTAGGCCACATTAATCTCAAGAGGATTGAGAAACTAGTGAAGAGTGGACTTCTAAACGAGTTGGAAGAAAACTTTTTG
TCGGTGTGTGAGTCATGCCTTTTGTGCATGATGACCAAACGTCCTTTTAGTGGAAAAGGATATAGAGCCAAAGAGCCTCTTAAGTTAGTACATTCTGACCTCTGTTGTCC
GATGAATGTTAAAGCTCGGGGCGGTTATGAGTACTTCGTGTCTTTCATAGGCGATTACTCGAGGTATGGATATATTTACCTAATGGACAGGAAATCTGAAACTCTTGAAA
AGTTCAAGGAGTACAAGACTGAGGTTGAGAACCTCTTAGGTAAATCACTTAAAACACTTCGATCGGATCGAGGTGGAGAGTACATGGACACTGAATTTCAGGACTATATG
ATAGAACACAAAATTACGTCGCAACTCTCAGCCCCTGGTATGCCACAGCAGAATGGTGTATCGGAGAGGAAAAACAAAAGCTTGTTGGACATGGTTCGGTCGATGAGGAG
CTATGCTCGTCTCCCTGATTCTTTTTGGGGTTACGCAGTGGAGACTGCGGTCTATATTTTGAACAATGTTCTGTCGAAGAGTGTTTGTGAAACACCTTTCGAGCTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTGATGTTCTGGCAAAGAAGCATGAGCTGATGGTCACCGCCAATGAGATCATGGAGTCCTTGCAGGAAATGTTTGGACAACAGTCCATTCAGGTCCGGCATGACTC
GCTCAAATACGTCTTCAATGCATGGATGAAAGAGGGGTCGTCTGTCCGTGAACATGTTCTAGATACGATGACCCACTTTAATCTGGCTGAGATGAACGGGGCTTCGATCG
ATGAGTCGAGCCAAAATTTCCAGTCCTTGAACAGGGTCAAGGCATCGAAATTTGAGGCAAATGTTGCGTACAGGTCTTATCACAGGGGTTCGACCTCTAGGACGAAACCT
GTTGCTCCTTCACGCCCGAAAGGGAAGAAGAGGATGACGAGGGGTAAAACTGACCGAGCTGTCGCCCACAAGGGCAAGAAGGCAAGGACGTTGCAGAGAAAGGAAAGTGT
TTCCACTTCGATGGGGGCAGACACTGGAAGAGGAATTGTCCCAAATTCCTTACCGAGAGGAAGAATCAAGGGGATTGATTCCTGGCAGCCGCTGCAAGAGGGTGAGGTGA
CTCTACGGGTTGGATCCGGAGAGCTTGTCTCTGCTGCAGCGATCGACACGGTCAAACTACACTTTGGCACGAGCTACATTTTGTTGGACAATTTGTACATAGTTCAAGGG
TTTACTAGAAACCTAGGTTCTATTTCCTGCCTTTTAGAACAGTGTATTTCCGTTTCCTTTTATGGTAATAAAGCGTTTATTTCCAGAAATGGTAATCTTATTTGTTCTGC
TTCACTTGAGCATAATCTGTATGTTTTGAAACCTAATTCGGTCAAAAGTGTTTTGAATACTAAATTGTTTAAAACTGCAGAAACAGGAACTAAGAAAGCGAAAGTTTCTC
CTAAAGAAAATGCCTATCTTTGGCATCTACGGTTAGGCCACATTAATCTCAAGAGGATTGAGAAACTAGTGAAGAGTGGACTTCTAAACGAGTTGGAAGAAAACTTTTTG
TCGGTGTGTGAGTCATGCCTTTTGTGCATGATGACCAAACGTCCTTTTAGTGGAAAAGGATATAGAGCCAAAGAGCCTCTTAAGTTAGTACATTCTGACCTCTGTTGTCC
GATGAATGTTAAAGCTCGGGGCGGTTATGAGTACTTCGTGTCTTTCATAGGCGATTACTCGAGGTATGGATATATTTACCTAATGGACAGGAAATCTGAAACTCTTGAAA
AGTTCAAGGAGTACAAGACTGAGGTTGAGAACCTCTTAGGTAAATCACTTAAAACACTTCGATCGGATCGAGGTGGAGAGTACATGGACACTGAATTTCAGGACTATATG
ATAGAACACAAAATTACGTCGCAACTCTCAGCCCCTGGTATGCCACAGCAGAATGGTGTATCGGAGAGGAAAAACAAAAGCTTGTTGGACATGGTTCGGTCGATGAGGAG
CTATGCTCGTCTCCCTGATTCTTTTTGGGGTTACGCAGTGGAGACTGCGGTCTATATTTTGAACAATGTTCTGTCGAAGAGTGTTTGTGAAACACCTTTCGAGCTCTAG
Protein sequenceShow/hide protein sequence
MSDVLAKKHELMVTANEIMESLQEMFGQQSIQVRHDSLKYVFNAWMKEGSSVREHVLDTMTHFNLAEMNGASIDESSQNFQSLNRVKASKFEANVAYRSYHRGSTSRTKP
VAPSRPKGKKRMTRGKTDRAVAHKGKKARTLQRKESVSTSMGADTGRGIVPNSLPRGRIKGIDSWQPLQEGEVTLRVGSGELVSAAAIDTVKLHFGTSYILLDNLYIVQG
FTRNLGSISCLLEQCISVSFYGNKAFISRNGNLICSASLEHNLYVLKPNSVKSVLNTKLFKTAETGTKKAKVSPKENAYLWHLRLGHINLKRIEKLVKSGLLNELEENFL
SVCESCLLCMMTKRPFSGKGYRAKEPLKLVHSDLCCPMNVKARGGYEYFVSFIGDYSRYGYIYLMDRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYM
IEHKITSQLSAPGMPQQNGVSERKNKSLLDMVRSMRSYARLPDSFWGYAVETAVYILNNVLSKSVCETPFEL