; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009406 (gene) of Snake gourd v1 genome

Gene IDTan0009406
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG07:48965307..48967434
RNA-Seq ExpressionTan0009406
SyntenyTan0009406
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]7.5e-14070.62Show/hide
Query:  HINLNRIEKLVKSGLLSELEENSLLVCESCLEGKMTIRPFSGKGYRANEPLELIHYDLCGPMNVKARGGYEYFISFIDDYSSYGYIYLMHEKSETFEKFK
        HINL+RI +LVK+GLL++L++ SL  CESCLEGKMT RPF+GKGYRA EPLELIH DLCGPMNVKARGG+EYFISFIDDYS YGY+YLM  KSE  EKFK
Subjt:  HINLNRIEKLVKSGLLSELEENSLLVCESCLEGKMTIRPFSGKGYRANEPLELIHYDLCGPMNVKARGGYEYFISFIDDYSSYGYIYLMHEKSETFEKFK

Query:  EFKTEVENLLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMSYARLPDSFWGYAMETAVCILNTVLSKSVC
        E+KTEVENLL K IK LRSDRGGEYMD  FQDYMIEHGI  QLSAPG PQQNGVS+RRNRTLLDMVRSMMSYA+LP SFWGYA+ETAV ILN V SKSV 
Subjt:  EFKTEVENLLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMSYARLPDSFWGYAMETAVCILNTVLSKSVC

Query:  ERPFELWHGRKTSLRHFRFGDVRPMCWCQTRRKRKLEPRSKLCLFVSYPKETRGGYFYDPKDNRVIVSTNVTFFEEDHIRDHLPRSKIVLNEMDGTSTRV
        E PFELW GRK SL HFR     P     T  K KLEPRS+LC FV YPKETRGG F+DP++NRV VSTN TF EEDH+R+H PRSK+VL+E    STRV
Subjt:  ERPFELWHGRKTSLRHFRFGDVRPMCWCQTRRKRKLEPRSKLCLFVSYPKETRGGYFYDPKDNRVIVSTNVTFFEEDHIRDHLPRSKIVLNEMDGTSTRV

Query:  ADGASTSTSVVDPSSSSQVR-SQELRMPRRSGRVVRQPERYMGLAETQVVTVDDECEDPLTYVRQWQTLTK
         D    S+ V + ++S Q   SQ LRMPRRSGRVV QP RY+GL ETQVV  DD  EDPL+Y +    + K
Subjt:  ADGASTSTSVVDPSSSSQVR-SQELRMPRRSGRVVRQPERYMGLAETQVVTVDDECEDPLTYVRQWQTLTK

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]7.5e-14070.62Show/hide
Query:  HINLNRIEKLVKSGLLSELEENSLLVCESCLEGKMTIRPFSGKGYRANEPLELIHYDLCGPMNVKARGGYEYFISFIDDYSSYGYIYLMHEKSETFEKFK
        HINL+RI +LVK+GLL++L++ SL  CESCLEGKMT RPF+GKGYRA EPLELIH DLCGPMNVKARGG+EYFISFIDDYS YGY+YLM  KSE  EKFK
Subjt:  HINLNRIEKLVKSGLLSELEENSLLVCESCLEGKMTIRPFSGKGYRANEPLELIHYDLCGPMNVKARGGYEYFISFIDDYSSYGYIYLMHEKSETFEKFK

Query:  EFKTEVENLLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMSYARLPDSFWGYAMETAVCILNTVLSKSVC
        E+KTEVENLL K IK LRSDRGGEYMD  FQDYMIEHGI  QLSAPG PQQNGVS+RRNRTLLDMVRSMMSYA+LP SFWGYA+ETAV ILN V SKSV 
Subjt:  EFKTEVENLLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMSYARLPDSFWGYAMETAVCILNTVLSKSVC

Query:  ERPFELWHGRKTSLRHFRFGDVRPMCWCQTRRKRKLEPRSKLCLFVSYPKETRGGYFYDPKDNRVIVSTNVTFFEEDHIRDHLPRSKIVLNEMDGTSTRV
        E PFELW GRK SL HFR     P     T  K KLEPRS+LC FV YPKETRGG F+DP++NRV VSTN TF EEDH+R+H PRSK+VL+E    STRV
Subjt:  ERPFELWHGRKTSLRHFRFGDVRPMCWCQTRRKRKLEPRSKLCLFVSYPKETRGGYFYDPKDNRVIVSTNVTFFEEDHIRDHLPRSKIVLNEMDGTSTRV

Query:  ADGASTSTSVVDPSSSSQVR-SQELRMPRRSGRVVRQPERYMGLAETQVVTVDDECEDPLTYVRQWQTLTK
         D    S+ V + ++S Q   SQ LRMPRRSGRVV QP RY+GL ETQVV  DD  EDPL+Y +    + K
Subjt:  ADGASTSTSVVDPSSSSQVR-SQELRMPRRSGRVVRQPERYMGLAETQVVTVDDECEDPLTYVRQWQTLTK

KAA0060254.1 gag/pol protein [Cucumis melo var. makuwa]5.7e-14059.96Show/hide
Query:  TRGQEVEASVA--LRPFHKGSTFRIKFVDSSGPKGKKEKLK---EVKVDRAAALKGKEIKDVARKCFYCNEGEHLKRSFP-----------SHINLNRIE
        ++GQ+ EA+VA   R FH+GST   K + SS    K +K K     K + AAA   K+ K     CF+ N+  H KR+ P            HINLNRIE
Subjt:  TRGQEVEASVA--LRPFHKGSTFRIKFVDSSGPKGKKEKLK---EVKVDRAAALKGKEIKDVARKCFYCNEGEHLKRSFP-----------SHINLNRIE

Query:  KLVKSGLLSELEENSLLVCESCLEGKMTIRPFSGKGYRANEPLELIHYDLCGPMNVKARGGYEYFISFIDDYSSYGYIYLMHEKSETFEKFKEFKTEVEN
        +LVK+G+LSELEENSL +CESCLEGKMT RPF+GKG+RA EPLEL+H DLCGPMNVKARG +EYFI+F DDYS YGY+YLM  KSE  EKFKE+K EVEN
Subjt:  KLVKSGLLSELEENSLLVCESCLEGKMTIRPFSGKGYRANEPLELIHYDLCGPMNVKARGGYEYFISFIDDYSSYGYIYLMHEKSETFEKFKEFKTEVEN

Query:  LLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMSYARLPDSFWGYAMETAVCILNTVLSKSVCERPFELWH
         L KTIKT RSDRGGEYMD +FQ+Y++E  I  QLSAPG PQQNGVS+RRNRTLLDMVRSM+SYA LP+SFWGYA++TAV ILN V SKSV E P +LW+
Subjt:  LLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMSYARLPDSFWGYAMETAVCILNTVLSKSVCERPFELWH

Query:  GRKTSLRHFRFGDVRPMCWCQTRRKRKLEPRSKLCLFVSYPKETRGGYFYDPKDNRVIVSTNVTFFEEDHIRDHLPRSKIVLN----EMDGTSTRVADGA
        GRK SLRHFR               +KLEPRSKLCLFV YPK TRGGYFYD KDN+V V TN TF E+DHIR+H PRSKIVLN    E+   STRV + +
Subjt:  GRKTSLRHFRFGDVRPMCWCQTRRKRKLEPRSKLCLFVSYPKETRGGYFYDPKDNRVIVSTNVTFFEEDHIRDHLPRSKIVLN----EMDGTSTRVADGA

Query:  STSTSVVDPSSSSQV-RSQELRMPRRSGRVVRQPERYMGLAETQVVTVDDECEDPLTYVRQWQTLTK
        S  T VV   SS++  + Q LR PRRSGRV   P RYM L ET  V  D + EDPLT+ +  + + K
Subjt:  STSTSVVDPSSSSQV-RSQELRMPRRSGRVVRQPERYMGLAETQVVTVDDECEDPLTYVRQWQTLTK

TYJ96910.1 gag/pol protein [Cucumis melo var. makuwa]7.7e-14557.94Show/hide
Query:  TVIFDNLQTFQSLVQTRGQEVEASVA--LRPFHKGSTFRIKFVDSSGPKGKKEKLK---EVKVDRAAALKGKEIKDVARKCFYCNEGEHLKRSFPS----
        T + + LQTF+SL++ +GQ+ EA+VA   R FH+GST   K + SS    K +K K   + KV+ AAA   K+ K      F+CN+  H KR+ P     
Subjt:  TVIFDNLQTFQSLVQTRGQEVEASVA--LRPFHKGSTFRIKFVDSSGPKGKKEKLK---EVKVDRAAALKGKEIKDVARKCFYCNEGEHLKRSFPS----

Query:  -----------------------------HINLNRIEKLVKSGLLSELEENSLLVCESCLEGKMTIRPFSGKGYRANEPLELIHYDLCGPMNVKARGGYE
                                     HINLNRIE+LV++GLLSELEEN L VCESCLEGKMT RPF+GKG+RA EPLEL+H DLCGPMNVKARGG+E
Subjt:  -----------------------------HINLNRIEKLVKSGLLSELEENSLLVCESCLEGKMTIRPFSGKGYRANEPLELIHYDLCGPMNVKARGGYE

Query:  YFISFIDDYSSYGYIYLMHEKSETFEKFKEFKTEVENLLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMS
        YFI+F DDYS YGY+YLM  KSE  EKFKE+K EVEN L KTIKT RSDRGGEYMD +FQ+Y++E GI  QLSAPG PQQNGVS+RRN+TLLDMV SMMS
Subjt:  YFISFIDDYSSYGYIYLMHEKSETFEKFKEFKTEVENLLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMS

Query:  YARLPDSFWGYAMETAVCILNTVLSKSVCERPFELWHGRKTSLRHFRFGDVRPMCWCQTRRKRKLEPRSKLCLFVSYPKETRGGYFYDPKDNRVIVSTNV
        YA LP+SFWGYA++TAV ILN V SKSV E P +LW+GRK SL HFR     P    +   K KLEPRSKLCLFV Y K +RGGYFYDPKDN+V+VSTN 
Subjt:  YARLPDSFWGYAMETAVCILNTVLSKSVCERPFELWHGRKTSLRHFRFGDVRPMCWCQTRRKRKLEPRSKLCLFVSYPKETRGGYFYDPKDNRVIVSTNV

Query:  TFFEEDHIRDHLPRSKIVLNEMDG----TSTRVADGASTSTSVVDPSSSSQV-RSQELRMPRRSGRVVRQPERYMGLAETQVVTVDDECEDPLTYVRQWQ
        TF EEDHIR+H PRSKIVLNE+       STRV +  S  TSVV   SS++  + Q LR PRRSGRV   P RYM L ET  V  D + EDPLT+ +  +
Subjt:  TFFEEDHIRDHLPRSKIVLNEMDG----TSTRVADGASTSTSVVDPSSSSQV-RSQELRMPRRSGRVVRQPERYMGLAETQVVTVDDECEDPLTYVRQWQ

Query:  TLTK
         + K
Subjt:  TLTK

TYJ97618.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-14260.6Show/hide
Query:  TRGQEVEASVA--LRPFHKGSTFRIKFVDSSGPKGKKEKLK---EVKVDRAAALKGKEIKDVARKCFYCNEGEHLKRSFP-----------SHINLNRIE
        ++GQ+ EA+VA   R FH+GST   K + SS    K +K K     K + AAA   K+ K     CF+ N+  H KR+ P            HINLNRIE
Subjt:  TRGQEVEASVA--LRPFHKGSTFRIKFVDSSGPKGKKEKLK---EVKVDRAAALKGKEIKDVARKCFYCNEGEHLKRSFP-----------SHINLNRIE

Query:  KLVKSGLLSELEENSLLVCESCLEGKMTIRPFSGKGYRANEPLELIHYDLCGPMNVKARGGYEYFISFIDDYSSYGYIYLMHEKSETFEKFKEFKTEVEN
        +LVK+G+LSELEENSL +CESCLEGKMT RPF+GKG+RA EPLEL+H DLCGPMNVKARG +EYFI+F DDYS YGY+YLM  KSE  EKFKE+K EVEN
Subjt:  KLVKSGLLSELEENSLLVCESCLEGKMTIRPFSGKGYRANEPLELIHYDLCGPMNVKARGGYEYFISFIDDYSSYGYIYLMHEKSETFEKFKEFKTEVEN

Query:  LLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMSYARLPDSFWGYAMETAVCILNTVLSKSVCERPFELWH
         L KTIKT RSDRGGEYMD +FQ+Y++E  I  QLSAPG PQQNGVS+RRNRTLLDMVRSM+SYA LP+SFWGYA++TAV ILN V SKSV E P +LW+
Subjt:  LLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMSYARLPDSFWGYAMETAVCILNTVLSKSVCERPFELWH

Query:  GRKTSLRHFRFGDVRPMCWCQTRRKRKLEPRSKLCLFVSYPKETRGGYFYDPKDNRVIVSTNVTFFEEDHIRDHLPRSKIVLNEMD----GTSTRVADGA
        GRK SLRHFR               +KLEPRSKLCLFV YPK TRGGYFYDPKDN+V VSTN TF EEDHIR+H PRSKIVLNE+       STRV +  
Subjt:  GRKTSLRHFRFGDVRPMCWCQTRRKRKLEPRSKLCLFVSYPKETRGGYFYDPKDNRVIVSTNVTFFEEDHIRDHLPRSKIVLNEMD----GTSTRVADGA

Query:  STSTSVVDPSSSSQV-RSQELRMPRRSGRVVRQPERYMGLAETQVVTVDDECEDPLTYVRQWQTLTK
        S  T VV   SS++  + Q LR PRRSGRV   P RYM L ET  V  D + EDPLT+ +  + + K
Subjt:  STSTSVVDPSSSSQV-RSQELRMPRRSGRVVRQPERYMGLAETQVVTVDDECEDPLTYVRQWQTLTK

TrEMBL top hitse value%identityAlignment
A0A5A7TZD0 Gag/pol protein3.6e-14070.62Show/hide
Query:  HINLNRIEKLVKSGLLSELEENSLLVCESCLEGKMTIRPFSGKGYRANEPLELIHYDLCGPMNVKARGGYEYFISFIDDYSSYGYIYLMHEKSETFEKFK
        HINL+RI +LVK+GLL++L++ SL  CESCLEGKMT RPF+GKGYRA EPLELIH DLCGPMNVKARGG+EYFISFIDDYS YGY+YLM  KSE  EKFK
Subjt:  HINLNRIEKLVKSGLLSELEENSLLVCESCLEGKMTIRPFSGKGYRANEPLELIHYDLCGPMNVKARGGYEYFISFIDDYSSYGYIYLMHEKSETFEKFK

Query:  EFKTEVENLLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMSYARLPDSFWGYAMETAVCILNTVLSKSVC
        E+KTEVENLL K IK LRSDRGGEYMD  FQDYMIEHGI  QLSAPG PQQNGVS+RRNRTLLDMVRSMMSYA+LP SFWGYA+ETAV ILN V SKSV 
Subjt:  EFKTEVENLLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMSYARLPDSFWGYAMETAVCILNTVLSKSVC

Query:  ERPFELWHGRKTSLRHFRFGDVRPMCWCQTRRKRKLEPRSKLCLFVSYPKETRGGYFYDPKDNRVIVSTNVTFFEEDHIRDHLPRSKIVLNEMDGTSTRV
        E PFELW GRK SL HFR     P     T  K KLEPRS+LC FV YPKETRGG F+DP++NRV VSTN TF EEDH+R+H PRSK+VL+E    STRV
Subjt:  ERPFELWHGRKTSLRHFRFGDVRPMCWCQTRRKRKLEPRSKLCLFVSYPKETRGGYFYDPKDNRVIVSTNVTFFEEDHIRDHLPRSKIVLNEMDGTSTRV

Query:  ADGASTSTSVVDPSSSSQVR-SQELRMPRRSGRVVRQPERYMGLAETQVVTVDDECEDPLTYVRQWQTLTK
         D    S+ V + ++S Q   SQ LRMPRRSGRVV QP RY+GL ETQVV  DD  EDPL+Y +    + K
Subjt:  ADGASTSTSVVDPSSSSQVR-SQELRMPRRSGRVVRQPERYMGLAETQVVTVDDECEDPLTYVRQWQTLTK

A0A5A7UYX7 Gag/pol protein2.8e-14059.96Show/hide
Query:  TRGQEVEASVA--LRPFHKGSTFRIKFVDSSGPKGKKEKLK---EVKVDRAAALKGKEIKDVARKCFYCNEGEHLKRSFP-----------SHINLNRIE
        ++GQ+ EA+VA   R FH+GST   K + SS    K +K K     K + AAA   K+ K     CF+ N+  H KR+ P            HINLNRIE
Subjt:  TRGQEVEASVA--LRPFHKGSTFRIKFVDSSGPKGKKEKLK---EVKVDRAAALKGKEIKDVARKCFYCNEGEHLKRSFP-----------SHINLNRIE

Query:  KLVKSGLLSELEENSLLVCESCLEGKMTIRPFSGKGYRANEPLELIHYDLCGPMNVKARGGYEYFISFIDDYSSYGYIYLMHEKSETFEKFKEFKTEVEN
        +LVK+G+LSELEENSL +CESCLEGKMT RPF+GKG+RA EPLEL+H DLCGPMNVKARG +EYFI+F DDYS YGY+YLM  KSE  EKFKE+K EVEN
Subjt:  KLVKSGLLSELEENSLLVCESCLEGKMTIRPFSGKGYRANEPLELIHYDLCGPMNVKARGGYEYFISFIDDYSSYGYIYLMHEKSETFEKFKEFKTEVEN

Query:  LLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMSYARLPDSFWGYAMETAVCILNTVLSKSVCERPFELWH
         L KTIKT RSDRGGEYMD +FQ+Y++E  I  QLSAPG PQQNGVS+RRNRTLLDMVRSM+SYA LP+SFWGYA++TAV ILN V SKSV E P +LW+
Subjt:  LLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMSYARLPDSFWGYAMETAVCILNTVLSKSVCERPFELWH

Query:  GRKTSLRHFRFGDVRPMCWCQTRRKRKLEPRSKLCLFVSYPKETRGGYFYDPKDNRVIVSTNVTFFEEDHIRDHLPRSKIVLN----EMDGTSTRVADGA
        GRK SLRHFR               +KLEPRSKLCLFV YPK TRGGYFYD KDN+V V TN TF E+DHIR+H PRSKIVLN    E+   STRV + +
Subjt:  GRKTSLRHFRFGDVRPMCWCQTRRKRKLEPRSKLCLFVSYPKETRGGYFYDPKDNRVIVSTNVTFFEEDHIRDHLPRSKIVLN----EMDGTSTRVADGA

Query:  STSTSVVDPSSSSQV-RSQELRMPRRSGRVVRQPERYMGLAETQVVTVDDECEDPLTYVRQWQTLTK
        S  T VV   SS++  + Q LR PRRSGRV   P RYM L ET  V  D + EDPLT+ +  + + K
Subjt:  STSTSVVDPSSSSQV-RSQELRMPRRSGRVVRQPERYMGLAETQVVTVDDECEDPLTYVRQWQTLTK

A0A5D3BAN6 Gag/pol protein3.7e-14557.94Show/hide
Query:  TVIFDNLQTFQSLVQTRGQEVEASVA--LRPFHKGSTFRIKFVDSSGPKGKKEKLK---EVKVDRAAALKGKEIKDVARKCFYCNEGEHLKRSFPS----
        T + + LQTF+SL++ +GQ+ EA+VA   R FH+GST   K + SS    K +K K   + KV+ AAA   K+ K      F+CN+  H KR+ P     
Subjt:  TVIFDNLQTFQSLVQTRGQEVEASVA--LRPFHKGSTFRIKFVDSSGPKGKKEKLK---EVKVDRAAALKGKEIKDVARKCFYCNEGEHLKRSFPS----

Query:  -----------------------------HINLNRIEKLVKSGLLSELEENSLLVCESCLEGKMTIRPFSGKGYRANEPLELIHYDLCGPMNVKARGGYE
                                     HINLNRIE+LV++GLLSELEEN L VCESCLEGKMT RPF+GKG+RA EPLEL+H DLCGPMNVKARGG+E
Subjt:  -----------------------------HINLNRIEKLVKSGLLSELEENSLLVCESCLEGKMTIRPFSGKGYRANEPLELIHYDLCGPMNVKARGGYE

Query:  YFISFIDDYSSYGYIYLMHEKSETFEKFKEFKTEVENLLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMS
        YFI+F DDYS YGY+YLM  KSE  EKFKE+K EVEN L KTIKT RSDRGGEYMD +FQ+Y++E GI  QLSAPG PQQNGVS+RRN+TLLDMV SMMS
Subjt:  YFISFIDDYSSYGYIYLMHEKSETFEKFKEFKTEVENLLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMS

Query:  YARLPDSFWGYAMETAVCILNTVLSKSVCERPFELWHGRKTSLRHFRFGDVRPMCWCQTRRKRKLEPRSKLCLFVSYPKETRGGYFYDPKDNRVIVSTNV
        YA LP+SFWGYA++TAV ILN V SKSV E P +LW+GRK SL HFR     P    +   K KLEPRSKLCLFV Y K +RGGYFYDPKDN+V+VSTN 
Subjt:  YARLPDSFWGYAMETAVCILNTVLSKSVCERPFELWHGRKTSLRHFRFGDVRPMCWCQTRRKRKLEPRSKLCLFVSYPKETRGGYFYDPKDNRVIVSTNV

Query:  TFFEEDHIRDHLPRSKIVLNEMDG----TSTRVADGASTSTSVVDPSSSSQV-RSQELRMPRRSGRVVRQPERYMGLAETQVVTVDDECEDPLTYVRQWQ
        TF EEDHIR+H PRSKIVLNE+       STRV +  S  TSVV   SS++  + Q LR PRRSGRV   P RYM L ET  V  D + EDPLT+ +  +
Subjt:  TFFEEDHIRDHLPRSKIVLNEMDG----TSTRVADGASTSTSVVDPSSSSQV-RSQELRMPRRSGRVVRQPERYMGLAETQVVTVDDECEDPLTYVRQWQ

Query:  TLTK
         + K
Subjt:  TLTK

A0A5D3BHG7 Gag/pol protein6.0e-14360.6Show/hide
Query:  TRGQEVEASVA--LRPFHKGSTFRIKFVDSSGPKGKKEKLK---EVKVDRAAALKGKEIKDVARKCFYCNEGEHLKRSFP-----------SHINLNRIE
        ++GQ+ EA+VA   R FH+GST   K + SS    K +K K     K + AAA   K+ K     CF+ N+  H KR+ P            HINLNRIE
Subjt:  TRGQEVEASVA--LRPFHKGSTFRIKFVDSSGPKGKKEKLK---EVKVDRAAALKGKEIKDVARKCFYCNEGEHLKRSFP-----------SHINLNRIE

Query:  KLVKSGLLSELEENSLLVCESCLEGKMTIRPFSGKGYRANEPLELIHYDLCGPMNVKARGGYEYFISFIDDYSSYGYIYLMHEKSETFEKFKEFKTEVEN
        +LVK+G+LSELEENSL +CESCLEGKMT RPF+GKG+RA EPLEL+H DLCGPMNVKARG +EYFI+F DDYS YGY+YLM  KSE  EKFKE+K EVEN
Subjt:  KLVKSGLLSELEENSLLVCESCLEGKMTIRPFSGKGYRANEPLELIHYDLCGPMNVKARGGYEYFISFIDDYSSYGYIYLMHEKSETFEKFKEFKTEVEN

Query:  LLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMSYARLPDSFWGYAMETAVCILNTVLSKSVCERPFELWH
         L KTIKT RSDRGGEYMD +FQ+Y++E  I  QLSAPG PQQNGVS+RRNRTLLDMVRSM+SYA LP+SFWGYA++TAV ILN V SKSV E P +LW+
Subjt:  LLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMSYARLPDSFWGYAMETAVCILNTVLSKSVCERPFELWH

Query:  GRKTSLRHFRFGDVRPMCWCQTRRKRKLEPRSKLCLFVSYPKETRGGYFYDPKDNRVIVSTNVTFFEEDHIRDHLPRSKIVLNEMD----GTSTRVADGA
        GRK SLRHFR               +KLEPRSKLCLFV YPK TRGGYFYDPKDN+V VSTN TF EEDHIR+H PRSKIVLNE+       STRV +  
Subjt:  GRKTSLRHFRFGDVRPMCWCQTRRKRKLEPRSKLCLFVSYPKETRGGYFYDPKDNRVIVSTNVTFFEEDHIRDHLPRSKIVLNEMD----GTSTRVADGA

Query:  STSTSVVDPSSSSQV-RSQELRMPRRSGRVVRQPERYMGLAETQVVTVDDECEDPLTYVRQWQTLTK
        S  T VV   SS++  + Q LR PRRSGRV   P RYM L ET  V  D + EDPLT+ +  + + K
Subjt:  STSTSVVDPSSSSQV-RSQELRMPRRSGRVVRQPERYMGLAETQVVTVDDECEDPLTYVRQWQTLTK

A0A5D3BUN8 Gag/pol protein3.6e-14070.62Show/hide
Query:  HINLNRIEKLVKSGLLSELEENSLLVCESCLEGKMTIRPFSGKGYRANEPLELIHYDLCGPMNVKARGGYEYFISFIDDYSSYGYIYLMHEKSETFEKFK
        HINL+RI +LVK+GLL++L++ SL  CESCLEGKMT RPF+GKGYRA EPLELIH DLCGPMNVKARGG+EYFISFIDDYS YGY+YLM  KSE  EKFK
Subjt:  HINLNRIEKLVKSGLLSELEENSLLVCESCLEGKMTIRPFSGKGYRANEPLELIHYDLCGPMNVKARGGYEYFISFIDDYSSYGYIYLMHEKSETFEKFK

Query:  EFKTEVENLLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMSYARLPDSFWGYAMETAVCILNTVLSKSVC
        E+KTEVENLL K IK LRSDRGGEYMD  FQDYMIEHGI  QLSAPG PQQNGVS+RRNRTLLDMVRSMMSYA+LP SFWGYA+ETAV ILN V SKSV 
Subjt:  EFKTEVENLLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMSYARLPDSFWGYAMETAVCILNTVLSKSVC

Query:  ERPFELWHGRKTSLRHFRFGDVRPMCWCQTRRKRKLEPRSKLCLFVSYPKETRGGYFYDPKDNRVIVSTNVTFFEEDHIRDHLPRSKIVLNEMDGTSTRV
        E PFELW GRK SL HFR     P     T  K KLEPRS+LC FV YPKETRGG F+DP++NRV VSTN TF EEDH+R+H PRSK+VL+E    STRV
Subjt:  ERPFELWHGRKTSLRHFRFGDVRPMCWCQTRRKRKLEPRSKLCLFVSYPKETRGGYFYDPKDNRVIVSTNVTFFEEDHIRDHLPRSKIVLNEMDGTSTRV

Query:  ADGASTSTSVVDPSSSSQVR-SQELRMPRRSGRVVRQPERYMGLAETQVVTVDDECEDPLTYVRQWQTLTK
         D    S+ V + ++S Q   SQ LRMPRRSGRVV QP RY+GL ETQVV  DD  EDPL+Y +    + K
Subjt:  ADGASTSTSVVDPSSSSQVR-SQELRMPRRSGRVVRQPERYMGLAETQVVTVDDECEDPLTYVRQWQTLTK

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.9e-3527.05Show/hide
Query:  INLNRIEKLVKSGLLSELEENSLLVCESCLEGKMTIRPFSGKGYRAN--EPLELIHYDLCGPMNVKARGGYEYFISFIDDYSSYGYIYLMHEKSETFEKF
        + + R        LL+ L E S  +CE CL GK    PF     + +   PL ++H D+CGP+         YF+ F+D ++ Y   YL+  KS+ F  F
Subjt:  INLNRIEKLVKSGLLSELEENSLLVCESCLEGKMTIRPFSGKGYRAN--EPLELIHYDLCGPMNVKARGGYEYFISFIDDYSSYGYIYLMHEKSETFEKF

Query:  KEFKTEVENLLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMSYARLPDSFWGYAMETAVCILNTVLSKSV
        ++F  + E      +  L  D G EY+  E + + ++ GI++ L+ P  PQ NGVS+R  RT+ +  R+M+S A+L  SFWG A+ TA  ++N + S+++
Subjt:  KEFKTEVENLLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMSYARLPDSFWGYAMETAVCILNTVLSKSV

Query:  CE---RPFELWHGRKTSLRHFR-FGDVRPMCWCQTRRKR-KLEPRSKLCLFVSYPKETRGGYFYDPKDNRVIVSTNVTFFEEDHIRDHLPRSKIVLNEMD
         +    P+E+WH +K  L+H R FG      +   + K+ K + +S   +FV Y  E  G   +D  + + IV+ +V   E + +     + + V  + D
Subjt:  CE---RPFELWHGRKTSLRHFR-FGDVRPMCWCQTRRKR-KLEPRSKLCLFVSYPKETRGGYFYDPKDNRVIVSTNVTFFEEDHIRDHLPRSKIVLNEMD

Query:  GTSTRVADGASTSTSVVD---PSSSSQVRSQELRMPRRSGRVVRQPERYMGLAETQVVTVDDECED
           +   +  + S  ++    P+ S +  + +     +       P     + +T+      EC++
Subjt:  GTSTRVADGASTSTSVVD---PSSSSQVRSQELRMPRRSGRVVRQPERYMGLAETQVVTVDDECED

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-5133.52Show/hide
Query:  HINLNRIEKLVKSGLLSELEENSLLVCESCLEGKMTIRPFSGKGYRANEPLELIHYDLCGPMNVKARGGYEYFISFIDDYSSYGYIYLMHEKSETFEKFK
        H++   ++ L K  L+S  +  ++  C+ CL GK     F     R    L+L++ D+CGPM +++ GG +YF++FIDD S   ++Y++  K + F+ F+
Subjt:  HINLNRIEKLVKSGLLSELEENSLLVCESCLEGKMTIRPFSGKGYRANEPLELIHYDLCGPMNVKARGGYEYFISFIDDYSSYGYIYLMHEKSETFEKFK

Query:  EFKTEVENLLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMSYARLPDSFWGYAMETAVCILNTVLSKSVC
        +F   VE   G+ +K LRSD GGEY   EF++Y   HGI  + + PG PQ NGV++R NRT+++ VRSM+  A+LP SFWG A++TA  ++N   S  + 
Subjt:  EFKTEVENLLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMSYARLPDSFWGYAMETAVCILNTVLSKSVC

Query:  -ERPFELWHGRKTSLRHFRFGDVRPMCWCQTRRKRKLEPRSKLCLFVSYPKETRGGYFYDPKDNRVIVSTNVTFFEEDHIRDHLPRSKIVLNEMDGTSTR
         E P  +W  ++ S  H +    R        ++ KL+ +S  C+F+ Y  E  G   +DP   +VI S +V  F E  +R     S+ V N   G    
Subjt:  -ERPFELWHGRKTSLRHFRFGDVRPMCWCQTRRKRKLEPRSKLCLFVSYPKETRGGYFYDPKDNRVIVSTNVTFFEEDHIRDHLPRSKIVLNEMDGTSTR

Query:  VADGASTSTSVVDPSSSSQVRSQELRMPRRSGRVVRQPERY-MGLAETQVVTVDDECEDPL
             STS +     S++   S++   P   G V+ Q E+   G+ E +  T  +E   PL
Subjt:  VADGASTSTSVVDPSSSSQVRSQELRMPRRSGRVVRQPERY-MGLAETQVVTVDDECEDPL

Q07791 Transposon Ty2-DR3 Gag-Pol polyprotein4.0e-1927.75Show/hide
Query:  HINLNRIEKLVKSGLLSELEEN-------SLLVCESCLEGKMT----IRPFSGKGYRANEPLELIHYDLCGPMNVKARGGYEYFISFIDDYSSYGYIYLM
        H N   I+K +K   ++ L+E+       S   C  CL GK T    I+    K   + EP + +H D+ GP++   +    YFISF D+ + + ++Y +
Subjt:  HINLNRIEKLVKSGLLSELEEN-------SLLVCESCLEGKMT----IRPFSGKGYRANEPLELIHYDLCGPMNVKARGGYEYFISFIDDYSSYGYIYLM

Query:  HEKSE--TFEKFKEFKTEVENLLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMSYARLPDSFWGYAMETA
        H++ E      F      ++N     +  ++ DRG EY +     +    GIT   +     + +GV++R NRTLL+  R+++  + LP+  W  A+E +
Subjt:  HEKSE--TFEKFKEFKTEVENLLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMSYARLPDSFWGYAMETA

Query:  VCILNTVLS
          I N+++S
Subjt:  VCILNTVLS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.5e-2927.11Show/hide
Query:  HLKRSFPSHINLNRIEKLVKSGLLSELEENSLLVCESCLEGKMTIRPFSGKGYRANEPLELIHYDLCGPMNVKARGGYEYFISFIDDYSSYGYIYLMHEK
        H +   P+   LN +       +L+    +  L C  CL  K    PFS     +  PLE I+ D+     + +   Y Y++ F+D ++ Y ++Y + +K
Subjt:  HLKRSFPSHINLNRIEKLVKSGLLSELEENSLLVCESCLEGKMTIRPFSGKGYRANEPLELIHYDLCGPMNVKARGGYEYFISFIDDYSSYGYIYLMHEK

Query:  SETFEKFKEFKTEVENLLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMSYARLPDSFWGYAMETAVCILN
        S+  E F  FK  +EN     I T  SD GGE++     +Y  +HGI+   S P  P+ NG+S+R++R +++   +++S+A +P ++W YA   AV ++N
Subjt:  SETFEKFKEFKTEVENLLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMSYARLPDSFWGYAMETAVCILN

Query:  TVLSKSV-CERPFELWHGRKTSLRHFRFGDVRPMCWCQTRRKRKLEPRSKLCLFVSYPKETRGGYFYDPKDNRVIVSTNVTFFE
         + +  +  E PF+   G   +    R        W +   + KL+ +S+ C+F+ Y            + +R+ +S +V F E
Subjt:  TVLSKSV-CERPFELWHGRKTSLRHFRFGDVRPMCWCQTRRKRKLEPRSKLCLFVSYPKETRGGYFYDPKDNRVIVSTNVTFFE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-3028.18Show/hide
Query:  CNEGEHLK-RSFPSHINLNRIEKLVKSGLLSELE-ENSLLVCESCLEGKMTIRPFSGKGYRANEPLELIHYDLCGPMNVKARGGYEYFISFIDDYSSYGY
        C++  H    S   H +L  +  ++ +  L  L   + LL C  C   K    PFS     +++PLE I+ D+     + +   Y Y++ F+D ++ Y +
Subjt:  CNEGEHLK-RSFPSHINLNRIEKLVKSGLLSELE-ENSLLVCESCLEGKMTIRPFSGKGYRANEPLELIHYDLCGPMNVKARGGYEYFISFIDDYSSYGY

Query:  IYLMHEKSETFEKFKEFKTEVENLLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMSYARLPDSFWGYAME
        +Y + +KS+  + F  FK+ VEN     I TL SD GGE++    +DY+ +HGI+   S P  P+ NG+S+R++R +++M  +++S+A +P ++W YA  
Subjt:  IYLMHEKSETFEKFKEFKTEVENLLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMSYARLPDSFWGYAME

Query:  TAVCILNTVLSKSV-CERPFELWHGRKTSLRHFRFGDVRPMCWCQTRRKRKLEPRSKLCLFVSYPKETRGGYFYDPKDNRVIVSTNVTFFE
         AV ++N + +  +  + PF+   G+  +    +        W +   + KLE +SK C F+ Y               R+  S +V F E
Subjt:  TAVCILNTVLSKSV-CERPFELWHGRKTSLRHFRFGDVRPMCWCQTRRKRKLEPRSKLCLFVSYPKETRGGYFYDPKDNRVIVSTNVTFFE

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAATCGCTTTTAACGCCCCTTTAGCTTCCGTTACGTTAGTCGACAGTAGTATTTCGTGGAAAAATAATATAGACACGATTTTGGTAAATGAAGACCAAGAGTTTGC
ACTCACTGAGGAGGGTCCTCAGGTGCTCGGTTCAACAGTCATATTCGACAATCTACAAACCTTTCAGTCCTTGGTGCAAACTAGGGGACAGGAAGTTGAGGCAAGTGTTG
CCTTAAGACCTTTTCACAAGGGTTCGACCTTTAGGATAAAGTTTGTAGACTCTTCAGGTCCTAAGGGGAAGAAGGAGAAGTTGAAGGAAGTTAAGGTTGACCGCGCTGCC
GCCCTAAAAGGCAAAGAAATCAAGGATGTTGCAAGAAAGTGTTTCTACTGCAATGAGGGCGAACACCTGAAAAGGAGTTTCCCAAGCCACATTAATCTCAATAGGATTGA
GAAACTAGTCAAGAGTGGACTTCTAAGCGAGTTGGAAGAAAACTCTTTACTGGTATGTGAGTCATGCCTTGAAGGCAAAATGACCATACGTCCTTTTAGTGGAAAAGGAT
ATAGAGCCAATGAGCCCCTTGAGCTTATACATTATGATCTTTGTGGTCCGATGAATGTTAAAGCACGAGGTGGTTACGAATACTTCATATCTTTCATAGATGACTATTCG
AGTTATGGGTACATTTACCTAATGCATGAGAAGTCTGAAACATTTGAAAAGTTCAAGGAGTTCAAGACTGAGGTTGAGAACCTGTTAGGTAAAACGATTAAAACACTTCG
ATCCGATCGAGGTGGAGAGTATATGGACACTGAATTCCAGGACTATATGATAGAACATGGAATTACGTTCCAACTCTCAGCACCTGGTATGCCACAGCAGAATGGTGTAT
CGAAGAGGAGAAACAGAACCTTGTTAGACATGGTTCGGTCAATGATGAGCTATGCTCGTCTCCCTGATTCCTTTTGGGGTTACGCAATGGAGACTGCGGTTTGCATTTTG
AACACAGTTCTGTCGAAAAGTGTTTGTGAAAGACCTTTCGAACTCTGGCATGGTCGTAAAACTAGTTTACGTCATTTCAGATTTGGGGATGTCCGACCCATGTGTTGGTG
TCAAACCCGAAGAAAAAGGAAGCTGGAACCCCGTTCAAAGTTGTGCCTCTTTGTAAGTTACCCAAAAGAGACTAGAGGTGGTTACTTTTATGATCCTAAGGATAATAGGG
TGATTGTGTCCACAAATGTCACTTTCTTTGAGGAAGACCACATCAGGGATCACTTACCAAGGAGTAAGATTGTGTTAAATGAAATGGACGGTACATCAACAAGAGTCGCT
GATGGTGCTAGTACGTCAACAAGTGTTGTTGATCCTAGCTCGTCTAGTCAAGTCCGTTCCCAAGAGTTGAGAATGCCTCGACGTAGTGGGAGGGTTGTGAGACAGCCTGA
ACGTTACATGGGTTTAGCTGAAACCCAAGTTGTCACTGTTGATGATGAATGTGAGGATCCATTGACCTATGTCAGGCAATGGCAGACGTTGACAAAGACAAATGGATTAC
AGCCATGGACCAGGAAATGGAGTCGATGTACTTCAATTCCGTCTAGAAGCTTGTGGACCAGTCGGATGGGGTTAAACCTAATGGTTGTAAATGGATCTACAAGCGTAAAC
GTGGCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCAATCGCTTTTAACGCCCCTTTAGCTTCCGTTACGTTAGTCGACAGTAGTATTTCGTGGAAAAATAATATAGACACGATTTTGGTAAATGAAGACCAAGAGTTTGC
ACTCACTGAGGAGGGTCCTCAGGTGCTCGGTTCAACAGTCATATTCGACAATCTACAAACCTTTCAGTCCTTGGTGCAAACTAGGGGACAGGAAGTTGAGGCAAGTGTTG
CCTTAAGACCTTTTCACAAGGGTTCGACCTTTAGGATAAAGTTTGTAGACTCTTCAGGTCCTAAGGGGAAGAAGGAGAAGTTGAAGGAAGTTAAGGTTGACCGCGCTGCC
GCCCTAAAAGGCAAAGAAATCAAGGATGTTGCAAGAAAGTGTTTCTACTGCAATGAGGGCGAACACCTGAAAAGGAGTTTCCCAAGCCACATTAATCTCAATAGGATTGA
GAAACTAGTCAAGAGTGGACTTCTAAGCGAGTTGGAAGAAAACTCTTTACTGGTATGTGAGTCATGCCTTGAAGGCAAAATGACCATACGTCCTTTTAGTGGAAAAGGAT
ATAGAGCCAATGAGCCCCTTGAGCTTATACATTATGATCTTTGTGGTCCGATGAATGTTAAAGCACGAGGTGGTTACGAATACTTCATATCTTTCATAGATGACTATTCG
AGTTATGGGTACATTTACCTAATGCATGAGAAGTCTGAAACATTTGAAAAGTTCAAGGAGTTCAAGACTGAGGTTGAGAACCTGTTAGGTAAAACGATTAAAACACTTCG
ATCCGATCGAGGTGGAGAGTATATGGACACTGAATTCCAGGACTATATGATAGAACATGGAATTACGTTCCAACTCTCAGCACCTGGTATGCCACAGCAGAATGGTGTAT
CGAAGAGGAGAAACAGAACCTTGTTAGACATGGTTCGGTCAATGATGAGCTATGCTCGTCTCCCTGATTCCTTTTGGGGTTACGCAATGGAGACTGCGGTTTGCATTTTG
AACACAGTTCTGTCGAAAAGTGTTTGTGAAAGACCTTTCGAACTCTGGCATGGTCGTAAAACTAGTTTACGTCATTTCAGATTTGGGGATGTCCGACCCATGTGTTGGTG
TCAAACCCGAAGAAAAAGGAAGCTGGAACCCCGTTCAAAGTTGTGCCTCTTTGTAAGTTACCCAAAAGAGACTAGAGGTGGTTACTTTTATGATCCTAAGGATAATAGGG
TGATTGTGTCCACAAATGTCACTTTCTTTGAGGAAGACCACATCAGGGATCACTTACCAAGGAGTAAGATTGTGTTAAATGAAATGGACGGTACATCAACAAGAGTCGCT
GATGGTGCTAGTACGTCAACAAGTGTTGTTGATCCTAGCTCGTCTAGTCAAGTCCGTTCCCAAGAGTTGAGAATGCCTCGACGTAGTGGGAGGGTTGTGAGACAGCCTGA
ACGTTACATGGGTTTAGCTGAAACCCAAGTTGTCACTGTTGATGATGAATGTGAGGATCCATTGACCTATGTCAGGCAATGGCAGACGTTGACAAAGACAAATGGATTAC
AGCCATGGACCAGGAAATGGAGTCGATGTACTTCAATTCCGTCTAGAAGCTTGTGGACCAGTCGGATGGGGTTAAACCTAATGGTTGTAAATGGATCTACAAGCGTAAAC
GTGGCGTAG
Protein sequenceShow/hide protein sequence
MSIAFNAPLASVTLVDSSISWKNNIDTILVNEDQEFALTEEGPQVLGSTVIFDNLQTFQSLVQTRGQEVEASVALRPFHKGSTFRIKFVDSSGPKGKKEKLKEVKVDRAA
ALKGKEIKDVARKCFYCNEGEHLKRSFPSHINLNRIEKLVKSGLLSELEENSLLVCESCLEGKMTIRPFSGKGYRANEPLELIHYDLCGPMNVKARGGYEYFISFIDDYS
SYGYIYLMHEKSETFEKFKEFKTEVENLLGKTIKTLRSDRGGEYMDTEFQDYMIEHGITFQLSAPGMPQQNGVSKRRNRTLLDMVRSMMSYARLPDSFWGYAMETAVCIL
NTVLSKSVCERPFELWHGRKTSLRHFRFGDVRPMCWCQTRRKRKLEPRSKLCLFVSYPKETRGGYFYDPKDNRVIVSTNVTFFEEDHIRDHLPRSKIVLNEMDGTSTRVA
DGASTSTSVVDPSSSSQVRSQELRMPRRSGRVVRQPERYMGLAETQVVTVDDECEDPLTYVRQWQTLTKTNGLQPWTRKWSRCTSIPSRSLWTSRMGLNLMVVNGSTSVN
VA