; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021760 (gene) of Snake gourd v1 genome

Gene IDTan0021760
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationLG07:10418935..10423528
RNA-Seq ExpressionTan0021760
SyntenyTan0021760
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN81099.1 hypothetical protein VITISV_017741 [Vitis vinifera]3.3e-14146.08Show/hide
Query:  DGLYRFQMTQANLSILSSQHHQSSISAPQVHNVSLSSSLNQSVFHKCTSLEKWHNRLGHSAIPIVQHIMSMCNLDTSNK-SFHFCHACAVGKSHNLPFHD
        DGLY F  +   L    S     S+ A    +   ++SL+       ++ + WH RLGH +   +++++S CN+   NK   +FC +C +GK H  PF  
Subjt:  DGLYRFQMTQANLSILSSQHHQSSISAPQVHNVSLSSSLNQSVFHKCTSLEKWHNRLGHSAIPIVQHIMSMCNLDTSNK-SFHFCHACAVGKSHNLPFHD

Query:  STSHYDFPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNGGEFRSFIHFLKTNGITHRVT
        S + Y  PL+LI +D+WGP    S +G++YY+ FVD FSR++ I+ L NKSEA   F+ FKTQVE  F+  I SLQTD GGEFR+F  +L  NGI HRV+
Subjt:  STSHYDFPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNGGEFRSFIHFLKTNGITHRVT

Query:  CPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNKHKLDPRSTPC
        CP+T QQNG+ ERKHR IVE GLTLL  ASL +KFWD++F T VYL NRLPT + H   P+E LF   P Y  L+ FGC C+P+LR YN HKL  RS  C
Subjt:  CPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNKHKLDPRSTPC

Query:  LFLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFPSFNFLYPSESKSNTLHHSVLPIVHSDPTPINFQQSNLPPSCPLSTL----PMDPLITNSIPSP--
         FLGYS  HKGYKC+SS GR+YIS  V F+E++FP    +  S    +T+  S   +  S   P+        P+ P+S+      MD +++    +P  
Subjt:  LFLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFPSFNFLYPSESKSNTLHHSVLPIVHSDPTPINFQQSNLPPSCPLSTL----PMDPLITNSIPSP--

Query:  -QTLSLPTPLDSHAHPTPCLIASGSVSNILNVPPVTGTTTK--NNTHTLITRGKARVFKPKVFLANYKEVEPPNVKEALKCDHWIQAMIDEYRALMSNDT
          T   P  + S+   TP      S+++      VT T  K  +NTH +ITR K+ + KPK+F+A  +  EP +V  AL+ D W +AM+ EY AL  N+T
Subjt:  -QTLSLPTPLDSHAHPTPCLIASGSVSNILNVPPVTGTTTK--NNTHTLITRGKARVFKPKVFLANYKEVEPPNVKEALKCDHWIQAMIDEYRALMSNDT

Query:  WSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNGWKLRLVDINNAFLHGLLSEEVFMSQP
        WSL+  P  ++ IGCKWV+K K + DG+V +YKARLVA+GFHQQA  D+TETFSPVVKP T+R++FT+AL+  W ++ +D+NNAFL+G L EEVFM QP
Subjt:  WSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNGWKLRLVDINNAFLHGLLSEEVFMSQP

KAG8479334.1 hypothetical protein CXB51_029681 [Gossypium anomalum]1.4e-14447.69Show/hide
Query:  KTLLQG---DGLYRFQMTQANLSILSSQHHQSSISAPQVHNVSLSSSLNQSVFHKCTSLEKWHNRLGHSAIPIVQHIMSMCNLDTS-NKSFHFCHACAVG
        +TLL G   +GLYRF   ++N +                 + +L  S  Q         ++WH RLGH +  +V+ I++ CN+  S  K++  C+AC +G
Subjt:  KTLLQG---DGLYRFQMTQANLSILSSQHHQSSISAPQVHNVSLSSSLNQSVFHKCTSLEKWHNRLGHSAIPIVQHIMSMCNLDTS-NKSFHFCHACAVG

Query:  KSHNLPFHDSTSHYDFPLQLIVVDVWGPA-YESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNGGEFRSFIHFL
        K H LPF  S   Y  PL+L+V DVWGPA Y SS  G++YY+SFVD FSR+T IYFL  KS+AFSAFL FK  VE      +  LQTD GGEFRSF  +L
Subjt:  KSHNLPFHDSTSHYDFPLQLIVVDVWGPA-YESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNGGEFRSFIHFL

Query:  KTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNK
        K   I HRV+CP+TS+QNG+VE +HR IVE GL LL+ ASL I +W DAFATAVY++NRLPTK    VSP E+LFGHKP Y  LR FGCLCYP LR YN+
Subjt:  KTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNK

Query:  HKLDPRSTPCLFLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFP----SFNFLYPSESKSNTLHHSV--LPIVHSDPTPINFQQSNLPPSCPLSTLPMD
        HKL  RS PC FLGY++ H+GYKC+  +GR+YISRHV FDE T+P    S + +   +S+S      V  LPI  + P       +N+  S P+ + P  
Subjt:  HKLDPRSTPCLFLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFP----SFNFLYPSESKSNTLHHSV--LPIVHSDPTPINFQQSNLPPSCPLSTLPMD

Query:  PLITNSIPSPQTLSLPTPLDSHAHPTPCLIASGSVSNILNVPPVTGTTTKNNTHTLITRGKARVFKPKVFLANYKEVEPPNVKEALKCDHWIQAMIDEYR
             S  SP   SL  P  S++     LI     S+++            N H ++TR K  ++KPK ++A   +VEP  + EA+    W QA+ DE +
Subjt:  PLITNSIPSPQTLSLPTPLDSHAHPTPCLIASGSVSNILNVPPVTGTTTKNNTHTLITRGKARVFKPKVFLANYKEVEPPNVKEALKCDHWIQAMIDEYR

Query:  ALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNGWKLRLVDINNAFLHGLLSEE
        AL+ N TW L+  PVN+ ++GCKW+FKIKR+SDGSVAR K RLVAQGF Q A +DY ETFS VVK  T+R++  LA++  WKLR VD+NNAFL+G L E+
Subjt:  ALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNGWKLRLVDINNAFLHGLLSEE

Query:  VFMSQP
        ++M QP
Subjt:  VFMSQP

RVW44519.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]7.9e-14346.65Show/hide
Query:  LLQGD---GLYRFQMTQ------ANLSILSSQHHQSSISAPQVHNVSLS-SSLNQSVFHKCTSLEKWHNRLGHSAIPIVQHIMSMCNLDTSNKS-FHFCH
        LLQG+   GLY+F +++      + LS+ + ++  +  +A  VHN +        S FH     + WH RLGH A  IV  +++   +  S KS    C 
Subjt:  LLQGD---GLYRFQMTQ------ANLSILSSQHHQSSISAPQVHNVSLS-SSLNQSVFHKCTSLEKWHNRLGHSAIPIVQHIMSMCNLDTSNKS-FHFCH

Query:  ACAVGKSHNLPFHDSTSHYDFPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNGGEFRSF
        AC +GKSHNLPF  S + Y  PLQL+V D+WGPA  +S  GF YYVSFVD +SRYT +YFL  KS+   AFL+FK Q E  F   + + QTD GGEFRS 
Subjt:  ACAVGKSHNLPFHDSTSHYDFPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNGGEFRSF

Query:  IHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLR
          + + NGI HR++CP+TS+QNGI+ERKHRHIVE+GLTLL+ ASL +K+W DAF+TAV+LINRLPT+V     P E LF  KP Y  L+ FGCLC+P LR
Subjt:  IHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLR

Query:  AYNKHKLDPRSTPCLFLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFPSFNFLY-PSESKS-NTLHHSVLPIVHSDPTPINFQQS-NLPPSCPLSTLPM
         YNKHKLD RS+PC FLGYSS HKGYKCL+  GRM+ISR V FDE+ FP  + L  P +  S +T+    +P+V  +  P++   S +LP S   S+  +
Subjt:  AYNKHKLDPRSTPCLFLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFPSFNFLY-PSESKS-NTLHHSVLPIVHSDPTPINFQQS-NLPPSCPLSTLPM

Query:  DPLITNSIPSPQ-----------------TLSLPTPLDSHAHP--TPCLIASGSVSNILNVPPVTGTTTKNNTHTLITRGKARVFKPKVFLANYKEVEPP
        D  + + I S Q                 + S+P+  + +A P   P    S   +  +N  PV   T     H ++TR K  +FKPKV+  +    EP 
Subjt:  DPLITNSIPSPQ-----------------TLSLPTPLDSHAHP--TPCLIASGSVSNILNVPPVTGTTTKNNTHTLITRGKARVFKPKVFLANYKEVEPP

Query:  NVKEALKCDHWIQAMIDEYRALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNG
          +EA+    W +AM +E+RALM N TWSL+  P N+  +GC+WVFK+KR+ DGSV+RYKARLVA+G+ Q    D+ ETFSPVVKP TIR++  +A++  
Subjt:  NVKEALKCDHWIQAMIDEYRALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNG

Query:  WKLRLVDINNAFLHGLLSEEVFMSQP
        W +R +D+NNAFL+G L EEV+M QP
Subjt:  WKLRLVDINNAFLHGLLSEEVFMSQP

RVW60229.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]7.9e-14346.65Show/hide
Query:  LLQGD---GLYRFQMTQ------ANLSILSSQHHQSSISAPQVHNVSLS-SSLNQSVFHKCTSLEKWHNRLGHSAIPIVQHIMSMCNLDTSNKS-FHFCH
        LLQG+   GLY+F +++      + LS+ + ++  +  +A  VHN +        S FH     + WH RLGH A  IV  +++   +  S KS    C 
Subjt:  LLQGD---GLYRFQMTQ------ANLSILSSQHHQSSISAPQVHNVSLS-SSLNQSVFHKCTSLEKWHNRLGHSAIPIVQHIMSMCNLDTSNKS-FHFCH

Query:  ACAVGKSHNLPFHDSTSHYDFPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNGGEFRSF
        AC +GKSHNLPF  S + Y  PLQL+V D+WGPA  +S  GF YYVSFVD +SRYT +YFL  KS+   AFL+FK Q E  F   + + QTD GGEFRS 
Subjt:  ACAVGKSHNLPFHDSTSHYDFPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNGGEFRSF

Query:  IHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLR
          + + NGI HR++CP+TS+QNGI+ERKHRHIVE+GLTLL+ ASL +K+W DAF+TAV+LINRLPT+V     P E LF  KP Y  L+ FGCLC+P LR
Subjt:  IHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLR

Query:  AYNKHKLDPRSTPCLFLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFPSFNFLY-PSESKS-NTLHHSVLPIVHSDPTPINFQQS-NLPPSCPLSTLPM
         YNKHKLD RS+PC FLGYSS HKGYKCL+  GRM+ISR V FDE+ FP  + L  P +  S +T+    +P+V  +  P++   S +LP S   S+  +
Subjt:  AYNKHKLDPRSTPCLFLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFPSFNFLY-PSESKS-NTLHHSVLPIVHSDPTPINFQQS-NLPPSCPLSTLPM

Query:  DPLITNSIPSPQ-----------------TLSLPTPLDSHAHP--TPCLIASGSVSNILNVPPVTGTTTKNNTHTLITRGKARVFKPKVFLANYKEVEPP
        D  + + I S Q                 + S+P+  + +A P   P    S   +  +N  PV   T     H ++TR K  +FKPKV+  +    EP 
Subjt:  DPLITNSIPSPQ-----------------TLSLPTPLDSHAHP--TPCLIASGSVSNILNVPPVTGTTTKNNTHTLITRGKARVFKPKVFLANYKEVEPP

Query:  NVKEALKCDHWIQAMIDEYRALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNG
          +EA+    W +AM +E+RALM N TWSL+  P N+  +GC+WVFK+KR+ DGSV+RYKARLVA+G+ Q    D+ ETFSPVVKP TIR++  +A++  
Subjt:  NVKEALKCDHWIQAMIDEYRALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNG

Query:  WKLRLVDINNAFLHGLLSEEVFMSQP
        W +R +D+NNAFL+G L EEV+M QP
Subjt:  WKLRLVDINNAFLHGLLSEEVFMSQP

RVX14937.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.6e-14046.91Show/hide
Query:  DGLYRFQMTQANLSILSSQHHQSSISAPQVHNVSLSSSLNQSVFHKCTSLEKWHNRLGHSAIPIVQHIMSMCNLDTSNK-SFHFCHACAVGKSHNLPFHD
        DGLY F     + S L+ +  QS   +P V   S SS +   +    ++ + WH RLG  +   +++++S CN+   NK   +FC +C +GK H  PF  
Subjt:  DGLYRFQMTQANLSILSSQHHQSSISAPQVHNVSLSSSLNQSVFHKCTSLEKWHNRLGHSAIPIVQHIMSMCNLDTSNK-SFHFCHACAVGKSHNLPFHD

Query:  STSHYDFPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNGGEFRSFIHFLKTNGITHRVT
        S + Y  PL+LI  D+WGPA   S +G++YY+ FVD FSR++ I+ L NKSEA   F+ FKTQVE  F+  I SLQTD GGEFR+F  +L  NGI HRV+
Subjt:  STSHYDFPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNGGEFRSFIHFLKTNGITHRVT

Query:  CPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNKHKLDPRSTPC
        CP+T QQNG+ ERKHR IVE GLTLL   SL +KFWD++F T VYL NRLPT V H   P+E LF   P Y  L+ FGC C+P+LR YN HKL  RS  C
Subjt:  CPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNKHKLDPRSTPC

Query:  LFLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFPSFNFLYPSESKSNTLHHSVLPIVHSDPTPINFQQSNLPPSCPLSTL----PMDPLITNSIPSP--
         FLGYS  HKGYKC+SS GR+YISR V F+E++FP    +  S    +T+  S   +  S   P+        P+ P+S+      MD +++    +P  
Subjt:  LFLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFPSFNFLYPSESKSNTLHHSVLPIVHSDPTPINFQQSNLPPSCPLSTL----PMDPLITNSIPSP--

Query:  -QTLSLPTPLDSHAHPTPCLIASGSVSNILNVPPVTGTTTK--NNTHTLITRGKARVFKPKVFLANYKEVEPPNVKEALKCDHWIQAMIDEYRALMSNDT
          T   P  + S+   TP      S+++      VT T  K  +NTH +ITR K+ + KPK+F+A  +  EP +V  AL+ D W +AM+ EY AL  N+T
Subjt:  -QTLSLPTPLDSHAHPTPCLIASGSVSNILNVPPVTGTTTK--NNTHTLITRGKARVFKPKVFLANYKEVEPPNVKEALKCDHWIQAMIDEYRALMSNDT

Query:  WSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNGWKLRLVDINNAFLHGLLSEEVFMSQP
        WSL+  P  ++ IGCKWV+K K + DG+V +YKARLVA+GFHQQA  D+TETFSPVVKP TIR++FT+AL+  W ++ +D+NNAFL+G L EEVFM QP
Subjt:  WSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNGWKLRLVDINNAFLHGLLSEEVFMSQP

TrEMBL top hitse value%identityAlignment
A0A438EA49 Retrovirus-related Pol polyprotein from transposon TNT 1-943.8e-14346.65Show/hide
Query:  LLQGD---GLYRFQMTQ------ANLSILSSQHHQSSISAPQVHNVSLS-SSLNQSVFHKCTSLEKWHNRLGHSAIPIVQHIMSMCNLDTSNKS-FHFCH
        LLQG+   GLY+F +++      + LS+ + ++  +  +A  VHN +        S FH     + WH RLGH A  IV  +++   +  S KS    C 
Subjt:  LLQGD---GLYRFQMTQ------ANLSILSSQHHQSSISAPQVHNVSLS-SSLNQSVFHKCTSLEKWHNRLGHSAIPIVQHIMSMCNLDTSNKS-FHFCH

Query:  ACAVGKSHNLPFHDSTSHYDFPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNGGEFRSF
        AC +GKSHNLPF  S + Y  PLQL+V D+WGPA  +S  GF YYVSFVD +SRYT +YFL  KS+   AFL+FK Q E  F   + + QTD GGEFRS 
Subjt:  ACAVGKSHNLPFHDSTSHYDFPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNGGEFRSF

Query:  IHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLR
          + + NGI HR++CP+TS+QNGI+ERKHRHIVE+GLTLL+ ASL +K+W DAF+TAV+LINRLPT+V     P E LF  KP Y  L+ FGCLC+P LR
Subjt:  IHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLR

Query:  AYNKHKLDPRSTPCLFLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFPSFNFLY-PSESKS-NTLHHSVLPIVHSDPTPINFQQS-NLPPSCPLSTLPM
         YNKHKLD RS+PC FLGYSS HKGYKCL+  GRM+ISR V FDE+ FP  + L  P +  S +T+    +P+V  +  P++   S +LP S   S+  +
Subjt:  AYNKHKLDPRSTPCLFLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFPSFNFLY-PSESKS-NTLHHSVLPIVHSDPTPINFQQS-NLPPSCPLSTLPM

Query:  DPLITNSIPSPQ-----------------TLSLPTPLDSHAHP--TPCLIASGSVSNILNVPPVTGTTTKNNTHTLITRGKARVFKPKVFLANYKEVEPP
        D  + + I S Q                 + S+P+  + +A P   P    S   +  +N  PV   T     H ++TR K  +FKPKV+  +    EP 
Subjt:  DPLITNSIPSPQ-----------------TLSLPTPLDSHAHP--TPCLIASGSVSNILNVPPVTGTTTKNNTHTLITRGKARVFKPKVFLANYKEVEPP

Query:  NVKEALKCDHWIQAMIDEYRALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNG
          +EA+    W +AM +E+RALM N TWSL+  P N+  +GC+WVFK+KR+ DGSV+RYKARLVA+G+ Q    D+ ETFSPVVKP TIR++  +A++  
Subjt:  NVKEALKCDHWIQAMIDEYRALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNG

Query:  WKLRLVDINNAFLHGLLSEEVFMSQP
        W +R +D+NNAFL+G L EEV+M QP
Subjt:  WKLRLVDINNAFLHGLLSEEVFMSQP

A0A438FJP6 Retrovirus-related Pol polyprotein from transposon TNT 1-943.8e-14346.65Show/hide
Query:  LLQGD---GLYRFQMTQ------ANLSILSSQHHQSSISAPQVHNVSLS-SSLNQSVFHKCTSLEKWHNRLGHSAIPIVQHIMSMCNLDTSNKS-FHFCH
        LLQG+   GLY+F +++      + LS+ + ++  +  +A  VHN +        S FH     + WH RLGH A  IV  +++   +  S KS    C 
Subjt:  LLQGD---GLYRFQMTQ------ANLSILSSQHHQSSISAPQVHNVSLS-SSLNQSVFHKCTSLEKWHNRLGHSAIPIVQHIMSMCNLDTSNKS-FHFCH

Query:  ACAVGKSHNLPFHDSTSHYDFPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNGGEFRSF
        AC +GKSHNLPF  S + Y  PLQL+V D+WGPA  +S  GF YYVSFVD +SRYT +YFL  KS+   AFL+FK Q E  F   + + QTD GGEFRS 
Subjt:  ACAVGKSHNLPFHDSTSHYDFPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNGGEFRSF

Query:  IHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLR
          + + NGI HR++CP+TS+QNGI+ERKHRHIVE+GLTLL+ ASL +K+W DAF+TAV+LINRLPT+V     P E LF  KP Y  L+ FGCLC+P LR
Subjt:  IHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLR

Query:  AYNKHKLDPRSTPCLFLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFPSFNFLY-PSESKS-NTLHHSVLPIVHSDPTPINFQQS-NLPPSCPLSTLPM
         YNKHKLD RS+PC FLGYSS HKGYKCL+  GRM+ISR V FDE+ FP  + L  P +  S +T+    +P+V  +  P++   S +LP S   S+  +
Subjt:  AYNKHKLDPRSTPCLFLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFPSFNFLY-PSESKS-NTLHHSVLPIVHSDPTPINFQQS-NLPPSCPLSTLPM

Query:  DPLITNSIPSPQ-----------------TLSLPTPLDSHAHP--TPCLIASGSVSNILNVPPVTGTTTKNNTHTLITRGKARVFKPKVFLANYKEVEPP
        D  + + I S Q                 + S+P+  + +A P   P    S   +  +N  PV   T     H ++TR K  +FKPKV+  +    EP 
Subjt:  DPLITNSIPSPQ-----------------TLSLPTPLDSHAHP--TPCLIASGSVSNILNVPPVTGTTTKNNTHTLITRGKARVFKPKVFLANYKEVEPP

Query:  NVKEALKCDHWIQAMIDEYRALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNG
          +EA+    W +AM +E+RALM N TWSL+  P N+  +GC+WVFK+KR+ DGSV+RYKARLVA+G+ Q    D+ ETFSPVVKP TIR++  +A++  
Subjt:  NVKEALKCDHWIQAMIDEYRALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNG

Query:  WKLRLVDINNAFLHGLLSEEVFMSQP
        W +R +D+NNAFL+G L EEV+M QP
Subjt:  WKLRLVDINNAFLHGLLSEEVFMSQP

A0A438J431 Retrovirus-related Pol polyprotein from transposon TNT 1-942.2e-13846.53Show/hide
Query:  GLYRFQMTQANLSILSSQHHQSSISAPQVHNVSLSSSLNQSVFHKCTSLEKWHNRLGHSAIPIVQHIMSMCNLDTSNK-SFHFCHACAVGKSHNLPFHDS
        GLY F  TQ  L + S +   SS  A    + +L S          +    WHNRLGH +  IV  +++ CNL   NK     C AC +GK H  PF  S
Subjt:  GLYRFQMTQANLSILSSQHHQSSISAPQVHNVSLSSSLNQSVFHKCTSLEKWHNRLGHSAIPIVQHIMSMCNLDTSNK-SFHFCHACAVGKSHNLPFHDS

Query:  TSHYDFPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNGGEFRSFIHFLKTNGITHRVTC
        TS Y  PL+LI  D+WGPA   S +G +YY+ F+D +SR+T IY L +KSEAF  FL FK+QVE      I ++Q+D GGE+RSF  +L +NGI HR++C
Subjt:  TSHYDFPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNGGEFRSFIHFLKTNGITHRVTC

Query:  PYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNKHKLDPRSTPCL
        PYT +QNG+ ERKHRHIVE G+ LL+ ASL  K+WD+AF T+V+LINRLPT V    SP+E LF  KP+Y  L+ FGC+CYP+LR +N HKL  RS PC 
Subjt:  PYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNKHKLDPRSTPCL

Query:  FLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFPSFNFLYPSESKSNTLHHSVLPIVHSDPTPINFQQSNLP--PSCPLSTLPMDPLITNSIPSPQTLSL
        FLGYS   KGYKCLS  G + ISR V FDE  FP   F      K  T             + ++   ++LP   S PL  LP     + S P+      
Subjt:  FLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFPSFNFLYPSESKSNTLHHSVLPIVHSDPTPINFQQSNLP--PSCPLSTLPMDPLITNSIPSPQTLSL

Query:  PTPLDSHAHPTPCLIASGSVSNILNVPPVTGTTTKNNTHTLITRGKARVFKPKVFLANYKEVEPPNVKEALKCDHWIQAMIDEYRALMSNDTWSLLDRPV
                   P +  + S  N+ + PP   +     TH +ITR K  +FKPK +L +     P +V EAL+  HW QAM DEY AL+ N+TW L+  P 
Subjt:  PTPLDSHAHPTPCLIASGSVSNILNVPPVTGTTTKNNTHTLITRGKARVFKPKVFLANYKEVEPPNVKEALKCDHWIQAMIDEYRALMSNDTWSLLDRPV

Query:  NKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNGWKLRLVDINNAFLHGLLSEEVFMSQP
        + K+IGCKWVFK+K + DG++ +YKARLVA+GFHQ A  D+ ETFS VVKP TIRI+ T+AL   WK+R +D+NNAFL+G L E++FM QP
Subjt:  NKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNGWKLRLVDINNAFLHGLLSEEVFMSQP

A0A438K147 Retrovirus-related Pol polyprotein from transposon TNT 1-948.0e-14146.91Show/hide
Query:  DGLYRFQMTQANLSILSSQHHQSSISAPQVHNVSLSSSLNQSVFHKCTSLEKWHNRLGHSAIPIVQHIMSMCNLDTSNK-SFHFCHACAVGKSHNLPFHD
        DGLY F     + S L+ +  QS   +P V   S SS +   +    ++ + WH RLG  +   +++++S CN+   NK   +FC +C +GK H  PF  
Subjt:  DGLYRFQMTQANLSILSSQHHQSSISAPQVHNVSLSSSLNQSVFHKCTSLEKWHNRLGHSAIPIVQHIMSMCNLDTSNK-SFHFCHACAVGKSHNLPFHD

Query:  STSHYDFPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNGGEFRSFIHFLKTNGITHRVT
        S + Y  PL+LI  D+WGPA   S +G++YY+ FVD FSR++ I+ L NKSEA   F+ FKTQVE  F+  I SLQTD GGEFR+F  +L  NGI HRV+
Subjt:  STSHYDFPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNGGEFRSFIHFLKTNGITHRVT

Query:  CPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNKHKLDPRSTPC
        CP+T QQNG+ ERKHR IVE GLTLL   SL +KFWD++F T VYL NRLPT V H   P+E LF   P Y  L+ FGC C+P+LR YN HKL  RS  C
Subjt:  CPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNKHKLDPRSTPC

Query:  LFLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFPSFNFLYPSESKSNTLHHSVLPIVHSDPTPINFQQSNLPPSCPLSTL----PMDPLITNSIPSP--
         FLGYS  HKGYKC+SS GR+YISR V F+E++FP    +  S    +T+  S   +  S   P+        P+ P+S+      MD +++    +P  
Subjt:  LFLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFPSFNFLYPSESKSNTLHHSVLPIVHSDPTPINFQQSNLPPSCPLSTL----PMDPLITNSIPSP--

Query:  -QTLSLPTPLDSHAHPTPCLIASGSVSNILNVPPVTGTTTK--NNTHTLITRGKARVFKPKVFLANYKEVEPPNVKEALKCDHWIQAMIDEYRALMSNDT
          T   P  + S+   TP      S+++      VT T  K  +NTH +ITR K+ + KPK+F+A  +  EP +V  AL+ D W +AM+ EY AL  N+T
Subjt:  -QTLSLPTPLDSHAHPTPCLIASGSVSNILNVPPVTGTTTK--NNTHTLITRGKARVFKPKVFLANYKEVEPPNVKEALKCDHWIQAMIDEYRALMSNDT

Query:  WSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNGWKLRLVDINNAFLHGLLSEEVFMSQP
        WSL+  P  ++ IGCKWV+K K + DG+V +YKARLVA+GFHQQA  D+TETFSPVVKP TIR++FT+AL+  W ++ +D+NNAFL+G L EEVFM QP
Subjt:  WSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNGWKLRLVDINNAFLHGLLSEEVFMSQP

A5BFT3 Integrase catalytic domain-containing protein1.6e-14146.08Show/hide
Query:  DGLYRFQMTQANLSILSSQHHQSSISAPQVHNVSLSSSLNQSVFHKCTSLEKWHNRLGHSAIPIVQHIMSMCNLDTSNK-SFHFCHACAVGKSHNLPFHD
        DGLY F  +   L    S     S+ A    +   ++SL+       ++ + WH RLGH +   +++++S CN+   NK   +FC +C +GK H  PF  
Subjt:  DGLYRFQMTQANLSILSSQHHQSSISAPQVHNVSLSSSLNQSVFHKCTSLEKWHNRLGHSAIPIVQHIMSMCNLDTSNK-SFHFCHACAVGKSHNLPFHD

Query:  STSHYDFPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNGGEFRSFIHFLKTNGITHRVT
        S + Y  PL+LI +D+WGP    S +G++YY+ FVD FSR++ I+ L NKSEA   F+ FKTQVE  F+  I SLQTD GGEFR+F  +L  NGI HRV+
Subjt:  STSHYDFPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNGGEFRSFIHFLKTNGITHRVT

Query:  CPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNKHKLDPRSTPC
        CP+T QQNG+ ERKHR IVE GLTLL  ASL +KFWD++F T VYL NRLPT + H   P+E LF   P Y  L+ FGC C+P+LR YN HKL  RS  C
Subjt:  CPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNKHKLDPRSTPC

Query:  LFLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFPSFNFLYPSESKSNTLHHSVLPIVHSDPTPINFQQSNLPPSCPLSTL----PMDPLITNSIPSP--
         FLGYS  HKGYKC+SS GR+YIS  V F+E++FP    +  S    +T+  S   +  S   P+        P+ P+S+      MD +++    +P  
Subjt:  LFLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFPSFNFLYPSESKSNTLHHSVLPIVHSDPTPINFQQSNLPPSCPLSTL----PMDPLITNSIPSP--

Query:  -QTLSLPTPLDSHAHPTPCLIASGSVSNILNVPPVTGTTTK--NNTHTLITRGKARVFKPKVFLANYKEVEPPNVKEALKCDHWIQAMIDEYRALMSNDT
          T   P  + S+   TP      S+++      VT T  K  +NTH +ITR K+ + KPK+F+A  +  EP +V  AL+ D W +AM+ EY AL  N+T
Subjt:  -QTLSLPTPLDSHAHPTPCLIASGSVSNILNVPPVTGTTTK--NNTHTLITRGKARVFKPKVFLANYKEVEPPNVKEALKCDHWIQAMIDEYRALMSNDT

Query:  WSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNGWKLRLVDINNAFLHGLLSEEVFMSQP
        WSL+  P  ++ IGCKWV+K K + DG+V +YKARLVA+GFHQQA  D+TETFSPVVKP T+R++FT+AL+  W ++ +D+NNAFL+G L EEVFM QP
Subjt:  WSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNGWKLRLVDINNAFLHGLLSEEVFMSQP

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.1e-5228.15Show/hide
Query:  WHNRLGHSA------IPIVQHIMSMCNLDTSNKSFHFCHACAVGKSHNLPFHD--STSHYDFPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIY
        WH R GH +      I           L+    S   C  C  GK   LPF      +H   PL ++  DV GP    + +   Y+V FVD F+ Y + Y
Subjt:  WHNRLGHSA------IPIVQHIMSMCNLDTSNKSFHFCHACAVGKSHNLPFHD--STSHYDFPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIY

Query:  FLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNGGEFRS--FIHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAFATA
         +  KS+ FS F  F  + E  FN  ++ L  DNG E+ S     F    GI++ +T P+T Q NG+ ER  R I E   T++S A L   FW +A  TA
Subjt:  FLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNGGEFRS--FIHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAFATA

Query:  VYLINRLPTK--VHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNKH-KLDPRSTPCLFLGYSSFHKGYKCLSSFGRMYI-SRHVTFDESTFPS---
         YLINR+P++  V  + +P E     KP    LR FG   Y  ++  NK  K D +S   +F+GY     G+K   +    +I +R V  DE+   +   
Subjt:  VYLINRLPTK--VHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNKH-KLDPRSTPCLFLGYSSFHKGYKCLSSFGRMYI-SRHVTFDESTFPS---

Query:  --FNFLYPSESK----------SNTLHHSVLPIVHSDPTPINF-------QQSNLPPSCPLSTLPMDPLITNSIPSPQTL-------------SLPTPLD
          F  ++  +SK          S  +  +  P    +   I F       +  N P           P  +    + Q L             S     D
Subjt:  --FNFLYPSESK----------SNTLHHSVLPIVHSDPTPINF-------QQSNLPPSCPLSTLPMDPLITNSIPSPQTL-------------SLPTPLD

Query:  SHAHPT-----PCLIASGSVSNILNVPPVTGTTTKNNTHTLITRGKARVFKPKVFLANYKEVEPPNVKEALKC--------------------DHWIQAM
         H + +     P        +  L    +   T  +    +  R +    KP++   +Y E +    K  L                        W +A+
Subjt:  SHAHPT-----PCLIASGSVSNILNVPPVTGTTTKNNTHTLITRGKARVFKPKVFLANYKEVEPPNVKEALKC--------------------DHWIQAM

Query:  IDEYRALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNGWKLRLVDINNAFLHG
          E  A   N+TW++  RP NK I+  +WVF +K +  G+  RYKARLVA+GF Q+  +DY ETF+PV +  + R + +L +    K+  +D+  AFL+G
Subjt:  IDEYRALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNGWKLRLVDINNAFLHG

Query:  LLSEEVFMSQP
         L EE++M  P
Subjt:  LLSEEVFMSQP

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.6e-6731.1Show/hide
Query:  SLEKWHNRLGHSAIPIVQHIMSMCNLD-TSNKSFHFCHACAVGKSHNLPFHDSTSHYDFPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLN
        S++ WH R+GH +   +Q +     +      +   C  C  GK H + F  S+      L L+  DV GP    S  G KY+V+F+D  SR   +Y L 
Subjt:  SLEKWHNRLGHSAIPIVQHIMSMCNLD-TSNKSFHFCHACAVGKSHNLPFHDSTSHYDFPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLN

Query:  NKSEAFSAFLLFKTQVEKMFNRSILSLQTDNGGEF--RSFIHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAFATAVYL
         K + F  F  F   VE+   R +  L++DNGGE+  R F  +  ++GI H  T P T Q NG+ ER +R IVE   ++L  A L   FW +A  TA YL
Subjt:  NKSEAFSAFLLFKTQVEKMFNRSILSLQTDNGGEF--RSFIHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAFATAVYL

Query:  INRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNKHKLDPRSTPCLFLGYSSFHKGYKCLSSFGRMYI-SRHVTFDESTFPSFNFLYPSES
        INR P+       P       + +Y  L+ FGC  +  +    + KLD +S PC+F+GY     GY+      +  I SR V F ES       +  +  
Subjt:  INRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNKHKLDPRSTPCLFLGYSSFHKGYKCLSSFGRMYI-SRHVTFDESTFPSFNFLYPSES

Query:  KSNTLHHSVLPIVHSDPTPINFQQSNLPPSCPLSTLPMDPLITNSIPSPQTLSLPTPLDSHAHPTPCLIASGSVSNILNVPPVTGTTTKNNTHTLITRG-
         S  + + ++P   + P+      SN P S   +T   D +        + +     LD                    V  V   T     H  + R  
Subjt:  KSNTLHHSVLPIVHSDPTPINFQQSNLPPSCPLSTLPMDPLITNSIPSPQTLSLPTPLDSHAHPTPCLIASGSVSNILNVPPVTGTTTKNNTHTLITRG-

Query:  ----KARVFKPKVFLANYKEVEPPNVKEAL---KCDHWIQAMIDEYRALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQAD
            ++R +    ++    + EP ++KE L   + +  ++AM +E  +L  N T+ L++ P  K+ + CKWVFK+K+  D  + RYKARLV +GF Q+  
Subjt:  ----KARVFKPKVFLANYKEVEPPNVKEAL---KCDHWIQAMIDEYRALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQAD

Query:  VDYTETFSPVVKPVTIRILFTLALTNGWKLRLVDINNAFLHGLLSEEVFMSQP
        +D+ E FSPVVK  +IR + +LA +   ++  +D+  AFLHG L EE++M QP
Subjt:  VDYTETFSPVVKPVTIRILFTLALTNGWKLRLVDINNAFLHGLLSEEVFMSQP

P92520 Uncharacterized mitochondrial protein AtMg008209.1e-2549.6Show/hide
Query:  LITRGKARVFK--PKVFLANYKEV--EPPNVKEALKCDHWIQAMIDEYRALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQ
        ++TR KA + K  PK  L     +  EP +V  ALK   W QAM +E  AL  N TW L+  PVN+ I+GCKWVFK K HSDG++ R KARLVA+GFHQ+
Subjt:  LITRGKARVFK--PKVFLANYKEV--EPPNVKEALKCDHWIQAMIDEYRALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQ

Query:  ADVDYTETFSPVVKPVTIRILFTLA
          + + ET+SPVV+  TIR +  +A
Subjt:  ADVDYTETFSPVVKPVTIRILFTLA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.1e-11038.81Show/hide
Query:  VSLSSSLNQSVFHKCTSLEKWHNRLGHSAIPIVQHIMSMCNLDTSNKSFHF--CHACAVGKSHNLPFHDSTSHYDFPLQLIVVDVWGPAYESSRNGFKYY
        VSL +S +    H       WH RLGH A  I+  ++S  +L   N S  F  C  C + KS+ +PF  ST +   PL+ I  DVW     S  N ++YY
Subjt:  VSLSSSLNQSVFHKCTSLEKWHNRLGHSAIPIVQHIMSMCNLDTSNKSFHF--CHACAVGKSHNLPFHDSTSHYDFPLQLIVVDVWGPAYESSRNGFKYY

Query:  VSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNGGEFRSFIHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLLSYASL
        V FVD F+RYT +Y L  KS+    F+ FK  +E  F   I +  +DNGGEF +   +   +GI+H  + P+T + NG+ ERKHRHIVE GLTLLS+AS+
Subjt:  VSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNGGEFRSFIHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLLSYASL

Query:  SIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNKHKLDPRSTPCLFLGYSSFHKGYKCLS-SFGRMYISRHVTFD
           +W  AFA AVYLINRLPT +    SP +KLFG  P Y  LR FGC CYP LR YN+HKLD +S  C+FLGYS     Y CL     R+YISRHV FD
Subjt:  SIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNKHKLDPRSTPCLFLGYSSFHKGYKCLS-SFGRMYISRHVTFD

Query:  ESTFPSFNFL--------YPSESKSNTLHHSVLPI---VHSDPTPINFQQSNLPPSCP--------LSTLPMDPLITNSIPS-----------PQTLSLP
        E+ FP  N+L           ES      H+ LP    V   P+  +   +  PPS P        +S+  +D   ++S PS           PQ  + P
Subjt:  ESTFPSFNFL--------YPSESKSNTLHHSVLPI---VHSDPTPINFQQSNLPPSCP--------LSTLPMDPLITNSIPS-----------PQTLSLP

Query:  TPLDSHAH--------------------------------PTPCLIASGSVSN------ILNVPPVTGTTTKN------NTHTLITRGKARVFKP----K
        T   +  H                                P+P   AS S ++      +++ PP       N      NTH++ TR KA + KP     
Subjt:  TPLDSHAH--------------------------------PTPCLIASGSVSN------ILNVPPVTGTTTKN------NTHTLITRGKARVFKP----K

Query:  VFLANYKEVEPPNVKEALKCDHWIQAMIDEYRALMSNDTWSLLDRPVNK-KIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPV
        + ++   E EP    +ALK + W  AM  E  A + N TW L+  P +   I+GC+W+F  K +SDGS+ RYKARLVA+G++Q+  +DY ETFSPV+K  
Subjt:  VFLANYKEVEPPNVKEALKCDHWIQAMIDEYRALMSNDTWSLLDRPVNK-KIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPV

Query:  TIRILFTLALTNGWKLRLVDINNAFLHGLLSEEVFMSQP
        +IRI+  +A+   W +R +D+NNAFL G L+++V+MSQP
Subjt:  TIRILFTLALTNGWKLRLVDINNAFLHGLLSEEVFMSQP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.7e-10939.65Show/hide
Query:  WHNRLGHSAIPIVQHIMSMCNLDTSNKSFHF--CHACAVGKSHNLPFHDSTSHYDFPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKS
        WH+RLGH ++ I+  ++S  +L   N S     C  C + KSH +PF +ST     PL+ I  DVW     S  N ++YYV FVD F+RYT +Y L  KS
Subjt:  WHNRLGHSAIPIVQHIMSMCNLDTSNKSFHF--CHACAVGKSHNLPFHDSTSHYDFPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKS

Query:  EAFSAFLLFKTQVEKMFNRSILSLQTDNGGEFRSFIHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAFATAVYLINRLP
        +    F++FK+ VE  F   I +L +DNGGEF     +L  +GI+H  + P+T + NG+ ERKHRHIVEMGLTLLS+AS+   +W  AF+ AVYLINRLP
Subjt:  EAFSAFLLFKTQVEKMFNRSILSLQTDNGGEFRSFIHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAFATAVYLINRLP

Query:  TKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNKHKLDPRSTPCLFLGYSSFHKGYKCLS-SFGRMYISRHVTFDESTFP--------SFNFLYP
        T +    SP +KLFG  P Y  L+ FGC CYP LR YN+HKL+ +S  C F+GYS     Y CL    GR+Y SRHV FDE  FP        S +    
Subjt:  TKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNKHKLDPRSTPCLFLGYSSFHKGYKCLS-SFGRMYISRHVTFDESTFP--------SFNFLYP

Query:  SESKSNTLHHSVLPIV--------------------HSDPTPI---NFQQSNLP-----------PSCPLSTLPM--------------DPLITN---SI
        S+S  N   H+ LP                       S P+P+       SNLP           P+ P    P                P++ N   + 
Subjt:  SESKSNTLHHSVLPIV--------------------HSDPTPI---NFQQSNLP-----------PSCPLSTLPM--------------DPLITN---SI

Query:  PSP----QTLSLP-TPLDSHAHPTPCLI-------ASGSVSN-----ILNVPPVTGTTTKN--NTHTLITRGKARVFKPKVFLANYKEV----EPPNVKE
        PSP    Q   LP +P+ S   PTP          +S S S      +L  PP+     +   NTH++ TR K  + KP    +    +    EP    +
Subjt:  PSP----QTLSLP-TPLDSHAHPTPCLI-------ASGSVSN-----ILNVPPVTGTTTKN--NTHTLITRGKARVFKPKVFLANYKEV----EPPNVKE

Query:  ALKCDHWIQAMIDEYRALMSNDTWSLL-DRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNGWKL
        A+K D W QAM  E  A + N TW L+   P +  I+GC+W+F  K +SDGS+ RYKARLVA+G++Q+  +DY ETFSPV+K  +IRI+  +A+   W +
Subjt:  ALKCDHWIQAMIDEYRALMSNDTWSLL-DRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNGWKL

Query:  RLVDINNAFLHGLLSEEVFMSQP
        R +D+NNAFL G L++EV+MSQP
Subjt:  RLVDINNAFLHGLLSEEVFMSQP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 89.6e-3046.21Show/hide
Query:  KEVEPPNVKEALKCDHWIQAMIDEYRALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFT
        K  EP    EA +   W  AM DE  A+ +  TW +   P NKK IGCKWV+KIK +SDG++ RYKARLVA+G+ QQ  +D+ ETFSPV K  +++++  
Subjt:  KEVEPPNVKEALKCDHWIQAMIDEYRALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFT

Query:  LALTNGWKLRLVDINNAFLHGLLSEEVFMSQP
        ++    + L  +DI+NAFL+G L EE++M  P
Subjt:  LALTNGWKLRLVDINNAFLHGLLSEEVFMSQP

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.8e-0536.14Show/hide
Query:  HRHIVEMGLTLLSYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNKHKLDPRS
        +R I+E   ++L    L   F  DA  TAV++IN+ P+   +   P E  F   PTY  LR FGC+ Y      ++ KL PR+
Subjt:  HRHIVEMGLTLLSYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNKHKLDPRS

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)6.4e-2649.6Show/hide
Query:  LITRGKARVFK--PKVFLANYKEV--EPPNVKEALKCDHWIQAMIDEYRALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQ
        ++TR KA + K  PK  L     +  EP +V  ALK   W QAM +E  AL  N TW L+  PVN+ I+GCKWVFK K HSDG++ R KARLVA+GFHQ+
Subjt:  LITRGKARVFK--PKVFLANYKEV--EPPNVKEALKCDHWIQAMIDEYRALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQ

Query:  ADVDYTETFSPVVKPVTIRILFTLA
          + + ET+SPVV+  TIR +  +A
Subjt:  ADVDYTETFSPVVKPVTIRILFTLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCAAAACTCTTCTCCAAGGTGATGGACTTTATAGGTTTCAAATGACACAGGCCAATCTTTCCATTCTTTCTTCACAACATCATCAGTCTTCTATCTCTGCACCACA
AGTACACAATGTCTCTCTATCCTCTTCATTAAATCAATCTGTATTCCATAAATGTACTTCACTTGAAAAATGGCACAATAGGCTAGGGCATTCCGCTATACCTATTGTTC
AACATATTATGAGTATGTGTAATCTTGATACTTCAAATAAATCCTTTCATTTCTGTCATGCTTGTGCGGTTGGGAAGTCCCATAATCTTCCATTTCATGATTCCACATCT
CACTATGATTTTCCTTTACAATTAATTGTTGTTGATGTATGGGGTCCCGCTTATGAGTCATCTAGGAATGGTTTCAAATATTATGTGAGTTTTGTTGATGTATTCTCACG
GTATACCTTGATCTATTTCCTTAATAACAAATCAGAAGCTTTTTCTGCCTTTTTATTGTTTAAAACTCAAGTAGAAAAGATGTTCAATCGTTCTATCCTTAGCTTACAAA
CTGATAATGGGGGAGAATTTCGTTCTTTCATTCATTTTCTCAAAACAAATGGTATCACTCATAGAGTCACATGCCCCTACACTTCCCAACAAAATGGTATAGTCGAGAGA
AAGCACCGCCACATAGTTGAAATGGGTCTTACACTCTTATCCTATGCTTCTCTTTCCATTAAATTCTGGGATGATGCATTTGCTACGGCTGTGTACTTAATTAATAGATT
ACCTACAAAAGTCCATCACACCGTTTCTCCCATGGAAAAACTGTTTGGTCATAAACCAACCTATGTAGATCTCAGGACTTTTGGATGTCTTTGTTATCCATCATTAAGAG
CATACAATAAACATAAACTTGACCCTCGGTCTACTCCATGTCTCTTTCTCGGTTATAGCTCTTTTCACAAGGGATATAAGTGTCTTTCCTCATTTGGAAGAATGTATATA
TCGAGGCATGTAACATTTGATGAATCAACTTTTCCTTCTTTCAATTTTCTTTATCCTTCAGAGTCTAAGTCCAATACATTGCACCACAGTGTATTACCCATTGTACATTC
AGATCCCACGCCTATCAACTTCCAACAGTCAAATCTACCACCCTCTTGTCCTTTATCTACTTTACCTATGGACCCCCTTATCACCAACTCTATCCCATCACCTCAAACTT
TATCTCTACCCACTCCCCTAGACTCACACGCCCATCCTACACCTTGCCTGATTGCTTCTGGTAGTGTATCCAATATTTTGAATGTTCCACCAGTCACTGGGACGACAACC
AAAAATAATACTCATACTTTGATTACTCGAGGGAAAGCTAGAGTTTTTAAGCCAAAGGTCTTCTTGGCTAATTATAAAGAGGTTGAACCTCCAAACGTCAAGGAAGCTCT
CAAATGTGACCATTGGATCCAAGCCATGATAGATGAATATAGAGCTTTAATGAGCAACGATACTTGGTCTTTGCTGGATAGGCCAGTTAACAAGAAGATTATCGGATGTA
AATGGGTGTTCAAGATCAAAAGGCACTCTGATGGATCGGTTGCAAGATATAAAGCACGATTAGTTGCCCAGGGATTTCACCAGCAAGCTGATGTTGATTATACAGAGACG
TTTAGTCCTGTTGTCAAGCCCGTCACTATTCGAATCCTTTTCACATTAGCACTGACGAATGGCTGGAAATTGAGGCTGGTGGATATCAATAATGCATTCCTTCATGGTTT
GCTATCTGAGGAAGTTTTTATGAGTCAACCCCTGGCATGGTAA
mRNA sequenceShow/hide mRNA sequence
ACTTTGTTAAGATGGTATCAGAGCGTTGAAAAGTCTAATGACTAAGAAAGAATTTTTTTTCCGTAATGATGAGCTCGTCCACCTCAGAAGAGGACCAAATGTTGGTATCT
CTCTCGAAACTTGTTAATCCCGGAAGTACACCAAAGCTCGATGAGGAAAATTTCCTTCTATGGAAATTTCAGATTCTTATTACGTTGCGAGACCATGGACTGCAACACTA
CATTGAGGAAGGTTGTGAAGCCCCTGTGAAATACCTTCAATCAACTCACGAATCCTCTTTAGCAACATTGATTAATCCTGATTACGACAAATGGGTGCGACAAGACAATC
TTATTACGACATGGCTTCTAGGAGCAATGTCTGCATCAATTGTTGGTGAATTGCTCGACTGCAAGACTTCTCGCAAGGATCTGAATATGATCCTACGGTATCAGTATTAG
CTGGTAGTGATGAGACTCCTTCCTTACAGAAAGTTTATATAATGCTTCTTACCCAAGAGAGTAGAATACAACGGCATACCAGTGTTACTACTACATCTGATCCCACATTA
CCTTAAGTGAATCTAACACAATCAAAGGTTCAACAATCTCCACATCAGTCGACTTCGTTTGCTGATAATAGAAGATCAAAGCCTCAAGGTGGACAGAATCACGGGCAAAA
TCATGGACAAGGGAGTCAAGATTACAGGTCTAATCGTCGAATTGGAATAATCAGAAACCACAGTGCCAATTATGTGGCAAATTTGGGCACACTGCACTTCGGTGCTATTT
TCGGTTTGAAAGATCATATCAAGGTCCAAATTCATCTTCCTCTAGTTCTTCGACATATCATCAACAACAATATAATCATCCATCAAATTTCGATCCCTCTATACCACATC
AACAACAACAGGTTGCTGCCTATATTCTGAGTCATGAGATGAATAAGGAAAATCAATGGTTTCCTAATTCTGGGGCCTCGAATCATGTAACAAATGATGTGAACAATTTG
TCCTTTGAACAAAATACAATGGTGATAGCAGAGTCCATATTGGAAATGGTATAGGTTTGAACATTCAAAATATTGGTACTTCCTATCTGAAATCTTCCTCTCACAATGTC
TTTTCACTTAATAATTTTCTTCATATCTCACATATTACCAAAAATCTTATAAGTGTGAGTCAATTTGTTAAAGACAATAATGTTTTCTTGGAATTTCACCCCTTTACTTG
CTTTGTGAAGGACATTACTATGGGCAAAACTCTTCTCCAAGGTGATGGACTTTATAGGTTTCAAATGACACAGGCCAATCTTTCCATTCTTTCTTCACAACATCATCAGT
CTTCTATCTCTGCACCACAAGTACACAATGTCTCTCTATCCTCTTCATTAAATCAATCTGTATTCCATAAATGTACTTCACTTGAAAAATGGCACAATAGGCTAGGGCAT
TCCGCTATACCTATTGTTCAACATATTATGAGTATGTGTAATCTTGATACTTCAAATAAATCCTTTCATTTCTGTCATGCTTGTGCGGTTGGGAAGTCCCATAATCTTCC
ATTTCATGATTCCACATCTCACTATGATTTTCCTTTACAATTAATTGTTGTTGATGTATGGGGTCCCGCTTATGAGTCATCTAGGAATGGTTTCAAATATTATGTGAGTT
TTGTTGATGTATTCTCACGGTATACCTTGATCTATTTCCTTAATAACAAATCAGAAGCTTTTTCTGCCTTTTTATTGTTTAAAACTCAAGTAGAAAAGATGTTCAATCGT
TCTATCCTTAGCTTACAAACTGATAATGGGGGAGAATTTCGTTCTTTCATTCATTTTCTCAAAACAAATGGTATCACTCATAGAGTCACATGCCCCTACACTTCCCAACA
AAATGGTATAGTCGAGAGAAAGCACCGCCACATAGTTGAAATGGGTCTTACACTCTTATCCTATGCTTCTCTTTCCATTAAATTCTGGGATGATGCATTTGCTACGGCTG
TGTACTTAATTAATAGATTACCTACAAAAGTCCATCACACCGTTTCTCCCATGGAAAAACTGTTTGGTCATAAACCAACCTATGTAGATCTCAGGACTTTTGGATGTCTT
TGTTATCCATCATTAAGAGCATACAATAAACATAAACTTGACCCTCGGTCTACTCCATGTCTCTTTCTCGGTTATAGCTCTTTTCACAAGGGATATAAGTGTCTTTCCTC
ATTTGGAAGAATGTATATATCGAGGCATGTAACATTTGATGAATCAACTTTTCCTTCTTTCAATTTTCTTTATCCTTCAGAGTCTAAGTCCAATACATTGCACCACAGTG
TATTACCCATTGTACATTCAGATCCCACGCCTATCAACTTCCAACAGTCAAATCTACCACCCTCTTGTCCTTTATCTACTTTACCTATGGACCCCCTTATCACCAACTCT
ATCCCATCACCTCAAACTTTATCTCTACCCACTCCCCTAGACTCACACGCCCATCCTACACCTTGCCTGATTGCTTCTGGTAGTGTATCCAATATTTTGAATGTTCCACC
AGTCACTGGGACGACAACCAAAAATAATACTCATACTTTGATTACTCGAGGGAAAGCTAGAGTTTTTAAGCCAAAGGTCTTCTTGGCTAATTATAAAGAGGTTGAACCTC
CAAACGTCAAGGAAGCTCTCAAATGTGACCATTGGATCCAAGCCATGATAGATGAATATAGAGCTTTAATGAGCAACGATACTTGGTCTTTGCTGGATAGGCCAGTTAAC
AAGAAGATTATCGGATGTAAATGGGTGTTCAAGATCAAAAGGCACTCTGATGGATCGGTTGCAAGATATAAAGCACGATTAGTTGCCCAGGGATTTCACCAGCAAGCTGA
TGTTGATTATACAGAGACGTTTAGTCCTGTTGTCAAGCCCGTCACTATTCGAATCCTTTTCACATTAGCACTGACGAATGGCTGGAAATTGAGGCTGGTGGATATCAATA
ATGCATTCCTTCATGGTTTGCTATCTGAGGAAGTTTTTATGAGTCAACCCCTGGCATGGTAATATCCACCAAAGAAAAACAAGTGTGTAGACTGAAGAAAGCCTTGTATG
GCCTGAAGCAAGCATCAAGAGCATGGTACGAACGGTTAAGCTCATTTCTATGCTCCCTTGGGTTCTCTCATTCAAAAGCTGACTCATCTCTTTTTATATATCAACATCAT
CATGTTATTTGCTACATTTTAGTGTATGTTGATGATATAGTAGTTGCTGGAAACTCTGATGATTTTGTTGACAATTTGCTGAAATAATTGAACATGAAATTCTCTCTTAA
AGACCTGGGGTTATTGAGTTAGTTTCTTGGAGTTGAAGTCTCAGCAACACCAACTGGATCATTATTTCTTTCACAACTGAAGTATATTTCAGATCTTCTTCATAGAGCTA
ACATGAGTCATGCAAACCCGATTGCTACGCCTATGATAAGTGGATCAGTGTTATCTGCCTTTCAGGGAGAATCGTTTCAGGATGTTCATCTTTACAGAAGTATTGTGGGA
GCTTTACAATATGTTACCATTACAAGACCAGAGATACTTACAGTGTTAACAAAATCTATCAATTTATGCAATCTCCTACAGTGCATCATTGGCAGACAGTCAAGAGGATA
TTGTGATACTTGAAGGGCACTTTGAATCATGGCCTAGTTTTCCATAAATCGACTGAATTGGTGCTGCAAGGGTATGCCGATGCTGATTGGGCCTCTGATCCAGATGACAG
GAAGTCCACCTCAGGGTTTTGTATATATTTTGGTGGCAATCTGATACAATGATCATCGAAGAAACAAGGAATCATATCCCGATCGAGTACTGAAGCAGAATATAGAAGTT
TGGCTCACATATCTGCGGATGTGGTTTGGATACAATCCTTATTTTCTGAGTTAAACATCAGGTTAGCTTTTATGCCAAGATTATGGTGCGATAATCTAAGTGCTGTTCAT
TTAAGTCCTAATCCGATTTTACACTCAAGAACTAAGCATGTAGAGATTGATATATACTTTGTTCGAGATCTTGTATTCCAGAAACGTCTGCAGATTTCTCATCTTCCGGC
TTCAGCTCAAGTGGCTGACATCTTTACTAAACCTTTGTCTGCTTCGAAGTTTTTGGCTCTTACACACAAGCTCAATGTTTGCTCTTCAGTTGACATTGGCTTGGGGGATG
TTACGAGAGCGCATTAAAGGTTTAGTTAGAATTTTATAATTTTATCAGTTTCATAAACTTACGATAGTTATAACTACTTTTCTGAGTTGTTGTACTTAATAGAGAGCCTA
TAAATATGGCATATGTATTCTTTGAGAATGTGAAAAGGAAATCTTATCCTTTGTGATCATATTCACTTTGTTAAGAAATATTAATTACCATTATAAAAA
Protein sequenceShow/hide protein sequence
MGKTLLQGDGLYRFQMTQANLSILSSQHHQSSISAPQVHNVSLSSSLNQSVFHKCTSLEKWHNRLGHSAIPIVQHIMSMCNLDTSNKSFHFCHACAVGKSHNLPFHDSTS
HYDFPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNGGEFRSFIHFLKTNGITHRVTCPYTSQQNGIVER
KHRHIVEMGLTLLSYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNKHKLDPRSTPCLFLGYSSFHKGYKCLSSFGRMYI
SRHVTFDESTFPSFNFLYPSESKSNTLHHSVLPIVHSDPTPINFQQSNLPPSCPLSTLPMDPLITNSIPSPQTLSLPTPLDSHAHPTPCLIASGSVSNILNVPPVTGTTT
KNNTHTLITRGKARVFKPKVFLANYKEVEPPNVKEALKCDHWIQAMIDEYRALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTET
FSPVVKPVTIRILFTLALTNGWKLRLVDINNAFLHGLLSEEVFMSQPLAW