; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G22620 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G22620
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationChr4:20881185..20885551
RNA-Seq ExpressionCSPI04G22620
SyntenyCSPI04G22620
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN73924.1 hypothetical protein VITISV_041509 [Vitis vinifera]8.4e-23247.99Show/hide
Query:  LISQLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKK
        +I  L KSH+LP+  S SHA  P A+I++DLWGPAPS S    RY+++F+DDYS +TWIY L  K  A+++F  F   V+NQ   TIK  QSDNGGE+  
Subjt:  LISQLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKK

Query:  IQHLCLNLGINCRFSCPYTSAQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQKLKVENLKIFGCVCFPCL
         +      GI  +FSCP+T  QNGRAERK RH+VETGL L+AQ+ +   YW  AF T V LIN +    L   SP + LF++     +L++FGC+CFP L
Subjt:  IQHLCLNLGINCRFSCPYTSAQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQKLKVENLKIFGCVCFPCL

Query:  RPYQPTKFSYHSEKCVYLGPSPTHKGFKCLS-KTGRIFISRHVKFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTST
        RPY   K  Y S  CV+LG +P HKG+ CL   T RI+ISR+V F+E+ FPF             QS+SP                   P+   P L S+
Subjt:  RPYQPTKFSYHSEKCVYLGPSPTHKGFKCLS-KTGRIFISRHVKFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTST

Query:  TQITLPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSPYSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRA
        T   +  P    P S   SSPI IT ++ P                  PL P    T   +SP + S PP PL+                  TH M+TRA
Subjt:  TQITLPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSPYSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRA

Query:  KADIFKPKACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHPGVDF
        K+ I K      +SF     TEP    +A     W +AM++EY ALL  NTWSLVPP SS +IVG +WI+KLK   DGSI R+KARLVA+GF Q PG+D+
Subjt:  KADIFKPKACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHPGVDF

Query:  FETFSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSS
        F+TFSPVVK  T+ ++L++ ++  W++RQLD  NAFLNG LEE V++TQP  +V  ++  YVCKL+KA+YGLKQAPRAW   L  ALL++GF +S++D+S
Subjt:  FETFSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSS

Query:  LFILRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSAL
        LFI      I++LL YVDD ++TG++P L+   ++ L  +FAL+DLG LSYFL  Q + L     L++ KY+ DLL+R QM   K AP+P  +G+TLS  
Subjt:  LFILRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSAL

Query:  DSKLLDDPSLYRSTIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFI
        D   L DPS YR T+GALQY+T TRPDIA+ VN   QF+ +P+D+HW AVKRILRY+ GT   GL FQ +  + +  +SDADWAS  DDR+  S YCVF+
Subjt:  DSKLLDDPSLYRSTIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFI

Query:  GNNLVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPS
        G+NL+SWSS KQ +V+ SS ESEYR L     E++W++ LL EL + TS  P++WCDN SA  LA NPVFH+R+KHIE+D+ FIR++VL+  L + YVPS
Subjt:  GNNLVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPS

Query:  TDQLANCLTKPLTHSQFRNLRSKLGVVISPPTRLRGDVRDNNCD
         DQLA+  TK L  +QF NLRSKL  V  PP  LRGD  DN  D
Subjt:  TDQLANCLTKPLTHSQFRNLRSKLGVVISPPTRLRGDVRDNNCD

GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]1.4e-25049.14Show/hide
Query:  QLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQH
        Q GK H LPF +S SHA+EP  ++++D+WGPAP  ++  F+YY+ F+DD+S +TWIYPLKQKS  V+AF  F    +NQFN  IKV Q D GGEYK +Q 
Subjt:  QLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQH

Query:  LCLNLGINCRFSCPYTSAQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQKLKVENLKIFGCVCFPCLRPY
        L +  GI  R SCPYTS QNGRAERKHRHI E GLTLLAQA M L+YWW+AF T V LIN + +   Q  SP   +  ++   + LK FGC C+PCL+PY
Subjt:  LCLNLGINCRFSCPYTSAQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQKLKVENLKIFGCVCFPCLRPY

Query:  QPTKFSYHSEKCVYLGPSPTHKGFKCLSKTGRIFISRHVKFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQIT
           K  YH+ +CV+LG S +HKG+KCL+  GRIFISRHV FNE+ FPF D F   +S   T    PS +F      P  T  N+  +   P L +     
Subjt:  QPTKFSYHSEKCVYLGPSPTHKGFKCLSKTGRIFISRHVKFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQIT

Query:  LPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSPYSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKADI
                       +P      +   V+S    TN+ P+  +     +  IT ++             S+G AS     Q +N+   +H++ TR+K+ I
Subjt:  LPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSPYSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKADI

Query:  FKPK-ACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHPGVDFFET
         KPK   I  + T     EP   KEAL    WK+AM  E+ AL++  TW LVP  + +NIV SKW+FK K   DGS++R KARLVAKGF Q  G+D+ ET
Subjt:  FKPK-ACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHPGVDFFET

Query:  FSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFI
        FSPV+KAST+ ++LSI +   W +RQLD NNAFLNG L+E+V++ QP  +V S+  +++CKL+KAIYGLKQAPRAW ++L  ALLNWGF N+KSDSSLF+
Subjt:  FSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFI

Query:  LRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSK
        L+ +  I  LL YVDD I+TG++   +Q+ +  L+  F+LKDLG L YFL  +V+    G  L + KY+ DLL + +M +    P+P + G+  + ++ +
Subjt:  LRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSK

Query:  LLDDPSLYRSTIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNN
         L DP+++R  IG LQYLT+T PDIA+ VN LSQ++  P+  HWQ +KRILRY+ GT  + L  + S DL I+ FSDADWA++IDDRK +S  CVF+G  
Subjt:  LLDDPSLYRSTIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNN

Query:  LVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPSTDQ
        L+SWSS+KQ VV+ SSTESEYRALA  A E+ W++ LL EL++    KP++WCDNLSA ALA+NPV H R+KHIEIDV +IRDQVL+  + V YVP+TDQ
Subjt:  LVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPSTDQ

Query:  LANCLTKPLTHSQFRNLRSKLGVVISPP
        +A+CLTKPL+H++F  LR KLGV++SPP
Subjt:  LANCLTKPLTHSQFRNLRSKLGVVISPP

GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum]4.6e-23847.67Show/hide
Query:  QLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQH
        Q GK H LPF  S SH +EP A+I+SD+WGPAP  S   F+YY+ F+DD+S +TWI+PLKQKS  + AF  F    +NQFN  IK+ Q D GGEYK +Q 
Subjt:  QLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQH

Query:  LCLNLGINCRFSCPYTSAQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQKLKVENLKIFGCVCFPCLRPY
        + +  GI  R SCPYTS QNGRAERKHRH+ E GLTLLAQA M L YWW+AF T V LIN + +      SP   +F ++     LK FGC C+PCL+PY
Subjt:  LCLNLGINCRFSCPYTSAQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQKLKVENLKIFGCVCFPCLRPY

Query:  QPTKFSYHSEKCVYLGPSPTHKGFKCLSKTGRIFISRHVKFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQIT
           K  +H+ +CV++G S +HKG+KC++  GRIF+SRHV FNEN FPF   F   ++   T + + S+    +  +   TQ  + P+      T++ Q T
Subjt:  QPTKFSYHSEKCVYLGPSPTHKGFKCLSKTGRIFISRHVKFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQIT

Query:  LPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSPYSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKADI
                   SI SS  N    N   V S+    N+N        + ST     +NS  S     S ++    +I    Q  NS+  TH M TR+K  I
Subjt:  LPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSPYSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKADI

Query:  FKPK-ACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHPGVDFFET
         KPK   +  + TD    EP  +KEAL    WK+AMD EY AL++ +TW+LVP    +NI+ SKWIFK K  SDGSI+R KARLVAKGF Q  G+DF ET
Subjt:  FKPK-ACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHPGVDFFET

Query:  FSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFI
        FSPVVK+ST+ ++L+I +   W +RQLD NNAFLNG+L+E+V++ QP  Y+ ++  +++CKL+KAIYGLKQAPRAW ++L   L+NWGF N+K+D+SLF 
Subjt:  FSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFI

Query:  LRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSK
        L+       LL YVDD I+TG++   +++    L+  ++LKDLG L YFL  +V   D G  L + KY+ D+L +  M +  A P+P V G+   A + +
Subjt:  LRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSK

Query:  LLDDPSLYRSTIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNN
        L+ +P+LYR  IGALQYLTNTRPDIA+ VN LSQ++  PT  HWQ +KRILRY+ GTK   L  + S +L I+ F DADWA++ DDRK     CVF+G  
Subjt:  LLDDPSLYRSTIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNN

Query:  LVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLK-------------------QLLNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFI
        LVSW+S+KQ VV+ SSTESEYR+LA    EV                        LL EL +    KPV+WCDNLSA ALA+NPV H R+KHIEID+ +I
Subjt:  LVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLK-------------------QLLNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFI

Query:  RDQVLKGALDVRYVPSTDQLANCLTKPLTHSQFRNLRSKLGVVISP
        RDQVL+  + + YVP+ DQ+A+CLTKPL H++F  +R KLGV +SP
Subjt:  RDQVLKGALDVRYVPSTDQLANCLTKPLTHSQFRNLRSKLGVVISP

RVW64314.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.6e-23047.01Show/hide
Query:  QLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQH
        Q  KSH LPF  S S A  P A++++DLWGPA   S    RY+ILF+DD+S ++WIYPL  K  A+  F  F   V+NQFN+ I+  +SDNGGE+K    
Subjt:  QLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQH

Query:  LCLNLGINCRFSCPYTSAQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQKLKVENLKIFGCVCFPCLRPY
             GI  +FSCPYT  QNGRAERK RHI+ETGL LLA A++   +W  AF T + LIN + T  L   SP + LF +       KIFGC+C+P +RPY
Subjt:  LCLNLGINCRFSCPYTSAQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQKLKVENLKIFGCVCFPCLRPY

Query:  QPTKFSYHSEKCVYLGPSPTHKGFKCLSK-TGRIFISRHVKFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQI
           K SY S +CV+LG S  HKG+ CL+  TGR++++RHV F+E  FPF       QST                  P+Q               S++ +
Subjt:  QPTKFSYHSEKCVYLGPSPTHKGFKCLSK-TGRIFISRHVKFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQI

Query:  TLPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSPYSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKAD
        T+P P  +P  S              P V S  + T           +PST+  P  N P S+ S P  + +  A I  T +P  ++   H M+TRAK  
Subjt:  TLPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSPYSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKAD

Query:  IFKPKACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHPGVDFFET
        I K K   S       ++EPT   +A+    W  AM+ E+ AL   NTW LVPP S+ NI+G KW++KLK   DG++ RYKARLVA+GF Q  G+D+FET
Subjt:  IFKPKACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHPGVDFFET

Query:  FSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFI
        FSPVVKAST+ ++L++ L+  W++ QLD  NAFL+G LEE V++ QPP ++ S +  +VCKLNKA+YGLKQAPRAW N LS +LL WGF  S++DSS+FI
Subjt:  FSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFI

Query:  LRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSK
            H +++LL YVDD ++TG+  + + S +T L+  FAL+DLG ++YFL  +V      F LS+ KY  DLL R  M D K A +P ++G+TLS LD +
Subjt:  LRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSK

Query:  LLDDPSLYRSTIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNN
           D +LYRST+GALQYLT TRPDI++ VN   QF+  PT  HW AVKRILRY+ GT  +G+  Q S  L I  ++DADWAS  DDR+    Y +F+G N
Subjt:  LLDDPSLYRSTIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNN

Query:  LVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPSTDQ
        LVSWSS KQ VV+ SS ESEYRALA A  E+IW++ +L EL +S+S  P++WCDN SA  LA NPVFH RTKHIE+D+ FIRD VL+  L ++Y+PS +Q
Subjt:  LVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPSTDQ

Query:  LANCLTKPLTHSQFRNLRSKLGVVISPPTRLRGDVR
        +A+  TK ++ SQF + R+KL VV S P  LRGD R
Subjt:  LANCLTKPLTHSQFRNLRSKLGVVISPPTRLRGDVR

RVX06084.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.7e-23047.01Show/hide
Query:  QLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQH
        Q  KSH LPF  S S A  P A++++DLWGPA   S    RY+ILF+DD+S ++WIYPL  K  A+  F  F   V+NQFN+ I+  +SDNGGE+K    
Subjt:  QLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQH

Query:  LCLNLGINCRFSCPYTSAQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQKLKVENLKIFGCVCFPCLRPY
             GI  +FSCPYT  QNGRAERK RHI+ETGL LLA A++   +W  AF T + LIN + T  L   SP + LF +       KIFGC+C+P +RPY
Subjt:  LCLNLGINCRFSCPYTSAQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQKLKVENLKIFGCVCFPCLRPY

Query:  QPTKFSYHSEKCVYLGPSPTHKGFKCLSK-TGRIFISRHVKFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQI
           K SY S +CV+LG S  HKG+ CL+  TGR++++RHV F+E  FPF       QST                  P+Q               S++ +
Subjt:  QPTKFSYHSEKCVYLGPSPTHKGFKCLSK-TGRIFISRHVKFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQI

Query:  TLPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSPYSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKAD
        T+P P  +P  S              P V S  + T           +PST+  P  N P S+ S P  + +  A I  T +P  ++   H M+TRAK  
Subjt:  TLPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSPYSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKAD

Query:  IFKPKACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHPGVDFFET
        I K K   S       ++EPT   +A+    W  AM+ E+ AL   NTW LVPP S+ NI+G KW++KLK   DG++ RYKARLVA+GF Q  G+D+FET
Subjt:  IFKPKACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHPGVDFFET

Query:  FSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFI
        FSPVVKAST+ ++L++ L+  W++ QLD  NAFL+G LEE V++ QPP ++ S +  +VCKLNKA+YGLKQAPRAW N LS +LL WGF  S++DSS+FI
Subjt:  FSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFI

Query:  LRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSK
            H +++LL YVDD ++TG+  + + S +T L+  FAL+DLG ++YFL  +V      F LS+ KY  DLL R  M D K A +P ++G+TLS LD +
Subjt:  LRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSK

Query:  LLDDPSLYRSTIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNN
           D +LYRST+GALQYLT TRPDI++ VN   QF+  PT  HW AVKRILRY+ GT  +G+  Q S  L I  ++DADWAS  DDR+    Y +F+G N
Subjt:  LLDDPSLYRSTIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNN

Query:  LVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPSTDQ
        LVSWSS KQ VV+ SS ESEYRALA A  E+IW++ +L EL +S+S  P++WCDN SA  LA NPVFH RTKHIE+D+ FIRD VL+  L ++Y+PS +Q
Subjt:  LVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPSTDQ

Query:  LANCLTKPLTHSQFRNLRSKLGVVISPPTRLRGDVR
        +A+  TK ++ SQF + R+KL VV S P  LRGD R
Subjt:  LANCLTKPLTHSQFRNLRSKLGVVISPPTRLRGDVR

TrEMBL top hitse value%identityAlignment
A0A2Z6MBG6 Integrase catalytic domain-containing protein6.6e-25149.14Show/hide
Query:  QLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQH
        Q GK H LPF +S SHA+EP  ++++D+WGPAP  ++  F+YY+ F+DD+S +TWIYPLKQKS  V+AF  F    +NQFN  IKV Q D GGEYK +Q 
Subjt:  QLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQH

Query:  LCLNLGINCRFSCPYTSAQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQKLKVENLKIFGCVCFPCLRPY
        L +  GI  R SCPYTS QNGRAERKHRHI E GLTLLAQA M L+YWW+AF T V LIN + +   Q  SP   +  ++   + LK FGC C+PCL+PY
Subjt:  LCLNLGINCRFSCPYTSAQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQKLKVENLKIFGCVCFPCLRPY

Query:  QPTKFSYHSEKCVYLGPSPTHKGFKCLSKTGRIFISRHVKFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQIT
           K  YH+ +CV+LG S +HKG+KCL+  GRIFISRHV FNE+ FPF D F   +S   T    PS +F      P  T  N+  +   P L +     
Subjt:  QPTKFSYHSEKCVYLGPSPTHKGFKCLSKTGRIFISRHVKFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQIT

Query:  LPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSPYSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKADI
                       +P      +   V+S    TN+ P+  +     +  IT ++             S+G AS     Q +N+   +H++ TR+K+ I
Subjt:  LPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSPYSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKADI

Query:  FKPK-ACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHPGVDFFET
         KPK   I  + T     EP   KEAL    WK+AM  E+ AL++  TW LVP  + +NIV SKW+FK K   DGS++R KARLVAKGF Q  G+D+ ET
Subjt:  FKPK-ACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHPGVDFFET

Query:  FSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFI
        FSPV+KAST+ ++LSI +   W +RQLD NNAFLNG L+E+V++ QP  +V S+  +++CKL+KAIYGLKQAPRAW ++L  ALLNWGF N+KSDSSLF+
Subjt:  FSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFI

Query:  LRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSK
        L+ +  I  LL YVDD I+TG++   +Q+ +  L+  F+LKDLG L YFL  +V+    G  L + KY+ DLL + +M +    P+P + G+  + ++ +
Subjt:  LRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSK

Query:  LLDDPSLYRSTIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNN
         L DP+++R  IG LQYLT+T PDIA+ VN LSQ++  P+  HWQ +KRILRY+ GT  + L  + S DL I+ FSDADWA++IDDRK +S  CVF+G  
Subjt:  LLDDPSLYRSTIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNN

Query:  LVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPSTDQ
        L+SWSS+KQ VV+ SSTESEYRALA  A E+ W++ LL EL++    KP++WCDNLSA ALA+NPV H R+KHIEIDV +IRDQVL+  + V YVP+TDQ
Subjt:  LVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPSTDQ

Query:  LANCLTKPLTHSQFRNLRSKLGVVISPP
        +A+CLTKPL+H++F  LR KLGV++SPP
Subjt:  LANCLTKPLTHSQFRNLRSKLGVVISPP

A0A2Z6P4D5 Integrase catalytic domain-containing protein2.2e-23847.67Show/hide
Query:  QLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQH
        Q GK H LPF  S SH +EP A+I+SD+WGPAP  S   F+YY+ F+DD+S +TWI+PLKQKS  + AF  F    +NQFN  IK+ Q D GGEYK +Q 
Subjt:  QLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQH

Query:  LCLNLGINCRFSCPYTSAQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQKLKVENLKIFGCVCFPCLRPY
        + +  GI  R SCPYTS QNGRAERKHRH+ E GLTLLAQA M L YWW+AF T V LIN + +      SP   +F ++     LK FGC C+PCL+PY
Subjt:  LCLNLGINCRFSCPYTSAQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQKLKVENLKIFGCVCFPCLRPY

Query:  QPTKFSYHSEKCVYLGPSPTHKGFKCLSKTGRIFISRHVKFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQIT
           K  +H+ +CV++G S +HKG+KC++  GRIF+SRHV FNEN FPF   F   ++   T + + S+    +  +   TQ  + P+      T++ Q T
Subjt:  QPTKFSYHSEKCVYLGPSPTHKGFKCLSKTGRIFISRHVKFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQIT

Query:  LPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSPYSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKADI
                   SI SS  N    N   V S+    N+N        + ST     +NS  S     S ++    +I    Q  NS+  TH M TR+K  I
Subjt:  LPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSPYSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKADI

Query:  FKPK-ACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHPGVDFFET
         KPK   +  + TD    EP  +KEAL    WK+AMD EY AL++ +TW+LVP    +NI+ SKWIFK K  SDGSI+R KARLVAKGF Q  G+DF ET
Subjt:  FKPK-ACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHPGVDFFET

Query:  FSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFI
        FSPVVK+ST+ ++L+I +   W +RQLD NNAFLNG+L+E+V++ QP  Y+ ++  +++CKL+KAIYGLKQAPRAW ++L   L+NWGF N+K+D+SLF 
Subjt:  FSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFI

Query:  LRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSK
        L+       LL YVDD I+TG++   +++    L+  ++LKDLG L YFL  +V   D G  L + KY+ D+L +  M +  A P+P V G+   A + +
Subjt:  LRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSK

Query:  LLDDPSLYRSTIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNN
        L+ +P+LYR  IGALQYLTNTRPDIA+ VN LSQ++  PT  HWQ +KRILRY+ GTK   L  + S +L I+ F DADWA++ DDRK     CVF+G  
Subjt:  LLDDPSLYRSTIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNN

Query:  LVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLK-------------------QLLNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFI
        LVSW+S+KQ VV+ SSTESEYR+LA    EV                        LL EL +    KPV+WCDNLSA ALA+NPV H R+KHIEID+ +I
Subjt:  LVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLK-------------------QLLNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFI

Query:  RDQVLKGALDVRYVPSTDQLANCLTKPLTHSQFRNLRSKLGVVISP
        RDQVL+  + + YVP+ DQ+A+CLTKPL H++F  +R KLGV +SP
Subjt:  RDQVLKGALDVRYVPSTDQLANCLTKPLTHSQFRNLRSKLGVVISP

A0A438FWJ3 Retrovirus-related Pol polyprotein from transposon TNT 1-947.6e-23147.01Show/hide
Query:  QLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQH
        Q  KSH LPF  S S A  P A++++DLWGPA   S    RY+ILF+DD+S ++WIYPL  K  A+  F  F   V+NQFN+ I+  +SDNGGE+K    
Subjt:  QLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQH

Query:  LCLNLGINCRFSCPYTSAQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQKLKVENLKIFGCVCFPCLRPY
             GI  +FSCPYT  QNGRAERK RHI+ETGL LLA A++   +W  AF T + LIN + T  L   SP + LF +       KIFGC+C+P +RPY
Subjt:  LCLNLGINCRFSCPYTSAQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQKLKVENLKIFGCVCFPCLRPY

Query:  QPTKFSYHSEKCVYLGPSPTHKGFKCLSK-TGRIFISRHVKFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQI
           K SY S +CV+LG S  HKG+ CL+  TGR++++RHV F+E  FPF       QST                  P+Q               S++ +
Subjt:  QPTKFSYHSEKCVYLGPSPTHKGFKCLSK-TGRIFISRHVKFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQI

Query:  TLPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSPYSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKAD
        T+P P  +P  S              P V S  + T           +PST+  P  N P S+ S P  + +  A I  T +P  ++   H M+TRAK  
Subjt:  TLPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSPYSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKAD

Query:  IFKPKACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHPGVDFFET
        I K K   S       ++EPT   +A+    W  AM+ E+ AL   NTW LVPP S+ NI+G KW++KLK   DG++ RYKARLVA+GF Q  G+D+FET
Subjt:  IFKPKACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHPGVDFFET

Query:  FSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFI
        FSPVVKAST+ ++L++ L+  W++ QLD  NAFL+G LEE V++ QPP ++ S +  +VCKLNKA+YGLKQAPRAW N LS +LL WGF  S++DSS+FI
Subjt:  FSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFI

Query:  LRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSK
            H +++LL YVDD ++TG+  + + S +T L+  FAL+DLG ++YFL  +V      F LS+ KY  DLL R  M D K A +P ++G+TLS LD +
Subjt:  LRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSK

Query:  LLDDPSLYRSTIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNN
           D +LYRST+GALQYLT TRPDI++ VN   QF+  PT  HW AVKRILRY+ GT  +G+  Q S  L I  ++DADWAS  DDR+    Y +F+G N
Subjt:  LLDDPSLYRSTIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNN

Query:  LVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPSTDQ
        LVSWSS KQ VV+ SS ESEYRALA A  E+IW++ +L EL +S+S  P++WCDN SA  LA NPVFH RTKHIE+D+ FIRD VL+  L ++Y+PS +Q
Subjt:  LVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPSTDQ

Query:  LANCLTKPLTHSQFRNLRSKLGVVISPPTRLRGDVR
        +A+  TK ++ SQF + R+KL VV S P  LRGD R
Subjt:  LANCLTKPLTHSQFRNLRSKLGVVISPPTRLRGDVR

A0A438JAU4 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-23047.01Show/hide
Query:  QLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQH
        Q  KSH LPF  S S A  P A++++DLWGPA   S    RY+ILF+DD+S ++WIYPL  K  A+  F  F   V+NQFN+ I+  +SDNGGE+K    
Subjt:  QLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQH

Query:  LCLNLGINCRFSCPYTSAQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQKLKVENLKIFGCVCFPCLRPY
             GI  +FSCPYT  QNGRAERK RHI+ETGL LLA A++   +W  AF T + LIN + T  L   SP + LF +       KIFGC+C+P +RPY
Subjt:  LCLNLGINCRFSCPYTSAQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQKLKVENLKIFGCVCFPCLRPY

Query:  QPTKFSYHSEKCVYLGPSPTHKGFKCLSK-TGRIFISRHVKFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQI
           K SY S +CV+LG S  HKG+ CL+  TGR++++RHV F+E  FPF       QST                  P+Q               S++ +
Subjt:  QPTKFSYHSEKCVYLGPSPTHKGFKCLSK-TGRIFISRHVKFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQI

Query:  TLPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSPYSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKAD
        T+P P  +P  S              P V S  + T           +PST+  P  N P S+ S P  + +  A I  T +P  ++   H M+TRAK  
Subjt:  TLPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSPYSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKAD

Query:  IFKPKACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHPGVDFFET
        I K K   S       ++EPT   +A+    W  AM+ E+ AL   NTW LVPP S+ NI+G KW++KLK   DG++ RYKARLVA+GF Q  G+D+FET
Subjt:  IFKPKACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHPGVDFFET

Query:  FSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFI
        FSPVVKAST+ ++L++ L+  W++ QLD  NAFL+G LEE V++ QPP ++ S +  +VCKLNKA+YGLKQAPRAW N LS +LL WGF  S++DSS+FI
Subjt:  FSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFI

Query:  LRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSK
            H +++LL YVDD ++TG+  + + S +T L+  FAL+DLG ++YFL  +V      F LS+ KY  DLL R  M D K A +P ++G+TLS LD +
Subjt:  LRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSK

Query:  LLDDPSLYRSTIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNN
           D +LYRST+GALQYLT TRPDI++ VN   QF+  PT  HW AVKRILRY+ GT  +G+  Q S  L I  ++DADWAS  DDR+    Y +F+G N
Subjt:  LLDDPSLYRSTIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNN

Query:  LVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPSTDQ
        LVSWSS KQ VV+ SS ESEYRALA A  E+IW++ +L EL +S+S  P++WCDN SA  LA NPVFH RTKHIE+D+ FIRD VL+  L ++Y+PS +Q
Subjt:  LVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPSTDQ

Query:  LANCLTKPLTHSQFRNLRSKLGVVISPPTRLRGDVR
        +A+  TK ++ SQF + R+KL VV S P  LRGD R
Subjt:  LANCLTKPLTHSQFRNLRSKLGVVISPPTRLRGDVR

A5AYB0 Integrase catalytic domain-containing protein4.1e-23247.99Show/hide
Query:  LISQLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKK
        +I  L KSH+LP+  S SHA  P A+I++DLWGPAPS S    RY+++F+DDYS +TWIY L  K  A+++F  F   V+NQ   TIK  QSDNGGE+  
Subjt:  LISQLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKK

Query:  IQHLCLNLGINCRFSCPYTSAQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQKLKVENLKIFGCVCFPCL
         +      GI  +FSCP+T  QNGRAERK RH+VETGL L+AQ+ +   YW  AF T V LIN +    L   SP + LF++     +L++FGC+CFP L
Subjt:  IQHLCLNLGINCRFSCPYTSAQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQKLKVENLKIFGCVCFPCL

Query:  RPYQPTKFSYHSEKCVYLGPSPTHKGFKCLS-KTGRIFISRHVKFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTST
        RPY   K  Y S  CV+LG +P HKG+ CL   T RI+ISR+V F+E+ FPF             QS+SP                   P+   P L S+
Subjt:  RPYQPTKFSYHSEKCVYLGPSPTHKGFKCLS-KTGRIFISRHVKFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTST

Query:  TQITLPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSPYSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRA
        T   +  P    P S   SSPI IT ++ P                  PL P    T   +SP + S PP PL+                  TH M+TRA
Subjt:  TQITLPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSPYSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRA

Query:  KADIFKPKACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHPGVDF
        K+ I K      +SF     TEP    +A     W +AM++EY ALL  NTWSLVPP SS +IVG +WI+KLK   DGSI R+KARLVA+GF Q PG+D+
Subjt:  KADIFKPKACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHPGVDF

Query:  FETFSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSS
        F+TFSPVVK  T+ ++L++ ++  W++RQLD  NAFLNG LEE V++TQP  +V  ++  YVCKL+KA+YGLKQAPRAW   L  ALL++GF +S++D+S
Subjt:  FETFSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSS

Query:  LFILRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSAL
        LFI      I++LL YVDD ++TG++P L+   ++ L  +FAL+DLG LSYFL  Q + L     L++ KY+ DLL+R QM   K AP+P  +G+TLS  
Subjt:  LFILRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSAL

Query:  DSKLLDDPSLYRSTIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFI
        D   L DPS YR T+GALQY+T TRPDIA+ VN   QF+ +P+D+HW AVKRILRY+ GT   GL FQ +  + +  +SDADWAS  DDR+  S YCVF+
Subjt:  DSKLLDDPSLYRSTIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFI

Query:  GNNLVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPS
        G+NL+SWSS KQ +V+ SS ESEYR L     E++W++ LL EL + TS  P++WCDN SA  LA NPVFH+R+KHIE+D+ FIR++VL+  L + YVPS
Subjt:  GNNLVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPS

Query:  TDQLANCLTKPLTHSQFRNLRSKLGVVISPPTRLRGDVRDNNCD
         DQLA+  TK L  +QF NLRSKL  V  PP  LRGD  DN  D
Subjt:  TDQLANCLTKPLTHSQFRNLRSKLGVVISPPTRLRGDVRDNNCD

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.3e-10430.79Show/hide
Query:  GKSHNLPFP--NSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEY--KKI
        GK   LPF     ++H K P  +++SD+ GP    + D   Y+++F+D ++ Y   Y +K KS     FQ FV   +  FN  +     DNG EY   ++
Subjt:  GKSHNLPFP--NSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEY--KKI

Query:  QHLCLNLGINCRFSCPYTSAQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTL--QGLSPIERLFNQKLKVENLKIFGCVCFPC
        +  C+  GI+   + P+T   NG +ER  R I E   T+++ A +  ++W +A LT   LIN + +  L     +P E   N+K  +++L++FG   +  
Subjt:  QHLCLNLGINCRFSCPYTSAQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTL--QGLSPIERLFNQKLKVENLKIFGCVCFPC

Query:  LRPYQPTKFSYHSEKCVYLGPSPTHKGFKCLSKTGRIFI-SRHVKFNENDF------PFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQ--------
        ++  Q  KF   S K +++G  P   GFK        FI +R V  +E +        F  +F         ++             PN+++        
Subjt:  LRPYQPTKFSYHSEKCVYLGPSPTHKGFKCLSKTGRIFI-SRHVKFNENDF------PFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQ--------

Query:  --SNMAPNPQGPQLTSTTQITLPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSPYSSSSPPSPLSLGPASIDHT
          S  + N   P   S   I   F     P  S     I    ++  S     N +       H   S  +       +P  S    +   L    ID+ 
Subjt:  --SNMAPNPQGPQLTSTTQITLPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSPYSSSSPPSPLSLGPASIDHT

Query:  VQPSNSHIPTHSMITRAKADIFKPKACISKSFTDWTLTE------------PTRIKEALI---TLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKW
         +  N  I     I   +++  K K  IS +  D +L +            P    E         W++A++ E  A    NTW++     ++NIV S+W
Subjt:  VQPSNSHIPTHSMITRAKADIFKPKACISKSFTDWTLTE------------PTRIKEALI---TLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKW

Query:  IFKLKRNSDGSIQRYKARLVAKGFHQHPGVDFFETFSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYVCKLNKA
        +F +K N  G+  RYKARLVA+GF Q   +D+ ETF+PV + S+   +LS+ +     + Q+D   AFLNG L+E +Y+  P     S +S  VCKLNKA
Subjt:  IFKLKRNSDGSIQRYKARLVAKGFHQHPGVDFFETFSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYVCKLNKA

Query:  IYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFIL---RCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFI
        IYGLKQA R W     +AL    FVNS  D  ++IL       +I +LL YVDD +I   D + + +    L ++F + DL  + +F+  +++  +    
Subjt:  IYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFIL---RCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFI

Query:  LSEEKYVDDLLHRLQMTDLKA--APSPSVVGKTLSALDSKLLDDPSLYRSTIGALQY-LTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQ
        LS+  YV  +L +  M +  A   P PS +   L   D    D  +  RS IG L Y +  TRPD+   VN LS++  +     WQ +KR+LRY+ GT  
Subjt:  LSEEKYVDDLLHRLQMTDLKA--APSPSVVGKTLSALDSKLLDDPSLYRSTIGALQY-LTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQ

Query:  FGLLFQH--SFDLSISAFSDADWASNIDDRKFVSAYCV-FIGNNLVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNL
          L+F+   +F+  I  + D+DWA +  DRK  + Y       NL+ W++K+Q  VA SSTE+EY AL  A  E +WLK LL  +++       I+ DN 
Subjt:  FGLLFQH--SFDLSISAFSDADWASNIDDRKFVSAYCV-FIGNNLVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNL

Query:  SAGALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPSTDQLANCLTKPLTHSQFRNLRSKLGVV
           ++A NP  H R KHI+I   F R+QV    + + Y+P+ +QLA+  TKPL  ++F  LR KLG++
Subjt:  SAGALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPSTDQLANCLTKPLTHSQFRNLRSKLGVV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.8e-11331.49Show/hide
Query:  GKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEY--KKIQH
        GK H + F  S         ++YSD+ GP    S    +Y++ F+DD S   W+Y LK K    + FQ F   V+ +    +K  +SDNGGEY  ++ + 
Subjt:  GKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEY--KKIQH

Query:  LCLNLGINCRFSCPYTSAQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQKLKVENLKIFGCVCFPCLRPY
         C + GI    + P T   NG AER +R IVE   ++L  A +  ++W +A  T   LIN   +  L    P     N+++   +LK+FGC  F  +   
Subjt:  LCLNLGINCRFSCPYTSAQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQKLKVENLKIFGCVCFPCLRPY

Query:  QPTKFSYHSEKCVYLGPSPTHKGFKCLSKT-GRIFISRHVKFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQI
        Q TK    S  C+++G      G++       ++  SR V F E++                                 +T ++M+   +          
Subjt:  QPTKFSYHSEKCVYLGPSPTHKGFKCLSKT-GRIFISRHVKFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQI

Query:  TLPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSPYSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKAD
               IP   +IPS     T NNP S  ST +  +     P   +                      L  G   ++H  Q    H P    + R++  
Subjt:  TLPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSPYSSSSPPSPLSLGPASIDHTVQPSNSHIPTHSMITRAKAD

Query:  IFKPKACISKSFTDWTL----TEPTRIKEALI---TLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHP
          + +   S   T++ L     EP  +KE L      Q  KAM  E  +L    T+ LV     +  +  KW+FKLK++ D  + RYKARLV KGF Q  
Subjt:  IFKPKACISKSFTDWTL----TEPTRIKEALI---TLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHP

Query:  GVDFFETFSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSK
        G+DF E FSPVVK +++  +LS+  +    + QLD   AFL+G LEE +Y+ QP  +  +   H VCKLNK++YGLKQAPR W       + +  ++ + 
Subjt:  GVDFFETFSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSK

Query:  SDSSLFILR-CQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQV--KYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVV
        SD  ++  R  +++ I+LL YVDD +I G D  LI  L   L K F +KDLG     L  ++  +       LS+EKY++ +L R  M + K   +P   
Subjt:  SDSSLFILR-CQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQV--KYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVV

Query:  GKTLS------ALDSKLLDDPSLYRSTIGALQY-LTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASN
           LS       ++ K       Y S +G+L Y +  TRPDIA+ V  +S+FL+ P   HW+AVK ILRY+ GT    L F  S D  +  ++DAD A +
Subjt:  GKTLS------ALDSKLLDDPSLYRSTIGALQY-LTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASN

Query:  IDDRKFVSAYCVFIGNNLVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRD
        ID+RK  + Y        +SW SK Q  VA S+TE+EY A      E+IWLK+ L EL +    + V++CD+ SA  L+ N ++H RTKHI++   +IR+
Subjt:  IDDRKFVSAYCVFIGNNLVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRD

Query:  QVLKGALDVRYVPSTDQLANCLTKPLTHSQFRNLRSKLGV
         V   +L V  + + +  A+ LTK +  ++F   +  +G+
Subjt:  QVLKGALDVRYVPSTDQLANCLTKPLTHSQFRNLRSKLGV

P92519 Uncharacterized mitochondrial protein AtMg008107.8e-4743.3Show/hide
Query:  LLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSKLLDDPSLYR
        LL YVDD ++TG+  +L+  L+  L   F++KDLG + YFL  Q+K    G  LS+ KY + +L+   M D K   +P  + K  S++ +    DPS +R
Subjt:  LLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSKLLDDPSLYR

Query:  STIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNNLVSWSSKKQ
        S +GALQYLT TRPDI+Y VN + Q +  PT   +  +KR+LRY+ GT   GL    +  L++ AF D+DWA     R+  + +C F+G N++SWS+K+Q
Subjt:  STIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNNLVSWSSKKQ

Query:  TVVAHSSTESEYRALALAAPEVIW
          V+ SSTE+EYRALAL A E+ W
Subjt:  TVVAHSSTESEYRALALAAPEVIW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.9e-21143.78Show/hide
Query:  LGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQHL
        + KS+ +PF  S  ++  P   IYSD+W  +P  S+D +RYY++F+D ++ YTW+YPLKQKS   E F  F   ++N+F   I  F SDNGGE+  +   
Subjt:  LGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQHL

Query:  CLNLGINCRFSCPYTSAQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQKLKVENLKIFGCVCFPCLRPYQ
            GI+   S P+T   NG +ERKHRHIVETGLTLL+ A++   YW  AF   V LIN + TP LQ  SP ++LF      + L++FGC C+P LRPY 
Subjt:  CLNLGINCRFSCPYTSAQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQKLKVENLKIFGCVCFPCLRPYQ

Query:  PTKFSYHSEKCVYLGPSPTHKGFKCLS-KTGRIFISRHVKFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQIT
          K    S +CV+LG S T   + CL  +T R++ISRHV+F+EN FPFS+ +    S    Q    S  +      P +T    AP+   P   +T   +
Subjt:  PTKFSYHSEKCVYLGPSPTHKGFKCLS-KTGRIFISRHVKFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPSPNQTQSNMAPNPQGPQLTSTTQIT

Query:  LPFPFPIPPM----------SSIPSSPINITP--NNP----------PSVHSTA-----NPTNSNP-------NLPHNPLSPSTTITPRENSPYSSSSPP
           PF    +          SS PSSP    P  N P             HS+      NPTN +P       + P    S S + T   +S  +S +PP
Subjt:  LPFPFPIPPM----------SSIPSSPINITP--NNP----------PSVHSTA-----NPTNSNP-------NLPHNPLSPSTTITPRENSPYSSSSPP

Query:  SPLSLGPASIDHTVQPSN-SHIPTHSMITRAKADIFKPKACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQ-NIVGSKW
        S L   P  +   V  +N + + THSM TRAKA I KP    S + +    +EP    +AL   +W+ AM +E  A +  +TW LVPP  S   IVG +W
Subjt:  SPLSLGPASIDHTVQPSN-SHIPTHSMITRAKADIFKPKACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQ-NIVGSKW

Query:  IFKLKRNSDGSIQRYKARLVAKGFHQHPGVDFFETFSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYVCKLNKA
        IF  K NSDGS+ RYKARLVAKG++Q PG+D+ ETFSPV+K++++ +VL + + R W +RQLD NNAFL G L + VY++QPP ++     +YVCKL KA
Subjt:  IFKLKRNSDGSIQRYKARLVAKGFHQHPGVDFFETFSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYVCKLNKA

Query:  IYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFILRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSE
        +YGLKQAPRAW   L   LL  GFVNS SD+SLF+L+   SI+ +L YVDD +ITGNDP+L+ + + +L ++F++KD   L YFL  + K +  G  LS+
Subjt:  IYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFILRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSE

Query:  EKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSKLLDDPSLYRSTIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQ
         +Y+ DLL R  M   K   +P      LS      L DP+ YR  +G+LQYL  TRPDI+Y VN LSQF+  PT+ H QA+KRILRY++GT   G+  +
Subjt:  EKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSKLLDDPSLYRSTIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQ

Query:  HSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNNLVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNLSAGALATNP
            LS+ A+SDADWA + DD    + Y V++G++ +SWSSKKQ  V  SSTE+EYR++A  + E+ W+  LL EL +  +  PVI+CDN+ A  L  NP
Subjt:  HSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNNLVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNLSAGALATNP

Query:  VFHTRTKHIEIDVDFIRDQVLKGALDVRYVPSTDQLANCLTKPLTHSQFRNLRSKLGVVISPPT
        VFH+R KHI ID  FIR+QV  GAL V +V + DQLA+ LTKPL+ + F+N  SK+GV   PP+
Subjt:  VFHTRTKHIEIDVDFIRDQVLKGALDVRYVPSTDQLANCLTKPLTHSQFRNLRSKLGVVISPPT

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.8e-20843.3Show/hide
Query:  LGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQHL
        + KSH +PF NS   + +P   IYSD+W  +P  S D +RYY++F+D ++ YTW+YPLKQKS   + F  F   V+N+F   I    SDNGGE+  ++  
Subjt:  LGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWIYPLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQHL

Query:  CLNLGINCRFSCPYTSAQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQKLKVENLKIFGCVCFPCLRPYQ
            GI+   S P+T   NG +ERKHRHIVE GLTLL+ A++   YW  AF   V LIN + TP LQ  SP ++LF Q    E LK+FGC C+P LRPY 
Subjt:  CLNLGINCRFSCPYTSAQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPTLQGLSPIERLFNQKLKVENLKIFGCVCFPCLRPYQ

Query:  PTKFSYHSEKCVYLGPSPTHKGFKCLS-KTGRIFISRHVKFNENDFPFSDLFYPAQSTCSTQSAS----PSLAFFKSWPSPNQTQSNMAPN-------PQ
          K    S++C ++G S T   + CL   TGR++ SRHV+F+E  FPFS   +   ++   +S S    PS     + P        + P+       P 
Subjt:  PTKFSYHSEKCVYLGPSPTHKGFKCLS-KTGRIFISRHVKFNENDFPFSDLFYPAQSTCSTQSAS----PSLAFFKSWPSPNQTQSNMAPN-------PQ

Query:  GPQLTSTTQIT---LPFPFPIPPMSSIPSSPINITP-------------------NNPPSVHSTANPTNSNPNLPHNPLS------PSTTIT-PRENSPY
         P    TTQ++   LP      P SS P++P +  P                   NNP     + N  N N  LP +P+S      PST+I+ P   S  
Subjt:  GPQLTSTTQIT---LPFPFPIPPMSSIPSSPINITP-------------------NNPPSVHSTANPTNSNPNLPHNPLS------PSTTIT-PRENSPY

Query:  SSSSPPSPLSL-GPASIDHTVQPSNSHIPTHSMITRAKADIFKPKACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLV-PPSSSQN
        S+S+PP P  L  P  I    Q   + + THSM TRAK  I KP    S + +    +EP    +A+   +W++AM +E  A +  +TW LV PP  S  
Subjt:  SSSSPPSPLSL-GPASIDHTVQPSNSHIPTHSMITRAKADIFKPKACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLV-PPSSSQN

Query:  IVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHPGVDFFETFSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYV
        IVG +WIF  K NSDGS+ RYKARLVAKG++Q PG+D+ ETFSPV+K++++ +VL + + R W +RQLD NNAFL G L + VY++QPP +V      YV
Subjt:  IVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHPGVDFFETFSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYV

Query:  CKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFILRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDC
        C+L KAIYGLKQAPRAW   L   LL  GFVNS SD+SLF+L+   SII +L YVDD +ITGND  L++  + +L ++F++K+   L YFL  + K +  
Subjt:  CKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFILRCQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDC

Query:  GFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSKLLDDPSLYRSTIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQ
        G  LS+ +Y  DLL R  M   K   +P      L+      L DP+ YR  +G+LQYL  TRPD++Y VN LSQ++  PTD HW A+KR+LRY++GT  
Subjt:  GFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSKLLDDPSLYRSTIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQ

Query:  FGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNNLVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNLSAG
         G+  +    LS+ A+SDADWA + DD    + Y V++G++ +SWSSKKQ  V  SSTE+EYR++A  + E+ W+  LL EL +  S  PVI+CDN+ A 
Subjt:  FGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNNLVSWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNLSAG

Query:  ALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPSTDQLANCLTKPLTHSQFRNLRSKLGVVISPPT
         L  NPVFH+R KHI +D  FIR+QV  GAL V +V + DQLA+ LTKPL+   F+N   K+GV+  PP+
Subjt:  ALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPSTDQLANCLTKPLTHSQFRNLRSKLGVVISPPT

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 89.3e-10442.53Show/hide
Query:  CISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHPGVDFFETFSPVVK
        CI+K+       EP+   EA   L W  AMD E  A+  T+TW +     ++  +G KW++K+K NSDG+I+RYKARLVAKG+ Q  G+DF ETFSPV K
Subjt:  CISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHPGVDFFETFSPVVK

Query:  ASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVG----SSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFILR
         +++ ++L+I     +TL QLD +NAFLNG L+E +Y+  PP Y      S   + VC L K+IYGLKQA R W    S  L+ +GFV S SD + F+  
Subjt:  ASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVG----SSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFILR

Query:  CQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSKLL
             + +L YVDD II  N+ + +  L + L   F L+DLG L YFL  ++     G  + + KY  DLL    +   K +  P     T SA      
Subjt:  CQHSIILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSKLL

Query:  DDPSLYRSTIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNNLV
         D   YR  IG L YL  TR DI++ VN LSQF + P   H QAV +IL YI GT   GL +    ++ +  FSDA + S  D R+  + YC+F+G +L+
Subjt:  DDPSLYRSTIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNNLV

Query:  SWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRDQ
        SW SKKQ VV+ SS E+EYRAL+ A  E++WL Q   EL +  S   +++CDN +A  +ATN VFH RTKHIE D   +R++
Subjt:  SWSSKKQTVVAHSSTESEYRALALAAPEVIWLKQLLNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRDQ

ATMG00240.1 Gag-Pol-related retrotransposon family protein4.4e-1343.21Show/hide
Query:  YLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFI
        YLT TRPD+ + VN LSQF         QAV ++L Y+ GT   GL +  + DL + AF+D+DWAS  D R+ V+ +C  +
Subjt:  YLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFI

ATMG00810.1 DNA/RNA polymerases superfamily protein5.5e-4843.3Show/hide
Query:  LLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSKLLDDPSLYR
        LL YVDD ++TG+  +L+  L+  L   F++KDLG + YFL  Q+K    G  LS+ KY + +L+   M D K   +P  + K  S++ +    DPS +R
Subjt:  LLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSKLLDDPSLYR

Query:  STIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNNLVSWSSKKQ
        S +GALQYLT TRPDI+Y VN + Q +  PT   +  +KR+LRY+ GT   GL    +  L++ AF D+DWA     R+  + +C F+G N++SWS+K+Q
Subjt:  STIGALQYLTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNNLVSWSSKKQ

Query:  TVVAHSSTESEYRALALAAPEVIW
          V+ SSTE+EYRALAL A E+ W
Subjt:  TVVAHSSTESEYRALALAAPEVIW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.1e-2447.58Show/hide
Query:  MITRAKADIFKPKACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQH
        M+TR+KA I K     S + T     EP  +  AL    W +AM  E  AL    TW LVPP  +QNI+G KW+FK K +SDG++ R KARLVAKGFHQ 
Subjt:  MITRAKADIFKPKACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQH

Query:  PGVDFFETFSPVVKASTLWVVLSI
         G+ F ET+SPVV+ +T+  +L++
Subjt:  PGVDFFETFSPVVKASTLWVVLSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCATTGACCATCGCCCCTTTCTGCACCGTTCGTCGTTGATTATGTCGTCGCCCACCATCGCGAGCCAATCAGTTAGATTTATATTGAATCCTAACCCTAGCCACGC
CAACTTCTTTTGTCGTCACTGGAAACCTATGTTTTCGTCAGTAATTATGTTAGAGACAACAATTGTGACCAAGAAATCCAGTCAGCATCCTCTGCCTTGTTGGAGTCGAA
TCATCAGGGAAAATGAAAATGTGAATTTACTCTGTATAAAGAGAGTCAATGGCTCAATCAATATAGTGAAGAAAACCAAAGGGAAATTAATACCCTACAATTGTCCTTTC
TCTGAAAAGCTTTTCTTGGTATCTGAGCCACCAATGGCAAACGTCGACCTACCCATTATCTCTGTTAACACTACCCCATTTATCACCATTCCTCCTTCTTTTGCTCCGTC
CTTTTCTAACCCTCCACTCAACCAAGTTCTTAATCAACTCACTACTGTGAAACTTGATCGAAACAACTATCTAGATGGGACAAACCCAGAAGCGTCGTCTTCATCCGCTA
CTGTCGTGATAAACCCCGCCTTTGAACAATGGGTCACCATTGACCTACTGTTGCTTGGATGGCTCTATAATTCAATGACACCATACGTAGCTATCCAACTGATGGGTTTT
ACAAATGCCAAAGACCTAGCTGAAGAAGATTTTCTTCGTCAAGTGTTTCAAACCACAAGAAAAGGTAACTCTAAAATGGAGGATTATTTGCTATTAATGAAAACTCACGC
TGATAATCTAGGTCAGGCCGGAAGTCCTATTCCTACATGTGCTCTTATCTCTCAGCTTGGAAAATCTCACAATCTTCCATTTCCTAATTCTCAATCTCATGCTAAAGAAC
CCTTTGCTATTATATATTCTGACTTATGGGGTCCAGCTCCTTCTTGCTCCAATGATTACTTCAGATATTATATACTGTTTCTGGATGATTACAGCATATATACATGGATT
TATCCTCTTAAACAAAAAAGTTCTGCAGTTGAAGCTTTTCAGCACTTCGTCATATATGTCAAGAACCAGTTTAATAATACCATTAAGGTGTTTCAATCTGACAATGGAGG
AGAATACAAAAAAATACAACACTTATGTCTAAATCTAGGGATCAATTGTCGGTTCTCTTGTCCCTATACCTCAGCTCAGAATGGTAGAGCAGAAAGGAAACATCGACATA
TAGTTGAAACAGGGCTTACGCTTCTTGCTCAAGCCAACATGACCCTGAATTATTGGTGGGATGCCTTCTTAACCTTTGTCATTTTAATAAATGGAATGCACACACCTACA
CTTCAAGGACTTTCTCCAATTGAACGTCTATTTAATCAAAAATTAAAAGTTGAAAATCTGAAGATTTTTGGCTGTGTCTGCTTCCCATGTTTAAGACCGTACCAACCTAC
AAAATTCAGTTATCATTCTGAAAAATGTGTTTATCTTGGCCCAAGTCCCACTCACAAGGGATTCAAATGCTTGTCTAAAACTGGGAGAATCTTTATTTCTAGACATGTTA
AGTTCAATGAAAATGATTTCCCGTTTTCAGATTTATTTTACCCAGCCCAGTCTACTTGTTCAACTCAATCGGCCTCTCCATCTCTTGCCTTTTTCAAAAGTTGGCCTTCT
CCCAACCAAACCCAATCCAATATGGCTCCAAACCCACAAGGTCCACAATTAACTTCCACCACACAAATAACCCTACCCTTTCCTTTTCCAATACCACCCATGTCCTCAAT
ACCTTCAAGCCCAATTAACATCACGCCAAATAACCCACCTTCAGTCCACTCCACCGCCAACCCAACAAATTCTAACCCTAATTTGCCACATAATCCCTTGTCTCCTTCTA
CCACAATTACTCCACGTGAAAATTCTCCCTACTCATCATCTTCTCCTCCATCTCCCTTGTCTCTTGGTCCAGCCTCCATTGATCATACTGTCCAACCATCCAATTCTCAT
ATACCGACTCATTCCATGATTACTCGAGCCAAAGCCGACATCTTCAAACCTAAAGCATGTATATCAAAATCTTTCACTGATTGGACACTCACTGAACCAACAAGGATAAA
GGAAGCGTTGATCACTCTTCAATGGAAAAAGGCCATGGATGCAGAATATTGTGCATTGCTTGCCACGAATACTTGGAGCTTGGTACCTCCTTCCTCATCTCAGAATATAG
TCGGTAGCAAATGGATTTTCAAACTCAAGAGAAATTCTGATGGTTCAATTCAGCGGTACAAAGCCCGATTAGTGGCTAAAGGGTTTCACCAACATCCAGGTGTAGATTTC
TTCGAGACGTTTAGTCCTGTTGTCAAGGCCTCCACACTTTGGGTTGTCTTAAGCATTGGTTTAGCCCGTGGATGGACACTACGTCAATTAGATTTTAACAATGCCTTTCT
AAATGGACAATTGGAAGAAAGTGTGTACATAACGCAACCACCTGAGTATGTTGGTTCTTCTCACTCTCATTATGTCTGTAAACTTAACAAAGCAATATATGGTCTAAAGC
AAGCTCCACGTGCTTGGAACAATACTCTATCGAAGGCATTACTTAACTGGGGTTTTGTCAATTCAAAGTCTGATTCCTCTTTATTCATTCTTCGGTGTCAACACTCTATT
ATTCTCTTACTCGCTTATGTGGATGATGCAATTATTACTGGAAATGACCCTTCCTTGATACAGAGTCTGGTTACCTCTTTAGACAAACAGTTTGCGTTAAAGGACTTAGG
TGCTCTAAGTTACTTTTTGAGTTTCCAGGTGAAGTATTTAGACTGTGGATTTATCCTTTCTGAAGAAAAGTATGTTGATGACCTTCTACACAGACTTCAAATGACTGATT
TAAAAGCTGCTCCCTCACCTAGTGTAGTCGGGAAGACTTTATCGGCTCTTGACAGTAAGCTGCTTGATGATCCTTCTCTATATCGAAGCACTATTGGAGCACTTCAGTAT
CTCACAAACACAAGACCGGATATTGCGTATATGGTTAATCATTTAAGCCAGTTTCTTCAACGGCCAACTGATCTTCATTGGCAAGCTGTTAAAAGAATTCTTCGTTATAT
CAGTGGCACAAAACAGTTTGGGTTACTATTTCAACACAGCTTCGATCTATCTATTTCTGCCTTCTCTGATGCTGACTGGGCGTCCAATATTGATGATCGAAAATTTGTCT
CTGCCTACTGTGTTTTTATTGGCAACAATTTGGTTTCCTGGTCTTCCAAGAAACAAACGGTTGTTGCTCACTCTAGTACCGAATCCGAGTATCGGGCCTTAGCTTTAGCT
GCCCCTGAAGTCATCTGGCTAAAACAGTTGCTGAACGAACTTGATGTTTCTACTTCCCTCAAACCTGTCATTTGGTGTGATAATTTGAGTGCTGGAGCACTGGCCACCAA
CCCAGTTTTTCACACTCGCACAAAACATATAGAGATCGATGTTGATTTCATTCGGGATCAAGTTCTTAAGGGTGCTCTAGATGTTCGCTATGTTCCTTCCACCGATCAGT
TGGCAAACTGTCTAACCAAACCTCTCACGCACTCTCAATTTCGTAATTTACGTTCCAAACTCGGAGTTGTCATTAGTCCACCCACTCGTTTGAGGGGGGATGTTAGAGAC
AACAATTGTGATCCAGAAATCCAATCAGCATCCTCTGCCTTGTTGGAGTCGAATCATCAGGGAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCATTGACCATCGCCCCTTTCTGCACCGTTCGTCGTTGATTATGTCGTCGCCCACCATCGCGAGCCAATCAGTTAGATTTATATTGAATCCTAACCCTAGCCACGC
CAACTTCTTTTGTCGTCACTGGAAACCTATGTTTTCGTCAGTAATTATGTTAGAGACAACAATTGTGACCAAGAAATCCAGTCAGCATCCTCTGCCTTGTTGGAGTCGAA
TCATCAGGGAAAATGAAAATGTGAATTTACTCTGTATAAAGAGAGTCAATGGCTCAATCAATATAGTGAAGAAAACCAAAGGGAAATTAATACCCTACAATTGTCCTTTC
TCTGAAAAGCTTTTCTTGGTATCTGAGCCACCAATGGCAAACGTCGACCTACCCATTATCTCTGTTAACACTACCCCATTTATCACCATTCCTCCTTCTTTTGCTCCGTC
CTTTTCTAACCCTCCACTCAACCAAGTTCTTAATCAACTCACTACTGTGAAACTTGATCGAAACAACTATCTAGATGGGACAAACCCAGAAGCGTCGTCTTCATCCGCTA
CTGTCGTGATAAACCCCGCCTTTGAACAATGGGTCACCATTGACCTACTGTTGCTTGGATGGCTCTATAATTCAATGACACCATACGTAGCTATCCAACTGATGGGTTTT
ACAAATGCCAAAGACCTAGCTGAAGAAGATTTTCTTCGTCAAGTGTTTCAAACCACAAGAAAAGGTAACTCTAAAATGGAGGATTATTTGCTATTAATGAAAACTCACGC
TGATAATCTAGGTCAGGCCGGAAGTCCTATTCCTACATGTGCTCTTATCTCTCAGCTTGGAAAATCTCACAATCTTCCATTTCCTAATTCTCAATCTCATGCTAAAGAAC
CCTTTGCTATTATATATTCTGACTTATGGGGTCCAGCTCCTTCTTGCTCCAATGATTACTTCAGATATTATATACTGTTTCTGGATGATTACAGCATATATACATGGATT
TATCCTCTTAAACAAAAAAGTTCTGCAGTTGAAGCTTTTCAGCACTTCGTCATATATGTCAAGAACCAGTTTAATAATACCATTAAGGTGTTTCAATCTGACAATGGAGG
AGAATACAAAAAAATACAACACTTATGTCTAAATCTAGGGATCAATTGTCGGTTCTCTTGTCCCTATACCTCAGCTCAGAATGGTAGAGCAGAAAGGAAACATCGACATA
TAGTTGAAACAGGGCTTACGCTTCTTGCTCAAGCCAACATGACCCTGAATTATTGGTGGGATGCCTTCTTAACCTTTGTCATTTTAATAAATGGAATGCACACACCTACA
CTTCAAGGACTTTCTCCAATTGAACGTCTATTTAATCAAAAATTAAAAGTTGAAAATCTGAAGATTTTTGGCTGTGTCTGCTTCCCATGTTTAAGACCGTACCAACCTAC
AAAATTCAGTTATCATTCTGAAAAATGTGTTTATCTTGGCCCAAGTCCCACTCACAAGGGATTCAAATGCTTGTCTAAAACTGGGAGAATCTTTATTTCTAGACATGTTA
AGTTCAATGAAAATGATTTCCCGTTTTCAGATTTATTTTACCCAGCCCAGTCTACTTGTTCAACTCAATCGGCCTCTCCATCTCTTGCCTTTTTCAAAAGTTGGCCTTCT
CCCAACCAAACCCAATCCAATATGGCTCCAAACCCACAAGGTCCACAATTAACTTCCACCACACAAATAACCCTACCCTTTCCTTTTCCAATACCACCCATGTCCTCAAT
ACCTTCAAGCCCAATTAACATCACGCCAAATAACCCACCTTCAGTCCACTCCACCGCCAACCCAACAAATTCTAACCCTAATTTGCCACATAATCCCTTGTCTCCTTCTA
CCACAATTACTCCACGTGAAAATTCTCCCTACTCATCATCTTCTCCTCCATCTCCCTTGTCTCTTGGTCCAGCCTCCATTGATCATACTGTCCAACCATCCAATTCTCAT
ATACCGACTCATTCCATGATTACTCGAGCCAAAGCCGACATCTTCAAACCTAAAGCATGTATATCAAAATCTTTCACTGATTGGACACTCACTGAACCAACAAGGATAAA
GGAAGCGTTGATCACTCTTCAATGGAAAAAGGCCATGGATGCAGAATATTGTGCATTGCTTGCCACGAATACTTGGAGCTTGGTACCTCCTTCCTCATCTCAGAATATAG
TCGGTAGCAAATGGATTTTCAAACTCAAGAGAAATTCTGATGGTTCAATTCAGCGGTACAAAGCCCGATTAGTGGCTAAAGGGTTTCACCAACATCCAGGTGTAGATTTC
TTCGAGACGTTTAGTCCTGTTGTCAAGGCCTCCACACTTTGGGTTGTCTTAAGCATTGGTTTAGCCCGTGGATGGACACTACGTCAATTAGATTTTAACAATGCCTTTCT
AAATGGACAATTGGAAGAAAGTGTGTACATAACGCAACCACCTGAGTATGTTGGTTCTTCTCACTCTCATTATGTCTGTAAACTTAACAAAGCAATATATGGTCTAAAGC
AAGCTCCACGTGCTTGGAACAATACTCTATCGAAGGCATTACTTAACTGGGGTTTTGTCAATTCAAAGTCTGATTCCTCTTTATTCATTCTTCGGTGTCAACACTCTATT
ATTCTCTTACTCGCTTATGTGGATGATGCAATTATTACTGGAAATGACCCTTCCTTGATACAGAGTCTGGTTACCTCTTTAGACAAACAGTTTGCGTTAAAGGACTTAGG
TGCTCTAAGTTACTTTTTGAGTTTCCAGGTGAAGTATTTAGACTGTGGATTTATCCTTTCTGAAGAAAAGTATGTTGATGACCTTCTACACAGACTTCAAATGACTGATT
TAAAAGCTGCTCCCTCACCTAGTGTAGTCGGGAAGACTTTATCGGCTCTTGACAGTAAGCTGCTTGATGATCCTTCTCTATATCGAAGCACTATTGGAGCACTTCAGTAT
CTCACAAACACAAGACCGGATATTGCGTATATGGTTAATCATTTAAGCCAGTTTCTTCAACGGCCAACTGATCTTCATTGGCAAGCTGTTAAAAGAATTCTTCGTTATAT
CAGTGGCACAAAACAGTTTGGGTTACTATTTCAACACAGCTTCGATCTATCTATTTCTGCCTTCTCTGATGCTGACTGGGCGTCCAATATTGATGATCGAAAATTTGTCT
CTGCCTACTGTGTTTTTATTGGCAACAATTTGGTTTCCTGGTCTTCCAAGAAACAAACGGTTGTTGCTCACTCTAGTACCGAATCCGAGTATCGGGCCTTAGCTTTAGCT
GCCCCTGAAGTCATCTGGCTAAAACAGTTGCTGAACGAACTTGATGTTTCTACTTCCCTCAAACCTGTCATTTGGTGTGATAATTTGAGTGCTGGAGCACTGGCCACCAA
CCCAGTTTTTCACACTCGCACAAAACATATAGAGATCGATGTTGATTTCATTCGGGATCAAGTTCTTAAGGGTGCTCTAGATGTTCGCTATGTTCCTTCCACCGATCAGT
TGGCAAACTGTCTAACCAAACCTCTCACGCACTCTCAATTTCGTAATTTACGTTCCAAACTCGGAGTTGTCATTAGTCCACCCACTCGTTTGAGGGGGGATGTTAGAGAC
AACAATTGTGATCCAGAAATCCAATCAGCATCCTCTGCCTTGTTGGAGTCGAATCATCAGGGAAAATGAAAATGCGTAGGTATATGTGAGAAATGTAAAATGTAATATGG
TAAAGAAAAGTTGTTATTTCTCTTCCGGTGAAACTCTACCATCTGTAATCTGCAATCTATTTACTCTGTATAAAGAGAGCCAATAGCTCAATCAATATAGTGAAGAAATC
GAAAGGGA
Protein sequenceShow/hide protein sequence
MVIDHRPFLHRSSLIMSSPTIASQSVRFILNPNPSHANFFCRHWKPMFSSVIMLETTIVTKKSSQHPLPCWSRIIRENENVNLLCIKRVNGSINIVKKTKGKLIPYNCPF
SEKLFLVSEPPMANVDLPIISVNTTPFITIPPSFAPSFSNPPLNQVLNQLTTVKLDRNNYLDGTNPEASSSSATVVINPAFEQWVTIDLLLLGWLYNSMTPYVAIQLMGF
TNAKDLAEEDFLRQVFQTTRKGNSKMEDYLLLMKTHADNLGQAGSPIPTCALISQLGKSHNLPFPNSQSHAKEPFAIIYSDLWGPAPSCSNDYFRYYILFLDDYSIYTWI
YPLKQKSSAVEAFQHFVIYVKNQFNNTIKVFQSDNGGEYKKIQHLCLNLGINCRFSCPYTSAQNGRAERKHRHIVETGLTLLAQANMTLNYWWDAFLTFVILINGMHTPT
LQGLSPIERLFNQKLKVENLKIFGCVCFPCLRPYQPTKFSYHSEKCVYLGPSPTHKGFKCLSKTGRIFISRHVKFNENDFPFSDLFYPAQSTCSTQSASPSLAFFKSWPS
PNQTQSNMAPNPQGPQLTSTTQITLPFPFPIPPMSSIPSSPINITPNNPPSVHSTANPTNSNPNLPHNPLSPSTTITPRENSPYSSSSPPSPLSLGPASIDHTVQPSNSH
IPTHSMITRAKADIFKPKACISKSFTDWTLTEPTRIKEALITLQWKKAMDAEYCALLATNTWSLVPPSSSQNIVGSKWIFKLKRNSDGSIQRYKARLVAKGFHQHPGVDF
FETFSPVVKASTLWVVLSIGLARGWTLRQLDFNNAFLNGQLEESVYITQPPEYVGSSHSHYVCKLNKAIYGLKQAPRAWNNTLSKALLNWGFVNSKSDSSLFILRCQHSI
ILLLAYVDDAIITGNDPSLIQSLVTSLDKQFALKDLGALSYFLSFQVKYLDCGFILSEEKYVDDLLHRLQMTDLKAAPSPSVVGKTLSALDSKLLDDPSLYRSTIGALQY
LTNTRPDIAYMVNHLSQFLQRPTDLHWQAVKRILRYISGTKQFGLLFQHSFDLSISAFSDADWASNIDDRKFVSAYCVFIGNNLVSWSSKKQTVVAHSSTESEYRALALA
APEVIWLKQLLNELDVSTSLKPVIWCDNLSAGALATNPVFHTRTKHIEIDVDFIRDQVLKGALDVRYVPSTDQLANCLTKPLTHSQFRNLRSKLGVVISPPTRLRGDVRD
NNCDPEIQSASSALLESNHQGK