; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G14465 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G14465
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationClcChr05:14770015..14773159
RNA-Seq ExpressionClc05G14465
SyntenyClc05G14465
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU17915.1 hypothetical protein TSUD_330400, partial [Trifolium subterraneum]1.2e-11434.11Show/hide
Query:  KLDQEPALLVLHHRNNFLLWQNIALPILRSYKLEGHLTGKDECPEHSIIIPPSEDEPKGLTLPNQEHDIWLAADQLLVGWLYNSMIAEVAFQVTGYDTAK
        K D    + V   R+N+ LW+++ LPI+R  +L+G++ GK ECPE  I    S  +       N E + W A DQ L+GWL NSM   +A Q+   +T+ 
Subjt:  KLDQEPALLVLHHRNNFLLWQNIALPILRSYKLEGHLTGKDECPEHSIIIPPSEDEPKGLTLPNQEHDIWLAADQLLVGWLYNSMIAEVAFQVTGYDTAK

Query:  NLWDALQEYYGLQAASQQDYLKRMLQQTRKGSTKMSEYLALMKGYADNLYLAGSPVSTRDLISYVIAGLDEEYTPILSK---------------------
         LW+      G    SQ  YLK     TRKG  KM +YL  MK  AD L LAG+P+ST DLI   + GLD EY P++ K                     
Subjt:  NLWDALQEYYGLQAASQQDYLKRMLQQTRKGSTKMSEYLALMKGYADNLYLAGSPVSTRDLISYVIAGLDEEYTPILSK---------------------

Query:  ------GLVSIN-QPSVNVATSNSNQGNSSNQSNR-NSKNYR----GRGRGRGGNAPRLICQACGKAGHSAAVCYYRFEKNFNSFQNSGNSSSSHQNKNS
               L ++    + NVA  ++++GN  N +N     N+R    GRGRGR    P   CQ CG+  H A  C+YRF+K +     S ++ SS+ +K  
Subjt:  ------GLVSIN-QPSVNVATSNSNQGNSSNQSNR-NSKNYR----GRGRGRGGNAPRLICQACGKAGHSAAVCYYRFEKNFNSFQNSGNSSSSHQNKNS

Query:  APTALIATPEHLYNSAWYLDSGAPNHLTSDLANLTVKSEYTGNEMITVGSGQQLPIHYIGSTNIKGNARNLVSKSLMHVPMISKNLISISRLTMDNSGIV
        +    +A+   + +  WY DSGA NH+T         +E+ G   + VG+G++L I   G T ++G  ++               L  +S    D+S  V
Subjt:  APTALIATPEHLYNSAWYLDSGAPNHLTSDLANLTVKSEYTGNEMITVGSGQQLPIHYIGSTNIKGNARNLVSKSLMHVPMISKNLISISRLTMDNSGIV

Query:  EFYDSFCIVKDKETGKVLLEGTIKD--GLYQVKTASNKQN-----------------QEVPGSKFAFVAFFKQVENMLHRKIKEVRCDEGGEFKPLIHFA
           +S+         K +LE    D  G   + ++S  +                  ++   +  AF+ F   VEN  +++IK ++CD GGE+K +   A
Subjt:  EFYDSFCIVKDKETGKVLLEGTIKD--GLYQVKTASNKQN-----------------QEVPGSKFAFVAFFKQVENMLHRKIKEVRCDEGGEFKPLIHFA

Query:  SGNEIKIQFTCPCTSDQNGRVERKHWHIVETGLSLLAQAHMPLAYWWEPFHTVVYLINRMPTVALHGKVPFTTLYNKEPDYHVLRTFGSACFPCLRPYQS
            I+ + +CP TS QNGR ERKH HI E GL+LLAQA MPL YWWE F T VYLINR+P+   H + P++ L+ KEPDY+ L+ FG AC+PCL+PY  
Subjt:  SGNEIKIQFTCPCTSDQNGRVERKHWHIVETGLSLLAQAHMPLAYWWEPFHTVVYLINRMPTVALHGKVPFTTLYNKEPDYHVLRTFGSACFPCLRPYQS

Query:  NKFDFHTKKCVFLGYSGNHKGYRCLSPLGRTYTSKHVCFIEQDFLFSTNFLQNPSAINNESPPILSWLPILQSSTTFQHNTSDFSPTSHTELHSNSTSPC
        +K  FHT KCVFLGYS +HKGY+C++  GR + S+HV F E  F F   FL     +   +    S  P+  +  T   +T       +TE  SN  +  
Subjt:  NKFDFHTKKCVFLGYSGNHKGYRCLSPLGRTYTSKHVCFIEQDFLFSTNFLQNPSAINNESPPILSWLPILQSSTTFQHNTSDFSPTSHTELHSNSTSPC

Query:  PTPSNHLTPYETISPSVSSSTSSHVPPSFT----------------DHPTPVTP-PPQPNTHPMEWQQAMAEEFSALIKNNTWDLVPPNPQQHLVGNKWV
                   T+   V +S ++H   + +                +H     P   Q      EW++AM +EF AL+ N TW L+P   Q+ ++ ++WV
Subjt:  PTPSNHLTPYETISPSVSSSTSSHVPPSFT----------------DHPTPVTP-PPQPNTHPMEWQQAMAEEFSALIKNNTWDLVPPNPQQHLVGNKWV

Query:  FKLQRVADKSILRYKARL
        FK++  AD +I R KARL
Subjt:  FKLQRVADKSILRYKARL

GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]2.4e-12633.91Show/hide
Query:  KLDQEPALLVLHHRNNFLLWQNIALPILRSYKLEGHLTGKDECPEHSIIIPPSEDEPKGLTLPNQEHDIWLAADQLLVGWLYNSMIAEVAFQVTGYDTAK
        K D   ++ V   RNN+ LW+++ LP++R  KL+G++ G + CPE  I    S D  K     N     W A DQ L+GW+ NSM  E+A Q+   +T+K
Subjt:  KLDQEPALLVLHHRNNFLLWQNIALPILRSYKLEGHLTGKDECPEHSIIIPPSEDEPKGLTLPNQEHDIWLAADQLLVGWLYNSMIAEVAFQVTGYDTAK

Query:  NLWDALQEYYGLQAASQQDYLKRMLQQTRKGSTKMSEYLALMKGYADNLYLAGSPVSTRDLISYVIAGLDEEYTPILSK---------------------
         LWD  Q   G    SQ  YLK      RKG  KM +YL  MK   D L LAG+PVST DLI   + GLD EY P++ K                     
Subjt:  NLWDALQEYYGLQAASQQDYLKRMLQQTRKGSTKMSEYLALMKGYADNLYLAGSPVSTRDLISYVIAGLDEEYTPILSK---------------------

Query:  ------GLVSIN-QPSVNVATSNSNQGNSSNQSNR--NSKNYR-GRGRGRGGNAPRLICQACGKAGHSAAVCYYRFEKNFNSFQNSGNSSSSHQNKNSAP
               L ++    + NVA  + ++G SSN + R  NS+ +R GRGRG+ G  P   CQ CG + H A  C++RF+K +    +  N S+ H +K  + 
Subjt:  ------GLVSIN-QPSVNVATSNSNQGNSSNQSNR--NSKNYR-GRGRGRGGNAPRLICQACGKAGHSAAVCYYRFEKNFNSFQNSGNSSSSHQNKNSAP

Query:  TALIATPEHLYNSAWYLDSGAPNHLTSDLANLTVKSEYTGNEMITVGSGQQLPIHYIGSTNIKGNARNLVSKSLMHVPMISKNLISISRLTMDNSGIVEF
         A +A+   + +  WY DSGA NH+T         +E+ G   + VG+G++L I   GS+ +K    +L    +++VP I+KNL+S+S+L  DN+ +VEF
Subjt:  TALIATPEHLYNSAWYLDSGAPNHLTSDLANLTVKSEYTGNEMITVGSGQQLPIHYIGSTNIKGNARNLVSKSLMHVPMISKNLISISRLTMDNSGIVEF

Query:  YDSFCIVKDKETGKVLLEGTIKDGLYQV--------------------------------------------------------------KTASNKQNQE
         ++ C VKDK TGKV+L+G +KDGLYQ+                                                              K++S+   + 
Subjt:  YDSFCIVKDKETGKVLLEGTIKDGLYQV--------------------------------------------------------------KTASNKQNQE

Query:  V----------------PGSKF-------------------------AFVAFFKQVENMLHRKIKEVRCDEGGEFKPLIHFASGNEIKIQFTCPCTSDQN
        +                 G K+                         AF+ F    EN  +++IK ++CD GGE+KP+   A    I+ + +CP TS QN
Subjt:  V----------------PGSKF-------------------------AFVAFFKQVENMLHRKIKEVRCDEGGEFKPLIHFASGNEIKIQFTCPCTSDQN

Query:  GRVERKHWHIVETGLSLLAQAHMPLAYWWEPFHTVVYLINRMPTVALHGKVPFTTLYNKEPDYHVLRTFGSACFPCLRPYQSNKFDFHTKKCVFLGYSGN
        GR ERKH HI E GL+LLAQA MPL YWWE F T VYLINR+P+     + P++ +  KEPDY +L+TFG AC+PCL+PY  +K  +HT +CVFLGYS +
Subjt:  GRVERKHWHIVETGLSLLAQAHMPLAYWWEPFHTVVYLINRMPTVALHGKVPFTTLYNKEPDYHVLRTFGSACFPCLRPYQSNKFDFHTKKCVFLGYSGN

Query:  HKGYRCLSPLGRTYTSKHVCFIEQDFLFSTNFLQNPSAINNE-SPPILSW-------------LPILQSSTTFQHNTSDFSP-TSHTELHSNSTSPCPTP
        HKGY+CL+  GR + S+HV F E  F F   FL   S +    + P  S+             +PIL++    + NT D     S TE  +N  S   T 
Subjt:  HKGYRCLSPLGRTYTSKHVCFIEQDFLFSTNFLQNPSAINNE-SPPILSW-------------LPILQSSTTFQHNTSDFSP-TSHTELHSNSTSPCPTP

Query:  SNH---LTPYETISPSVSSSTSSH-------------------VPPSFTDHPTPVTPPPQPNTHPMEWQQAMAEEFSALIKNNTWDLVPPNPQQHLVGNK
              +T  +++  +  ++ +SH                   +  ++ D   P     +  + P+ W++AM +EF AL+ N TW LVP   Q+++V +K
Subjt:  SNH---LTPYETISPSVSSSTSSH-------------------VPPSFTDHPTPVTPPPQPNTHPMEWQQAMAEEFSALIKNNTWDLVPPNPQQHLVGNK

Query:  WVFKLQRVADKSILRYKARL
        WVFK +   D S+ R KARL
Subjt:  WVFKLQRVADKSILRYKARL

GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum]5.6e-12033.15Show/hide
Query:  KLDQEPALLVLHHRNNFLLWQNIALPILRSYKLEGHLTGKDECPEHSIIIPPSEDEPKGLTLPNQEHDIWLAADQLLVGWLYNSMIAEVAFQVTGYDTAK
        K D    + V   R+N+ LW+++ L ++R  KL+G++ G  ECPE  +    S D+ K +   N +   W+A DQ L+GWL NSM  ++A Q+   +T+K
Subjt:  KLDQEPALLVLHHRNNFLLWQNIALPILRSYKLEGHLTGKDECPEHSIIIPPSEDEPKGLTLPNQEHDIWLAADQLLVGWLYNSMIAEVAFQVTGYDTAK

Query:  NLWDALQEYYGLQAASQQDYLKRMLQQTRKGSTKMSEYLALMKGYADNLYLAGSPVSTRDLISYVIAGLDEEYTPILSKGLVSIN---------------
         LWD  Q   G    S+  YLK     TRKG  KM EYL  MK  +D L LAGSP+S  DL+   + GLD EY P++ K    IN               
Subjt:  NLWDALQEYYGLQAASQQDYLKRMLQQTRKGSTKMSEYLALMKGYADNLYLAGSPVSTRDLISYVIAGLDEEYTPILSKGLVSIN---------------

Query:  -------------QPSVNVATSNSNQGNSSN-QSNRNSKNYRGRGRGRG-GNAPRLICQACGKAGHSAAVCYYRFEKNFNSFQNSGNSSSSHQNKNSAPT
                       S N A     +GN  N + N    N+RG   GRG G      CQ C   GH A  C YRF++ +     +G + S+  +K  + +
Subjt:  -------------QPSVNVATSNSNQGNSSN-QSNRNSKNYRGRGRGRG-GNAPRLICQACGKAGHSAAVCYYRFEKNFNSFQNSGNSSSSHQNKNSAPT

Query:  ALIATPEHLYNSAWYLDSGAPNHLTSDLANLTVKSEYTGNEMITVGSGQQLPIHYIGSTNIKGNARNLVSKSLMHVPMISKNLISISRLTMDNSGIVEFY
        A IA+P H  +  WY DSGA NH+T         +E+ G   + VG+G++L I   GST +     NL    +++VP I+KNL+S+S+LT DN+ +VEF 
Subjt:  ALIATPEHLYNSAWYLDSGAPNHLTSDLANLTVKSEYTGNEMITVGSGQQLPIHYIGSTNIKGNARNLVSKSLMHVPMISKNLISISRLTMDNSGIVEFY

Query:  DSFCIVKDKETGKVLLEGTIKDGLYQ------------------------------------VKTASNKQ------------------------------
         + C VKDK TG+ LL+G +KDGLYQ                                    VK + + Q                              
Subjt:  DSFCIVKDKETGKVLLEGTIKDGLYQ------------------------------------VKTASNKQ------------------------------

Query:  -NQEVPG--------------------SKF--------------AFVAFFKQVENMLHRKIKEVRCDEGGEFKPLIHFASGNEIKIQFTCPCTSDQNGRV
         + +V G                    S+F              AF+ F    EN  ++KIK ++CD GGE+K +   +    I+ + +CP TS QNGR 
Subjt:  -NQEVPG--------------------SKF--------------AFVAFFKQVENMLHRKIKEVRCDEGGEFKPLIHFASGNEIKIQFTCPCTSDQNGRV

Query:  ERKHWHIVETGLSLLAQAHMPLAYWWEPFHTVVYLINRMPTVALHGKVPFTTLYNKEPDYHVLRTFGSACFPCLRPYQSNKFDFHTKKCVFLGYSGNHKG
        ERKH H+ E GL+LLAQA MPL YWWE F T VYLINR+P+     + P++ ++ +EPDY+ L+ FG AC+PCL+PY  +K  FHT +CVF+GYS +HKG
Subjt:  ERKHWHIVETGLSLLAQAHMPLAYWWEPFHTVVYLINRMPTVALHGKVPFTTLYNKEPDYHVLRTFGSACFPCLRPYQSNKFDFHTKKCVFLGYSGNHKG

Query:  YRCLSPLGRTYTSKHVCFIEQDFLFSTNFL--QNPSAINNESPPILSWLPILQSSTTFQHNTS-DFSPTSHTELHSNSTSPCPTPSNHLTPYETISPSVS
        Y+C++  GR + S+HV F E  F F   FL  +NP     ++  IL  LP   +  T Q     D + TS    HS  +S        +   E    + +
Subjt:  YRCLSPLGRTYTSKHVCFIEQDFLFSTNFL--QNPSAINNESPPILSWLPILQSSTTFQHNTS-DFSPTSHTELHSNSTSPCPTPSNHLTPYETISPSVS

Query:  SSTSSHVPPSFTD----HPTPVTPPPQ-------PNTHPME--------------------------------------WQQAMAEEFSALIKNNTWDLV
        SST      +  D    + + +T   Q        NTH M                                       W++AM +E+ AL+ N+TW LV
Subjt:  SSTSSHVPPSFTD----HPTPVTPPPQ-------PNTHPME--------------------------------------WQQAMAEEFSALIKNNTWDLV

Query:  PPNPQQHLVGNKWVFKLQRVADKSILRYKARL
        P   Q++++ +KW+FK +  +D SI R KARL
Subjt:  PPNPQQHLVGNKWVFKLQRVADKSILRYKARL

KYP50444.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]7.6e-11734.68Show/hide
Query:  MIAEVAFQVTGYDTAKNLWDALQEYYGLQAASQQDYLKRMLQQTRKGSTKMSEYLALMKGYADNLYLAGSPVSTRDLISYVIAGLDEEYTPIL-------
        M  EVA Q+   +T++ +W+  Q   G    S+  +LK    +TRKG  KM EYL  MK  AD+L LAGS VST DL++  +AGLD EY PI+       
Subjt:  MIAEVAFQVTGYDTAKNLWDALQEYYGLQAASQQDYLKRMLQQTRKGSTKMSEYLALMKGYADNLYLAGSPVSTRDLISYVIAGLDEEYTPIL-------

Query:  --------------SKGLVSIN-------QPSVNVAT--------SNSNQGNSSNQSNRNSKNYRGRGRGRGGNAPRLICQACGKAGHSAAVCYYRFEKN
                         L  IN        PS N++T        SN+  G    Q NR ++  RGRGR       R++CQ C K GH+A+ CY+RF KN
Subjt:  --------------SKGLVSIN-------QPSVNVAT--------SNSNQGNSSNQSNRNSKNYRGRGRGRGGNAPRLICQACGKAGHSAAVCYYRFEKN

Query:  FNSFQNSGNSSSSHQNKNSAPTALIATPEHLYNSAWYLDSGAPNHLTSDLANLTVKSEYTGNEMITVGSGQQLPIHYIGSTNIKGNARNLVSKSLMHVPM
        +    +    S   + +N    A +A+P  + +  WY DSGA NH+T D   +   +E  G   +TVG+G  L I   G +++    ++L  K +++VP 
Subjt:  FNSFQNSGNSSSSHQNKNSAPTALIATPEHLYNSAWYLDSGAPNHLTSDLANLTVKSEYTGNEMITVGSGQQLPIHYIGSTNIKGNARNLVSKSLMHVPM

Query:  ISKNLISISRLTMDNSGIVEFYDSFCIVKDKETGKVLLEGTIKDGLYQV---KTASNK------------------------------------------
        I+KNL+SIS+LT DN   VEF+D  C VKDK TG++LLEG IKDGLYQ+    T++NK                                          
Subjt:  ISKNLISISRLTMDNSGIVEFYDSFCIVKDKETGKVLLEGTIKDGLYQV---KTASNK------------------------------------------

Query:  ---------------QN----------------------QEVPGSKF-------------------------AFVAFFKQVENMLHRKIKEVRCDEGGEF
                       QN                        V G K+                         AF+ F   VEN  +++IK ++CD GGEF
Subjt:  ---------------QN----------------------QEVPGSKF-------------------------AFVAFFKQVENMLHRKIKEVRCDEGGEF

Query:  KPLIHFASGNEIKIQFTCPCTSDQNGRVERKHWHIVETGLSLLAQAHMPLAYWWEPFHTVVYLINRMPTVALHGKVPFTTLYNKEPDYHVLRTFGSACFP
        K L        I+++ +CP TS QNGR ERKH H+VE+GL+LLAQA MPL YWWE F T V+LINR+PT  +  K P+  L++K PDY  ++TFG AC+P
Subjt:  KPLIHFASGNEIKIQFTCPCTSDQNGRVERKHWHIVETGLSLLAQAHMPLAYWWEPFHTVVYLINRMPTVALHGKVPFTTLYNKEPDYHVLRTFGSACFP

Query:  CLRPYQSNKFDFHTKKCVFLGYSGNHKGYRCLSPLGRTYTSKHVCFIEQDFLFSTNFLQNPSAINNESPPILSWLPILQSSTTFQHNTSDFSPTSHTELH
        CL+PY  +K  FHT KCVFLGYSG+HKGY+CL+  GR + S+HV F E  F F   FL         + P     PI  + +   +            LH
Subjt:  CLRPYQSNKFDFHTKKCVFLGYSGNHKGYRCLSPLGRTYTSKHVCFIEQDFLFSTNFLQNPSAINNESPPILSWLPILQSSTTFQHNTSDFSPTSHTELH

Query:  SNSTSPCPTPSNHLTP----YETISPSVSSST-----------SSHVPPSFTDHPTPVTPPPQPNTHPM-------------------EWQQAMAEEFSA
        +N+ S   T S H         TI  ++S +T           S +     T     +  P +P    +                   EW++AM  EF A
Subjt:  SNSTSPCPTPSNHLTP----YETISPSVSSST-----------SSHVPPSFTDHPTPVTPPPQPNTHPM-------------------EWQQAMAEEFSA

Query:  LIKNNTWDLVPPNPQQHLVGNKWVFKLQRVADKSILRYKARL
        L+ N TW LVP   Q++++  KWVFK +  AD +I R KARL
Subjt:  LIKNNTWDLVPPNPQQHLVGNKWVFKLQRVADKSILRYKARL

PNX94503.1 putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense]1.3e-12934.33Show/hide
Query:  KLDQEPALLVLHHRNNFLLWQNIALPILRSYKLEGHLTGKDECPEHSIIIPPSEDEPKGLTLPNQEHDIWLAADQLLVGWLYNSMIAEVAFQVTGYDTAK
        K D    + V   R+NF LW+++ LP++R  K +G++ G  +CP+  +    S D  + +   N ++  W A DQ L+GWL NSM  ++A QV   +T+K
Subjt:  KLDQEPALLVLHHRNNFLLWQNIALPILRSYKLEGHLTGKDECPEHSIIIPPSEDEPKGLTLPNQEHDIWLAADQLLVGWLYNSMIAEVAFQVTGYDTAK

Query:  NLWDALQEYYGLQAASQQDYLKRMLQQTRKGSTKMSEYLALMKGYADNLYLAGSPVSTRDLISYVIAGLDEEYTPILSK---------------------
         LWD  Q   G    S+  YLK     T K   KM +YLA MK  AD L LAGSP+S+ DL+   + GLD EY P++ K                     
Subjt:  NLWDALQEYYGLQAASQQDYLKRMLQQTRKGSTKMSEYLALMKGYADNLYLAGSPVSTRDLISYVIAGLDEEYTPILSK---------------------

Query:  ------GLVSIN-QPSVNVATSNSNQGNS----SNQSNRNSKNYR-GRGRGRGGNAPRLICQACGKAGHSAAVCYYRFEKNFNSFQNSGNSSSSHQNKNS
                 +IN   S N A+ N + GN           NS+  R GRGR R    PR ICQ CGK GH+AA CYYRF+K++    +      SH     
Subjt:  ------GLVSIN-QPSVNVATSNSNQGNS----SNQSNRNSKNYR-GRGRGRGGNAPRLICQACGKAGHSAAVCYYRFEKNFNSFQNSGNSSSSHQNKNS

Query:  APTALIATPEHLYNSAWYLDSGAPNHLTSDLANLTVKSEYTGNEMITVGSGQQLPIHYIGSTNIKGNARNLVSKSLMHVPMISKNLISISRLTMDNSGIV
          +A +A+P H  +  WY DSGA NH+T     L   +E  G   + VG+G++L I   GST +  N  NL  +++++VP I+KNL+S+S+LT+DN+ +V
Subjt:  APTALIATPEHLYNSAWYLDSGAPNHLTSDLANLTVKSEYTGNEMITVGSGQQLPIHYIGSTNIKGNARNLVSKSLMHVPMISKNLISISRLTMDNSGIV

Query:  EFYDSFCIVKDKETGKVLLEGTIKDGLYQVKT--------------------------ASNKQNQEV---------PGSKF-------------------
        EF +++C VKDK TGK LL+G +KDGLYQ+                             +NK  ++V         P  KF                   
Subjt:  EFYDSFCIVKDKETGKVLLEGTIKDGLYQVKT--------------------------ASNKQNQEV---------PGSKF-------------------

Query:  ------------------------------------------------------AFVAFFKQVENMLHRKIKEVRCDEGGEFKPLIHFASGNEIKIQFTC
                                                              AF  F   VEN  ++KIK +RCD GGE+KP+   A  + I+ Q +C
Subjt:  ------------------------------------------------------AFVAFFKQVENMLHRKIKEVRCDEGGEFKPLIHFASGNEIKIQFTC

Query:  PCTSDQNGRVERKHWHIVETGLSLLAQAHMPLAYWWEPFHTVVYLINRMPTVALHGKVPFTTLYNKEPDYHVLRTFGSACFPCLRPYQSNKFDFHTKKCV
        P TS QNGR ERKH H+ E GL+LLAQA MPL+YWWE F T VYLINR+P+     + P+T ++ KEPDY  L+ FG AC+PCL+PY  +K  FHT +CV
Subjt:  PCTSDQNGRVERKHWHIVETGLSLLAQAHMPLAYWWEPFHTVVYLINRMPTVALHGKVPFTTLYNKEPDYHVLRTFGSACFPCLRPYQSNKFDFHTKKCV

Query:  FLGYSGNHKGYRCLSPLGRTYTSKHVCFIEQDFLFSTNFL--QNP-SAINNESPPILSWLPILQSSTTFQHNTSDFSPTSHTELHSNST----SPCPTPS
        FLGYS +HKGY+C++  GR + S+HV F E  F F   FL  +NP   + N++P      P   ++      T +       EL+  +T    S      
Subjt:  FLGYSGNHKGYRCLSPLGRTYTSKHVCFIEQDFLFSTNFL--QNP-SAINNESPPILSWLPILQSSTTFQHNTSDFSPTSHTELHSNST----SPCPTPS

Query:  NHLTPYETISPSVSSSTSSHVPPSFTDHPTPVT---PPPQ---PNTHPM--------------------------------------EWQQAMAEEFSAL
         H       +     ST +    S  +   P+T   PPPQ    NTH M                                      EW  AM  E+ AL
Subjt:  NHLTPYETISPSVSSSTSSHVPPSFTDHPTPVT---PPPQ---PNTHPM--------------------------------------EWQQAMAEEFSAL

Query:  IKNNTWDLVPPNPQQHLVGNKWVFKLQRVADKSILRYKARL
        + N TW LVP   Q++++ +KW+FK +  AD +I R KARL
Subjt:  IKNNTWDLVPPNPQQHLVGNKWVFKLQRVADKSILRYKARL

TrEMBL top hitse value%identityAlignment
A0A2K3MUJ9 Putative retrotransposon Ty1-copia subclass protein (Fragment)6.5e-13034.33Show/hide
Query:  KLDQEPALLVLHHRNNFLLWQNIALPILRSYKLEGHLTGKDECPEHSIIIPPSEDEPKGLTLPNQEHDIWLAADQLLVGWLYNSMIAEVAFQVTGYDTAK
        K D    + V   R+NF LW+++ LP++R  K +G++ G  +CP+  +    S D  + +   N ++  W A DQ L+GWL NSM  ++A QV   +T+K
Subjt:  KLDQEPALLVLHHRNNFLLWQNIALPILRSYKLEGHLTGKDECPEHSIIIPPSEDEPKGLTLPNQEHDIWLAADQLLVGWLYNSMIAEVAFQVTGYDTAK

Query:  NLWDALQEYYGLQAASQQDYLKRMLQQTRKGSTKMSEYLALMKGYADNLYLAGSPVSTRDLISYVIAGLDEEYTPILSK---------------------
         LWD  Q   G    S+  YLK     T K   KM +YLA MK  AD L LAGSP+S+ DL+   + GLD EY P++ K                     
Subjt:  NLWDALQEYYGLQAASQQDYLKRMLQQTRKGSTKMSEYLALMKGYADNLYLAGSPVSTRDLISYVIAGLDEEYTPILSK---------------------

Query:  ------GLVSIN-QPSVNVATSNSNQGNS----SNQSNRNSKNYR-GRGRGRGGNAPRLICQACGKAGHSAAVCYYRFEKNFNSFQNSGNSSSSHQNKNS
                 +IN   S N A+ N + GN           NS+  R GRGR R    PR ICQ CGK GH+AA CYYRF+K++    +      SH     
Subjt:  ------GLVSIN-QPSVNVATSNSNQGNS----SNQSNRNSKNYR-GRGRGRGGNAPRLICQACGKAGHSAAVCYYRFEKNFNSFQNSGNSSSSHQNKNS

Query:  APTALIATPEHLYNSAWYLDSGAPNHLTSDLANLTVKSEYTGNEMITVGSGQQLPIHYIGSTNIKGNARNLVSKSLMHVPMISKNLISISRLTMDNSGIV
          +A +A+P H  +  WY DSGA NH+T     L   +E  G   + VG+G++L I   GST +  N  NL  +++++VP I+KNL+S+S+LT+DN+ +V
Subjt:  APTALIATPEHLYNSAWYLDSGAPNHLTSDLANLTVKSEYTGNEMITVGSGQQLPIHYIGSTNIKGNARNLVSKSLMHVPMISKNLISISRLTMDNSGIV

Query:  EFYDSFCIVKDKETGKVLLEGTIKDGLYQVKT--------------------------ASNKQNQEV---------PGSKF-------------------
        EF +++C VKDK TGK LL+G +KDGLYQ+                             +NK  ++V         P  KF                   
Subjt:  EFYDSFCIVKDKETGKVLLEGTIKDGLYQVKT--------------------------ASNKQNQEV---------PGSKF-------------------

Query:  ------------------------------------------------------AFVAFFKQVENMLHRKIKEVRCDEGGEFKPLIHFASGNEIKIQFTC
                                                              AF  F   VEN  ++KIK +RCD GGE+KP+   A  + I+ Q +C
Subjt:  ------------------------------------------------------AFVAFFKQVENMLHRKIKEVRCDEGGEFKPLIHFASGNEIKIQFTC

Query:  PCTSDQNGRVERKHWHIVETGLSLLAQAHMPLAYWWEPFHTVVYLINRMPTVALHGKVPFTTLYNKEPDYHVLRTFGSACFPCLRPYQSNKFDFHTKKCV
        P TS QNGR ERKH H+ E GL+LLAQA MPL+YWWE F T VYLINR+P+     + P+T ++ KEPDY  L+ FG AC+PCL+PY  +K  FHT +CV
Subjt:  PCTSDQNGRVERKHWHIVETGLSLLAQAHMPLAYWWEPFHTVVYLINRMPTVALHGKVPFTTLYNKEPDYHVLRTFGSACFPCLRPYQSNKFDFHTKKCV

Query:  FLGYSGNHKGYRCLSPLGRTYTSKHVCFIEQDFLFSTNFL--QNP-SAINNESPPILSWLPILQSSTTFQHNTSDFSPTSHTELHSNST----SPCPTPS
        FLGYS +HKGY+C++  GR + S+HV F E  F F   FL  +NP   + N++P      P   ++      T +       EL+  +T    S      
Subjt:  FLGYSGNHKGYRCLSPLGRTYTSKHVCFIEQDFLFSTNFL--QNP-SAINNESPPILSWLPILQSSTTFQHNTSDFSPTSHTELHSNST----SPCPTPS

Query:  NHLTPYETISPSVSSSTSSHVPPSFTDHPTPVT---PPPQ---PNTHPM--------------------------------------EWQQAMAEEFSAL
         H       +     ST +    S  +   P+T   PPPQ    NTH M                                      EW  AM  E+ AL
Subjt:  NHLTPYETISPSVSSSTSSHVPPSFTDHPTPVT---PPPQ---PNTHPM--------------------------------------EWQQAMAEEFSAL

Query:  IKNNTWDLVPPNPQQHLVGNKWVFKLQRVADKSILRYKARL
        + N TW LVP   Q++++ +KW+FK +  AD +I R KARL
Subjt:  IKNNTWDLVPPNPQQHLVGNKWVFKLQRVADKSILRYKARL

A0A2Z6MBG6 Integrase catalytic domain-containing protein1.1e-12633.91Show/hide
Query:  KLDQEPALLVLHHRNNFLLWQNIALPILRSYKLEGHLTGKDECPEHSIIIPPSEDEPKGLTLPNQEHDIWLAADQLLVGWLYNSMIAEVAFQVTGYDTAK
        K D   ++ V   RNN+ LW+++ LP++R  KL+G++ G + CPE  I    S D  K     N     W A DQ L+GW+ NSM  E+A Q+   +T+K
Subjt:  KLDQEPALLVLHHRNNFLLWQNIALPILRSYKLEGHLTGKDECPEHSIIIPPSEDEPKGLTLPNQEHDIWLAADQLLVGWLYNSMIAEVAFQVTGYDTAK

Query:  NLWDALQEYYGLQAASQQDYLKRMLQQTRKGSTKMSEYLALMKGYADNLYLAGSPVSTRDLISYVIAGLDEEYTPILSK---------------------
         LWD  Q   G    SQ  YLK      RKG  KM +YL  MK   D L LAG+PVST DLI   + GLD EY P++ K                     
Subjt:  NLWDALQEYYGLQAASQQDYLKRMLQQTRKGSTKMSEYLALMKGYADNLYLAGSPVSTRDLISYVIAGLDEEYTPILSK---------------------

Query:  ------GLVSIN-QPSVNVATSNSNQGNSSNQSNR--NSKNYR-GRGRGRGGNAPRLICQACGKAGHSAAVCYYRFEKNFNSFQNSGNSSSSHQNKNSAP
               L ++    + NVA  + ++G SSN + R  NS+ +R GRGRG+ G  P   CQ CG + H A  C++RF+K +    +  N S+ H +K  + 
Subjt:  ------GLVSIN-QPSVNVATSNSNQGNSSNQSNR--NSKNYR-GRGRGRGGNAPRLICQACGKAGHSAAVCYYRFEKNFNSFQNSGNSSSSHQNKNSAP

Query:  TALIATPEHLYNSAWYLDSGAPNHLTSDLANLTVKSEYTGNEMITVGSGQQLPIHYIGSTNIKGNARNLVSKSLMHVPMISKNLISISRLTMDNSGIVEF
         A +A+   + +  WY DSGA NH+T         +E+ G   + VG+G++L I   GS+ +K    +L    +++VP I+KNL+S+S+L  DN+ +VEF
Subjt:  TALIATPEHLYNSAWYLDSGAPNHLTSDLANLTVKSEYTGNEMITVGSGQQLPIHYIGSTNIKGNARNLVSKSLMHVPMISKNLISISRLTMDNSGIVEF

Query:  YDSFCIVKDKETGKVLLEGTIKDGLYQV--------------------------------------------------------------KTASNKQNQE
         ++ C VKDK TGKV+L+G +KDGLYQ+                                                              K++S+   + 
Subjt:  YDSFCIVKDKETGKVLLEGTIKDGLYQV--------------------------------------------------------------KTASNKQNQE

Query:  V----------------PGSKF-------------------------AFVAFFKQVENMLHRKIKEVRCDEGGEFKPLIHFASGNEIKIQFTCPCTSDQN
        +                 G K+                         AF+ F    EN  +++IK ++CD GGE+KP+   A    I+ + +CP TS QN
Subjt:  V----------------PGSKF-------------------------AFVAFFKQVENMLHRKIKEVRCDEGGEFKPLIHFASGNEIKIQFTCPCTSDQN

Query:  GRVERKHWHIVETGLSLLAQAHMPLAYWWEPFHTVVYLINRMPTVALHGKVPFTTLYNKEPDYHVLRTFGSACFPCLRPYQSNKFDFHTKKCVFLGYSGN
        GR ERKH HI E GL+LLAQA MPL YWWE F T VYLINR+P+     + P++ +  KEPDY +L+TFG AC+PCL+PY  +K  +HT +CVFLGYS +
Subjt:  GRVERKHWHIVETGLSLLAQAHMPLAYWWEPFHTVVYLINRMPTVALHGKVPFTTLYNKEPDYHVLRTFGSACFPCLRPYQSNKFDFHTKKCVFLGYSGN

Query:  HKGYRCLSPLGRTYTSKHVCFIEQDFLFSTNFLQNPSAINNE-SPPILSW-------------LPILQSSTTFQHNTSDFSP-TSHTELHSNSTSPCPTP
        HKGY+CL+  GR + S+HV F E  F F   FL   S +    + P  S+             +PIL++    + NT D     S TE  +N  S   T 
Subjt:  HKGYRCLSPLGRTYTSKHVCFIEQDFLFSTNFLQNPSAINNE-SPPILSW-------------LPILQSSTTFQHNTSDFSP-TSHTELHSNSTSPCPTP

Query:  SNH---LTPYETISPSVSSSTSSH-------------------VPPSFTDHPTPVTPPPQPNTHPMEWQQAMAEEFSALIKNNTWDLVPPNPQQHLVGNK
              +T  +++  +  ++ +SH                   +  ++ D   P     +  + P+ W++AM +EF AL+ N TW LVP   Q+++V +K
Subjt:  SNH---LTPYETISPSVSSSTSSH-------------------VPPSFTDHPTPVTPPPQPNTHPMEWQQAMAEEFSALIKNNTWDLVPPNPQQHLVGNK

Query:  WVFKLQRVADKSILRYKARL
        WVFK +   D S+ R KARL
Subjt:  WVFKLQRVADKSILRYKARL

A0A803NU85 Uncharacterized protein3.1e-14036.31Show/hide
Query:  SSGSDSTNPNTPVIKSSTDSAYVKKIQICMKLDQEPALLVLHHRNNFLLWQNIALPILRSYKLEGHLTGKDECPEHSIIIPPSEDEPKGLTLP----NQE
        ++G   T P T  ++++  +         +    +P  L L  RNNF LW+ +   I+R ++LEG+L G  + P   +   PSE    G   P    N E
Subjt:  SSGSDSTNPNTPVIKSSTDSAYVKKIQICMKLDQEPALLVLHHRNNFLLWQNIALPILRSYKLEGHLTGKDECPEHSIIIPPSEDEPKGLTLP----NQE

Query:  HDIWLAADQLLVGWLYNSMIAEVAFQVTGYDTAKNLWDALQEYYGLQAASQQDYLKRMLQQTRKGSTKMSEYLALMKGYADNLYLAGSPVSTRDLISYVI
        ++ WL  DQLL+GWL                   +LW AL+E YG  + +  D ++  +Q TRKG+  M++YL   + +AD+L LAG P   + L+S V+
Subjt:  HDIWLAADQLLVGWLYNSMIAEVAFQVTGYDTAKNLWDALQEYYGLQAASQQDYLKRMLQQTRKGSTKMSEYLALMKGYADNLYLAGSPVSTRDLISYVI

Query:  AGLDEEYTPI--------------LSKGLVS-----------------INQPSVNVA--------------------TSNSNQGNSSNQSNRNSKNYR-G
        +GLD EY  I              L   L+S                 +N PS N A                      ++N+GN +N  N    ++R G
Subjt:  AGLDEEYTPI--------------LSKGLVS-----------------INQPSVNVA--------------------TSNSNQGNSSNQSNRNSKNYR-G

Query:  RGRGRGGNAPRLICQACGKAGHSAAVCYYRFEKNFNSFQNSGNSSSSHQNKNSAPTALIATPEHLYNSAWYLDSGAPNHLTSDLANLTVKSEYTGNEMIT
        RGRG  GN  +  CQ CGK GHSAA+CY R++++F      G   ++  N +   +AL+A PE + + +WY DSGA NHLTSD   +  KSEY G E IT
Subjt:  RGRGRGGNAPRLICQACGKAGHSAAVCYYRFEKNFNSFQNSGNSSSSHQNKNSAPTALIATPEHLYNSAWYLDSGAPNHLTSDLANLTVKSEYTGNEMIT

Query:  VGSGQQLPIHYIGSTNIKGNARNLVSKSLMHVPMISKNLISISRLTMDNSGIVEFYDSFCIVKDKETGKVLLEGTIKDGLYQVKTASNKQNQEVPGSKFA
        +G G +LPI ++G+  ++     LV  +++HVP ISKNLIS+S+LT DN+  +EF+   C+VK++ TG+V+L+GT+KDGLYQ+  + +  + +      A
Subjt:  VGSGQQLPIHYIGSTNIKGNARNLVSKSLMHVPMISKNLISISRLTMDNSGIVEFYDSFCIVKDKETGKVLLEGTIKDGLYQVKTASNKQNQEVPGSKFA

Query:  FVAFFK-------------------------------------------------QVENMLHRKIKEVRCDEGGEFKPLIHFASGNEIKIQFTCPCTSDQ
        FV+                                                    Q EN    KIK++R D GGEF+   +    + I    +CP TS+Q
Subjt:  FVAFFK-------------------------------------------------QVENMLHRKIKEVRCDEGGEFKPLIHFASGNEIKIQFTCPCTSDQ

Query:  NGRVERKHWHIVETGLSLLAQAHMPLAYWWEPFHTVVYLINRMPTVALHGKVPFTTLYNKEPDYHVLRTFGSACFPCLRPYQSNKFDFHTKKCVFLGYSG
        NGR ERKH HIVE GL+L+AQA +PL YW + F T VYLINR+PT  L  + P+ TL++K+PDY  L+TFG ACFPCLR Y ++KF FH+ KCV LGYS 
Subjt:  NGRVERKHWHIVETGLSLLAQAHMPLAYWWEPFHTVVYLINRMPTVALHGKVPFTTLYNKEPDYHVLRTFGSACFPCLRPYQSNKFDFHTKKCVFLGYSG

Query:  NHKGYRCLSPLGRTYTSKHVCFIEQDFLFSTNFLQNPSAINNESPPILSWLPILQSSTTFQHNTSDFSPTSHTELHSNSTSPCPTPSNHLTPYETISPSV
        +HKGY+CLSP GR Y S+HV F E +F F   FL N +   +   P    +P L     F  + ++ S +SHT   ++   P P  ++         P V
Subjt:  NHKGYRCLSPLGRTYTSKHVCFIEQDFLFSTNFLQNPSAINNESPPILSWLPILQSSTTFQHNTSDFSPTSHTELHSNSTSPCPTPSNHLTPYETISPSV

Query:  SSSTSSHVPPSFTDHPTPVTPPPQPNTHPMEWQQAMAEEFSALIKNNTWDLVPPNPQQHLVGNKWVFKLQRVADKSILRYKARL
            ++       DH  P++   Q   +   W  AM  E  AL +N TW LVPP+P  ++VGNKWV+K++  AD S  RYKARL
Subjt:  SSSTSSHVPPSFTDHPTPVTPPPQPNTHPMEWQQAMAEEFSALIKNNTWDLVPPNPQQHLVGNKWVFKLQRVADKSILRYKARL

A0A803PYD1 Uncharacterized protein3.3e-12637.23Show/hide
Query:  PSSGSDSTNPNTPVIKSSTDSAYVKKIQICMKLDQEPALLVLHHRNNFLLWQNIALPILRSYKLEGHLTGKDECPEHSIIIPPSEDEPKGLT--LPNQEH
        P   S   +P +   + +T  A          L Q  +L +   RNN+ LW+ +   I+R ++L+G LTG+  CP  S+ +P +++E    T  LPN E+
Subjt:  PSSGSDSTNPNTPVIKSSTDSAYVKKIQICMKLDQEPALLVLHHRNNFLLWQNIALPILRSYKLEGHLTGKDECPEHSIIIPPSEDEPKGLT--LPNQEH

Query:  DIWLAADQLLVGWLYNSMIAEVAFQVTGYDTAKNLWDALQEYYGLQAASQQDYLKRMLQQTRKGSTKMSEYLALMKGYADNLYLAGSPVSTRDLISYVIA
        +  +  DQLL+GWLY SM   +  +V G  +A  LW AL+E YG Q+ +  D L+  LQ TRKG   M+EYL   +  AD L + G P   + L   +++
Subjt:  DIWLAADQLLVGWLYNSMIAEVAFQVTGYDTAKNLWDALQEYYGLQAASQQDYLKRMLQQTRKGSTKMSEYLALMKGYADNLYLAGSPVSTRDLISYVIA

Query:  GLDEEYTPI-------------------------------LSKGLVSINQPSVNVA----------------TSNSNQGNSSNQ-------SNRNSKNYR
        GLD EY  I                               LS     +N PSV +A                + NSN GN+S +       S+R     R
Subjt:  GLDEEYTPI-------------------------------LSKGLVSINQPSVNVA----------------TSNSNQGNSSNQ-------SNRNSKNYR

Query:  GRGRGRGGNAPRLICQACGKAGHSAAVCYYRFEKNFNSFQNSGNSSSSHQNKNSAPTALIATPEHLYNSAWYLDSGAPNHLTSDLANLTVKSEYTGNEMI
        GRGRG  GN+ +  CQ CGK GHSAA+C  RF++++   Q   +  +  Q+K +  + L+ATP+ L + +WY DSGA NHLT D   L  K EY G E +
Subjt:  GRGRGRGGNAPRLICQACGKAGHSAAVCYYRFEKNFNSFQNSGNSSSSHQNKNSAPTALIATPEHLYNSAWYLDSGAPNHLTSDLANLTVKSEYTGNEMI

Query:  TVGSGQQLPIHYIGSTNIK-GNARNLVSKSLMHVPMISKNLISISRLTMDNSGIVEFYDSFCIVKDKETGKVLLEGTIKDGLYQ-----VKTASNKQNQE
         VG G +L I +IGS  +   +++ L+ K L+HVP I+KNLISIS LT DN   VEF+  FC VKD+ TGKV+L+ T+KDGLYQ     V   S   N+ 
Subjt:  TVGSGQQLPIHYIGSTNIK-GNARNLVSKSLMHVPMISKNLISISRLTMDNSGIVEFYDSFCIVKDKETGKVLLEGTIKDGLYQ-----VKTASNKQNQE

Query:  VPGSKFAFVAFFKQVENMLHRKIKEVRCDEGGEFKPLIHFASGNEIKIQFTCPCTSDQNGRVERKHWHIVETGLSLLAQAHMPLAYWWEPFHTVVYLINR
          GS     +F   +++  HR++                  S   +           QN R E KH HIVE GL+LLAQA MPL YW + F T VYLINR
Subjt:  VPGSKFAFVAFFKQVENMLHRKIKEVRCDEGGEFKPLIHFASGNEIKIQFTCPCTSDQNGRVERKHWHIVETGLSLLAQAHMPLAYWWEPFHTVVYLINR

Query:  MPTVALHGKVPFTTLYNKEPDYHVLRTFGSACFPCLRPYQSNKFDFHTKKCVFLGYSGNHKGYRCLSPLGRTYTSKHVCFIEQDFLFSTNFLQN---PSA
        +PTV L GK PF  LY+K PDY  L+ FGS CFP LRPYQ++KF +H+ KC+ LGYS  HKGY+CLSP GR Y S++V F E +F   + F  N      
Subjt:  MPTVALHGKVPFTTLYNKEPDYHVLRTFGSACFPCLRPYQSNKFDFHTKKCVFLGYSGNHKGYRCLSPLGRTYTSKHVCFIEQDFLFSTNFLQN---PSA

Query:  INNESPPILSWL----PIL---QSSTTFQHNTSDFSPTSHTELHSNSTSPCPTPSNHLTPYETISPSVSSSTSSHVPPSFTDHPTPVTPPPQPNTHPM--
        I  ++P   SW     PIL    S T+     +  SPTS  +  S+ +S   +P     P+ T     S S  S   PS   HP PV     P THPM  
Subjt:  INNESPPILSWL----PIL---QSSTTFQHNTSDFSPTSHTELHSNSTSPCPTPSNHLTPYETISPSVSSSTSSHVPPSFTDHPTPVTPPPQPNTHPM--

Query:  -----------------------------------EWQQAMAEEFSALIKNNTWDLVPPNPQQHLVGNKWVFKLQRVADKSILRYKARL
                                            W  AM++EF AL +  TW LVP +   ++VG KW+F+ +  AD S  R KARL
Subjt:  -----------------------------------EWQQAMAEEFSALIKNNTWDLVPPNPQQHLVGNKWVFKLQRVADKSILRYKARL

A0A803QCY3 Uncharacterized protein2.1e-13336.65Show/hide
Query:  SSGSDSTNPNTPVIKSSTDSAYVKKIQICMKLDQEPALLVLHHRNNFLLWQNIALPILRSYKLEGHLTGKDECPEHSIIIPPSEDEPKGLTLPNQEHDIW
        ++G+ STNP   V    +      K    +KLD           NN+ LW+ +   I+R ++L+G L G + CP   +    +ED  K +   N E + W
Subjt:  SSGSDSTNPNTPVIKSSTDSAYVKKIQICMKLDQEPALLVLHHRNNFLLWQNIALPILRSYKLEGHLTGKDECPEHSIIIPPSEDEPKGLTLPNQEHDIW

Query:  LAADQLLVGWLYNSMIAEVAFQVTGYDTAKNLWDALQEYYGLQAASQQDYLKRMLQQTRKGSTKMSEYLALMKGYADNLYLAGSPVSTRDLISYVIAGLD
        +  DQLL+GWLY+SM   +A +V G  +A  LW AL++ YG  + S+ D  + ++Q T+KG T M EYL   K +AD+L LAG P     L + V++ LD
Subjt:  LAADQLLVGWLYNSMIAEVAFQVTGYDTAKNLWDALQEYYGLQAASQQDYLKRMLQQTRKGSTKMSEYLALMKGYADNLYLAGSPVSTRDLISYVIAGLD

Query:  EEYTPI--------------LSKGLVSINQPSVNVATSNSN-QGNSSNQSNRNSKNYRGRGRGRGGNAPRLICQACGKAGHSAAVCYYRFEKNF-NSFQN
          Y  +              L + L+S       +   N+N +G   N+ N      R RGRGR  N+ +  CQ CGK  HSA VCY  F+ ++  S  +
Subjt:  EEYTPI--------------LSKGLVSINQPSVNVATSNSN-QGNSSNQSNRNSKNYRGRGRGRGGNAPRLICQACGKAGHSAAVCYYRFEKNF-NSFQN

Query:  SGNSSSSHQNKNSAPTALIATPEHLYNSAWYLDSGAPNHLTSDLANLTVKSEYTGNEMITVGSGQQLPIHYIGSTNI-KGNARNLVSKSLMHVPMISKNL
        S N + + QN N+ P+A IATPE L + AW+ DSGA N++T+D + +  K EY G E +TVG+G +L I + G+  +     + L    ++ VP+I+KN 
Subjt:  SGNSSSSHQNKNSAPTALIATPEHLYNSAWYLDSGAPNHLTSDLANLTVKSEYTGNEMITVGSGQQLPIHYIGSTNI-KGNARNLVSKSLMHVPMISKNL

Query:  ISISRLTMDNSGIVEFYDSFCIVKDKETGKVLLEGTIKDGLYQVKTASNKQNQ-EVPGSKF---------------------------------------
        +S+S+LT DN  I+EF+ + C VKD  T +VLL+G +KDGLYQ++T  NK        SKF                                       
Subjt:  ISISRLTMDNSGIVEFYDSFCIVKDKETGKVLLEGTIKDGLYQVKTASNKQNQ-EVPGSKF---------------------------------------

Query:  ---------------------------------------------AFVAFFKQVENMLHRKIKEVRCDEGGEFKPLIHFASGNEIKIQFTCPCTSDQNGR
                                                     AF+AF    EN   RKIK +R D GGE++ L  F   + I    +CP TS QNGR
Subjt:  ---------------------------------------------AFVAFFKQVENMLHRKIKEVRCDEGGEFKPLIHFASGNEIKIQFTCPCTSDQNGR

Query:  VERKHWHIVETGLSLLAQAHMPLAYWWEPFHTVVYLINRMPTVALHGKVPFTTLYNKEPDYHVLRTFGSACFPCLRPYQSNKFDFHTKKCVFLGYSGNHK
         ERKH HIVE GL+LLAQ+ MPL YWW+ F T VYLINR+PT  L  K PF  L+ K PDY  L+TFG ACFPCLRPYQ++KF FH+ KCV LGYS  HK
Subjt:  VERKHWHIVETGLSLLAQAHMPLAYWWEPFHTVVYLINRMPTVALHGKVPFTTLYNKEPDYHVLRTFGSACFPCLRPYQSNKFDFHTKKCVFLGYSGNHK

Query:  GYRCLSPLGRTYTSKHVCFIEQDFLFSTNFLQNPSAINNESPPILSWLPILQSSTTFQHNTSDFSPTSHTELHSNSTSPCPTPSNHLTPYETISPSVSSS
        GY+CLSP GR Y  + V F E +F F  +FL N  + N         L I+QSST                    S  P  +P+   + + + +PS SS 
Subjt:  GYRCLSPLGRTYTSKHVCFIEQDFLFSTNFLQNPSAINNESPPILSWLPILQSSTTFQHNTSDFSPTSHTELHSNSTSPCPTPSNHLTPYETISPSVSSS

Query:  TSSHVPPSFTDHPTPVTPPPQPNTHPME---WQQAMAEEFSALIKNNTWDLVPPNPQQHLVGNKWVFKLQRVADKSILRYKARL
         +    P   +   P +  P      +    W +AM EE +AL  N T+ LVPP P Q+L+GNKWVF+ +   D ++ R KARL
Subjt:  TSSHVPPSFTDHPTPVTPPPQPNTHPME---WQQAMAEEFSALIKNNTWDLVPPNPQQHLVGNKWVFKLQRVADKSILRYKARL

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.9e-2629.94Show/hide
Query:  FVAFFKQVENMLHRKIKEVRCDEGGEF--KPLIHFASGNEIKIQFTCPCTSDQNGRVERKHWHIVETGLSLLAQAHMPLAYWWEPFHTVVYLINRMPTVA
        F  F   VE    RK+K +R D GGE+  +    + S + I+ + T P T   NG  ER +  IVE   S+L  A +P ++W E   T  YLINR P+V 
Subjt:  FVAFFKQVENMLHRKIKEVRCDEGGEF--KPLIHFASGNEIKIQFTCPCTSDQNGRVERKHWHIVETGLSLLAQAHMPLAYWWEPFHTVVYLINRMPTVA

Query:  LHGKVPFTTLYNKEPDYHVLRTFGSACFPCLRPYQSNKFDFHTKKCVFLGYSGNHKGYRCLSPL-GRTYTSKHVCFIEQDFLFSTNF---LQNPSAINNE
        L  ++P     NKE  Y  L+ FG   F  +   Q  K D  +  C+F+GY     GYR   P+  +   S+ V F E +   + +    ++N    N  
Subjt:  LHGKVPFTTLYNKEPDYHVLRTFGSACFPCLRPYQSNKFDFHTKKCVFLGYSGNHKGYRCLSPL-GRTYTSKHVCFIEQDFLFSTNF---LQNPSAINNE

Query:  SPPILSWLPILQSSTTFQHNTSDFSP---TSHTELHSNSTSPCPTPSNHLTPYETISPSVSSSTSSHVPPS-----FTDHPTP-----VTPPPQPNTHPM
        + P  S  P    STT + +     P       E           P+     ++ +  S      S   PS      +D   P     V   P+ N    
Subjt:  SPPILSWLPILQSSTTFQHNTSDFSP---TSHTELHSNSTSPCPTPSNHLTPYETISPSVSSSTSSHVPPS-----FTDHPTP-----VTPPPQPNTHPM

Query:  EWQQAMAEEFSALIKNNTWDLVPPNPQQHLVGNKWVFKLQRVADKSILRYKARL
        +  +AM EE  +L KN T+ LV     +  +  KWVFKL++  D  ++RYKARL
Subjt:  EWQQAMAEEFSALIKNNTWDLVPPNPQQHLVGNKWVFKLQRVADKSILRYKARL

P92520 Uncharacterized mitochondrial protein AtMg008201.8e-0752.83Show/hide
Query:  WQQAMAEEFSALIKNNTWDLVPPNPQQHLVGNKWVFKLQRVADKSILRYKARL
        W QAM EE  AL +N TW LVPP   Q+++G KWVFK +  +D ++ R KARL
Subjt:  WQQAMAEEFSALIKNNTWDLVPPNPQQHLVGNKWVFKLQRVADKSILRYKARL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.0e-6728.27Show/hide
Query:  NFLLWQNIALPILRSYKLEGHLTGKDECPEHSIIIPPSEDEPKGLTLPNQEHDIWLAADQLLVGWLYNSMIAEVAFQVTGYDTAKNLWDALQEYYGLQAA
        N+L+W      +   Y+L G L G       S  +PP+          N ++  W   D+L+   +  ++   V   V+   TA  +W+ L++ Y   + 
Subjt:  NFLLWQNIALPILRSYKLEGHLTGKDECPEHSIIIPPSEDEPKGLTLPNQEHDIWLAADQLLVGWLYNSMIAEVAFQVTGYDTAKNLWDALQEYYGLQAA

Query:  SQQDYLKRMLQQTRKGSTKMSEYLALMKGYADNLYLAGSPVSTRDLISYVIAGLDEEYTPILSK--------GLVSINQPSVN-----------------
             L+  L+Q  KG+  + +Y+  +    D L L G P+   + +  V+  L EEY P++ +         L  I++  +N                 
Subjt:  SQQDYLKRMLQQTRKGSTKMSEYLALMKGYADNLYLAGSPVSTRDLISYVIAGLDEEYTPILSK--------GLVSINQPSVN-----------------

Query:  ---------VATSNSNQGNSSNQ-----SNRNSKNYRGRGR----GRGGNAPRL-ICQACGKAGHSAAVCYYRFEKNFNSFQNSGNSSSSHQNKNS-APT
                   T+N+N GN +N+     +N NSK ++            + P L  CQ CG  GHSA  C          F +S NS           P 
Subjt:  ---------VATSNSNQGNSSNQ-----SNRNSKNYRGRGR----GRGGNAPRL-ICQACGKAGHSAAVCYYRFEKNFNSFQNSGNSSSSHQNKNS-APT

Query:  ALIATPEHLYNSAWYLDSGAPNHLTSDLANLTVKSEYTGNEMITVGSGQQLPIHYIGSTNIKGNARNLVSKSLMHVPMISKNLISISRLTMDNSGIVEFY
        A +A      ++ W LDSGA +H+TSD  NL++   YTG + + V  G  +PI + GST++   +R L   ++++VP I KNLIS+ RL   N   VEF+
Subjt:  ALIATPEHLYNSAWYLDSGAPNHLTSDLANLTVKSEYTGNEMITVGSGQQLPIHYIGSTNIKGNARNLVSKSLMHVPMISKNLISISRLTMDNSGIVEFY

Query:  DSFCIVKDKETGKVLLEGTIKDGLYQVKTASNK--------------------------------------------------------QNQEVPGS---
         +   VKD  TG  LL+G  KD LY+   AS++                                                        ++ +VP S   
Subjt:  DSFCIVKDKETGKVLLEGTIKDGLYQVKTASNK--------------------------------------------------------QNQEVPGS---

Query:  --------------------------------------------------KFAFVAFFKQVENMLHRKIKEVRCDEGGEFKPLIHFASGNEIKIQFTCPC
                                                          K  F+ F   +EN    +I     D GGEF  L  + S + I    + P 
Subjt:  --------------------------------------------------KFAFVAFFKQVENMLHRKIKEVRCDEGGEFKPLIHFASGNEIKIQFTCPC

Query:  TSDQNGRVERKHWHIVETGLSLLAQAHMPLAYWWEPFHTVVYLINRMPTVALHGKVPFTTLYNKEPDYHVLRTFGSACFPCLRPYQSNKFDFHTKKCVFL
        T + NG  ERKH HIVETGL+LL+ A +P  YW   F   VYLINR+PT  L  + PF  L+   P+Y  LR FG AC+P LRPY  +K D  +++CVFL
Subjt:  TSDQNGRVERKHWHIVETGLSLLAQAHMPLAYWWEPFHTVVYLINRMPTVALHGKVPFTTLYNKEPDYHVLRTFGSACFPCLRPYQSNKFDFHTKKCVFL

Query:  GYSGNHKGYRCLS-PLGRTYTSKHVCFIEQDFLFSTNFLQNPSAINNE-SPPILSWLPILQSSTTFQHNTSDFSPTSHTELHSNSTSPCPTPSNHLTPYE
        GYS     Y CL     R Y S+HV F E  F FS N+L   S +  +       W P     TT    T      S ++ H  +T P  +PS      +
Subjt:  GYSGNHKGYRCLS-PLGRTYTSKHVCFIEQDFLFSTNFLQNPSAINNE-SPPILSWLPILQSSTTFQHNTSDFSPTSHTELHSNSTSPCPTPSNHLTPYE

Query:  TISPSVSSSTSSHVPPSFTDHPTPVTP---PPQPNTHPMEWQ
          S ++ SS SS    SF   P P  P    PQP T P + Q
Subjt:  TISPSVSSSTSSHVPPSFTDHPTPVTP---PPQPNTHPMEWQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.1e-6127.3Show/hide
Query:  NFLLWQNIALPILRSYKLEGHLTGKDECPEHSIIIPPSEDEPKGLTLPNQEHDIWLAADQLLVGWLYNSMIAEVAFQVTGYDTAKNLWDALQEYYGLQAA
        N+L+W      +   Y+L G L G       S  +PP+      +   N ++  W   D+L+   +  ++   V   V+   TA  +W+ L++ Y   A 
Subjt:  NFLLWQNIALPILRSYKLEGHLTGKDECPEHSIIIPPSEDEPKGLTLPNQEHDIWLAADQLLVGWLYNSMIAEVAFQVTGYDTAKNLWDALQEYYGLQAA

Query:  SQQDYLKRMLQQTRKGSTKMSEYLALMKGYADNLYLAGSPVSTRDLISYVIAGLDEEYTPILSK----------------------GLVSINQPSV----
            ++ ++   TR                 D L L G P+   + +  V+  L ++Y P++ +                       L+++N   V    
Subjt:  SQQDYLKRMLQQTRKGSTKMSEYLALMKGYADNLYLAGSPVSTRDLISYVIAGLDEEYTPILSK----------------------GLVSINQPSV----

Query:  -NVAT-----SNSNQGNSSNQSNRNSKNYRGR-------GRGRGGNAPRLI---CQACGKAGHSAAVCYYRFEKNFNSFQNSGNSSSSHQNKNS-APTAL
         NV T     +N NQ N  +  N N+ N R         G       P+     CQ C   GHSA  C        + FQ++ N   S        P A 
Subjt:  -NVAT-----SNSNQGNSSNQSNRNSKNYRGR-------GRGRGGNAPRLI---CQACGKAGHSAAVCYYRFEKNFNSFQNSGNSSSSHQNKNS-APTAL

Query:  IATPEHLYNSAWYLDSGAPNHLTSDLANLTVKSEYTGNEMITVGSGQQLPIHYIGSTNIKGNARNLVSKSLMHVPMISKNLISISRLTMDNSGIVEFYDS
        +A       + W LDSGA +H+TSD  NL+    YTG + + +  G  +PI + GS ++  ++R+L    +++VP I KNLIS+ RL   N   VEF+ +
Subjt:  IATPEHLYNSAWYLDSGAPNHLTSDLANLTVKSEYTGNEMITVGSGQQLPIHYIGSTNIKGNARNLVSKSLMHVPMISKNLISISRLTMDNSGIVEFYDS

Query:  FCIVKDKETGKVLLEGTIKDGLYQVKTASNK--------------------------------------------------------QNQEVPGS-----
           VKD  TG  LL+G  KD LY+   AS++                                                        ++ +VP S     
Subjt:  FCIVKDKETGKVLLEGTIKDGLYQVKTASNK--------------------------------------------------------QNQEVPGS-----

Query:  ------------------------------------------------KFAFVAFFKQVENMLHRKIKEVRCDEGGEFKPLIHFASGNEIKIQFTCPCTS
                                                        K  F+ F   VEN    +I  +  D GGEF  L  + S + I    + P T 
Subjt:  ------------------------------------------------KFAFVAFFKQVENMLHRKIKEVRCDEGGEFKPLIHFASGNEIKIQFTCPCTS

Query:  DQNGRVERKHWHIVETGLSLLAQAHMPLAYWWEPFHTVVYLINRMPTVALHGKVPFTTLYNKEPDYHVLRTFGSACFPCLRPYQSNKFDFHTKKCVFLGY
        + NG  ERKH HIVE GL+LL+ A +P  YW   F   VYLINR+PT  L  + PF  L+ + P+Y  L+ FG AC+P LRPY  +K +  +K+C F+GY
Subjt:  DQNGRVERKHWHIVETGLSLLAQAHMPLAYWWEPFHTVVYLINRMPTVALHGKVPFTTLYNKEPDYHVLRTFGSACFPCLRPYQSNKFDFHTKKCVFLGY

Query:  SGNHKGYRCLS-PLGRTYTSKHVCFIEQDFLFS-TNFLQNPSAINNESPPILSWLPILQSSTTFQHNTSDFSPT--SHTELHSN----STSPCPTPSNHL
        S     Y CL  P GR YTS+HV F E+ F FS TNF                        +T Q   SD +P   SHT L +        PC  P    
Subjt:  SGNHKGYRCLS-PLGRTYTSKHVCFIEQDFLFS-TNFLQNPSAINNESPPILSWLPILQSSTTFQHNTSDFSPT--SHTELHSN----STSPCPTPSNHL

Query:  TPYETISPS------VSSS---TSSHVPPSFTDHPTPVTPPPQPNTHPMEWQQAMAEEFSALIKNNTWDLVPPN-PQQH
        +P    SPS      VSSS   +SS   PS ++   P    PQP   P + Q + +   S ++ N   +   PN P Q+
Subjt:  TPYETISPS------VSSS---TSSHVPPSFTDHPTPVTPPPQPNTHPMEWQQAMAEEFSALIKNNTWDLVPPN-PQQH

Arabidopsis top hitse value%identityAlignment
ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.2e-0852.83Show/hide
Query:  WQQAMAEEFSALIKNNTWDLVPPNPQQHLVGNKWVFKLQRVADKSILRYKARL
        W QAM EE  AL +N TW LVPP   Q+++G KWVFK +  +D ++ R KARL
Subjt:  WQQAMAEEFSALIKNNTWDLVPPNPQQHLVGNKWVFKLQRVADKSILRYKARL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAACCCCATAAACGATCCCCTGAATGTTCCCTCTAGTGGTTCTGATTCCACCAATCCAAACACTCCTGTCATCAAGAGTTCCACTGATTCGGCCTATGTTAAAAA
AATCCAGATCTGTATGAAGTTAGATCAGGAACCTGCCCTGTTGGTTCTCCACCACCGCAACAATTTTCTGCTATGGCAAAACATCGCTCTTCCCATTCTTCGTAGCTACA
AACTAGAAGGCCATCTGACAGGTAAGGATGAATGTCCTGAGCACTCTATTATTATTCCTCCTTCAGAAGATGAACCCAAAGGCCTCACTCTTCCAAACCAGGAACACGAT
ATATGGCTAGCTGCTGATCAACTCTTGGTTGGCTGGTTATACAACTCCATGATAGCCGAAGTAGCCTTTCAGGTTACTGGATATGACACGGCTAAAAATTTGTGGGATGC
TCTACAAGAGTACTATGGCCTCCAAGCTGCATCTCAACAAGATTATTTGAAAAGAATGCTCCAGCAGACAAGGAAGGGGAGTACCAAGATGTCTGAATACTTGGCCTTAA
TGAAAGGATATGCTGACAATTTATACTTAGCAGGTTCGCCGGTTAGTACAAGAGATTTAATTTCGTATGTGATTGCAGGGCTTGATGAAGAATACACTCCCATTTTGTCA
AAAGGTTTGGTTTCTATCAACCAACCCTCAGTTAATGTCGCCACCTCCAACTCAAATCAAGGGAATTCTTCAAATCAATCGAATCGGAACTCTAAAAACTACAGAGGCAG
AGGAAGGGGTCGTGGAGGCAATGCTCCTCGTCTAATTTGTCAGGCATGTGGAAAGGCTGGACATTCAGCAGCTGTCTGTTATTATCGATTTGAAAAGAACTTCAATAGTT
TCCAAAATTCTGGTAATTCTTCTTCCTCCCATCAGAACAAGAATTCTGCCCCAACAGCTCTAATAGCCACCCCAGAGCACCTTTATAATTCTGCCTGGTATCTTGACAGT
GGAGCACCTAATCACCTCACTTCAGACCTAGCAAATCTCACAGTAAAATCCGAGTACACAGGTAATGAAATGATTACAGTTGGTAGTGGCCAACAACTTCCTATTCATTA
TATTGGTAGCACTAATATTAAAGGCAATGCTAGAAATCTTGTTTCGAAAAGTCTGATGCATGTTCCTATGATCAGTAAAAATCTGATTAGCATATCTCGTCTTACCATGG
ATAATTCGGGGATCGTTGAATTTTATGATTCATTTTGTATTGTTAAGGACAAGGAAACGGGGAAGGTACTTCTAGAAGGAACAATTAAGGATGGCTTATACCAGGTTAAG
ACAGCCTCAAACAAACAAAATCAAGAAGTCCCTGGATCGAAGTTTGCTTTTGTGGCTTTTTTTAAACAAGTTGAAAATATGCTACACAGGAAAATTAAAGAAGTTCGGTG
TGATGAAGGAGGTGAATTCAAGCCATTAATACACTTTGCTTCTGGAAATGAGATCAAAATACAGTTTACCTGTCCTTGTACTTCAGACCAAAATGGACGTGTTGAAAGGA
AACATTGGCATATAGTGGAAACAGGACTATCTCTTCTTGCTCAAGCTCACATGCCTCTAGCCTACTGGTGGGAACCATTTCATACGGTCGTTTACTTGATTAACCGTATG
CCCACTGTTGCTTTACATGGAAAAGTTCCATTTACCACATTGTACAACAAGGAACCTGATTATCATGTTCTTAGAACATTCGGATCAGCTTGTTTTCCTTGTCTCAGGCC
ATATCAGTCCAACAAGTTTGATTTTCATACTAAAAAATGTGTTTTTCTGGGGTATAGTGGTAACCATAAAGGCTATCGATGTTTGAGTCCCTTAGGTAGAACCTACACTT
CTAAACATGTGTGTTTCATTGAGCAAGATTTCCTTTTCTCCACCAATTTTCTCCAAAACCCATCAGCCATAAACAATGAATCACCTCCTATCTTATCATGGCTACCAATC
CTCCAATCCTCAACCACTTTCCAGCACAACACCTCAGACTTCTCCCCCACTTCGCATACTGAACTCCATTCTAACTCTACCAGTCCTTGTCCAACTCCCTCGAATCATTT
AACCCCATATGAAACCATAAGCCCTTCGGTTTCCAGCTCTACTTCCAGCCATGTCCCTCCCTCGTTTACTGACCACCCAACCCCTGTCACTCCACCACCCCAACCCAACA
CTCACCCTATGGAATGGCAGCAGGCCATGGCTGAGGAATTTTCTGCCCTCATTAAAAATAACACTTGGGACTTGGTTCCCCCTAATCCACAACAACATCTTGTTGGCAAT
AAATGGGTTTTCAAATTACAAAGAGTTGCTGACAAGTCAATTCTTCGGTACAAAGCTCGATTGTCATTTAATCGCCCATCATCTAAAGTGTATAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTAACCCCATAAACGATCCCCTGAATGTTCCCTCTAGTGGTTCTGATTCCACCAATCCAAACACTCCTGTCATCAAGAGTTCCACTGATTCGGCCTATGTTAAAAA
AATCCAGATCTGTATGAAGTTAGATCAGGAACCTGCCCTGTTGGTTCTCCACCACCGCAACAATTTTCTGCTATGGCAAAACATCGCTCTTCCCATTCTTCGTAGCTACA
AACTAGAAGGCCATCTGACAGGTAAGGATGAATGTCCTGAGCACTCTATTATTATTCCTCCTTCAGAAGATGAACCCAAAGGCCTCACTCTTCCAAACCAGGAACACGAT
ATATGGCTAGCTGCTGATCAACTCTTGGTTGGCTGGTTATACAACTCCATGATAGCCGAAGTAGCCTTTCAGGTTACTGGATATGACACGGCTAAAAATTTGTGGGATGC
TCTACAAGAGTACTATGGCCTCCAAGCTGCATCTCAACAAGATTATTTGAAAAGAATGCTCCAGCAGACAAGGAAGGGGAGTACCAAGATGTCTGAATACTTGGCCTTAA
TGAAAGGATATGCTGACAATTTATACTTAGCAGGTTCGCCGGTTAGTACAAGAGATTTAATTTCGTATGTGATTGCAGGGCTTGATGAAGAATACACTCCCATTTTGTCA
AAAGGTTTGGTTTCTATCAACCAACCCTCAGTTAATGTCGCCACCTCCAACTCAAATCAAGGGAATTCTTCAAATCAATCGAATCGGAACTCTAAAAACTACAGAGGCAG
AGGAAGGGGTCGTGGAGGCAATGCTCCTCGTCTAATTTGTCAGGCATGTGGAAAGGCTGGACATTCAGCAGCTGTCTGTTATTATCGATTTGAAAAGAACTTCAATAGTT
TCCAAAATTCTGGTAATTCTTCTTCCTCCCATCAGAACAAGAATTCTGCCCCAACAGCTCTAATAGCCACCCCAGAGCACCTTTATAATTCTGCCTGGTATCTTGACAGT
GGAGCACCTAATCACCTCACTTCAGACCTAGCAAATCTCACAGTAAAATCCGAGTACACAGGTAATGAAATGATTACAGTTGGTAGTGGCCAACAACTTCCTATTCATTA
TATTGGTAGCACTAATATTAAAGGCAATGCTAGAAATCTTGTTTCGAAAAGTCTGATGCATGTTCCTATGATCAGTAAAAATCTGATTAGCATATCTCGTCTTACCATGG
ATAATTCGGGGATCGTTGAATTTTATGATTCATTTTGTATTGTTAAGGACAAGGAAACGGGGAAGGTACTTCTAGAAGGAACAATTAAGGATGGCTTATACCAGGTTAAG
ACAGCCTCAAACAAACAAAATCAAGAAGTCCCTGGATCGAAGTTTGCTTTTGTGGCTTTTTTTAAACAAGTTGAAAATATGCTACACAGGAAAATTAAAGAAGTTCGGTG
TGATGAAGGAGGTGAATTCAAGCCATTAATACACTTTGCTTCTGGAAATGAGATCAAAATACAGTTTACCTGTCCTTGTACTTCAGACCAAAATGGACGTGTTGAAAGGA
AACATTGGCATATAGTGGAAACAGGACTATCTCTTCTTGCTCAAGCTCACATGCCTCTAGCCTACTGGTGGGAACCATTTCATACGGTCGTTTACTTGATTAACCGTATG
CCCACTGTTGCTTTACATGGAAAAGTTCCATTTACCACATTGTACAACAAGGAACCTGATTATCATGTTCTTAGAACATTCGGATCAGCTTGTTTTCCTTGTCTCAGGCC
ATATCAGTCCAACAAGTTTGATTTTCATACTAAAAAATGTGTTTTTCTGGGGTATAGTGGTAACCATAAAGGCTATCGATGTTTGAGTCCCTTAGGTAGAACCTACACTT
CTAAACATGTGTGTTTCATTGAGCAAGATTTCCTTTTCTCCACCAATTTTCTCCAAAACCCATCAGCCATAAACAATGAATCACCTCCTATCTTATCATGGCTACCAATC
CTCCAATCCTCAACCACTTTCCAGCACAACACCTCAGACTTCTCCCCCACTTCGCATACTGAACTCCATTCTAACTCTACCAGTCCTTGTCCAACTCCCTCGAATCATTT
AACCCCATATGAAACCATAAGCCCTTCGGTTTCCAGCTCTACTTCCAGCCATGTCCCTCCCTCGTTTACTGACCACCCAACCCCTGTCACTCCACCACCCCAACCCAACA
CTCACCCTATGGAATGGCAGCAGGCCATGGCTGAGGAATTTTCTGCCCTCATTAAAAATAACACTTGGGACTTGGTTCCCCCTAATCCACAACAACATCTTGTTGGCAAT
AAATGGGTTTTCAAATTACAAAGAGTTGCTGACAAGTCAATTCTTCGGTACAAAGCTCGATTGTCATTTAATCGCCCATCATCTAAAGTGTATAGATGA
Protein sequenceShow/hide protein sequence
MANPINDPLNVPSSGSDSTNPNTPVIKSSTDSAYVKKIQICMKLDQEPALLVLHHRNNFLLWQNIALPILRSYKLEGHLTGKDECPEHSIIIPPSEDEPKGLTLPNQEHD
IWLAADQLLVGWLYNSMIAEVAFQVTGYDTAKNLWDALQEYYGLQAASQQDYLKRMLQQTRKGSTKMSEYLALMKGYADNLYLAGSPVSTRDLISYVIAGLDEEYTPILS
KGLVSINQPSVNVATSNSNQGNSSNQSNRNSKNYRGRGRGRGGNAPRLICQACGKAGHSAAVCYYRFEKNFNSFQNSGNSSSSHQNKNSAPTALIATPEHLYNSAWYLDS
GAPNHLTSDLANLTVKSEYTGNEMITVGSGQQLPIHYIGSTNIKGNARNLVSKSLMHVPMISKNLISISRLTMDNSGIVEFYDSFCIVKDKETGKVLLEGTIKDGLYQVK
TASNKQNQEVPGSKFAFVAFFKQVENMLHRKIKEVRCDEGGEFKPLIHFASGNEIKIQFTCPCTSDQNGRVERKHWHIVETGLSLLAQAHMPLAYWWEPFHTVVYLINRM
PTVALHGKVPFTTLYNKEPDYHVLRTFGSACFPCLRPYQSNKFDFHTKKCVFLGYSGNHKGYRCLSPLGRTYTSKHVCFIEQDFLFSTNFLQNPSAINNESPPILSWLPI
LQSSTTFQHNTSDFSPTSHTELHSNSTSPCPTPSNHLTPYETISPSVSSSTSSHVPPSFTDHPTPVTPPPQPNTHPMEWQQAMAEEFSALIKNNTWDLVPPNPQQHLVGN
KWVFKLQRVADKSILRYKARLSFNRPSSKVYR