; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011790 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011790
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr1:32987263..32995985
RNA-Seq ExpressionLag0011790
SyntenyLag0011790
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR029472 - Retrotransposon Copia-like, N-terminal
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA7043502.1 unnamed protein product [Microthlaspi erraticum]1.2e-16033.18Show/hide
Query:  DGTNLVLVSDLVTEFNYTSWCNAMIIGLTVKNKLGFVDGTII---------------NLTVKCGALGSFA----------------MLDLQQRYQRKNRP
        D   L L+S+++   NY +W  AM I L  KNK+ F+DG+I+               N  VK   L S +                  DL  R+   + P
Subjt:  DGTNLVLVSDLVTEFNYTSWCNAMIIGLTVKNKLGFVDGTII---------------NLTVKCGALGSFA----------------MLDLQQRYQRKNRP

Query:  RVFQIRREILNLVQDQDSVTTYFAKLKTLWNEFSSYRPVCSCGKCTCGSVKKLTEYFQTEHVMTFLMGLNDSFSQILTQLLLMEPESTIQRAFSLVAQEV
          +Q+ ++I +L Q    ++TY+ KLKTLW+E        +C  C C   K      +   V+ FL GLN+S++ I +Q+++ +    +   ++L+ Q+ 
Subjt:  RVFQIRREILNLVQDQDSVTTYFAKLKTLWNEFSSYRPVCSCGKCTCGSVKKLTEYFQTEHVMTFLMGLNDSFSQILTQLLLMEPESTIQRAFSLVAQEV

Query:  EQRALVNITHSSSVASNTITSPAALLVKNNPPSRTQSSMNKKKERPHCTHCNILGHTVDKCYKIHDYPPGYRN--QRVSSSKTDT------------PCS
         QR+++ + ++S+     I +P ++ +  N  +   +   +K+ +P C++C   GHTV+ C+KIH YPPG+++  Q+ +  KT +              S
Subjt:  EQRALVNITHSSSVASNTITSPAALLVKNNPPSRTQSSMNKKKERPHCTHCNILGHTVDKCYKIHDYPPGYRN--QRVSSSKTDT------------PCS

Query:  TVTHGSLSLADSLSSLTPEQCQGLLAILQSHL-----------------TKVSAPADTSPSTHVVGICHVP-HVSFVTSWVLDYGASIHICHLKELFTYL
          +  + +++D+L  L+ +Q QG++    S L                 T +   A +S + H VG      H+    SW++D GA+ H+CH K LF  +
Subjt:  TVTHGSLSLADSLSSLTPEQCQGLLAILQSHL-----------------TKVSAPADTSPSTHVVGICHVP-HVSFVTSWVLDYGASIHICHLKELFTYL

Query:  RPVTH------------------------------------NFNLISISALTASQPLVVKFVENSCILQDKFSWKTIGKADHWQGLYLLKVASPVTALST
           ++                                      NL+SIS LT      V F  +SC++QD      IG+ +    LY+L V+     L T
Subjt:  RPVTH------------------------------------NFNLISISALTASQPLVVKFVENSCILQDKFSWKTIGKADHWQGLYLLKVASPVTALST

Query:  TVLNKYFPCNNVT----WHDRLDHPSSKHLNALGSLLQNNSVKTFSHDPCLIYPLAKQRRLSFHSNNHISGKVFDLLHCDIWGPYKTPTHAGFRYFLTLV
        + L+  F  N V     WH+RL HPS   ++ +  +L     K     PC I PLAKQ+ LSF S N++    FDLLH DIWGPY  PT  G+RYFLT+V
Subjt:  TVLNKYFPCNNVT----WHDRLDHPSSKHLNALGSLLQNNSVKTFSHDPCLIYPLAKQRRLSFHSNNHISGKVFDLLHCDIWGPYKTPTHAGFRYFLTLV

Query:  DDHSCYTWIFLIRQKSEALQIIPRFFQLVETQYSASIKQFRSDNAPELAFTEFFASKGVIHQYSCVARPEQNSVVERKHQHLLNVTRALLFQSRVPVCFW
        DDHS   W++L+R K E L++ P F  ++E Q+   ++  RSDNAPEL F   +  KG++  +SC   PEQNSVVERKHQH+LNV R+L+FQ++VP+ +W
Subjt:  DDHSCYTWIFLIRQKSEALQIIPRFFQLVETQYSASIKQFRSDNAPELAFTEFFASKGVIHQYSCVARPEQNSVVERKHQHLLNVTRALLFQSRVPVCFW

Query:  GECVFTACYLINMTPSPLLKWETSFLRLHRKHVDYSNIKVFGCLAFVSTLQHNRSKLQPRALPTVFLGYPPGVKGFRLYDIQSKKIFVSRDVVFHEHLFP
        G+CV TA +LIN  P+PLLK +T +  L+ K +DY  ++VFGCLAF ST   NR+K QPRA P VFLGYP G KG++L D++S KI +SR+VVFHE +FP
Subjt:  GECVFTACYLINMTPSPLLKWETSFLRLHRKHVDYSNIKVFGCLAFVSTLQHNRSKLQPRALPTVFLGYPPGVKGFRLYDIQSKKIFVSRDVVFHEHLFP

Query:  FHTVAPQGDPTTIFPNLVLPMTFNYSGTSFTDQATSGFPTLADGSNFDEQTPDVLTGTTADEIPDTCARIEPEQQPQSPVVADPTDVISQPDRVVPNSEL
        F T   +              +F++  T     ++S  P++A+G       P V++G T                   PVV D                 
Subjt:  FHTVAPQGDPTTIFPNLVLPMTFNYSGTSFTDQATSGFPTLADGSNFDEQTPDVLTGTTADEIPDTCARIEPEQQPQSPVVADPTDVISQPDRVVPNSEL

Query:  AIVVDTVTTGSCNQP---RHSNRTCKVPSYLRDYHCSLLFSTDIPKANHPYPLVTYLSYDRLSPKYKSFVLSVSTHYEPDFYHQAAPFPHWRETIVLSYM
         + +D+ T      P       R+ K P+YL+DY C++       +AN PYPL  YLSY  LS  YK+++ SV+ H EP  ++QA  F  W + +    M
Subjt:  AIVVDTVTTGSCNQP---RHSNRTCKVPSYLRDYHCSLLFSTDIPKANHPYPLVTYLSYDRLSPKYKSFVLSVSTHYEPDFYHQAAPFPHWRETIVLSYM

Query:  L----------------------W----KLTVRGLLSLFQRVITPLGAGRDR---LPRNFSPVAKLVTVKILLSIVVSLNWPLLQLDVNNAFLHGELNEE
                               W    KL   G L  ++  +   G  +         FSPVAK+ TVK LL++  + +W L QLD++NAFL+G+L+EE
Subjt:  L----------------------W----KLTVRGLLSLFQRVITPLGAGRDR---LPRNFSPVAKLVTVKILLSIVVSLNWPLLQLDVNNAFLHGELNEE

Query:  VYMDVPLGY
        +YM +P GY
Subjt:  VYMDVPLGY

KZV25004.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Dorcoceras hygrometricum]1.1e-16635.67Show/hide
Query:  DGTNLVLVSDLVTEFNYTSWCNAMIIGLTVKNKLGFVDGTI---------------INLTVKCGALGSFA----------------MLDLQQRYQRKNRP
        D   L LVS+ +   NY +W  AMI+ LT KNKLGF+D +I                N  V    L S A                  DL +R+   N P
Subjt:  DGTNLVLVSDLVTEFNYTSWCNAMIIGLTVKNKLGFVDGTI---------------INLTVKCGALGSFA----------------MLDLQQRYQRKNRP

Query:  RVFQIRREILNLVQDQDSVTTYFAKLKTLWNEFSSYRPVCSCGKCTCGSVKKLTEYFQTEHVMTFLMGLNDSFSQILTQLLLMEPESTIQRAFSLVAQEV
        R++QI++ +  L Q    V++Y+ KL+TLW+E   Y+P  +   CTCGS+++   Y   E VM FLMGLNDS++Q+  Q+L++EP  TI + F+LV QE 
Subjt:  RVFQIRREILNLVQDQDSVTTYFAKLKTLWNEFSSYRPVCSCGKCTCGSVKKLTEYFQTEHVMTFLMGLNDSFSQILTQLLLMEPESTIQRAFSLVAQEV

Query:  EQRALVNITHSSSVASNTITSPAALLVKNNPPSRT-QSSMNKKKERPHCTHCNILGHTVDKCYKIHDYPPGY-----RNQRVSSSKTDTPCSTVTHGSLS
         QR++      + V  + I S            RT Q+S   + +R  C+HC+   HTVDKCYK+H YPPG+     +  + S+       S+ TH    
Subjt:  EQRALVNITHSSSVASNTITSPAALLVKNNPPSRT-QSSMNKKKERPHCTHCNILGHTVDKCYKIHDYPPGY-----RNQRVSSSKTDTPCSTVTHGSLS

Query:  LADSLSSLTPEQCQGLLAILQSHL-TKVSAPADTSPSTHV---VGIC----HVPHVSFVTSWVLDYGASIHICHLKELF--------------TYLRPVT
          D   SLT  QC+ L+  L S L T+ +   +  P T V    GIC    H+P ++    W++D GA+ HIC    +F              T   PVT
Subjt:  LADSLSSLTPEQCQGLLAILQSHL-TKVSAPADTSPSTHV---VGIC----HVPHVSFVTSWVLDYGASIHICHLKELF--------------TYLRPVT

Query:  ---------------------HNFNLISISALTASQPLVVKFVENSCILQDKFSWKTIGKADHWQGLYLLKVASPVTALSTTVLNKYFPCNNVTWHDRLD
                               FNL+S+S+LT +    V F+ +SC +QD    + IG       LY+L+   P   L + + N  F  N+  WH R+ 
Subjt:  ---------------------HNFNLISISALTASQPLVVKFVENSCILQDKFSWKTIGKADHWQGLYLLKVASPVTALSTTVLNKYFPCNNVTWHDRLD

Query:  HPSSKHLNALGSLLQNNSVKTFSHDPCLIYPLAKQRRLSFHSNNHISGKVFDLLHCDIWGPYKTPTHAGFRYFLTLVDDHSCYTWIFLIRQKSEALQIIP
        HPS   L++L ++L  N   T   + C    L+KQRRL   S N+IS ++F+LLH D WGP+   +  GFR+F T+VDDHS YTW+++++ KS+ L I P
Subjt:  HPSSKHLNALGSLLQNNSVKTFSHDPCLIYPLAKQRRLSFHSNNHISGKVFDLLHCDIWGPYKTPTHAGFRYFLTLVDDHSCYTWIFLIRQKSEALQIIP

Query:  RFFQLVETQYSASIKQFRSDNAPELAFTEFFASKGVIHQYSCVARPEQNSVVERKHQHLLNVTRALLFQSRVPVCFWGECVFTACYLINMTPSPLLKWET
         F ++V TQ+  ++K  RSDNAPEL F +FFA  G+ H +SCV RP+QNSVVERKHQH+LNV RALLFQS +P+ +W +C+ T+ YLIN TPSP+L  +T
Subjt:  RFFQLVETQYSASIKQFRSDNAPELAFTEFFASKGVIHQYSCVARPEQNSVVERKHQHLLNVTRALLFQSRVPVCFWGECVFTACYLINMTPSPLLKWET

Query:  SFLRLHRKHVDYSNIKVFGCLAFVSTLQHNRSKLQPRALPTVFLGYPPGVKGFRLYDIQSKKIFVSRDVVFHEHLFPFHTVAPQGDPTTIFPNLVLPMTF
         F  LH K   YS++KVFGCL + STL  +R K  PRA+  VF+GYPPG KG++L ++++ +IF+SRDV+FHE+ FP+   +P           +  MTF
Subjt:  SFLRLHRKHVDYSNIKVFGCLAFVSTLQHNRSKLQPRALPTVFLGYPPGVKGFRLYDIQSKKIFVSRDVVFHEHLFPFHTVAPQGDPTTIFPNLVLPMTF

Query:  NYSGTSFTDQATSGFPTLADGSNFDEQTPDVLTGTTADEIPDTCARIEPEQQPQSPVVADPTDVISQPDRVVPNSELAIVVDTVTTGSCNQPRHSNRTCK
          S +S                   + TP +                             P D                           Q   ++R   
Subjt:  NYSGTSFTDQATSGFPTLADGSNFDEQTPDVLTGTTADEIPDTCARIEPEQQPQSPVVADPTDVISQPDRVVPNSELAIVVDTVTTGSCNQPRHSNRTCK

Query:  VPSYLRDYHCSLLFSTDIPKANHP-YPLVTYLSYDRLSPKYKSFVLSVSTHYEPDFYHQAAPFPHWRETI--------------VLSY--------MLW-
         PS+LRDYHC  + +       HP +PLV   +Y +LS  +++FV ++S+  EP  + QA   P WR+ +              ++S           W 
Subjt:  VPSYLRDYHCSLLFSTDIPKANHP-YPLVTYLSYDRLSPKYKSFVLSVSTHYEPDFYHQAAPFPHWRETI--------------VLSY--------MLW-

Query:  ---KLTVRGLLSLFQRVITPLG----AGRDRLPRNFSPVAKLVTVKILLSIVVSLNWPLLQLDVNNAFLHGELNEEVYMDVPLGY
           K    G L  ++  +   G     G D L   FSPVAKLVTV+ LL++     W L+QLDVNNAFLHG+L EEVYM +P G+
Subjt:  ---KLTVRGLLSLFQRVITPLG----AGRDRLPRNFSPVAKLVTVKILLSIVVSLNWPLLQLDVNNAFLHGELNEEVYMDVPLGY

RVW21404.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.0e-16136.53Show/hide
Query:  GNMKRPGPTTRKIIGQQTSK-----ERKDGTNLVLVSDLVTEFNYTSWCNAMIIGLTVKNKLGFVDGTIIN---------LTVKCGALGSFAML------
        GN  R G    +I+  + S         D   L LVS+L+T  NY +W  AM++ LT KNK+GFVDGTI              +C ++ S  ++      
Subjt:  GNMKRPGPTTRKIIGQQTSK-----ERKDGTNLVLVSDLVTEFNYTSWCNAMIIGLTVKNKLGFVDGTIIN---------LTVKCGALGSFAML------

Query:  ----------------DLQQRYQRKNRPRVFQIRREILNLVQDQDSVTTYFAKLKTLWNEFSSYRPVCSCGKCTCGSVKKLTEYFQTEHVMTFLMGLNDS
                        DL  R+ + N PR+FQI++++  L Q    V +YF KLK LW+E   ++PV     C CG ++  T+Y   E+V+ FLMGLND 
Subjt:  ----------------DLQQRYQRKNRPRVFQIRREILNLVQDQDSVTTYFAKLKTLWNEFSSYRPVCSCGKCTCGSVKKLTEYFQTEHVMTFLMGLNDS

Query:  FSQILTQLLLMEPESTIQRAFSLVAQEVEQRALVNITHSSSVASNTITSPAALLVKNNPPSRTQSSMNKKKERPHCTHCNILGHTVDKCYKIHDYPPGYR
        ++QI  Q+L+M+P   I + FSLV QE E+   V  ++S S  S+ +T  +     +N P+ +      +++R  C++C   GH  DKCYK+  YPPG++
Subjt:  FSQILTQLLLMEPESTIQRAFSLVAQEVEQRALVNITHSSSVASNTITSPAALLVKNNPPSRTQSSMNKKKERPHCTHCNILGHTVDKCYKIHDYPPGYR

Query:  NQRVSSSKTDTPCSTVTHGSLSLADS---LSSLTPEQCQGLLAILQSHLTKVSAPADTSPSTHVVGICHVPHVSFVT---------SWVLDYGASIHICH
         +    + +    ++    SL+   S   +SSLT  QCQ L+ +L + L+  S+ +  + ST        P VS             W+++ GA+ H+C+
Subjt:  NQRVSSSKTDTPCSTVTHGSLSLADS---LSSLTPEQCQGLLAILQSHLTKVSAPADTSPSTHVVGICHVPHVSFVT---------SWVLDYGASIHICH

Query:  LKELF-----------TYLRPVTHNFNLISISALTASQPLV-VKFVE--NSCILQDKFSWKTIGKADHWQGLYLLKVASPVTALSTTVLNKYFPCNNV--
           LF           T    +T   + +    L+    L+ V FV      +L +    K IGK          + AS +            P +N+  
Subjt:  LKELF-----------TYLRPVTHNFNLISISALTASQPLV-VKFVE--NSCILQDKFSWKTIGKADHWQGLYLLKVASPVTALSTTVLNKYFPCNNV--

Query:  TWHDRLDHPSSKHLNALGSLLQNNSVKTFSHDPCLIYPLAKQRRLSFHSNNHISGKVFDLLHCDIWGPYKTPTHAGFRYFLTLVDDHSCYTWIFLIRQKS
         WH RL HPS   L  L S+L  +S  +F   PC + PLAKQR L + S N      FDLLH DIWGP+   +  G+++FLT+VDDHS  TW+++++ KS
Subjt:  TWHDRLDHPSSKHLNALGSLLQNNSVKTFSHDPCLIYPLAKQRRLSFHSNNHISGKVFDLLHCDIWGPYKTPTHAGFRYFLTLVDDHSCYTWIFLIRQKS

Query:  EALQIIPRFFQLVETQYSASIKQFRSDNAPELAFTEFFASKGVIHQYSCVARPEQNSVVERKHQHLLNVTRALLFQSRVPVCFWGECVFTACYLINMTPS
        E  + IP FF  V+ Q+   +K  RSDNAPEL  + F+ S GVIH  SCV  P+QNSVVERKHQH+LNV RALLFQS +P+C+W +C+ TA YLIN TPS
Subjt:  EALQIIPRFFQLVETQYSASIKQFRSDNAPELAFTEFFASKGVIHQYSCVARPEQNSVVERKHQHLLNVTRALLFQSRVPVCFWGECVFTACYLINMTPS

Query:  PLLKWETSFLRLHRKHVDYSNIKVFGCLAFVSTLQHNRSKLQPRALPTVFLGYPPGVKGFRLYDIQSKKIFVSRDVVFHEHLFPFHTVAPQGDP---TTI
        P L  +T F  LH K  DYS+++VFGCL +VSTL+ NR+K  PRA   VFLGYP G KG++L DI+++ I +SR+V+FHE +FPF    P   P   + +
Subjt:  PLLKWETSFLRLHRKHVDYSNIKVFGCLAFVSTLQHNRSKLQPRALPTVFLGYPPGVKGFRLYDIQSKKIFVSRDVVFHEHLFPFHTVAPQGDP---TTI

Query:  FPNLVLPMTFNYSGTSFTDQATSGFPTLADGSNFDEQTPDVLTGTTADEIPDTCARIEPEQQPQSPVVADPTDVISQPDRVVPNSELAIVVDTVTTGSCN
        F + VLP        +  DQ++S  P                                        VV+ P      P +V P+S               
Subjt:  FPNLVLPMTFNYSGTSFTDQATSGFPTLADGSNFDEQTPDVLTGTTADEIPDTCARIEPEQQPQSPVVADPTDVISQPDRVVPNSELAIVVDTVTTGSCN

Query:  QPRHSNRTCKVPSYLRDYHCSLL-FSTDIPKANHPYPLVTYLSYDRLSPKYKSFVLSVSTHYEPDFYHQAAPFPHWRETIVLSYMLWKLTVRGLLSLFQR
              R  K  SYL+DYHCSL+ F   +   +  +P+  +LSYD+LS  YK F LSVS   EPD + +AA  P WR     + M  +L        +  
Subjt:  QPRHSNRTCKVPSYLRDYHCSLL-FSTDIPKANHPYPLVTYLSYDRLSPKYKSFVLSVSTHYEPDFYHQAAPFPHWRETIVLSYMLWKLTVRGLLSLFQR

Query:  VITPLGAGRDRLPRNFSPVAKLVTVKILLSIVVSLNWPLLQLDVNNAFLHGELNEEVYMDVPLGY
        V  P+G+     P   + +AKLVTVK+LL+I     W L QLDVNNAFLHG+LNEEVYM +P GY
Subjt:  VITPLGAGRDRLPRNFSPVAKLVTVKILLSIVVSLNWPLLQLDVNNAFLHGELNEEVYMDVPLGY

XP_012857659.1 PREDICTED: uncharacterized protein LOC105976934 [Erythranthe guttata]2.3e-16136.48Show/hide
Query:  DGTNLVLVSDLVTEFNYTSWCNAMIIGLTVKNKLGFVDGTI----------INLTVKCGALGSFAML----------------------DLQQRYQRKNR
        DG  LVLVS L+ E NY +W  AM+I LTVKNKLGF+DG+I          +N  V+  ++    +L                      DL+ R+ + N 
Subjt:  DGTNLVLVSDLVTEFNYTSWCNAMIIGLTVKNKLGFVDGTI----------INLTVKCGALGSFAML----------------------DLQQRYQRKNR

Query:  PRVFQIRREILNLVQDQDSVTTYFAKLKTLWNEFSSYRPVCSCGKCTCGSVKKLTEYFQTEHVMTFLMGLNDSFSQILTQLLLMEPESTIQRAFSLVAQE
        PR+FQ+RRE+ NL QDQ SV  YF KLK +W+E  ++RP C+CG+C+CG V KL ++   EHVM+FLMGLNDS +    Q+LLM+P   I + F+LV+QE
Subjt:  PRVFQIRREILNLVQDQDSVTTYFAKLKTLWNEFSSYRPVCSCGKCTCGSVKKLTEYFQTEHVMTFLMGLNDSFSQILTQLLLMEPESTIQRAFSLVAQE

Query:  VEQRALVNITHSSSVASNTITSPAAL----LVKNNPPSRTQSSMNKKKERPHCTHCNILGHTVDKCYKIHDYPPGYR---------NQRVSSSKTDTPCS
           R+ V +T SS V  +   +   +     V+    ++   + +++K++ +CTHC+  GHTV+KCY++H +PPGY+         NQ   +       S
Subjt:  VEQRALVNITHSSSVASNTITSPAAL----LVKNNPPSRTQSSMNKKKERPHCTHCNILGHTVDKCYKIHDYPPGYR---------NQRVSSSKTDTPCS

Query:  TVTH-------GSLSLA-----DSLSSLTPEQCQGLLAILQSHLT---------KVSAPADTSPSTHVVGIC-----HVPHVSFVT-SWVLDYGASIHIC
         + H       GSLS +     + L ++T  QCQ LL+ + SHL          K S   DTS  + V GIC     H P  SF+   W+LD GAS HIC
Subjt:  TVTH-------GSLSLA-----DSLSSLTPEQCQGLLAILQSHLT---------KVSAPADTSPSTHVVGIC-----HVPHVSFVT-SWVLDYGASIHIC

Query:  HLKELFTYLRPVT----------------------------HN--------FNLISISALTASQPLVVKFVENSCILQDKFSWKTIGKADHWQGLYLLKV
        H K LF  ++ V+                            HN        FNL+S+SAL      VV F E S  +QD+   + IGK +  QGLY+L  
Subjt:  HLKELFTYLRPVT----------------------------HN--------FNLISISALTASQPLVVKFVENSCILQDKFSWKTIGKADHWQGLYLLKV

Query:  ASPVTALSTTVLNKYFPCNNVT---WHDRLDHPSSKHLNALGSLLQNNSVKTFSHDPCLIYPLAKQRRLSFHSNNHISGKVFDLLHCDIWGPYKTPTHAG
          PV+A       ++  CN ++   WH RL H     L  L      +  K      C + PLAKQ+RL F +++ +S  +FDL+HCDIWGP+K P+++G
Subjt:  ASPVTALSTTVLNKYFPCNNVT---WHDRLDHPSSKHLNALGSLLQNNSVKTFSHDPCLIYPLAKQRRLSFHSNNHISGKVFDLLHCDIWGPYKTPTHAG

Query:  FRYFLTLVDDHSCYTWIFLIRQKSEALQIIPRFFQLVETQYSASIKQFRSDNAPELAFTEFFASKGVIHQYSCVARPEQNSVVERKHQHLLNVTRALLFQ
        F YF+TLVDD+S +TW+ L++ KSE + ++PRF ++V  Q+  SIK FRSDNA EL F   F   GVIHQ+SCV  P+QN++VERKHQH+LNV R+L FQ
Subjt:  FRYFLTLVDDHSCYTWIFLIRQKSEALQIIPRFFQLVETQYSASIKQFRSDNAPELAFTEFFASKGVIHQYSCVARPEQNSVVERKHQHLLNVTRALLFQ

Query:  SRVPVCFWGECVFTACYLINMTPSPLLKWETSFLRLH-RKHVDYSNIKVFGCLAFVSTLQHNRSKLQPRALPTVFLGYPPGVKGFRLYDIQSKKIFVSRD
        S +P+ +W EC+ TA +LIN  P+  L   + +  L+  K  DY ++K FGCL F + +  ++SK  PRA   VFLGYP G+KG++L D+ S K+F+SRD
Subjt:  SRVPVCFWGECVFTACYLINMTPSPLLKWETSFLRLH-RKHVDYSNIKVFGCLAFVSTLQHNRSKLQPRALPTVFLGYPPGVKGFRLYDIQSKKIFVSRD

Query:  VVFHEHLFPFHTVAPQGDPTTIFPNLVLPMTFNYSGTSFTDQATSGFPTLADGSNFDEQTPDVLTGTTADEIPDTCARIEPEQQPQSPVVADPTDVISQP
        V+FHE+++PF   +   +               +  T F+    S  P+    S              ++  P     + P   P +  V  P       
Subjt:  VVFHEHLFPFHTVAPQGDPTTIFPNLVLPMTFNYSGTSFTDQATSGFPTLADGSNFDEQTPDVLTGTTADEIPDTCARIEPEQQPQSPVVADPTDVISQP

Query:  DRVVPNSELAIVVDTVTTGSCNQPRHSNRTCKVPSYLRDYHCSLLFSTDIPKANHPYPLVTYLSYDRLSPKYKSFVLSVSTHYEPDFYHQAAPFP
          +VP                   R S+R  K P+YL D+ C+ + +T  P     YP+  +    +LSP Y++F+L++S   EP  Y + + FP
Subjt:  DRVVPNSELAIVVDTVTTGSCNQPRHSNRTCKVPSYLRDYHCSLLFSTDIPKANHPYPLVTYLSYDRLSPKYKSFVLSVSTHYEPDFYHQAAPFP

XP_022154919.1 uncharacterized protein LOC111022065 [Momordica charantia]2.2e-17237.84Show/hide
Query:  DGTNLVLVSDLVTEFNYTSWCNAMIIGLTVKNKLGFVDGTII-----------------------NLTVKCGALGSFA------MLDLQQRYQRKNRPRV
        D T+LVLVSDL+T+ NYTSW  +++I LTVKNK+GFVDG+I                        +L+ K  A   F+       LDL++R+QR+NRPR+
Subjt:  DGTNLVLVSDLVTEFNYTSWCNAMIIGLTVKNKLGFVDGTII-----------------------NLTVKCGALGSFA------MLDLQQRYQRKNRPRV

Query:  FQIRREILNLVQDQDSVTTYFAKLKTLWNEFSSYRPVCSCGKCTCGSVKKLTEYFQTEHVMTFLMGLNDSFSQILTQLLLMEPESTIQRAFSLVAQEVEQ
        FQ+RRE+ NL QDQ SVT YF +LKTLW+E + YRP CSCG+C+ G VK +  ++Q E+VM FLMGLN SFSQI  QLLLMEP  TI RAF+LVAQE++Q
Subjt:  FQIRREILNLVQDQDSVTTYFAKLKTLWNEFSSYRPVCSCGKCTCGSVKKLTEYFQTEHVMTFLMGLNDSFSQILTQLLLMEPESTIQRAFSLVAQEVEQ

Query:  RALVNITHSSSVASNTITSPAALLVK---NNPPSRTQSSM---NKKKERPHCTHCNILGHTVDKCYKIHDYPPGYR--------NQRVSSSKTDTPCSTV
        R         S++  ++TSP A  V+   N+  SR  SS     K+K++  CTHC I GHTVDKCYK+H+YPPGYR        +   SS   + P  +V
Subjt:  RALVNITHSSSVASNTITSPAALLVK---NNPPSRTQSSM---NKKKERPHCTHCNILGHTVDKCYKIHDYPPGYR--------NQRVSSSKTDTPCSTV

Query:  THGSLSLADSLSSLTPEQCQGLLAILQSHLTKVSAPADTSPSTHVVGICHVPHVSFVTSWVLDYGASIHICHLKELFTYLRPVTHNFNLISISALTASQP
        +     +++SL++LT +QCQ LL +LQSHLT     +D    T                                                         
Subjt:  THGSLSLADSLSSLTPEQCQGLLAILQSHLTKVSAPADTSPSTHVVGICHVPHVSFVTSWVLDYGASIHICHLKELFTYLRPVTHNFNLISISALTASQP

Query:  LVVKFVENSCILQDKFSWKTIGKADHWQGLYLLKVASPVTALSTTVLNKYFPCNNVTWHDRLDHPSSKHLNALGSLLQNNSVKTFSHDPCLIYPLAKQRR
                                                                                             SH             
Subjt:  LVVKFVENSCILQDKFSWKTIGKADHWQGLYLLKVASPVTALSTTVLNKYFPCNNVTWHDRLDHPSSKHLNALGSLLQNNSVKTFSHDPCLIYPLAKQRR

Query:  LSFHSNNHISGKVFDLLHCDIWGPYKTPTHAGFRYFLTLVDDHSCYTWIFLIRQKSEALQIIPRFFQLVETQYSASIKQFRSDNAPELAFTEFFASKGVI
                                                                           + ETQ+  +IK+FRSD APEL+F EFF SKGV+
Subjt:  LSFHSNNHISGKVFDLLHCDIWGPYKTPTHAGFRYFLTLVDDHSCYTWIFLIRQKSEALQIIPRFFQLVETQYSASIKQFRSDNAPELAFTEFFASKGVI

Query:  HQYSCVARPEQNSVVERKHQHLLNVTRALLFQSRVPVCFWGECVFTACYLINMTPSPLLKWETSFLRLHRKHVDYSNIKVFGCLAFVSTLQHNRSKLQPR
        HQ+SCV  PEQNSVVERKHQHLLNV R+L FQSRVP  FWGECV TA YLIN TP+P+L W T + RL+    DYS++KVFGCL FVST   NRSK  PR
Subjt:  HQYSCVARPEQNSVVERKHQHLLNVTRALLFQSRVPVCFWGECVFTACYLINMTPSPLLKWETSFLRLHRKHVDYSNIKVFGCLAFVSTLQHNRSKLQPR

Query:  ALPTVFLGYPPGVKGFRLYDIQSKKIFVSRDVVFHEHLFPFHTVAPQGDPTTIFPNLVLPMTFNYSGTSFTDQATSGFPTLADGSNFDEQTPDVLTGTTA
        AL +VF+GYPPG+KG++LYDI++K+ FVSRDV+FHE +FPFHTV+        FP +V+P +++   TS    A       A GS     + D+      
Subjt:  ALPTVFLGYPPGVKGFRLYDIQSKKIFVSRDVVFHEHLFPFHTVAPQGDPTTIFPNLVLPMTFNYSGTSFTDQATSGFPTLADGSNFDEQTPDVLTGTTA

Query:  DEIPDTCARIEPEQQPQSPVVADPTDV---------ISQPDRVVPNSELAIVVDTVTTGSCNQPRHSNRTCKVPSYLRDYHCSLLFSTDIPKANHPYPLV
         +        E +      +VA+ +++         I+  D  V N + A+VV           R S+R  + PSYLRDYHC L+ +TD   ++  YPL 
Subjt:  DEIPDTCARIEPEQQPQSPVVADPTDV---------ISQPDRVVPNSELAIVVDTVTTGSCNQPRHSNRTCKVPSYLRDYHCSLLFSTDIPKANHPYPLV

Query:  TYLSYDRLSPKYKSFVLSVSTHYEPDFYHQAAPFPHWRE----------------TIVLSY------MLWKLTVR----GLLSLFQRVITPLG----AGR
         YL Y+ LS  YK FVLSVS  YEP FYHQA PF HWRE                 + L Y        W   V+    G +  ++  +   G     G 
Subjt:  TYLSYDRLSPKYKSFVLSVSTHYEPDFYHQAAPFPHWRE----------------TIVLSY------MLWKLTVR----GLLSLFQRVITPLG----AGR

Query:  DRLPRNFSPVAKLVTVKILLSIVVSLNWPLLQLDVNNAFLHGELNEEVYMDVPLGYE
        D +   FSPVAKLVTVK+LL++ VS NW L+QLDVNNAFLHG+L EEVYMD+PLGY+
Subjt:  DRLPRNFSPVAKLVTVKILLSIVVSLNWPLLQLDVNNAFLHGELNEEVYMDVPLGYE

TrEMBL top hitse value%identityAlignment
A0A2N9EDE7 Integrase catalytic domain-containing protein1.2e-17435.66Show/hide
Query:  DGTNLVLVSDLVTEFNYTSWCNAMIIGLTVKNKLGFVDGTIINLTVKCGALGSF-------------------------------AMLDLQQRYQRKNRP
        D   L LV+ ++T  N+ +W  +M + L  KNK GFV+G I  +        S+                                  DL++R+ + N P
Subjt:  DGTNLVLVSDLVTEFNYTSWCNAMIIGLTVKNKLGFVDGTIINLTVKCGALGSF-------------------------------AMLDLQQRYQRKNRP

Query:  RVFQIRREILNLVQDQDSVTTYFAKLKTLWNEFSSYRPVCSCGKCTCGSVKKLTEYFQTEHVMTFLMGLNDSFSQILTQLLLMEPESTIQRAFSLVAQEV
        RV+Q+++ I +L QDQ SV+T++ KLK LW+E  ++ P+ +   C CG++K L +Y   E+VM FL+GLNDS+  I  Q+LLMEP  +I + F+LV+QE 
Subjt:  RVFQIRREILNLVQDQDSVTTYFAKLKTLWNEFSSYRPVCSCGKCTCGSVKKLTEYFQTEHVMTFLMGLNDSFSQILTQLLLMEPESTIQRAFSLVAQEV

Query:  EQRALVNITHSSSVASNTITSPAALLVKNNPPSRTQSSMNKKKERPHCTHCNILGHTVDKCYKIHDYPPGYRNQRVSSSKTDTPCSTVTHGSLSLADSLS
         QR L    +S  +     + P A  V N  P     +   KKERP C+HC I GHTV+KCYKIH YPPGY+ +  +++   T  S    GS  L     
Subjt:  EQRALVNITHSSSVASNTITSPAALLVKNNPPSRTQSSMNKKKERPHCTHCNILGHTVDKCYKIHDYPPGYRNQRVSSSKTDTPCSTVTHGSLSLADSLS

Query:  SLTPEQCQGLLAILQSHLTKVSAPADTSPSTHVVG-------------ICHVP-HVSFVT-----------SWVLDYGASIHICHLKELFTYLRPVTH--
        S+T EQCQ LL+ L S ++  ++ ++   +T V+              + H P H  F T           +WV+D GA  H+    +LFT +    H  
Subjt:  SLTPEQCQGLLAILQSHLTKVSAPADTSPSTHVVG-------------ICHVP-HVSFVT-----------SWVLDYGASIHICHLKELFTYLRPVTH--

Query:  ----------------------------------NFNLISISALTASQPLVVKFVENSCILQDKFSWKTIGKADHWQGLYLLKVASPVTALSTTVLNK--
                                          +FNLIS+S LT+S    + F+ N C +QD   WK IG+    +GLYLL+    V   S   L+   
Subjt:  ----------------------------------NFNLISISALTASQPLVVKFVENSCILQDKFSWKTIGKADHWQGLYLLKVASPVTALSTTVLNK--

Query:  ----YFPCNNVT-----WHDRLDHPSSKHLNALGSLLQNNSVKTFSHDP---CLIYPLAKQRRLSFHSNNHISGKVFDLLHCDIWGPYKTPTHAGFRYFL
            +   NN +     WH+RL H S  +L  L + + +       +D    CL+ PLAKQ+RL F  +N +    F L+HCD+WGP    T+ GF+YFL
Subjt:  ----YFPCNNVT-----WHDRLDHPSSKHLNALGSLLQNNSVKTFSHDP---CLIYPLAKQRRLSFHSNNHISGKVFDLLHCDIWGPYKTPTHAGFRYFL

Query:  TLVDDHSCYTWIFLIRQKSEALQIIPRFFQLVETQYSASIKQFRSDNAPELAFTEFFASKGVIHQYSCVARPEQNSVVERKHQHLLNVTRALLFQSRVPV
        T+VDDH+  TW+FL++ K E    +  FF LVETQ++  IK  R+DNA E +   F+  +GV+H  SCVA P+QNSVVERKHQH+LNV RAL FQS +P+
Subjt:  TLVDDHSCYTWIFLIRQKSEALQIIPRFFQLVETQYSASIKQFRSDNAPELAFTEFFASKGVIHQYSCVARPEQNSVVERKHQHLLNVTRALLFQSRVPV

Query:  CFWGECVFTACYLINMTPSPLLKWETSFLRLHRKHVDYSNIKVFGCLAFVSTLQHNRSKLQPRALPTVFLGYPPGVKGFRLYDIQSKKIFVSRDVVFHEH
         +WG+CV TA YLIN TPS +L+ +T F  L     +YS++KVFGCL + STL  NRSK  PRA   +FLGYP GVKG+++ D+ + ++F+SRDVVFHE+
Subjt:  CFWGECVFTACYLINMTPSPLLKWETSFLRLHRKHVDYSNIKVFGCLAFVSTLQHNRSKLQPRALPTVFLGYPPGVKGFRLYDIQSKKIFVSRDVVFHEH

Query:  LFPFHTVAPQGDPTTIFPNLVLP----------------MTFNYSGTSFTDQAT------SGFPTLADGSNFDEQTP--DVLTGTTADEIPD-TCARIEP
        +FPF   +        F  LVLP                +   +  +S +D A+      +  P L   +N    +P    L   +A  + D   A +  
Subjt:  LFPFHTVAPQGDPTTIFPNLVLP----------------MTFNYSGTSFTDQAT------SGFPTLADGSNFDEQTP--DVLTGTTADEIPD-TCARIEP

Query:  EQQPQSPVVADPTDVISQPDRVVPNSELAIVVDTVTTGSCNQPRHSNRTCKVPSYLRDYHCSLLFSTD------------IPKANHPYPLVTYLSYDRLS
          Q  SP ++ P +  S P  + P      VV+T         R S R+   P YL DYHC+L  S              +      + L   LSY  LS
Subjt:  EQQPQSPVVADPTDVISQPDRVVPNSELAIVVDTVTTGSCNQPRHSNRTCKVPSYLRDYHCSLLFSTD------------IPKANHPYPLVTYLSYDRLS

Query:  PKYKSFVLSVSTHYEPDFYHQAAPFPHWRETI------VLSYMLWKLT--------------------VRGLLSLFQRVITPLG----AGRDRLPRNFSP
        PK++ F L++ST  EP FYHQA   P WR  +      + +   W LT                      G +  F+  +   G     G D     FSP
Subjt:  PKYKSFVLSVSTHYEPDFYHQAAPFPHWRETI------VLSYMLWKLT--------------------VRGLLSLFQRVITPLG----AGRDRLPRNFSP

Query:  VAKLVTVKILLSIVVSLNWPLLQLDVNNAFLHGELNEEVYMDVPLGY
        VAK  TV++LL++    +W + QLDVNNAFLHGEL+EEVYMD+P G+
Subjt:  VAKLVTVKILLSIVVSLNWPLLQLDVNNAFLHGELNEEVYMDVPLGY

A0A2N9EHN7 Integrase catalytic domain-containing protein6.3e-17335.81Show/hide
Query:  DGTNLVLVSDLVTEFNYTSWCNAMIIGLTVKNKLGFVDGTI--------------------------------INLTVKCGALGSFAMLDLQQRYQRKNR
        D   + +V D +T  NY +W  +M   L+ KNKLGFV+G+I                                I+ TV           DLQQRY + N 
Subjt:  DGTNLVLVSDLVTEFNYTSWCNAMIIGLTVKNKLGFVDGTI--------------------------------INLTVKCGALGSFAMLDLQQRYQRKNR

Query:  PRVFQIRREILNLVQDQDSVTTYFAKLKTLWNEFSSYRPV--CSCG-KCTCGSVKKLTEYFQTEHVMTFLMGLNDSFSQILTQLLLMEPESTIQRAFSLV
         RV  +++ I +L QD   V+ YF +LK LW+EF +YRP+  C+CG KC CG  + L +Y   ++V +FLMGLNDSF+ +  Q+LLMEP   I + FSL+
Subjt:  PRVFQIRREILNLVQDQDSVTTYFAKLKTLWNEFSSYRPV--CSCG-KCTCGSVKKLTEYFQTEHVMTFLMGLNDSFSQILTQLLLMEPESTIQRAFSLV

Query:  AQEVEQRA---LVNITHSSSVASNTITS------PAALLVKNNPP----SRTQSSMN----KKKERPH--CTHCNILGHTVDKCYKIHDYPPGYRNQ-RV
          + +QR    L   T   +V S  + S        AL   N  P    +RT +S       +K++P   C+HC   GHT DKCYK+H YPPG+R++ R 
Subjt:  AQEVEQRA---LVNITHSSSVASNTITS------PAALLVKNNPP----SRTQSSMN----KKKERPH--CTHCNILGHTVDKCYKIHDYPPGYRNQ-RV

Query:  SSSKTDTPCSTVTHG-SLSLADSLSSLT--PEQCQGLLAILQSHLTKVSAPAD---------------TSPSTHVVG---------------------IC
         +  +    S V H  S +   S+ +L     QCQ LL +L +   + ++ +D               T P +++ G                       
Subjt:  SSSKTDTPCSTVTHG-SLSLADSLSSLT--PEQCQGLLAILQSHLTKVSAPAD---------------TSPSTHVVG---------------------IC

Query:  HVPHVSFVTSWVLDYGASIHICHLKELFTYLR---------------PVTH---------------------NFNLISISALTASQPLVVKFVENSCILQ
          PH S    WV+D GA+ H+    + +T +                 VTH                     +FNLIS+S LT+S    + F+   C +Q
Subjt:  HVPHVSFVTSWVLDYGASIHICHLKELFTYLR---------------PVTH---------------------NFNLISISALTASQPLVVKFVENSCILQ

Query:  DKFSWKTIGKADHWQGLYLLKVASPVTALSTTVLNK-------YFPCNNV--------TWHDRLDHPSSKHLNALGSLLQNNSVKTFSHDPCLIYPLAKQ
        D   W+ IG      GLYLL  +S  T  +   L+         +  +++         WH R  HPS   ++ L S++ N S+ +     C + PLAKQ
Subjt:  DKFSWKTIGKADHWQGLYLLKVASPVTALSTTVLNK-------YFPCNNV--------TWHDRLDHPSSKHLNALGSLLQNNSVKTFSHDPCLIYPLAKQ

Query:  RRLSFHSNNHISGKVFDLLHCDIWGPYKTPTHAGFRYFLTLVDDHSCYTWIFLIRQKSEALQIIPRFFQLVETQYSASIKQFRSDNAPELAFTEFFASKG
        +RL F + NH+S   FDLLH DIWGPY  PT  G+RYFLTLVDD +  TWI+L+R KS+   ++  F  +++TQ+   IKQ RSDN  E    EF+ASKG
Subjt:  RRLSFHSNNHISGKVFDLLHCDIWGPYKTPTHAGFRYFLTLVDDHSCYTWIFLIRQKSEALQIIPRFFQLVETQYSASIKQFRSDNAPELAFTEFFASKG

Query:  VIHQYSCVARPEQNSVVERKHQHLLNVTRALLFQSRVPVCFWGECVFTACYLINMTPSPLLKWETSFLRLHRKHVDYSNIKVFGCLAFVSTLQHNRSKLQ
        +IHQ+SCV  P+QNSVVERKHQH+LNV R+L FQS +P+ +WG C+ TA YLIN  P P+L  ++ F  L  K   Y+++KVFGCL F STL  +R+K  
Subjt:  VIHQYSCVARPEQNSVVERKHQHLLNVTRALLFQSRVPVCFWGECVFTACYLINMTPSPLLKWETSFLRLHRKHVDYSNIKVFGCLAFVSTLQHNRSKLQ

Query:  PRALPTVFLGYPPGVKGFRLYDIQSKKIFVSRDVVFHEHLFPFHTVAPQGDPTTIFPNLVLPMTFNYSGTSFTDQATSGFPTLADGSNFDEQTPDVLTGT
        PRA   VFLGYP GVKG++L D+ + K+F+SRDVVFHE +FPF T  P  D TT   +   P+       S T         +AD         D+L   
Subjt:  PRALPTVFLGYPPGVKGFRLYDIQSKKIFVSRDVVFHEHLFPFHTVAPQGDPTTIFPNLVLPMTFNYSGTSFTDQATSGFPTLADGSNFDEQTPDVLTGT

Query:  TADEIPDTCARIEPEQQPQSPVVADP---TDVISQPDRVVPNSELAIVVDTVTTG-SCNQP-RHSNRTCKVPSYLRDYHCSLLF------STDIPKANHP
                C+ I P   P   +   P   +D+    D  + +S     ++  + G S + P R S R  K P+YL+DYHC L        S  I  +  P
Subjt:  TADEIPDTCARIEPEQQPQSPVVADP---TDVISQPDRVVPNSELAIVVDTVTTG-SCNQP-RHSNRTCKVPSYLRDYHCSLLF------STDIPKANHP

Query:  YPLVTYLSYDRLSPKYKSFVLSVSTHYEPDFYHQAAPFPHWRETIVLSYML----------------------W----KLTVRGLLSLFQRVITPLGAGR
        YPL T LSYD LSP +++F LSV+   EP  +HQA   PHW+E +                            W    KL   G L  ++  +   G  +
Subjt:  YPLVTYLSYDRLSPKYKSFVLSVSTHYEPDFYHQAAPFPHWRETIVLSYML----------------------W----KLTVRGLLSLFQRVITPLGAGR

Query:  DR---LPRNFSPVAKLVTVKILLSIVVSLNWPLLQLDVNNAFLHGELNEEVYMDVPLGY
                 FSPVAK  TV+ LL++  + NW L QLDVNNAFLHG+L EEVYM +PLG+
Subjt:  DR---LPRNFSPVAKLVTVKILLSIVVSLNWPLLQLDVNNAFLHGELNEEVYMDVPLGY

A0A2N9G1L6 Integrase catalytic domain-containing protein2.9e-17835.34Show/hide
Query:  SQLSRNRQASTTSTQGNMKRPGPTTRKIIGQQTSKERKDGTNLVLVSDLVTEFNYTSWCNAMIIGLTVKNKLGFVDGTI---------------------
        S++S  +  S+ ST   +  P P++   +         D ++L+LV++ +T  N+ SW  +M + LT+KNKLGFVDG+I                     
Subjt:  SQLSRNRQASTTSTQGNMKRPGPTTRKIIGQQTSKERKDGTNLVLVSDLVTEFNYTSWCNAMIIGLTVKNKLGFVDGTI---------------------

Query:  -INLTVKCGALGSFAML-----------DLQQRYQRKNRPRVFQIRREILNLVQDQDSVTTYFAKLKTLWNEFSSY--RPVCSC-GKCTCGSVKKLTEYF
         I   + C +    A +           +LQ ++ + N P++FQ+ ++I +L Q+Q+SV+ Y+  L+ LW E  +Y   PVC+C   C+CG++ K  E +
Subjt:  -INLTVKCGALGSFAML-----------DLQQRYQRKNRPRVFQIRREILNLVQDQDSVTTYFAKLKTLWNEFSSY--RPVCSC-GKCTCGSVKKLTEYF

Query:  QTEHVMTFLMGLNDSFSQILTQLLLMEPESTIQRAFSLVAQEVEQRALVNITHSSSVASNTITSPAALLVKNNPP--SRTQSSMNKKKERPHCTHCNILG
        +   VM FLMGLN+SF+ +  Q+LLM+P   I + FSL+ QE  QR++ ++  S    +       AL+ K + P     + S+  KKERP CTHC +LG
Subjt:  QTEHVMTFLMGLNDSFSQILTQLLLMEPESTIQRAFSLVAQEVEQRALVNITHSSSVASNTITSPAALLVKNNPP--SRTQSSMNKKKERPHCTHCNILG

Query:  HTVDKCYKIHDYPPGYRNQ-RVSSSKTDTPCSTVTHGSLSLADSLSSL----TPEQCQGLLAILQS---------------HLTKVSAPADTSPSTHVVG
        HTVDKCYK+H +PPGY+ + +  +    T  S     + + A+ +S L       QC+ LLA++ +                 T  +A + T P + + G
Subjt:  HTVDKCYKIHDYPPGYRNQ-RVSSSKTDTPCSTVTHGSLSLADSLSSL----TPEQCQGLLAILQS---------------HLTKVSAPADTSPSTHVVG

Query:  ------ICH-----------------VPHVSFV-TSWVLDYGASIHICHLKELFT---------------YLRPVTH---------------------NF
              IC                   PH S    SW+LD GA+ H+ H    FT                L PVTH                     +F
Subjt:  ------ICH-----------------VPHVSFV-TSWVLDYGASIHICHLKELFT---------------YLRPVTH---------------------NF

Query:  NLISISALTASQPLVVKFVENSCILQDKFSWKTIGKADHWQGLYLLKVASPVTALST----------TVLNKYFPCNNVT----WHDRLDHPSSKHLNAL
        NLIS+S L  S    + F+ + C +Q    W+ IG      GLY+L   SP   LS+          +V N     +N+T    WH RL HPS   +  L
Subjt:  NLISISALTASQPLVVKFVENSCILQDKFSWKTIGKADHWQGLYLLKVASPVTALST----------TVLNKYFPCNNVT----WHDRLDHPSSKHLNAL

Query:  GSLLQNNSVKTFSHDPCLIYPLAKQRRLSFHSNNHISGKVFDLLHCDIWGPYKTPTHAGFRYFLTLVDDHSCYTWIFLIRQKSEALQIIPRFFQLVETQY
           + +      S + C + PLAK +RL F +  H +   FDL+HCDIWGPY  PTH GF+YFLT+VDD S  TW++L+  K     ++  FF ++ETQ+
Subjt:  GSLLQNNSVKTFSHDPCLIYPLAKQRRLSFHSNNHISGKVFDLLHCDIWGPYKTPTHAGFRYFLTLVDDHSCYTWIFLIRQKSEALQIIPRFFQLVETQY

Query:  SASIKQFRSDNAPELAFTEFFASKGVIHQYSCVARPEQNSVVERKHQHLLNVTRALLFQSRVPVCFWGECVFTACYLINMTPSPLLKWETSFLRLHRKHV
           IK  RSDN  E   ++FF+SKGVIHQ SCV  P+QNSVVERKHQHLLNV RA+ FQS +P+ FWGEC+  A YLIN  P+P+L  +T +  L  K  
Subjt:  SASIKQFRSDNAPELAFTEFFASKGVIHQYSCVARPEQNSVVERKHQHLLNVTRALLFQSRVPVCFWGECVFTACYLINMTPSPLLKWETSFLRLHRKHV

Query:  DYSNIKVFGCLAFVSTLQHNRSKLQPRALPTVFLGYPPGVKGFRLYDIQSKKIFVSRDVVFHEHLFPFHTVAPQGDPTTIFPNLVLPMTFNYSGTSFTDQ
         Y+++KVFGCLA+ S L  +++K   +A P VFLGYP G KG++L D+ + + FVSRDVVFHE +FPFH      +P +   ++       +S  S  + 
Subjt:  DYSNIKVFGCLAFVSTLQHNRSKLQPRALPTVFLGYPPGVKGFRLYDIQSKKIFVSRDVVFHEHLFPFHTVAPQGDPTTIFPNLVLPMTFNYSGTSFTDQ

Query:  ATSGFPTLADGSNFDEQTPDVLTGTTADEIPDTCARIEPEQQPQSPVVA-DPTDVISQPDRVVPNSELAIVVDTVTTGSCNQPRHSNRTCKVPSYLRDYH
         +    TL   S    Q P  L                P Q+P +P  A  P  V +  D  V +S+ ++             R S+R  K PSYL+DYH
Subjt:  ATSGFPTLADGSNFDEQTPDVLTGTTADEIPDTCARIEPEQQPQSPVVA-DPTDVISQPDRVVPNSELAIVVDTVTTGSCNQPRHSNRTCKVPSYLRDYH

Query:  CSL---LFSTDIPKANHPYPLVTYLSYDRLSPKYKSFVLSVSTHYEPDFYHQAAPFPHW-------------RETIVLSYM-----------LWKLTVR-
        CSL   L S+DI  AN  YP+   LSY +LS  +K+F L++ST  EP FYH+A   PHW               T VL+ +           ++KL  + 
Subjt:  CSL---LFSTDIPKANHPYPLVTYLSYDRLSPKYKSFVLSVSTHYEPDFYHQAAPFPHW-------------RETIVLSYM-----------LWKLTVR-

Query:  -GLLSLFQRVITPLGAGRDR---LPRNFSPVAKLVTVKILLSIVVSLNWPLLQLDVNNAFLHGELNEEVYMDVPLGY
         G +  ++  +   G  +         FSPVAKLVTV+  ++I  +  WP+ QLDVNNAFLHG+L+EEV+M +P G+
Subjt:  -GLLSLFQRVITPLGAGRDR---LPRNFSPVAKLVTVKILLSIVVSLNWPLLQLDVNNAFLHGELNEEVYMDVPLGY

A0A2N9GPY9 Integrase catalytic domain-containing protein2.2e-17334.67Show/hide
Query:  SQLSRNRQASTTSTQGNMKRPGPTTRKIIGQQTSKERKDGTNLVLVSDLVTEFNYTSWCNAMIIGLTVKNKLGFVDGTI---------------------
        S++S  +  S+ ST   M  P P++   +         D ++L+LV++ +T  N+ SW  +M + LT+KNKLGF+DG+I                     
Subjt:  SQLSRNRQASTTSTQGNMKRPGPTTRKIIGQQTSKERKDGTNLVLVSDLVTEFNYTSWCNAMIIGLTVKNKLGFVDGTI---------------------

Query:  ------------INLTVKCGALGSFAMLDLQQRYQRKNRPRVFQIRREILNLVQDQDSVTTYFAKLKTLWNEFSSY--RPVCSCGK-CTCGSVKKLTEYF
                    I+ +V           +LQ+++ + N P++FQ+ +EI +L Q+Q+SV+ Y+  L+ LW E  +Y   PVCSC   C CG++ K  E +
Subjt:  ------------INLTVKCGALGSFAMLDLQQRYQRKNRPRVFQIRREILNLVQDQDSVTTYFAKLKTLWNEFSSY--RPVCSCGK-CTCGSVKKLTEYF

Query:  QTEHVMTFLMGLNDSFSQILTQLLLMEPESTIQRAFSLVAQEVEQRALVNITHSSSVASNTITSPAALLVKNNPP--SRTQSSMNKKKERPHCTHCNILG
        +   +M FLMGLN+SF  +  Q+LLM+P   I + FSL+ QE  QR++ ++  S    SN +    AL VK+  P     + S  +KKERP CTHC +LG
Subjt:  QTEHVMTFLMGLNDSFSQILTQLLLMEPESTIQRAFSLVAQEVEQRALVNITHSSSVASNTITSPAALLVKNNPP--SRTQSSMNKKKERPHCTHCNILG

Query:  HTVDKCYKIHDYPPGYRNQ-RVSSSKTDTPCSTVTHGSLSLADSLSSL----TPEQCQGLLAILQSH-----LTK------------------------V
        HT+DKCYK+H YPPGYR + +  +    T  S+    + + +D++S L       QC+  LA + S      +T                          
Subjt:  HTVDKCYKIHDYPPGYRNQ-RVSSSKTDTPCSTVTHGSLSLADSLSSL----TPEQCQGLLAILQSH-----LTK------------------------V

Query:  SAPADTSPSTHVVGICHVPHVSFV--TSWVLDYGASIHICHLKELFT---------------YLRPVTH---------------------NFNLISISAL
        S PA T   +H V   H    S +   SW+LD GA+ H+ H     T                L PVTH                     +FNLIS+S L
Subjt:  SAPADTSPSTHVVGICHVPHVSFV--TSWVLDYGASIHICHLKELFT---------------YLRPVTH---------------------NFNLISISAL

Query:  TASQPLVVKFVENSCILQDKFSWKTIGKADHWQGLYLLKVASPV---TALSTTVLNKYFPCNNVT--------------WHDRLDHPSSKHLNALGSLLQ
          S    + F+ + C +Q    W+ IG      GLYLL+  SP    T  S+ + +  F  ++V               WH RL HPS   +  L  L+ 
Subjt:  TASQPLVVKFVENSCILQDKFSWKTIGKADHWQGLYLLKVASPV---TALSTTVLNKYFPCNNVT--------------WHDRLDHPSSKHLNALGSLLQ

Query:  NNSVKTFSHDPCLIYPLAKQRRLSFHSNNHISGKVFDLLHCDIWGPYKTPTHAGFRYFLTLVDDHSCYTWIFLIRQKSEALQIIPRFFQLVETQYSASIK
        +        + C + PLAK +RL F  + H +  VFDL+HCDIWGPY   TH GF+YFLT+VDD S  TWI+L+  K++   ++  FF ++ETQ++  IK
Subjt:  NNSVKTFSHDPCLIYPLAKQRRLSFHSNNHISGKVFDLLHCDIWGPYKTPTHAGFRYFLTLVDDHSCYTWIFLIRQKSEALQIIPRFFQLVETQYSASIK

Query:  QFRSDNAPELAFTEFFASKGVIHQYSCVARPEQNSVVERKHQHLLNVTRALLFQSRVPVCFWGECVFTACYLINMTPSPLLKWETSFLRLHRKHVDYSNI
          RSDN  E   ++FF+SKGVIHQ SCV  PEQNSVVERKHQHLLNV RAL FQS VP+ FWG+ +  A YLIN  PSPLL+ +T F  L      YS++
Subjt:  QFRSDNAPELAFTEFFASKGVIHQYSCVARPEQNSVVERKHQHLLNVTRALLFQSRVPVCFWGECVFTACYLINMTPSPLLKWETSFLRLHRKHVDYSNI

Query:  KVFGCLAFVSTLQHNRSKLQPRALPTVFLGYPPGVKGFRLYDIQSKKIFVSRDVVFHEHLFPFHTVAPQGDPTTIFPNLVLPMTFNYSGTSFTDQATSG-
        KVFGCLA+ S L  +++K   RA+P VFLGYP GVKG++L+D+ +KK  VSRDVVFHE +FPF++       T++          + S ++ +  A SG 
Subjt:  KVFGCLAFVSTLQHNRSKLQPRALPTVFLGYPPGVKGFRLYDIQSKKIFVSRDVVFHEHLFPFHTVAPQGDPTTIFPNLVLPMTFNYSGTSFTDQATSG-

Query:  ------------FPTLADGSNFDEQTPDVLTGTTADEIP-------------------DTCARIEPE----QQPQSPVVADP-TDVISQP----------
                     P  +  S+     P          IP                   ++  +  PE      P+SP++  P + ++S P          
Subjt:  ------------FPTLADGSNFDEQTPDVLTGTTADEIP-------------------DTCARIEPE----QQPQSPVVADP-TDVISQP----------

Query:  ---DRVVPNSELA-IVVDTVTTGSCNQPRHSNRTCKVPSYLRDYHCSLLFSTD--IP-KANHPYPLVTYLSYDRLSPKYKSFVLSVSTHYEPDFYHQAAP
           D   P + L+  V+DT ++      R S+R  K PSYL+DYHC L  S +   P      +P+   LSY  LS  +K+F L++STH EP FYH+A  
Subjt:  ---DRVVPNSELA-IVVDTVTTGSCNQPRHSNRTCKVPSYLRDYHCSLLFSTD--IP-KANHPYPLVTYLSYDRLSPKYKSFVLSVSTHYEPDFYHQAAP

Query:  FPHWRETIVLSYMLWKLTVRGLLSLFQRVITPLGA---------GRDRLPR--------------------NFSPVAKLVTVKILLSIVVSLNWPLLQLD
         P W E +       +     +L+       P+G            D + R                     FSPVAKLV V+  ++I  +  W L QLD
Subjt:  FPHWRETIVLSYMLWKLTVRGLLSLFQRVITPLGA---------GRDRLPR--------------------NFSPVAKLVTVKILLSIVVSLNWPLLQLD

Query:  VNNAFLHGELNEEVYMDVPLGYE
        VN+AFLHGEL+EEVYM +PLGY+
Subjt:  VNNAFLHGELNEEVYMDVPLGYE

A0A2N9HYD2 Integrase catalytic domain-containing protein6.8e-17535.5Show/hide
Query:  NMKRPGPTTRKIIGQQTSKERKDGTNLVLVSDLVTEFNYTSWCNAMIIGLTVKNKLGFVDGTI----------INLTVKCGALGSFAML-----------
        N+  P    R     Q      D    +LVS  ++  NY +W  +MI+ LT KNK+GF++GTI           NL  +C  +    +L           
Subjt:  NMKRPGPTTRKIIGQQTSKERKDGTNLVLVSDLVTEFNYTSWCNAMIIGLTVKNKLGFVDGTI----------INLTVKCGALGSFAML-----------

Query:  -----------DLQQRYQRKNRPRVFQIRREILNLVQDQDSVTTYFAKLKTLWNEFSSYRPVCSCGKCTCGSVKKLTEYFQTEHVMTFLMGLNDSFSQIL
                   DL++R+ + N PR+F+I++ I +L QDQ +V+ YF KLK+LW+E ++YR   S   C+CG++K L +  Q E+VM FLMGLNDSF+ + 
Subjt:  -----------DLQQRYQRKNRPRVFQIRREILNLVQDQDSVTTYFAKLKTLWNEFSSYRPVCSCGKCTCGSVKKLTEYFQTEHVMTFLMGLNDSFSQIL

Query:  TQLLLMEPESTIQRAFSLVAQEVEQRALVNITHSSSVASNTITSPAALLVKNNPPSRTQSSMNKKKERPHCTHCNILGHTVDKCYKIHDYPPGYRNQRVS
         Q+L+MEP   I +AFSLV QE  QR+ + +T   +   +      + + +NN   R+ S    KKERP C+HC I GH VDKCYK+H +PPG++ +  S
Subjt:  TQLLLMEPESTIQRAFSLVAQEVEQRALVNITHSSSVASNTITSPAALLVKNNPPSRTQSSMNKKKERPHCTHCNILGHTVDKCYKIHDYPPGYRNQRVS

Query:  SSKTDTPCSTVTHGSLSLADSLSSLTPEQCQGLLAILQSHLTKVSA---------------------------PADTSPSTHVV---------GICH---
         +      S +   S  L      +T  QCQ LLA+L S  +  SA                            A +S + H V         GI H   
Subjt:  SSKTDTPCSTVTHGSLSLADSLSSLTPEQCQGLLAILQSHLTKVSA---------------------------PADTSPSTHVV---------GICH---

Query:  ---VPHVS------------FVTSWVLDYGASIHICHLKELFTYLRPVTH------------------------------------NFNLISISALTASQ
           +P  S            F   W++D GA+ H+ +    FT +    H                                    +FNLISI+ LT S 
Subjt:  ---VPHVS------------FVTSWVLDYGASIHICHLKELFTYLRPVTH------------------------------------NFNLISISALTASQ

Query:  PLVVKFVENSCILQDKFSWKTIGKADHWQGLYLLKVA--------SPVTALSTTVLNKYFPCNNVTWHDRLDHPSSKHLNALGSLLQNNSVKTFSHDPCL
        P  V F  + C +QD  SWK IG A    GLY+L+VA        +P TAL +          NV WH RL HPS+  LN L  ++   ++ + S   C 
Subjt:  PLVVKFVENSCILQDKFSWKTIGKADHWQGLYLLKVA--------SPVTALSTTVLNKYFPCNNVTWHDRLDHPSSKHLNALGSLLQNNSVKTFSHDPCL

Query:  IYPLAKQRRLSFHSNNHISGKVFDLLHCDIWGPYKTPTHAGFRYFLTLVDDHSCYTWIFLIRQKSEALQIIPRFFQLVETQYSASIKQFRSDNAPELAFT
        +  L+K RRL F ++ HIS   FDL+HCDIWGP+  PT    +YFLT+VDD +  TWIFL++ KSE   ++  FF LV+TQ+S+SIK  RSDN  E + T
Subjt:  IYPLAKQRRLSFHSNNHISGKVFDLLHCDIWGPYKTPTHAGFRYFLTLVDDHSCYTWIFLIRQKSEALQIIPRFFQLVETQYSASIKQFRSDNAPELAFT

Query:  EFFASKGVIHQYSCVARPEQNSVVERKHQHLLNVTRALLFQSRVPVCFWGECVFTACYLINMTPSPLLKWETSFLRLHRKHVDYSNIKVFGCLAFVSTLQ
        EF+A  G IHQ SC+A P+QNS VERKHQHLL V R+L FQ+ +P+ +WG CV TA YLIN  PSPLL  ++ +  L +    YS+++VFG L + +TL 
Subjt:  EFFASKGVIHQYSCVARPEQNSVVERKHQHLLNVTRALLFQSRVPVCFWGECVFTACYLINMTPSPLLKWETSFLRLHRKHVDYSNIKVFGCLAFVSTLQ

Query:  HNRSKLQPRALPTVFLGYPPGVKGFRLYDIQSKKIFVSRDVVFHEHLFPFHT--------------VAPQGDPTTIFPNLVLPMTFNYSGTSFTDQATSG
        HNR K  PR+   + LGYP G KG+RL D+++ ++FVSRDV+FHE +FPF                +    + +  FP  V+P +   S    TD     
Subjt:  HNRSKLQPRALPTVFLGYPPGVKGFRLYDIQSKKIFVSRDVVFHEHLFPFHT--------------VAPQGDPTTIFPNLVLPMTFNYSGTSFTDQATSG

Query:  FP---------TLADGSNFDEQT-PDVLTGTTADEIPDTCARIEPEQQPQSPVVADPTDVISQPDRVVPNSELAIVVDTVTTGSCNQPRHSNRTCKVPSY
         P         T A  ++   +T  D    T  D  P       P   P  P  + PT ++ Q   V P   L               R S R  K P+Y
Subjt:  FP---------TLADGSNFDEQT-PDVLTGTTADEIPDTCARIEPEQQPQSPVVADPTDVISQPDRVVPNSELAIVVDTVTTGSCNQPRHSNRTCKVPSY

Query:  LRDYHCSLLFSTDIP-------KANHPYPLVTYLSYDRLSPKYKSFVLSVSTHYEPDFYHQAAPFPHWRETI------VLSYMLWKLT------------
        L++YHCS   S  +P        ++  +PL   LSY  LSP YKSFVL+ ST  EP  Y++A+  PHW E +      + +   W LT            
Subjt:  LRDYHCSLLFSTDIP-------KANHPYPLVTYLSYDRLSPKYKSFVLSVSTHYEPDFYHQAAPFPHWRETI------VLSYMLWKLT------------

Query:  --------VRGLLSLFQRVITPLGAGRDR---LPRNFSPVAKLVTVKILLSIVVSLNWPLLQLDVNNAFLHGELNEEVYMDVPLGY
                  G L  ++  +   G  +         FSPVAK VTV+ LL++     W L QLDVNNAFLHG L+EEVYM +P G+
Subjt:  --------VRGLLSLFQRVITPLGAGRDR---LPRNFSPVAKLVTVKILLSIVVSLNWPLLQLDVNNAFLHGELNEEVYMDVPLGY

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.6e-2722.87Show/hide
Query:  PVTALSTTVLNKYFPCNNVTWHDRLDHPSS--------KHLNALGSLLQNNSVKTFSHDPCLIYPLAKQRRLSF---HSNNHISGKVFDLLHCDIWGPYK
        PV       +N     N   WH+R  H S         K++ +  SLL N  +     +PCL     KQ RL F       HI   +F ++H D+ GP  
Subjt:  PVTALSTTVLNKYFPCNNVTWHDRLDHPSS--------KHLNALGSLLQNNSVKTFSHDPCLIYPLAKQRRLSF---HSNNHISGKVFDLLHCDIWGPYK

Query:  TPTHAGFRYFLTLVDDHSCYTWIFLIRQKSEALQIIPRFFQLVETQYSASIKQFRSDNAPELAFTE---FFASKGVIHQYSCVARPEQNSVVERKHQHLL
          T     YF+  VD  + Y   +LI+ KS+   +   F    E  ++  +     DN  E    E   F   KG+ +  +    P+ N V ER  + + 
Subjt:  TPTHAGFRYFLTLVDDHSCYTWIFLIRQKSEALQIIPRFFQLVETQYSASIKQFRSDNAPELAFTE---FFASKGVIHQYSCVARPEQNSVVERKHQHLL

Query:  NVTRALLFQSRVPVCFWGECVFTACYLINMTPSPLL--KWETSFLRLHRKHVDYSNIKVFGCLAFVSTLQHNRSKLQPRALPTVFLGYPPGVKGFRLYDI
           R ++  +++   FWGE V TA YLIN  PS  L    +T +   H K     +++VFG   +V  +++ + K   ++  ++F+GY P   GF+L+D 
Subjt:  NVTRALLFQSRVPVCFWGECVFTACYLINMTPSPLL--KWETSFLRLHRKHVDYSNIKVFGCLAFVSTLQHNRSKLQPRALPTVFLGYPPGVKGFRLYDI

Query:  QSKKIFVSRDVVFHE------HLFPFHTV---APQGDPTTIFPN---LVLPMTFNYSGTSFTDQATSGFPTLADGSNFDEQTPDVLTGTTADEIPDTCAR
         ++K  V+RDVV  E          F TV     +      FPN    ++   F        +         ++  NF   +  ++     +E  + C  
Subjt:  QSKKIFVSRDVVFHE------HLFPFHTV---APQGDPTTIFPN---LVLPMTFNYSGTSFTDQATSGFPTLADGSNFDEQTPDVLTGTTADEIPDTCAR

Query:  IE--PEQQPQSPVVADPTDVISQPDRVVPNSELAIVVDTVTTGSCNQPRHSNRTCKVPSYLRDYHCSLLFSTD----IPKANHPYPLVTYLSYDRLSPKY
        I+   + +  +    + +    + D          + ++  +G+ N+ R S    +   +L++         D    I + +        +SY+      
Subjt:  IE--PEQQPQSPVVADPTDVISQPDRVVPNSELAIVVDTVTTGSCNQPRHSNRTCKVPSYLRDYHCSLLFSTD----IPKANHPYPLVTYLSYDRLSPKY

Query:  KSFVLSVSTHYEP-----DFYHQAAPFPHWRETIVLSY------MLWKLT----------VRGLLSL-FQRVITPLGAGRDRLPR------------NFS
           VL+  T +       D          W E I            W +T           R + S+ +  +  P+      + R             F+
Subjt:  KSFVLSVSTHYEP-----DFYHQAAPFPHWRETIVLSY------MLWKLT----------VRGLLSL-FQRVITPLGAGRDRLPR------------NFS

Query:  PVAKLVTVKILLSIVVSLNWPLLQLDVNNAFLHGELNEEVYMDVPLG
        PVA++ + + +LS+V+  N  + Q+DV  AFL+G L EE+YM +P G
Subjt:  PVAKLVTVKILLSIVVSLNWPLLQLDVNNAFLHGELNEEVYMDVPLG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.1e-6030.07Show/hide
Query:  WHDRLDHPSSKHLNALGSLLQNNSVKTFSHDPCLIYPLAKQRRLSFHSNNHISGKVFDLLHCDIWGPYKTPTHAGFRYFLTLVDDHSCYTWIFLIRQKSE
        WH R+ H S K L  L      +  K  +  PC      KQ R+SF +++     + DL++ D+ GP +  +  G +YF+T +DD S   W+++++ K +
Subjt:  WHDRLDHPSSKHLNALGSLLQNNSVKTFSHDPCLIYPLAKQRRLSFHSNNHISGKVFDLLHCDIWGPYKTPTHAGFRYFLTLVDDHSCYTWIFLIRQKSE

Query:  ALQIIPRFFQLVETQYSASIKQFRSDNAPELA---FTEFFASKGVIHQYSCVARPEQNSVVERKHQHLLNVTRALLFQSRVPVCFWGECVFTACYLINMT
          Q+  +F  LVE +    +K+ RSDN  E     F E+ +S G+ H+ +    P+ N V ER ++ ++   R++L  +++P  FWGE V TACYLIN +
Subjt:  ALQIIPRFFQLVETQYSASIKQFRSDNAPELA---FTEFFASKGVIHQYSCVARPEQNSVVERKHQHLLNVTRALLFQSRVPVCFWGECVFTACYLINMT

Query:  PSPLLKWETSFLRLHRKHVDYSNIKVFGCLAFVSTLQHNRSKLQPRALPTVFLGYPPGVKGFRLYDIQSKKIFVSRDVVFHEHLFPFHTVAPQGDPTT--
        PS  L +E        K V YS++KVFGC AF    +  R+KL  +++P +F+GY     G+RL+D   KK+  SRDVVF E      T A   +     
Subjt:  PSPLLKWETSFLRLHRKHVDYSNIKVFGCLAFVSTLQHNRSKLQPRALPTVFLGYPPGVKGFRLYDIQSKKIFVSRDVVFHEHLFPFHTVAPQGDPTT--

Query:  IFPNLVLPMTFNYSGTSFTDQATSGFPTLADGSNFDEQTPDVLTGTTADEIPDTCARIEPEQQPQSPVVADPTDVISQPDRVVPNSELAIVVDTVTTG-S
        I PN V            T  +TS  PT A+              +T DE+         EQ  Q      P +VI Q +++    E    V+  T G  
Subjt:  IFPNLVLPMTFNYSGTSFTDQATSGFPTLADGSNFDEQTPDVLTGTTADEIPDTCARIEPEQQPQSPVVADPTDVISQPDRVVPNSELAIVVDTVTTG-S

Query:  CNQPRHSNRTCKVPSYLRDYHCSLLFSTDIPKANHPYPLVTYLSYDRLSPKYKSFVLSVSTHYEPDFYHQAAPFPHWRETIVLSYMLWKLTVRGLLSLFQ
         +QP   +   +V S        +L S D      P  L   LS+   +   K+    + +  +   Y +    P  +  +   + ++KL   G   L +
Subjt:  CNQPRHSNRTCKVPSYLRDYHCSLLFSTDIPKANHPYPLVTYLSYDRLSPKYKSFVLSVSTHYEPDFYHQAAPFPHWRETIVLSYMLWKLTVRGLLSLFQ

Query:  RVITPLGAGRDR-----LPRNFSPVAKLVTVKILLSIVVSLNWPLLQLDVNNAFLHGELNEEVYMDVPLGYE
             +  G ++         FSPV K+ +++ +LS+  SL+  + QLDV  AFLHG+L EE+YM+ P G+E
Subjt:  RVITPLGAGRDR-----LPRNFSPVAKLVTVKILLSIVVSLNWPLLQLDVNNAFLHGELNEEVYMDVPLGYE

Q12337 Transposon Ty2-GR1 Gag-Pol polyprotein2.4e-1524.12Show/hide
Query:  FNLISISALTASQPLVVKFVENSCILQDKFSWKTIGKADHWQGLYLLK---VASPVTALSTTVLNKYFPCNNVTW---HDRLDHPSSKHL------NALG
        ++L+S+S L A+Q +   F  N+    D      I K  H    +L K   + S ++ L+   +NK    N   +   H  L H + + +      NA+ 
Subjt:  FNLISISALTASQPLVVKFVENSCILQDKFSWKTIGKADHWQGLYLLK---VASPVTALSTTVLNKYFPCNNVTW---HDRLDHPSSKHL------NALG

Query:  SLLQN----NSVKTFSHDPCLIYPLAKQR-----RLSFHSNNHISGKVFDLLHCDIWGPYKTPTHAGFRYFLTLVDDHSCYTWIFLI--RQKSEALQIIP
         L ++    ++  T+    CLI    K R     RL +      S + F  LH DI+GP      +   YF++  D+ + + W++ +  R++   L +  
Subjt:  SLLQN----NSVKTFSHDPCLIYPLAKQR-----RLSFHSNNHISGKVFDLLHCDIWGPYKTPTHAGFRYFLTLVDDHSCYTWIFLI--RQKSEALQIIP

Query:  RFFQLVETQYSASIKQFRSDNAPEL---AFTEFFASKGVIHQYSCVARPEQNSVVERKHQHLLNVTRALLFQSRVPVCFWGECVFTACYLINMTPSPLLK
             ++ Q++A +   + D   E       +FF ++G+   Y+  A    + V ER ++ LLN  R LL  S +P   W   V  +  + N   SP  +
Subjt:  RFFQLVETQYSASIKQFRSDNAPEL---AFTEFFASKGVIHQYSCVARPEQNSVVERKHQHLLNVTRALLFQSRVPVCFWGECVFTACYLINMTPSPLLK

Query:  WETSFLRLHRKHVDYSNIKVFGCLAF---VSTLQHN-RSKLQPRALPTVFLGYPPGVKGFRLYDIQSKK
              +  R+H   + + +   L F   V    HN  SK+ PR +P   L       G+ +Y    KK
Subjt:  WETSFLRLHRKHVDYSNIKVFGCLAF---VSTLQHN-RSKLQPRALPTVFLGYPPGVKGFRLYDIQSKK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.8e-5526.27Show/hide
Query:  SRTQSSMNKKKERPH---CTHCNILGHTVDKCYKIHDYPPGYRNQRVSSSKTDTPCSTVTHGSLSLADSLSSLTPEQCQGLLAILQSHLTKVSAPADTSP
        S T    N  + +P+   C  C + GH+  +C ++  +        +SS  +  P               S  TP Q +  LA L S  +  +   D+  
Subjt:  SRTQSSMNKKKERPH---CTHCNILGHTVDKCYKIHDYPPGYRNQRVSSSKTDTPCSTVTHGSLSLADSLSSLTPEQCQGLLAILQSHLTKVSAPADTSP

Query:  STHVVG-----ICHVPHVSFVTSWVLDYGASIHICHL--KELFTYLRPVT-HNF--------NLISISALTASQPLVVKFVENSCILQDKFSWKTIGKAD
        + H+         H P+       V D G++I I H     L T  RP+  HN         NLIS+  L  +  + V+F   S  ++D  +   + +  
Subjt:  STHVVG-----ICHVPHVSFVTSWVLDYGASIHICHL--KELFTYLRPVT-HNF--------NLISISALTASQPLVVKFVENSCILQDKFSWKTIGKAD

Query:  HWQGLYLLKVAS--PVTALSTTVLNKYFPCNNVTWHDRLDHPSSKHLNALGS----LLQNNSVKTFSHDPCLIYPLAKQRRLSFHSNNHISGKVFDLLHC
            LY   +AS  PV+  ++          + +WH RL HP+   LN++ S     + N S K  S   CLI    K  ++ F  +   S +  + ++ 
Subjt:  HWQGLYLLKVAS--PVTALSTTVLNKYFPCNNVTWHDRLDHPSSKHLNALGS----LLQNNSVKTFSHDPCLIYPLAKQRRLSFHSNNHISGKVFDLLHC

Query:  DIWGPYKTPTHAGFRYFLTLVDDHSCYTWIFLIRQKSEALQIIPRFFQLVETQYSASIKQFRSDNAPE-LAFTEFFASKGVIHQYSCVARPEQNSVVERK
        D+W      +H  +RY++  VD  + YTW++ ++QKS+  +    F  L+E ++   I  F SDN  E +A  E+F+  G+ H  S    PE N + ERK
Subjt:  DIWGPYKTPTHAGFRYFLTLVDDHSCYTWIFLIRQKSEALQIIPRFFQLVETQYSASIKQFRSDNAPE-LAFTEFFASKGVIHQYSCVARPEQNSVVERK

Query:  HQHLLNVTRALLFQSRVPVCFWGECVFTACYLINMTPSPLLKWETSFLRLHRKHVDYSNIKVFGCLAFVSTLQHNRSKLQPRALPTVFLGYPPGVKGFRL
        H+H++     LL  + +P  +W      A YLIN  P+PLL+ E+ F +L     +Y  ++VFGC  +     +N+ KL  ++   VFLGY      +  
Subjt:  HQHLLNVTRALLFQSRVPVCFWGECVFTACYLINMTPSPLLKWETSFLRLHRKHVDYSNIKVFGCLAFVSTLQHNRSKLQPRALPTVFLGYPPGVKGFRL

Query:  YDIQSKKIFVSRDVVFHEHLFPFH----TVAP----QGDPTTIF-PNLVLP-MTFNYSGTSFTDQATSGFPTLADGSNF--DEQTPDVLTGTTADEIPDT
          +Q+ ++++SR V F E+ FPF     T++P    + + + ++ P+  LP  T      S +D   +  P  +  + F   + +   L  + +   P +
Subjt:  YDIQSKKIFVSRDVVFHEHLFPFH----TVAP----QGDPTTIF-PNLVLP-MTFNYSGTSFTDQATSGFPTLADGSNF--DEQTPDVLTGTTADEIPDT

Query:  CARIEPEQQPQSPVVADPTDVISQ-----------PDRVVPNSELAIVVDTVTTGSCNQPRHSNRTCKVPSYLRDYHCSLLFSTDIPKA-----NHPYPL
             P Q    P    PT   +Q           P    P S+LA  + T    S + P  S  T    S       S+L     P A     N+  PL
Subjt:  CARIEPEQQPQSPVVADPTDVISQ-----------PDRVVPNSELAIVVDTVTTGSCNQPRHSNRTCKVPSYLRDYHCSLLFSTDIPKA-----NHPYPL

Query:  VTYLSYDRL-------SPKYKSFVLSVSTHYEPDFYHQAAPFPHWRETI------VLSYMLWKL--------TVRGLLSLFQRVITPLGA----------
         T+    R        +PKY S  +S++   EP    QA     WR  +       +    W L        T+ G   +F +     G+          
Subjt:  VTYLSYDRL-------SPKYKSFVLSVSTHYEPDFYHQAAPFPHWRETI------VLSYMLWKL--------TVRGLLSLFQRVITPLGA----------

Query:  -GRDRLP-----RNFSPVAKLVTVKILLSIVVSLNWPLLQLDVNNAFLHGELNEEVYMDVPLGY
         G ++ P       FSPV K  +++I+L + V  +WP+ QLDVNNAFL G L ++VYM  P G+
Subjt:  -GRDRLP-----RNFSPVAKLVTVKILLSIVVSLNWPLLQLDVNNAFLHGELNEEVYMDVPLGY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.1e-5225.26Show/hide
Query:  PSRTQSSMNKKKERPH---CTHCNILGHTVDKCYKIHDYPPGYRNQRVSSSKTDTPCSTVTHGSLSLADSLSSLTPEQCQGLLAILQSHLTKVSAPADTS
        PS + S  + ++ +P+   C  C++ GH+  +C ++H +      Q+                      S S  TP Q +  LA+  S     +   D+ 
Subjt:  PSRTQSSMNKKKERPH---CTHCNILGHTVDKCYKIHDYPPGYRNQRVSSSKTDTPCSTVTHGSLSLADSLSSLTPEQCQGLLAILQSHLTKVSAPADTS

Query:  PSTHVVG-----ICHVPHVSFVTSWVLDYGASIHICHL--KELFTYLRPVTHN---------FNLISISALTASQPLVVKFVENSCILQDKFSWKTIGKA
         + H+         H P+       + D G++I I H     L T  R +  N          NLIS+  L  +  + V+F   S  ++D  +   + + 
Subjt:  PSTHVVG-----ICHVPHVSFVTSWVLDYGASIHICHL--KELFTYLRPVTHN---------FNLISISALTASQPLVVKFVENSCILQDKFSWKTIGKA

Query:  DHWQGLYLLKVASPVTALSTTVLNKYFPCNNVT---WHDRLDHPSSKHLNALGS----LLQNNSVKTFSHDPCLIYPLAKQRRLSFHSNNHISGKVFDLL
             LY   +AS     S  V     PC+  T   WH RL HPS   LN++ S     + N S K  S   C I    K  ++ F ++   S K  + +
Subjt:  DHWQGLYLLKVASPVTALSTTVLNKYFPCNNVT---WHDRLDHPSSKHLNALGS----LLQNNSVKTFSHDPCLIYPLAKQRRLSFHSNNHISGKVFDLL

Query:  HCDIWGPYKTPTHAGFRYFLTLVDDHSCYTWIFLIRQKSEALQIIPRFFQLVETQYSASIKQFRSDNAPE-LAFTEFFASKGVIHQYSCVARPEQNSVVE
        + D+W      +   +RY++  VD  + YTW++ ++QKS+       F  LVE ++   I    SDN  E +   ++ +  G+ H  S    PE N + E
Subjt:  HCDIWGPYKTPTHAGFRYFLTLVDDHSCYTWIFLIRQKSEALQIIPRFFQLVETQYSASIKQFRSDNAPE-LAFTEFFASKGVIHQYSCVARPEQNSVVE

Query:  RKHQHLLNVTRALLFQSRVPVCFWGECVFTACYLINMTPSPLLKWETSFLRLHRKHVDYSNIKVFGCLAFVSTLQHNRSKLQPRALPTVFLGYPPGVKGF
        RKH+H++ +   LL  + VP  +W      A YLIN  P+PLL+ ++ F +L  +  +Y  +KVFGC  +     +NR KL+ ++    F+GY      +
Subjt:  RKHQHLLNVTRALLFQSRVPVCFWGECVFTACYLINMTPSPLLKWETSFLRLHRKHVDYSNIKVFGCLAFVSTLQHNRSKLQPRALPTVFLGYPPGVKGF

Query:  RLYDIQSKKIFVSRDVVFHEHLFPFHTV--------------APQGDPTTIFPN--LVLPMTFNYSGTSFTDQATSGFP-----TLADGSNFDEQTPDVL
            I + +++ SR V F E  FPF T               AP     T  P   LVLP          T       P     T    SN    +  + 
Subjt:  RLYDIQSKKIFVSRDVVFHEHLFPFHTV--------------APQGDPTTIFPN--LVLPMTFNYSGTSFTDQATSGFP-----TLADGSNFDEQTPDVL

Query:  TGTTADEIPDTCARIEPEQQP--------QSPVVADPTDVISQPDRVVPNSELA--------IVVDTVTTGSCNQPRHSN-RTCKVPSYLRDYHCSLLFS
        + ++++    +    +P  QP         SP++ +P      P+    NS L         I   + +    N P  S+  T  +P         +L +
Subjt:  TGTTADEIPDTCARIEPEQQP--------QSPVVADPTDVISQPDRVVPNSELA--------IVVDTVTTGSCNQPRHSN-RTCKVPSYLRDYHCSLLFS

Query:  TDIPKANHPYPLVTYLSYDRL-----SPKYK-SFVLSVSTHYEPDFYHQAAPFPHWRETI------VLSYMLWKL--------TVRGLLSLFQRVITPLG
          I + N   P+ T+    R       P  K S+  S++ + EP    QA     WR+ +       +    W L        T+ G   +F +     G
Subjt:  TDIPKANHPYPLVTYLSYDRL-----SPKYK-SFVLSVSTHYEPDFYHQAAPFPHWRETI------VLSYMLWKL--------TVRGLLSLFQRVITPLG

Query:  A-----------GRDRLP-----RNFSPVAKLVTVKILLSIVVSLNWPLLQLDVNNAFLHGELNEEVYMDVPLGY
        +           G ++ P       FSPV K  +++I+L + V  +WP+ QLDVNNAFL G L +EVYM  P G+
Subjt:  A-----------GRDRLP-----RNFSPVAKLVTVKILLSIVVSLNWPLLQLDVNNAFLHGELNEEVYMDVPLGY

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).7.8e-1428.34Show/hide
Query:  EFNYTSWCNAMIIGLTVKNKLGFVDGT---------IINLTVKCGALGSFAML----------------------DLQQRYQRKNRPRVFQIRREILNLV
        E NY +W       L V  K GF+DGT         +     +C A+  + ++                      DL++ +      +++Q+RR +  L 
Subjt:  EFNYTSWCNAMIIGLTVKNKLGFVDGT---------IINLTVKCGALGSFAML----------------------DLQQRYQRKNRPRVFQIRREILNLV

Query:  QDQDSVTTYFAKLKTLWNEFSSYRPV--CSCGKCTCGSVKKLTEYFQTEHVMTFLMG--LNDSFSQILTQLLLMEPESTIQRAFSLV
        Q  DSV  YF KL  +W E S Y P+  C CG C C   K+  E  + E    FLMG  LN  F  + T+++  +P  ++  AF++V
Subjt:  QDQDSVTTYFAKLKTLWNEFSSYRPV--CSCGKCTCGSVKKLTEYFQTEHVMTFLMG--LNDSFSQILTQLLLMEPESTIQRAFSLV

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.7e-1627.48Show/hide
Query:  VISQPDRVVPNSELAIVVDTVTTGSCNQP--RHSNRTCKVPSYLRDYHCSLLFSTDIPKANHPYPLVTYLSYDRLSPKYKSFVLSVSTHYEPDFYHQAAP
        ++S  D    +S + I+          +P    S+R  + P+YL+DY+C  + S  I      + +  +LSY+++SP Y SF++ ++   EP  Y++A  
Subjt:  VISQPDRVVPNSELAIVVDTVTTGSCNQP--RHSNRTCKVPSYLRDYHCSLLFSTDIPKANHPYPLVTYLSYDRLSPKYKSFVLSVSTHYEPDFYHQAAP

Query:  FPHW-------------------------RETIVLSYML-WKLTVRGLLSLFQRVITPLGAGRDR---LPRNFSPVAKLVTVKILLSIVVSLNWPLLQLD
        F  W                         ++ I   ++   K    G +  ++  +   G  +         FSPV KL +VK++L+I    N+ L QLD
Subjt:  FPHW-------------------------RETIVLSYML-WKLTVRGLLSLFQRVITPLGAGRDR---LPRNFSPVAKLVTVKILLSIVVSLNWPLLQLD

Query:  VNNAFLHGELNEEVYMDVPLGY
        ++NAFL+G+L+EE+YM +P GY
Subjt:  VNNAFLHGELNEEVYMDVPLGY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGGCTAATTCCAACAACAATAACTGTCCAGCCACCATGCAGACCCTTCACACCAAGCAGCAAAGAGTAATATTACTTTCGTTGAAAGAGACAAAACTTAGT
GATGGATGGGATCACAATGGGCTTTGGATTGCAGTTCTTATTCATTTGTGGGTGAGTAATTTTCTCAATGGCAATTTTGGACCATCTCAGTTATCGAGGAATAGA
CAAGCCAGCACAACATCAACCCAAGGAAACATGAAAAGACCAGGTCCAACTACAAGAAAGATTATAGGACAACAGACTTCGAAAGAGAGAAAGGATGGAACTAAT
CTCGTTCTCGTTTCTGACCTCGTGACAGAATTCAATTATACTTCCTGGTGCAATGCTATGATTATTGGCTTGACAGTGAAGAACAAGTTAGGGTTCGTCGATGGT
ACTATCATCAACCTGACAGTGAAATGCGGCGCTCTTGGATCATTTGCAATGCTCGATCTTCAGCAACGGTATCAAAGAAAGAATCGCCCTCGTGTCTTTCAGATT
CGTCGAGAGATTTTGAATCTTGTCCAAGATCAAGACTCTGTCACCACATACTTTGCAAAACTCAAAACGCTCTGGAATGAATTTTCTTCCTATCGTCCTGTCTGC
AGTTGTGGGAAATGTACGTGTGGAAGTGTCAAGAAACTTACTGAGTATTTTCAGACCGAACACGTTATGACTTTTCTCATGGGATTGAATGATTCATTTAGCCAA
ATCCTCACACAATTGCTTCTTATGGAACCAGAGTCGACCATCCAACGGGCTTTTTCTTTGGTTGCGCAAGAAGTTGAACAGCGAGCTTTGGTGAATATCACTCAT
TCATCCTCTGTCGCAAGCAACACAATCACTTCGCCAGCCGCACTTCTTGTCAAGAATAATCCTCCGAGTCGAACACAATCGTCTATGAACAAGAAGAAAGAACGG
CCTCATTGCACTCACTGCAATATTTTGGGCCACACCGTGGACAAGTGTTATAAAATCCACGATTATCCACCAGGATACCGCAATCAAAGGGTCAGCTCTTCGAAG
ACAGACACTCCTTGTTCCACCGTTACTCATGGTTCCTTGTCTTTGGCTGATTCTTTATCTTCTTTAACACCTGAGCAGTGTCAAGGGTTATTGGCCATTCTCCAA
AGCCATCTCACTAAGGTTTCTGCTCCTGCTGATACTTCTCCATCAACTCATGTGGTAGGTATTTGTCATGTCCCTCATGTTTCTTTTGTTACGTCGTGGGTTCTT
GACTATGGAGCGTCTATTCATATCTGCCATTTGAAAGAATTATTTACATACTTGAGGCCTGTTACACATAATTTTAATTTGATCTCCATAAGTGCTTTAACTGCT
AGTCAACCTCTTGTTGTGAAATTTGTTGAGAATTCGTGCATTCTTCAAGACAAGTTCTCCTGGAAGACGATTGGGAAGGCTGATCATTGGCAAGGGCTTTACCTT
TTGAAAGTTGCTTCCCCCGTGACTGCTTTGAGTACTACTGTTTTGAATAAATATTTTCCTTGTAATAATGTAACATGGCATGATAGACTCGACCATCCTTCTTCT
AAGCATCTTAATGCTTTGGGTTCTTTGTTGCAAAATAACAGTGTTAAAACTTTTTCACATGATCCTTGTCTCATTTATCCTCTTGCCAAACAACGTAGGTTGTCT
TTTCATTCTAACAATCATATATCTGGCAAAGTGTTTGATCTGCTGCACTGTGATATCTGGGGCCCCTATAAGACACCAACACATGCTGGTTTTCGCTATTTTCTT
ACCTTAGTAGATGATCATTCTTGTTACACTTGGATTTTCCTCATTAGACAAAAGTCAGAGGCTCTTCAGATAATTCCTCGTTTCTTTCAACTGGTGGAAACTCAA
TACTCTGCTTCCATAAAGCAATTTAGATCAGACAACGCTCCTGAGCTTGCCTTCACAGAGTTCTTTGCTTCTAAAGGGGTGATTCATCAATACTCGTGTGTAGCT
AGACCTGAGCAAAACTCAGTTGTGGAGCGCAAACATCAGCACTTGTTGAATGTTACTCGTGCTTTATTGTTTCAATCGAGAGTCCCAGTCTGTTTTTGGGGTGAG
TGTGTCTTCACAGCATGTTACCTCATCAACATGACTCCATCTCCCTTGTTGAAATGGGAAACTTCTTTTCTTCGTCTCCACAGGAAGCATGTTGATTACTCTAAC
ATAAAAGTGTTTGGTTGTCTTGCTTTTGTCTCCACTTTGCAGCATAATAGATCTAAACTTCAACCTCGGGCTCTTCCTACTGTTTTTCTTGGTTATCCTCCAGGA
GTCAAGGGTTTTCGTTTATATGACATTCAGTCCAAGAAAATTTTTGTCTCCAGGGATGTGGTGTTCCACGAACATCTGTTTCCTTTTCATACAGTTGCTCCGCAA
GGTGATCCTACAACCATTTTTCCTAACTTAGTTTTGCCAATGACTTTCAATTATTCTGGTACAAGTTTTACTGATCAGGCTACCAGTGGTTTTCCAACACTTGCG
GATGGATCTAATTTTGATGAACAAACACCTGATGTTCTTACCGGCACTACAGCTGACGAGATTCCTGATACTTGTGCCCGCATTGAACCTGAACAACAACCTCAA
TCTCCAGTAGTTGCTGATCCTACTGATGTAATTTCACAACCTGATAGAGTTGTTCCTAACAGTGAGCTTGCTATTGTTGTTGATACTGTTACTACTGGTTCCTGC
AATCAGCCTCGACACTCTAACAGAACATGTAAGGTGCCATCTTACCTTCGGGACTACCATTGTAGTCTTTTATTCTCGACTGACATACCCAAGGCAAATCATCCG
TATCCATTGGTCACCTATCTATCTTATGATAGGCTATCTCCTAAGTACAAAAGTTTTGTTCTTAGTGTTTCAACTCACTATGAACCAGATTTTTACCATCAAGCT
GCTCCATTCCCTCATTGGCGTGAAACAATAGTGCTGAGTTACATGCTATGGAAGCTAACCGTACGTGGTCTATTGTCCCTCTTCCAAAGGGTCATCACTCCATTG
GGTGCAGGAAGGGATCGACTACCTAGAAACTTTTCACCTGTTGCAAAGTTGGTCACAGTGAAAATTCTTCTTTCCATTGTTGTTTCTCTAAACTGGCCCTTACTC
CAGCTCGACGTGAATAATGCTTTCCTCCATGGAGAGCTTAATGAGGAGGTTTATATGGATGTTCCTCTTGGTTATGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGGCTAATTCCAACAACAATAACTGTCCAGCCACCATGCAGACCCTTCACACCAAGCAGCAAAGAGTAATATTACTTTCGTTGAAAGAGACAAAACTTAGT
GATGGATGGGATCACAATGGGCTTTGGATTGCAGTTCTTATTCATTTGTGGGTGAGTAATTTTCTCAATGGCAATTTTGGACCATCTCAGTTATCGAGGAATAGA
CAAGCCAGCACAACATCAACCCAAGGAAACATGAAAAGACCAGGTCCAACTACAAGAAAGATTATAGGACAACAGACTTCGAAAGAGAGAAAGGATGGAACTAAT
CTCGTTCTCGTTTCTGACCTCGTGACAGAATTCAATTATACTTCCTGGTGCAATGCTATGATTATTGGCTTGACAGTGAAGAACAAGTTAGGGTTCGTCGATGGT
ACTATCATCAACCTGACAGTGAAATGCGGCGCTCTTGGATCATTTGCAATGCTCGATCTTCAGCAACGGTATCAAAGAAAGAATCGCCCTCGTGTCTTTCAGATT
CGTCGAGAGATTTTGAATCTTGTCCAAGATCAAGACTCTGTCACCACATACTTTGCAAAACTCAAAACGCTCTGGAATGAATTTTCTTCCTATCGTCCTGTCTGC
AGTTGTGGGAAATGTACGTGTGGAAGTGTCAAGAAACTTACTGAGTATTTTCAGACCGAACACGTTATGACTTTTCTCATGGGATTGAATGATTCATTTAGCCAA
ATCCTCACACAATTGCTTCTTATGGAACCAGAGTCGACCATCCAACGGGCTTTTTCTTTGGTTGCGCAAGAAGTTGAACAGCGAGCTTTGGTGAATATCACTCAT
TCATCCTCTGTCGCAAGCAACACAATCACTTCGCCAGCCGCACTTCTTGTCAAGAATAATCCTCCGAGTCGAACACAATCGTCTATGAACAAGAAGAAAGAACGG
CCTCATTGCACTCACTGCAATATTTTGGGCCACACCGTGGACAAGTGTTATAAAATCCACGATTATCCACCAGGATACCGCAATCAAAGGGTCAGCTCTTCGAAG
ACAGACACTCCTTGTTCCACCGTTACTCATGGTTCCTTGTCTTTGGCTGATTCTTTATCTTCTTTAACACCTGAGCAGTGTCAAGGGTTATTGGCCATTCTCCAA
AGCCATCTCACTAAGGTTTCTGCTCCTGCTGATACTTCTCCATCAACTCATGTGGTAGGTATTTGTCATGTCCCTCATGTTTCTTTTGTTACGTCGTGGGTTCTT
GACTATGGAGCGTCTATTCATATCTGCCATTTGAAAGAATTATTTACATACTTGAGGCCTGTTACACATAATTTTAATTTGATCTCCATAAGTGCTTTAACTGCT
AGTCAACCTCTTGTTGTGAAATTTGTTGAGAATTCGTGCATTCTTCAAGACAAGTTCTCCTGGAAGACGATTGGGAAGGCTGATCATTGGCAAGGGCTTTACCTT
TTGAAAGTTGCTTCCCCCGTGACTGCTTTGAGTACTACTGTTTTGAATAAATATTTTCCTTGTAATAATGTAACATGGCATGATAGACTCGACCATCCTTCTTCT
AAGCATCTTAATGCTTTGGGTTCTTTGTTGCAAAATAACAGTGTTAAAACTTTTTCACATGATCCTTGTCTCATTTATCCTCTTGCCAAACAACGTAGGTTGTCT
TTTCATTCTAACAATCATATATCTGGCAAAGTGTTTGATCTGCTGCACTGTGATATCTGGGGCCCCTATAAGACACCAACACATGCTGGTTTTCGCTATTTTCTT
ACCTTAGTAGATGATCATTCTTGTTACACTTGGATTTTCCTCATTAGACAAAAGTCAGAGGCTCTTCAGATAATTCCTCGTTTCTTTCAACTGGTGGAAACTCAA
TACTCTGCTTCCATAAAGCAATTTAGATCAGACAACGCTCCTGAGCTTGCCTTCACAGAGTTCTTTGCTTCTAAAGGGGTGATTCATCAATACTCGTGTGTAGCT
AGACCTGAGCAAAACTCAGTTGTGGAGCGCAAACATCAGCACTTGTTGAATGTTACTCGTGCTTTATTGTTTCAATCGAGAGTCCCAGTCTGTTTTTGGGGTGAG
TGTGTCTTCACAGCATGTTACCTCATCAACATGACTCCATCTCCCTTGTTGAAATGGGAAACTTCTTTTCTTCGTCTCCACAGGAAGCATGTTGATTACTCTAAC
ATAAAAGTGTTTGGTTGTCTTGCTTTTGTCTCCACTTTGCAGCATAATAGATCTAAACTTCAACCTCGGGCTCTTCCTACTGTTTTTCTTGGTTATCCTCCAGGA
GTCAAGGGTTTTCGTTTATATGACATTCAGTCCAAGAAAATTTTTGTCTCCAGGGATGTGGTGTTCCACGAACATCTGTTTCCTTTTCATACAGTTGCTCCGCAA
GGTGATCCTACAACCATTTTTCCTAACTTAGTTTTGCCAATGACTTTCAATTATTCTGGTACAAGTTTTACTGATCAGGCTACCAGTGGTTTTCCAACACTTGCG
GATGGATCTAATTTTGATGAACAAACACCTGATGTTCTTACCGGCACTACAGCTGACGAGATTCCTGATACTTGTGCCCGCATTGAACCTGAACAACAACCTCAA
TCTCCAGTAGTTGCTGATCCTACTGATGTAATTTCACAACCTGATAGAGTTGTTCCTAACAGTGAGCTTGCTATTGTTGTTGATACTGTTACTACTGGTTCCTGC
AATCAGCCTCGACACTCTAACAGAACATGTAAGGTGCCATCTTACCTTCGGGACTACCATTGTAGTCTTTTATTCTCGACTGACATACCCAAGGCAAATCATCCG
TATCCATTGGTCACCTATCTATCTTATGATAGGCTATCTCCTAAGTACAAAAGTTTTGTTCTTAGTGTTTCAACTCACTATGAACCAGATTTTTACCATCAAGCT
GCTCCATTCCCTCATTGGCGTGAAACAATAGTGCTGAGTTACATGCTATGGAAGCTAACCGTACGTGGTCTATTGTCCCTCTTCCAAAGGGTCATCACTCCATTG
GGTGCAGGAAGGGATCGACTACCTAGAAACTTTTCACCTGTTGCAAAGTTGGTCACAGTGAAAATTCTTCTTTCCATTGTTGTTTCTCTAAACTGGCCCTTACTC
CAGCTCGACGTGAATAATGCTTTCCTCCATGGAGAGCTTAATGAGGAGGTTTATATGGATGTTCCTCTTGGTTATGAATGA
Protein sequenceShow/hide protein sequence
MLANSNNNNCPATMQTLHTKQQRVILLSLKETKLSDGWDHNGLWIAVLIHLWVSNFLNGNFGPSQLSRNRQASTTSTQGNMKRPGPTTRKIIGQQTSKERKDGTN
LVLVSDLVTEFNYTSWCNAMIIGLTVKNKLGFVDGTIINLTVKCGALGSFAMLDLQQRYQRKNRPRVFQIRREILNLVQDQDSVTTYFAKLKTLWNEFSSYRPVC
SCGKCTCGSVKKLTEYFQTEHVMTFLMGLNDSFSQILTQLLLMEPESTIQRAFSLVAQEVEQRALVNITHSSSVASNTITSPAALLVKNNPPSRTQSSMNKKKER
PHCTHCNILGHTVDKCYKIHDYPPGYRNQRVSSSKTDTPCSTVTHGSLSLADSLSSLTPEQCQGLLAILQSHLTKVSAPADTSPSTHVVGICHVPHVSFVTSWVL
DYGASIHICHLKELFTYLRPVTHNFNLISISALTASQPLVVKFVENSCILQDKFSWKTIGKADHWQGLYLLKVASPVTALSTTVLNKYFPCNNVTWHDRLDHPSS
KHLNALGSLLQNNSVKTFSHDPCLIYPLAKQRRLSFHSNNHISGKVFDLLHCDIWGPYKTPTHAGFRYFLTLVDDHSCYTWIFLIRQKSEALQIIPRFFQLVETQ
YSASIKQFRSDNAPELAFTEFFASKGVIHQYSCVARPEQNSVVERKHQHLLNVTRALLFQSRVPVCFWGECVFTACYLINMTPSPLLKWETSFLRLHRKHVDYSN
IKVFGCLAFVSTLQHNRSKLQPRALPTVFLGYPPGVKGFRLYDIQSKKIFVSRDVVFHEHLFPFHTVAPQGDPTTIFPNLVLPMTFNYSGTSFTDQATSGFPTLA
DGSNFDEQTPDVLTGTTADEIPDTCARIEPEQQPQSPVVADPTDVISQPDRVVPNSELAIVVDTVTTGSCNQPRHSNRTCKVPSYLRDYHCSLLFSTDIPKANHP
YPLVTYLSYDRLSPKYKSFVLSVSTHYEPDFYHQAAPFPHWRETIVLSYMLWKLTVRGLLSLFQRVITPLGAGRDRLPRNFSPVAKLVTVKILLSIVVSLNWPLL
QLDVNNAFLHGELNEEVYMDVPLGYE