; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0023061 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0023061
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr7:43562105..43564667
RNA-Seq ExpressionLag0023061
SyntenyLag0023061
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.3e-14939.61Show/hide
Query:  MESHSTEKSSSEIEVSSQAMKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSK-----IPNLAYDHWVRQDSLIIA
        M S S+       E SS   +I   GNKI+ +KL++D FLLWK QILT L  + L++ L+ +SE PSKYL + +SS       PN AY  W RQD LI +
Subjt:  MESHSTEKSSSEIEVSSQAMKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSK-----IPNLAYDHWVRQDSLIIA

Query:  WLLGSMSNSLLSEMLECETAREVWRILQNRFSSRNVARIMDLKSKLELLKKGNLKLEDYFLKVKNLVDALNAAGRKISHEDHVLKILAGLGTEYDSTVNV
        WLLGSMS  +L++ML C++A+E+W  LQ  FSSR +A+ M  K+KL  +KKG++ L++YFLK+   VDAL +  + +S +DH+L ILAGLG++Y S ++V
Subjt:  WLLGSMSNSLLSEMLECETAREVWRILQNRFSSRNVARIMDLKSKLELLKKGNLKLEDYFLKVKNLVDALNAAGRKISHEDHVLKILAGLGTEYDSTVNV

Query:  ITEKDETPPLQKVYSLLFTQESRIARNLSINPDGSIPSVNLTTQTGPK---------QQQASSGDSNNRKNSNGKGNNNRRSWNNNNKPQCQLCKRFGHT
        I+ + ++P +Q+V SLL TQES+      +  + ++PSVN+ TQT  K         Q    +  S N++   G G +NR    N NKPQCQ+C + G++
Subjt:  ITEKDETPPLQKVYSLLFTQESRIARNLSINPDGSIPSVNLTTQTGPK---------QQQASSGDSNNRKNSNGKGNNNRRSWNNNNKPQCQLCKRFGHT

Query:  VQRCYYRFERWFQGPNTNSNGQQSSSVLGPPPSGQNFQQPPMNALVVQNDLNKDNHWYPDSGASNHVTNDLSNLAIGTEYQGDNRVNVGNGA--------
          RC++R+      P +NS+G   +S      +  N   P M+A+V   DLN D++WYPDSGA+NH+T+ LSNL+IG+EY G N++   NG+        
Subjt:  VQRCYYRFERWFQGPNTNSNGQQSSSVLGPPPSGQNFQQPPMNALVVQNDLNKDNHWYPDSGASNHVTNDLSNLAIGTEYQGDNRVNVGNGA--------

Query:  ---------------------------------------------------------------GTLVDGLYQFLLEKAHP------DTTK----------
                                                                       G L DGLY+F +E +H         TK          
Subjt:  ---------------------------------------------------------------GTLVDGLYQFLLEKAHP------DTTK----------

Query:  --------------------------ISNSTS-----------------AEHTSHVLSTEAQPSKLSCTTVFDLW--------------------HNRFT
                                  I NS+                  A   SH L+    P +L      DLW                    ++R+T
Subjt:  --------------------------ISNSTS-----------------AEHTSHVLSTEAQPSKLSCTTVFDLW--------------------HNRFT

Query:  WIYFLTSKTEAFQIFKLFKTYVEKLLGRSILRLQTDNGSEFRPFTPFLKTHGIEHRFTCPYTSKQNGIVERKHRKIVEMGLTLLSHASLSLEFWDDAFST
        WIYFL SK++AF  F+ FKT VEK LG+SI  LQTD G+EF+PF PFL  HGIEHR TCPYTSKQN IVERKHR I+EMGLTLLS A+L L FWD+AFST
Subjt:  WIYFLTSKTEAFQIFKLFKTYVEKLLGRSILRLQTDNGSEFRPFTPFLKTHGIEHRFTCPYTSKQNGIVERKHRKIVEMGLTLLSHASLSLEFWDDAFST

Query:  SVYLINRLPT-VVHQLSPLEKLLGFKPDYKLLKTFGCLCYPYLRPYNHHKIEPRSLPCLFLGYSSIHKGYKCLASSGKVYISRNVLFDETKFPSSKFLDP
        SVYLINRLPT V+  +SPLEKL   KP++  L+ FGC CYPYLRPY  HK+  RS PC FLGYS+ HKGYKCLAS G+++ISR+VLFDE  FP + F   
Subjt:  SVYLINRLPT-VVHQLSPLEKLLGFKPDYKLLKTFGCLCYPYLRPYNHHKIEPRSLPCLFLGYSSIHKGYKCLASSGKVYISRNVLFDETKFPSSKFLDP

Query:  QSPVLYPTVPIMVPITYGKSLSTAVSCPTDATVTVPEISPTVPAMTLN-NHDSDHTSLSND
                             ++  S P    V  P +   +P+  +N N D  HT   +D
Subjt:  QSPVLYPTVPIMVPITYGKSLSTAVSCPTDATVTVPEISPTVPAMTLN-NHDSDHTSLSND

RVW60229.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.7e-11733.67Show/hide
Query:  IINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSKIPNLAYDHWVRQDSLIIAWLLGSMSNSLLSEMLECETAREVWRI
        +I+P +++ TM+L++DNFL+WK QI   +RG+GL+  L    ++P K +T+     +PN  +  + RQD L+I+WLL S+ ++ L +++ C +A EVW  
Subjt:  IINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSKIPNLAYDHWVRQDSLIIAWLLGSMSNSLLSEMLECETAREVWRI

Query:  LQNRFSSRNVARIMDLKSKLELLKKGNLKLEDYFLKVKNLVDALNAAGRKISHEDHVLKILAGLGTEYDSTVNVITEKDETPPLQKVYSLLFTQESRIAR
        +   F+S++ A++M  KS++++LKK  L + DY  K+KN  D L  AG KIS  DH+L I+ GLG EY+S + VI+ K  +P LQ V S L   E RIA 
Subjt:  LQNRFSSRNVARIMDLKSKLELLKKGNLKLEDYFLKVKNLVDALNAAGRKISHEDHVLKILAGLGTEYDSTVNVITEKDETPPLQKVYSLLFTQESRIAR

Query:  NLSINPDGSIPSVNLTTQTGPKQQQASSG-DSNNRKNSNGKGNN--NRRSWNNNN-----------KPQCQLCKRFGHTVQRCYYRFERWFQGPNTNSNG
         +S N D S+   +  +  GP     S+G  S+  +N N  G N   R S+ +N            KPQCQLC +FGHTV RC+YR++  F G N  +NG
Subjt:  NLSINPDGSIPSVNLTTQTGPKQQQASSG-DSNNRKNSNGKGNN--NRRSWNNNN-----------KPQCQLCKRFGHTVQRCYYRFERWFQGPNTNSNG

Query:  QQSSSVLGPP--------------------PSGQNFQQPPMNALVVQNDLNKDNHWYPDSGASNHVTNDLSNLAIGTEYQGDNRVNVGNGA---------
          +  VLG                       + +N     M A+V   +  ++  W+PDSGA+NHVT+DL NL  G EY G++++++GNG          
Subjt:  QQSSSVLGPP--------------------PSGQNFQQPPMNALVVQNDLNKDNHWYPDSGASNHVTNDLSNLAIGTEYQGDNRVNVGNGA---------

Query:  --------------------------------------------------------------GTLVDGLYQFLLEK------------------------
                                                                      G L  GLYQF L K                        
Subjt:  --------------------------------------------------------------GTLVDGLYQFLLEK------------------------

Query:  -AHPDTTKISNSTSAEH-------------TSHVLSTEAQPSKLSCTT----------------------------------VFDLW-------------
          H D +     T++                S +++     +K+  +T                                  V DLW             
Subjt:  -AHPDTTKISNSTSAEH-------------TSHVLSTEAQPSKLSCTT----------------------------------VFDLW-------------

Query:  -------HNRFTWIYFLTSKTEAFQIFKLFKTYVEKLLGRSILRLQTDNGSEFRPFTPFLKTHGIEHRFTCPYTSKQNGIVERKHRKIVEMGLTLLSHAS
               ++R+TW+YFL +K++  + F +FK   E   G  +   QTD G EFR    + + +GI HR +CP+TSKQNGI+ERKHR IVE+GLTLL+ AS
Subjt:  -------HNRFTWIYFLTSKTEAFQIFKLFKTYVEKLLGRSILRLQTDNGSEFRPFTPFLKTHGIEHRFTCPYTSKQNGIVERKHRKIVEMGLTLLSHAS

Query:  LSLEFWDDAFSTSVYLINRLPT-VVHQLSPLEKLLGFKPDYKLLKTFGCLCYPYLRPYNHHKIEPRSLPCLFLGYSSIHKGYKCLASSGKVYISRNVLFD
        L L++W DAFST+V+LINRLPT V+ Q  P E L   KP+Y  LK FGCLC+P+LRPYN HK++ RS PC FLGYSS HKGYKCL   G+++ISR+V+FD
Subjt:  LSLEFWDDAFSTSVYLINRLPT-VVHQLSPLEKLLGFKPDYKLLKTFGCLCYPYLRPYNHHKIEPRSLPCLFLGYSSIHKGYKCLASSGKVYISRNVLFD

Query:  ETKFPSSKFLD------PQSPVLYPTVPIMVPITYGKSLSTAVSCPTDATVTVPEISPTVPA------MTLNNHDSDHT
        ET+FP +  L         S V  P +P++  +    S+S ++S PT +  +  ++   + +        L+N DS  T
Subjt:  ETKFPSSKFLD------PQSPVLYPTVPIMVPITYGKSLSTAVSCPTDATVTVPEISPTVPA------MTLNNHDSDHT

RVW80632.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.7e-11734.84Show/hide
Query:  KLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSK-IPNLAYDHWVRQDSLIIAWLLGSMSNSLLSEMLECETAREVWRILQNRFSSRNV
        KLD  NFL+W+ QILTTLRGH L+H L E S +PS++L++ D ++   N  +  W +QD LI++WLL S++++LL+ M+ C+T+ +VW+ L+  F+++  
Subjt:  KLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSK-IPNLAYDHWVRQDSLIIAWLLGSMSNSLLSEMLECETAREVWRILQNRFSSRNV

Query:  ARIMDLKSKLELLKKGNLKLEDYFLKVKNLVDALNAAGRKISHEDHVLKILAGLGTEYDSTVNVITEKDETPPLQKVYSLLFTQESRIARNLSINPDGSI
        A++   K++L   KKG+L + DY LK++N+VD L   G KIS +DH+  I  GL  +Y++ +  +  + +   ++++  LL  QESRI +N+ I  D S 
Subjt:  ARIMDLKSKLELLKKGNLKLEDYFLKVKNLVDALNAAGRKISHEDHVLKILAGLGTEYDSTVNVITEKDETPPLQKVYSLLFTQESRIARNLSINPDGSI

Query:  PSVN---LTTQTGPK--QQQASSGDSNNRK---------------NSNGKGNNNRRSWNNNNKPQCQLCKRFGHTVQRCYYRFERWFQGPN-TNSNGQQS
        PS+     T + G      +AS+ +SN R                   G+G + R SW  NNKPQCQLC R GH V +CYYRF++ F GP+    N  Q 
Subjt:  PSVN---LTTQTGPK--QQQASSGDSNNRK---------------NSNGKGNNNRRSWNNNNKPQCQLCKRFGHTVQRCYYRFERWFQGPN-TNSNGQQS

Query:  SSVLGPPPSGQNFQQPPMNALVVQNDLNKDNHWYPDSGASNHVTNDLSNLAIGTEYQGDNRVNVGNGAG-------------TLVDGLYQFLLEKAH-PD
        +         +NF     +      ++ +DN+WYPDSGA++H+T +L+NL   +++   + V VGNG G             + +      L +  H P+
Subjt:  SSVLGPPPSGQNFQQPPMNALVVQNDLNKDNHWYPDSGASNHVTNDLSNLAIGTEYQGDNRVNVGNGAG-------------TLVDGLYQFLLEKAH-PD

Query:  TTK--------------------------------------------ISNSTSAE---HTSHVLSTEAQPSK-----LSCTTVFDLWHN-----------
         TK                                            + ++T  +   H S   ++ A PSK      S T+ F LWHN           
Subjt:  TTK--------------------------------------------ISNSTSAE---HTSHVLSTEAQPSK-----LSCTTVFDLWHN-----------

Query:  -------------------------------------------------------------------------RFTWIYFLTSKTEAFQIFKLFKTYVEK
                                                                                 RFTWIY L  K+EAFQ+F  FK+ VE 
Subjt:  -------------------------------------------------------------------------RFTWIYFLTSKTEAFQIFKLFKTYVEK

Query:  LLGRSILRLQTDNGSEFRPFTPFLKTHGIEHRFTCPYTSKQNGIVERKHRKIVEMGLTLLSHASLSLEFWDDAFSTSVYLINRLPT-VVHQLSPLEKLLG
         LG  I  +Q+D G E+R FT +L ++GI HR +CPYT +QNG+ ERKHR IVE G+ LL+ ASL  ++WD+AF TSVYLINRLPT V+   SPLE L  
Subjt:  LLGRSILRLQTDNGSEFRPFTPFLKTHGIEHRFTCPYTSKQNGIVERKHRKIVEMGLTLLSHASLSLEFWDDAFSTSVYLINRLPT-VVHQLSPLEKLLG

Query:  FKPDYKLLKTFGCLCYPYLRPYNHHKIEPRSLPCLFLGYSSIHKGYKCLASSGKVYISRNVLFDETKFPSSKFLDPQSPVLYPTVPIMVPITYGKSLSTA
         KP Y  LK FGC+CYP LRP+NHHK++ RS+PC FLGYS  HKGYKCL+ +G + ISR+V+FDE  FP   F   QS             +   S ST+
Subjt:  FKPDYKLLKTFGCLCYPYLRPYNHHKIEPRSLPCLFLGYSSIHKGYKCLASSGKVYISRNVLFDETKFPSSKFLDPQSPVLYPTVPIMVPITYGKSLSTA

Query:  VSCPTDATVTVPEISPTVPAMTLNNHDSDHTSLSNDPS
        + C T           ++P M L +  S  TS   +PS
Subjt:  VSCPTDATVTVPEISPTVPAMTLNNHDSDHTSLSNDPS

RVX03305.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.8e-12236.08Show/hide
Query:  KLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSK-IPNLAYDHWVRQDSLIIAWLLGSMSNSLLSEMLECETAREVWRILQNRFSSRNV
        KLD  NFL+W+ QILTTLRGH L+H L E S +PS++L++ D ++   N  +  W +QD LI++WLL S++++LL+ M+ C+T+ +VW+ L+  F+++  
Subjt:  KLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSK-IPNLAYDHWVRQDSLIIAWLLGSMSNSLLSEMLECETAREVWRILQNRFSSRNV

Query:  ARIMDLKSKLELLKKGNLKLEDYFLKVKNLVDALNAAGRKISHEDHVLKILAGLGTEYDSTVNVITEKDETPPLQKVYSLLFTQESRIARNLSINPDGSI
        A++   K++L   KKG+L + DY LK++N+VD L   G KIS +DH+  I  GL  +Y++ +  +  + +   ++++ +LL  QESRI +N+ I  D S 
Subjt:  ARIMDLKSKLELLKKGNLKLEDYFLKVKNLVDALNAAGRKISHEDHVLKILAGLGTEYDSTVNVITEKDETPPLQKVYSLLFTQESRIARNLSINPDGSI

Query:  PSVN---LTTQTGPK--QQQASSGDSNNRK---------------NSNGKGNNNRRSWNNNNKPQCQLCKRFGHTVQRCYYRFERWFQGPN-TNSNGQQS
        PS+     T + G      +AS+ +SN R                   G+G + R  W  NNKPQCQLC R GH V +CYYRF++ F GP+    N  Q 
Subjt:  PSVN---LTTQTGPK--QQQASSGDSNNRK---------------NSNGKGNNNRRSWNNNNKPQCQLCKRFGHTVQRCYYRFERWFQGPN-TNSNGQQS

Query:  SSVLGPPPSGQNFQQPPMNALVVQNDLNKDNHWYPDSGASNHVTNDLSNLAIGTEYQGDNRVNVGNGAGTLVDGL------YQFLLEKA-----------
        +         +NF    ++      ++ +DN+WY DSGA++H+T +L+NL   +++   + V VGNG G  +  +        F+  K            
Subjt:  SSVLGPPPSGQNFQQPPMNALVVQNDLNKDNHWYPDSGASNHVTNDLSNLAIGTEYQGDNRVNVGNGAGTLVDGL------YQFLLEKA-----------

Query:  ---------------------HPDTTKISNSTSAE-----------------------HTSHVLSTEAQPSK-----LSCTTVFDLWHN-----------
                             HP +  + + ++                         H S   ++ A PSK      S T+ F LWHN           
Subjt:  ---------------------HPDTTKISNSTSAE-----------------------HTSHVLSTEAQPSK-----LSCTTVFDLWHN-----------

Query:  -------------------------RFTWIYFLTSKTEAFQIFKLFKTYVEKLLGRSILRLQTDNGSEFRPFTPFLKTHGIEHRFTCPYTSKQNGIVERK
                                 RFTWIY L  K+EAFQ+F  FK+ VE  LG  I  +Q+D G E+R FT +L ++GI HR +CPYT +QNG+ ERK
Subjt:  -------------------------RFTWIYFLTSKTEAFQIFKLFKTYVEKLLGRSILRLQTDNGSEFRPFTPFLKTHGIEHRFTCPYTSKQNGIVERK

Query:  HRKIVEMGLTLLSHASLSLEFWDDAFSTSVYLINRLPT-VVHQLSPLEKLLGFKPDYKLLKTFGCLCYPYLRPYNHHKIEPRSLPCLFLGYSSIHKGYKC
        HR IVE G+ LL+ ASL  ++WD+AF TSVYLINRLPT V+   SPLE L   KP Y  LK FGC+CYP LRP+NHHK++ RS+PC FLGYS  HKGYKC
Subjt:  HRKIVEMGLTLLSHASLSLEFWDDAFSTSVYLINRLPT-VVHQLSPLEKLLGFKPDYKLLKTFGCLCYPYLRPYNHHKIEPRSLPCLFLGYSSIHKGYKC

Query:  LASSGKVYISRNVLFDETKFPSSKFLDPQSPVLYPTVPIMVPITYGKSLSTAVSCPTDATVTVPEISPTVPAMTLNNHDSDHTSLSNDPS
        L+ +G + ISR+V+FDE  FP ++    +              +   S ST++ C T           ++P M L +  S  TS   +PS
Subjt:  LASSGKVYISRNVLFDETKFPSSKFLDPQSPVLYPTVPIMVPITYGKSLSTAVSCPTDATVTVPEISPTVPAMTLNNHDSDHTSLSNDPS

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.3e-14939.61Show/hide
Query:  MESHSTEKSSSEIEVSSQAMKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSK-----IPNLAYDHWVRQDSLIIA
        M S S+       E SS   +I   GNKI+ +KL++D FLLWK QILT L  + L++ L+ +SE PSKYL + +SS       PN AY  W RQD LI +
Subjt:  MESHSTEKSSSEIEVSSQAMKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSK-----IPNLAYDHWVRQDSLIIA

Query:  WLLGSMSNSLLSEMLECETAREVWRILQNRFSSRNVARIMDLKSKLELLKKGNLKLEDYFLKVKNLVDALNAAGRKISHEDHVLKILAGLGTEYDSTVNV
        WLLGSMS  +L++ML C++A+E+W  LQ  FSSR +A+ M  K+KL  +KKG++ L++YFLK+   VDAL +  + +S +DH+L ILAGLG++Y S ++V
Subjt:  WLLGSMSNSLLSEMLECETAREVWRILQNRFSSRNVARIMDLKSKLELLKKGNLKLEDYFLKVKNLVDALNAAGRKISHEDHVLKILAGLGTEYDSTVNV

Query:  ITEKDETPPLQKVYSLLFTQESRIARNLSINPDGSIPSVNLTTQTGPK---------QQQASSGDSNNRKNSNGKGNNNRRSWNNNNKPQCQLCKRFGHT
        I+ + ++P +Q+V SLL TQES+      +  + ++PSVN+ TQT  K         Q    +  S N++   G G +NR    N NKPQCQ+C + G++
Subjt:  ITEKDETPPLQKVYSLLFTQESRIARNLSINPDGSIPSVNLTTQTGPK---------QQQASSGDSNNRKNSNGKGNNNRRSWNNNNKPQCQLCKRFGHT

Query:  VQRCYYRFERWFQGPNTNSNGQQSSSVLGPPPSGQNFQQPPMNALVVQNDLNKDNHWYPDSGASNHVTNDLSNLAIGTEYQGDNRVNVGNGA--------
          RC++R+      P +NS+G   +S      +  N   P M+A+V   DLN D++WYPDSGA+NH+T+ LSNL+IG+EY G N++   NG+        
Subjt:  VQRCYYRFERWFQGPNTNSNGQQSSSVLGPPPSGQNFQQPPMNALVVQNDLNKDNHWYPDSGASNHVTNDLSNLAIGTEYQGDNRVNVGNGA--------

Query:  ---------------------------------------------------------------GTLVDGLYQFLLEKAHP------DTTK----------
                                                                       G L DGLY+F +E +H         TK          
Subjt:  ---------------------------------------------------------------GTLVDGLYQFLLEKAHP------DTTK----------

Query:  --------------------------ISNSTS-----------------AEHTSHVLSTEAQPSKLSCTTVFDLW--------------------HNRFT
                                  I NS+                  A   SH L+    P +L      DLW                    ++R+T
Subjt:  --------------------------ISNSTS-----------------AEHTSHVLSTEAQPSKLSCTTVFDLW--------------------HNRFT

Query:  WIYFLTSKTEAFQIFKLFKTYVEKLLGRSILRLQTDNGSEFRPFTPFLKTHGIEHRFTCPYTSKQNGIVERKHRKIVEMGLTLLSHASLSLEFWDDAFST
        WIYFL SK++AF  F+ FKT VEK LG+SI  LQTD G+EF+PF PFL  HGIEHR TCPYTSKQN IVERKHR I+EMGLTLLS A+L L FWD+AFST
Subjt:  WIYFLTSKTEAFQIFKLFKTYVEKLLGRSILRLQTDNGSEFRPFTPFLKTHGIEHRFTCPYTSKQNGIVERKHRKIVEMGLTLLSHASLSLEFWDDAFST

Query:  SVYLINRLPT-VVHQLSPLEKLLGFKPDYKLLKTFGCLCYPYLRPYNHHKIEPRSLPCLFLGYSSIHKGYKCLASSGKVYISRNVLFDETKFPSSKFLDP
        SVYLINRLPT V+  +SPLEKL   KP++  L+ FGC CYPYLRPY  HK+  RS PC FLGYS+ HKGYKCLAS G+++ISR+VLFDE  FP + F   
Subjt:  SVYLINRLPT-VVHQLSPLEKLLGFKPDYKLLKTFGCLCYPYLRPYNHHKIEPRSLPCLFLGYSSIHKGYKCLASSGKVYISRNVLFDETKFPSSKFLDP

Query:  QSPVLYPTVPIMVPITYGKSLSTAVSCPTDATVTVPEISPTVPAMTLN-NHDSDHTSLSND
                             ++  S P    V  P +   +P+  +N N D  HT   +D
Subjt:  QSPVLYPTVPIMVPITYGKSLSTAVSCPTDATVTVPEISPTVPAMTLN-NHDSDHTSLSND

TrEMBL top hitse value%identityAlignment
A0A438FJP6 Retrovirus-related Pol polyprotein from transposon TNT 1-948.5e-11833.67Show/hide
Query:  IINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSKIPNLAYDHWVRQDSLIIAWLLGSMSNSLLSEMLECETAREVWRI
        +I+P +++ TM+L++DNFL+WK QI   +RG+GL+  L    ++P K +T+     +PN  +  + RQD L+I+WLL S+ ++ L +++ C +A EVW  
Subjt:  IINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSKIPNLAYDHWVRQDSLIIAWLLGSMSNSLLSEMLECETAREVWRI

Query:  LQNRFSSRNVARIMDLKSKLELLKKGNLKLEDYFLKVKNLVDALNAAGRKISHEDHVLKILAGLGTEYDSTVNVITEKDETPPLQKVYSLLFTQESRIAR
        +   F+S++ A++M  KS++++LKK  L + DY  K+KN  D L  AG KIS  DH+L I+ GLG EY+S + VI+ K  +P LQ V S L   E RIA 
Subjt:  LQNRFSSRNVARIMDLKSKLELLKKGNLKLEDYFLKVKNLVDALNAAGRKISHEDHVLKILAGLGTEYDSTVNVITEKDETPPLQKVYSLLFTQESRIAR

Query:  NLSINPDGSIPSVNLTTQTGPKQQQASSG-DSNNRKNSNGKGNN--NRRSWNNNN-----------KPQCQLCKRFGHTVQRCYYRFERWFQGPNTNSNG
         +S N D S+   +  +  GP     S+G  S+  +N N  G N   R S+ +N            KPQCQLC +FGHTV RC+YR++  F G N  +NG
Subjt:  NLSINPDGSIPSVNLTTQTGPKQQQASSG-DSNNRKNSNGKGNN--NRRSWNNNN-----------KPQCQLCKRFGHTVQRCYYRFERWFQGPNTNSNG

Query:  QQSSSVLGPP--------------------PSGQNFQQPPMNALVVQNDLNKDNHWYPDSGASNHVTNDLSNLAIGTEYQGDNRVNVGNGA---------
          +  VLG                       + +N     M A+V   +  ++  W+PDSGA+NHVT+DL NL  G EY G++++++GNG          
Subjt:  QQSSSVLGPP--------------------PSGQNFQQPPMNALVVQNDLNKDNHWYPDSGASNHVTNDLSNLAIGTEYQGDNRVNVGNGA---------

Query:  --------------------------------------------------------------GTLVDGLYQFLLEK------------------------
                                                                      G L  GLYQF L K                        
Subjt:  --------------------------------------------------------------GTLVDGLYQFLLEK------------------------

Query:  -AHPDTTKISNSTSAEH-------------TSHVLSTEAQPSKLSCTT----------------------------------VFDLW-------------
          H D +     T++                S +++     +K+  +T                                  V DLW             
Subjt:  -AHPDTTKISNSTSAEH-------------TSHVLSTEAQPSKLSCTT----------------------------------VFDLW-------------

Query:  -------HNRFTWIYFLTSKTEAFQIFKLFKTYVEKLLGRSILRLQTDNGSEFRPFTPFLKTHGIEHRFTCPYTSKQNGIVERKHRKIVEMGLTLLSHAS
               ++R+TW+YFL +K++  + F +FK   E   G  +   QTD G EFR    + + +GI HR +CP+TSKQNGI+ERKHR IVE+GLTLL+ AS
Subjt:  -------HNRFTWIYFLTSKTEAFQIFKLFKTYVEKLLGRSILRLQTDNGSEFRPFTPFLKTHGIEHRFTCPYTSKQNGIVERKHRKIVEMGLTLLSHAS

Query:  LSLEFWDDAFSTSVYLINRLPT-VVHQLSPLEKLLGFKPDYKLLKTFGCLCYPYLRPYNHHKIEPRSLPCLFLGYSSIHKGYKCLASSGKVYISRNVLFD
        L L++W DAFST+V+LINRLPT V+ Q  P E L   KP+Y  LK FGCLC+P+LRPYN HK++ RS PC FLGYSS HKGYKCL   G+++ISR+V+FD
Subjt:  LSLEFWDDAFSTSVYLINRLPT-VVHQLSPLEKLLGFKPDYKLLKTFGCLCYPYLRPYNHHKIEPRSLPCLFLGYSSIHKGYKCLASSGKVYISRNVLFD

Query:  ETKFPSSKFLD------PQSPVLYPTVPIMVPITYGKSLSTAVSCPTDATVTVPEISPTVPA------MTLNNHDSDHT
        ET+FP +  L         S V  P +P++  +    S+S ++S PT +  +  ++   + +        L+N DS  T
Subjt:  ETKFPSSKFLD------PQSPVLYPTVPIMVPITYGKSLSTAVSCPTDATVTVPEISPTVPA------MTLNNHDSDHT

A0A438H844 Retrovirus-related Pol polyprotein from transposon RE18.5e-11834.84Show/hide
Query:  KLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSK-IPNLAYDHWVRQDSLIIAWLLGSMSNSLLSEMLECETAREVWRILQNRFSSRNV
        KLD  NFL+W+ QILTTLRGH L+H L E S +PS++L++ D ++   N  +  W +QD LI++WLL S++++LL+ M+ C+T+ +VW+ L+  F+++  
Subjt:  KLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSK-IPNLAYDHWVRQDSLIIAWLLGSMSNSLLSEMLECETAREVWRILQNRFSSRNV

Query:  ARIMDLKSKLELLKKGNLKLEDYFLKVKNLVDALNAAGRKISHEDHVLKILAGLGTEYDSTVNVITEKDETPPLQKVYSLLFTQESRIARNLSINPDGSI
        A++   K++L   KKG+L + DY LK++N+VD L   G KIS +DH+  I  GL  +Y++ +  +  + +   ++++  LL  QESRI +N+ I  D S 
Subjt:  ARIMDLKSKLELLKKGNLKLEDYFLKVKNLVDALNAAGRKISHEDHVLKILAGLGTEYDSTVNVITEKDETPPLQKVYSLLFTQESRIARNLSINPDGSI

Query:  PSVN---LTTQTGPK--QQQASSGDSNNRK---------------NSNGKGNNNRRSWNNNNKPQCQLCKRFGHTVQRCYYRFERWFQGPN-TNSNGQQS
        PS+     T + G      +AS+ +SN R                   G+G + R SW  NNKPQCQLC R GH V +CYYRF++ F GP+    N  Q 
Subjt:  PSVN---LTTQTGPK--QQQASSGDSNNRK---------------NSNGKGNNNRRSWNNNNKPQCQLCKRFGHTVQRCYYRFERWFQGPN-TNSNGQQS

Query:  SSVLGPPPSGQNFQQPPMNALVVQNDLNKDNHWYPDSGASNHVTNDLSNLAIGTEYQGDNRVNVGNGAG-------------TLVDGLYQFLLEKAH-PD
        +         +NF     +      ++ +DN+WYPDSGA++H+T +L+NL   +++   + V VGNG G             + +      L +  H P+
Subjt:  SSVLGPPPSGQNFQQPPMNALVVQNDLNKDNHWYPDSGASNHVTNDLSNLAIGTEYQGDNRVNVGNGAG-------------TLVDGLYQFLLEKAH-PD

Query:  TTK--------------------------------------------ISNSTSAE---HTSHVLSTEAQPSK-----LSCTTVFDLWHN-----------
         TK                                            + ++T  +   H S   ++ A PSK      S T+ F LWHN           
Subjt:  TTK--------------------------------------------ISNSTSAE---HTSHVLSTEAQPSK-----LSCTTVFDLWHN-----------

Query:  -------------------------------------------------------------------------RFTWIYFLTSKTEAFQIFKLFKTYVEK
                                                                                 RFTWIY L  K+EAFQ+F  FK+ VE 
Subjt:  -------------------------------------------------------------------------RFTWIYFLTSKTEAFQIFKLFKTYVEK

Query:  LLGRSILRLQTDNGSEFRPFTPFLKTHGIEHRFTCPYTSKQNGIVERKHRKIVEMGLTLLSHASLSLEFWDDAFSTSVYLINRLPT-VVHQLSPLEKLLG
         LG  I  +Q+D G E+R FT +L ++GI HR +CPYT +QNG+ ERKHR IVE G+ LL+ ASL  ++WD+AF TSVYLINRLPT V+   SPLE L  
Subjt:  LLGRSILRLQTDNGSEFRPFTPFLKTHGIEHRFTCPYTSKQNGIVERKHRKIVEMGLTLLSHASLSLEFWDDAFSTSVYLINRLPT-VVHQLSPLEKLLG

Query:  FKPDYKLLKTFGCLCYPYLRPYNHHKIEPRSLPCLFLGYSSIHKGYKCLASSGKVYISRNVLFDETKFPSSKFLDPQSPVLYPTVPIMVPITYGKSLSTA
         KP Y  LK FGC+CYP LRP+NHHK++ RS+PC FLGYS  HKGYKCL+ +G + ISR+V+FDE  FP   F   QS             +   S ST+
Subjt:  FKPDYKLLKTFGCLCYPYLRPYNHHKIEPRSLPCLFLGYSSIHKGYKCLASSGKVYISRNVLFDETKFPSSKFLDPQSPVLYPTVPIMVPITYGKSLSTA

Query:  VSCPTDATVTVPEISPTVPAMTLNNHDSDHTSLSNDPS
        + C T           ++P M L +  S  TS   +PS
Subjt:  VSCPTDATVTVPEISPTVPAMTLNNHDSDHTSLSNDPS

A0A438J300 Retrovirus-related Pol polyprotein from transposon TNT 1-948.7e-12336.08Show/hide
Query:  KLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSK-IPNLAYDHWVRQDSLIIAWLLGSMSNSLLSEMLECETAREVWRILQNRFSSRNV
        KLD  NFL+W+ QILTTLRGH L+H L E S +PS++L++ D ++   N  +  W +QD LI++WLL S++++LL+ M+ C+T+ +VW+ L+  F+++  
Subjt:  KLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSK-IPNLAYDHWVRQDSLIIAWLLGSMSNSLLSEMLECETAREVWRILQNRFSSRNV

Query:  ARIMDLKSKLELLKKGNLKLEDYFLKVKNLVDALNAAGRKISHEDHVLKILAGLGTEYDSTVNVITEKDETPPLQKVYSLLFTQESRIARNLSINPDGSI
        A++   K++L   KKG+L + DY LK++N+VD L   G KIS +DH+  I  GL  +Y++ +  +  + +   ++++ +LL  QESRI +N+ I  D S 
Subjt:  ARIMDLKSKLELLKKGNLKLEDYFLKVKNLVDALNAAGRKISHEDHVLKILAGLGTEYDSTVNVITEKDETPPLQKVYSLLFTQESRIARNLSINPDGSI

Query:  PSVN---LTTQTGPK--QQQASSGDSNNRK---------------NSNGKGNNNRRSWNNNNKPQCQLCKRFGHTVQRCYYRFERWFQGPN-TNSNGQQS
        PS+     T + G      +AS+ +SN R                   G+G + R  W  NNKPQCQLC R GH V +CYYRF++ F GP+    N  Q 
Subjt:  PSVN---LTTQTGPK--QQQASSGDSNNRK---------------NSNGKGNNNRRSWNNNNKPQCQLCKRFGHTVQRCYYRFERWFQGPN-TNSNGQQS

Query:  SSVLGPPPSGQNFQQPPMNALVVQNDLNKDNHWYPDSGASNHVTNDLSNLAIGTEYQGDNRVNVGNGAGTLVDGL------YQFLLEKA-----------
        +         +NF    ++      ++ +DN+WY DSGA++H+T +L+NL   +++   + V VGNG G  +  +        F+  K            
Subjt:  SSVLGPPPSGQNFQQPPMNALVVQNDLNKDNHWYPDSGASNHVTNDLSNLAIGTEYQGDNRVNVGNGAGTLVDGL------YQFLLEKA-----------

Query:  ---------------------HPDTTKISNSTSAE-----------------------HTSHVLSTEAQPSK-----LSCTTVFDLWHN-----------
                             HP +  + + ++                         H S   ++ A PSK      S T+ F LWHN           
Subjt:  ---------------------HPDTTKISNSTSAE-----------------------HTSHVLSTEAQPSK-----LSCTTVFDLWHN-----------

Query:  -------------------------RFTWIYFLTSKTEAFQIFKLFKTYVEKLLGRSILRLQTDNGSEFRPFTPFLKTHGIEHRFTCPYTSKQNGIVERK
                                 RFTWIY L  K+EAFQ+F  FK+ VE  LG  I  +Q+D G E+R FT +L ++GI HR +CPYT +QNG+ ERK
Subjt:  -------------------------RFTWIYFLTSKTEAFQIFKLFKTYVEKLLGRSILRLQTDNGSEFRPFTPFLKTHGIEHRFTCPYTSKQNGIVERK

Query:  HRKIVEMGLTLLSHASLSLEFWDDAFSTSVYLINRLPT-VVHQLSPLEKLLGFKPDYKLLKTFGCLCYPYLRPYNHHKIEPRSLPCLFLGYSSIHKGYKC
        HR IVE G+ LL+ ASL  ++WD+AF TSVYLINRLPT V+   SPLE L   KP Y  LK FGC+CYP LRP+NHHK++ RS+PC FLGYS  HKGYKC
Subjt:  HRKIVEMGLTLLSHASLSLEFWDDAFSTSVYLINRLPT-VVHQLSPLEKLLGFKPDYKLLKTFGCLCYPYLRPYNHHKIEPRSLPCLFLGYSSIHKGYKC

Query:  LASSGKVYISRNVLFDETKFPSSKFLDPQSPVLYPTVPIMVPITYGKSLSTAVSCPTDATVTVPEISPTVPAMTLNNHDSDHTSLSNDPS
        L+ +G + ISR+V+FDE  FP ++    +              +   S ST++ C T           ++P M L +  S  TS   +PS
Subjt:  LASSGKVYISRNVLFDETKFPSSKFLDPQSPVLYPTVPIMVPITYGKSLSTAVSCPTDATVTVPEISPTVPAMTLNNHDSDHTSLSNDPS

A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-14939.61Show/hide
Query:  MESHSTEKSSSEIEVSSQAMKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSK-----IPNLAYDHWVRQDSLIIA
        M S S+       E SS   +I   GNKI+ +KL++D FLLWK QILT L  + L++ L+ +SE PSKYL + +SS       PN AY  W RQD LI +
Subjt:  MESHSTEKSSSEIEVSSQAMKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSK-----IPNLAYDHWVRQDSLIIA

Query:  WLLGSMSNSLLSEMLECETAREVWRILQNRFSSRNVARIMDLKSKLELLKKGNLKLEDYFLKVKNLVDALNAAGRKISHEDHVLKILAGLGTEYDSTVNV
        WLLGSMS  +L++ML C++A+E+W  LQ  FSSR +A+ M  K+KL  +KKG++ L++YFLK+   VDAL +  + +S +DH+L ILAGLG++Y S ++V
Subjt:  WLLGSMSNSLLSEMLECETAREVWRILQNRFSSRNVARIMDLKSKLELLKKGNLKLEDYFLKVKNLVDALNAAGRKISHEDHVLKILAGLGTEYDSTVNV

Query:  ITEKDETPPLQKVYSLLFTQESRIARNLSINPDGSIPSVNLTTQTGPK---------QQQASSGDSNNRKNSNGKGNNNRRSWNNNNKPQCQLCKRFGHT
        I+ + ++P +Q+V SLL TQES+      +  + ++PSVN+ TQT  K         Q    +  S N++   G G +NR    N NKPQCQ+C + G++
Subjt:  ITEKDETPPLQKVYSLLFTQESRIARNLSINPDGSIPSVNLTTQTGPK---------QQQASSGDSNNRKNSNGKGNNNRRSWNNNNKPQCQLCKRFGHT

Query:  VQRCYYRFERWFQGPNTNSNGQQSSSVLGPPPSGQNFQQPPMNALVVQNDLNKDNHWYPDSGASNHVTNDLSNLAIGTEYQGDNRVNVGNGA--------
          RC++R+      P +NS+G   +S      +  N   P M+A+V   DLN D++WYPDSGA+NH+T+ LSNL+IG+EY G N++   NG+        
Subjt:  VQRCYYRFERWFQGPNTNSNGQQSSSVLGPPPSGQNFQQPPMNALVVQNDLNKDNHWYPDSGASNHVTNDLSNLAIGTEYQGDNRVNVGNGA--------

Query:  ---------------------------------------------------------------GTLVDGLYQFLLEKAHP------DTTK----------
                                                                       G L DGLY+F +E +H         TK          
Subjt:  ---------------------------------------------------------------GTLVDGLYQFLLEKAHP------DTTK----------

Query:  --------------------------ISNSTS-----------------AEHTSHVLSTEAQPSKLSCTTVFDLW--------------------HNRFT
                                  I NS+                  A   SH L+    P +L      DLW                    ++R+T
Subjt:  --------------------------ISNSTS-----------------AEHTSHVLSTEAQPSKLSCTTVFDLW--------------------HNRFT

Query:  WIYFLTSKTEAFQIFKLFKTYVEKLLGRSILRLQTDNGSEFRPFTPFLKTHGIEHRFTCPYTSKQNGIVERKHRKIVEMGLTLLSHASLSLEFWDDAFST
        WIYFL SK++AF  F+ FKT VEK LG+SI  LQTD G+EF+PF PFL  HGIEHR TCPYTSKQN IVERKHR I+EMGLTLLS A+L L FWD+AFST
Subjt:  WIYFLTSKTEAFQIFKLFKTYVEKLLGRSILRLQTDNGSEFRPFTPFLKTHGIEHRFTCPYTSKQNGIVERKHRKIVEMGLTLLSHASLSLEFWDDAFST

Query:  SVYLINRLPT-VVHQLSPLEKLLGFKPDYKLLKTFGCLCYPYLRPYNHHKIEPRSLPCLFLGYSSIHKGYKCLASSGKVYISRNVLFDETKFPSSKFLDP
        SVYLINRLPT V+  +SPLEKL   KP++  L+ FGC CYPYLRPY  HK+  RS PC FLGYS+ HKGYKCLAS G+++ISR+VLFDE  FP + F   
Subjt:  SVYLINRLPT-VVHQLSPLEKLLGFKPDYKLLKTFGCLCYPYLRPYNHHKIEPRSLPCLFLGYSSIHKGYKCLASSGKVYISRNVLFDETKFPSSKFLDP

Query:  QSPVLYPTVPIMVPITYGKSLSTAVSCPTDATVTVPEISPTVPAMTLN-NHDSDHTSLSND
                             ++  S P    V  P +   +P+  +N N D  HT   +D
Subjt:  QSPVLYPTVPIMVPITYGKSLSTAVSCPTDATVTVPEISPTVPAMTLN-NHDSDHTSLSND

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-14939.61Show/hide
Query:  MESHSTEKSSSEIEVSSQAMKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSK-----IPNLAYDHWVRQDSLIIA
        M S S+       E SS   +I   GNKI+ +KL++D FLLWK QILT L  + L++ L+ +SE PSKYL + +SS       PN AY  W RQD LI +
Subjt:  MESHSTEKSSSEIEVSSQAMKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSK-----IPNLAYDHWVRQDSLIIA

Query:  WLLGSMSNSLLSEMLECETAREVWRILQNRFSSRNVARIMDLKSKLELLKKGNLKLEDYFLKVKNLVDALNAAGRKISHEDHVLKILAGLGTEYDSTVNV
        WLLGSMS  +L++ML C++A+E+W  LQ  FSSR +A+ M  K+KL  +KKG++ L++YFLK+   VDAL +  + +S +DH+L ILAGLG++Y S ++V
Subjt:  WLLGSMSNSLLSEMLECETAREVWRILQNRFSSRNVARIMDLKSKLELLKKGNLKLEDYFLKVKNLVDALNAAGRKISHEDHVLKILAGLGTEYDSTVNV

Query:  ITEKDETPPLQKVYSLLFTQESRIARNLSINPDGSIPSVNLTTQTGPK---------QQQASSGDSNNRKNSNGKGNNNRRSWNNNNKPQCQLCKRFGHT
        I+ + ++P +Q+V SLL TQES+      +  + ++PSVN+ TQT  K         Q    +  S N++   G G +NR    N NKPQCQ+C + G++
Subjt:  ITEKDETPPLQKVYSLLFTQESRIARNLSINPDGSIPSVNLTTQTGPK---------QQQASSGDSNNRKNSNGKGNNNRRSWNNNNKPQCQLCKRFGHT

Query:  VQRCYYRFERWFQGPNTNSNGQQSSSVLGPPPSGQNFQQPPMNALVVQNDLNKDNHWYPDSGASNHVTNDLSNLAIGTEYQGDNRVNVGNGA--------
          RC++R+      P +NS+G   +S      +  N   P M+A+V   DLN D++WYPDSGA+NH+T+ LSNL+IG+EY G N++   NG+        
Subjt:  VQRCYYRFERWFQGPNTNSNGQQSSSVLGPPPSGQNFQQPPMNALVVQNDLNKDNHWYPDSGASNHVTNDLSNLAIGTEYQGDNRVNVGNGA--------

Query:  ---------------------------------------------------------------GTLVDGLYQFLLEKAHP------DTTK----------
                                                                       G L DGLY+F +E +H         TK          
Subjt:  ---------------------------------------------------------------GTLVDGLYQFLLEKAHP------DTTK----------

Query:  --------------------------ISNSTS-----------------AEHTSHVLSTEAQPSKLSCTTVFDLW--------------------HNRFT
                                  I NS+                  A   SH L+    P +L      DLW                    ++R+T
Subjt:  --------------------------ISNSTS-----------------AEHTSHVLSTEAQPSKLSCTTVFDLW--------------------HNRFT

Query:  WIYFLTSKTEAFQIFKLFKTYVEKLLGRSILRLQTDNGSEFRPFTPFLKTHGIEHRFTCPYTSKQNGIVERKHRKIVEMGLTLLSHASLSLEFWDDAFST
        WIYFL SK++AF  F+ FKT VEK LG+SI  LQTD G+EF+PF PFL  HGIEHR TCPYTSKQN IVERKHR I+EMGLTLLS A+L L FWD+AFST
Subjt:  WIYFLTSKTEAFQIFKLFKTYVEKLLGRSILRLQTDNGSEFRPFTPFLKTHGIEHRFTCPYTSKQNGIVERKHRKIVEMGLTLLSHASLSLEFWDDAFST

Query:  SVYLINRLPT-VVHQLSPLEKLLGFKPDYKLLKTFGCLCYPYLRPYNHHKIEPRSLPCLFLGYSSIHKGYKCLASSGKVYISRNVLFDETKFPSSKFLDP
        SVYLINRLPT V+  +SPLEKL   KP++  L+ FGC CYPYLRPY  HK+  RS PC FLGYS+ HKGYKCLAS G+++ISR+VLFDE  FP + F   
Subjt:  SVYLINRLPT-VVHQLSPLEKLLGFKPDYKLLKTFGCLCYPYLRPYNHHKIEPRSLPCLFLGYSSIHKGYKCLASSGKVYISRNVLFDETKFPSSKFLDP

Query:  QSPVLYPTVPIMVPITYGKSLSTAVSCPTDATVTVPEISPTVPAMTLN-NHDSDHTSLSND
                             ++  S P    V  P +   +P+  +N N D  HT   +D
Subjt:  QSPVLYPTVPIMVPITYGKSLSTAVSCPTDATVTVPEISPTVPAMTLN-NHDSDHTSLSND

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.6e-1930.63Show/hide
Query:  PSKLSCTTVFDLWHNRFTW---IYFLTSKTEAFQIFKLFKTYVEKLLGRSILRLQTDNGSEF--RPFTPFLKTHGIEHRFTCPYTSKQNGIVERKHRKIV
        P  L     F ++ ++FT     Y +  K++ F +F+ F    E      ++ L  DNG E+       F    GI +  T P+T + NG+ ER  R I 
Subjt:  PSKLSCTTVFDLWHNRFTW---IYFLTSKTEAFQIFKLFKTYVEKLLGRSILRLQTDNGSEF--RPFTPFLKTHGIEHRFTCPYTSKQNGIVERKHRKIV

Query:  EMGLTLLSHASLSLEFWDDAFSTSVYLINRLPT---VVHQLSPLEKLLGFKPDYKLLKTFGCLCYPYLRPYNHHKIEPRSLPCLFLGYSSIHKGYKCL-A
        E   T++S A L   FW +A  T+ YLINR+P+   V    +P E     KP  K L+ FG   Y +++     K + +S   +F+GY     G+K   A
Subjt:  EMGLTLLSHASLSLEFWDDAFSTSVYLINRLPT---VVHQLSPLEKLLGFKPDYKLLKTFGCLCYPYLRPYNHHKIEPRSLPCLFLGYSSIHKGYKCL-A

Query:  SSGKVYISRNVLFDETKFPSSK
         + K  ++R+V+ DET   +S+
Subjt:  SSGKVYISRNVLFDETKFPSSK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-3034.15Show/hide
Query:  NRFTWIYFLTSKTEAFQIFKLFKTYVEKLLGRSILRLQTDNGSEF--RPFTPFLKTHGIEHRFTCPYTSKQNGIVERKHRKIVEMGLTLLSHASLSLEFW
        +R  W+Y L +K + FQ+F+ F   VE+  GR + RL++DNG E+  R F  +  +HGI H  T P T + NG+ ER +R IVE   ++L  A L   FW
Subjt:  NRFTWIYFLTSKTEAFQIFKLFKTYVEKLLGRSILRLQTDNGSEF--RPFTPFLKTHGIEHRFTCPYTSKQNGIVERKHRKIVEMGLTLLSHASLSLEFW

Query:  DDAFSTSVYLINRLPTVVHQLSPLEKLLGFKP-DYKLLKTFGCLCYPYLRPYNHHKIEPRSLPCLFLGYSSIHKGYKCL-ASSGKVYISRNVLFDETKFP
         +A  T+ YLINR P+V       E++   K   Y  LK FGC  + ++      K++ +S+PC+F+GY     GY+       KV  SR+V+F E++  
Subjt:  DDAFSTSVYLINRLPTVVHQLSPLEKLLGFKP-DYKLLKTFGCLCYPYLRPYNHHKIEPRSLPCLFLGYSSIHKGYKCL-ASSGKVYISRNVLFDETKFP

Query:  SSKFLDPQSPVLYPTVPIMVPITYGKSLSTAVSCPTDATVTVPEIS
        ++   D    V    +P  V      ++ +  + PT A  T  E+S
Subjt:  SSKFLDPQSPVLYPTVPIMVPITYGKSLSTAVSCPTDATVTVPEIS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.2e-7527.78Show/hide
Query:  SSEIEVSSQAMKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSKIPNLAYDHWVRQDSLIIAWLLGSMSNSLLSEM
        + E+ +++ ++  +N  N     KL   N+L+W  Q+     G+ L   LD  + +P   +   D++   N  Y  W RQD LI + +LG++S S+   +
Subjt:  SSEIEVSSQAMKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSKIPNLAYDHWVRQDSLIIAWLLGSMSNSLLSEM

Query:  LECETAREVWRILQNRFSSRNVARIMDLKSKLELLKKGNLKLEDYFLKVKNLVDALNAAGRKISHEDHVLKILAGLGTEYDSTVNVITEKDETPPLQKVY
            TA ++W  L+  +++ +   +  L+++L+   KG   ++DY   +    D L   G+ + H++ V ++L  L  EY   ++ I  KD  P L +++
Subjt:  LECETAREVWRILQNRFSSRNVARIMDLKSKLELLKKGNLKLEDYFLKVKNLVDALNAAGRKISHEDHVLKILAGLGTEYDSTVNVITEKDETPPLQKVY

Query:  SLLFTQESRIARNLSINPDGSIP-SVNLTTQTGPKQQQASSGDSNNRKNSNGKGNNNRRSW----------NNNNKP---QCQLCKRFGHTVQRCYYRFE
          L   ES+I   L+++    IP + N  +         ++  + N +  N   NNN + W          NN +KP   +CQ+C   GH+ +RC     
Subjt:  SLLFTQESRIARNLSINPDGSIP-SVNLTTQTGPKQQQASSGDSNNRKNSNGKGNNNRRSW----------NNNNKP---QCQLCKRFGHTVQRCYYRFE

Query:  RWFQGPNTNSNGQQSSSVLGPPPSGQNFQQPPMNALVVQNDLNKDNHWYPDSGASNHVTNDLSNLAIGTEYQGDNRVNVGNGA-----------------
           Q   ++ N QQ       PPS     QP  N  +        N+W  DSGA++H+T+D +NL++   Y G + V V +G+                 
Subjt:  RWFQGPNTNSNGQQSSSVLGPPPSGQNFQQPPMNALVVQNDLNKDNHWYPDSGASNHVTNDLSNLAIGTEYQGDNRVNVGNGA-----------------

Query:  ---------------------------------------------------GTLVDGLYQFLLEKAHPDTTKISNSTSAEHTS-HVLSTEAQPS------
                                                           G   D LY++ +  + P +   S S+ A H+S H       PS      
Subjt:  ---------------------------------------------------GTLVDGLYQFLLEKAHPDTTKISNSTSAEHTS-HVLSTEAQPS------

Query:  -------------KLSCTTVF-------------------------DLWHN-------------------RFTWIYFLTSKTEAFQIFKLFKTYVEKLLG
                      LSC+                            D+W +                   R+TW+Y L  K++  + F  FK  +E    
Subjt:  -------------KLSCTTVF-------------------------DLWHN-------------------RFTWIYFLTSKTEAFQIFKLFKTYVEKLLG

Query:  RSILRLQTDNGSEFRPFTPFLKTHGIEHRFTCPYTSKQNGIVERKHRKIVEMGLTLLSHASLSLEFWDDAFSTSVYLINRLPTVVHQL-SPLEKLLGFKP
          I    +DNG EF     +   HGI H  + P+T + NG+ ERKHR IVE GLTLLSHAS+   +W  AF+ +VYLINRLPT + QL SP +KL G  P
Subjt:  RSILRLQTDNGSEFRPFTPFLKTHGIEHRFTCPYTSKQNGIVERKHRKIVEMGLTLLSHASLSLEFWDDAFSTSVYLINRLPTVVHQL-SPLEKLLGFKP

Query:  DYKLLKTFGCLCYPYLRPYNHHKIEPRSLPCLFLGYSSIHKGYKCL-ASSGKVYISRNVLFDETKFPSSKFLDPQSPVL------------YPTVPIMVP
        +Y  L+ FGC CYP+LRPYN HK++ +S  C+FLGYS     Y CL   + ++YISR+V FDE  FP S +L   SPV             + T+P   P
Subjt:  DYKLLKTFGCLCYPYLRPYNHHKIEPRSLPCLFLGYSSIHKGYKCL-ASSGKVYISRNVLFDETKFPSSKFLDPQSPVL------------YPTVPIMVP

Query:  ITYGKSLSTAVSCPTDATVTVPEISPTVP----AMTLNNHDSDHTS
        +        A SC        P  SP+ P     ++ +N DS  +S
Subjt:  ITYGKSLSTAVSCPTDATVTVPEISPTVP----AMTLNNHDSDHTS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.0e-7128.33Show/hide
Query:  NKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSKIPNLAYDHWVRQDSLIIAWLLGSMSNSLLSEMLECETAREVWRILQNRF
        N     KL   N+L+W  Q+     G+ L   LD  + +P   +   D+    N  Y  W RQD LI + +LG++S S+   +    TA ++W  L+  +
Subjt:  NKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSKIPNLAYDHWVRQDSLIIAWLLGSMSNSLLSEMLECETAREVWRILQNRF

Query:  SSRNVARIMDLKSKLELLKKGNLKLEDYFLKVKNLVDALNAAGRKISHEDHVLKILAGLGTEYDSTVNVITEKDETPPLQKVYSLLFTQESRIARNLSIN
        ++ +   +  L+                F+      D L   G+ + H++ V ++L  L  +Y   ++ I  KD  P L +++  L  +ES++   L++N
Subjt:  SSRNVARIMDLKSKLELLKKGNLKLEDYFLKVKNLVDALNAAGRKISHEDHVLKILAGLGTEYDSTVNVITEKDETPPLQKVYSLLFTQESRIARNLSIN

Query:  PDGSIP-SVNLTT--QTGPKQQQASSGDSNNRKNSNGKGN------NNRRSWNNNNKP---QCQLCKRFGHTVQRCYYRFERWFQGPNTNSNGQQSSSVL
            +P + N+ T   T   + Q + GD+ N  N+N + N      +  RS N   KP   +CQ+C   GH+ +RC    +  FQ   + +N QQS+S  
Subjt:  PDGSIP-SVNLTT--QTGPKQQQASSGDSNNRKNSNGKGN------NNRRSWNNNNKP---QCQLCKRFGHTVQRCYYRFERWFQGPNTNSNGQQSSSVL

Query:  GPPPSGQNFQQPPMNALVVQNDLNKDNHWYPDSGASNHVTNDLSNLAIGTEYQGDNRVNVGNGA------------------------------------
         P        QP  N  V  N     N+W  DSGA++H+T+D +NL+    Y G + V + +G+                                    
Subjt:  GPPPSGQNFQQPPMNALVVQNDLNKDNHWYPDSGASNHVTNDLSNLAIGTEYQGDNRVNVGNGA------------------------------------

Query:  --------------------------------GTLVDGLYQFLLEKAHPDTTKISNSTSAEHTS--------------HVLSTEAQP------SKLSCTT
                                        G   D LY++ +  +   +   S  + A H+S               V+S  + P        LSC+ 
Subjt:  --------------------------------GTLVDGLYQFLLEKAHPDTTKISNSTSAEHTS--------------HVLSTEAQP------SKLSCTT

Query:  VF-------------------------DLWHN-------------------RFTWIYFLTSKTEAFQIFKLFKTYVEKLLGRSILRLQTDNGSEFRPFTP
         F                         D+W +                   R+TW+Y L  K++    F +FK+ VE      I  L +DNG EF     
Subjt:  VF-------------------------DLWHN-------------------RFTWIYFLTSKTEAFQIFKLFKTYVEKLLGRSILRLQTDNGSEFRPFTP

Query:  FLKTHGIEHRFTCPYTSKQNGIVERKHRKIVEMGLTLLSHASLSLEFWDDAFSTSVYLINRLPTVVHQL-SPLEKLLGFKPDYKLLKTFGCLCYPYLRPY
        +L  HGI H  + P+T + NG+ ERKHR IVEMGLTLLSHAS+   +W  AFS +VYLINRLPT + QL SP +KL G  P+Y+ LK FGC CYP+LRPY
Subjt:  FLKTHGIEHRFTCPYTSKQNGIVERKHRKIVEMGLTLLSHASLSLEFWDDAFSTSVYLINRLPTVVHQL-SPLEKLLGFKPDYKLLKTFGCLCYPYLRPY

Query:  NHHKIEPRSLPCLFLGYSSIHKGYKCL-ASSGKVYISRNVLFDETKFPSSKFLDPQSPVLYPTVPIMVPITYGKSLSTAVSCPTDATV-TVPEISPTVPA
        N HK+E +S  C F+GYS     Y CL   +G++Y SR+V FDE  FP            + T    V  +  +   +A + P+  T+ T P + P  P 
Subjt:  NHHKIEPRSLPCLFLGYSSIHKGYKCL-ASSGKVYISRNVLFDETKFPSSKFLDPQSPVLYPTVPIMVPITYGKSLSTAVSCPTDATV-TVPEISPTVPA

Query:  MTLNNHDSDHTSLSNDPSC
        +  +   S     S  P C
Subjt:  MTLNNHDSDHTSLSNDPSC

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).2.4e-0825.56Show/hide
Query:  ITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSKIPNLAYDHWVRQDSLIIAWLLGSMSNSLLSEMLECETAREVWRILQNRFSS
        I  +  DEDN++ WK++  + LR       +D     P  +          +  Y  W + +++++ WL+ SM++ LL  ++  ETA ++W  L+  F  
Subjt:  ITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSKIPNLAYDHWVRQDSLIIAWLLGSMSNSLLSEMLECETAREVWRILQNRFSS

Query:  RNVARIMDLKSKLELLKKGNLKLEDYFLKVKNL
            +I  L+ +L  L++G   +E+YF K+  +
Subjt:  RNVARIMDLKSKLELLKKGNLKLEDYFLKVKNL

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)7.8e-1523.13Show/hide
Query:  MKLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSKIPNLAYD-HWVRQDSLIIAWLLGSMS-NSLLSEMLECETAREVWRILQNRFSSR
        + ++E N+  W+   LT      +  H+              D + +P  A D +W ++D ++   L G+++        +   T+R++W  ++N+F + 
Subjt:  MKLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSKIPNLAYD-HWVRQDSLIIAWLLGSMS-NSLLSEMLECETAREVWRILQNRFSSR

Query:  NVARIMDLKSKLELLKKGNLKLEDYFLKVKNLVDALNAAGRKISHEDHVLKILAGLGTEYDSTVNVITEKDETPPLQKVYSLLFTQESRIARNLSINP--
          AR + L S+L     G++++ DY+ K+K L D+L      ++  + V+ +L GL  ++D+ +NVI  +   P      ++L  +E R+ R +  NP  
Subjt:  NVARIMDLKSKLELLKKGNLKLEDYFLKVKNLVDALNAAGRKISHEDHVLKILAGLGTEYDSTVNVITEKDETPPLQKVYSLLFTQESRIARNLSINP--

Query:  --DGSIPSVNLTTQTGPKQQQASSGDSNNRKNSNGKGNN------------NRRSWNNNNKPQCQLCKRFGHTVQRCYYRFER-WFQGPNTNSNGQQSSS
            S  +V   ++  P      SG +       G+GNN            N  ++N+ N+P            Q  Y  +   W   P  N+NG     
Subjt:  --DGSIPSVNLTTQTGPKQQQASSGDSNNRKNSNGKGNN------------NRRSWNNNNKPQCQLCKRFGHTVQRCYYRFER-WFQGPNTNSNGQQSSS

Query:  VLGPPPS
        +LGP P+
Subjt:  VLGPPPS

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)8.3e-1726.77Show/hide
Query:  TMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSKIPNLAYDHWVRQDSLIIAWLLGSMSNSLLSEMLECE-TAREVWRILQNRFSSR
        T+ L++ N+ +W+    T     G+  H+D            G S+  P +    W  +D L+  W+ G++++SLL  +++   TAR++W  L+N F   
Subjt:  TMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSKIPNLAYDHWVRQDSLIIAWLLGSMSNSLLSEMLECE-TAREVWRILQNRFSSR

Query:  NVARIMDLKSKLELLKKGNLKLEDYFLKVKNLVDALNAAGRKISHEDHVLKILAGLGTEYDSTVNVITEKDETPPLQKVYSLLFTQESRIA-RNLSINPD
          AR +  +++L      +L + +Y  K+K+L D L      IS    V+ +L GL  +YD  +NVI  K   P   +  S+L  +ESR++ ++ S    
Subjt:  NVARIMDLKSKLELLKKGNLKLEDYFLKVKNLVDALNAAGRKISHEDHVLKILAGLGTEYDSTVNVITEKDETPPLQKVYSLLFTQESRIA-RNLSINPD

Query:  GSIPSVNLTTQTGPKQQQASSGDSNNRKNSNGKGNNNRRS---------WNNNN
         + PS++    T P+QQ+    + +N  ++ G+G + +++         +NNNN
Subjt:  GSIPSVNLTTQTGPKQQQASSGDSNNRKNSNGKGNNNRRS---------WNNNN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCTCATTCAACAGAGAAATCCAGTTCTGAGATTGAAGTATCCTCTCAGGCGATGAAGATCATTAATCCAGGCAATAAAATCACGACGATGAAGCTTGACGAAGA
CAATTTTCTTCTCTGGAAGCTTCAAATTCTTACTACTTTACGAGGTCATGGGTTGAAGCACCATCTTGATGAGGATTCTGAAATTCCTTCGAAGTACCTTACGAATGGTG
ATTCTTCGAAAATCCCTAACCTTGCTTATGACCATTGGGTTCGACAGGATAGTCTTATTATCGCCTGGTTACTTGGCTCAATGTCCAACTCTCTGCTCTCAGAAATGTTG
GAATGCGAAACCGCTCGAGAAGTATGGAGAATTCTTCAAAATCGATTCTCCTCTCGCAATGTTGCACGAATCATGGACCTGAAATCGAAGCTGGAATTGCTCAAGAAAGG
TAATCTTAAGCTTGAAGATTACTTTCTGAAAGTTAAGAATCTTGTTGATGCGTTAAATGCTGCTGGAAGAAAGATTTCTCATGAGGATCATGTGTTAAAAATTCTTGCTG
GCCTAGGAACTGAATACGACTCAACTGTAAATGTAATTACTGAGAAAGATGAAACTCCCCCTTTACAGAAAGTGTATTCACTTCTTTTTACCCAAGAGAGTAGGATTGCT
AGAAACTTGTCTATCAATCCTGATGGTTCAATCCCTTCGGTAAATCTTACAACACAGACAGGGCCAAAGCAACAACAAGCCTCGTCAGGTGATTCTAACAATCGCAAGAA
TTCGAATGGAAAGGGAAATAACAATCGAAGGTCGTGGAACAACAATAACAAACCTCAGTGTCAGTTGTGTAAGAGGTTTGGGCATACTGTTCAAAGGTGTTATTATCGGT
TTGAACGCTGGTTTCAGGGACCTAATACGAATTCAAATGGCCAACAATCATCCTCTGTTCTAGGACCACCACCATCTGGGCAAAATTTTCAGCAGCCACCTATGAATGCT
CTTGTTGTCCAGAATGATCTTAACAAAGACAACCACTGGTATCCGGATTCAGGGGCCTCGAACCATGTTACCAATGATCTATCAAACTTGGCGATTGGGACTGAGTATCA
AGGTGACAACCGAGTAAACGTTGGAAATGGTGCAGGGACACTAGTTGATGGACTCTATCAATTCTTGCTGGAAAAAGCTCATCCTGATACAACTAAGATCTCCAACTCCA
CATCTGCTGAACATACTTCACATGTGCTTTCCACTGAGGCTCAACCATCCAAATTATCCTGTACTACTGTCTTTGATTTGTGGCACAATAGGTTTACATGGATATACTTT
CTCACTTCTAAAACCGAGGCTTTCCAAATCTTTAAACTTTTTAAAACTTATGTTGAAAAGCTTCTTGGTCGTTCCATTCTTCGTCTTCAAACTGACAATGGTAGTGAATT
TCGTCCATTCACACCTTTTCTTAAAACGCATGGCATTGAACATCGTTTTACCTGTCCTTATACCTCAAAACAGAATGGAATTGTGGAAAGAAAACACCGAAAAATAGTTG
AAATGGGTCTAACTCTTTTGTCTCATGCCTCTTTATCCTTAGAGTTTTGGGATGATGCTTTTTCCACATCTGTCTATCTTATCAACAGATTGCCAACGGTAGTCCATCAA
CTATCACCCTTGGAAAAGTTACTTGGCTTCAAACCTGATTACAAATTACTAAAAACCTTCGGTTGTCTCTGTTATCCCTACCTTAGGCCATATAATCACCACAAAATTGA
ACCAAGATCCCTTCCATGTCTATTTCTCGGTTACAGCAGTATTCATAAAGGATATAAGTGTCTGGCTTCTTCAGGCAAAGTGTATATCTCAAGAAATGTCCTTTTTGATG
AAACAAAATTTCCATCTTCAAAGTTTCTAGACCCTCAAAGTCCTGTTCTTTACCCGACAGTCCCTATAATGGTACCTATTACCTATGGGAAGTCTTTATCTACTGCTGTG
TCTTGTCCTACTGATGCTACTGTCACTGTCCCTGAAATTTCTCCCACTGTACCGGCTATGACTTTAAATAATCATGACAGTGACCACACATCTTTATCTAATGATCCTTC
ATGTGAAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCTCATTCAACAGAGAAATCCAGTTCTGAGATTGAAGTATCCTCTCAGGCGATGAAGATCATTAATCCAGGCAATAAAATCACGACGATGAAGCTTGACGAAGA
CAATTTTCTTCTCTGGAAGCTTCAAATTCTTACTACTTTACGAGGTCATGGGTTGAAGCACCATCTTGATGAGGATTCTGAAATTCCTTCGAAGTACCTTACGAATGGTG
ATTCTTCGAAAATCCCTAACCTTGCTTATGACCATTGGGTTCGACAGGATAGTCTTATTATCGCCTGGTTACTTGGCTCAATGTCCAACTCTCTGCTCTCAGAAATGTTG
GAATGCGAAACCGCTCGAGAAGTATGGAGAATTCTTCAAAATCGATTCTCCTCTCGCAATGTTGCACGAATCATGGACCTGAAATCGAAGCTGGAATTGCTCAAGAAAGG
TAATCTTAAGCTTGAAGATTACTTTCTGAAAGTTAAGAATCTTGTTGATGCGTTAAATGCTGCTGGAAGAAAGATTTCTCATGAGGATCATGTGTTAAAAATTCTTGCTG
GCCTAGGAACTGAATACGACTCAACTGTAAATGTAATTACTGAGAAAGATGAAACTCCCCCTTTACAGAAAGTGTATTCACTTCTTTTTACCCAAGAGAGTAGGATTGCT
AGAAACTTGTCTATCAATCCTGATGGTTCAATCCCTTCGGTAAATCTTACAACACAGACAGGGCCAAAGCAACAACAAGCCTCGTCAGGTGATTCTAACAATCGCAAGAA
TTCGAATGGAAAGGGAAATAACAATCGAAGGTCGTGGAACAACAATAACAAACCTCAGTGTCAGTTGTGTAAGAGGTTTGGGCATACTGTTCAAAGGTGTTATTATCGGT
TTGAACGCTGGTTTCAGGGACCTAATACGAATTCAAATGGCCAACAATCATCCTCTGTTCTAGGACCACCACCATCTGGGCAAAATTTTCAGCAGCCACCTATGAATGCT
CTTGTTGTCCAGAATGATCTTAACAAAGACAACCACTGGTATCCGGATTCAGGGGCCTCGAACCATGTTACCAATGATCTATCAAACTTGGCGATTGGGACTGAGTATCA
AGGTGACAACCGAGTAAACGTTGGAAATGGTGCAGGGACACTAGTTGATGGACTCTATCAATTCTTGCTGGAAAAAGCTCATCCTGATACAACTAAGATCTCCAACTCCA
CATCTGCTGAACATACTTCACATGTGCTTTCCACTGAGGCTCAACCATCCAAATTATCCTGTACTACTGTCTTTGATTTGTGGCACAATAGGTTTACATGGATATACTTT
CTCACTTCTAAAACCGAGGCTTTCCAAATCTTTAAACTTTTTAAAACTTATGTTGAAAAGCTTCTTGGTCGTTCCATTCTTCGTCTTCAAACTGACAATGGTAGTGAATT
TCGTCCATTCACACCTTTTCTTAAAACGCATGGCATTGAACATCGTTTTACCTGTCCTTATACCTCAAAACAGAATGGAATTGTGGAAAGAAAACACCGAAAAATAGTTG
AAATGGGTCTAACTCTTTTGTCTCATGCCTCTTTATCCTTAGAGTTTTGGGATGATGCTTTTTCCACATCTGTCTATCTTATCAACAGATTGCCAACGGTAGTCCATCAA
CTATCACCCTTGGAAAAGTTACTTGGCTTCAAACCTGATTACAAATTACTAAAAACCTTCGGTTGTCTCTGTTATCCCTACCTTAGGCCATATAATCACCACAAAATTGA
ACCAAGATCCCTTCCATGTCTATTTCTCGGTTACAGCAGTATTCATAAAGGATATAAGTGTCTGGCTTCTTCAGGCAAAGTGTATATCTCAAGAAATGTCCTTTTTGATG
AAACAAAATTTCCATCTTCAAAGTTTCTAGACCCTCAAAGTCCTGTTCTTTACCCGACAGTCCCTATAATGGTACCTATTACCTATGGGAAGTCTTTATCTACTGCTGTG
TCTTGTCCTACTGATGCTACTGTCACTGTCCCTGAAATTTCTCCCACTGTACCGGCTATGACTTTAAATAATCATGACAGTGACCACACATCTTTATCTAATGATCCTTC
ATGTGAAGATTGA
Protein sequenceShow/hide protein sequence
MESHSTEKSSSEIEVSSQAMKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDEDSEIPSKYLTNGDSSKIPNLAYDHWVRQDSLIIAWLLGSMSNSLLSEML
ECETAREVWRILQNRFSSRNVARIMDLKSKLELLKKGNLKLEDYFLKVKNLVDALNAAGRKISHEDHVLKILAGLGTEYDSTVNVITEKDETPPLQKVYSLLFTQESRIA
RNLSINPDGSIPSVNLTTQTGPKQQQASSGDSNNRKNSNGKGNNNRRSWNNNNKPQCQLCKRFGHTVQRCYYRFERWFQGPNTNSNGQQSSSVLGPPPSGQNFQQPPMNA
LVVQNDLNKDNHWYPDSGASNHVTNDLSNLAIGTEYQGDNRVNVGNGAGTLVDGLYQFLLEKAHPDTTKISNSTSAEHTSHVLSTEAQPSKLSCTTVFDLWHNRFTWIYF
LTSKTEAFQIFKLFKTYVEKLLGRSILRLQTDNGSEFRPFTPFLKTHGIEHRFTCPYTSKQNGIVERKHRKIVEMGLTLLSHASLSLEFWDDAFSTSVYLINRLPTVVHQ
LSPLEKLLGFKPDYKLLKTFGCLCYPYLRPYNHHKIEPRSLPCLFLGYSSIHKGYKCLASSGKVYISRNVLFDETKFPSSKFLDPQSPVLYPTVPIMVPITYGKSLSTAV
SCPTDATVTVPEISPTVPAMTLNNHDSDHTSLSNDPSCED