; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031805 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031805
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr11:15116121..15117485
RNA-Seq ExpressionLag0031805
SyntenyLag0031805
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN81099.1 hypothetical protein VITISV_017741 [Vitis vinifera]1.5e-11447.61Show/hide
Query:  MAKSHSLPYSPSTTTYTAPLQLIVSDLWGPTYIPSANGYRYYISFVDVFSRYTWIYFLKSKSDAFAAFVKFKTHIEKLLNLPILQFQSDNGGEFLAFKPY
        + K H  P+S S TTYT PL+LI  DLWGPT + S +GYRYYI FVD FSR++WI+ L++KS+A   FV FKT +E   +L I   Q+D GGEF AF+ Y
Subjt:  MAKSHSLPYSPSTTTYTAPLQLIVSDLWGPTYIPSANGYRYYISFVDVFSRYTWIYFLKSKSDAFAAFVKFKTHIEKLLNLPILQFQSDNGGEFLAFKPY

Query:  LDSHGISHRFSCPHTSQQNGIAECKHRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVLHGKSPLDIIFCTHPDYSFLKTFGCQCFPCLRPYK
        L  +GI HR SCPHT QQNG+AE KHR IV+ GL LL  +S+PLKFWDE+F T V+  NRLP+ +LH K P++++F + PDYSFLK FGC CFP LRPY 
Subjt:  LDSHGISHRFSCPHTSQQNGIAECKHRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVLHGKSPLDIIFCTHPDYSFLKTFGCQCFPCLRPYK

Query:  THKLDYRSEPCVFIGYSSSHKGYRCLSANGRVYVSRNVLFNESIFPFSQFPLQS----ITKSP------PSVSLPTLGSIL------PVSSPIPSTSSSE
        THKL YRSE C F+GYS  HKGY+C+S+NGRVY+S +V+FNE+ FP+S+    S     T SP      PS S P L   +      P+SS  P +    
Subjt:  THKLDYRSEPCVFIGYSSSHKGYRCLSANGRVYVSRNVLFNESIFPFSQFPLQS----ITKSP------PSVSLPTLGSIL------PVSSPIPSTSSSE

Query:  LSTDPAPIPN----------IHASPSPGPLVEVTQDIPSSSST-TSPSVIKNTHHMVTRGKSGIFKPKAFVSVSTSDYDNIEPPNVKEALKSSHWRKAMQ
        + +     PN          + ++P   P+  V   I  +S T T      NTH M+TR KSGI KPK F++         EP +V  AL+   W+KAM 
Subjt:  LSTDPAPIPN----------IHASPSPGPLVEVTQDIPSSSST-TSPSVIKNTHHMVTRGKSGIFKPKAFVSVSTSDYDNIEPPNVKEALKSSHWRKAMQ

Query:  DEFDALQKNETWELVPSPDHKKIVGCKWVFRIKRNSDGSISRYKARLVANTTTNLTFIDAWLFTFIDICVKKTSINVIFVL
         E+DALQ+N TW LVP P  ++ +GCKWV++ K N DG++ +YKARLVA         D +  TF  + VK +++ V+F +
Subjt:  DEFDALQKNETWELVPSPDHKKIVGCKWVFRIKRNSDGSISRYKARLVANTTTNLTFIDAWLFTFIDICVKKTSINVIFVL

KYP71160.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]2.6e-11450.93Show/hide
Query:  MAKSHSLPYSPSTTTYTAPLQLIVSDLWGPTYIPSANGYRYYISFVDVFSRYTWIYFLKSKSDAFAAFVKFKTHIEKLLNLPILQFQSDNGGEFLAFKPY
        + KSH LP   S + Y+APL LI +DLWGP+++ S +GY YY+SF+D FS++TWIY LKSKSD  + F +FK  +E  LN  I   QSD GGE+ +F  +
Subjt:  MAKSHSLPYSPSTTTYTAPLQLIVSDLWGPTYIPSANGYRYYISFVDVFSRYTWIYFLKSKSDAFAAFVKFKTHIEKLLNLPILQFQSDNGGEFLAFKPY

Query:  LDSHGISHRFSCPHTSQQNGIAECKHRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVLHGKSPLDIIFCTHPDYSFLKTFGCQCFPCLRPYK
        L +HGI+HR  CPHT  QNG+ E KHRHIVD G+ LL H+S+PL FWD AFSTAV+ INRLP+  L+   P  ++F   PDY FLKTFGC CFP LRPY 
Subjt:  LDSHGISHRFSCPHTSQQNGIAECKHRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVLHGKSPLDIIFCTHPDYSFLKTFGCQCFPCLRPYK

Query:  THKLDYRSEPCVFIGYSSSHKGYRCLSANGRVYVSRNVLFNESIFPFSQFPLQSITKSPPSVSLPTLGSILPVSSPIPSTSSSELSTDPAPIPNIHASP-
        THKLD+RS+ CVF+GYS+SHKGY+CLSA+G++Y+S++V+FNE+ FP+      + T  P      TL   L  S P   T++S L   P    + H+   
Subjt:  THKLDYRSEPCVFIGYSSSHKGYRCLSANGRVYVSRNVLFNESIFPFSQFPLQSITKSPPSVSLPTLGSILPVSSPIPSTSSSELSTDPAPIPNIHASP-

Query:  ---SPGPLVEVTQDIPSS--SSTTSPSVIK--NTHHMVTRGKSGIFKPKAFVSVSTSDYDNIEPPNVKEALKSSHWRKAMQDEFDALQKNETWELVPSPD
           S  P V      PSS  SST + +V +  N H M TR KSGI+KPK   S+  +   + EP NVK+AL    W  AM+ E+ AL +N TW+LVP P 
Subjt:  ---SPGPLVEVTQDIPSS--SSTTSPSVIK--NTHHMVTRGKSGIFKPKAFVSVSTSDYDNIEPPNVKEALKSSHWRKAMQDEFDALQKNETWELVPSPD

Query:  HKKIVGCKWVFRIKRNSDGSISRYKARLVA
        +++ VGCKWVFR+K N+DGSI+++KARLVA
Subjt:  HKKIVGCKWVFRIKRNSDGSISRYKARLVA

RVW44519.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.8e-11648.28Show/hide
Query:  MAKSHSLPYSPSTTTYTAPLQLIVSDLWGPTYIPSANGYRYYISFVDVFSRYTWIYFLKSKSDAFAAFVKFKTHIEKLLNLPILQFQSDNGGEFLAFKPY
        + KSH+LP+  S T YT PLQL+VSDLWGP  I S+ G+ YY+SFVD +SRYTW+YFLK+KS    AF+ FK   E      +  FQ+D GGEF + K Y
Subjt:  MAKSHSLPYSPSTTTYTAPLQLIVSDLWGPTYIPSANGYRYYISFVDVFSRYTWIYFLKSKSDAFAAFVKFKTHIEKLLNLPILQFQSDNGGEFLAFKPY

Query:  LDSHGISHRFSCPHTSQQNGIAECKHRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVLHGKSPLDIIFCTHPDYSFLKTFGCQCFPCLRPYK
         + +GI HR SCPHTS+QNGI E KHRHIV+ GL LL+ +S+PLK+W +AFSTAVF INRLP+EVL  K P + +F + P+YS LK FGC CFP LRPY 
Subjt:  LDSHGISHRFSCPHTSQQNGIAECKHRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVLHGKSPLDIIFCTHPDYSFLKTFGCQCFPCLRPYK

Query:  THKLDYRSEPCVFIGYSSSHKGYRCLSANGRVYVSRNVLFNESIFPFS---QFPLQSITKS------------------PPSVSLPT------------L
         HKLD+RS PC F+GYSS HKGY+CL+  GR+++SR+V+F+E+ FPF+   Q P+Q ++ S                   PS+SLPT            L
Subjt:  THKLDYRSEPCVFIGYSSSHKGYRCLSANGRVYVSRNVLFNESIFPFS---QFPLQSITKS------------------PPSVSLPT------------L

Query:  GS-ILPVSSPIPSTSSSE---LSTDPAPIP---NIHASPSPGPLVEVTQDIPSSSSTTSPSVI-KNTHHMVTRGKSGIFKPKAFVSVSTSDYDNIEPPNV
        GS I  V   + +T SS    +  + A IP   N++A P   PL     D P+ S  T P    +  HHMVTR K+GIFKPK +    T D +  EP   
Subjt:  GS-ILPVSSPIPSTSSSE---LSTDPAPIP---NIHASPSPGPLVEVTQDIPSSSSTTSPSVI-KNTHHMVTRGKSGIFKPKAFVSVSTSDYDNIEPPNV

Query:  KEALKSSHWRKAMQDEFDALQKNETWELVPSPDHKKIVGCKWVFRIKRNSDGSISRYKARLVANTTTNLTFIDAWLFTFIDICVKKTSINVIFVL
        +EA+    W++AM +EF AL KN+TW LV  P ++  VGC+WVF++KRN DGS+SRYKARLVA   + +   D +  TF  + VK T+I V+  +
Subjt:  KEALKSSHWRKAMQDEFDALQKNETWELVPSPDHKKIVGCKWVFRIKRNSDGSISRYKARLVANTTTNLTFIDAWLFTFIDICVKKTSINVIFVL

RVW60229.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.8e-11648.28Show/hide
Query:  MAKSHSLPYSPSTTTYTAPLQLIVSDLWGPTYIPSANGYRYYISFVDVFSRYTWIYFLKSKSDAFAAFVKFKTHIEKLLNLPILQFQSDNGGEFLAFKPY
        + KSH+LP+  S T YT PLQL+VSDLWGP  I S+ G+ YY+SFVD +SRYTW+YFLK+KS    AF+ FK   E      +  FQ+D GGEF + K Y
Subjt:  MAKSHSLPYSPSTTTYTAPLQLIVSDLWGPTYIPSANGYRYYISFVDVFSRYTWIYFLKSKSDAFAAFVKFKTHIEKLLNLPILQFQSDNGGEFLAFKPY

Query:  LDSHGISHRFSCPHTSQQNGIAECKHRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVLHGKSPLDIIFCTHPDYSFLKTFGCQCFPCLRPYK
         + +GI HR SCPHTS+QNGI E KHRHIV+ GL LL+ +S+PLK+W +AFSTAVF INRLP+EVL  K P + +F + P+YS LK FGC CFP LRPY 
Subjt:  LDSHGISHRFSCPHTSQQNGIAECKHRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVLHGKSPLDIIFCTHPDYSFLKTFGCQCFPCLRPYK

Query:  THKLDYRSEPCVFIGYSSSHKGYRCLSANGRVYVSRNVLFNESIFPFS---QFPLQSITKS------------------PPSVSLPT------------L
         HKLD+RS PC F+GYSS HKGY+CL+  GR+++SR+V+F+E+ FPF+   Q P+Q ++ S                   PS+SLPT            L
Subjt:  THKLDYRSEPCVFIGYSSSHKGYRCLSANGRVYVSRNVLFNESIFPFS---QFPLQSITKS------------------PPSVSLPT------------L

Query:  GS-ILPVSSPIPSTSSSE---LSTDPAPIP---NIHASPSPGPLVEVTQDIPSSSSTTSPSVI-KNTHHMVTRGKSGIFKPKAFVSVSTSDYDNIEPPNV
        GS I  V   + +T SS    +  + A IP   N++A P   PL     D P+ S  T P    +  HHMVTR K+GIFKPK +    T D +  EP   
Subjt:  GS-ILPVSSPIPSTSSSE---LSTDPAPIP---NIHASPSPGPLVEVTQDIPSSSSTTSPSVI-KNTHHMVTRGKSGIFKPKAFVSVSTSDYDNIEPPNV

Query:  KEALKSSHWRKAMQDEFDALQKNETWELVPSPDHKKIVGCKWVFRIKRNSDGSISRYKARLVANTTTNLTFIDAWLFTFIDICVKKTSINVIFVL
        +EA+    W++AM +EF AL KN+TW LV  P ++  VGC+WVF++KRN DGS+SRYKARLVA   + +   D +  TF  + VK T+I V+  +
Subjt:  KEALKSSHWRKAMQDEFDALQKNETWELVPSPDHKKIVGCKWVFRIKRNSDGSISRYKARLVANTTTNLTFIDAWLFTFIDICVKKTSINVIFVL

RVX14937.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]6.9e-11548.23Show/hide
Query:  MAKSHSLPYSPSTTTYTAPLQLIVSDLWGPTYIPSANGYRYYISFVDVFSRYTWIYFLKSKSDAFAAFVKFKTHIEKLLNLPILQFQSDNGGEFLAFKPY
        + K H  P+S S TTYT PL+LI SDLWGP  + S +GYRYYI FVD FSR++WI+ L++KS+A   FV FKT +E   +L I   Q+D GGEF AF+ Y
Subjt:  MAKSHSLPYSPSTTTYTAPLQLIVSDLWGPTYIPSANGYRYYISFVDVFSRYTWIYFLKSKSDAFAAFVKFKTHIEKLLNLPILQFQSDNGGEFLAFKPY

Query:  LDSHGISHRFSCPHTSQQNGIAECKHRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVLHGKSPLDIIFCTHPDYSFLKTFGCQCFPCLRPYK
        L  +GI HR SCPHT QQNG+AE KHR IV+ GL LL   S+PLKFWDE+F T V+  NRLP+ VLH K P++++F + PDYSFLK FGC CFP LRPY 
Subjt:  LDSHGISHRFSCPHTSQQNGIAECKHRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVLHGKSPLDIIFCTHPDYSFLKTFGCQCFPCLRPYK

Query:  THKLDYRSEPCVFIGYSSSHKGYRCLSANGRVYVSRNVLFNESIFPFSQFPLQS----ITKSP------PSVSLPTLGSIL------PVSSPIPSTSSSE
        THKL YRSE C F+GYS  HKGY+C+S+NGRVY+SR+V+FNE+ FP+S+    S     T SP      PS S P L   +      P+SS  P +    
Subjt:  THKLDYRSEPCVFIGYSSSHKGYRCLSANGRVYVSRNVLFNESIFPFSQFPLQS----ITKSP------PSVSLPTLGSIL------PVSSPIPSTSSSE

Query:  LSTDPAPIPN----------IHASPSPGPLVEVTQDIPSSSST-TSPSVIKNTHHMVTRGKSGIFKPKAFVSVSTSDYDNIEPPNVKEALKSSHWRKAMQ
        + +     PN          + ++P   P+  V   I  +S T T      NTH M+TR KSGI KPK F++         EP +V  AL+   W+KAM 
Subjt:  LSTDPAPIPN----------IHASPSPGPLVEVTQDIPSSSST-TSPSVIKNTHHMVTRGKSGIFKPKAFVSVSTSDYDNIEPPNVKEALKSSHWRKAMQ

Query:  DEFDALQKNETWELVPSPDHKKIVGCKWVFRIKRNSDGSISRYKARLVANTTTNLTFIDAWLFTFIDICVKKTSINVIFVL
         E+DALQ+N TW LVP P  ++ +GCKWV++ K N DG++ +YKARLVA         D +  TF  + VK ++I V+F +
Subjt:  DEFDALQKNETWELVPSPDHKKIVGCKWVFRIKRNSDGSISRYKARLVANTTTNLTFIDAWLFTFIDICVKKTSINVIFVL

TrEMBL top hitse value%identityAlignment
A0A151TVT8 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-11450.93Show/hide
Query:  MAKSHSLPYSPSTTTYTAPLQLIVSDLWGPTYIPSANGYRYYISFVDVFSRYTWIYFLKSKSDAFAAFVKFKTHIEKLLNLPILQFQSDNGGEFLAFKPY
        + KSH LP   S + Y+APL LI +DLWGP+++ S +GY YY+SF+D FS++TWIY LKSKSD  + F +FK  +E  LN  I   QSD GGE+ +F  +
Subjt:  MAKSHSLPYSPSTTTYTAPLQLIVSDLWGPTYIPSANGYRYYISFVDVFSRYTWIYFLKSKSDAFAAFVKFKTHIEKLLNLPILQFQSDNGGEFLAFKPY

Query:  LDSHGISHRFSCPHTSQQNGIAECKHRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVLHGKSPLDIIFCTHPDYSFLKTFGCQCFPCLRPYK
        L +HGI+HR  CPHT  QNG+ E KHRHIVD G+ LL H+S+PL FWD AFSTAV+ INRLP+  L+   P  ++F   PDY FLKTFGC CFP LRPY 
Subjt:  LDSHGISHRFSCPHTSQQNGIAECKHRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVLHGKSPLDIIFCTHPDYSFLKTFGCQCFPCLRPYK

Query:  THKLDYRSEPCVFIGYSSSHKGYRCLSANGRVYVSRNVLFNESIFPFSQFPLQSITKSPPSVSLPTLGSILPVSSPIPSTSSSELSTDPAPIPNIHASP-
        THKLD+RS+ CVF+GYS+SHKGY+CLSA+G++Y+S++V+FNE+ FP+      + T  P      TL   L  S P   T++S L   P    + H+   
Subjt:  THKLDYRSEPCVFIGYSSSHKGYRCLSANGRVYVSRNVLFNESIFPFSQFPLQSITKSPPSVSLPTLGSILPVSSPIPSTSSSELSTDPAPIPNIHASP-

Query:  ---SPGPLVEVTQDIPSS--SSTTSPSVIK--NTHHMVTRGKSGIFKPKAFVSVSTSDYDNIEPPNVKEALKSSHWRKAMQDEFDALQKNETWELVPSPD
           S  P V      PSS  SST + +V +  N H M TR KSGI+KPK   S+  +   + EP NVK+AL    W  AM+ E+ AL +N TW+LVP P 
Subjt:  ---SPGPLVEVTQDIPSS--SSTTSPSVIK--NTHHMVTRGKSGIFKPKAFVSVSTSDYDNIEPPNVKEALKSSHWRKAMQDEFDALQKNETWELVPSPD

Query:  HKKIVGCKWVFRIKRNSDGSISRYKARLVA
        +++ VGCKWVFR+K N+DGSI+++KARLVA
Subjt:  HKKIVGCKWVFRIKRNSDGSISRYKARLVA

A0A438EA49 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-11648.28Show/hide
Query:  MAKSHSLPYSPSTTTYTAPLQLIVSDLWGPTYIPSANGYRYYISFVDVFSRYTWIYFLKSKSDAFAAFVKFKTHIEKLLNLPILQFQSDNGGEFLAFKPY
        + KSH+LP+  S T YT PLQL+VSDLWGP  I S+ G+ YY+SFVD +SRYTW+YFLK+KS    AF+ FK   E      +  FQ+D GGEF + K Y
Subjt:  MAKSHSLPYSPSTTTYTAPLQLIVSDLWGPTYIPSANGYRYYISFVDVFSRYTWIYFLKSKSDAFAAFVKFKTHIEKLLNLPILQFQSDNGGEFLAFKPY

Query:  LDSHGISHRFSCPHTSQQNGIAECKHRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVLHGKSPLDIIFCTHPDYSFLKTFGCQCFPCLRPYK
         + +GI HR SCPHTS+QNGI E KHRHIV+ GL LL+ +S+PLK+W +AFSTAVF INRLP+EVL  K P + +F + P+YS LK FGC CFP LRPY 
Subjt:  LDSHGISHRFSCPHTSQQNGIAECKHRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVLHGKSPLDIIFCTHPDYSFLKTFGCQCFPCLRPYK

Query:  THKLDYRSEPCVFIGYSSSHKGYRCLSANGRVYVSRNVLFNESIFPFS---QFPLQSITKS------------------PPSVSLPT------------L
         HKLD+RS PC F+GYSS HKGY+CL+  GR+++SR+V+F+E+ FPF+   Q P+Q ++ S                   PS+SLPT            L
Subjt:  THKLDYRSEPCVFIGYSSSHKGYRCLSANGRVYVSRNVLFNESIFPFS---QFPLQSITKS------------------PPSVSLPT------------L

Query:  GS-ILPVSSPIPSTSSSE---LSTDPAPIP---NIHASPSPGPLVEVTQDIPSSSSTTSPSVI-KNTHHMVTRGKSGIFKPKAFVSVSTSDYDNIEPPNV
        GS I  V   + +T SS    +  + A IP   N++A P   PL     D P+ S  T P    +  HHMVTR K+GIFKPK +    T D +  EP   
Subjt:  GS-ILPVSSPIPSTSSSE---LSTDPAPIP---NIHASPSPGPLVEVTQDIPSSSSTTSPSVI-KNTHHMVTRGKSGIFKPKAFVSVSTSDYDNIEPPNV

Query:  KEALKSSHWRKAMQDEFDALQKNETWELVPSPDHKKIVGCKWVFRIKRNSDGSISRYKARLVANTTTNLTFIDAWLFTFIDICVKKTSINVIFVL
        +EA+    W++AM +EF AL KN+TW LV  P ++  VGC+WVF++KRN DGS+SRYKARLVA   + +   D +  TF  + VK T+I V+  +
Subjt:  KEALKSSHWRKAMQDEFDALQKNETWELVPSPDHKKIVGCKWVFRIKRNSDGSISRYKARLVANTTTNLTFIDAWLFTFIDICVKKTSINVIFVL

A0A438FJP6 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-11648.28Show/hide
Query:  MAKSHSLPYSPSTTTYTAPLQLIVSDLWGPTYIPSANGYRYYISFVDVFSRYTWIYFLKSKSDAFAAFVKFKTHIEKLLNLPILQFQSDNGGEFLAFKPY
        + KSH+LP+  S T YT PLQL+VSDLWGP  I S+ G+ YY+SFVD +SRYTW+YFLK+KS    AF+ FK   E      +  FQ+D GGEF + K Y
Subjt:  MAKSHSLPYSPSTTTYTAPLQLIVSDLWGPTYIPSANGYRYYISFVDVFSRYTWIYFLKSKSDAFAAFVKFKTHIEKLLNLPILQFQSDNGGEFLAFKPY

Query:  LDSHGISHRFSCPHTSQQNGIAECKHRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVLHGKSPLDIIFCTHPDYSFLKTFGCQCFPCLRPYK
         + +GI HR SCPHTS+QNGI E KHRHIV+ GL LL+ +S+PLK+W +AFSTAVF INRLP+EVL  K P + +F + P+YS LK FGC CFP LRPY 
Subjt:  LDSHGISHRFSCPHTSQQNGIAECKHRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVLHGKSPLDIIFCTHPDYSFLKTFGCQCFPCLRPYK

Query:  THKLDYRSEPCVFIGYSSSHKGYRCLSANGRVYVSRNVLFNESIFPFS---QFPLQSITKS------------------PPSVSLPT------------L
         HKLD+RS PC F+GYSS HKGY+CL+  GR+++SR+V+F+E+ FPF+   Q P+Q ++ S                   PS+SLPT            L
Subjt:  THKLDYRSEPCVFIGYSSSHKGYRCLSANGRVYVSRNVLFNESIFPFS---QFPLQSITKS------------------PPSVSLPT------------L

Query:  GS-ILPVSSPIPSTSSSE---LSTDPAPIP---NIHASPSPGPLVEVTQDIPSSSSTTSPSVI-KNTHHMVTRGKSGIFKPKAFVSVSTSDYDNIEPPNV
        GS I  V   + +T SS    +  + A IP   N++A P   PL     D P+ S  T P    +  HHMVTR K+GIFKPK +    T D +  EP   
Subjt:  GS-ILPVSSPIPSTSSSE---LSTDPAPIP---NIHASPSPGPLVEVTQDIPSSSSTTSPSVI-KNTHHMVTRGKSGIFKPKAFVSVSTSDYDNIEPPNV

Query:  KEALKSSHWRKAMQDEFDALQKNETWELVPSPDHKKIVGCKWVFRIKRNSDGSISRYKARLVANTTTNLTFIDAWLFTFIDICVKKTSINVIFVL
        +EA+    W++AM +EF AL KN+TW LV  P ++  VGC+WVF++KRN DGS+SRYKARLVA   + +   D +  TF  + VK T+I V+  +
Subjt:  KEALKSSHWRKAMQDEFDALQKNETWELVPSPDHKKIVGCKWVFRIKRNSDGSISRYKARLVANTTTNLTFIDAWLFTFIDICVKKTSINVIFVL

A0A438K147 Retrovirus-related Pol polyprotein from transposon TNT 1-943.3e-11548.23Show/hide
Query:  MAKSHSLPYSPSTTTYTAPLQLIVSDLWGPTYIPSANGYRYYISFVDVFSRYTWIYFLKSKSDAFAAFVKFKTHIEKLLNLPILQFQSDNGGEFLAFKPY
        + K H  P+S S TTYT PL+LI SDLWGP  + S +GYRYYI FVD FSR++WI+ L++KS+A   FV FKT +E   +L I   Q+D GGEF AF+ Y
Subjt:  MAKSHSLPYSPSTTTYTAPLQLIVSDLWGPTYIPSANGYRYYISFVDVFSRYTWIYFLKSKSDAFAAFVKFKTHIEKLLNLPILQFQSDNGGEFLAFKPY

Query:  LDSHGISHRFSCPHTSQQNGIAECKHRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVLHGKSPLDIIFCTHPDYSFLKTFGCQCFPCLRPYK
        L  +GI HR SCPHT QQNG+AE KHR IV+ GL LL   S+PLKFWDE+F T V+  NRLP+ VLH K P++++F + PDYSFLK FGC CFP LRPY 
Subjt:  LDSHGISHRFSCPHTSQQNGIAECKHRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVLHGKSPLDIIFCTHPDYSFLKTFGCQCFPCLRPYK

Query:  THKLDYRSEPCVFIGYSSSHKGYRCLSANGRVYVSRNVLFNESIFPFSQFPLQS----ITKSP------PSVSLPTLGSIL------PVSSPIPSTSSSE
        THKL YRSE C F+GYS  HKGY+C+S+NGRVY+SR+V+FNE+ FP+S+    S     T SP      PS S P L   +      P+SS  P +    
Subjt:  THKLDYRSEPCVFIGYSSSHKGYRCLSANGRVYVSRNVLFNESIFPFSQFPLQS----ITKSP------PSVSLPTLGSIL------PVSSPIPSTSSSE

Query:  LSTDPAPIPN----------IHASPSPGPLVEVTQDIPSSSST-TSPSVIKNTHHMVTRGKSGIFKPKAFVSVSTSDYDNIEPPNVKEALKSSHWRKAMQ
        + +     PN          + ++P   P+  V   I  +S T T      NTH M+TR KSGI KPK F++         EP +V  AL+   W+KAM 
Subjt:  LSTDPAPIPN----------IHASPSPGPLVEVTQDIPSSSST-TSPSVIKNTHHMVTRGKSGIFKPKAFVSVSTSDYDNIEPPNVKEALKSSHWRKAMQ

Query:  DEFDALQKNETWELVPSPDHKKIVGCKWVFRIKRNSDGSISRYKARLVANTTTNLTFIDAWLFTFIDICVKKTSINVIFVL
         E+DALQ+N TW LVP P  ++ +GCKWV++ K N DG++ +YKARLVA         D +  TF  + VK ++I V+F +
Subjt:  DEFDALQKNETWELVPSPDHKKIVGCKWVFRIKRNSDGSISRYKARLVANTTTNLTFIDAWLFTFIDICVKKTSINVIFVL

A5BFT3 Integrase catalytic domain-containing protein7.4e-11547.61Show/hide
Query:  MAKSHSLPYSPSTTTYTAPLQLIVSDLWGPTYIPSANGYRYYISFVDVFSRYTWIYFLKSKSDAFAAFVKFKTHIEKLLNLPILQFQSDNGGEFLAFKPY
        + K H  P+S S TTYT PL+LI  DLWGPT + S +GYRYYI FVD FSR++WI+ L++KS+A   FV FKT +E   +L I   Q+D GGEF AF+ Y
Subjt:  MAKSHSLPYSPSTTTYTAPLQLIVSDLWGPTYIPSANGYRYYISFVDVFSRYTWIYFLKSKSDAFAAFVKFKTHIEKLLNLPILQFQSDNGGEFLAFKPY

Query:  LDSHGISHRFSCPHTSQQNGIAECKHRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVLHGKSPLDIIFCTHPDYSFLKTFGCQCFPCLRPYK
        L  +GI HR SCPHT QQNG+AE KHR IV+ GL LL  +S+PLKFWDE+F T V+  NRLP+ +LH K P++++F + PDYSFLK FGC CFP LRPY 
Subjt:  LDSHGISHRFSCPHTSQQNGIAECKHRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVLHGKSPLDIIFCTHPDYSFLKTFGCQCFPCLRPYK

Query:  THKLDYRSEPCVFIGYSSSHKGYRCLSANGRVYVSRNVLFNESIFPFSQFPLQS----ITKSP------PSVSLPTLGSIL------PVSSPIPSTSSSE
        THKL YRSE C F+GYS  HKGY+C+S+NGRVY+S +V+FNE+ FP+S+    S     T SP      PS S P L   +      P+SS  P +    
Subjt:  THKLDYRSEPCVFIGYSSSHKGYRCLSANGRVYVSRNVLFNESIFPFSQFPLQS----ITKSP------PSVSLPTLGSIL------PVSSPIPSTSSSE

Query:  LSTDPAPIPN----------IHASPSPGPLVEVTQDIPSSSST-TSPSVIKNTHHMVTRGKSGIFKPKAFVSVSTSDYDNIEPPNVKEALKSSHWRKAMQ
        + +     PN          + ++P   P+  V   I  +S T T      NTH M+TR KSGI KPK F++         EP +V  AL+   W+KAM 
Subjt:  LSTDPAPIPN----------IHASPSPGPLVEVTQDIPSSSST-TSPSVIKNTHHMVTRGKSGIFKPKAFVSVSTSDYDNIEPPNVKEALKSSHWRKAMQ

Query:  DEFDALQKNETWELVPSPDHKKIVGCKWVFRIKRNSDGSISRYKARLVANTTTNLTFIDAWLFTFIDICVKKTSINVIFVL
         E+DALQ+N TW LVP P  ++ +GCKWV++ K N DG++ +YKARLVA         D +  TF  + VK +++ V+F +
Subjt:  DEFDALQKNETWELVPSPDHKKIVGCKWVFRIKRNSDGSISRYKARLVANTTTNLTFIDAWLFTFIDICVKKTSINVIFVL

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.8e-2925.39Show/hide
Query:  KSHSLPYS--PSTTTYTAPLQLIVSDLWGPTYIPSANGYRYYISFVDVFSRYTWIYFLKSKSDAFAAFVKFKTHIEKLLNLPILQFQSDNGGEFLA--FK
        K   LP+      T    PL ++ SD+ GP    + +   Y++ FVD F+ Y   Y +K KSD F+ F  F    E   NL ++    DNG E+L+   +
Subjt:  KSHSLPYS--PSTTTYTAPLQLIVSDLWGPTYIPSANGYRYYISFVDVFSRYTWIYFLKSKSDAFAAFVKFKTHIEKLLNLPILQFQSDNGGEFLA--FK

Query:  PYLDSHGISHRFSCPHTSQQNGIAECKHRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVL--HGKSPLDIIFCTHPDYSFLKTFGCQCFPCL
         +    GIS+  + PHT Q NG++E   R I +    ++S + +   FW EA  TA + INR+PS  L    K+P ++     P    L+ FG   +  +
Subjt:  PYLDSHGISHRFSCPHTSQQNGIAECKHRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVL--HGKSPLDIIFCTHPDYSFLKTFGCQCFPCL

Query:  RPYKTHKLDYRSEPCVFIGYSSSHKGYRCLSA-NGRVYVSRNVLFNESIF----------PFSQFPLQSITKSPPSVSLPTLGSILPVSSP-------IP
        +  K  K D +S   +F+GY  +  G++   A N +  V+R+V+ +E+             F +   +S  K+ P+ S   + +  P  S        + 
Subjt:  RPYKTHKLDYRSEPCVFIGYSSSHKGYRCLSA-NGRVYVSRNVLFNESIF----------PFSQFPLQSITKSPPSVSLPTLGSILPVSSP-------IP

Query:  STSSSELSTDPAPIPNIHASPSPGPLVEV-------------------------TQDIPSSSSTTSPSVIKNTHHMVTRGKSGIFKP-------------
         +  SE    P     I  +  P    E                             +  S  + +P+  + +       + GI  P             
Subjt:  STSSSELSTDPAPIPNIHASPSPGPLVEV-------------------------TQDIPSSSSTTSPSVIKNTHHMVTRGKSGIFKP-------------

Query:  ---KAFVSVSTSDYDNI-------------EPPNVKEALK----SSHWRKAMQDEFDALQKNETWELVPSPDHKKIVGCKWVFRIKRNSDGSISRYKARL
           K    +S ++ DN              + PN  + ++     S W +A+  E +A + N TW +   P++K IV  +WVF +K N  G+  RYKARL
Subjt:  ---KAFVSVSTSDYDNI-------------EPPNVKEALK----SSHWRKAMQDEFDALQKNETWELVPSPDHKKIVGCKWVFRIKRNSDGSISRYKARL

Query:  VANTTTNLTFID
        VA   T    ID
Subjt:  VANTTTNLTFID

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.4e-5633.04Show/hide
Query:  KSHSLPYSPSTTTYTAPLQLIVSDLWGPTYIPSANGYRYYISFVDVFSRYTWIYFLKSKSDAFAAFVKFKTHIEKLLNLPILQFQSDNGGEFLA--FKPY
        K H + +  S+      L L+ SD+ GP  I S  G +Y+++F+D  SR  W+Y LK+K   F  F KF   +E+     + + +SDNGGE+ +  F+ Y
Subjt:  KSHSLPYSPSTTTYTAPLQLIVSDLWGPTYIPSANGYRYYISFVDVFSRYTWIYFLKSKSDAFAAFVKFKTHIEKLLNLPILQFQSDNGGEFLA--FKPY

Query:  LDSHGISHRFSCPHTSQQNGIAECKHRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVLHGKSPLDIIFCTHPDYSFLKTFGCQCFPCLRPYK
          SHGI H  + P T Q NG+AE  +R IV+   ++L  + +P  FW EA  TA + INR PS  L  + P  +       YS LK FGC+ F  +   +
Subjt:  LDSHGISHRFSCPHTSQQNGIAECKHRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVLHGKSPLDIIFCTHPDYSFLKTFGCQCFPCLRPYK

Query:  THKLDYRSEPCVFIGYSSSHKGYRCLS-ANGRVYVSRNVLFNESIFPFSQFPLQSITKSPPSVSLPTLGSILPVSSPIPSTSSSELSTDPAPIPNIHASP
          KLD +S PC+FIGY     GYR       +V  SR+V+F E           S  ++   +S      I+P    IPSTS++  S +           
Subjt:  THKLDYRSEPCVFIGYSSSHKGYRCLS-ANGRVYVSRNVLFNESIFPFSQFPLQSITKSPPSVSLPTLGSILPVSSPIPSTSSSELSTDPAPIPNIHASP

Query:  SPGPLVEVTQDIPSS-SSTTSPSVIKNTHHMVTRGKSGIFKPKAFVSVS-TSDYDNIEPPNVKEAL---KSSHWRKAMQDEFDALQKNETWELVPSPDHK
         PG ++E  + +         P+  +  H  + R +    + + + S       D+ EP ++KE L   + +   KAMQ+E ++LQKN T++LV  P  K
Subjt:  SPGPLVEVTQDIPSS-SSTTSPSVIKNTHHMVTRGKSGIFKPKAFVSVS-TSDYDNIEPPNVKEAL---KSSHWRKAMQDEFDALQKNETWELVPSPDHK

Query:  KIVGCKWVFRIKRNSDGSISRYKARLVANTTTNLTFIDAWLFTFIDICVKKTSINVIFVL
        + + CKWVF++K++ D  + RYKARLV         ID          VK TSI  I  L
Subjt:  KIVGCKWVFRIKRNSDGSISRYKARLVANTTTNLTFIDAWLFTFIDICVKKTSINVIFVL

P92520 Uncharacterized mitochondrial protein AtMg008206.2e-1850Show/hide
Query:  MVTRGKSGIFKPKAFVSVSTSDYDNIEPPNVKEALKSSHWRKAMQDEFDALQKNETWELVPSPDHKKIVGCKWVFRIKRNSDGSISRYKARLVA
        M+TR K+GI K     S++ +     EP +V  ALK   W +AMQ+E DAL +N+TW LVP P ++ I+GCKWVF+ K +SDG++ R KARLVA
Subjt:  MVTRGKSGIFKPKAFVSVSTSDYDNIEPPNVKEALKSSHWRKAMQDEFDALQKNETWELVPSPDHKKIVGCKWVFRIKRNSDGSISRYKARLVA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.7e-8938.55Show/hide
Query:  KSHSLPYSPSTTTYTAPLQLIVSDLWGPTYIPSANGYRYYISFVDVFSRYTWIYFLKSKSDAFAAFVKFKTHIEKLLNLPILQFQSDNGGEFLAFKPYLD
        KS+ +P+S ST   T PL+ I SD+W  + I S + YRYY+ FVD F+RYTW+Y LK KS     F+ FK  +E      I  F SDNGGEF+A   Y  
Subjt:  KSHSLPYSPSTTTYTAPLQLIVSDLWGPTYIPSANGYRYYISFVDVFSRYTWIYFLKSKSDAFAAFVKFKTHIEKLLNLPILQFQSDNGGEFLAFKPYLD

Query:  SHGISHRFSCPHTSQQNGIAECKHRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVLHGKSPLDIIFCTHPDYSFLKTFGCQCFPCLRPYKTH
         HGISH  S PHT + NG++E KHRHIV+TGL LLSH+S+P  +W  AF+ AV+ INRLP+ +L  +SP   +F T P+Y  L+ FGC C+P LRPY  H
Subjt:  SHGISHRFSCPHTSQQNGIAECKHRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVLHGKSPLDIIFCTHPDYSFLKTFGCQCFPCLRPYKTH

Query:  KLDYRSEPCVFIGYSSSHKGYRCLS-ANGRVYVSRNVLFNESIFPFSQF-----PLQSITKS-----PPSVSLPTLGSILPV------------------
        KLD +S  CVF+GYS +   Y CL     R+Y+SR+V F+E+ FPFS +     P+Q   +       P  +LPT   +LP                   
Subjt:  KLDYRSEPCVFIGYSSSHKGYRCLS-ANGRVYVSRNVLFNESIFPFSQF-----PLQSITKS-----PPSVSLPTLGSILPV------------------

Query:  --------SSPIPSTSSSELSTDPAP---------------------------------------------IPNIHASPSPGPLVEVTQDIPSSSSTTSP
                SS + S+ SS   + P P                                              P   +S SP P    T    SS+S T P
Subjt:  --------SSPIPSTSSSELSTDPAP---------------------------------------------IPNIHASPSPGPLVEVTQDIPSSSSTTSP

Query:  SVI------------------KNTHHMVTRGKSGIFKPKAFVSVSTSDYDNIEPPNVKEALKSSHWRKAMQDEFDALQKNETWELV-PSPDHKKIVGCKW
        S++                   NTH M TR K+GI KP    S++ S     EP    +ALK   WR AM  E +A   N TW+LV P P H  IVGC+W
Subjt:  SVI------------------KNTHHMVTRGKSGIFKPKAFVSVSTSDYDNIEPPNVKEALKSSHWRKAMQDEFDALQKNETWELV-PSPDHKKIVGCKW

Query:  VFRIKRNSDGSISRYKARLVANTTTNLTFIDAWLFTFIDICVKKTSINVI
        +F  K NSDGS++RYKARLVA        +D +  TF  + +K TSI ++
Subjt:  VFRIKRNSDGSISRYKARLVANTTTNLTFIDAWLFTFIDICVKKTSINVI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.2e-8738.84Show/hide
Query:  KSHSLPYSPSTTTYTAPLQLIVSDLWGPTYIPSANGYRYYISFVDVFSRYTWIYFLKSKSDAFAAFVKFKTHIEKLLNLPILQFQSDNGGEFLAFKPYLD
        KSH +P+S ST T + PL+ I SD+W  + I S + YRYY+ FVD F+RYTW+Y LK KS     F+ FK+ +E      I    SDNGGEF+  + YL 
Subjt:  KSHSLPYSPSTTTYTAPLQLIVSDLWGPTYIPSANGYRYYISFVDVFSRYTWIYFLKSKSDAFAAFVKFKTHIEKLLNLPILQFQSDNGGEFLAFKPYLD

Query:  SHGISHRFSCPHTSQQNGIAECKHRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVLHGKSPLDIIFCTHPDYSFLKTFGCQCFPCLRPYKTH
         HGISH  S PHT + NG++E KHRHIV+ GL LLSH+S+P  +W  AFS AV+ INRLP+ +L  +SP   +F   P+Y  LK FGC C+P LRPY  H
Subjt:  SHGISHRFSCPHTSQQNGIAECKHRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVLHGKSPLDIIFCTHPDYSFLKTFGCQCFPCLRPYKTH

Query:  KLDYRSEPCVFIGYSSSHKGYRCLS-ANGRVYVSRNVLFNESIFPFS----------QFPLQSITKSPPSVSLPTLGSILPV------------------
        KL+ +S+ C F+GYS +   Y CL    GR+Y SR+V F+E  FPFS          +    S    P   +LPT   +LP                   
Subjt:  KLDYRSEPCVFIGYSSSHKGYRCLS-ANGRVYVSRNVLFNESIFPFS----------QFPLQSITKSPPSVSLPTLGSILPV------------------

Query:  ---------SSPIPSTS-SSELSTDP---------------------------------APIPNIHASPSPGPLVEVTQ-------------DIPSSSST
                 SS +PS+S SS  S++P                                 +P PN     SP P   ++              + PSSSST
Subjt:  ---------SSPIPSTS-SSELSTDP---------------------------------APIPNIHASPSPGPLVEVTQ-------------DIPSSSST

Query:  T---------SPSVIK-------NTHHMVTRGKSGIFKPKAFVSVSTSDYDNIEPPNVKEALKSSHWRKAMQDEFDALQKNETWELVPSPDHK-KIVGCK
        +         +P +I+       NTH M TR K GI KP    S +TS   N EP    +A+K   WR+AM  E +A   N TW+LVP P     IVGC+
Subjt:  T---------SPSVIK-------NTHHMVTRGKSGIFKPKAFVSVSTSDYDNIEPPNVKEALKSSHWRKAMQDEFDALQKNETWELVPSPDHK-KIVGCK

Query:  WVFRIKRNSDGSISRYKARLVANTTTNLTFIDAWLFTFIDICVKKTSINVI
        W+F  K NSDGS++RYKARLVA        +D +  TF  + +K TSI ++
Subjt:  WVFRIKRNSDGSISRYKARLVANTTTNLTFIDAWLFTFIDICVKKTSINVI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.6e-1433.86Show/hide
Query:  VSSPIPSTSSSELSTDPA-------PIPNIHASPSPGPLVEVTQDIPSSSSTTSPSVIKNTHHMVTRGKSGIFKPKAFVSVSTSDYDNIEPPNVKEALKS
        VS    STSSS +   P+       P P++H S          QD    S   +   I +    ++  K         V ++ +     EP    EA + 
Subjt:  VSSPIPSTSSSELSTDPA-------PIPNIHASPSPGPLVEVTQDIPSSSSTTSPSVIKNTHHMVTRGKSGIFKPKAFVSVSTSDYDNIEPPNVKEALKS

Query:  SHWRKAMQDEFDALQKNETWELVPSPDHKKIVGCKWVFRIKRNSDGSISRYKARLVANTTTNLTFIDAWLFTFIDICVKKTSINVIFVL
          W  AM DE  A++   TWE+   P +KK +GCKWV++IK NSDG+I RYKARLVA   T    ID ++ TF  +C K TS+ +I  +
Subjt:  SHWRKAMQDEFDALQKNETWELVPSPDHKKIVGCKWVFRIKRNSDGSISRYKARLVANTTTNLTFIDAWLFTFIDICVKKTSINVIFVL

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.6e-0532.35Show/hide
Query:  HRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVLHGKSPLDIIFCTHPDYSFLKTFGCQCF
        +R I++   ++L    +P  F  +A +TAV  IN+ PS  ++   P ++ F + P YS+L+ FGC  +
Subjt:  HRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVLHGKSPLDIIFCTHPDYSFLKTFGCQCF

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)4.4e-1950Show/hide
Query:  MVTRGKSGIFKPKAFVSVSTSDYDNIEPPNVKEALKSSHWRKAMQDEFDALQKNETWELVPSPDHKKIVGCKWVFRIKRNSDGSISRYKARLVA
        M+TR K+GI K     S++ +     EP +V  ALK   W +AMQ+E DAL +N+TW LVP P ++ I+GCKWVF+ K +SDG++ R KARLVA
Subjt:  MVTRGKSGIFKPKAFVSVSTSDYDNIEPPNVKEALKSSHWRKAMQDEFDALQKNETWELVPSPDHKKIVGCKWVFRIKRNSDGSISRYKARLVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAAATCTCACTCTCTCCCTTACTCTCCCTCTACTACCACTTACACAGCTCCTCTTCAATTAATTGTGTCTGACTTGTGGGGTCCTACATATATTCCTTCAGCAAA
TGGTTATCGGTATTATATTAGCTTTGTGGATGTTTTTAGTCGGTACACTTGGATATATTTTTTAAAATCCAAGTCCGATGCCTTTGCTGCTTTTGTTAAATTTAAAACCC
ACATTGAAAAGCTTCTCAATTTGCCTATTCTTCAATTTCAATCTGATAATGGTGGAGAGTTTCTGGCTTTCAAACCTTATTTAGATTCTCATGGCATTTCACATCGATTT
TCTTGTCCTCACACTTCACAACAGAACGGGATTGCTGAGTGCAAGCACAGGCACATTGTCGATACTGGCCTAGCCTTGCTATCTCACTCCTCCATGCCTTTAAAATTCTG
GGATGAAGCGTTCTCAACCGCTGTATTTTTTATAAATAGGCTGCCCTCTGAAGTTCTTCATGGCAAGAGTCCCTTGGATATCATCTTCTGCACTCACCCTGATTATTCTT
TTCTTAAAACCTTCGGATGTCAGTGCTTTCCTTGTTTACGACCATATAAAACTCATAAGCTTGACTATAGGTCTGAACCTTGTGTTTTCATAGGATACAGTTCTTCTCAT
AAGGGTTATCGTTGCTTATCTGCTAATGGTAGGGTCTATGTTTCAAGGAATGTTTTATTTAATGAGTCCATTTTTCCCTTTTCCCAATTTCCATTGCAGTCTATCACCAA
GTCTCCTCCCTCTGTCTCTTTACCTACTCTTGGGTCTATTTTACCTGTCTCTTCACCTATTCCATCTACTTCATCCTCCGAGTTAAGTACGGATCCTGCTCCAATTCCCA
ATATACATGCTTCCCCATCTCCAGGTCCTTTGGTTGAGGTTACTCAGGATATTCCTTCTTCCTCTTCCACAACTTCTCCTTCTGTTATTAAAAACACTCATCATATGGTT
ACTAGAGGCAAGAGTGGCATATTCAAACCCAAAGCTTTTGTCTCTGTTTCTACTTCAGATTATGATAATATTGAACCTCCTAATGTCAAGGAGGCCTTAAAATCTTCTCA
TTGGCGAAAGGCAATGCAGGATGAATTTGATGCTTTACAAAAGAATGAAACATGGGAATTGGTTCCTTCTCCGGATCATAAGAAAATTGTCGGTTGTAAATGGGTGTTTC
GCATTAAACGAAACTCGGATGGCTCTATTTCTCGCTACAAAGCTCGTCTTGTAGCTAACACTACTACAAATTTGACCTTTATTGACGCTTGGCTTTTTACATTTATTGAC
ATTTGTGTAAAAAAAACGTCAATAAACGTAATTTTTGTACTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTAAATCTCACTCTCTCCCTTACTCTCCCTCTACTACCACTTACACAGCTCCTCTTCAATTAATTGTGTCTGACTTGTGGGGTCCTACATATATTCCTTCAGCAAA
TGGTTATCGGTATTATATTAGCTTTGTGGATGTTTTTAGTCGGTACACTTGGATATATTTTTTAAAATCCAAGTCCGATGCCTTTGCTGCTTTTGTTAAATTTAAAACCC
ACATTGAAAAGCTTCTCAATTTGCCTATTCTTCAATTTCAATCTGATAATGGTGGAGAGTTTCTGGCTTTCAAACCTTATTTAGATTCTCATGGCATTTCACATCGATTT
TCTTGTCCTCACACTTCACAACAGAACGGGATTGCTGAGTGCAAGCACAGGCACATTGTCGATACTGGCCTAGCCTTGCTATCTCACTCCTCCATGCCTTTAAAATTCTG
GGATGAAGCGTTCTCAACCGCTGTATTTTTTATAAATAGGCTGCCCTCTGAAGTTCTTCATGGCAAGAGTCCCTTGGATATCATCTTCTGCACTCACCCTGATTATTCTT
TTCTTAAAACCTTCGGATGTCAGTGCTTTCCTTGTTTACGACCATATAAAACTCATAAGCTTGACTATAGGTCTGAACCTTGTGTTTTCATAGGATACAGTTCTTCTCAT
AAGGGTTATCGTTGCTTATCTGCTAATGGTAGGGTCTATGTTTCAAGGAATGTTTTATTTAATGAGTCCATTTTTCCCTTTTCCCAATTTCCATTGCAGTCTATCACCAA
GTCTCCTCCCTCTGTCTCTTTACCTACTCTTGGGTCTATTTTACCTGTCTCTTCACCTATTCCATCTACTTCATCCTCCGAGTTAAGTACGGATCCTGCTCCAATTCCCA
ATATACATGCTTCCCCATCTCCAGGTCCTTTGGTTGAGGTTACTCAGGATATTCCTTCTTCCTCTTCCACAACTTCTCCTTCTGTTATTAAAAACACTCATCATATGGTT
ACTAGAGGCAAGAGTGGCATATTCAAACCCAAAGCTTTTGTCTCTGTTTCTACTTCAGATTATGATAATATTGAACCTCCTAATGTCAAGGAGGCCTTAAAATCTTCTCA
TTGGCGAAAGGCAATGCAGGATGAATTTGATGCTTTACAAAAGAATGAAACATGGGAATTGGTTCCTTCTCCGGATCATAAGAAAATTGTCGGTTGTAAATGGGTGTTTC
GCATTAAACGAAACTCGGATGGCTCTATTTCTCGCTACAAAGCTCGTCTTGTAGCTAACACTACTACAAATTTGACCTTTATTGACGCTTGGCTTTTTACATTTATTGAC
ATTTGTGTAAAAAAAACGTCAATAAACGTAATTTTTGTACTTTAA
Protein sequenceShow/hide protein sequence
MAKSHSLPYSPSTTTYTAPLQLIVSDLWGPTYIPSANGYRYYISFVDVFSRYTWIYFLKSKSDAFAAFVKFKTHIEKLLNLPILQFQSDNGGEFLAFKPYLDSHGISHRF
SCPHTSQQNGIAECKHRHIVDTGLALLSHSSMPLKFWDEAFSTAVFFINRLPSEVLHGKSPLDIIFCTHPDYSFLKTFGCQCFPCLRPYKTHKLDYRSEPCVFIGYSSSH
KGYRCLSANGRVYVSRNVLFNESIFPFSQFPLQSITKSPPSVSLPTLGSILPVSSPIPSTSSSELSTDPAPIPNIHASPSPGPLVEVTQDIPSSSSTTSPSVIKNTHHMV
TRGKSGIFKPKAFVSVSTSDYDNIEPPNVKEALKSSHWRKAMQDEFDALQKNETWELVPSPDHKKIVGCKWVFRIKRNSDGSISRYKARLVANTTTNLTFIDAWLFTFID
ICVKKTSINVIFVL