; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028183 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028183
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr8:15048011..15055057
RNA-Seq ExpressionLag0028183
SyntenyLag0028183
Gene Ontology termsGO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0016020 - membrane (cellular component)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXC35359.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Morus notabilis]3.0e-10652.04Show/hide
Query:  NLGQCLHCHAKAPQ--NRTPIVYTRRKEATLQNQECQ------------PPEPNSSSLNSTDENITFMAPVVDDLTLPIAQRKA---------GQY----
        N+   +   + APQ  N    VY RRK+        Q            PPE N      TD       P +DD TLPIA RK          G Y    
Subjt:  NLGQCLHCHAKAPQ--NRTPIVYTRRKEATLQNQECQ------------PPEPNSSSLNSTDENITFMAPVVDDLTLPIAQRKA---------GQY----

Query:  ----------------SHSKTIHDALKDPKQKKAVDEEIRALESNGTWTLTQPPYGKNLIGCKWIFTIKYKSDGSVERFKVGLVAKGFTQSYGVDVGHAF
                            TIH+AL++ + KKAV +EI ALE NGTWT+T  P GK  +GCKWIFTIKYK+DGSVERFK  LVA+GFTQSYG       
Subjt:  ----------------SHSKTIHDALKDPKQKKAVDEEIRALESNGTWTLTQPPYGKNLIGCKWIFTIKYKSDGSVERFKVGLVAKGFTQSYGVDVGHAF

Query:  VCKIKEIELLLIQIVSTRVFYTPSIDYQEIFALVAKLNTVHVLLSLAVNQDWSLHQLDVKKTFLNGDLEEEVYMTIPPGMEDKSNSNLVCKLRKSLYGLK
                                IDYQE FA VAKLNT+ +LLSLAVNQDW L QLD+K  FLNGDLEEEVYM IPPG E     N VCKLRKSLYGLK
Subjt:  VCKIKEIELLLIQIVSTRVFYTPSIDYQEIFALVAKLNTVHVLLSLAVNQDWSLHQLDVKKTFLNGDLEEEVYMTIPPGMEDKSNSNLVCKLRKSLYGLK

Query:  QSPRAWFDIFTKTLIKNGYYQSQADHTLFVKSSN-NKTAILIVYVDDIIITRHDIKEILNLKRMLATKFEIKDLGGLRYFLGMEVARSNNGIIISQRKYI
        QSPRAWFD FTK ++K GY Q Q+DHTLFVK S+  K AILIVYVDDII++ +D+KE+  LK+ L+ +FE+KDLG L+YFLGMEVARS+ GI++SQRKYI
Subjt:  QSPRAWFDIFTKTLIKNGYYQSQADHTLFVKSSN-NKTAILIVYVDDIIITRHDIKEILNLKRMLATKFEIKDLGGLRYFLGMEVARSNNGIIISQRKYI

Query:  LDLLKETRNLGCRPAETSMDPNLTL-HQSEVIPVDKGMYQRL
        LDLLKET  LGC+P +T MD    L  + E  PVD+G YQRL
Subjt:  LDLLKETRNLGCRPAETSMDPNLTL-HQSEVIPVDKGMYQRL

RVW29719.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]7.1e-10856.61Show/hide
Query:  QPPEPNSSSLN----STDENITFMAPVVDDLTLPIAQRKAGQYSHSKTIHDALKDPKQKKAVDEEIRALESNGTWTLTQPPYGKNLIGCKWIFTIKYKSD
        QP  P  ++ N      D +   + P +DD TLPIA RK G    S TI +ALK  + KKAV +EI ALE NGTWT+T  P GK  +GCKWIFTIKYK+D
Subjt:  QPPEPNSSSLN----STDENITFMAPVVDDLTLPIAQRKAGQYSHSKTIHDALKDPKQKKAVDEEIRALESNGTWTLTQPPYGKNLIGCKWIFTIKYKSD

Query:  GSVERFKVGLVAKGFTQSYGVDVGHAFVCKIKEIELLLIQIVSTRVFYTPSIDYQEIFALVAKLNTVHVLLSLAVNQDWSLHQLDVKKTFLNGDLEEEVY
        GSVERFK  LVA+GFTQSYG                               IDYQE FA VAKLNT+ +LLSLAVNQDW L QLD+K  FLNGDLEEEVY
Subjt:  GSVERFKVGLVAKGFTQSYGVDVGHAFVCKIKEIELLLIQIVSTRVFYTPSIDYQEIFALVAKLNTVHVLLSLAVNQDWSLHQLDVKKTFLNGDLEEEVY

Query:  MTIPPGMEDKSNSNLVCKLRKSLYGLKQSPRAWFDIFTKTLIKNGYYQSQADHTLFVKSSN-NKTAILIVYVDDIIITRHDIKEILNLKRMLATKFEIKD
        M IPPG E+    N VCKL+KSLYGLKQSPRAWFD FTK ++K GY Q QADHTLFVK S+  K AILIVYVDDII++ +D+ E+ NLK+ L+ +FE+KD
Subjt:  MTIPPGMEDKSNSNLVCKLRKSLYGLKQSPRAWFDIFTKTLIKNGYYQSQADHTLFVKSSN-NKTAILIVYVDDIIITRHDIKEILNLKRMLATKFEIKD

Query:  LGGLRYFLGMEVARSNNGIIISQRKYILDLLKETRNLGCRPAETSMDPNLTLH-QSEVIPVDKGMYQRLTIVPMFGVTWSLGEVKNKLLLQEAMQKRNTE
        LG L+YFLGMEVARS  GI++SQRKYILDLLKET  LGC+P +T MD    L  + E  PVD+G YQRL  V      ++LG++   L LQ+  + R+TE
Subjt:  LGGLRYFLGMEVARSNNGIIISQRKYILDLLKETRNLGCRPAETSMDPNLTLH-QSEVIPVDKGMYQRLTIVPMFGVTWSLGEVKNKLLLQEAMQKRNTE

Query:  L
        +
Subjt:  L

RVW60936.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.9e-10556.74Show/hide
Query:  MAPVVDDLTLPIAQRKAGQYSHSKTIHDALKDPKQKKAVDEEIRALESNGTWTLTQPPYGKNLIGCKWIFTIKYKSDGSVERFKVGLVAKGFTQSYGVDV
        + P +DD TLPIA RK        TI +ALK  + KKAV +EI ALE NGTWT+T  P GK  +GCKWIFTIKYK+DGSVERFK  LVA+GFTQSYG   
Subjt:  MAPVVDDLTLPIAQRKAGQYSHSKTIHDALKDPKQKKAVDEEIRALESNGTWTLTQPPYGKNLIGCKWIFTIKYKSDGSVERFKVGLVAKGFTQSYGVDV

Query:  GHAFVCKIKEIELLLIQIVSTRVFYTPSIDYQEIFALVAKLNTVHVLLSLAVNQDWSLHQLDVKKTFLNGDLEEEVYMTIPPGMEDKSNSNLVCKLRKSL
                                    IDYQE FA VAKLNT+ +LLSLAVNQDW L QLD+K  FLN DLEEEVYM IPPG E+    N VCKL+KSL
Subjt:  GHAFVCKIKEIELLLIQIVSTRVFYTPSIDYQEIFALVAKLNTVHVLLSLAVNQDWSLHQLDVKKTFLNGDLEEEVYMTIPPGMEDKSNSNLVCKLRKSL

Query:  YGLKQSPRAWFDIFTKTLIKNGYYQSQADHTLFVKSSN-NKTAILIVYVDDIIITRHDIKEILNLKRMLATKFEIKDLGGLRYFLGMEVARSNNGIIISQ
        YGLKQSPRAWFD FTK ++K GY Q QADHTLFVK S+  K AILIVYVDDII++ +D+ E+ NLK+ L+ +FE+KDLG L+YFLGMEVARS  GI++SQ
Subjt:  YGLKQSPRAWFDIFTKTLIKNGYYQSQADHTLFVKSSN-NKTAILIVYVDDIIITRHDIKEILNLKRMLATKFEIKDLGGLRYFLGMEVARSNNGIIISQ

Query:  RKYILDLLKETRNLGCRPAETSMDPNLTLH-QSEVIPVDKGMYQRLT--------IVPMFGVTWSLGEVKNKLLLQEAMQKRNTEL
        RKYILDLLKET  LGC+P +T MD    L  + E  PVD+G YQRL           P  G   S G    K    E  + R+TE+
Subjt:  RKYILDLLKETRNLGCRPAETSMDPNLTLH-QSEVIPVDKGMYQRLT--------IVPMFGVTWSLGEVKNKLLLQEAMQKRNTEL

RVW83276.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]7.4e-10551.32Show/hide
Query:  VFFGHLQDSKMIPNLG--QCLH-CHAKAPQNRTPI-VYTRRKEATLQN---QECQPPEPNSSSLNSTDENI------TFMAPVVDDLTLPIAQRKA----
        V+    Q  +++PN     C H C  ++ Q  T + +  RRK   L++     C     ++SSL   +ENI        + P +DD TLPIA RK     
Subjt:  VFFGHLQDSKMIPNLG--QCLH-CHAKAPQNRTPI-VYTRRKEATLQN---QECQPPEPNSSSLNSTDENI------TFMAPVVDDLTLPIAQRKA----

Query:  -----GQY--------------------SHSKTIHDALKDPKQKKAVDEEIRALESNGTWTLTQPPYGKNLIGCKWIFTIKYKSDGSVERFKVGLVAKGF
             G Y                        TI +ALK  + KKAV +EI ALE NGTWT+T  P GK  +GCKWIFTIKYK+DGSVERFK  LVA+GF
Subjt:  -----GQY--------------------SHSKTIHDALKDPKQKKAVDEEIRALESNGTWTLTQPPYGKNLIGCKWIFTIKYKSDGSVERFKVGLVAKGF

Query:  TQSYGVDVGHAFVCKIKEIELLLIQIVSTRVFYTPSIDYQEIFALVAKLNTVHVLLSLAVNQDWSLHQLDVKKTFLNGDLEEEVYMTIPPGMEDKSNSNL
        TQSYG                               IDYQE FA VAKLNT+ +LLSLAVNQDW L QLD+K  FLNGDLEEEVYM IPPG E+    N 
Subjt:  TQSYGVDVGHAFVCKIKEIELLLIQIVSTRVFYTPSIDYQEIFALVAKLNTVHVLLSLAVNQDWSLHQLDVKKTFLNGDLEEEVYMTIPPGMEDKSNSNL

Query:  VCKLRKSLYGLKQSPRAWFDIFTKTLIKNGYYQSQADHTLFVKSSN-NKTAILIVYVDDIIITRHDIKEILNLKRMLATKFEIKDLGGLRYFLGMEVARS
        VCKL+KSLYGLKQSPRAWFD FTK ++K GY Q QADHTLFVK S+  K AILIVYVDDII++ +D++E+ NLK+ L+ +FE+KDLG L+YFLGMEVARS
Subjt:  VCKLRKSLYGLKQSPRAWFDIFTKTLIKNGYYQSQADHTLFVKSSN-NKTAILIVYVDDIIITRHDIKEILNLKRMLATKFEIKDLGGLRYFLGMEVARS

Query:  NNGIIISQRKYILDLLKETRNLGCRPAETSMDPNLTLH-QSEVIPVDKGMYQRL
          GI++SQRKYILDLLKET  LGC+P +T MD    L  + E  PVD+G YQRL
Subjt:  NNGIIISQRKYILDLLKETRNLGCRPAETSMDPNLTLH-QSEVIPVDKGMYQRL

RVW85288.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]3.0e-10657Show/hide
Query:  APQ--NRTPIVY-TRRKEATLQN---QECQPPEPNSSSLNSTDENI------TFMAPVVDDLTLPIAQRKAGQYSHSKTIHDALKDPKQKKAVDEEIRAL
        APQ  N    VY  RRK   L++     C     ++SSL   +ENI        + P +DD TLPIA RK        TI +ALK  + KKAV +EI AL
Subjt:  APQ--NRTPIVY-TRRKEATLQN---QECQPPEPNSSSLNSTDENI------TFMAPVVDDLTLPIAQRKAGQYSHSKTIHDALKDPKQKKAVDEEIRAL

Query:  ESNGTWTLTQPPYGKNLIGCKWIFTIKYKSDGSVERFKVGLVAKGFTQSYGVDVGHAFVCKIKEIELLLIQIVSTRVFYTPSIDYQEIFALVAKLNTVHV
        E NGTWT+T  P GK  +GCKWIFTIKYK+DGSVERFK  LVA+GFTQSYG                               IDYQE FA VAKLNT+ +
Subjt:  ESNGTWTLTQPPYGKNLIGCKWIFTIKYKSDGSVERFKVGLVAKGFTQSYGVDVGHAFVCKIKEIELLLIQIVSTRVFYTPSIDYQEIFALVAKLNTVHV

Query:  LLSLAVNQDWSLHQLDVKKTFLNGDLEEEVYMTIPPGMEDKSNSNLVCKLRKSLYGLKQSPRAWFDIFTKTLIKNGYYQSQADHTLFVKSSN-NKTAILI
        LLSLAVNQDW L QLD+K  FLNGDLEEEVYM IPPG E+    N VCKL+KSLYGLKQSPRAWFD FTK ++K GY Q QADHTLFVK S+  K AILI
Subjt:  LLSLAVNQDWSLHQLDVKKTFLNGDLEEEVYMTIPPGMEDKSNSNLVCKLRKSLYGLKQSPRAWFDIFTKTLIKNGYYQSQADHTLFVKSSN-NKTAILI

Query:  VYVDDIIITRHDIKEILNLKRMLATKFEIKDLGGLRYFLGMEVARSNNGIIISQRKYILDLLKETRNLGCRPAETSMDPNLTLH-QSEVIPVDKGMYQRL
        VYVDDII++ +D+ E+ NLK+ L+ +FE+KDLG L+YFLGMEVARS  GI++SQRKYILDLLKET  LGC+P +T MD    L  + E  PVD+G YQRL
Subjt:  VYVDDIIITRHDIKEILNLKRMLATKFEIKDLGGLRYFLGMEVARSNNGIIISQRKYILDLLKETRNLGCRPAETSMDPNLTLH-QSEVIPVDKGMYQRL

TrEMBL top hitse value%identityAlignment
A0A438D2M3 Retrovirus-related Pol polyprotein from transposon RE13.5e-10856.61Show/hide
Query:  QPPEPNSSSLN----STDENITFMAPVVDDLTLPIAQRKAGQYSHSKTIHDALKDPKQKKAVDEEIRALESNGTWTLTQPPYGKNLIGCKWIFTIKYKSD
        QP  P  ++ N      D +   + P +DD TLPIA RK G    S TI +ALK  + KKAV +EI ALE NGTWT+T  P GK  +GCKWIFTIKYK+D
Subjt:  QPPEPNSSSLN----STDENITFMAPVVDDLTLPIAQRKAGQYSHSKTIHDALKDPKQKKAVDEEIRALESNGTWTLTQPPYGKNLIGCKWIFTIKYKSD

Query:  GSVERFKVGLVAKGFTQSYGVDVGHAFVCKIKEIELLLIQIVSTRVFYTPSIDYQEIFALVAKLNTVHVLLSLAVNQDWSLHQLDVKKTFLNGDLEEEVY
        GSVERFK  LVA+GFTQSYG                               IDYQE FA VAKLNT+ +LLSLAVNQDW L QLD+K  FLNGDLEEEVY
Subjt:  GSVERFKVGLVAKGFTQSYGVDVGHAFVCKIKEIELLLIQIVSTRVFYTPSIDYQEIFALVAKLNTVHVLLSLAVNQDWSLHQLDVKKTFLNGDLEEEVY

Query:  MTIPPGMEDKSNSNLVCKLRKSLYGLKQSPRAWFDIFTKTLIKNGYYQSQADHTLFVKSSN-NKTAILIVYVDDIIITRHDIKEILNLKRMLATKFEIKD
        M IPPG E+    N VCKL+KSLYGLKQSPRAWFD FTK ++K GY Q QADHTLFVK S+  K AILIVYVDDII++ +D+ E+ NLK+ L+ +FE+KD
Subjt:  MTIPPGMEDKSNSNLVCKLRKSLYGLKQSPRAWFDIFTKTLIKNGYYQSQADHTLFVKSSN-NKTAILIVYVDDIIITRHDIKEILNLKRMLATKFEIKD

Query:  LGGLRYFLGMEVARSNNGIIISQRKYILDLLKETRNLGCRPAETSMDPNLTLH-QSEVIPVDKGMYQRLTIVPMFGVTWSLGEVKNKLLLQEAMQKRNTE
        LG L+YFLGMEVARS  GI++SQRKYILDLLKET  LGC+P +T MD    L  + E  PVD+G YQRL  V      ++LG++   L LQ+  + R+TE
Subjt:  LGGLRYFLGMEVARSNNGIIISQRKYILDLLKETRNLGCRPAETSMDPNLTLH-QSEVIPVDKGMYQRLTIVPMFGVTWSLGEVKNKLLLQEAMQKRNTE

Query:  L
        +
Subjt:  L

A0A438FLP6 Retrovirus-related Pol polyprotein from transposon RE19.4e-10656.74Show/hide
Query:  MAPVVDDLTLPIAQRKAGQYSHSKTIHDALKDPKQKKAVDEEIRALESNGTWTLTQPPYGKNLIGCKWIFTIKYKSDGSVERFKVGLVAKGFTQSYGVDV
        + P +DD TLPIA RK        TI +ALK  + KKAV +EI ALE NGTWT+T  P GK  +GCKWIFTIKYK+DGSVERFK  LVA+GFTQSYG   
Subjt:  MAPVVDDLTLPIAQRKAGQYSHSKTIHDALKDPKQKKAVDEEIRALESNGTWTLTQPPYGKNLIGCKWIFTIKYKSDGSVERFKVGLVAKGFTQSYGVDV

Query:  GHAFVCKIKEIELLLIQIVSTRVFYTPSIDYQEIFALVAKLNTVHVLLSLAVNQDWSLHQLDVKKTFLNGDLEEEVYMTIPPGMEDKSNSNLVCKLRKSL
                                    IDYQE FA VAKLNT+ +LLSLAVNQDW L QLD+K  FLN DLEEEVYM IPPG E+    N VCKL+KSL
Subjt:  GHAFVCKIKEIELLLIQIVSTRVFYTPSIDYQEIFALVAKLNTVHVLLSLAVNQDWSLHQLDVKKTFLNGDLEEEVYMTIPPGMEDKSNSNLVCKLRKSL

Query:  YGLKQSPRAWFDIFTKTLIKNGYYQSQADHTLFVKSSN-NKTAILIVYVDDIIITRHDIKEILNLKRMLATKFEIKDLGGLRYFLGMEVARSNNGIIISQ
        YGLKQSPRAWFD FTK ++K GY Q QADHTLFVK S+  K AILIVYVDDII++ +D+ E+ NLK+ L+ +FE+KDLG L+YFLGMEVARS  GI++SQ
Subjt:  YGLKQSPRAWFDIFTKTLIKNGYYQSQADHTLFVKSSN-NKTAILIVYVDDIIITRHDIKEILNLKRMLATKFEIKDLGGLRYFLGMEVARSNNGIIISQ

Query:  RKYILDLLKETRNLGCRPAETSMDPNLTLH-QSEVIPVDKGMYQRLT--------IVPMFGVTWSLGEVKNKLLLQEAMQKRNTEL
        RKYILDLLKET  LGC+P +T MD    L  + E  PVD+G YQRL           P  G   S G    K    E  + R+TE+
Subjt:  RKYILDLLKETRNLGCRPAETSMDPNLTLH-QSEVIPVDKGMYQRLT--------IVPMFGVTWSLGEVKNKLLLQEAMQKRNTEL

A0A438HFP1 Retrovirus-related Pol polyprotein from transposon TNT 1-943.6e-10551.32Show/hide
Query:  VFFGHLQDSKMIPNLG--QCLH-CHAKAPQNRTPI-VYTRRKEATLQN---QECQPPEPNSSSLNSTDENI------TFMAPVVDDLTLPIAQRKA----
        V+    Q  +++PN     C H C  ++ Q  T + +  RRK   L++     C     ++SSL   +ENI        + P +DD TLPIA RK     
Subjt:  VFFGHLQDSKMIPNLG--QCLH-CHAKAPQNRTPI-VYTRRKEATLQN---QECQPPEPNSSSLNSTDENI------TFMAPVVDDLTLPIAQRKA----

Query:  -----GQY--------------------SHSKTIHDALKDPKQKKAVDEEIRALESNGTWTLTQPPYGKNLIGCKWIFTIKYKSDGSVERFKVGLVAKGF
             G Y                        TI +ALK  + KKAV +EI ALE NGTWT+T  P GK  +GCKWIFTIKYK+DGSVERFK  LVA+GF
Subjt:  -----GQY--------------------SHSKTIHDALKDPKQKKAVDEEIRALESNGTWTLTQPPYGKNLIGCKWIFTIKYKSDGSVERFKVGLVAKGF

Query:  TQSYGVDVGHAFVCKIKEIELLLIQIVSTRVFYTPSIDYQEIFALVAKLNTVHVLLSLAVNQDWSLHQLDVKKTFLNGDLEEEVYMTIPPGMEDKSNSNL
        TQSYG                               IDYQE FA VAKLNT+ +LLSLAVNQDW L QLD+K  FLNGDLEEEVYM IPPG E+    N 
Subjt:  TQSYGVDVGHAFVCKIKEIELLLIQIVSTRVFYTPSIDYQEIFALVAKLNTVHVLLSLAVNQDWSLHQLDVKKTFLNGDLEEEVYMTIPPGMEDKSNSNL

Query:  VCKLRKSLYGLKQSPRAWFDIFTKTLIKNGYYQSQADHTLFVKSSN-NKTAILIVYVDDIIITRHDIKEILNLKRMLATKFEIKDLGGLRYFLGMEVARS
        VCKL+KSLYGLKQSPRAWFD FTK ++K GY Q QADHTLFVK S+  K AILIVYVDDII++ +D++E+ NLK+ L+ +FE+KDLG L+YFLGMEVARS
Subjt:  VCKLRKSLYGLKQSPRAWFDIFTKTLIKNGYYQSQADHTLFVKSSN-NKTAILIVYVDDIIITRHDIKEILNLKRMLATKFEIKDLGGLRYFLGMEVARS

Query:  NNGIIISQRKYILDLLKETRNLGCRPAETSMDPNLTLH-QSEVIPVDKGMYQRL
          GI++SQRKYILDLLKET  LGC+P +T MD    L  + E  PVD+G YQRL
Subjt:  NNGIIISQRKYILDLLKETRNLGCRPAETSMDPNLTLH-QSEVIPVDKGMYQRL

A0A438HLF0 Retrovirus-related Pol polyprotein from transposon RE11.5e-10657Show/hide
Query:  APQ--NRTPIVY-TRRKEATLQN---QECQPPEPNSSSLNSTDENI------TFMAPVVDDLTLPIAQRKAGQYSHSKTIHDALKDPKQKKAVDEEIRAL
        APQ  N    VY  RRK   L++     C     ++SSL   +ENI        + P +DD TLPIA RK        TI +ALK  + KKAV +EI AL
Subjt:  APQ--NRTPIVY-TRRKEATLQN---QECQPPEPNSSSLNSTDENI------TFMAPVVDDLTLPIAQRKAGQYSHSKTIHDALKDPKQKKAVDEEIRAL

Query:  ESNGTWTLTQPPYGKNLIGCKWIFTIKYKSDGSVERFKVGLVAKGFTQSYGVDVGHAFVCKIKEIELLLIQIVSTRVFYTPSIDYQEIFALVAKLNTVHV
        E NGTWT+T  P GK  +GCKWIFTIKYK+DGSVERFK  LVA+GFTQSYG                               IDYQE FA VAKLNT+ +
Subjt:  ESNGTWTLTQPPYGKNLIGCKWIFTIKYKSDGSVERFKVGLVAKGFTQSYGVDVGHAFVCKIKEIELLLIQIVSTRVFYTPSIDYQEIFALVAKLNTVHV

Query:  LLSLAVNQDWSLHQLDVKKTFLNGDLEEEVYMTIPPGMEDKSNSNLVCKLRKSLYGLKQSPRAWFDIFTKTLIKNGYYQSQADHTLFVKSSN-NKTAILI
        LLSLAVNQDW L QLD+K  FLNGDLEEEVYM IPPG E+    N VCKL+KSLYGLKQSPRAWFD FTK ++K GY Q QADHTLFVK S+  K AILI
Subjt:  LLSLAVNQDWSLHQLDVKKTFLNGDLEEEVYMTIPPGMEDKSNSNLVCKLRKSLYGLKQSPRAWFDIFTKTLIKNGYYQSQADHTLFVKSSN-NKTAILI

Query:  VYVDDIIITRHDIKEILNLKRMLATKFEIKDLGGLRYFLGMEVARSNNGIIISQRKYILDLLKETRNLGCRPAETSMDPNLTLH-QSEVIPVDKGMYQRL
        VYVDDII++ +D+ E+ NLK+ L+ +FE+KDLG L+YFLGMEVARS  GI++SQRKYILDLLKET  LGC+P +T MD    L  + E  PVD+G YQRL
Subjt:  VYVDDIIITRHDIKEILNLKRMLATKFEIKDLGGLRYFLGMEVARSNNGIIISQRKYILDLLKETRNLGCRPAETSMDPNLTLH-QSEVIPVDKGMYQRL

W9SCZ3 Non-specific serine/threonine protein kinase1.5e-10652.04Show/hide
Query:  NLGQCLHCHAKAPQ--NRTPIVYTRRKEATLQNQECQ------------PPEPNSSSLNSTDENITFMAPVVDDLTLPIAQRKA---------GQY----
        N+   +   + APQ  N    VY RRK+        Q            PPE N      TD       P +DD TLPIA RK          G Y    
Subjt:  NLGQCLHCHAKAPQ--NRTPIVYTRRKEATLQNQECQ------------PPEPNSSSLNSTDENITFMAPVVDDLTLPIAQRKA---------GQY----

Query:  ----------------SHSKTIHDALKDPKQKKAVDEEIRALESNGTWTLTQPPYGKNLIGCKWIFTIKYKSDGSVERFKVGLVAKGFTQSYGVDVGHAF
                            TIH+AL++ + KKAV +EI ALE NGTWT+T  P GK  +GCKWIFTIKYK+DGSVERFK  LVA+GFTQSYG       
Subjt:  ----------------SHSKTIHDALKDPKQKKAVDEEIRALESNGTWTLTQPPYGKNLIGCKWIFTIKYKSDGSVERFKVGLVAKGFTQSYGVDVGHAF

Query:  VCKIKEIELLLIQIVSTRVFYTPSIDYQEIFALVAKLNTVHVLLSLAVNQDWSLHQLDVKKTFLNGDLEEEVYMTIPPGMEDKSNSNLVCKLRKSLYGLK
                                IDYQE FA VAKLNT+ +LLSLAVNQDW L QLD+K  FLNGDLEEEVYM IPPG E     N VCKLRKSLYGLK
Subjt:  VCKIKEIELLLIQIVSTRVFYTPSIDYQEIFALVAKLNTVHVLLSLAVNQDWSLHQLDVKKTFLNGDLEEEVYMTIPPGMEDKSNSNLVCKLRKSLYGLK

Query:  QSPRAWFDIFTKTLIKNGYYQSQADHTLFVKSSN-NKTAILIVYVDDIIITRHDIKEILNLKRMLATKFEIKDLGGLRYFLGMEVARSNNGIIISQRKYI
        QSPRAWFD FTK ++K GY Q Q+DHTLFVK S+  K AILIVYVDDII++ +D+KE+  LK+ L+ +FE+KDLG L+YFLGMEVARS+ GI++SQRKYI
Subjt:  QSPRAWFDIFTKTLIKNGYYQSQADHTLFVKSSN-NKTAILIVYVDDIIITRHDIKEILNLKRMLATKFEIKDLGGLRYFLGMEVARSNNGIIISQRKYI

Query:  LDLLKETRNLGCRPAETSMDPNLTL-HQSEVIPVDKGMYQRL
        LDLLKET  LGC+P +T MD    L  + E  PVD+G YQRL
Subjt:  LDLLKETRNLGCRPAETSMDPNLTL-HQSEVIPVDKGMYQRL

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.3e-4833.56Show/hide
Query:  KKAVDEEIRALESNGTWTLTQPPYGKNLIGCKWIFTIKYKSDGSVERFKVGLVAKGFTQSYGVDVGHAFVCKIKEIELLLIQIVSTRVFYTPSIDYQEIF
        ++A++ E+ A + N TWT+T+ P  KN++  +W+F++KY   G+  R+K  LVA+GFTQ Y                                IDY+E F
Subjt:  KKAVDEEIRALESNGTWTLTQPPYGKNLIGCKWIFTIKYKSDGSVERFKVGLVAKGFTQSYGVDVGHAFVCKIKEIELLLIQIVSTRVFYTPSIDYQEIF

Query:  ALVAKLNTVHVLLSLAVNQDWSLHQLDVKKTFLNGDLEEEVYMTIPPGMEDKSNSNLVCKLRKSLYGLKQSPRAWFDIFTKTLIKNGYYQSQADHTLFV-
        A VA++++   +LSL +  +  +HQ+DVK  FLNG L+EE+YM +P G+    NS+ VCKL K++YGLKQ+ R WF++F + L +  +  S  D  +++ 
Subjt:  ALVAKLNTVHVLLSLAVNQDWSLHQLDVKKTFLNGDLEEEVYMTIPPGMEDKSNSNLVCKLRKSLYGLKQSPRAWFDIFTKTLIKNGYYQSQADHTLFV-

Query:  -KSSNNKTAILIVYVDDIIITRHDIKEILNLKRMLATKFEIKDLGGLRYFLGMEVARSNNGIIISQRKYILDLLKETRNLGCRPAETSM
         K + N+   +++YVDD++I   D+  + N KR L  KF + DL  +++F+G+ +    + I +SQ  Y+  +L +     C    T +
Subjt:  -KSSNNKTAILIVYVDDIIITRHDIKEILNLKRMLATKFEIKDLGGLRYFLGMEVARSNNGIIISQRKYILDLLKETRNLGCRPAETSM

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.3e-4735.69Show/hide
Query:  KTIHDALKDPKQK---KAVDEEIRALESNGTWTLTQPPYGKNLIGCKWIFTIKYKSDGSVERFKVGLVAKGFTQSYGVDVGHAFVCKIKEIELLLIQIVS
        +++ + L  P++    KA+ EE+ +L+ NGT+ L + P GK  + CKW+F +K   D  + R+K  LV KGF Q  G                       
Subjt:  KTIHDALKDPKQK---KAVDEEIRALESNGTWTLTQPPYGKNLIGCKWIFTIKYKSDGSVERFKVGLVAKGFTQSYGVDVGHAFVCKIKEIELLLIQIVS

Query:  TRVFYTPSIDYQEIFALVAKLNTVHVLLSLAVNQDWSLHQLDVKKTFLNGDLEEEVYMTIPPGMEDKSNSNLVCKLRKSLYGLKQSPRAWFDIFTKTLIK
                ID+ EIF+ V K+ ++  +LSLA + D  + QLDVK  FL+GDLEEE+YM  P G E     ++VCKL KSLYGLKQ+PR W+  F   +  
Subjt:  TRVFYTPSIDYQEIFALVAKLNTVHVLLSLAVNQDWSLHQLDVKKTFLNGDLEEEVYMTIPPGMEDKSNSNLVCKLRKSLYGLKQSPRAWFDIFTKTLIK

Query:  NGYYQSQADHTLFVKS-SNNKTAILIVYVDDIIITRHDIKEILNLKRMLATKFEIKDLGGLRYFLGMEVA--RSNNGIIISQRKYILDLLKETRNLGCRP
          Y ++ +D  ++ K  S N   IL++YVDD++I   D   I  LK  L+  F++KDLG  +  LGM++   R++  + +SQ KYI  +L+       +P
Subjt:  NGYYQSQADHTLFVKS-SNNKTAILIVYVDDIIITRHDIKEILNLKRMLATKFEIKDLGGLRYFLGMEVA--RSNNGIIISQRKYILDLLKETRNLGCRP

Query:  AETSMDPNLTL
          T +  +L L
Subjt:  AETSMDPNLTL

P25600 Putative transposon Ty5-1 protein YCL074W6.4e-1933.11Show/hide
Query:  LDVKKTFLNGDLEEEVYMTIPPGMEDKSNSNLVCKLRKSLYGLKQSPRAWFDIFTKTLIKNGYYQSQADHTLFVKSSNNKTAILIVYVDDIIITRHDIKE
        +DV   FLN  ++E +Y+  PPG  ++ N + V +L   +YGLKQ+P  W +    TL K G+ + + +H L+ +S+++    + VYVDD+++     K 
Subjt:  LDVKKTFLNGDLEEEVYMTIPPGMEDKSNSNLVCKLRKSLYGLKQSPRAWFDIFTKTLIKNGYYQSQADHTLFVKSSNNKTAILIVYVDDIIITRHDIKE

Query:  ILNLKRMLATKFEIKDLGGLRYFLGMEVARSNNG-IIISQRKYILDLLKET
           +K+ L   + +KDLG +  FLG+ + +S+NG I +S + YI     E+
Subjt:  ILNLKRMLATKFEIKDLGGLRYFLGMEVARSNNG-IIISQRKYILDLLKET

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.4e-5840.2Show/hide
Query:  SHSKTIHDALKDPKQKKAVDEEIRALESNGTWTLTQPPYGK-NLIGCKWIFTIKYKSDGSVERFKVGLVAKGFTQSYGVDVGHAFVCKIKEIELLLIQIV
        S  +T   ALKD + + A+  EI A   N TW L  PP     ++GC+WIFT KY SDGS+ R+K  LVAKG+ Q                         
Subjt:  SHSKTIHDALKDPKQKKAVDEEIRALESNGTWTLTQPPYGK-NLIGCKWIFTIKYKSDGSVERFKVGLVAKGFTQSYGVDVGHAFVCKIKEIELLLIQIV

Query:  STRVFYTPSIDYQEIFALVAKLNTVHVLLSLAVNQDWSLHQLDVKKTFLNGDLEEEVYMTIPPGMEDKSNSNLVCKLRKSLYGLKQSPRAWFDIFTKTLI
               P +DY E F+ V K  ++ ++L +AV++ W + QLDV   FL G L ++VYM+ PPG  DK   N VCKLRK+LYGLKQ+PRAW+      L+
Subjt:  STRVFYTPSIDYQEIFALVAKLNTVHVLLSLAVNQDWSLHQLDVKKTFLNGDLEEEVYMTIPPGMEDKSNSNLVCKLRKSLYGLKQSPRAWFDIFTKTLI

Query:  KNGYYQSQADHTLFVKSSNNKTAILIVYVDDIIITRHDIKEILNLKRMLATKFEIKDLGGLRYFLGMEVARSNNGIIISQRKYILDLLKETRNLGCRPAE
          G+  S +D +LFV         ++VYVDDI+IT +D   + N    L+ +F +KD   L YFLG+E  R   G+ +SQR+YILDLL  T  +  +P  
Subjt:  KNGYYQSQADHTLFVKSSNNKTAILIVYVDDIIITRHDIKEILNLKRMLATKFEIKDLGGLRYFLGMEVARSNNGIIISQRKYILDLLKETRNLGCRPAE

Query:  TSMDPN
        T M P+
Subjt:  TSMDPN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.6e-5738.36Show/hide
Query:  SHSKTIHDALKDPKQKKAVDEEIRALESNGTWTLT-QPPYGKNLIGCKWIFTIKYKSDGSVERFKVGLVAKGFTQSYGVDVGHAFVCKIKEIELLLIQIV
        S  +T   A+KD + ++A+  EI A   N TW L   PP    ++GC+WIFT K+ SDGS+ R+K  LVAKG+ Q                         
Subjt:  SHSKTIHDALKDPKQKKAVDEEIRALESNGTWTLT-QPPYGKNLIGCKWIFTIKYKSDGSVERFKVGLVAKGFTQSYGVDVGHAFVCKIKEIELLLIQIV

Query:  STRVFYTPSIDYQEIFALVAKLNTVHVLLSLAVNQDWSLHQLDVKKTFLNGDLEEEVYMTIPPGMEDKSNSNLVCKLRKSLYGLKQSPRAWFDIFTKTLI
               P +DY E F+ V K  ++ ++L +AV++ W + QLDV   FL G L +EVYM+ PPG  DK   + VC+LRK++YGLKQ+PRAW+      L+
Subjt:  STRVFYTPSIDYQEIFALVAKLNTVHVLLSLAVNQDWSLHQLDVKKTFLNGDLEEEVYMTIPPGMEDKSNSNLVCKLRKSLYGLKQSPRAWFDIFTKTLI

Query:  KNGYYQSQADHTLFVKSSNNKTAILIVYVDDIIITRHDIKEILNLKRMLATKFEIKDLGGLRYFLGMEVARSNNGIIISQRKYILDLLKETRNLGCRPAE
          G+  S +D +LFV         ++VYVDDI+IT +D   + +    L+ +F +K+   L YFLG+E  R   G+ +SQR+Y LDLL  T  L  +P  
Subjt:  KNGYYQSQADHTLFVKSSNNKTAILIVYVDDIIITRHDIKEILNLKRMLATKFEIKDLGGLRYFLGMEVARSNNGIIISQRKYILDLLKETRNLGCRPAE

Query:  TSM--DPNLTLHQSEVIP
        T M   P LTLH    +P
Subjt:  TSM--DPNLTLHQSEVIP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.6e-6539.37Show/hide
Query:  RKEATLQNQECQPPEPNSSSLNSTDENITF--MAPVVDDLTLPIAQRKAGQYSHSKTIHDALKDPKQKKAVDEEIRALESNGTWTLTQPPYGKNLIGCKW
        RK A LQ+  C      S +++   + +++  ++P+     + IA+ K        T ++A +      A+D+EI A+E+  TW +   P  K  IGCKW
Subjt:  RKEATLQNQECQPPEPNSSSLNSTDENITF--MAPVVDDLTLPIAQRKAGQYSHSKTIHDALKDPKQKKAVDEEIRALESNGTWTLTQPPYGKNLIGCKW

Query:  IFTIKYKSDGSVERFKVGLVAKGFTQSYGVDVGHAFVCKIKEIELLLIQIVSTRVFYTPSIDYQEIFALVAKLNTVHVLLSLAVNQDWSLHQLDVKKTFL
        ++ IKY SDG++ER+K  LVAKG+TQ  G                               ID+ E F+ V KL +V ++L+++   +++LHQLD+   FL
Subjt:  IFTIKYKSDGSVERFKVGLVAKGFTQSYGVDVGHAFVCKIKEIELLLIQIVSTRVFYTPSIDYQEIFALVAKLNTVHVLLSLAVNQDWSLHQLDVKKTFL

Query:  NGDLEEEVYMTIPPG----MEDKSNSNLVCKLRKSLYGLKQSPRAWFDIFTKTLIKNGYYQSQADHTLFVKSSNNKTAILIVYVDDIIITRHDIKEILNL
        NGDL+EE+YM +PPG      D    N VC L+KS+YGLKQ+ R WF  F+ TLI  G+ QS +DHT F+K +      ++VYVDDIII  ++   +  L
Subjt:  NGDLEEEVYMTIPPG----MEDKSNSNLVCKLRKSLYGLKQSPRAWFDIFTKTLIKNGYYQSQADHTLFVKSSNNKTAILIVYVDDIIITRHDIKEILNL

Query:  KRMLATKFEIKDLGGLRYFLGMEVARSNNGIIISQRKYILDLLKETRNLGCRPAETSMDPNLTLH-QSEVIPVDKGMYQRL
        K  L + F+++DLG L+YFLG+E+ARS  GI I QRKY LDLL ET  LGC+P+   MDP++T    S    VD   Y+RL
Subjt:  KRMLATKFEIKDLGGLRYFLGMEVARSNNGIIISQRKYILDLLKETRNLGCRPAETSMDPNLTLH-QSEVIPVDKGMYQRL

ATMG00810.1 DNA/RNA polymerases superfamily protein4.0e-0835.44Show/hide
Query:  LIVYVDDIIITRHDIKEILNLKRMLATKFEIKDLGGLRYFLGMEVARSNNGIIISQRKYILDLLKETRNLGCRPAETSM
        L++YVDDI++T      +  L   L++ F +KDLG + YFLG+++    +G+ +SQ KY   +L     L C+P  T +
Subjt:  LIVYVDDIIITRHDIKEILNLKRMLATKFEIKDLGGLRYFLGMEVARSNNGIIISQRKYILDLLKETRNLGCRPAETSM

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)8.9e-1649.33Show/hide
Query:  KTIHDALKDPKQKKAVDEEIRALESNGTWTLTQPPYGKNLIGCKWIFTIKYKSDGSVERFKVGLVAKGFTQSYGV
        K++  ALKDP   +A+ EE+ AL  N TW L  PP  +N++GCKW+F  K  SDG+++R K  LVAKGF Q  G+
Subjt:  KTIHDALKDPKQKKAVDEEIRALESNGTWTLTQPPYGKNLIGCKWIFTIKYKSDGSVERFKVGLVAKGFTQSYGV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCCACGACTCGAATTTCGAGCTCACTCATCCGAGGGACCAGGATTCTAACTTGTATGACAACAGACAACATCACATTCGTGGAGTCCACAACTGTTTGGAGTCAAT
GGAGAGATGCATTGGCAGTCGTTGGCTTCTCTCCAATGCCAATGTTAGAATCTATGTGTTCTTCGTTGGAGCTTTCATCTACAAGAAAATTTTCAACATCTTCACGCTTC
ACATAATATTTGTCTTCTTTGGTCATTTGCAGGATTCCAAGATGATTCCAAATCTAGGCCAGTGCTTGCATTGTCACGCCAAAGCTCCACAAAATCGCACGCCTATTGTT
TATACAAGAAGAAAAGAGGCTACCTTGCAAAACCAAGAATGTCAACCACCTGAACCAAATTCATCATCGCTAAATTCCACTGATGAGAACATCACATTTATGGCACCTGT
GGTCGATGATCTAACCCTACCAATTGCTCAGAGAAAAGCTGGACAGTATTCACATTCCAAAACCATTCATGATGCACTCAAAGATCCAAAACAGAAAAAGGCAGTAGATG
AAGAAATCAGAGCTCTTGAAAGCAATGGAACGTGGACTCTTACTCAACCTCCTTATGGAAAGAATCTAATTGGTTGCAAATGGATTTTCACGATTAAATATAAATCTGAT
GGAAGTGTGGAACGATTTAAAGTCGGACTTGTTGCGAAGGGCTTCACTCAATCATATGGTGTAGATGTTGGGCATGCTTTTGTTTGCAAAATTAAAGAAATTGAGTTGCT
TTTGATCCAAATCGTTTCCACTAGAGTGTTCTACACTCCTTCAATAGATTACCAAGAAATTTTTGCCCTTGTTGCAAAATTAAATACTGTGCATGTTCTACTTTCTCTCG
CTGTGAACCAAGATTGGTCACTTCACCAACTTGATGTAAAAAAGACATTCTTGAATGGTGATCTCGAAGAAGAAGTTTATATGACAATTCCTCCTGGAATGGAAGACAAG
TCTAATAGTAACTTGGTGTGTAAGTTGAGAAAGTCTCTATATGGATTGAAACAATCTCCACGTGCTTGGTTTGATATATTCACTAAAACTCTTATTAAAAACGGCTATTA
TCAATCACAAGCTGATCATACCTTGTTTGTGAAATCCTCAAATAACAAAACTGCAATTTTGATTGTATATGTGGATGATATCATCATTACAAGGCATGATATAAAAGAGA
TTCTCAACCTAAAAAGGATGCTTGCAACTAAGTTTGAAATCAAAGATCTGGGAGGCTTAAGATATTTTCTAGGCATGGAAGTGGCACGATCCAATAATGGTATTATAATT
TCTCAGAGAAAGTATATCCTAGATCTATTAAAGGAGACGAGAAATCTTGGGTGTAGACCTGCAGAAACATCCATGGATCCAAATCTAACTTTACACCAAAGTGAAGTTAT
TCCAGTTGATAAAGGCATGTATCAAAGACTTACTATTGTTCCTATGTTTGGGGTAACTTGGTCACTTGGAGAAGTAAAAAACAAGTTGTTGTTGCAAGAAGCAATGCAGA
AGCGGAATACAGAGCTCTTGCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCCACGACTCGAATTTCGAGCTCACTCATCCGAGGGACCAGGATTCTAACTTGTATGACAACAGACAACATCACATTCGTGGAGTCCACAACTGTTTGGAGTCAAT
GGAGAGATGCATTGGCAGTCGTTGGCTTCTCTCCAATGCCAATGTTAGAATCTATGTGTTCTTCGTTGGAGCTTTCATCTACAAGAAAATTTTCAACATCTTCACGCTTC
ACATAATATTTGTCTTCTTTGGTCATTTGCAGGATTCCAAGATGATTCCAAATCTAGGCCAGTGCTTGCATTGTCACGCCAAAGCTCCACAAAATCGCACGCCTATTGTT
TATACAAGAAGAAAAGAGGCTACCTTGCAAAACCAAGAATGTCAACCACCTGAACCAAATTCATCATCGCTAAATTCCACTGATGAGAACATCACATTTATGGCACCTGT
GGTCGATGATCTAACCCTACCAATTGCTCAGAGAAAAGCTGGACAGTATTCACATTCCAAAACCATTCATGATGCACTCAAAGATCCAAAACAGAAAAAGGCAGTAGATG
AAGAAATCAGAGCTCTTGAAAGCAATGGAACGTGGACTCTTACTCAACCTCCTTATGGAAAGAATCTAATTGGTTGCAAATGGATTTTCACGATTAAATATAAATCTGAT
GGAAGTGTGGAACGATTTAAAGTCGGACTTGTTGCGAAGGGCTTCACTCAATCATATGGTGTAGATGTTGGGCATGCTTTTGTTTGCAAAATTAAAGAAATTGAGTTGCT
TTTGATCCAAATCGTTTCCACTAGAGTGTTCTACACTCCTTCAATAGATTACCAAGAAATTTTTGCCCTTGTTGCAAAATTAAATACTGTGCATGTTCTACTTTCTCTCG
CTGTGAACCAAGATTGGTCACTTCACCAACTTGATGTAAAAAAGACATTCTTGAATGGTGATCTCGAAGAAGAAGTTTATATGACAATTCCTCCTGGAATGGAAGACAAG
TCTAATAGTAACTTGGTGTGTAAGTTGAGAAAGTCTCTATATGGATTGAAACAATCTCCACGTGCTTGGTTTGATATATTCACTAAAACTCTTATTAAAAACGGCTATTA
TCAATCACAAGCTGATCATACCTTGTTTGTGAAATCCTCAAATAACAAAACTGCAATTTTGATTGTATATGTGGATGATATCATCATTACAAGGCATGATATAAAAGAGA
TTCTCAACCTAAAAAGGATGCTTGCAACTAAGTTTGAAATCAAAGATCTGGGAGGCTTAAGATATTTTCTAGGCATGGAAGTGGCACGATCCAATAATGGTATTATAATT
TCTCAGAGAAAGTATATCCTAGATCTATTAAAGGAGACGAGAAATCTTGGGTGTAGACCTGCAGAAACATCCATGGATCCAAATCTAACTTTACACCAAAGTGAAGTTAT
TCCAGTTGATAAAGGCATGTATCAAAGACTTACTATTGTTCCTATGTTTGGGGTAACTTGGTCACTTGGAGAAGTAAAAAACAAGTTGTTGTTGCAAGAAGCAATGCAGA
AGCGGAATACAGAGCTCTTGCTTTGA
Protein sequenceShow/hide protein sequence
MGHDSNFELTHPRDQDSNLYDNRQHHIRGVHNCLESMERCIGSRWLLSNANVRIYVFFVGAFIYKKIFNIFTLHIIFVFFGHLQDSKMIPNLGQCLHCHAKAPQNRTPIV
YTRRKEATLQNQECQPPEPNSSSLNSTDENITFMAPVVDDLTLPIAQRKAGQYSHSKTIHDALKDPKQKKAVDEEIRALESNGTWTLTQPPYGKNLIGCKWIFTIKYKSD
GSVERFKVGLVAKGFTQSYGVDVGHAFVCKIKEIELLLIQIVSTRVFYTPSIDYQEIFALVAKLNTVHVLLSLAVNQDWSLHQLDVKKTFLNGDLEEEVYMTIPPGMEDK
SNSNLVCKLRKSLYGLKQSPRAWFDIFTKTLIKNGYYQSQADHTLFVKSSNNKTAILIVYVDDIIITRHDIKEILNLKRMLATKFEIKDLGGLRYFLGMEVARSNNGIII
SQRKYILDLLKETRNLGCRPAETSMDPNLTLHQSEVIPVDKGMYQRLTIVPMFGVTWSLGEVKNKLLLQEAMQKRNTELLL