; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh18G005090 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh18G005090
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrotransposon protein, putative, Ty1-copia subclass
Genome locationCmo_Chr18:3653083..3654827
RNA-Seq ExpressionCmoCh18G005090
SyntenyCmoCh18G005090
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AQA29583.1 reverse transcriptase [Zea mays]2.7e-7333.5Show/hide
Query:  MPKRFWREAEMTVVYLLNRSPTRSLDEKTPHEAWYNKKPAIHHQRVFGCATYMKVARPHLAMLDPRGLKVVSIGYEPGARC-------------------
        MP  FW EA  T V+LLNR+PT +L+ +TP++AWY KKPA+H  +VFGC  Y+K  RPHL+ L+ RG KVV IGY+ G++                    
Subjt:  MPKRFWREAEMTVVYLLNRSPTRSLDEKTPHEAWYNKKPAIHHQRVFGCATYMKVARPHLAMLDPRGLKVVSIGYEPGARC-------------------

Query:  ---------------------------------------------------------------------------TSSMILRGASSRVARRHLQREHLLA
                                                                                   TS+     ++ +  R H++    L+
Subjt:  ---------------------------------------------------------------------------TSSMILRGASSRVARRHLQREHLLA

Query:  VERCTGG----------MEDLVGRGEPPGPAARKLEEKVAELHAISTDKLNIFAEAERKPCWL-------------------------KAIGRS--YEAQ
         +               ++++   G  PG      +++ AELH +S ++     EA+  P W+                         +AIG    Y+ +
Subjt:  VERCTGG----------MEDLVGRGEPPGPAARKLEEKVAELHAISTDKLNIFAEAERKPCWL-------------------------KAIGRS--YEAQ

Query:  GPSGDEGLR----------PQKQVVDFEEVFAPVARLKFVRLLLEITAHHSWEVHYMDVKFAFLNRELKETIYVRQLSGFLGNDNPDKVLRLHKALH---
            D  ++           Q+  VDF+EVFAPVAR + VRLLL I AH+ W VH+MDVK AFLN EL+E +YV+Q  GF+   +  KVLRLHKAL+   
Subjt:  GPSGDEGLR----------PQKQVVDFEEVFAPVARLKFVRLLLEITAHHSWEVHYMDVKFAFLNRELKETIYVRQLSGFLGNDNPDKVLRLHKALH---

Query:  ---------------------C------------EHRLIVGVYVDDLLITRGD----------------------------MKVLGDSRG------AYAK
                             C            E RLIVGVYVDDL+IT GD                            ++V   ++G      AYA 
Subjt:  ---------------------C------------EHRLIVGVYVDDLLITRGD----------------------------MKVLGDSRG------AYAK

Query:  KLLDTAGLADSNPTRTPMEARLQLRKADTTMAVDSTNYRSIVRSLRYLVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVKY-CVGRG
        K+L+ AGLA  N + TPME +L+L K  TT +VD T YRS++ SLRYL N+ PDLAY VGY+SRFMEA R+EHL AVKR+LRYVAGT  W + Y   G+ 
Subjt:  KLLDTAGLADSNPTRTPMEARLQLRKADTTMAVDSTNYRSIVRSLRYLVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVKY-CVGRG

Query:  KEKLELVGYNDSDMAGDV
        +   +L+GY+DSD+AGDV
Subjt:  KEKLELVGYNDSDMAGDV

CAD41367.2 OSJNBa0088A01.6 [Oryza sativa Japonica Group]5.9e-7336.84Show/hide
Query:  MPKRFWREAEMTVVYLLNRSPTRSLDEKTPHEAWYNKKPAIHHQRVFGCATYMKVARPHLAMLDPRGLKVVSIGYEPGARCTSSMILRGASSRVARRHLQ
        MP R+W EA  T V+LLNR+PT+SL   TP+EAW+ KKPA+ + R FGC  Y+K  RPHL  L  R   +V IGY  GA+            +V+R  + 
Subjt:  MPKRFWREAEMTVVYLLNRSPTRSLDEKTPHEAWYNKKPAIHHQRVFGCATYMKVARPHLAMLDPRGLKVVSIGYEPGARCTSSMILRGASSRVARRHLQ

Query:  REHL--------------------LAVERCTGG-------------------MEDLVGRGEPPGPAARKLEEKVAELHAISTDKLNIFAEAERKPCWL--
         E                        VE  T G                   M DL+G    PG A R  EE   ELH ++ ++   F EAE++  W   
Subjt:  REHL--------------------LAVERCTGG-------------------MEDLVGRGEPPGPAARKLEEKVAELHAISTDKLNIFAEAERKPCWL--

Query:  -----------------------KAIGRSY------EAQGPSGDEGLR------PQKQVVDFEEVFAPVARLKFVRLLLEITAHHSWEVHYMDVKFAFLN
                               +AIG  +      +A+G       R       Q+  +DFEEVF PVARL+ VRLLL   A   W VH+MDVK AFLN
Subjt:  -----------------------KAIGRSY------EAQGPSGDEGLR------PQKQVVDFEEVFAPVARLKFVRLLLEITAHHSWEVHYMDVKFAFLN

Query:  RELKETIYVRQLSGFLGNDNPDKVLRLHKALH--------------------------CEH----------RLIVGVYVDDLLIT----------RGDMK
         EL E +YV+Q  GF+      +VLRL KAL+                          CEH          RL+VGVYVDDL+IT          +G+MK
Subjt:  RELKETIYVRQLSGFLGNDNPDKVLRLHKALH--------------------------CEH----------RLIVGVYVDDLLIT----------RGDMK

Query:  VLGD------------------------SRGAYAKKLLDTAGLADSNPTRTPMEARLQLRKADTTMAVDSTNYRSIVRSLRYLVNTYPDLAYFVGYVSRF
         L                          S+ AYA+++L+ AG+   NP+ TPME RL+L K     +V+ T YR IV  LRYLV+T PD+A+ VGYVSRF
Subjt:  VLGD------------------------SRGAYAKKLLDTAGLADSNPTRTPMEARLQLRKADTTMAVDSTNYRSIVRSLRYLVNTYPDLAYFVGYVSRF

Query:  MEASREEHLVAVKRILRYVAGTRGWSVKYCVGRGKEKLELVGYNDSDMAGD
        MEA   EH  AVKRILRY+AGT  +   Y   +    ++LVGY+D+DMAGD
Subjt:  MEASREEHLVAVKRILRYVAGTRGWSVKYCVGRGKEKLELVGYNDSDMAGD

CAE03692.2 OSJNBb0026E15.10 [Oryza sativa Japonica Group]1.1e-7936.68Show/hide
Query:  MPKRFWREAEMTVVYLLNRSPTRSLDEKTPHEAWYNKKPAIHHQRVFGCATYMKVARPHLAMLDPRGLKVVSIGYEPGARCTSSMILRGASSRVARRHLQ
        +P RFW EA  T V+LLNRSPT+SLD +TP+EAWY + PA+H  R FGC  ++K+ +P L  LD R   +V +GYE G++  +  +    S RV   H+ 
Subjt:  MPKRFWREAEMTVVYLLNRSPTRSLDEKTPHEAWYNKKPAIHHQRVFGCATYMKVARPHLAMLDPRGLKVVSIGYEPGARCTSSMILRGASSRVARRHLQ

Query:  REHLL-----------------------------------------------------------------AVERCTGGMED-------------------
        R+ +                                                                  AVE  T   +D                   
Subjt:  REHLL-----------------------------------------------------------------AVERCTGGMED-------------------

Query:  -LVGRGEPPGPAARKLEEKVAELHAISTDKLNIFAEAERKPCWL-------------------------KAIGRSYEAQGPSGDEGL------------R
         L+G   PPG A R LE+   ELH +S D+    AEAE  P W                          +AIG  +  +    ++G              
Subjt:  -LVGRGEPPGPAARKLEEKVAELHAISTDKLNIFAEAERKPCWL-------------------------KAIGRSYEAQGPSGDEGL------------R

Query:  PQKQVVDFEEVFAPVARLKFVRLLLEITAHHSWEVHYMDVKFAFLNRELKETIYVRQLSGFLGNDNPDKVLRLHKALH----------------------
         Q+Q VDF+EVFA VARL+ VRLLL + AH  W+VH+MDVK AFLN EL E +YV Q  GF+ +++ +KV RLHKAL+                      
Subjt:  PQKQVVDFEEVFAPVARLKFVRLLLEITAHHSWEVHYMDVKFAFLNRELKETIYVRQLSGFLGNDNPDKVLRLHKALH----------------------

Query:  ----CEH----------RLIVGVYVDDLLIT----------RGDM------------------KVLGDSRG------AYAKKLLDTAGLADSNPTRTPME
             EH          RL+VGVYVDDL+IT          +G+M                  +V  DS G      AYA K+L+ AGL D NP +TPME
Subjt:  ----CEH----------RLIVGVYVDDLLIT----------RGDM------------------KVLGDSRG------AYAKKLLDTAGLADSNPTRTPME

Query:  ARLQLRKADTTMAVDSTNYRSIVRSLRYLVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVKYCVGRGKEKLELVGYNDSDMAGD
         RL+LRK      VD+T YRS+V SLRYLVNT PDLA+ VGYVSRFME+ RE+HL AV+RILRYVAGTR W +++  G       LVGY+DSD+AGD
Subjt:  ARLQLRKADTTMAVDSTNYRSIVRSLRYLVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVKYCVGRGKEKLELVGYNDSDMAGD

CAH66352.1 OSIGBa0135C09.3 [Oryza sativa]8.3e-8338.38Show/hide
Query:  MPKRFWREAEMTVVYLLNRSPTRSLDEKTPHEAWYNKKPAIHHQRVFGCATYMKVARPHLAMLDPRGLKVVSIGYEPGARCTSSMILRGASSRVARRHLQ
        +P RFW EA  T V+LLNRSPT+SLD +TP+EAWY ++PA+H  R FGC  ++K+ +P L  LD R   +V +GYE G++  +  +    S RV   H+ 
Subjt:  MPKRFWREAEMTVVYLLNRSPTRSLDEKTPHEAWYNKKPAIHHQRVFGCATYMKVARPHLAMLDPRGLKVVSIGYEPGARCTSSMILRGASSRVARRHLQ

Query:  REHLL-----------------------------------------------------------------AVERCTGGMED-------------------
        R+ +                                                                  AVE  T   +D                   
Subjt:  REHLL-----------------------------------------------------------------AVERCTGGMED-------------------

Query:  -LVGRGEPPGPAARKLEEKVAELHAISTDKLNIFAEAERKPCWLKAIGRSYEA----------QGPSGDE--GLR---------PQKQVVDFEEVFAPVA
         L+G   PPG A R LE+   ELH +S D+    AEAE  P W  A+     A            P G    GL+          Q+Q VDF+EVFAPVA
Subjt:  -LVGRGEPPGPAARKLEEKVAELHAISTDKLNIFAEAERKPCWLKAIGRSYEA----------QGPSGDE--GLR---------PQKQVVDFEEVFAPVA

Query:  RLKFVRLLLEITAHHSWEVHYMDVKFAFLNRELKETIYVRQLSGFLGNDNPDKVLRLHKALH--------------------------CEH---------
        RL+ VRLLL I AH  W+VH+MDVK AFLN EL E +YV Q  GF+ +++ +KV RLHKAL+                           EH         
Subjt:  RLKFVRLLLEITAHHSWEVHYMDVKFAFLNRELKETIYVRQLSGFLGNDNPDKVLRLHKALH--------------------------CEH---------

Query:  -RLIVGVYVDDLLIT----------RGDM------------------KVLGDSRG------AYAKKLLDTAGLADSNPTRTPMEARLQLRKADTTMAVDS
         RL VGVYVDDL+IT          +G+M                  +V  DS G      AYA K+L+ AGL D NP +TPME RL+LRK      VD+
Subjt:  -RLIVGVYVDDLLIT----------RGDM------------------KVLGDSRG------AYAKKLLDTAGLADSNPTRTPMEARLQLRKADTTMAVDS

Query:  TNYRSIVRSLRYLVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVKYCVGRGKEKLELVGYNDSDMAGD
        T YRS+V SLRYLVNT PDLA+ VGYVSRFME+ RE+HL AV+RILRYVAGTR W +++  G       LVGY+DSD+AGD
Subjt:  TNYRSIVRSLRYLVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVKYCVGRGKEKLELVGYNDSDMAGD

XP_023521510.1 uncharacterized protein LOC111785335 [Cucurbita pepo subsp. pepo]4.7e-8639.7Show/hide
Query:  LNRSPTRSLDEKTPHEAWYNKKPAIHHQRVFGCATYMKVARPHLAMLDPRGLKVVSIGYEPGAR---------------------------------CTS
        +NRSPTR L  KT +EAWYNKK A+HH RVF C  YMKV  PHLA LDPRGLKVV IGYEP ++                                  TS
Subjt:  LNRSPTRSLDEKTPHEAWYNKKPAIHHQRVFGCATYMKVARPHLAMLDPRGLKVVSIGYEPGAR---------------------------------CTS

Query:  SMI----LRGASSRVARRHLQREHLLAVERCTGGMEDLVGRGEPPGPAARKLEEKVAELHAISTDKLNIFAEAERKPCWLKA-------------IGRSY
        + +    +  A+ R A   L  +H   +      M+DLVG GEPPG A R+LEE V ELHAISTD+ N FA+AER PC LK              +  + 
Subjt:  SMI----LRGASSRVARRHLQREHLLAVERCTGGMEDLVGRGEPPGPAARKLEEKVAELHAISTDKLNIFAEAERKPCWLKA-------------IGRSY

Query:  EAQGPSG-----DEGLRPQ---------------------------------------------------------------------------------
        + Q P G     + G  P                                                                                  
Subjt:  EAQGPSG-----DEGLRPQ---------------------------------------------------------------------------------

Query:  ------------------------------------------KQVVDFEEVFAPVARLKFVRLLLEITAHHSWEVHYMDVKFAFLNRELKETIYVRQLSG
                                                  KQ VDFEEVFA V RL+FVRLLL I  H SWEVH+MDVK  FLN ELKETI V+Q  G
Subjt:  ------------------------------------------KQVVDFEEVFAPVARLKFVRLLLEITAHHSWEVHYMDVKFAFLNRELKETIYVRQLSG

Query:  FLGNDNPDKVLRLHKAL------------------------------------HCEHRLIVGVYVDDLLITRGDMKVLGD--------------------
        FL NDNP KVLRLHKAL                                    H E RLI+GVYVDDL+IT GDM+VLG                     
Subjt:  FLGNDNPDKVLRLHKAL------------------------------------HCEHRLIVGVYVDDLLITRGDMKVLGD--------------------

Query:  --------------SRGAYAKKLLDTAGLADSNPTRTPMEARLQLRKADTTMAVDSTNYRSIVRSLRYLVNTYPDLAYFVGYVSRFMEASREEHLVAVKR
                       + AYAKKLLDT GLADSNPTR PMEARLQLRKA TT  VD+TNY +IV SL YLVNT PDL Y VGYVSRFMEA R+EHLVA K 
Subjt:  --------------SRGAYAKKLLDTAGLADSNPTRTPMEARLQLRKADTTMAVDSTNYRSIVRSLRYLVNTYPDLAYFVGYVSRFMEASREEHLVAVKR

Query:  ILRYVAG
        ILRYVAG
Subjt:  ILRYVAG

TrEMBL top hitse value%identityAlignment
A0A1P8YYM3 Reverse transcriptase1.3e-7333.5Show/hide
Query:  MPKRFWREAEMTVVYLLNRSPTRSLDEKTPHEAWYNKKPAIHHQRVFGCATYMKVARPHLAMLDPRGLKVVSIGYEPGARC-------------------
        MP  FW EA  T V+LLNR+PT +L+ +TP++AWY KKPA+H  +VFGC  Y+K  RPHL+ L+ RG KVV IGY+ G++                    
Subjt:  MPKRFWREAEMTVVYLLNRSPTRSLDEKTPHEAWYNKKPAIHHQRVFGCATYMKVARPHLAMLDPRGLKVVSIGYEPGARC-------------------

Query:  ---------------------------------------------------------------------------TSSMILRGASSRVARRHLQREHLLA
                                                                                   TS+     ++ +  R H++    L+
Subjt:  ---------------------------------------------------------------------------TSSMILRGASSRVARRHLQREHLLA

Query:  VERCTGG----------MEDLVGRGEPPGPAARKLEEKVAELHAISTDKLNIFAEAERKPCWL-------------------------KAIGRS--YEAQ
         +               ++++   G  PG      +++ AELH +S ++     EA+  P W+                         +AIG    Y+ +
Subjt:  VERCTGG----------MEDLVGRGEPPGPAARKLEEKVAELHAISTDKLNIFAEAERKPCWL-------------------------KAIGRS--YEAQ

Query:  GPSGDEGLR----------PQKQVVDFEEVFAPVARLKFVRLLLEITAHHSWEVHYMDVKFAFLNRELKETIYVRQLSGFLGNDNPDKVLRLHKALH---
            D  ++           Q+  VDF+EVFAPVAR + VRLLL I AH+ W VH+MDVK AFLN EL+E +YV+Q  GF+   +  KVLRLHKAL+   
Subjt:  GPSGDEGLR----------PQKQVVDFEEVFAPVARLKFVRLLLEITAHHSWEVHYMDVKFAFLNRELKETIYVRQLSGFLGNDNPDKVLRLHKALH---

Query:  ---------------------C------------EHRLIVGVYVDDLLITRGD----------------------------MKVLGDSRG------AYAK
                             C            E RLIVGVYVDDL+IT GD                            ++V   ++G      AYA 
Subjt:  ---------------------C------------EHRLIVGVYVDDLLITRGD----------------------------MKVLGDSRG------AYAK

Query:  KLLDTAGLADSNPTRTPMEARLQLRKADTTMAVDSTNYRSIVRSLRYLVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVKY-CVGRG
        K+L+ AGLA  N + TPME +L+L K  TT +VD T YRS++ SLRYL N+ PDLAY VGY+SRFMEA R+EHL AVKR+LRYVAGT  W + Y   G+ 
Subjt:  KLLDTAGLADSNPTRTPMEARLQLRKADTTMAVDSTNYRSIVRSLRYLVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVKY-CVGRG

Query:  KEKLELVGYNDSDMAGDV
        +   +L+GY+DSD+AGDV
Subjt:  KEKLELVGYNDSDMAGDV

A0A7I8IFL9 Hypothetical protein7.1e-7234.2Show/hide
Query:  MPKRFWREAEMTVVYLLNRSPTRSLDEKTPHEAWYNKKPAIHHQRVFGCATYMKVARPHLAMLDPRGLKVVSIGYEPGARCTSSMILRGASSRVARRHLQ
        +P  FW EA  T VY+LNR PT+S+D  TP E W+ KKPA+HH +VFGC  Y+    PHL  L+ RG K++ IGYE G++       R       R H+ 
Subjt:  MPKRFWREAEMTVVYLLNRSPTRSLDEKTPHEAWYNKKPAIHHQRVFGCATYMKVARPHLAMLDPRGLKVVSIGYEPGARCTSSMILRGASSRVARRHLQ

Query:  R--------------EHLLAVERCTGG----------------------------------------------------------------------MED
        R              + +  +E   G                                                                       +++
Subjt:  R--------------EHLLAVERCTGG----------------------------------------------------------------------MED

Query:  LVGRGEPPGPAARKLEEKVAELHAISTDKLNIFAEAERKPCWLK-------------------------AIGRSYEAQGPSGDEGL------------RP
        +VG   P G A+R L  K  ELHA+S+D+   F EAE  P W K                         AIG  +  +    + G               
Subjt:  LVGRGEPPGPAARKLEEKVAELHAISTDKLNIFAEAERKPCWLK-------------------------AIGRSYEAQGPSGDEGL------------RP

Query:  QKQVVDFEEVFAPVARLKFVRLLLEITAHHSWEVHYMDVKFAFLNRELKETIYVRQLSGFLGNDNPDKVLRLHKALHCEH--------------------
        Q+Q +D++EVFAPVARL  VRLL+ + AH  WEVH+MDVK AFLN +LKE +YV Q +GF+   N  KV +L KAL+  H                    
Subjt:  QKQVVDFEEVFAPVARLKFVRLLLEITAHHSWEVHYMDVKFAFLNRELKETIYVRQLSGFLGNDNPDKVLRLHKALHCEH--------------------

Query:  ----------------RLIVGVYVDDLLITRGDMKVLGDSRGAYAKKLLDTAGLADSNPTR---------------TPMEARLQLRKADTTMAVDSTNYR
                        +L+VGVYVDDL+I         D    + K++ D   ++D    R                PM  RL+L K  T   VD+T YR
Subjt:  ----------------RLIVGVYVDDLLITRGDMKVLGDSRGAYAKKLLDTAGLADSNPTR---------------TPMEARLQLRKADTTMAVDSTNYR

Query:  SIVRSLRYLVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVKYCVGRGKEK-LELVGYNDSDMAGDV
        SIV SLRYLVNT PDLA+ VGYVSRF+E  R++HL AVK+ILRYVAGT+ W ++Y   R KEK ++L G++DSD AGDV
Subjt:  SIVRSLRYLVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVKYCVGRGKEK-LELVGYNDSDMAGDV

A0B9X7 OSIGBa0135C09.3 protein4.0e-8338.38Show/hide
Query:  MPKRFWREAEMTVVYLLNRSPTRSLDEKTPHEAWYNKKPAIHHQRVFGCATYMKVARPHLAMLDPRGLKVVSIGYEPGARCTSSMILRGASSRVARRHLQ
        +P RFW EA  T V+LLNRSPT+SLD +TP+EAWY ++PA+H  R FGC  ++K+ +P L  LD R   +V +GYE G++  +  +    S RV   H+ 
Subjt:  MPKRFWREAEMTVVYLLNRSPTRSLDEKTPHEAWYNKKPAIHHQRVFGCATYMKVARPHLAMLDPRGLKVVSIGYEPGARCTSSMILRGASSRVARRHLQ

Query:  REHLL-----------------------------------------------------------------AVERCTGGMED-------------------
        R+ +                                                                  AVE  T   +D                   
Subjt:  REHLL-----------------------------------------------------------------AVERCTGGMED-------------------

Query:  -LVGRGEPPGPAARKLEEKVAELHAISTDKLNIFAEAERKPCWLKAIGRSYEA----------QGPSGDE--GLR---------PQKQVVDFEEVFAPVA
         L+G   PPG A R LE+   ELH +S D+    AEAE  P W  A+     A            P G    GL+          Q+Q VDF+EVFAPVA
Subjt:  -LVGRGEPPGPAARKLEEKVAELHAISTDKLNIFAEAERKPCWLKAIGRSYEA----------QGPSGDE--GLR---------PQKQVVDFEEVFAPVA

Query:  RLKFVRLLLEITAHHSWEVHYMDVKFAFLNRELKETIYVRQLSGFLGNDNPDKVLRLHKALH--------------------------CEH---------
        RL+ VRLLL I AH  W+VH+MDVK AFLN EL E +YV Q  GF+ +++ +KV RLHKAL+                           EH         
Subjt:  RLKFVRLLLEITAHHSWEVHYMDVKFAFLNRELKETIYVRQLSGFLGNDNPDKVLRLHKALH--------------------------CEH---------

Query:  -RLIVGVYVDDLLIT----------RGDM------------------KVLGDSRG------AYAKKLLDTAGLADSNPTRTPMEARLQLRKADTTMAVDS
         RL VGVYVDDL+IT          +G+M                  +V  DS G      AYA K+L+ AGL D NP +TPME RL+LRK      VD+
Subjt:  -RLIVGVYVDDLLIT----------RGDM------------------KVLGDSRG------AYAKKLLDTAGLADSNPTRTPMEARLQLRKADTTMAVDS

Query:  TNYRSIVRSLRYLVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVKYCVGRGKEKLELVGYNDSDMAGD
        T YRS+V SLRYLVNT PDLA+ VGYVSRFME+ RE+HL AV+RILRYVAGTR W +++  G       LVGY+DSD+AGD
Subjt:  TNYRSIVRSLRYLVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVKYCVGRGKEKLELVGYNDSDMAGD

C7J2T2 Os05g0126900 protein2.1e-7134.62Show/hide
Query:  MPKRFWREAEMTVVYLLNRSPTRSLDEKTPHEAWYNKKPAIHHQRVFGCATYMKVARPHLAMLDPRGLKVVSIGYEPGA---RCTSSMILRGASSR----
        +P  FW EA  T VYLLNRS ++S+  KTP+E W    PA+HH R FGC  ++K   P++  LD R   ++ +GYEPG+   RC      R   SR    
Subjt:  MPKRFWREAEMTVVYLLNRSPTRSLDEKTPHEAWYNKKPAIHHQRVFGCATYMKVARPHLAMLDPRGLKVVSIGYEPGA---RCTSSMILRGASSR----

Query:  --------------------------------------------------------------------------VARRHLQREHLLAVERCTGGMEDLVG
                                                                                  +A   L  +H  A  R    +++++G
Subjt:  --------------------------------------------------------------------------VARRHLQREHLLAVERCTGGMEDLVG

Query:  RGEPPGPAARKLEEKVAELHAISTDKLNIFAEAERKPCWLKAI---GRSYEAQG------------PSGDEGL----------------------RPQKQ
           PPG A R+ ++   EL   S ++    AEA+++ CW +A+    +S E  G            P G + +                        Q+Q
Subjt:  RGEPPGPAARKLEEKVAELHAISTDKLNIFAEAERKPCWLKAI---GRSYEAQG------------PSGDEGL----------------------RPQKQ

Query:  VVDFEEVFAPVARLKFVRLLLEITAHHSWEVHYMDVKFAFLNRELKETIYVRQLSGFLGNDNPDKVLRLHKALH--------------------------
         +D++EVFAPVARL+ VRLLL   A   W VH+MDVK AFLN ELKE +YV Q  GF+ +    KVLRL KAL+                          
Subjt:  VVDFEEVFAPVARLKFVRLLLEITAHHSWEVHYMDVKFAFLNRELKETIYVRQLSGFLGNDNPDKVLRLHKALH--------------------------

Query:  CEH----------RLIVGVYVDDLLITRGDMKVLGD----------------------------------SRGAYAKKLLDTAGLADSNPTRTPMEARLQ
         EH          RL+VGVYVDDL+IT G+   L                                    S+GAYAKK+L+ AG+   NP++TPME RL+
Subjt:  CEH----------RLIVGVYVDDLLITRGDMKVLGD----------------------------------SRGAYAKKLLDTAGLADSNPTRTPMEARLQ

Query:  LRKADTTMAVDSTNYRSIVRSLRYLVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGT--RGWSVKYCVGRGKEKLELVGYNDSDMAGDVVT
        L K  T+  VD+T+YR IV SLRYLVN+ PDLA+ VGYVSRFME    EHL AVKR+LRYVAGT  RG S +    R K  + LVG++DSD+AGDV T
Subjt:  LRKADTTMAVDSTNYRSIVRSLRYLVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGT--RGWSVKYCVGRGKEKLELVGYNDSDMAGDVVT

Q7XPB1 OSJNBb0026E15.10 protein5.4e-8036.68Show/hide
Query:  MPKRFWREAEMTVVYLLNRSPTRSLDEKTPHEAWYNKKPAIHHQRVFGCATYMKVARPHLAMLDPRGLKVVSIGYEPGARCTSSMILRGASSRVARRHLQ
        +P RFW EA  T V+LLNRSPT+SLD +TP+EAWY + PA+H  R FGC  ++K+ +P L  LD R   +V +GYE G++  +  +    S RV   H+ 
Subjt:  MPKRFWREAEMTVVYLLNRSPTRSLDEKTPHEAWYNKKPAIHHQRVFGCATYMKVARPHLAMLDPRGLKVVSIGYEPGARCTSSMILRGASSRVARRHLQ

Query:  REHLL-----------------------------------------------------------------AVERCTGGMED-------------------
        R+ +                                                                  AVE  T   +D                   
Subjt:  REHLL-----------------------------------------------------------------AVERCTGGMED-------------------

Query:  -LVGRGEPPGPAARKLEEKVAELHAISTDKLNIFAEAERKPCWL-------------------------KAIGRSYEAQGPSGDEGL------------R
         L+G   PPG A R LE+   ELH +S D+    AEAE  P W                          +AIG  +  +    ++G              
Subjt:  -LVGRGEPPGPAARKLEEKVAELHAISTDKLNIFAEAERKPCWL-------------------------KAIGRSYEAQGPSGDEGL------------R

Query:  PQKQVVDFEEVFAPVARLKFVRLLLEITAHHSWEVHYMDVKFAFLNRELKETIYVRQLSGFLGNDNPDKVLRLHKALH----------------------
         Q+Q VDF+EVFA VARL+ VRLLL + AH  W+VH+MDVK AFLN EL E +YV Q  GF+ +++ +KV RLHKAL+                      
Subjt:  PQKQVVDFEEVFAPVARLKFVRLLLEITAHHSWEVHYMDVKFAFLNRELKETIYVRQLSGFLGNDNPDKVLRLHKALH----------------------

Query:  ----CEH----------RLIVGVYVDDLLIT----------RGDM------------------KVLGDSRG------AYAKKLLDTAGLADSNPTRTPME
             EH          RL+VGVYVDDL+IT          +G+M                  +V  DS G      AYA K+L+ AGL D NP +TPME
Subjt:  ----CEH----------RLIVGVYVDDLLIT----------RGDM------------------KVLGDSRG------AYAKKLLDTAGLADSNPTRTPME

Query:  ARLQLRKADTTMAVDSTNYRSIVRSLRYLVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVKYCVGRGKEKLELVGYNDSDMAGD
         RL+LRK      VD+T YRS+V SLRYLVNT PDLA+ VGYVSRFME+ RE+HL AV+RILRYVAGTR W +++  G       LVGY+DSD+AGD
Subjt:  ARLQLRKADTTMAVDSTNYRSIVRSLRYLVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVKYCVGRGKEKLELVGYNDSDMAGD

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.5e-2128.38Show/hide
Query:  QKQVVDFEEVFAPVARLKFVRLLLEITAHHSWEVHYMDVKFAFLNRELKETIYVRQLSGFLGNDNPDKVLRLHKALH-------------------CE--
        QK  +D+EE FAPVAR+   R +L +   ++ +VH MDVK AFLN  LKE IY+R   G   + N D V +L+KA++                   CE  
Subjt:  QKQVVDFEEVFAPVARLKFVRLLLEITAHHSWEVHYMDVKFAFLNRELKETIYVRQLSGFLGNDNPDKVLRLHKALH-------------------CE--

Query:  -----------------HRLIVGVYVDDLLITRGDMKVLGD----------------------------------SRGAYAKKLLDTAGLADSNPTRTPM
                           + V +YVDD++I  GDM  + +                                  S+ AY KK+L    + + N   TP+
Subjt:  -----------------HRLIVGVYVDDLLITRGDMKVLGD----------------------------------SRGAYAKKLLDTAGLADSNPTRTPM

Query:  EARL--QLRKADTTMAVDSTNYRSIVRSLRY-LVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVKYCVGRGKEKLELVGYNDSDMAG
         +++  +L  +D      +T  RS++  L Y ++ T PDL   V  +SR+   +  E    +KR+LRY+ GT    + +      E  +++GY DSD AG
Subjt:  EARL--QLRKADTTMAVDSTNYRSIVRSLRY-LVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVKYCVGRGKEKLELVGYNDSDMAG

Query:  DVV
          +
Subjt:  DVV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-2430.94Show/hide
Query:  QKQVVDFEEVFAPVARLKFVRLLLEITAHHSWEVHYMDVKFAFLNRELKETIYVRQLSGFLGNDNPDKVLRLHKALH-----------------------
        QK+ +DF+E+F+PV ++  +R +L + A    EV  +DVK AFL+ +L+E IY+ Q  GF        V +L+K+L+                       
Subjt:  QKQVVDFEEVFAPVARLKFVRLLLEITAHHSWEVHYMDVKFAFLNRELKETIYVRQLSGFLGNDNPDKVLRLHKALH-----------------------

Query:  ------C--------EHRLIVGVYVDDLLIT----------RG------DMKVLGD--------------------SRGAYAKKLLDTAGLADSNPTRTP
              C         + +I+ +YVDD+LI           +G      DMK LG                     S+  Y +++L+   + ++ P  TP
Subjt:  ------C--------EHRLIVGVYVDDLLIT----------RG------DMKVLGD--------------------SRGAYAKKLLDTAGLADSNPTRTP

Query:  MEARLQLRKADTTMAVDSTN------YRSIVRSLRY-LVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVKYCVGRGKEKLELVGYND
        +   L+L K      V+         Y S V SL Y +V T PD+A+ VG VSRF+E   +EH  AVK ILRY+ GT G     C+  G     L GY D
Subjt:  MEARLQLRKADTTMAVDSTN------YRSIVRSLRY-LVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVKYCVGRGKEKLELVGYND

Query:  SDMAGDV
        +DMAGD+
Subjt:  SDMAGDV

P25600 Putative transposon Ty5-1 protein YCL074W5.5e-1328.79Show/hide
Query:  MDVKFAFLNRELKETIYVRQLSGFLGNDNPDKVLRLHKALH----------------------CEHR--------------LIVGVYVDDLLITRG----
        MDV  AFLN  + E IYV+Q  GF+   NPD V  L+  ++                      C H               + + VYVDDLL+       
Subjt:  MDVKFAFLNRELKETIYVRQLSGFLGNDNPDKVLRLHKALH----------------------CEHR--------------LIVGVYVDDLLITRG----

Query:  ------------DMKVLG------------DSRGAYAKKLLDTAGLADSNP-------TRTPMEARLQLRKADTTMAVDSTNYRSIVRSLRYLVNT-YPD
                     MK LG             S G     L D    A S         T+TP+     L +  +    D T Y+SIV  L +  NT  PD
Subjt:  ------------DMKVLG------------DSRGAYAKKLLDTAGLADSNP-------TRTPMEARLQLRKADTTMAVDSTNYRSIVRSLRYLVNT-YPD

Query:  LAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVKYCVGRGKEKLELVGYNDS
        ++Y V  +SRF+   R  HL + +R+LRY+  TR   +KY   R   +L L  Y D+
Subjt:  LAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVKYCVGRGKEKLELVGYNDS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-1534.86Show/hide
Query:  VYVDDLLITRGDMKVLGD----------------------------------SRGAYAKKLLDTAGLADSNPTRTPMEARLQLRKADTTMAVDSTNYRSI
        VYVDD+LIT  D  +L +                                  S+  Y   LL    +  + P  TPM    +L     T   D T YR I
Subjt:  VYVDDLLITRGDMKVLGD----------------------------------SRGAYAKKLLDTAGLADSNPTRTPMEARLQLRKADTTMAVDSTNYRSI

Query:  VRSLRYLVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVKYCVGRGKEKLELVGYNDSDMAGD
        V SL+YL  T PD++Y V  +S+FM    EEHL A+KRILRY+AGT    +    G     L L  Y+D+D AGD
Subjt:  VRSLRYLVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVKYCVGRGKEKLELVGYNDSDMAGD

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-0535.06Show/hide
Query:  AEMPKRFWREAEMTVVYLLNRSPTRSLDEKTPHEAWYNKKPAIHHQRVFGCATYMKVARPHLAMLDPRGLKVVSIGY
        A +PK +W  A    VYL+NR PT  L  ++P +  +   P     RVFGCA Y  +   +   LD +  + V +GY
Subjt:  AEMPKRFWREAEMTVVYLLNRSPTRSLDEKTPHEAWYNKKPAIHHQRVFGCATYMKVARPHLAMLDPRGLKVVSIGY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.5e-1338.21Show/hide
Query:  SRGAYAKKLLDTAGLADSNPTRTPMEARLQLRKADTTMAVDSTNYRSIVRSLRYLVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVK
        S+  Y   LL    +  + P  TPM    +L     T   D T YR IV SL+YL  T PDL+Y V  +S++M    ++H  A+KR+LRY+AGT    + 
Subjt:  SRGAYAKKLLDTAGLADSNPTRTPMEARLQLRKADTTMAVDSTNYRSIVRSLRYLVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVK

Query:  YCVGRGKEKLELVGYNDSDMAGD
           G     L L  Y+D+D AGD
Subjt:  YCVGRGKEKLELVGYNDSDMAGD

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.1e-0431.17Show/hide
Query:  AEMPKRFWREAEMTVVYLLNRSPTRSLDEKTPHEAWYNKKPAIHHQRVFGCATYMKVARPHLAMLDPRGLKVVSIGY
        A +PK +W  A    VYL+NR PT  L  ++P +  + + P     +VFGCA Y  +   +   L+ +  +   +GY
Subjt:  AEMPKRFWREAEMTVVYLLNRSPTRSLDEKTPHEAWYNKKPAIHHQRVFGCATYMKVARPHLAMLDPRGLKVVSIGY

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.6e-1827.7Show/hide
Query:  QKQVVDFEEVFAPVARLKFVRLLLEITAHHSWEVHYMDVKFAFLNRELKETIYVRQLSGFLGNDN----PDKVLRLHKAL--------------------
        Q++ +DF E F+PV +L  V+L+L I+A +++ +H +D+  AFLN +L E IY++   G+         P+ V  L K++                    
Subjt:  QKQVVDFEEVFAPVARLKFVRLLLEITAHHSWEVHYMDVKFAFLNRELKETIYVRQLSGFLGNDN----PDKVLRLHKAL--------------------

Query:  ------HCEHR----------LIVGVYVDDLLITRGD----------------MKVLGD---------SRGA---------YAKKLLDTAGLADSNPTRT
              H +H           L V VYVDD++I   +                ++ LG          +R A         YA  LLD  GL    P+  
Subjt:  ------HCEHR----------LIVGVYVDDLLITRGD----------------MKVLGD---------SRGA---------YAKKLLDTAGLADSNPTRT

Query:  PMEARLQLRKADTTMAVDSTNYRSIVRSLRYLVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVKY
        PM+  +          VD+  YR ++  L YL  T  D+++ V  +S+F EA R  H  AV +IL Y+ GT G  + Y
Subjt:  PMEARLQLRKADTTMAVDSTNYRSIVRSLRYLVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVKY

ATMG00240.1 Gag-Pol-related retrotransposon family protein7.4e-0539.71Show/hide
Query:  YLVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVKYCVGRGKEKLELVGYNDSDMA
        YL  T PDL + V  +S+F  ASR   + AV ++L YV GT G  + Y        L+L  + DSD A
Subjt:  YLVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVKYCVGRGKEKLELVGYNDSDMA

ATMG00810.1 DNA/RNA polymerases superfamily protein9.0e-1134.43Show/hide
Query:  SRGAYAKKLLDTAGLADSNPTRTPMEARLQLRKADTTMAVDSTNYRSIVRSLRYLVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVK
        S+  YA+++L+ AG+ D  P  TP+  +L      T    D +++RSIV +L+YL  T PD++Y V  V + M          +KR+LRYV GT    + 
Subjt:  SRGAYAKKLLDTAGLADSNPTRTPMEARLQLRKADTTMAVDSTNYRSIVRSLRYLVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVK

Query:  YCVGRGKEKLELVGYNDSDMAG
                KL +  + DSD AG
Subjt:  YCVGRGKEKLELVGYNDSDMAG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGGCCGAAATGCCTAAGAGGTTCTGGAGAGAGGCAGAAATGACGGTCGTCTACCTTCTCAATCGATCGCCGACACGAAGCCTCGACGAGAAGACGCCACAC
GAGGCCTGGTACAACAAGAAGCCAGCGATACATCATCAACGAGTGTTCGGTTGCGCCACATACATGAAGGTTGCCCGTCCCCACCTCGCCATGCTCGATCCCAGG
GGGCTGAAGGTCGTCTCCATCGGCTACGAACCAGGAGCAAGGTGTACAAGCTCTATGATCCTGCGGGGGGCGAGCTCACGTGTCGCACGACGTCATCTTCAACGA
GAGCACCTTCTGGCTGTGGAACGATGTACCGGGGGGATGGAAGACCTAGTGGGAAGAGGTGAACCACCTGGACCGGCAGCGCGCAAACTCGAAGAAAAGGTGGCC
GAACTACATGCCATTAGCACAGATAAACTGAACATCTTCGCCGAAGCAGAAAGGAAACCGTGCTGGTTGAAGGCAATAGGGAGAAGTTATGAAGCACAAGGTCCG
TCTGGTGACGAAGGGCTACGTCCCCAGAAGCAAGTAGTGGACTTCGAAGAGGTATTCGCGCCAGTGGCAAGGTTGAAATTTGTCCGCCTATTGCTGGAGATCACG
GCACATCATTCTTGGGAGGTCCACTACATGGACGTGAAGTTCGCTTTCCTCAACAGAGAGTTGAAGGAGACCATCTATGTCCGACAACTATCGGGATTTCTAGGT
AACGACAACCCCGACAAGGTACTGCGCCTACACAAGGCACTCCACTGCGAGCATCGACTGATCGTGGGAGTGTACGTCGACGACCTCCTAATCACTAGAGGTGAC
ATGAAAGTCCTTGGAGATTCAAGAGGTGCGTACGCAAAGAAGCTGTTGGACACAGCTGGGCTTGCGGACAGTAACCCTACAAGGACGCCAATGGAGGCTCGACTT
CAGTTGAGGAAGGCCGACACTACGATGGCAGTCGACTCCACCAATTACCGCAGCATTGTCAGGAGCTTGCGCTATCTGGTAAACACATACCCTGACCTTGCTTAT
TTCGTTGGATACGTGAGTAGGTTTATGGAAGCATCTAGGGAGGAGCATCTTGTGGCTGTCAAGCGCATCCTGCGCTACGTGGCCGGAACCAGAGGCTGGAGTGTG
AAATACTGTGTAGGGAGAGGAAAGGAGAAGCTTGAGCTGGTCGGCTACAATGATAGTGACATGGCTGGTGACGTTGTGACGGCAGCAACATAA
mRNA sequenceShow/hide mRNA sequence
ATGACGGCCGAAATGCCTAAGAGGTTCTGGAGAGAGGCAGAAATGACGGTCGTCTACCTTCTCAATCGATCGCCGACACGAAGCCTCGACGAGAAGACGCCACAC
GAGGCCTGGTACAACAAGAAGCCAGCGATACATCATCAACGAGTGTTCGGTTGCGCCACATACATGAAGGTTGCCCGTCCCCACCTCGCCATGCTCGATCCCAGG
GGGCTGAAGGTCGTCTCCATCGGCTACGAACCAGGAGCAAGGTGTACAAGCTCTATGATCCTGCGGGGGGCGAGCTCACGTGTCGCACGACGTCATCTTCAACGA
GAGCACCTTCTGGCTGTGGAACGATGTACCGGGGGGATGGAAGACCTAGTGGGAAGAGGTGAACCACCTGGACCGGCAGCGCGCAAACTCGAAGAAAAGGTGGCC
GAACTACATGCCATTAGCACAGATAAACTGAACATCTTCGCCGAAGCAGAAAGGAAACCGTGCTGGTTGAAGGCAATAGGGAGAAGTTATGAAGCACAAGGTCCG
TCTGGTGACGAAGGGCTACGTCCCCAGAAGCAAGTAGTGGACTTCGAAGAGGTATTCGCGCCAGTGGCAAGGTTGAAATTTGTCCGCCTATTGCTGGAGATCACG
GCACATCATTCTTGGGAGGTCCACTACATGGACGTGAAGTTCGCTTTCCTCAACAGAGAGTTGAAGGAGACCATCTATGTCCGACAACTATCGGGATTTCTAGGT
AACGACAACCCCGACAAGGTACTGCGCCTACACAAGGCACTCCACTGCGAGCATCGACTGATCGTGGGAGTGTACGTCGACGACCTCCTAATCACTAGAGGTGAC
ATGAAAGTCCTTGGAGATTCAAGAGGTGCGTACGCAAAGAAGCTGTTGGACACAGCTGGGCTTGCGGACAGTAACCCTACAAGGACGCCAATGGAGGCTCGACTT
CAGTTGAGGAAGGCCGACACTACGATGGCAGTCGACTCCACCAATTACCGCAGCATTGTCAGGAGCTTGCGCTATCTGGTAAACACATACCCTGACCTTGCTTAT
TTCGTTGGATACGTGAGTAGGTTTATGGAAGCATCTAGGGAGGAGCATCTTGTGGCTGTCAAGCGCATCCTGCGCTACGTGGCCGGAACCAGAGGCTGGAGTGTG
AAATACTGTGTAGGGAGAGGAAAGGAGAAGCTTGAGCTGGTCGGCTACAATGATAGTGACATGGCTGGTGACGTTGTGACGGCAGCAACATAA
Protein sequenceShow/hide protein sequence
MTAEMPKRFWREAEMTVVYLLNRSPTRSLDEKTPHEAWYNKKPAIHHQRVFGCATYMKVARPHLAMLDPRGLKVVSIGYEPGARCTSSMILRGASSRVARRHLQR
EHLLAVERCTGGMEDLVGRGEPPGPAARKLEEKVAELHAISTDKLNIFAEAERKPCWLKAIGRSYEAQGPSGDEGLRPQKQVVDFEEVFAPVARLKFVRLLLEIT
AHHSWEVHYMDVKFAFLNRELKETIYVRQLSGFLGNDNPDKVLRLHKALHCEHRLIVGVYVDDLLITRGDMKVLGDSRGAYAKKLLDTAGLADSNPTRTPMEARL
QLRKADTTMAVDSTNYRSIVRSLRYLVNTYPDLAYFVGYVSRFMEASREEHLVAVKRILRYVAGTRGWSVKYCVGRGKEKLELVGYNDSDMAGDVVTAAT