; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh18G007430 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh18G007430
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCmo_Chr18:9131684..9133609
RNA-Seq ExpressionCmoCh18G007430
SyntenyCmoCh18G007430
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_003613757.4 uncharacterized protein LOC11413243 [Medicago truncatula]1.4e-14247.58Show/hide
Query:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLFGSVLNDSRIVEKLLVTPPEKFEATITTLENTKDLSKISLVELLNALQAQEQRRSMKQEGVIE
        M+VLNL+R+FELQKMKESE++KEYSD+LLSIANKVRL G+   DSRIVEK+LVT PE++EA++ +LENTKDL KI+L E+++ALQAQEQRR+M+QEGV+E
Subjt:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLFGSVLNDSRIVEKLLVTPPEKFEATITTLENTKDLSKISLVELLNALQAQEQRRSMKQEGVIE

Query:  DALLVKHQDSSRYKNNKNFKNQLTYGDSSANYQKAKGEGFKKSYPPCHHCEKKGHPPYKCWRRPDAFYSKCNQLGHEAVICNAKYQVKEVDAQVVDQEEE
         AL VK+QD  +    KN KN     D++ +  K K    K +YPPC HC K GHPP++CWRRP+A  SKCNQ+GHEAVIC  +++ +E DAQV DQEEE
Subjt:  DALLVKHQDSSRYKNNKNFKNQLTYGDSSANYQKAKGEGFKKSYPPCHHCEKKGHPPYKCWRRPDAFYSKCNQLGHEAVICNAKYQVKEVDAQVVDQEEE

Query:  DQ---LCMVTSSSSKESSKSWLIDSGCINHMTYDKESFEELRDTEVKRVRIGNGEHLEVKGKGTVAITSYEDTKFIPDVLFVPKIDQNLLSVEQV-----
        D+   L + T  S  +SS SWLIDSGC NHMTYDKE F+ELR ++  +VRIGNG+++ VKGKGT+AI S   TK I DVL+VP+IDQNLLSV Q+     
Subjt:  DQ---LCMVTSSSSKESSKSWLIDSGCINHMTYDKESFEELRDTEVKRVRIGNGEHLEVKGKGTVAITSYEDTKFIPDVLFVPKIDQNLLSVEQV-----

Query:  --------------------------------PL------------------------------------------------------------------
                                        PL                                                                  
Subjt:  --------------------------------PL------------------------------------------------------------------

Query:  -----------------GP------------------------------STKDLGTFI-IEARVENEGACLIQTVRSDNGKEYTSETFNRFCEEAGIEHQ
                         GP                               ++  G F+  +  VENE  C IQ +RSDNGKEYTS  FN FCEEAGI+HQ
Subjt:  -----------------GP------------------------------STKDLGTFI-IEARVENEGACLIQTVRSDNGKEYTSETFNRFCEEAGIEHQ

Query:  LTTLYTPQQNGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKDPTSFEVWYGYKPSLKFVRVFGCLCFIYISQVKRDKLDKKSE
        LT  YTPQQNGVS RRNR+IMEMTRCMLHEK+LPK+FW +AANT + LQNR+ T  VK+ T FE WYGYKPSL F++VFGCLCF Y+ QVKRDKLDKK+ 
Subjt:  LTTLYTPQQNGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKDPTSFEVWYGYKPSLKFVRVFGCLCFIYISQVKRDKLDKKSE

Query:  VDIFVGMLEESEGERLKESE
          IF+G    S+  ++ + E
Subjt:  VDIFVGMLEESEGERLKESE

XP_016679117.1 uncharacterized protein LOC107898077 [Gossypium hirsutum]2.0e-16057.31Show/hide
Query:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLFGSVLNDSRIVEKLLVTPPEKFEATITTLENTKDLSKISLVELLNALQAQEQRRSMKQEGVIE
        MKVLNLIRDFELQKMKESESVKEY  RLLSIANKVRL GS LNDSRIVEKLLVT  EKFEAT TTLENTKDLSKISLVELLNALQAQEQRRSM+QEGVIE
Subjt:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLFGSVLNDSRIVEKLLVTPPEKFEATITTLENTKDLSKISLVELLNALQAQEQRRSMKQEGVIE

Query:  DALLVKHQDSSRYKNNKNFKNQLTYGDSSANYQKAKGEGFKKSYPPCHHCEKKGHPPYKCWRRPDAFYSKCNQLGHEAVICNAKYQVKEVDAQVVDQEEE
         AL VKHQD     NNK                                              PDA  SKCNQLGHEAVIC  K QV+EVDAQVVDQEEE
Subjt:  DALLVKHQDSSRYKNNKNFKNQLTYGDSSANYQKAKGEGFKKSYPPCHHCEKKGHPPYKCWRRPDAFYSKCNQLGHEAVICNAKYQVKEVDAQVVDQEEE

Query:  DQLCMVTSSSSKESSKSWLIDSGCINHMTYDKESFEELRDTEVKRVRIGNGEHLEVKGKGTVAITSYEDTKFIPDVLFVPKIDQNLLSVEQV--------
        DQL ++T  S KESS+SWLIDSGC NHMTYDKE FEELR+TEVKR+RIGNGE+LEVKGKGTVAITSYE TKF+ DVLFVPKIDQNLLSV Q+        
Subjt:  DQLCMVTSSSSKESSKSWLIDSGCINHMTYDKESFEELRDTEVKRVRIGNGEHLEVKGKGTVAITSYEDTKFIPDVLFVPKIDQNLLSVEQV--------

Query:  --------------------------PLGPSTKDLGTF-----------------------------------------IIEARVENEGACLIQTVRSDN
                                     P  K+   F                                           +AR+ENE  C+IQ +RSDN
Subjt:  --------------------------PLGPSTKDLGTF-----------------------------------------IIEARVENEGACLIQTVRSDN

Query:  GKEYTSETFNRFCEEAGIEHQLTTLYTPQQNGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKDPTSFEVWYGYKPSLKFVRVF
        GKEYTSETFNRFCEEAGIEHQ T  YTPQQNGVS RRN FIMEMTRCMLHEK+LPK FWG+AANT + LQNRISTK VKD T FE WYGYKPSLKF+RVF
Subjt:  GKEYTSETFNRFCEEAGIEHQLTTLYTPQQNGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKDPTSFEVWYGYKPSLKFVRVF

Query:  GCLCFIYISQVKRDKLDKKSEVDIFVGMLEESE----------------------------------------------GERLKESEDEQQDALVDDAPI
        GCLCF YI QVKRDKLDKK+E  IFVG    S+                                              G  L+E EDE QD L DDAP+
Subjt:  GCLCFIYISQVKRDKLDKKSEVDIFVGMLEESE----------------------------------------------GERLKESEDEQQDALVDDAPI

Query:  RG
        RG
Subjt:  RG

XP_022927115.1 uncharacterized protein LOC111434054 [Cucurbita moschata]4.8e-17865.8Show/hide
Query:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLFGSVLNDSRIVEKLLVTPPEKFEATITTLENTKDLSKISLVELLNALQAQEQRRSMKQEGVIE
        MKVLNLIRDFELQKMKESESVKEYS+RLLSIANKVRL GSVLNDSRIVEKLLVT PEKFEATITTLENTKDLSKISL ELLNALQAQEQRRSM+QEGVIE
Subjt:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLFGSVLNDSRIVEKLLVTPPEKFEATITTLENTKDLSKISLVELLNALQAQEQRRSMKQEGVIE

Query:  DALLVKHQDSSRYKNNKNFKNQLTYGDSSANYQKAKGEGFKKSYPPCHHCEKKGHPPYKCWRRPDAFYSKCNQLGHEAVICNAKYQVKEVDAQVVDQEE-
         ALLVKHQDS RYK+NKNFKNQLTYGDS ANYQK KG GFKKSYPPC HCEKKGHPPYK WR+PDAFYSKCNQLGHEAVIC AK  VKEVDAQ++++   
Subjt:  DALLVKHQDSSRYKNNKNFKNQLTYGDSSANYQKAKGEGFKKSYPPCHHCEKKGHPPYKCWRRPDAFYSKCNQLGHEAVICNAKYQVKEVDAQVVDQEE-

Query:  ---EDQLCMVTSSSSKE-----------------------SSKSWLIDSGCINH----MTYDKESFEELR--DTEVKRVRIGNGEHLEVKGKGTVAITSY
           E++ C++  +S K+                       ++ +W    G  +H        K+  EEL   D ++   R  N      +     A    
Subjt:  ---EDQLCMVTSSSSKE-----------------------SSKSWLIDSGCINH----MTYDKESFEELR--DTEVKRVRIGNGEHLEVKGKGTVAITSY

Query:  EDTKFIPDVLF----VPKIDQNLLSVEQVP----------LGPSTKDLGTF-IIEARVENEGACLIQTVRSDNGKEYTSETFNRFCEEAGIEHQLTTLYT
        +  + +   L      P ++ NL  +  +           L   ++  G F   +A+VE E ACLIQT+RSDN KEYT ETFNRFCEE GIEHQLT  YT
Subjt:  EDTKFIPDVLF----VPKIDQNLLSVEQVP----------LGPSTKDLGTF-IIEARVENEGACLIQTVRSDNGKEYTSETFNRFCEEAGIEHQLTTLYT

Query:  PQQNGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKDPTSFEVWYGYKPSLKFVRVFGCLCFIYISQVKRDKLDKKSEVDIFVG
        PQQN VS RRNRFIMEMTRCMLHEKDLPKRFWGKA NTIMCLQNRISTKVVKDPT FEVWYGYKPS KFVRVFGCLCFIYI QVKRDKLDKKSEV IFVG
Subjt:  PQQNGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKDPTSFEVWYGYKPSLKFVRVFGCLCFIYISQVKRDKLDKKSEVDIFVG

Query:  MLEESEGERLKESEDEQQDALVDDAPIRGGSFSDREKQNLG
        MLE        ES+DE+QDALVDDAP+RGG+FSDREKQNLG
Subjt:  MLEESEGERLKESEDEQQDALVDDAPIRGGSFSDREKQNLG

XP_022929937.1 uncharacterized protein LOC111436393 [Cucurbita moschata]6.3e-14659.13Show/hide
Query:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLFGSVLNDSRIVEKLLVTPPEKFEATITTLENTKDLSKISLVELLNALQAQEQRRSMKQEGVIE
        MK LNLIRDFELQK+K+SESVKEYSDRLLSIANKVRL GSVLNDSRIVEKLLVT  EKFEATITTLENTKDLSKISL ELLNALQAQEQ+RSM+QEGVIE
Subjt:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLFGSVLNDSRIVEKLLVTPPEKFEATITTLENTKDLSKISLVELLNALQAQEQRRSMKQEGVIE

Query:  DALLVKHQDSSRYKNNKNFKNQLTYGDSSANYQKAKGEGFKKSYPPCHHCEKKGHPPYKCWRRPDAFYSKCNQLGHEAVICNAKYQVKEVDAQVVDQEEE
         ALLVKHQDS                                                                                          
Subjt:  DALLVKHQDSSRYKNNKNFKNQLTYGDSSANYQKAKGEGFKKSYPPCHHCEKKGHPPYKCWRRPDAFYSKCNQLGHEAVICNAKYQVKEVDAQVVDQEEE

Query:  DQLCMVTSSSSKESSKSWLIDSGCINHMTYDKESFEELRDTEVKRVRIGNGEHLEVKGKGTVAITSYEDTKF-IPDVLFVPK------------IDQNLL
                             SGC NHMTYDK+SFEELRDTEVKRVRIGNGEHLEVKGKG+VAITSYE   F +    F+P+            ID+++ 
Subjt:  DQLCMVTSSSSKESSKSWLIDSGCINHMTYDKESFEELRDTEVKRVRIGNGEHLEVKGKGTVAITSYEDTKF-IPDVLFVPK------------IDQNLL

Query:  SVEQVPLG-------------------------------PSTKDLGTF-IIEARVENEGACLIQTVRSDNGKEYTSETFNRFCEEAGIEHQLTTLYTPQQ
               G                               PS    G F   +A+VENEGACLIQTVRSDNGKEYTSETFNRFCEEAGIEH LTT YTPQQ
Subjt:  SVEQVPLG-------------------------------PSTKDLGTF-IIEARVENEGACLIQTVRSDNGKEYTSETFNRFCEEAGIEHQLTTLYTPQQ

Query:  NGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKDPTSFEVWYGYKPSLKFVRVFGCLCFIYISQVKRDKLDKKSEVDIFVGMLE
        NGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKDPT FEVWYGYKPSLKFVRVFGCLCFIYI QVK DKLDKKSEV IFVGMLE
Subjt:  NGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKDPTSFEVWYGYKPSLKFVRVFGCLCFIYISQVKRDKLDKKSEVDIFVGMLE

Query:  ESEGERLKESEDEQQDALVDDAPIRGGSFSD
                ES+DE+QDALVDDAP+RGG+FSD
Subjt:  ESEGERLKESEDEQQDALVDDAPIRGGSFSD

XP_022959005.1 uncharacterized protein LOC111460124 [Cucurbita moschata]2.6e-20865.61Show/hide
Query:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLFGSVLNDSRIVEKLLVTPPEKFEATITTLENTKDLSKISLVELLNALQAQEQRRSMKQEGVIE
        MK LNLIRDFELQKMK+SESVKEYS+RLL+IANKVRL GS+LNDSRIVEKLLVT PEKFEATITTLENTKDLSKISL ELLNALQAQEQ+RSM+QEGVIE
Subjt:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLFGSVLNDSRIVEKLLVTPPEKFEATITTLENTKDLSKISLVELLNALQAQEQRRSMKQEGVIE

Query:  DALLVKHQDSSRYKNNKNFKNQLTYGDSSANYQKAKGEGFKKSYPPCHHCEKKGHPPYKCWRRPDAFYSKCNQLGHEAVICNAKYQVKEVDAQVVDQ-EE
         AL VKHQD+SRYKNNKNFKNQLTYGDSS NYQK KG GFKKSYP C HCEKK HPPYKCWRRPDAF SKCNQLGHEAVIC  K  VKEVDAQVVDQ EE
Subjt:  DALLVKHQDSSRYKNNKNFKNQLTYGDSSANYQKAKGEGFKKSYPPCHHCEKKGHPPYKCWRRPDAFYSKCNQLGHEAVICNAKYQVKEVDAQVVDQ-EE

Query:  EDQLCMVTSSSSKESSKSWLIDSGCINHMTYDKESFEELRDTEVKRVRIGNGEHLEVKGKGTVAITSYEDTKFIPDV--------------------LFV
        EDQLCMVTSSSSKESS+SWLIDSGC NHMTYDKESFEELRDTE+KRVRIGNGEHLEVKGK TVAITSYE+  F  ++                    L  
Subjt:  EDQLCMVTSSSSKESSKSWLIDSGCINHMTYDKESFEELRDTEVKRVRIGNGEHLEVKGKGTVAITSYEDTKFIPDV--------------------LFV

Query:  PKIDQNLLSVEQVPLG-------------------------------PS--------------TKDLGTFII-------------EARVENEGACLIQTV
          ID ++       LG                               PS              T+    F++             +A+VEN+  CLIQT+
Subjt:  PKIDQNLLSVEQVPLG-------------------------------PS--------------TKDLGTFII-------------EARVENEGACLIQTV

Query:  RSDNGKEYTSETFNRFCEEAGIEHQLTTLYTPQQNGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKDPTSFEVWYGYKPSLKF
        RSDN KEYT ETFNRFCEEAGIEHQLT  YTPQQN VS RRNRFIMEMTRCMLHEKDLPKRFWGKAANT++CLQNRISTK VKD T FEVWYGYKPSLKF
Subjt:  RSDNGKEYTSETFNRFCEEAGIEHQLTTLYTPQQNGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKDPTSFEVWYGYKPSLKF

Query:  VRVFGCLCFIYISQVKRDKLDKKSEVDIFVG------------------------------------------------------MLEESEGERLKESED
        VRVFGCLCF YI QVKRDKLDKKSEV IFVG                                                      MLEESE ER K+SED
Subjt:  VRVFGCLCFIYISQVKRDKLDKKSEVDIFVG------------------------------------------------------MLEESEGERLKESED

Query:  EQQDALVDDAPIRGGSFSDREKQNLGVG
        E+QDALVDDAP+RGG+F DREKQNLGVG
Subjt:  EQQDALVDDAPIRGGSFSDREKQNLGVG

TrEMBL top hitse value%identityAlignment
A0A1U8ILX5 uncharacterized protein LOC1078980779.7e-16157.31Show/hide
Query:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLFGSVLNDSRIVEKLLVTPPEKFEATITTLENTKDLSKISLVELLNALQAQEQRRSMKQEGVIE
        MKVLNLIRDFELQKMKESESVKEY  RLLSIANKVRL GS LNDSRIVEKLLVT  EKFEAT TTLENTKDLSKISLVELLNALQAQEQRRSM+QEGVIE
Subjt:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLFGSVLNDSRIVEKLLVTPPEKFEATITTLENTKDLSKISLVELLNALQAQEQRRSMKQEGVIE

Query:  DALLVKHQDSSRYKNNKNFKNQLTYGDSSANYQKAKGEGFKKSYPPCHHCEKKGHPPYKCWRRPDAFYSKCNQLGHEAVICNAKYQVKEVDAQVVDQEEE
         AL VKHQD     NNK                                              PDA  SKCNQLGHEAVIC  K QV+EVDAQVVDQEEE
Subjt:  DALLVKHQDSSRYKNNKNFKNQLTYGDSSANYQKAKGEGFKKSYPPCHHCEKKGHPPYKCWRRPDAFYSKCNQLGHEAVICNAKYQVKEVDAQVVDQEEE

Query:  DQLCMVTSSSSKESSKSWLIDSGCINHMTYDKESFEELRDTEVKRVRIGNGEHLEVKGKGTVAITSYEDTKFIPDVLFVPKIDQNLLSVEQV--------
        DQL ++T  S KESS+SWLIDSGC NHMTYDKE FEELR+TEVKR+RIGNGE+LEVKGKGTVAITSYE TKF+ DVLFVPKIDQNLLSV Q+        
Subjt:  DQLCMVTSSSSKESSKSWLIDSGCINHMTYDKESFEELRDTEVKRVRIGNGEHLEVKGKGTVAITSYEDTKFIPDVLFVPKIDQNLLSVEQV--------

Query:  --------------------------PLGPSTKDLGTF-----------------------------------------IIEARVENEGACLIQTVRSDN
                                     P  K+   F                                           +AR+ENE  C+IQ +RSDN
Subjt:  --------------------------PLGPSTKDLGTF-----------------------------------------IIEARVENEGACLIQTVRSDN

Query:  GKEYTSETFNRFCEEAGIEHQLTTLYTPQQNGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKDPTSFEVWYGYKPSLKFVRVF
        GKEYTSETFNRFCEEAGIEHQ T  YTPQQNGVS RRN FIMEMTRCMLHEK+LPK FWG+AANT + LQNRISTK VKD T FE WYGYKPSLKF+RVF
Subjt:  GKEYTSETFNRFCEEAGIEHQLTTLYTPQQNGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKDPTSFEVWYGYKPSLKFVRVF

Query:  GCLCFIYISQVKRDKLDKKSEVDIFVGMLEESE----------------------------------------------GERLKESEDEQQDALVDDAPI
        GCLCF YI QVKRDKLDKK+E  IFVG    S+                                              G  L+E EDE QD L DDAP+
Subjt:  GCLCFIYISQVKRDKLDKKSEVDIFVGMLEESE----------------------------------------------GERLKESEDEQQDALVDDAPI

Query:  RG
        RG
Subjt:  RG

A0A5D3DMJ1 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-13944.26Show/hide
Query:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLFGSVLNDSRIVEKLLVTPPEKFEATITTLENTKDLSKISLVELLNALQAQEQRRSMKQEGVIE
        M+VLNLIR+FELQKMKE+ES+KEYS RLL IAN++RL GSV  DSRIVEK+LV+ PEKFEA+I+ LENTKDL++I+L E+LNALQAQEQRR+M+QEG +E
Subjt:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLFGSVLNDSRIVEKLLVTPPEKFEATITTLENTKDLSKISLVELLNALQAQEQRRSMKQEGVIE

Query:  DALLVKHQDSSRYKNNKNF--KNQLTYGDSSANYQKAKGEGFKK-SYPPCHHCEKKGHPPYKCWRRPDAFYSKCNQLGHEAVICNAKYQVKEVDAQVV--
         AL  KH ++ R    K F  KNQ++  +SS  Y KA   G KK SYPPC HC K+GHPP+KCWRRP+A  +KCNQ+GHEAVIC    Q + V+A++   
Subjt:  DALLVKHQDSSRYKNNKNF--KNQLTYGDSSANYQKAKGEGFKK-SYPPCHHCEKKGHPPYKCWRRPDAFYSKCNQLGHEAVICNAKYQVKEVDAQVV--

Query:  DQEEEDQLCMVTSSSSKESSKSWLIDSGCINHMTYDKESFEELRDTEVKRVRIGNGEHLEVKGKGTVAITSYEDTKFIPDVLFVPKIDQNLLSVEQV---
        ++EEEDQL + T     ES++SWLIDSGC NHMT+DKE F++L+ T + +VRIGNG+++ VKGKGT+AI S + TK I DVLFVP I+QNLLSV Q+   
Subjt:  DQEEEDQLCMVTSSSSKESSKSWLIDSGCINHMTYDKESFEELRDTEVKRVRIGNGEHLEVKGKGTVAITSYEDTKFIPDVLFVPKIDQNLLSVEQV---

Query:  -------------------------------PLGPSTKDLGTFII-------------------------------------------------------
                                        L P  ++   F +                                                       
Subjt:  -------------------------------PLGPSTKDLGTFII-------------------------------------------------------

Query:  ------------------------------------------------------------EARVENEGACLIQTVRSDNGKEYTSETFNRFCEEAGIEHQ
                                                                    +ARVENE  C IQ VRSDNGKEY S  F++FCE++GI+HQ
Subjt:  ------------------------------------------------------------EARVENEGACLIQTVRSDNGKEYTSETFNRFCEEAGIEHQ

Query:  LTTLYTPQQNGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKDPTSFEVWYGYKPSLKFVRVFGCLCFIYISQVKRDKLDKKSE
        LT  YTPQQNGVS RRNR+IMEMTRCMLHEK LPK+FW +AANT + LQNR+ TK +K+ T FE WYGYKPSLKF++VFGCLCF ++ Q KRDKLD+++ 
Subjt:  LTTLYTPQQNGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKDPTSFEVWYGYKPSLKFVRVFGCLCFIYISQVKRDKLDKKSE

Query:  VDIFVGM------------------------LEESEG-------------ERLK--------ESEDEQQDALVDDAPIRG
          +F+G                          EE E              E++K        E ED++Q+ +VDDA +RG
Subjt:  VDIFVGM------------------------LEESEG-------------ERLK--------ESEDEQQDALVDDAPIRG

A0A6J1EMZ3 uncharacterized protein LOC1114340542.3e-17865.8Show/hide
Query:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLFGSVLNDSRIVEKLLVTPPEKFEATITTLENTKDLSKISLVELLNALQAQEQRRSMKQEGVIE
        MKVLNLIRDFELQKMKESESVKEYS+RLLSIANKVRL GSVLNDSRIVEKLLVT PEKFEATITTLENTKDLSKISL ELLNALQAQEQRRSM+QEGVIE
Subjt:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLFGSVLNDSRIVEKLLVTPPEKFEATITTLENTKDLSKISLVELLNALQAQEQRRSMKQEGVIE

Query:  DALLVKHQDSSRYKNNKNFKNQLTYGDSSANYQKAKGEGFKKSYPPCHHCEKKGHPPYKCWRRPDAFYSKCNQLGHEAVICNAKYQVKEVDAQVVDQEE-
         ALLVKHQDS RYK+NKNFKNQLTYGDS ANYQK KG GFKKSYPPC HCEKKGHPPYK WR+PDAFYSKCNQLGHEAVIC AK  VKEVDAQ++++   
Subjt:  DALLVKHQDSSRYKNNKNFKNQLTYGDSSANYQKAKGEGFKKSYPPCHHCEKKGHPPYKCWRRPDAFYSKCNQLGHEAVICNAKYQVKEVDAQVVDQEE-

Query:  ---EDQLCMVTSSSSKE-----------------------SSKSWLIDSGCINH----MTYDKESFEELR--DTEVKRVRIGNGEHLEVKGKGTVAITSY
           E++ C++  +S K+                       ++ +W    G  +H        K+  EEL   D ++   R  N      +     A    
Subjt:  ---EDQLCMVTSSSSKE-----------------------SSKSWLIDSGCINH----MTYDKESFEELR--DTEVKRVRIGNGEHLEVKGKGTVAITSY

Query:  EDTKFIPDVLF----VPKIDQNLLSVEQVP----------LGPSTKDLGTF-IIEARVENEGACLIQTVRSDNGKEYTSETFNRFCEEAGIEHQLTTLYT
        +  + +   L      P ++ NL  +  +           L   ++  G F   +A+VE E ACLIQT+RSDN KEYT ETFNRFCEE GIEHQLT  YT
Subjt:  EDTKFIPDVLF----VPKIDQNLLSVEQVP----------LGPSTKDLGTF-IIEARVENEGACLIQTVRSDNGKEYTSETFNRFCEEAGIEHQLTTLYT

Query:  PQQNGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKDPTSFEVWYGYKPSLKFVRVFGCLCFIYISQVKRDKLDKKSEVDIFVG
        PQQN VS RRNRFIMEMTRCMLHEKDLPKRFWGKA NTIMCLQNRISTKVVKDPT FEVWYGYKPS KFVRVFGCLCFIYI QVKRDKLDKKSEV IFVG
Subjt:  PQQNGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKDPTSFEVWYGYKPSLKFVRVFGCLCFIYISQVKRDKLDKKSEVDIFVG

Query:  MLEESEGERLKESEDEQQDALVDDAPIRGGSFSDREKQNLG
        MLE        ES+DE+QDALVDDAP+RGG+FSDREKQNLG
Subjt:  MLEESEGERLKESEDEQQDALVDDAPIRGGSFSDREKQNLG

A0A6J1EP08 uncharacterized protein LOC1114363933.0e-14659.13Show/hide
Query:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLFGSVLNDSRIVEKLLVTPPEKFEATITTLENTKDLSKISLVELLNALQAQEQRRSMKQEGVIE
        MK LNLIRDFELQK+K+SESVKEYSDRLLSIANKVRL GSVLNDSRIVEKLLVT  EKFEATITTLENTKDLSKISL ELLNALQAQEQ+RSM+QEGVIE
Subjt:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLFGSVLNDSRIVEKLLVTPPEKFEATITTLENTKDLSKISLVELLNALQAQEQRRSMKQEGVIE

Query:  DALLVKHQDSSRYKNNKNFKNQLTYGDSSANYQKAKGEGFKKSYPPCHHCEKKGHPPYKCWRRPDAFYSKCNQLGHEAVICNAKYQVKEVDAQVVDQEEE
         ALLVKHQDS                                                                                          
Subjt:  DALLVKHQDSSRYKNNKNFKNQLTYGDSSANYQKAKGEGFKKSYPPCHHCEKKGHPPYKCWRRPDAFYSKCNQLGHEAVICNAKYQVKEVDAQVVDQEEE

Query:  DQLCMVTSSSSKESSKSWLIDSGCINHMTYDKESFEELRDTEVKRVRIGNGEHLEVKGKGTVAITSYEDTKF-IPDVLFVPK------------IDQNLL
                             SGC NHMTYDK+SFEELRDTEVKRVRIGNGEHLEVKGKG+VAITSYE   F +    F+P+            ID+++ 
Subjt:  DQLCMVTSSSSKESSKSWLIDSGCINHMTYDKESFEELRDTEVKRVRIGNGEHLEVKGKGTVAITSYEDTKF-IPDVLFVPK------------IDQNLL

Query:  SVEQVPLG-------------------------------PSTKDLGTF-IIEARVENEGACLIQTVRSDNGKEYTSETFNRFCEEAGIEHQLTTLYTPQQ
               G                               PS    G F   +A+VENEGACLIQTVRSDNGKEYTSETFNRFCEEAGIEH LTT YTPQQ
Subjt:  SVEQVPLG-------------------------------PSTKDLGTF-IIEARVENEGACLIQTVRSDNGKEYTSETFNRFCEEAGIEHQLTTLYTPQQ

Query:  NGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKDPTSFEVWYGYKPSLKFVRVFGCLCFIYISQVKRDKLDKKSEVDIFVGMLE
        NGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKDPT FEVWYGYKPSLKFVRVFGCLCFIYI QVK DKLDKKSEV IFVGMLE
Subjt:  NGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKDPTSFEVWYGYKPSLKFVRVFGCLCFIYISQVKRDKLDKKSEVDIFVGMLE

Query:  ESEGERLKESEDEQQDALVDDAPIRGGSFSD
                ES+DE+QDALVDDAP+RGG+FSD
Subjt:  ESEGERLKESEDEQQDALVDDAPIRGGSFSD

A0A6J1H529 uncharacterized protein LOC1114601241.3e-20865.61Show/hide
Query:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLFGSVLNDSRIVEKLLVTPPEKFEATITTLENTKDLSKISLVELLNALQAQEQRRSMKQEGVIE
        MK LNLIRDFELQKMK+SESVKEYS+RLL+IANKVRL GS+LNDSRIVEKLLVT PEKFEATITTLENTKDLSKISL ELLNALQAQEQ+RSM+QEGVIE
Subjt:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLFGSVLNDSRIVEKLLVTPPEKFEATITTLENTKDLSKISLVELLNALQAQEQRRSMKQEGVIE

Query:  DALLVKHQDSSRYKNNKNFKNQLTYGDSSANYQKAKGEGFKKSYPPCHHCEKKGHPPYKCWRRPDAFYSKCNQLGHEAVICNAKYQVKEVDAQVVDQ-EE
         AL VKHQD+SRYKNNKNFKNQLTYGDSS NYQK KG GFKKSYP C HCEKK HPPYKCWRRPDAF SKCNQLGHEAVIC  K  VKEVDAQVVDQ EE
Subjt:  DALLVKHQDSSRYKNNKNFKNQLTYGDSSANYQKAKGEGFKKSYPPCHHCEKKGHPPYKCWRRPDAFYSKCNQLGHEAVICNAKYQVKEVDAQVVDQ-EE

Query:  EDQLCMVTSSSSKESSKSWLIDSGCINHMTYDKESFEELRDTEVKRVRIGNGEHLEVKGKGTVAITSYEDTKFIPDV--------------------LFV
        EDQLCMVTSSSSKESS+SWLIDSGC NHMTYDKESFEELRDTE+KRVRIGNGEHLEVKGK TVAITSYE+  F  ++                    L  
Subjt:  EDQLCMVTSSSSKESSKSWLIDSGCINHMTYDKESFEELRDTEVKRVRIGNGEHLEVKGKGTVAITSYEDTKFIPDV--------------------LFV

Query:  PKIDQNLLSVEQVPLG-------------------------------PS--------------TKDLGTFII-------------EARVENEGACLIQTV
          ID ++       LG                               PS              T+    F++             +A+VEN+  CLIQT+
Subjt:  PKIDQNLLSVEQVPLG-------------------------------PS--------------TKDLGTFII-------------EARVENEGACLIQTV

Query:  RSDNGKEYTSETFNRFCEEAGIEHQLTTLYTPQQNGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKDPTSFEVWYGYKPSLKF
        RSDN KEYT ETFNRFCEEAGIEHQLT  YTPQQN VS RRNRFIMEMTRCMLHEKDLPKRFWGKAANT++CLQNRISTK VKD T FEVWYGYKPSLKF
Subjt:  RSDNGKEYTSETFNRFCEEAGIEHQLTTLYTPQQNGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKDPTSFEVWYGYKPSLKF

Query:  VRVFGCLCFIYISQVKRDKLDKKSEVDIFVG------------------------------------------------------MLEESEGERLKESED
        VRVFGCLCF YI QVKRDKLDKKSEV IFVG                                                      MLEESE ER K+SED
Subjt:  VRVFGCLCFIYISQVKRDKLDKKSEVDIFVG------------------------------------------------------MLEESEGERLKESED

Query:  EQQDALVDDAPIRGGSFSDREKQNLGVG
        E+QDALVDDAP+RGG+F DREKQNLGVG
Subjt:  EQQDALVDDAPIRGGSFSDREKQNLGVG

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.9e-2240.88Show/hide
Query:  DNGKEYTSETFNRFCEEAGIEHQLTTLYTPQQNGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKD--PTSFEVWYGYKPSLKF
        DNG+EY S    +FC + GI + LT  +TPQ NGVS R  R I E  R M+    L K FWG+A  T   L NRI ++ + D   T +E+W+  KP LK 
Subjt:  DNGKEYTSETFNRFCEEAGIEHQLTTLYTPQQNGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKD--PTSFEVWYGYKPSLKF

Query:  VRVFGCLCFIYISQVKRDKLDKKSEVDIFVGMLEESEGERLKESEDEQ----QDALVDD
        +RVFG   +++I   K+ K D KS   IFVG   E  G +L ++ +E+    +D +VD+
Subjt:  VRVFGCLCFIYISQVKRDKLDKKSEVDIFVGMLEESEGERLKESEDEQ----QDALVDD

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.8e-2536.7Show/hide
Query:  ARVENEGACLIQTVRSDNGKEYTSETFNRFCEEAGIEHQLTTLYTPQQNGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKDPT
        A VE E    ++ +RSDNG EYTS  F  +C   GI H+ T   TPQ NGV+ R NR I+E  R ML    LPK FWG+A  T   L NR  +  +    
Subjt:  ARVENEGACLIQTVRSDNGKEYTSETFNRFCEEAGIEHQLTTLYTPQQNGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKDPT

Query:  SFEVWYGYKPSLKFVRVFGCLCFIYISQVKRDKLDKKSEVDIFVGMLEESEGERL----KESEDEQQDALVDDAPIRGGSFSDREKQN
           VW   + S   ++VFGC  F ++ + +R KLD KS   IF+G  +E  G RL    K+     +D +  ++ +R  +    + +N
Subjt:  SFEVWYGYKPSLKFVRVFGCLCFIYISQVKRDKLDKKSEVDIFVGMLEESEGERL----KESEDEQQDALVDDAPIRGGSFSDREKQN

P92512 Uncharacterized mitochondrial protein AtMg007107.0e-0735.71Show/hide
Query:  NRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKDPTSFEVWYGYKPSLKFVRVFGCLCFIYISQVKRDKLDKKSE
        NR I+E  R ML E  LPK F   AANT + + N+  +  +      EVW+   P+  ++R FGC+ +I+  + K     KK E
Subjt:  NRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKDPTSFEVWYGYKPSLKFVRVFGCLCFIYISQVKRDKLDKKSE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.8e-1731.68Show/hide
Query:  PLGPSTKDLGTFI-IEARVENEGACLIQTVRSDNGKEYTSETFNRFCEEAGIEHQLTTLYTPQQNGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTI
        PL   ++   TFI  +  +EN     I T  SDNG E+ +     +  + GI H  +  +TP+ NG+S R++R I+E    +L    +PK +W  A    
Subjt:  PLGPSTKDLGTFI-IEARVENEGACLIQTVRSDNGKEYTSETFNRFCEEAGIEHQLTTLYTPQQNGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTI

Query:  MCLQNRISTKVVKDPTSFEVWYGYKPSLKFVRVFGCLCFIYISQVKRDKLDKKSEVDIFVG
        + L NR+ T +++  + F+  +G  P+   +RVFGC C+ ++    + KLD KS   +F+G
Subjt:  MCLQNRISTKVVKDPTSFEVWYGYKPSLKFVRVFGCLCFIYISQVKRDKLDKKSEVDIFVG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.0e-1832.92Show/hide
Query:  PLGPSTKDLGTFII-EARVENEGACLIQTVRSDNGKEYTSETFNRFCEEAGIEHQLTTLYTPQQNGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTI
        PL   ++   TFII ++ VEN     I T+ SDNG E+       +  + GI H  +  +TP+ NG+S R++R I+EM   +L    +PK +W  A +  
Subjt:  PLGPSTKDLGTFII-EARVENEGACLIQTVRSDNGKEYTSETFNRFCEEAGIEHQLTTLYTPQQNGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTI

Query:  MCLQNRISTKVVKDPTSFEVWYGYKPSLKFVRVFGCLCFIYISQVKRDKLDKKSEVDIFVG
        + L NR+ T +++  + F+  +G  P+ + ++VFGC C+ ++    R KL+ KS+   F+G
Subjt:  MCLQNRISTKVVKDPTSFEVWYGYKPSLKFVRVFGCLCFIYISQVKRDKLDKKSEVDIFVG

Arabidopsis top hitse value%identityAlignment
AT3G20980.1 Gag-Pol-related retrotransposon family protein2.0e-0437.5Show/hide
Query:  WLIDSGCINHMTYDKESFEELRDTEVKRVRIGNGEHLE-----VKGKGTVAITSYEDTKFIPDVLFVPKIDQNLLSVEQV
        WLI S   NHMT   + F  L  +   +V+  +G+  E     V+G G V   + E  K I +VL+VP I+ N LSV Q+
Subjt:  WLIDSGCINHMTYDKESFEELRDTEVKRVRIGNGEHLE-----VKGKGTVAITSYEDTKFIPDVLFVPKIDQNLLSVEQV

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein5.0e-0835.71Show/hide
Query:  NRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKDPTSFEVWYGYKPSLKFVRVFGCLCFIYISQVKRDKLDKKSE
        NR I+E  R ML E  LPK F   AANT + + N+  +  +      EVW+   P+  ++R FGC+ +I+  + K     KK E
Subjt:  NRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKDPTSFEVWYGYKPSLKFVRVFGCLCFIYISQVKRDKLDKKSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGTCCTAAATTTGATTAGGGATTTCGAGTTGCAGAAGATGAAGGAGTCGGAGTCAGTGAAAGAGTACTCTGATAGACTTCTCAGCATTGCCAACAAGGTGAGATT
GTTTGGTTCTGTTTTAAATGATTCTAGGATCGTTGAAAAACTCCTAGTCACTCCTCCAGAGAAGTTTGAAGCCACCATTACTACTCTGGAGAACACCAAAGACTTGTCAA
AGATTTCTCTTGTAGAGCTCTTGAATGCTTTACAAGCACAAGAGCAAAGGAGGTCTATGAAGCAAGAAGGGGTGATTGAAGATGCCTTACTTGTTAAGCATCAAGACAGC
AGCAGGTATAAAAACAACAAAAATTTCAAAAATCAATTGACGTATGGAGATTCATCTGCCAATTATCAGAAGGCAAAAGGAGAAGGTTTCAAAAAATCCTATCCACCTTG
CCACCATTGTGAGAAAAAAGGTCATCCACCATACAAGTGTTGGAGAAGACCTGATGCCTTCTACTCCAAATGCAATCAACTTGGACATGAAGCTGTGATCTGCAACGCCA
AATATCAGGTGAAAGAAGTAGATGCACAGGTAGTTGATCAAGAAGAAGAAGATCAATTGTGTATGGTCACTTCTTCCTCAAGCAAAGAATCAAGCAAGAGCTGGTTGATT
GACAGTGGGTGCATAAATCACATGACATATGACAAGGAGTCTTTTGAGGAATTAAGAGACACTGAAGTCAAGAGAGTGAGGATTGGCAATGGTGAACACTTGGAAGTCAA
GGGAAAAGGCACAGTAGCTATAACAAGCTATGAAGATACAAAATTTATTCCAGATGTTTTATTTGTACCTAAAATTGATCAAAATCTGTTAAGCGTTGAGCAAGTGCCAT
TGGGACCTAGCACAAAAGACTTGGGCACTTTCATTATCGAGGCCAGAGTTGAGAATGAAGGTGCATGCTTGATTCAGACCGTAAGATCAGATAATGGCAAAGAGTACACT
TCAGAAACTTTTAATAGATTTTGTGAAGAGGCTGGAATTGAACATCAATTGACAACACTATACACTCCTCAACAGAATGGCGTCAGTGTCAGAAGGAATAGATTCATAAT
GGAGATGACGAGATGCATGCTTCATGAGAAGGATCTTCCAAAACGTTTTTGGGGAAAAGCAGCAAACACTATTATGTGCTTGCAAAATCGAATTTCAACAAAAGTTGTGA
AGGACCCGACATCATTTGAAGTTTGGTATGGTTACAAACCTTCTTTAAAGTTCGTTAGAGTATTTGGGTGTCTTTGCTTCATTTACATTTCACAGGTCAAGCGTGATAAG
CTTGACAAAAAGTCAGAAGTTGACATCTTTGTTGGCATGTTAGAAGAGTCTGAAGGTGAACGACTAAAAGAGTCTGAAGATGAACAACAAGATGCTTTGGTTGATGATGC
ACCTATCAGAGGAGGATCTTTCAGTGATCGAGAAAAACAAAATCTGGGAGTTGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGTCCTAAATTTGATTAGGGATTTCGAGTTGCAGAAGATGAAGGAGTCGGAGTCAGTGAAAGAGTACTCTGATAGACTTCTCAGCATTGCCAACAAGGTGAGATT
GTTTGGTTCTGTTTTAAATGATTCTAGGATCGTTGAAAAACTCCTAGTCACTCCTCCAGAGAAGTTTGAAGCCACCATTACTACTCTGGAGAACACCAAAGACTTGTCAA
AGATTTCTCTTGTAGAGCTCTTGAATGCTTTACAAGCACAAGAGCAAAGGAGGTCTATGAAGCAAGAAGGGGTGATTGAAGATGCCTTACTTGTTAAGCATCAAGACAGC
AGCAGGTATAAAAACAACAAAAATTTCAAAAATCAATTGACGTATGGAGATTCATCTGCCAATTATCAGAAGGCAAAAGGAGAAGGTTTCAAAAAATCCTATCCACCTTG
CCACCATTGTGAGAAAAAAGGTCATCCACCATACAAGTGTTGGAGAAGACCTGATGCCTTCTACTCCAAATGCAATCAACTTGGACATGAAGCTGTGATCTGCAACGCCA
AATATCAGGTGAAAGAAGTAGATGCACAGGTAGTTGATCAAGAAGAAGAAGATCAATTGTGTATGGTCACTTCTTCCTCAAGCAAAGAATCAAGCAAGAGCTGGTTGATT
GACAGTGGGTGCATAAATCACATGACATATGACAAGGAGTCTTTTGAGGAATTAAGAGACACTGAAGTCAAGAGAGTGAGGATTGGCAATGGTGAACACTTGGAAGTCAA
GGGAAAAGGCACAGTAGCTATAACAAGCTATGAAGATACAAAATTTATTCCAGATGTTTTATTTGTACCTAAAATTGATCAAAATCTGTTAAGCGTTGAGCAAGTGCCAT
TGGGACCTAGCACAAAAGACTTGGGCACTTTCATTATCGAGGCCAGAGTTGAGAATGAAGGTGCATGCTTGATTCAGACCGTAAGATCAGATAATGGCAAAGAGTACACT
TCAGAAACTTTTAATAGATTTTGTGAAGAGGCTGGAATTGAACATCAATTGACAACACTATACACTCCTCAACAGAATGGCGTCAGTGTCAGAAGGAATAGATTCATAAT
GGAGATGACGAGATGCATGCTTCATGAGAAGGATCTTCCAAAACGTTTTTGGGGAAAAGCAGCAAACACTATTATGTGCTTGCAAAATCGAATTTCAACAAAAGTTGTGA
AGGACCCGACATCATTTGAAGTTTGGTATGGTTACAAACCTTCTTTAAAGTTCGTTAGAGTATTTGGGTGTCTTTGCTTCATTTACATTTCACAGGTCAAGCGTGATAAG
CTTGACAAAAAGTCAGAAGTTGACATCTTTGTTGGCATGTTAGAAGAGTCTGAAGGTGAACGACTAAAAGAGTCTGAAGATGAACAACAAGATGCTTTGGTTGATGATGC
ACCTATCAGAGGAGGATCTTTCAGTGATCGAGAAAAACAAAATCTGGGAGTTGGTTGA
Protein sequenceShow/hide protein sequence
MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLFGSVLNDSRIVEKLLVTPPEKFEATITTLENTKDLSKISLVELLNALQAQEQRRSMKQEGVIEDALLVKHQDS
SRYKNNKNFKNQLTYGDSSANYQKAKGEGFKKSYPPCHHCEKKGHPPYKCWRRPDAFYSKCNQLGHEAVICNAKYQVKEVDAQVVDQEEEDQLCMVTSSSSKESSKSWLI
DSGCINHMTYDKESFEELRDTEVKRVRIGNGEHLEVKGKGTVAITSYEDTKFIPDVLFVPKIDQNLLSVEQVPLGPSTKDLGTFIIEARVENEGACLIQTVRSDNGKEYT
SETFNRFCEEAGIEHQLTTLYTPQQNGVSVRRNRFIMEMTRCMLHEKDLPKRFWGKAANTIMCLQNRISTKVVKDPTSFEVWYGYKPSLKFVRVFGCLCFIYISQVKRDK
LDKKSEVDIFVGMLEESEGERLKESEDEQQDALVDDAPIRGGSFSDREKQNLGVG