; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh20G011160 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh20G011160
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCmo_Chr20:10661322..10664835
RNA-Seq ExpressionCmoCh20G011160
SyntenyCmoCh20G011160
Gene Ontology termsGO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0063435.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.2e-8635.05Show/hide
Query:  MVTSSSSKESSESWLIDNGYTNHITYDKESFEELRDTEDKRVRIGNGEHLEVKGKGTVAITSYEGTKFIPDVLFVPKIDQNLLSVGQLLDKGYKVLFENK
        +VT     ES+ESWLID+G TNH+T+DKE F++L+ T   +V IGNG+++ VKGKGT+AI S +GTK I DVLFVP I+QNLLSVGQL++KG+KV FEN+
Subjt:  MVTSSSSKESSESWLIDNGYTNHITYDKESFEELRDTEDKRVRIGNGEHLEVKGKGTVAITSYEGTKFIPDVLFVPKIDQNLLSVGQLLDKGYKVLFENK

Query:  QCLIKDASGKDLFNVKMEGKSFALNP-----TAFILRVSATEIWHKKLTLSSSRGNKPYVVKIEGRGTILFVSKGGEHCKLTDF------YFIPQARRTT
         CLIKDA+ +D+F VKM+GKSF+LNP     + F L+   T++WHK++     +G    +   E       +S+    CK   F       F   + R T
Subjt:  QCLIKDASGKDLFNVKMEGKSFALNP-----TAFILRVSATEIWHKKLTLSSSRGNKPYVVKIEGRGTILFVSKGGEHCKLTDF------YFIPQARRTT

Query:  NRLYILELEIDQP---------------------------VSLSAKTEEVSWRWHARYGHLNFPALEETSPPPAGAPPEPVEFATPR-TADSTLDADHDT
         +L ++  ++  P                           +   ++   V W++ AR    N   ++     P       V     R   + T    H+ 
Subjt:  NRLYILELEIDQP---------------------------VSLSAKTEEVSWRWHARYGHLNFPALEETSPPPAGAPPEPVEFATPR-TADSTLDADHDT

Query:  DLEARYRMMDDLVGGGEPPGLAARELEEVTFAEAEK-------------------NPCWRKAMQEEMTSITENQTWSLEDM-PPGHRAIGLKWVF-----
         L  ++              L  + L+E T  EA +                    P   K +        E++ W+ +D    G     +K+ F     
Subjt:  DLEARYRMMDDLVGGGEPPGLAARELEEVTFAEAEK-------------------NPCWRKAMQEEMTSITENQTWSLEDM-PPGHRAIGLKWVF-----

Query:  KLKRNEKGEVVKHKA----RLVAKGYVQ--------------KQGVDFEEAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARL
        + + + + E+V   +    RL++  Y +              K+   +  AM+EE++ I +N+TW L D P   + IG+KWVF+ K N  G + KHKARL
Subjt:  KLKRNEKGEVVKHKA----RLVAKGYVQ--------------KQGVDFEEAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARL

Query:  VAKGYVQKQGVDFEEVFAPAARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALCGLQQAPRAWNAKLDSTL
        V KGY Q  GVD+ + FAP AR++++R L AIAA   W+++ +DVKSAFLNG L+E +YV QP G       NKV  L KAL GL+QAPRAW +K+D  L
Subjt:  VAKGYVQKQGVDFEEVFAPAARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALCGLQQAPRAWNAKLDSTL

Query:  LSLNFKRCASEHDMYTYGHGKKRLIVGVYVDNLIITGGD
        LSL F +  +E  +Y   +G   LIV +YVD+L++TG +
Subjt:  LSLNFKRCASEHDMYTYGHGKKRLIVGVYVDNLIITGGD

RVW46097.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]5.4e-8534.58Show/hide
Query:  TSSSSKESSESWLIDNGYTNHITYDKESFEELRDTEDKRVRIGNGEHLEVKGKGTVAITSYEGTKFIPDVLFVPKIDQNLLSVGQLLDKGYKVLFENKQC
        T  ++  +SESWL+D+G TNH+TYD+  F E+  T   +V IGNGE++ VKGKGTVAI S  G K I DVLFVP IDQNLLSVGQL++KG+KV FE+K C
Subjt:  TSSSSKESSESWLIDNGYTNHITYDKESFEELRDTEDKRVRIGNGEHLEVKGKGTVAITSYEGTKFIPDVLFVPKIDQNLLSVGQLLDKGYKVLFENKQC

Query:  LIKDASGKDLFNVKMEGKSFALN-----PTAFILRVSATEIWHKKLTLSSSRGNKPYVVK-IEGRGTILFVSKGGEHCKLTDFYFIPQARRTTNRLYILE
        +IKDA G+++FN+KM+GKSFALN       A     + T +WHK+L       +K  ++K +EGR       KG                    + + L 
Subjt:  LIKDASGKDLFNVKMEGKSFALN-----PTAFILRVSATEIWHKKLTLSSSRGNKPYVVK-IEGRGTILFVSKGGEHCKLTDFYFIPQARRTTNRLYILE

Query:  LEIDQPVSLSAKTEEVSWRWHARYGHLN---------------FPALEETSPPPAGAP-------PEP---------------VEFATPRTADSTLDADH
        +  D+ ++ +A+ E  +  WH R GH +                P LEE  P  A          P P                + + P+   S   + +
Subjt:  LEIDQPVSLSAKTEEVSWRWHARYGHLN---------------FPALEETSPPPAGAP-------PEP---------------VEFATPRTADSTLDADH

Query:  ----------------------------------DTDLEARYRMMDDLVGGGEPPGL-----AARELEEVTFAEAEKNPCWR--KAMQEEMTSITENQTW
                                          +   E R ++  D +     PG+     +  +   +   +  K    R  K ++ E  S  E    
Subjt:  ----------------------------------DTDLEARYRMMDDLVGGGEPPGL-----AARELEEVTFAEAEKNPCWR--KAMQEEMTSITENQTW

Query:  SLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQ-KQGVDFEEAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKAR
         +++        G + +F + +        + A L   G+V+  +   +  AMQEE+  I +N TW L D P   + IG+KWV++ K N  G + KHKAR
Subjt:  SLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQ-KQGVDFEEAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKAR

Query:  LVAKGYVQKQGVDFEEVFAPAARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALCGLQQAPRAWNAKLDST
        LV KGY Q  GVDF E FAP ARL+++R LLA+AA   W+++ +D+KSAFLNG L+E ++V QP GF       KV  L KAL GL+QAPRAW  ++D+ 
Subjt:  LVAKGYVQKQGVDFEEVFAPAARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALCGLQQAPRAWNAKLDST

Query:  LLSLNFKRCASEHDMYTYGHGKKRLIVGVYVDNLIITGGDVG
        LL+L F +  SE  +Y     +  LIV +YVD+L++TG + G
Subjt:  LLSLNFKRCASEHDMYTYGHGKKRLIVGVYVDNLIITGGDVG

RVX08961.1 Retrovirus-related Pol polyprotein from transposon RE2 [Vitis vinifera]5.4e-8536.24Show/hide
Query:  TSSSSKESSESWLIDNGYTNHITYDKESFEELRDTEDKRVRIGNGEHLEVKGKGTVAITSYEGTKFIPDVLFVPKIDQNLLSVGQLLDKGYKVLFENKQC
        T  ++  +SESWL+D+G TNH+TYD++ F E+  T   +VRIGNGE++ VKGKGTVAI S  G K I DVLFVP IDQNLLSVGQ ++KG+KV FE+K C
Subjt:  TSSSSKESSESWLIDNGYTNHITYDKESFEELRDTEDKRVRIGNGEHLEVKGKGTVAITSYEGTKFIPDVLFVPKIDQNLLSVGQLLDKGYKVLFENKQC

Query:  LIKDASGKDLFNVKMEGKSFALN-----PTAFILRVSATEIWHKKLTLSSSRGNKPYVVK----IEGRGTI---LFVSKGGEHCKLTDFYFIPQ--ARRT
        +IKDA G+++FN+KM+GKSFALN       A     + T +WHK+  L     N    +K    +EG   +   L +    ++ K T   F PQ  A ++
Subjt:  LIKDASGKDLFNVKMEGKSFALN-----PTAFILRVSATEIWHKKLTLSSSRGNKPYVVK----IEGRGTI---LFVSKGGEHCKLTDFYFIPQ--ARRT

Query:  TNRLYILELEIDQP------------VSLSAKTEEVSWRWHARYG------HLNFPALEETSPP---PAGAPPEPVEFATPRTADSTLDADHDTDLEARY
        T +L ++  ++  P            ++         W +   Y        L + A+ E                E+ + +      DA  D  L A Y
Subjt:  TNRLYILELEIDQP------------VSLSAKTEEVSWRWHARYG------HLNFPALEETSPP---PAGAPPEPVEFATPRTADSTLDADHDTDLEARY

Query:  R------MMDDLVGGGEPPGLAARELEEVTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGH--------RAIGLKWVFKLKRNEKGEVVKHKARL
               +  D +     PG+        + A     P   K +        E + WS ++    +           G + +F + +        + A L
Subjt:  R------MMDDLVGGGEPPGLAARELEEVTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGH--------RAIGLKWVFKLKRNEKGEVVKHKARL

Query:  VAKGYVQ-KQGVDFEEAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPAARLESVRFLLAIAA
           G+V+  +   +   MQEE+  I +N TW L D P   + IG+KWV++ K N  G + KHKARLV KGY Q  GVDF E FAP ARL+++R LLA+AA
Subjt:  VAKGYVQ-KQGVDFEEAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPAARLESVRFLLAIAA

Query:  HHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALCGLQQAPRAWNAKLDSTLLSLNFKRCASEHDMYTYGHGKKRLIVGVYVDNLI
           W+++ +DVKSAFLNG L+E ++V QP GF       KV  L KAL GL+QAPRAW +++D+ LL+L F +  SE  +Y     +  LIV +YVD+L+
Subjt:  HHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALCGLQQAPRAWNAKLDSTLLSLNFKRCASEHDMYTYGHGKKRLIVGVYVDNLI

Query:  ITGGDVG
        +TG + G
Subjt:  ITGGDVG

XP_003613757.4 uncharacterized protein LOC11413243 [Medicago truncatula]4.5e-8434.2Show/hide
Query:  MVTSSSSKESSESWLIDNGYTNHITYDKESFEELRDTEDKRVRIGNGEHLEVKGKGTVAITSYEGTKFIPDVLFVPKIDQNLLSVGQLLDKGYKVLFENK
        + T  S  +SS+SWLID+G TNH+TYDKE F+ELR ++  +VRIGNG+++ VKGKGT+AI S  GTK I DVL+VP+IDQNLLSVGQLL+KG+KV FE+K
Subjt:  MVTSSSSKESSESWLIDNGYTNHITYDKESFEELRDTEDKRVRIGNGEHLEVKGKGTVAITSYEGTKFIPDVLFVPKIDQNLLSVGQLLDKGYKVLFENK

Query:  QCLIKDASGKDLFNVKMEGKSFALNP-----TAFILRVSATEIWHKKLTLSSSRG-----NKPYVVKIEGRGTILFVSKGGEHCKLTDFYFIPQARRTTN
         CLIKDASG+++F VKM GKSF LNP     +AF ++ S TE+WHK+L     +G     +K  V  +      L   +  ++ K     F   A R T 
Subjt:  QCLIKDASGKDLFNVKMEGKSFALNP-----TAFILRVSATEIWHKKLTLSSSRG-----NKPYVVKIEGRGTILFVSKGGEHCKLTDFYFIPQARRTTN

Query:  RLYILELEIDQPVSLSAKTEEV------------SWRWHARYG------HLNFPALEE------------------------------------TSP--P
        +L ++  ++  P   S+    +             W +  ++        L F  L E                                    T+P  P
Subjt:  RLYILELEIDQPVSLSAKTEEV------------SWRWHARYG------HLNFPALEE------------------------------------TSP--P

Query:  PAGAPPE-------------------PVEFATPRTADS--------TLDADHDTDLEARYRMMDDL-------------VGGGEPPGLAARELEEVTFAE
              E                   P +F       S        T    + T  EA Y     L             V   +   L  + ++ +    
Subjt:  PAGAPPE-------------------PVEFATPRTADS--------TLDADHDTDLEARYRMMDDL-------------VGGGEPPGLAARELEEVTFAE

Query:  AEKNPCWRKAMQEEMTSI--------TENQTWSLEDMPPGHRAIG-LKWVFKLKRNE--KGEVVK----HKARLVAKGYVQ-----KQGVDFEE------
        +  +  + K  Q E  +I         EN+ W  + +   ++A   +K V +    E  K E+V        RL++  Y +      +  DF E      
Subjt:  AEKNPCWRKAMQEEMTSI--------TENQTWSLEDMPPGHRAIG-LKWVFKLKRNE--KGEVVK----HKARLVAKGYVQ-----KQGVDFEE------

Query:  ---AMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPAARLESVRFLLAIAAHHSWEVHHMDVKS
           AM+EE+  I +N TW L D P   + IG+KWVF+ K N  G + KHKARLV KGY Q  GVD+ + FAP ARL+++R LLA+A    W+V+ +DVKS
Subjt:  ---AMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPAARLESVRFLLAIAAHHSWEVHHMDVKS

Query:  AFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALCGLQQAPRAWNAKLDSTLLSLNFKRCASEHDMYTYGHGKKRLIVGVYVDNLIITGGDVGVL
         FLNG L+E +YV QP GF+     +KV  L KAL GL+QAPRAW +++D+ LLSL F +  SE  +Y        L++ +YVD+L +TG +  ++
Subjt:  AFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALCGLQQAPRAWNAKLDSTLLSLNFKRCASEHDMYTYGHGKKRLIVGVYVDNLIITGGDVGVL

XP_023521510.1 uncharacterized protein LOC111785335 [Cucurbita pepo subsp. pepo]5.7e-8752.34Show/hide
Query:  ETSPPPAGAPPEPVEFATPRTADSTLDADHDTDLEARYRMMDDLVGGGEPPGLAARELEEV------------TFAEAEKNPCWRKAMQEEMTSITEN--
        E SP  A   P+PVEFATPRTADSTLD DHD DL ARYR MDDLVGGGEPPGLA RELEEV            TFA+AE+NPC  K    ++ S   N  
Subjt:  ETSPPPAGAPPEPVEFATPRTADSTLDADHDTDLEARYRMMDDLVGGGEPPGLAARELEEV------------TFAEAEKNPCWRKAMQEEMTSITEN--

Query:  --QTWSLEDMP-------PGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEAMQEEMTSITEN------QTWSLEDM-----PPGHRAI
          +T  ++  P       PG         + +    +G   +H+        V  + V+F      + T   ++      +   ++D+     PPG    
Subjt:  --QTWSLEDMP-------PGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEAMQEEMTSITEN------QTWSLEDM-----PPGHRAI

Query:  GLKWVFKL------------------KRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPAARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVY
         L+ V +L                   R EKGEVVKHKA LVAKGY+ KQGVDFEEVFA   RLE VR LL IA H SWEVHHMDVKS FLNGELKET+ 
Subjt:  GLKWVFKL------------------KRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPAARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVY

Query:  VRQPPGFLDNDNPNKVLRLHKALCGLQQAPRAWNAKLDSTLLSLNFKRCASEHDMYTYGHGKKRLIVGVYVDNLIITGGDVGVL
        V+QPPGFLDNDNP+KVLRLHKAL GL+QAPRAWNAKLDS LLS+ FK CASEH MYT+ H ++RLI+GVYVD+LIITGGD+ VL
Subjt:  VRQPPGFLDNDNPNKVLRLHKALCGLQQAPRAWNAKLDSTLLSLNFKRCASEHDMYTYGHGKKRLIVGVYVDNLIITGGDVGVL

TrEMBL top hitse value%identityAlignment
A0A438CDR2 Retrovirus-related Pol polyprotein from transposon RE13.8e-8437.59Show/hide
Query:  SSSKESSESWLIDNGYTNHITYDKESFEELRDTEDKRVRIGNGEHLEVKGKGTVAITSYEGTKFIPDVLFVPKIDQNLLSVGQLLDKGYKVLFENKQCLI
        ++S  SS+SWLID+G TNH+T D+E F+EL  T   +V+IGNGE + VKGKGTVAI S  G K+I DVL+VP IDQNLLSVGQL++KG+KV+FE+K C+I
Subjt:  SSSKESSESWLIDNGYTNHITYDKESFEELRDTEDKRVRIGNGEHLEVKGKGTVAITSYEGTKFIPDVLFVPKIDQNLLSVGQLLDKGYKVLFENKQCLI

Query:  KDASGKDLFNVKMEGKSFALN-----PTAFILRVSATEIWHKKLTLSSSRG----NKPYVVKIEGRGTILFVSK-----GGEHCKLTDFYFIPQARRTTN
        KDA G+D+F VKM  KSFALN       AF   VS  E+WH++L      G     K  +VK    G  L   K       ++ K T   F   A R  +
Subjt:  KDASGKDLFNVKMEGKSFALN-----PTAFILRVSATEIWHKKLTLSSSRG----NKPYVVKIEGRGTILFVSK-----GGEHCKLTDFYFIPQARRTTN

Query:  RLYILELEIDQPVSL-SAKTEEVSWRWHARYGHLNFPALEETSPPPAGAPPEPVEFATPRTADSTLDADHDTDLEARYRMMDDLVGGGEPPGLAARELEE
        +L ++  ++  P    S    +    +   Y    +    ++    A    +   +   +++        D   E    + D      E  G+  +    
Subjt:  RLYILELEIDQPVSL-SAKTEEVSWRWHARYGHLNFPALEETSPPPAGAPPEPVEFATPRTADSTLDADHDTDLEARYRMMDDLVGGGEPPGLAARELEE

Query:  VTFAE--AEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVD--FEEAMQEEMTSITENQTWSL
         T  +   +++   +KA QE++    E++ W+ E+              K++  +  +        V     ++   D  + EAM+EE+  I +N TW L
Subjt:  VTFAE--AEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVD--FEEAMQEEMTSITENQTWSL

Query:  EDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPAARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFL
         D P   + IG+KWV++ K N  G V K+KARLV KGY Q  GVDF E FAP ARL+++R LLA+ A   W+ + +DVKSAFLNG L+E +YV QP GF 
Subjt:  EDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPAARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFL

Query:  DNDNPNKVLRLHKALCGLQQAPRAWNAKLDSTLLSLNFKRCASEHDMYTYGHGKKRLIVGVYVDNLIITGGD
              KV  L KAL GL+QAPRAW +++D  L SL F +  SE  +Y  G     ++V VYVD+L++TG +
Subjt:  DNDNPNKVLRLHKALCGLQQAPRAWNAKLDSTLLSLNFKRCASEHDMYTYGHGKKRLIVGVYVDNLIITGGD

A0A438DQ73 Retrovirus-related Pol polyprotein from transposon RE14.9e-8435.56Show/hide
Query:  SSSKESSESWLIDNGYTNHITYDKESFEELRDTEDKRVRIGNGEHLEVKGKGTVAITSYEGTKFIPDVLFVPKIDQNLLSVGQLLDKGYKVLFENKQCLI
        ++S  SS+SWLID+G TNH+T D+E F+EL  T   +V+IGNGE + VKGKGTVAI S  G K+I DVL+VP IDQNLLSVGQL++KG+KV+FE+K C+I
Subjt:  SSSKESSESWLIDNGYTNHITYDKESFEELRDTEDKRVRIGNGEHLEVKGKGTVAITSYEGTKFIPDVLFVPKIDQNLLSVGQLLDKGYKVLFENKQCLI

Query:  KDASGKDLFNVKMEGKSFALN-----PTAFILRVSATEIWHKKLTLSSSRG----NKPYVVKIEGRGTILFVSK-----GGEHCKLTDFYFIPQARRTTN
        KDA G+D+F VKM  K FALN       AF   VS  E+WH++L      G     K  +VK    G  L   K       ++ K T   F   A R  +
Subjt:  KDASGKDLFNVKMEGKSFALN-----PTAFILRVSATEIWHKKLTLSSSRG----NKPYVVKIEGRGTILFVSK-----GGEHCKLTDFYFIPQARRTTN

Query:  RLYILELEIDQPVS-------------------------LSAKTE--EVSWRWHARYGHLNFPALEETSPPPAGAPPEPV------------EFATPRTA
        +L ++  ++  P                           L +K+E   V W++ A   + +   +++            +            +  TP T 
Subjt:  RLYILELEIDQPVS-------------------------LSAKTE--EVSWRWHARYGHLNFPALEETSPPPAGAPPEPV------------EFATPRTA

Query:  DST------LDADHDTDLEARYRMMDDLVGGGEPPGLAARELEEVTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMP-PGHRAIGLKWVFKLKRNEKG
                 LD   +  +   Y          +P         +V F E ++   W ++++ ++  + +     ++D+P  G R++   +       E+ 
Subjt:  DST------LDADHDTDLEARYRMMDDLVGGGEPPGLAARELEEVTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMP-PGHRAIGLKWVFKLKRNEKG

Query:  EVVKHKARLVAKGYVQKQGVDFEEAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPAARLESV
         V   +    A+    ++   + EAM+EE+  I +N TW L D P   + IG+KWV++ K N  G V K+KARLV KGY Q  GVDF E FAP ARL+++
Subjt:  EVVKHKARLVAKGYVQKQGVDFEEAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPAARLESV

Query:  RFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALCGLQQAPRAWNAKLDSTLLSLNFKRCASEHDMYTYGHGKKRLIV
        R LLA+ A   W+ + +DVKSAFLNG L++ +YV QP GF       KV  L KAL GL+QAPRAW +++D  L SL F +  SE  +Y  G     ++V
Subjt:  RFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALCGLQQAPRAWNAKLDSTLLSLNFKRCASEHDMYTYGHGKKRLIV

Query:  GVYVDNLIITGGD
         VYVD+L++TG +
Subjt:  GVYVDNLIITGGD

A0A438EE23 Retrovirus-related Pol polyprotein from transposon RE12.6e-8534.58Show/hide
Query:  TSSSSKESSESWLIDNGYTNHITYDKESFEELRDTEDKRVRIGNGEHLEVKGKGTVAITSYEGTKFIPDVLFVPKIDQNLLSVGQLLDKGYKVLFENKQC
        T  ++  +SESWL+D+G TNH+TYD+  F E+  T   +V IGNGE++ VKGKGTVAI S  G K I DVLFVP IDQNLLSVGQL++KG+KV FE+K C
Subjt:  TSSSSKESSESWLIDNGYTNHITYDKESFEELRDTEDKRVRIGNGEHLEVKGKGTVAITSYEGTKFIPDVLFVPKIDQNLLSVGQLLDKGYKVLFENKQC

Query:  LIKDASGKDLFNVKMEGKSFALN-----PTAFILRVSATEIWHKKLTLSSSRGNKPYVVK-IEGRGTILFVSKGGEHCKLTDFYFIPQARRTTNRLYILE
        +IKDA G+++FN+KM+GKSFALN       A     + T +WHK+L       +K  ++K +EGR       KG                    + + L 
Subjt:  LIKDASGKDLFNVKMEGKSFALN-----PTAFILRVSATEIWHKKLTLSSSRGNKPYVVK-IEGRGTILFVSKGGEHCKLTDFYFIPQARRTTNRLYILE

Query:  LEIDQPVSLSAKTEEVSWRWHARYGHLN---------------FPALEETSPPPAGAP-------PEP---------------VEFATPRTADSTLDADH
        +  D+ ++ +A+ E  +  WH R GH +                P LEE  P  A          P P                + + P+   S   + +
Subjt:  LEIDQPVSLSAKTEEVSWRWHARYGHLN---------------FPALEETSPPPAGAP-------PEP---------------VEFATPRTADSTLDADH

Query:  ----------------------------------DTDLEARYRMMDDLVGGGEPPGL-----AARELEEVTFAEAEKNPCWR--KAMQEEMTSITENQTW
                                          +   E R ++  D +     PG+     +  +   +   +  K    R  K ++ E  S  E    
Subjt:  ----------------------------------DTDLEARYRMMDDLVGGGEPPGL-----AARELEEVTFAEAEKNPCWR--KAMQEEMTSITENQTW

Query:  SLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQ-KQGVDFEEAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKAR
         +++        G + +F + +        + A L   G+V+  +   +  AMQEE+  I +N TW L D P   + IG+KWV++ K N  G + KHKAR
Subjt:  SLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQ-KQGVDFEEAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKAR

Query:  LVAKGYVQKQGVDFEEVFAPAARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALCGLQQAPRAWNAKLDST
        LV KGY Q  GVDF E FAP ARL+++R LLA+AA   W+++ +D+KSAFLNG L+E ++V QP GF       KV  L KAL GL+QAPRAW  ++D+ 
Subjt:  LVAKGYVQKQGVDFEEVFAPAARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALCGLQQAPRAWNAKLDST

Query:  LLSLNFKRCASEHDMYTYGHGKKRLIVGVYVDNLIITGGDVG
        LL+L F +  SE  +Y     +  LIV +YVD+L++TG + G
Subjt:  LLSLNFKRCASEHDMYTYGHGKKRLIVGVYVDNLIITGGDVG

A0A438JJ21 Retrovirus-related Pol polyprotein from transposon RE22.6e-8536.24Show/hide
Query:  TSSSSKESSESWLIDNGYTNHITYDKESFEELRDTEDKRVRIGNGEHLEVKGKGTVAITSYEGTKFIPDVLFVPKIDQNLLSVGQLLDKGYKVLFENKQC
        T  ++  +SESWL+D+G TNH+TYD++ F E+  T   +VRIGNGE++ VKGKGTVAI S  G K I DVLFVP IDQNLLSVGQ ++KG+KV FE+K C
Subjt:  TSSSSKESSESWLIDNGYTNHITYDKESFEELRDTEDKRVRIGNGEHLEVKGKGTVAITSYEGTKFIPDVLFVPKIDQNLLSVGQLLDKGYKVLFENKQC

Query:  LIKDASGKDLFNVKMEGKSFALN-----PTAFILRVSATEIWHKKLTLSSSRGNKPYVVK----IEGRGTI---LFVSKGGEHCKLTDFYFIPQ--ARRT
        +IKDA G+++FN+KM+GKSFALN       A     + T +WHK+  L     N    +K    +EG   +   L +    ++ K T   F PQ  A ++
Subjt:  LIKDASGKDLFNVKMEGKSFALN-----PTAFILRVSATEIWHKKLTLSSSRGNKPYVVK----IEGRGTI---LFVSKGGEHCKLTDFYFIPQ--ARRT

Query:  TNRLYILELEIDQP------------VSLSAKTEEVSWRWHARYG------HLNFPALEETSPP---PAGAPPEPVEFATPRTADSTLDADHDTDLEARY
        T +L ++  ++  P            ++         W +   Y        L + A+ E                E+ + +      DA  D  L A Y
Subjt:  TNRLYILELEIDQP------------VSLSAKTEEVSWRWHARYG------HLNFPALEETSPP---PAGAPPEPVEFATPRTADSTLDADHDTDLEARY

Query:  R------MMDDLVGGGEPPGLAARELEEVTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGH--------RAIGLKWVFKLKRNEKGEVVKHKARL
               +  D +     PG+        + A     P   K +        E + WS ++    +           G + +F + +        + A L
Subjt:  R------MMDDLVGGGEPPGLAARELEEVTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGH--------RAIGLKWVFKLKRNEKGEVVKHKARL

Query:  VAKGYVQ-KQGVDFEEAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPAARLESVRFLLAIAA
           G+V+  +   +   MQEE+  I +N TW L D P   + IG+KWV++ K N  G + KHKARLV KGY Q  GVDF E FAP ARL+++R LLA+AA
Subjt:  VAKGYVQ-KQGVDFEEAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPAARLESVRFLLAIAA

Query:  HHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALCGLQQAPRAWNAKLDSTLLSLNFKRCASEHDMYTYGHGKKRLIVGVYVDNLI
           W+++ +DVKSAFLNG L+E ++V QP GF       KV  L KAL GL+QAPRAW +++D+ LL+L F +  SE  +Y     +  LIV +YVD+L+
Subjt:  HHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALCGLQQAPRAWNAKLDSTLLSLNFKRCASEHDMYTYGHGKKRLIVGVYVDNLI

Query:  ITGGDVG
        +TG + G
Subjt:  ITGGDVG

A0A5A7VC09 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-8635.05Show/hide
Query:  MVTSSSSKESSESWLIDNGYTNHITYDKESFEELRDTEDKRVRIGNGEHLEVKGKGTVAITSYEGTKFIPDVLFVPKIDQNLLSVGQLLDKGYKVLFENK
        +VT     ES+ESWLID+G TNH+T+DKE F++L+ T   +V IGNG+++ VKGKGT+AI S +GTK I DVLFVP I+QNLLSVGQL++KG+KV FEN+
Subjt:  MVTSSSSKESSESWLIDNGYTNHITYDKESFEELRDTEDKRVRIGNGEHLEVKGKGTVAITSYEGTKFIPDVLFVPKIDQNLLSVGQLLDKGYKVLFENK

Query:  QCLIKDASGKDLFNVKMEGKSFALNP-----TAFILRVSATEIWHKKLTLSSSRGNKPYVVKIEGRGTILFVSKGGEHCKLTDF------YFIPQARRTT
         CLIKDA+ +D+F VKM+GKSF+LNP     + F L+   T++WHK++     +G    +   E       +S+    CK   F       F   + R T
Subjt:  QCLIKDASGKDLFNVKMEGKSFALNP-----TAFILRVSATEIWHKKLTLSSSRGNKPYVVKIEGRGTILFVSKGGEHCKLTDF------YFIPQARRTT

Query:  NRLYILELEIDQP---------------------------VSLSAKTEEVSWRWHARYGHLNFPALEETSPPPAGAPPEPVEFATPR-TADSTLDADHDT
         +L ++  ++  P                           +   ++   V W++ AR    N   ++     P       V     R   + T    H+ 
Subjt:  NRLYILELEIDQP---------------------------VSLSAKTEEVSWRWHARYGHLNFPALEETSPPPAGAPPEPVEFATPR-TADSTLDADHDT

Query:  DLEARYRMMDDLVGGGEPPGLAARELEEVTFAEAEK-------------------NPCWRKAMQEEMTSITENQTWSLEDM-PPGHRAIGLKWVF-----
         L  ++              L  + L+E T  EA +                    P   K +        E++ W+ +D    G     +K+ F     
Subjt:  DLEARYRMMDDLVGGGEPPGLAARELEEVTFAEAEK-------------------NPCWRKAMQEEMTSITENQTWSLEDM-PPGHRAIGLKWVF-----

Query:  KLKRNEKGEVVKHKA----RLVAKGYVQ--------------KQGVDFEEAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARL
        + + + + E+V   +    RL++  Y +              K+   +  AM+EE++ I +N+TW L D P   + IG+KWVF+ K N  G + KHKARL
Subjt:  KLKRNEKGEVVKHKA----RLVAKGYVQ--------------KQGVDFEEAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARL

Query:  VAKGYVQKQGVDFEEVFAPAARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALCGLQQAPRAWNAKLDSTL
        V KGY Q  GVD+ + FAP AR++++R L AIAA   W+++ +DVKSAFLNG L+E +YV QP G       NKV  L KAL GL+QAPRAW +K+D  L
Subjt:  VAKGYVQKQGVDFEEVFAPAARLESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALCGLQQAPRAWNAKLDSTL

Query:  LSLNFKRCASEHDMYTYGHGKKRLIVGVYVDNLIITGGD
        LSL F +  +E  +Y   +G   LIV +YVD+L++TG +
Subjt:  LSLNFKRCASEHDMYTYGHGKKRLIVGVYVDNLIITGGD

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.2e-3638.97Show/hide
Query:  FEEAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPAARLESVRFLLAIAAHHSWEVHHMDVKS
        +EEA+  E+ +   N TW++   P     +  +WVF +K NE G  +++KARLVA+G+ QK  +D+EE FAP AR+ S RF+L++   ++ +VH MDVK+
Subjt:  FEEAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPAARLESVRFLLAIAAHHSWEVHHMDVKS

Query:  AFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALCGLQQAPRAWNAKLDSTLLSLNFKRCASEHDMYTYGHG--KKRLIVGVYVDNLIITGGDV
        AFLNG LKE +Y+R P G   + N + V +L+KA+ GL+QA R W    +  L    F   + +  +Y    G   + + V +YVD+++I  GD+
Subjt:  AFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALCGLQQAPRAWNAKLDSTLLSLNFKRCASEHDMYTYGHG--KKRLIVGVYVDNLIITGGDV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.1e-4042.05Show/hide
Query:  EAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPAARLESVRFLLAIAAHHSWEVHHMDVKSAF
        +AMQEEM S+ +N T+ L ++P G R +  KWVFKLK++   ++V++KARLV KG+ QK+G+DF+E+F+P  ++ S+R +L++AA    EV  +DVK+AF
Subjt:  EAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPAARLESVRFLLAIAAHHSWEVHHMDVKSAF

Query:  LNGELKETVYVRQPPGFLDNDNPNKVLRLHKALCGLQQAPRAWNAKLDSTLLSLNFKRCASEHDMYTYGHGKKR-LIVGVYVDNLIITGGDVGVL
        L+G+L+E +Y+ QP GF      + V +L+K+L GL+QAPR W  K DS + S  + +  S+  +Y     +   +I+ +YVD+++I G D G++
Subjt:  LNGELKETVYVRQPPGFLDNDNPNKVLRLHKALCGLQQAPRAWNAKLDSTLLSLNFKRCASEHDMYTYGHGKKR-LIVGVYVDNLIITGGDVGVL

P25600 Putative transposon Ty5-1 protein YCL074W9.4e-1640.86Show/hide
Query:  MDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALCGLQQAPRAWNAKLDSTLLSLNFKRCASEHDMYTYGHGKKRLIVGVYVDNLII
        MDV +AFLN  + E +YV+QPPGF++  NP+ V  L+  + GL+QAP  WN  +++TL  + F R   EH +Y        + + VYVD+L++
Subjt:  MDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALCGLQQAPRAWNAKLDSTLLSLNFKRCASEHDMYTYGHGKKRLIVGVYVDNLII

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-4242.64Show/hide
Query:  FEEAMQEEMTSITENQTWSLEDMPPGH-RAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPAARLESVRFLLAIAAHHSWEVHHMDVK
        +  AM  E+ +   N TW L   PP H   +G +W+F  K N  G + ++KARLVAKGY Q+ G+D+ E F+P  +  S+R +L +A   SW +  +DV 
Subjt:  FEEAMQEEMTSITENQTWSLEDMPPGH-RAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPAARLESVRFLLAIAAHHSWEVHHMDVK

Query:  SAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALCGLQQAPRAWNAKLDSTLLSLNFKRCASEHDMYTYGHGKKRLIVGVYVDNLIITGGDVGVL
        +AFL G L + VY+ QPPGF+D D PN V +L KAL GL+QAPRAW  +L + LL++ F    S+  ++    GK  + + VYVD+++ITG D  +L
Subjt:  SAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALCGLQQAPRAWNAKLDSTLLSLNFKRCASEHDMYTYGHGKKRLIVGVYVDNLIITGGDVGVL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.4e-4041.12Show/hide
Query:  FEEAMQEEMTSITENQTWSL-EDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPAARLESVRFLLAIAAHHSWEVHHMDVK
        + +AM  E+ +   N TW L    PP    +G +W+F  K N  G + ++KARLVAKGY Q+ G+D+ E F+P  +  S+R +L +A   SW +  +DV 
Subjt:  FEEAMQEEMTSITENQTWSL-EDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPAARLESVRFLLAIAAHHSWEVHHMDVK

Query:  SAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALCGLQQAPRAWNAKLDSTLLSLNFKRCASEHDMYTYGHGKKRLIVGVYVDNLIITGGDVGVL
        +AFL G L + VY+ QPPGF+D D P+ V RL KA+ GL+QAPRAW  +L + LL++ F    S+  ++    G+  + + VYVD+++ITG D  +L
Subjt:  SAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALCGLQQAPRAWNAKLDSTLLSLNFKRCASEHDMYTYGHGKKRLIVGVYVDNLIITGGDVGVL

Arabidopsis top hitse value%identityAlignment
AT3G20980.1 Gag-Pol-related retrotransposon family protein3.3e-0833.9Show/hide
Query:  WLIDNGYTNHITYDKESFEELRDTEDKRVRIGNGEHLE-----VKGKGTVAITSYEGTKFIPDVLFVPKIDQNLLSVGQLLDKGYKVLFENKQ-CLIKDA
        WLI +  +NH+T   + F  L  +   +V+  +G+  E     V+G G V   + EG K I +VL+VP I+ N LSV QL   G++V  E +  C + D 
Subjt:  WLIDNGYTNHITYDKESFEELRDTEDKRVRIGNGEHLE-----VKGKGTVAITSYEGTKFIPDVLFVPKIDQNLLSVGQLLDKGYKVLFENKQ-CLIKDA

Query:  SGKDLFNVKM-EGKSFAL
        +   +F   M E + F L
Subjt:  SGKDLFNVKM-EGKSFAL

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.8e-3941.27Show/hide
Query:  AMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPAARLESVRFLLAIAAHHSWEVHHMDVKSAFL
        AM +E+ ++    TW +  +PP  + IG KWV+K+K N  G + ++KARLVAKGY Q++G+DF E F+P  +L SV+ +LAI+A +++ +H +D+ +AFL
Subjt:  AMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPAARLESVRFLLAIAAHHSWEVHHMDVKSAFL

Query:  NGELKETVYVRQPPGFL----DNDNPNKVLRLHKALCGLQQAPRAWNAKLDSTLLSLNFKRCASEHDMYTYGHGKKRLIVGVYVDNLII
        NG+L E +Y++ PPG+     D+  PN V  L K++ GL+QA R W  K   TL+   F +  S+H  +        L V VYVD++II
Subjt:  NGELKETVYVRQPPGFL----DNDNPNKVLRLHKALCGLQQAPRAWNAKLDSTLLSLNFKRCASEHDMYTYGHGKKRLIVGVYVDNLII

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)6.9e-1441.67Show/hide
Query:  EAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPAARLESVRFLLAIA
        +AMQEE+ +++ N+TW L   P     +G KWVFK K +  G + + KARLVAKG+ Q++G+ F E ++P  R  ++R +L +A
Subjt:  EAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPAARLESVRFLLAIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCACTTCTTCCTCAAGCAAAGAATCAAGTGAGAGCTGGTTGATTGACAATGGGTACACAAATCACATAACATATGACAAGGAGTCTTTTGAGGAATTAAGAGACAC
TGAAGATAAGAGAGTGAGGATTGGCAATGGTGAACACTTGGAAGTCAAGGGAAAAGGCACAGTAGCTATAACAAGTTATGAAGGTACAAAATTTATTCCAGATGTTTTAT
TTGTACCTAAAATTGATCAAAATCTCTTAAGTGTTGGCCAGTTACTCGATAAAGGCTATAAAGTATTGTTTGAGAATAAGCAGTGCTTGATCAAAGATGCTAGTGGCAAA
GACTTGTTCAATGTCAAAATGGAAGGAAAAAGCTTTGCTCTAAATCCGACGGCCTTTATATTGAGAGTTAGTGCCACTGAGATTTGGCACAAAAAACTTACACTCTCATC
ATCGAGGGGCAACAAACCATATGTCGTCAAGATCGAAGGGCGCGGCACTATCCTGTTCGTCAGTAAGGGAGGCGAGCATTGCAAGCTGACTGACTTCTACTTCATCCCGC
AGGCAAGGCGCACCACAAACCGTCTTTACATCCTGGAGTTAGAGATAGACCAACCCGTTAGCCTCTCGGCCAAGACCGAAGAGGTATCTTGGAGGTGGCACGCAAGGTAC
GGACACTTAAACTTTCCTGCCCTAGAAGAGACGTCACCGCCGCCAGCAGGTGCACCACCTGAACCAGTGGAATTCGCAACACCACGGACTGCGGATTCGACGCTGGATGC
CGATCACGATACTGATCTGGAGGCTAGGTACCGGATGATGGATGACCTAGTGGGAGGAGGTGAACCACCTGGACTAGCAGCGCGCGAGCTCGAGGAAGTCACCTTCGCCG
AAGCAGAAAAGAACCCGTGCTGGCGGAAGGCAATGCAGGAGGAGATGACATCCATCACTGAGAACCAGACGTGGAGTCTGGAGGATATGCCACCAGGACACCGAGCCATA
GGGCTCAAATGGGTCTTCAAACTGAAGCGCAATGAAAAAGGAGAAGTTGTGAAGCACAAGGCTCGTCTGGTGGCGAAGGGCTACGTCCAGAAGCAAGGAGTGGACTTCGA
AGAGGCAATGCAGGAGGAGATGACATCCATCACTGAGAACCAGACGTGGAGTCTGGAGGATATGCCACCAGGACACCGAGCCATAGGGCTCAAATGGGTCTTCAAACTGA
AGCGCAATGAAAAAGGAGAAGTTGTGAAGCACAAGGCTCGTCTGGTGGCGAAGGGCTACGTCCAGAAGCAAGGAGTGGACTTCGAAGAGGTATTTGCGCCAGCAGCAAGG
TTAGAATCCGTTCGTTTCTTGCTGGCAATTGCAGCACATCACTCTTGGGAGGTTCACCATATGGACGTAAAGTCTGCTTTCCTTAACGGAGAGTTGAAGGAGACCGTCTA
TGTTCGACAACCACCTGGCTTCCTGGATAACGACAACCCTAATAAGGTACTGCGCCTGCACAAAGCACTCTGCGGGCTTCAACAAGCCCCACGAGCCTGGAACGCGAAGC
TCGACAGTACCCTACTGTCACTGAATTTCAAACGTTGTGCCTCTGAGCATGACATGTACACGTACGGCCACGGCAAAAAACGACTGATCGTGGGAGTGTACGTCGACAAC
CTCATAATCACTGGAGGCGACGTGGGAGTCCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCACTTCTTCCTCAAGCAAAGAATCAAGTGAGAGCTGGTTGATTGACAATGGGTACACAAATCACATAACATATGACAAGGAGTCTTTTGAGGAATTAAGAGACAC
TGAAGATAAGAGAGTGAGGATTGGCAATGGTGAACACTTGGAAGTCAAGGGAAAAGGCACAGTAGCTATAACAAGTTATGAAGGTACAAAATTTATTCCAGATGTTTTAT
TTGTACCTAAAATTGATCAAAATCTCTTAAGTGTTGGCCAGTTACTCGATAAAGGCTATAAAGTATTGTTTGAGAATAAGCAGTGCTTGATCAAAGATGCTAGTGGCAAA
GACTTGTTCAATGTCAAAATGGAAGGAAAAAGCTTTGCTCTAAATCCGACGGCCTTTATATTGAGAGTTAGTGCCACTGAGATTTGGCACAAAAAACTTACACTCTCATC
ATCGAGGGGCAACAAACCATATGTCGTCAAGATCGAAGGGCGCGGCACTATCCTGTTCGTCAGTAAGGGAGGCGAGCATTGCAAGCTGACTGACTTCTACTTCATCCCGC
AGGCAAGGCGCACCACAAACCGTCTTTACATCCTGGAGTTAGAGATAGACCAACCCGTTAGCCTCTCGGCCAAGACCGAAGAGGTATCTTGGAGGTGGCACGCAAGGTAC
GGACACTTAAACTTTCCTGCCCTAGAAGAGACGTCACCGCCGCCAGCAGGTGCACCACCTGAACCAGTGGAATTCGCAACACCACGGACTGCGGATTCGACGCTGGATGC
CGATCACGATACTGATCTGGAGGCTAGGTACCGGATGATGGATGACCTAGTGGGAGGAGGTGAACCACCTGGACTAGCAGCGCGCGAGCTCGAGGAAGTCACCTTCGCCG
AAGCAGAAAAGAACCCGTGCTGGCGGAAGGCAATGCAGGAGGAGATGACATCCATCACTGAGAACCAGACGTGGAGTCTGGAGGATATGCCACCAGGACACCGAGCCATA
GGGCTCAAATGGGTCTTCAAACTGAAGCGCAATGAAAAAGGAGAAGTTGTGAAGCACAAGGCTCGTCTGGTGGCGAAGGGCTACGTCCAGAAGCAAGGAGTGGACTTCGA
AGAGGCAATGCAGGAGGAGATGACATCCATCACTGAGAACCAGACGTGGAGTCTGGAGGATATGCCACCAGGACACCGAGCCATAGGGCTCAAATGGGTCTTCAAACTGA
AGCGCAATGAAAAAGGAGAAGTTGTGAAGCACAAGGCTCGTCTGGTGGCGAAGGGCTACGTCCAGAAGCAAGGAGTGGACTTCGAAGAGGTATTTGCGCCAGCAGCAAGG
TTAGAATCCGTTCGTTTCTTGCTGGCAATTGCAGCACATCACTCTTGGGAGGTTCACCATATGGACGTAAAGTCTGCTTTCCTTAACGGAGAGTTGAAGGAGACCGTCTA
TGTTCGACAACCACCTGGCTTCCTGGATAACGACAACCCTAATAAGGTACTGCGCCTGCACAAAGCACTCTGCGGGCTTCAACAAGCCCCACGAGCCTGGAACGCGAAGC
TCGACAGTACCCTACTGTCACTGAATTTCAAACGTTGTGCCTCTGAGCATGACATGTACACGTACGGCCACGGCAAAAAACGACTGATCGTGGGAGTGTACGTCGACAAC
CTCATAATCACTGGAGGCGACGTGGGAGTCCTCTGA
Protein sequenceShow/hide protein sequence
MVTSSSSKESSESWLIDNGYTNHITYDKESFEELRDTEDKRVRIGNGEHLEVKGKGTVAITSYEGTKFIPDVLFVPKIDQNLLSVGQLLDKGYKVLFENKQCLIKDASGK
DLFNVKMEGKSFALNPTAFILRVSATEIWHKKLTLSSSRGNKPYVVKIEGRGTILFVSKGGEHCKLTDFYFIPQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARY
GHLNFPALEETSPPPAGAPPEPVEFATPRTADSTLDADHDTDLEARYRMMDDLVGGGEPPGLAARELEEVTFAEAEKNPCWRKAMQEEMTSITENQTWSLEDMPPGHRAI
GLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEAMQEEMTSITENQTWSLEDMPPGHRAIGLKWVFKLKRNEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPAAR
LESVRFLLAIAAHHSWEVHHMDVKSAFLNGELKETVYVRQPPGFLDNDNPNKVLRLHKALCGLQQAPRAWNAKLDSTLLSLNFKRCASEHDMYTYGHGKKRLIVGVYVDN
LIITGGDVGVL