; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh14G010260 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh14G010260
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationCmo_Chr14:5586647..5591559
RNA-Seq ExpressionCmoCh14G010260
SyntenyCmoCh14G010260
Gene Ontology termsGO:0006810 - transport (biological process)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ACL54615.1 unknown [Zea mays]3.5e-6742.86Show/hide
Query:  MASTFGPIP---LHHAVTIRLTKNNFIIWRAQLIPYLRSTKLMGYLDGTVVAPAKMV---PSSTAAGADLVSNPAYKQWYDQDQQVLSGLLSSMSEEILH
        +A+ F  +P       V+++L   N+++W AQ++PYLRS  L G++DG++ AP + V   P+  + G  +  NP +  WY QDQ VLS + SS+SEE+L 
Subjt:  MASTFGPIP---LHHAVTIRLTKNNFIIWRAQLIPYLRSTKLMGYLDGTVVAPAKMV---PSSTAAGADLVSNPAYKQWYDQDQQVLSGLLSSMSEEILH

Query:  DVVAATTSKEVWDTLQRMFSSSTRARTVQTRVELATSKKRNQCAANYLHEIKE----LAAAGCALPDDDVIAYLLAVLGPNYDPFVTSMTTKSEALTLDD
         VV ATT++  W TL+RM++SS+R R +Q R++LAT +K +  AA Y  ++K     LAA G  L D++ I+YLL  L  +YD  VTS+TT+ +  T+ D
Subjt:  DVVAATTSKEVWDTLQRMFSSSTRARTVQTRVELATSKKRNQCAANYLHEIKE----LAAAGCALPDDDVIAYLLAVLGPNYDPFVTSMTTKSEALTLDD

Query:  VFAHLMTFEARQLQHQAELQLNPGSSAN-YASHGG---QRKNRGRRDRGPRRSQGYAPSHSAGDRHSPYARPS--------CQICGKVGHTVVRCWHRMD
        V+AHL++FE RQ  H A  Q++  ++AN   S GG    +  RG R RG R   G       G+   P   PS        CQICGK  H  ++CWHR D
Subjt:  VFAHLMTFEARQLQHQAELQLNPGSSAN-YASHGG---QRKNRGRRDRGPRRSQGYAPSHSAGDRHSPYARPS--------CQICGKVGHTVVRCWHRMD

Query:  EFYQDEPPSASSTVLAATSSYKIYPNWYSDTGAIDHITSDLDRLAVRERDHGGEQVQVGNGAAA
        + YQ E  S      AAT  Y + PNWY D+GA DHITSDL+RL  RER  GG+++QV NGA +
Subjt:  EFYQDEPPSASSTVLAATSSYKIYPNWYSDTGAIDHITSDLDRLAVRERDHGGEQVQVGNGAAA

ACN32036.1 unknown [Zea mays]1.2e-6743.13Show/hide
Query:  MASTFGPIP---LHHAVTIRLTKNNFIIWRAQLIPYLRSTKLMGYLDGTVVAPAKMV---PSSTAAGADLVSNPAYKQWYDQDQQVLSGLLSSMSEEILH
        +A+ F  +P       V+++L   N+++W AQ++PYLRS  L G++DG++ AP + V   P+  + G  +  NP +  WY QDQ VLS + SS+SEE+L 
Subjt:  MASTFGPIP---LHHAVTIRLTKNNFIIWRAQLIPYLRSTKLMGYLDGTVVAPAKMV---PSSTAAGADLVSNPAYKQWYDQDQQVLSGLLSSMSEEILH

Query:  DVVAATTSKEVWDTLQRMFSSSTRARTVQTRVELATSKKRNQCAANYLHEIKE----LAAAGCALPDDDVIAYLLAVLGPNYDPFVTSMTTKSEALTLDD
         VV ATT++  W TL+RM++SS+RAR +Q R++LAT +K +  AA Y  ++K     LAA G  L D++ I+YLL  L  +YD  VTS+TT+ +  T+ D
Subjt:  DVVAATTSKEVWDTLQRMFSSSTRARTVQTRVELATSKKRNQCAANYLHEIKE----LAAAGCALPDDDVIAYLLAVLGPNYDPFVTSMTTKSEALTLDD

Query:  VFAHLMTFEARQLQHQAELQLNPGSSAN-YASHGG---QRKNRGRRDRGPRRSQGYAPSHSAGDRHSPYARPS--------CQICGKVGHTVVRCWHRMD
        V+AHL++FE RQ  H A  Q++  ++AN   S GG    +  RG R RG R   G       G+   P   PS        CQICGK  H  ++CWHR D
Subjt:  VFAHLMTFEARQLQHQAELQLNPGSSAN-YASHGG---QRKNRGRRDRGPRRSQGYAPSHSAGDRHSPYARPS--------CQICGKVGHTVVRCWHRMD

Query:  EFYQDEPPSASSTVLAATSSYKIYPNWYSDTGAIDHITSDLDRLAVRERDHGGEQVQVGNGAAA
        + YQ E  S      AAT  Y + PNWY D+GA DHITSDL+RL  RER  GG+++QV NGA +
Subjt:  EFYQDEPPSASSTVLAATSSYKIYPNWYSDTGAIDHITSDLDRLAVRERDHGGEQVQVGNGAAA

KAG8084596.1 hypothetical protein GUJ93_ZPchr0010g7974 [Zizania palustris]8.5e-10660.76Show/hide
Query:  IPLHHAVTIRLTKNNFIIWRAQLIPYLRSTKLMGYLDGTVVAPAKMVPSSTAAGADLVSNPAYKQWYDQDQQVLSGLLSSMSEEILHDVVAATTSKEVWD
        +PLH AVTIRLTK N+ +WRAQL+P+LRSTKL+G+LDG+  A +K + +ST AGA  ++NPAY++WYD DQQ+LSGLLSSM+EE+L DV  ATT+KE WD
Subjt:  IPLHHAVTIRLTKNNFIIWRAQLIPYLRSTKLMGYLDGTVVAPAKMVPSSTAAGADLVSNPAYKQWYDQDQQVLSGLLSSMSEEILHDVVAATTSKEVWD

Query:  TLQRMFSSSTRARTVQTRVELATSKKRNQCAANYLHEIK----ELAAAGCALPDDDVIAYLLAVLGPNYDPFVTSMTTKSEALTLDDVFAHLMTFEARQL
         LQR F+SSTRAR VQ RVELAT KKR+  A +Y ++++    +LA AG  L DD+++AYL A L   YDPFVTSMTT     T+DDVFAHL+ FEARQL
Subjt:  TLQRMFSSSTRARTVQTRVELATSKKRNQCAANYLHEIK----ELAAAGCALPDDDVIAYLLAVLGPNYDPFVTSMTTKSEALTLDDVFAHLMTFEARQL

Query:  QHQAELQLNPGSSANYASHG-----GQRKNRGRRDRGPRRSQGYAPSHSAGDRHSPYARPSCQICGKVGHTVVRCWHRMDEFYQDEPPSASSTVLAATSS
        +HQAELQLN G+SAN+A  G     G+ +  GR    P R +G  PS   G  ++P  RP+CQIC K GHT +RCW+RMDE YQ+E PSA+   +A+TSS
Subjt:  QHQAELQLNPGSSANYASHG-----GQRKNRGRRDRGPRRSQGYAPSHSAGDRHSPYARPSCQICGKVGHTVVRCWHRMDEFYQDEPPSASSTVLAATSS

Query:  YKIYPNWYSDTGAIDHITSDLDRLAVRERDHGGEQVQVGNGAAA
        YKI  NWY DTGA DHITSDLDRLA+RER +GG+QVQVGNGA +
Subjt:  YKIYPNWYSDTGAIDHITSDLDRLAVRERDHGGEQVQVGNGAAA

RLN35346.1 uncharacterized protein C2845_PM03G10830 [Panicum miliaceum]1.3e-5840.06Show/hide
Query:  ASTFGPIPLH-HAVTIRLTKNNFIIWRAQLIPYLRSTKLMGYLDGTVVAPAKMVPSSTAAGADLVSNPAYKQWYDQDQQVLSGLLSSMSEEILHDVVAAT
        +S   P PL   A+T +LT+ N+ IW AQ++  ++   + G+L G++V P K +    A     V NPA+++W  +DQQ+LS L + +S +IL  +  + 
Subjt:  ASTFGPIPLH-HAVTIRLTKNNFIIWRAQLIPYLRSTKLMGYLDGTVVAPAKMVPSSTAAGADLVSNPAYKQWYDQDQQVLSGLLSSMSEEILHDVVAAT

Query:  TSKEVWDTLQRMFSSSTRARTVQTRVELATSKKRNQCAANYLHEIK----ELAAAGCALPDDDVIAYLLAVLGPNYDPFVTSMTTKSEALTLDDVFAHLM
        T++  W  ++ MF+S TRAR V  R+ LA +KK N  A  Y  ++K    E+AAAG ++ DD+++ Y+L  L   Y+  VTS+ T+ E++TLD+++A L+
Subjt:  TSKEVWDTLQRMFSSSTRARTVQTRVELATSKKRNQCAANYLHEIK----ELAAAGCALPDDDVIAYLLAVLGPNYDPFVTSMTTKSEALTLDDVFAHLM

Query:  TFEARQ-LQHQAELQLNPGSSANYASHGGQR-KNRGR-----RDRGP------RRSQGYAPS---HSAGDRHSPYARPSCQICGKVGHTVVRCWHRMDEF
         FE R  L H  E      +SAN A  GG R  NRGR     R RGP       R QG++ +     + + +    +P CQ+C K GHT +RCW+R DE 
Subjt:  TFEARQ-LQHQAELQLNPGSSANYASHGGQR-KNRGR-----RDRGP------RRSQGYAPS---HSAGDRHSPYARPSCQICGKVGHTVVRCWHRMDEF

Query:  YQDEPPSASSTVLAATSSYKIYPNWYSDTGAIDHITSDLDRLAVRERDHGGEQVQVGNGAAA
        Y DE   A     AA+SSY I  NWY+DTGA DHITS+L++LAVRE+ +GG+Q+   +GA +
Subjt:  YQDEPPSASSTVLAATSSYKIYPNWYSDTGAIDHITSDLDRLAVRERDHGGEQVQVGNGAAA

XP_023544061.1 uncharacterized protein LOC111803757 [Cucurbita pepo subsp. pepo]1.7e-6959.64Show/hide
Query:  STFGPIPLHHAVTIRLTKNNFIIWRAQLIPYLRSTKLMGYLDGTVVAPAKMVPSSTAAGADLVSNPAYKQWYDQDQQVLSGLLSSMSEEILHDVVAATTS
        +TFGPIPLHHAVTIRLTKNNFIIWRAQL+PYLRSTKLMGYLDGT  APAKMVPSSTAA A+L+                                     
Subjt:  STFGPIPLHHAVTIRLTKNNFIIWRAQLIPYLRSTKLMGYLDGTVVAPAKMVPSSTAAGADLVSNPAYKQWYDQDQQVLSGLLSSMSEEILHDVVAATTS

Query:  KEVWDTLQRMFSSSTRARTVQTRVELATSKKRNQCAANYLHEIK----ELAAAGCALPDDDVIAYLLAVLGPNYDPFVTSMTTKSEALTLDDVFAHLMTF
                              RVELATSKKR+Q A NY  +IK    ELAAA  AL DDDVIAYLLA LGP+YDPFVTSMTTKSEALTLDDV       
Subjt:  KEVWDTLQRMFSSSTRARTVQTRVELATSKKRNQCAANYLHEIK----ELAAAGCALPDDDVIAYLLAVLGPNYDPFVTSMTTKSEALTLDDVFAHLMTF

Query:  EARQLQHQAELQLNPGSSANYASHGGQRKNRGRRDRGPRRSQGYAPSHSAGDRHSPYARPSCQICGKVGHTVVRC
               + ELQLN GSSANYASHGGQ+KN GRRDRG  RSQGYA S   GDR  P AR SCQICGKVGHTV+RC
Subjt:  EARQLQHQAELQLNPGSSANYASHGGQRKNRGRRDRGPRRSQGYAPSHSAGDRHSPYARPSCQICGKVGHTVVRC

TrEMBL top hitse value%identityAlignment
A0A2N9G872 Uncharacterized protein5.2e-6138.48Show/hide
Query:  MASTFGP---IPLHHAVTIRLTKNNFIIWRAQLIPYLRSTKLMGYLDGTVVAPAKMVPSSTAAGADLVSNPAYKQWYDQDQQVLSGLLSSMSEEILHDVV
        ++ST+ P    P+HH +TI+LT++N+++WRAQ++PYLR   L G+LDG+ VAP   +   T        NP +  W+ QDQ +LS L+SS+SE +L  VV
Subjt:  MASTFGP---IPLHHAVTIRLTKNNFIIWRAQLIPYLRSTKLMGYLDGTVVAPAKMVPSSTAAGADLVSNPAYKQWYDQDQQVLSGLLSSMSEEILHDVV

Query:  AATTSKEVWDTLQRMFSSSTRARTVQTRVELATSKKRNQCAANYLHEI----KELAAAGCALPDDDVIAYLLAVLGPNYDPFVTSMTTKSEALTLDDVFA
          TT++EVW TL RMF+S +RART+Q   +LAT +K +   A++ H        LAA    L D +++++L+A LG  YD  VTS+ T+++ L+L++++ 
Subjt:  AATTSKEVWDTLQRMFSSSTRARTVQTRVELATSKKRNQCAANYLHEI----KELAAAGCALPDDDVIAYLLAVLGPNYDPFVTSMTTKSEALTLDDVFA

Query:  HLMTFEARQLQHQAELQLNPG-------SSANYASHGGQRKNRGRRDRGPRRSQGYAPSHSAGDRHSPYA--RPSCQICGKVGHTVVRCWHRMDEFYQDE
        HL+  E R +Q+Q  + L+         +S+     GG+  N G+  RG   +     +   G     +   RP CQ+C K GH  + C+HR D  Y  E
Subjt:  HLMTFEARQLQHQAELQLNPG-------SSANYASHGGQRKNRGRRDRGPRRSQGYAPSHSAGDRHSPYA--RPSCQICGKVGHTVVRCWHRMDEFYQDE

Query:  PPSASSTVLAATSSYKIYPNWYSDTGAIDHITSDLDRLAVR-ERDHGGEQVQVGNG
          + +     AT      PNWY+DTGA  H+TSD   L +R E  HG EQ++VGNG
Subjt:  PPSASSTVLAATSSYKIYPNWYSDTGAIDHITSDLDRLAVR-ERDHGGEQVQVGNG

A0A2N9G872 Uncharacterized protein2.3e-0851.32Show/hide
Query:  KHNIKKPKLDFTPSNADVSLFIFNKTGIQMYILIYVDDIIIIGSSSTATEKLLTQVQDDFVVKDLDILSYFLGIEV
        K + K   L FT S +D SLFI+  +   MY+LIYVDDIII  S  +A ++LL  ++ DF VKDL  L++FLGIEV
Subjt:  KHNIKKPKLDFTPSNADVSLFIFNKTGIQMYILIYVDDIIIIGSSSTATEKLLTQVQDDFVVKDLDILSYFLGIEV

A0A2N9G872 Uncharacterized protein5.2e-6138.48Show/hide
Query:  MASTFGP---IPLHHAVTIRLTKNNFIIWRAQLIPYLRSTKLMGYLDGTVVAPAKMVPSSTAAGADLVSNPAYKQWYDQDQQVLSGLLSSMSEEILHDVV
        ++ST+ P    P+HH +TI+LT++N+++WRAQ++PYLR   L G+LDG+ VAP   +   T        NP +  W+ QDQ +LS L+SS+SE +L  VV
Subjt:  MASTFGP---IPLHHAVTIRLTKNNFIIWRAQLIPYLRSTKLMGYLDGTVVAPAKMVPSSTAAGADLVSNPAYKQWYDQDQQVLSGLLSSMSEEILHDVV

Query:  AATTSKEVWDTLQRMFSSSTRARTVQTRVELATSKKRNQCAANYLHEI----KELAAAGCALPDDDVIAYLLAVLGPNYDPFVTSMTTKSEALTLDDVFA
          TT++EVW TL RMF+S +RART+Q   +LAT +K +   A++ H        LAA    L D +++++L+A LG  YD  VTS+ T+++ L+L++++ 
Subjt:  AATTSKEVWDTLQRMFSSSTRARTVQTRVELATSKKRNQCAANYLHEI----KELAAAGCALPDDDVIAYLLAVLGPNYDPFVTSMTTKSEALTLDDVFA

Query:  HLMTFEARQLQHQAELQLNPG-------SSANYASHGGQRKNRGRRDRGPRRSQGYAPSHSAGDRHSPYA--RPSCQICGKVGHTVVRCWHRMDEFYQDE
        HL+  E R +Q+Q  + L+         +S+     GG+  N G+  RG   +     +   G     +   RP CQ+C K GH  + C+HR D  Y  E
Subjt:  HLMTFEARQLQHQAELQLNPG-------SSANYASHGGQRKNRGRRDRGPRRSQGYAPSHSAGDRHSPYA--RPSCQICGKVGHTVVRCWHRMDEFYQDE

Query:  PPSASSTVLAATSSYKIYPNWYSDTGAIDHITSDLDRLAVR-ERDHGGEQVQVGNG
          + +     AT      PNWY+DTGA  H+TSD   L +R E  HG EQ++VGNG
Subjt:  PPSASSTVLAATSSYKIYPNWYSDTGAIDHITSDLDRLAVR-ERDHGGEQVQVGNG

A0A2N9HMT4 Uncharacterized protein1.4e-6139.36Show/hide
Query:  HHAVTIRLTKNNFIIWRAQLIPYLRSTKLMGYLDGTVVAPAKMVPSSTAAGADLVSNPAYKQWYDQDQQVLSGLLSSMSEEILHDVVAATTSKEVWDTLQ
        HH +TI+LT++N+++W+AQ++PYL+   L G++DG+  AP++ + S T+  A    NPA++ W+ QDQ ++S L+SS+SE IL  +V   TS+EVW TL+
Subjt:  HHAVTIRLTKNNFIIWRAQLIPYLRSTKLMGYLDGTVVAPAKMVPSSTAAGADLVSNPAYKQWYDQDQQVLSGLLSSMSEEILHDVVAATTSKEVWDTLQ

Query:  RMFSSSTRARTVQTRVELATSKKRNQCAANYLHEI----KELAAAGCALPDDDVIAYLLAVLGPNYDPFVTSMTTKSEALTLDDVFAHLMTFEARQLQHQ
        RMF+S +RART+Q   +LAT KK +   A+Y H+       LAA    L D +++++LLA LGP++D  VTS+  +++ ++L+D++ HL++ E    Q+Q
Subjt:  RMFSSSTRARTVQTRVELATSKKRNQCAANYLHEI----KELAAAGCALPDDDVIAYLLAVLGPNYDPFVTSMTTKSEALTLDDVFAHLMTFEARQLQHQ

Query:  AELQLNPGSS----ANYASHGGQ--RKNRGRRDRGPRRSQGYAPSHSAGDRHSPYARPSCQICGKVGHTVVRCWHRMDEFY-QDEPPSASSTVLAATSSY
          + L+ G++     + ++HGG   R +          SQG + +   G   S   RP CQ+CGK+GH  + C+HR D  Y +D  P   +  L AT   
Subjt:  AELQLNPGSS----ANYASHGGQ--RKNRGRRDRGPRRSQGYAPSHSAGDRHSPYARPSCQICGKVGHTVVRCWHRMDEFY-QDEPPSASSTVLAATSSY

Query:  KIYPNWYSDTGAIDHITSDLDRLAVRERDH-GGEQVQVGNGAA
        +  PNWY D+GA  H+T+DL  L VR  ++ G +Q++VGNG A
Subjt:  KIYPNWYSDTGAIDHITSDLDRLAVRERDH-GGEQVQVGNGAA

A0A2N9HTS2 Uncharacterized protein2.3e-0851.32Show/hide
Query:  KHNIKKPKLDFTPSNADVSLFIFNKTGIQMYILIYVDDIIIIGSSSTATEKLLTQVQDDFVVKDLDILSYFLGIEV
        K + K   L FT S +D SLFI+  +   MY+LIYVDDIII  S  +A ++LL  ++ DF VKDL  L++FLGIEV
Subjt:  KHNIKKPKLDFTPSNADVSLFIFNKTGIQMYILIYVDDIIIIGSSSTATEKLLTQVQDDFVVKDLDILSYFLGIEV

B8A366 Uncharacterized protein1.7e-6742.86Show/hide
Query:  MASTFGPIP---LHHAVTIRLTKNNFIIWRAQLIPYLRSTKLMGYLDGTVVAPAKMV---PSSTAAGADLVSNPAYKQWYDQDQQVLSGLLSSMSEEILH
        +A+ F  +P       V+++L   N+++W AQ++PYLRS  L G++DG++ AP + V   P+  + G  +  NP +  WY QDQ VLS + SS+SEE+L 
Subjt:  MASTFGPIP---LHHAVTIRLTKNNFIIWRAQLIPYLRSTKLMGYLDGTVVAPAKMV---PSSTAAGADLVSNPAYKQWYDQDQQVLSGLLSSMSEEILH

Query:  DVVAATTSKEVWDTLQRMFSSSTRARTVQTRVELATSKKRNQCAANYLHEIKE----LAAAGCALPDDDVIAYLLAVLGPNYDPFVTSMTTKSEALTLDD
         VV ATT++  W TL+RM++SS+R R +Q R++LAT +K +  AA Y  ++K     LAA G  L D++ I+YLL  L  +YD  VTS+TT+ +  T+ D
Subjt:  DVVAATTSKEVWDTLQRMFSSSTRARTVQTRVELATSKKRNQCAANYLHEIKE----LAAAGCALPDDDVIAYLLAVLGPNYDPFVTSMTTKSEALTLDD

Query:  VFAHLMTFEARQLQHQAELQLNPGSSAN-YASHGG---QRKNRGRRDRGPRRSQGYAPSHSAGDRHSPYARPS--------CQICGKVGHTVVRCWHRMD
        V+AHL++FE RQ  H A  Q++  ++AN   S GG    +  RG R RG R   G       G+   P   PS        CQICGK  H  ++CWHR D
Subjt:  VFAHLMTFEARQLQHQAELQLNPGSSAN-YASHGG---QRKNRGRRDRGPRRSQGYAPSHSAGDRHSPYARPS--------CQICGKVGHTVVRCWHRMD

Query:  EFYQDEPPSASSTVLAATSSYKIYPNWYSDTGAIDHITSDLDRLAVRERDHGGEQVQVGNGAAA
        + YQ E  S      AAT  Y + PNWY D+GA DHITSDL+RL  RER  GG+++QV NGA +
Subjt:  EFYQDEPPSASSTVLAATSSYKIYPNWYSDTGAIDHITSDLDRLAVRERDHGGEQVQVGNGAAA

C0PCZ1 Uncharacterized protein5.8e-6843.13Show/hide
Query:  MASTFGPIP---LHHAVTIRLTKNNFIIWRAQLIPYLRSTKLMGYLDGTVVAPAKMV---PSSTAAGADLVSNPAYKQWYDQDQQVLSGLLSSMSEEILH
        +A+ F  +P       V+++L   N+++W AQ++PYLRS  L G++DG++ AP + V   P+  + G  +  NP +  WY QDQ VLS + SS+SEE+L 
Subjt:  MASTFGPIP---LHHAVTIRLTKNNFIIWRAQLIPYLRSTKLMGYLDGTVVAPAKMV---PSSTAAGADLVSNPAYKQWYDQDQQVLSGLLSSMSEEILH

Query:  DVVAATTSKEVWDTLQRMFSSSTRARTVQTRVELATSKKRNQCAANYLHEIKE----LAAAGCALPDDDVIAYLLAVLGPNYDPFVTSMTTKSEALTLDD
         VV ATT++  W TL+RM++SS+RAR +Q R++LAT +K +  AA Y  ++K     LAA G  L D++ I+YLL  L  +YD  VTS+TT+ +  T+ D
Subjt:  DVVAATTSKEVWDTLQRMFSSSTRARTVQTRVELATSKKRNQCAANYLHEIKE----LAAAGCALPDDDVIAYLLAVLGPNYDPFVTSMTTKSEALTLDD

Query:  VFAHLMTFEARQLQHQAELQLNPGSSAN-YASHGG---QRKNRGRRDRGPRRSQGYAPSHSAGDRHSPYARPS--------CQICGKVGHTVVRCWHRMD
        V+AHL++FE RQ  H A  Q++  ++AN   S GG    +  RG R RG R   G       G+   P   PS        CQICGK  H  ++CWHR D
Subjt:  VFAHLMTFEARQLQHQAELQLNPGSSAN-YASHGG---QRKNRGRRDRGPRRSQGYAPSHSAGDRHSPYARPS--------CQICGKVGHTVVRCWHRMD

Query:  EFYQDEPPSASSTVLAATSSYKIYPNWYSDTGAIDHITSDLDRLAVRERDHGGEQVQVGNGAAA
        + YQ E  S      AAT  Y + PNWY D+GA DHITSDL+RL  RER  GG+++QV NGA +
Subjt:  EFYQDEPPSASSTVLAATSSYKIYPNWYSDTGAIDHITSDLDRLAVRERDHGGEQVQVGNGAAA

SwissProt top hitse value%identityAlignment
P92519 Uncharacterized mitochondrial protein AtMg008103.6e-0649.09Show/hide
Query:  MYILIYVDDIIIIGSSSTATEKLLTQVQDDFVVKDLDILSYFLGIEVR-HTSDLF
        MY+L+YVDDI++ GSS+T    L+ Q+   F +KDL  + YFLGI+++ H S LF
Subjt:  MYILIYVDDIIIIGSSSTATEKLLTQVQDDFVVKDLDILSYFLGIEVR-HTSDLF

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-2526.86Show/hide
Query:  RLTKNNFIIWRAQLIPYLRSTKLMGYLDGTVVAPAKMVPSSTAAGADLVSNPAYKQWYDQDQQVLSGLLSSMSEEILHDVVAATTSKEVWDTLQRMFSSS
        +LT  N+++W  Q+       +L G+LDG+   P    P++    A    NP Y +W  QD+ + S +L ++S  +   V  ATT+ ++W+TL++++++ 
Subjt:  RLTKNNFIIWRAQLIPYLRSTKLMGYLDGTVVAPAKMVPSSTAAGADLVSNPAYKQWYDQDQQVLSGLLSSMSEEILHDVVAATTSKEVWDTLQRMFSSS

Query:  TRARTVQTRVELATSKKRNQCAANYLH----EIKELAAAGCALPDDDVIAYLLAVLGPNYDPFVTSMTTKSEALTLDDVFAHLMTFEARQLQHQAELQLN
        +     Q R +L    K  +   +Y+        +LA  G  +  D+ +  +L  L   Y P +  +  K    TL ++   L+  E++ L   +   + 
Subjt:  TRARTVQTRVELATSKKRNQCAANYLH----EIKELAAAGCALPDDDVIAYLLAVLGPNYDPFVTSMTTKSEALTLDDVFAHLMTFEARQLQHQAELQLN

Query:  PGSSANYASH----------GGQRKNRGRRDRGPRRSQGYAPS----HSAGDRHSPYARPSCQICGKVGHTVVRC---WHRMDEFYQDEPPSASS-----
           +AN  SH           G R NR         S+ +  S    H   ++  PY    CQICG  GH+  RC    H +      +PPS  +     
Subjt:  PGSSANYASH----------GGQRKNRGRRDRGPRRSQGYAPS----HSAGDRHSPYARPSCQICGKVGHTVVRC---WHRMDEFYQDEPPSASS-----

Query:  TVLAATSSYKIYPNWYSDTGAIDHITSDLDRLAVRERDHGGEQVQVGNGA
          LA  S Y    NW  D+GA  HITSD + L++ +   GG+ V V +G+
Subjt:  TVLAATSSYKIYPNWYSDTGAIDHITSDLDRLAVRERDHGGEQVQVGNGA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.6e-0640.58Show/hide
Query:  LDFTPSNADVSLFIFNKTGIQMYILIYVDDIIIIGSSSTATEKLLTQVQDDFVVKDLDILSYFLGIEVR
        + F  S +D SLF+  +    +Y+L+YVDDI+I G+  T     L  +   F VKD + L YFLGIE +
Subjt:  LDFTPSNADVSLFIFNKTGIQMYILIYVDDIIIIGSSSTATEKLLTQVQDDFVVKDLDILSYFLGIEVR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.5e-2325.58Show/hide
Query:  RLTKNNFIIWRAQLIPYLRSTKLMGYLDGTVVAPAKMVPSSTAAGADLVSNPAYKQWYDQDQQVLSGLLSSMSEEILHDVVAATTSKEVWDTLQRMFSSS
        +LT  N+++W  Q+       +L G+LDG+   P    P++    A    NP Y +W  QD+ + S +L ++S  +   V  ATT+ ++W+TL++++++ 
Subjt:  RLTKNNFIIWRAQLIPYLRSTKLMGYLDGTVVAPAKMVPSSTAAGADLVSNPAYKQWYDQDQQVLSGLLSSMSEEILHDVVAATTSKEVWDTLQRMFSSS

Query:  TRARTVQTRVELATSKKRNQCAANYLHEIKELAAAGCALPDDDVIAYLLAVLGPNYDPFVTSMTTKSEALTLDDVFAHLMTFEARQLQ-HQAELQLNPGS
        +     Q R               ++    +LA  G  +  D+ +  +L  L  +Y P +  +  K    +L ++   L+  E++ L  + AE+      
Subjt:  TRARTVQTRVELATSKKRNQCAANYLHEIKELAAAGCALPDDDVIAYLLAVLGPNYDPFVTSMTTKSEALTLDDVFAHLMTFEARQLQ-HQAELQLNPGS

Query:  SANYASHGGQRKNRGRRDRGP--------RRSQGYAPSHSAGDRHSPYARP---SCQICGKVGHTVVRC--WHRMDEFYQDEPPSASSTV------LAAT
        +AN  +H     NR + +RG          RS  + PS S     +   +P    CQIC   GH+  RC   H+       +  ++  T       LA  
Subjt:  SANYASHGGQRKNRGRRDRGP--------RRSQGYAPSHSAGDRHSPYARP---SCQICGKVGHTVVRC--WHRMDEFYQDEPPSASSTV------LAAT

Query:  SSYKIYPNWYSDTGAIDHITSDLDRLAVRERDHGGEQVQVGNGA
        S Y    NW  D+GA  HITSD + L+  +   GG+ V + +G+
Subjt:  SSYKIYPNWYSDTGAIDHITSDLDRLAVRERDHGGEQVQVGNGA

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.1e-0538.81Show/hide
Query:  FTPSNADVSLFIFNKTGIQMYILIYVDDIIIIGSSSTATEKLLTQVQDDFVVKDLDILSYFLGIEVR
        F  S +D SLF+  +    +Y+L+YVDDI+I G+ +   +  L  +   F VK+ + L YFLGIE +
Subjt:  FTPSNADVSLFIFNKTGIQMYILIYVDDIIIIGSSSTATEKLLTQVQDDFVVKDLDILSYFLGIEVR

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.6e-0924.49Show/hide
Query:  PIPLHHAVTIRLTK-----NNFIIWRAQLIPYLRSTKLMGYLDGTVVAPAKMVPSSTAAGADLVSNPAYKQWYDQDQQVLSGLLSSMSEEILHDVVAATT
        P  +HH     + K     +N++ W+ +   +LR TK  G++DGT+  P                +P Y+ W   +  V+  L++SM++++L  V+ A T
Subjt:  PIPLHHAVTIRLTK-----NNFIIWRAQLIPYLRSTKLMGYLDGTVVAPAKMVPSSTAAGADLVSNPAYKQWYDQDQQVLSGLLSSMSEEILHDVVAATT

Query:  SKEVWDTLQRMFSSSTRARTVQTRVELATSKKRNQCAANYLHEIKEL
        + ++W+ L+R+F      +  Q R  LAT ++       Y  ++ ++
Subjt:  SKEVWDTLQRMFSSSTRARTVQTRVELATSKKRNQCAANYLHEIKEL

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 89.6e-0732.1Show/hide
Query:  LKHNIKKPKLDFTPSNADVSLFIFNKTGIQMYILIYVDDIIIIGSSSTATEKLLTQVQDDFVVKDLDILSYFLGIEVRHTS
        LK ++      F  S++D + F+     + + +L+YVDDIII  ++  A ++L +Q++  F ++DL  L YFLG+E+  ++
Subjt:  LKHNIKKPKLDFTPSNADVSLFIFNKTGIQMYILIYVDDIIIIGSSSTATEKLLTQVQDDFVVKDLDILSYFLGIEVRHTS

ATMG00810.1 DNA/RNA polymerases superfamily protein2.5e-0749.09Show/hide
Query:  MYILIYVDDIIIIGSSSTATEKLLTQVQDDFVVKDLDILSYFLGIEVR-HTSDLF
        MY+L+YVDDI++ GSS+T    L+ Q+   F +KDL  + YFLGI+++ H S LF
Subjt:  MYILIYVDDIIIIGSSSTATEKLLTQVQDDFVVKDLDILSYFLGIEVR-HTSDLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCGACCTTTGGCCCAATCCCTCTTCACCATGCTGTCACAATCCGTCTTACAAAAAACAACTTCATCATATGGCGAGCCCAGCTCATCCCCTACCTACGGAGTAC
GAAGCTTATGGGCTACCTCGATGGCACCGTTGTCGCACCTGCCAAGATGGTCCCTTCCTCAACCGCTGCTGGTGCTGACTTGGTCTCTAATCCAGCCTATAAGCAGTGGT
ATGATCAGGATCAACAGGTCCTTAGTGGCCTTCTCTCCTCTATGTCTGAGGAGATTCTTCACGATGTGGTTGCCGCTACTACGTCCAAGGAGGTGTGGGATACCCTGCAG
CGGATGTTTTCGTCGTCAACTCGTGCTCGCACTGTTCAGACCCGTGTTGAGCTCGCCACGTCCAAGAAACGCAATCAGTGTGCTGCAAATTATTTACACGAGATCAAAGA
GTTGGCTGCCGCTGGCTGTGCCTTACCAGATGATGATGTGATCGCGTATCTTCTCGCTGTACTTGGCCCTAACTATGATCCCTTCGTCACCTCAATGACTACCAAGAGTG
AAGCTCTCACGCTTGATGATGTGTTTGCACATCTAATGACATTTGAAGCTCGCCAACTACAACACCAGGCTGAACTTCAGTTAAATCCTGGGTCTTCTGCCAATTATGCT
AGTCATGGTGGTCAACGGAAGAATCGTGGGCGTAGGGATCGTGGTCCTCGTCGTTCTCAAGGTTATGCGCCCTCTCATTCTGCTGGTGATCGTCATAGCCCTTATGCTCG
TCCTTCCTGCCAGATCTGCGGCAAAGTAGGGCATACTGTTGTTCGCTGCTGGCATAGGATGGATGAGTTCTATCAAGATGAACCTCCTTCTGCTTCTTCTACGGTACTGG
CAGCTACTTCCTCTTACAAGATTTATCCAAATTGGTACAGCGACACAGGCGCCATTGATCATATCACCAGTGACCTGGATCGTCTCGCTGTGCGTGAACGCGATCATGGA
GGTGAACAAGTTCAAGTCGGCAATGGAGCAGCCGCAATCCTCTGCGTCACTGCCGAGCGAATCAACATTGGTTGTTCCGCCAATGTTGGGGCCTTGGATCCTCCGCCAGC
AGATGATATTGCGCAATGCTCGGTCGAATCCTCGGTCGCTGGTCGACCGACTACTGTAGCATCGGACATAAAACCCAACACGGTTGCTCCCTTCGCAACGGCTGATATGA
CTGTCCCCTCAGATGTGGATCCTACACCTACTGCTCATCCGTATGGTACTCGATTGAAGCACAATATCAAGAAACCCAAGCTAGATTTTACACCTTCAAACGCTGATGTC
TCTCTTTTCATTTTTAACAAGACGGGCATTCAGATGTATATCCTCATCTATGTTGATGACATTATTATCATCGGCTCATCTTCTACGGCTACTGAGAAACTTCTTACACA
GGTTCAGGATGACTTTGTCGTCAAGGATCTTGACATTTTGAGTTATTTTCTTGGGATTGAGGTCCGCCATACTTCCGACTTATTCTGA
mRNA sequenceShow/hide mRNA sequence
ATAATCTAGTGATTTACTGGTTTGTTACCAAATCTATGATTTCATAATCTAGTGATTTATCGGTTTGTTACTAAATCTATGATTTACAATGTAATCTTTATTATACACAT
CTATAAATACGTGAGATTGTGACCCAAGTGCCATTCAACTTAACATGCTATTAAAGTCAAGGTTTATTCTTTAACCTTGTCTACCATATCTTTTTCCTCCTCCCCATCCA
CCATCTCCAATACCTCCTCAACTACCATGGCGTCGACCTTTGGCCCAATCCCTCTTCACCATGCTGTCACAATCCGTCTTACAAAAAACAACTTCATCATATGGCGAGCC
CAGCTCATCCCCTACCTACGGAGTACGAAGCTTATGGGCTACCTCGATGGCACCGTTGTCGCACCTGCCAAGATGGTCCCTTCCTCAACCGCTGCTGGTGCTGACTTGGT
CTCTAATCCAGCCTATAAGCAGTGGTATGATCAGGATCAACAGGTCCTTAGTGGCCTTCTCTCCTCTATGTCTGAGGAGATTCTTCACGATGTGGTTGCCGCTACTACGT
CCAAGGAGGTGTGGGATACCCTGCAGCGGATGTTTTCGTCGTCAACTCGTGCTCGCACTGTTCAGACCCGTGTTGAGCTCGCCACGTCCAAGAAACGCAATCAGTGTGCT
GCAAATTATTTACACGAGATCAAAGAGTTGGCTGCCGCTGGCTGTGCCTTACCAGATGATGATGTGATCGCGTATCTTCTCGCTGTACTTGGCCCTAACTATGATCCCTT
CGTCACCTCAATGACTACCAAGAGTGAAGCTCTCACGCTTGATGATGTGTTTGCACATCTAATGACATTTGAAGCTCGCCAACTACAACACCAGGCTGAACTTCAGTTAA
ATCCTGGGTCTTCTGCCAATTATGCTAGTCATGGTGGTCAACGGAAGAATCGTGGGCGTAGGGATCGTGGTCCTCGTCGTTCTCAAGGTTATGCGCCCTCTCATTCTGCT
GGTGATCGTCATAGCCCTTATGCTCGTCCTTCCTGCCAGATCTGCGGCAAAGTAGGGCATACTGTTGTTCGCTGCTGGCATAGGATGGATGAGTTCTATCAAGATGAACC
TCCTTCTGCTTCTTCTACGGTACTGGCAGCTACTTCCTCTTACAAGATTTATCCAAATTGGTACAGCGACACAGGCGCCATTGATCATATCACCAGTGACCTGGATCGTC
TCGCTGTGCGTGAACGCGATCATGGAGGTGAACAAGTTCAAGTCGGCAATGGAGCAGCCGCAATCCTCTGCGTCACTGCCGAGCGAATCAACATTGGTTGTTCCGCCAAT
GTTGGGGCCTTGGATCCTCCGCCAGCAGATGATATTGCGCAATGCTCGGTCGAATCCTCGGTCGCTGGTCGACCGACTACTGTAGCATCGGACATAAAACCCAACACGGT
TGCTCCCTTCGCAACGGCTGATATGACTGTCCCCTCAGATGTGGATCCTACACCTACTGCTCATCCGTATGGTACTCGATTGAAGCACAATATCAAGAAACCCAAGCTAG
ATTTTACACCTTCAAACGCTGATGTCTCTCTTTTCATTTTTAACAAGACGGGCATTCAGATGTATATCCTCATCTATGTTGATGACATTATTATCATCGGCTCATCTTCT
ACGGCTACTGAGAAACTTCTTACACAGGTTCAGGATGACTTTGTCGTCAAGGATCTTGACATTTTGAGTTATTTTCTTGGGATTGAGGTCCGCCATACTTCCGACTTATT
CTGA
Protein sequenceShow/hide protein sequence
MASTFGPIPLHHAVTIRLTKNNFIIWRAQLIPYLRSTKLMGYLDGTVVAPAKMVPSSTAAGADLVSNPAYKQWYDQDQQVLSGLLSSMSEEILHDVVAATTSKEVWDTLQ
RMFSSSTRARTVQTRVELATSKKRNQCAANYLHEIKELAAAGCALPDDDVIAYLLAVLGPNYDPFVTSMTTKSEALTLDDVFAHLMTFEARQLQHQAELQLNPGSSANYA
SHGGQRKNRGRRDRGPRRSQGYAPSHSAGDRHSPYARPSCQICGKVGHTVVRCWHRMDEFYQDEPPSASSTVLAATSSYKIYPNWYSDTGAIDHITSDLDRLAVRERDHG
GEQVQVGNGAAAILCVTAERINIGCSANVGALDPPPADDIAQCSVESSVAGRPTTVASDIKPNTVAPFATADMTVPSDVDPTPTAHPYGTRLKHNIKKPKLDFTPSNADV
SLFIFNKTGIQMYILIYVDDIIIIGSSSTATEKLLTQVQDDFVVKDLDILSYFLGIEVRHTSDLF