; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG06G008385 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG06G008385
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationCG_Chr06:11973005..11978414
RNA-Seq ExpressionClCG06G008385
SyntenyClCG06G008385
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065480.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]8.5e-4132.08Show/hide
Query:  MSVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQVTDAMALMA
        +++E YYTKL TIWQ L+E+R T +CTCGG+K F+DHL+SE++M FLMGLN+ Y  +RAQIL+M P+PSI   F L+IQEE QRS        D +AL  
Subjt:  MSVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQVTDAMALMA

Query:  TTENAKRTNQSRKKDSQRPICTNCGIKGHVIDKCYKLHGYPPGYKSRINGNAENSPQ----NNSQSTPTANYQPKPS-----PSQQQQQQLTL-------
         +  A  T+++RKK  +RP C+ CGIKGH+ DKCYK HGYPPGYK R + +   +P     NN  +T +A     P       S+Q  Q +TL       
Subjt:  TTENAKRTNQSRKKDSQRPICTNCGIKGHVIDKCYKLHGYPPGYKSRINGNAENSPQ----NNSQSTPTANYQPKPS-----PSQQQQQQLTL-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------QDRLSLKMIGKANNKHELYLLNFVDSSNHHTAAALSCAISIETWHHRLGHLSPKCLSLLKDTXXXXXXXNDLSDTLANHV
                            QD     MIGKA+ ++ LY+LN   ++N   A     AIS++TWH RLGHLSPKCLS L  T       N      + HV
Subjt:  --------------------QDRLSLKMIGKANNKHELYLLNFVDSSNHHTAAALSCAISIETWHHRLGHLSPKCLSLLKDTXXXXXXXNDLSDTLANHV

Query:  LPLPIQGTLQQNEENHTSPNTSDM
         PL  Q  L  +  N+ + +  D+
Subjt:  LPLPIQGTLQQNEENHTSPNTSDM

XP_008463248.1 PREDICTED: uncharacterized protein LOC103501452 [Cucumis melo]4.3e-3730.5Show/hide
Query:  MGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQVTDAMALMATTEN-----AKRTNQSRKKDSQRPICTNCGIKGHVIDKCYKLHGYPP
        MGLN+ Y  +RAQIL+M P+PSI   F L+IQEE QRS        D  A + TT+N     A  T+++RKK+   P C  CGIKGH+ DKCY  HGYP 
Subjt:  MGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQVTDAMALMATTEN-----AKRTNQSRKKDSQRPICTNCGIKGHVIDKCYKLHGYPP

Query:  GYKSRINGNAENSPQNN------SQSTPTANYQP---KPSPSQQQQQQLT--------------------------------------------------
        GYK R +     +P  +      + ++  AN+ P       S+Q  Q +T                                                  
Subjt:  GYKSRINGNAENSPQNN------SQSTPTANYQP---KPSPSQQQQQQLT--------------------------------------------------

Query:  ----------------------LQDRLSLKMIGKANNKHELYLLNFVDSSNHHTAAALSCAISIETWHHRLGHLSPKCLSLLKDTXXXXXXXNDLSDTLA
                              + D     MIGKA+ ++ LY+LN   ++N   A     AIS++TWH RLGHLSPKCLS L  T          S   A
Subjt:  ----------------------LQDRLSLKMIGKANNKHELYLLNFVDSSNHHTAAALSCAISIETWHHRLGHLSPKCLSLLKDTXXXXXXXNDLSDTLA

Query:  N-----HVLPLPIQGTLQQNEENHTSPNTSDM--------------FNPGNTPHTAEEQVTFNEEIMQNPSTIEAIEPNNTVESNEATNLPHNITVSPRR
        N       L + I+    +  ++  S N  ++              F+    P         ++ ++ + + ++ I PN T  + E T +P N+ + PR+
Subjt:  N-----HVLPLPIQGTLQQNEENHTSPNTSDM--------------FNPGNTPHTAEEQVTFNEEIMQNPSTIEAIEPNNTVESNEATNLPHNITVSPRR

Query:  SIRRRHPPIFLQDYHCNLLQGQA----------------------------LNTATPYSIT-----VKLDSWRMAMDEEINAMERTRTWSIVPLPDGHRA
        S R+  PP  L+DYHC+LL   +                             N +T Y  +     VK  SWR AMD EI AMERT T SIVPLP GH  
Subjt:  SIRRRHPPIFLQDYHCNLLQGQA----------------------------LNTATPYSIT-----VKLDSWRMAMDEEINAMERTRTWSIVPLPDGHRA

Query:  IGCKW
        +GCKW
Subjt:  IGCKW

XP_019071858.1 PREDICTED: uncharacterized protein LOC100853407 [Vitis vinifera]3.3e-3730.17Show/hide
Query:  VEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSV--RNNVQVTDAMA--L
        V  YYT+L ++W EL E +    C CGG++ +++    E VM FL+GLNE +  +RAQIL+M P P + K F LV+QEE QRS+   N+   T  ++   
Subjt:  VEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSV--RNNVQVTDAMA--L

Query:  MATTENAKRTNQSRKKDSQRPICTNCGIKGHVIDKCYKLHGYPPGYKSRINGNAENSPQNNSQSTPTANYQPKPSPSQQQQQQLTLQDRLSLKMIGKANN
         A +  +  TN SR + + RP+CT+C I GH +D+CYK+HGYPPG+++R N     S  N            +  P+     QLTL D       G   +
Subjt:  MATTENAKRTNQSRKKDSQRPICTNCGIKGHVIDKCYKLHGYPPGYKSRINGNAENSPQNNSQSTPTANYQPKPSPSQQQQQQLTLQDRLSLKMIGKANN

Query:  KHELYLLNFVDSSNHHTAAALSCAISIETWHHRLGHLSPKCLSLLKDTXXXXXXXNDLSDTLANHVLPLPIQGTLQQNEENHTS-PNTSDMFNPGNTPHT
             L +      H+   AL   +S+        H S    +   D+       ++ +D  +  VL +    T   + +N TS PN+ D  +   +PHT
Subjt:  KHELYLLNFVDSSNHHTAAALSCAISIETWHHRLGHLSPKCLSLLKDTXXXXXXXNDLSDTLANHVLPLPIQGTLQQNEENHTS-PNTSDMFNPGNTPHT

Query:  AEEQVTFNEEIMQNPSTIEAIEPNNTVESNEATNLPH----NITVSPRRSIRRRHPPIFLQDYHCNLLQGQALNTATPYSITVKLDSWRMAMDEEINAME
        +    T +  + Q P  +      +      A++ PH    N T  P   +   +       +    +    +   T Y+  V +  W+ AM  E+ A+E
Subjt:  AEEQVTFNEEIMQNPSTIEAIEPNNTVESNEATNLPH----NITVSPRRSIRRRHPPIFLQDYHCNLLQGQALNTATPYSITVKLDSWRMAMDEEINAME

Query:  RTRTWSIVPLPDGHRAIGCKW
           TWS+  LP G  AIGCKW
Subjt:  RTRTWSIVPLPDGHRAIGCKW

XP_022145891.1 uncharacterized protein LOC111015239 [Momordica charantia]1.3e-3652.07Show/hide
Query:  MSVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQVTDAMALMA
        +S+E YYTKL T+WQEL+++RPT +CTC G+KS  +   SE+VM FLMGLNE Y  +RAQIL+M P+P + K F L+IQEE QR++        +MA MA
Subjt:  MSVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQVTDAMALMA

Query:  TTENAKRTN--QSRKKDSQRPICTNCGIKGHVIDKCYKLHGYPPGYKSRINGNAENSPQNNSQSTPTAN
          E +KR +  Q R+KD+ R  CT+CG++GHVIDKCYKLHGYPPGY++  N  A     +N   T  +N
Subjt:  TTENAKRTN--QSRKKDSQRPICTNCGIKGHVIDKCYKLHGYPPGYKSRINGNAENSPQNNSQSTPTAN

XP_022155284.1 uncharacterized protein LOC111022420 [Momordica charantia]7.2e-4052.54Show/hide
Query:  MSVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQVTDAMALMA
        +++E YYTKL T+WQELSE+  +  CTCGG+K  L+H +SE+VM+FLMGLNE Y  +RAQIL M PMP I K F L+IQEE+ RSVRN +   D++AL A
Subjt:  MSVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQVTDAMALMA

Query:  TTENAKR--TNQSRKKDSQRPICTNCGIKGHVIDKCYKLHGYPPGYKSRINGNAENSPQNNSQSTPTANYQPKPSPS
          E +KR   ++ RKK+ QRP CT+CGIKGH+I+ CYKLHGYPP Y+ R   +   +  N+  ++P  +   K  PS
Subjt:  TTENAKR--TNQSRKKDSQRPICTNCGIKGHVIDKCYKLHGYPPGYKSRINGNAENSPQNNSQSTPTANYQPKPSPS

TrEMBL top hitse value%identityAlignment
A0A2N9EL72 Reverse transcriptase Ty1/copia-type domain-containing protein3.2e-3828.4Show/hide
Query:  SVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQV----TDAMA
        SV  YYTKL   W EL  +RP   CTCG +K+ +D+  SE+VM FL+GL++ Y  +R QIL+M PMP+I K F LV QEE QR + +   +    + A A
Subjt:  SVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQV----TDAMA

Query:  LMAT-TENAKRTNQSRKKDSQRPICTNCGIKGHVIDKCYKLHGYPPGYKSRINGNAEN--SPQNNSQSTPTANYQPKP-SPSQQQQ-------QQ-----
        L      NA   N  RK   +RP+C++CGI GH ++KCY+LHG+PPGYK R   +A    +   ++     AN    P SP Q QQ       QQ     
Subjt:  LMAT-TENAKRTNQSRKKDSQRPICTNCGIKGHVIDKCYKLHGYPPGYKSRINGNAEN--SPQNNSQSTPTANYQPKP-SPSQQQQ-------QQ-----

Query:  --------------------------------------------------------LTLQDRLSL-------KMIGKANNKHELYLLNFVDS--------
                                                                + L D L+L       KMIG    +  LY L   DS        
Subjt:  --------------------------------------------------------LTLQDRLSL-------KMIGKANNKHELYLLNFVDS--------

Query:  -SNHHTA---AALSCAISIETWHHRLGHLSPKCLSLLKDTXXXXXXXNDLSDT---------------LANHVLPLPIQGTLQQNEENHTSPNTSDMFNP
          + HT     +++ +  ++ WH RLGH S   L  +KD             T               L +H+L L     L         P+ + + N 
Subjt:  -SNHHTA---AALSCAISIETWHHRLGHLSPKCLSLLKDTXXXXXXXNDLSDT---------------LANHVLPLPIQGTLQQNEENHTSPNTSDMFNP

Query:  GNTPHTAEEQVTFNEEIMQNPSTIEAIEPNNTVESNEATNLPHNITVSPRRSIRRRHPPIFLQDYHCNLL------------QGQALN-TATPYSIT---
           P T       N   +  P  +   EP   V S              R+S+R   PP +L DYHCN++            Q  A N + TPY ++   
Subjt:  GNTPHTAEEQVTFNEEIMQNPSTIEAIEPNNTVESNEATNLPHNITVSPRRSIRRRHPPIFLQDYHCNLL------------QGQALN-TATPYSIT---

Query:  ----------------------------VKLDSWRMAMDEEINAMERTRTWSIVPLPDGHRAIGCKW
                                    V+   WR AM  E+ A+E   TWS+  LP G ++IGCKW
Subjt:  ----------------------------VKLDSWRMAMDEEINAMERTRTWSIVPLPDGHRAIGCKW

A0A2N9H9X1 Uncharacterized protein6.5e-3930.56Show/hide
Query:  SVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQR--SVRNNVQVTDAMALM
        +V  Y+TKL ++W EL+ +R    C+CG +K  +D+   E VM FLMGLN+ +  +RAQIL+M P+P+I K F LV+QEE QR   V +   ++D+MAL 
Subjt:  SVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQR--SVRNNVQVTDAMALM

Query:  ATTE----NAKRTNQSRKKDSQRPICTNCGIKGHVIDKCYKLHGYPPGYKSRINGNAEN---------------------SPQNNSQSTP----------
           E    N      S KKD  RP+C++CGI  H++DKCYKLHG+PP +K R    A N                     + + +S STP          
Subjt:  ATTE----NAKRTNQSRKKDSQRPICTNCGIKGHVIDKCYKLHGYPPGYKSRINGNAEN---------------------SPQNNSQSTP----------

Query:  --------TANYQPKPS---------PSQQQQQQLT------LQDRLSLKMIGKANNKHELYLLNFVDSS---NHHTAAALSCAISIET-----WHHRLG
                + ++ PK S         PS  Q + +         D +S K IG A  KH LY+L   DS    +    A LS   +  T     WHHRLG
Subjt:  --------TANYQPKPS---------PSQQQQQQLT------LQDRLSLKMIGKANNKHELYLLNFVDSS---NHHTAAALSCAISIET-----WHHRLG

Query:  HLSPKCLSLLKDTXXXXXXXNDLSDTLANHVLPLPIQGTLQQNEENHTSPNTSDMFNPGNTPHTAEEQVTFNEEIMQNPSTIEAIEPNNTVESNEATNLP
        H S   L+LL                 +N+ L   +     Q    H    TS +  P             N  + +    +  +  +   + N +TN P
Subjt:  HLSPKCLSLLKDTXXXXXXXNDLSDTLANHVLPLPIQGTLQQNEENHTSPNTSDMFNPGNTPHTAEEQVTFNEEIMQNPSTIEAIEPNNTVESNEATNLP

Query:  HNITVSPRRSIRRRHPPIFLQDYHCNLLQGQALNTATPYSITVKLDSWRMAMDEEINAMERTRTWSIVPLPDGHRAIGCKW
            +S   S     P     +Y   +     +   + Y+   K   W  AM  EI+A+E  +TWS+  LP G   IGCKW
Subjt:  HNITVSPRRSIRRRHPPIFLQDYHCNLLQGQALNTATPYSITVKLDSWRMAMDEEINAMERTRTWSIVPLPDGHRAIGCKW

A0A2N9HNI5 Uncharacterized protein1.6e-4030.85Show/hide
Query:  SVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQV----TDAMA
        SV  YYTKL   W EL  +RP   CTCG +K+ +D+  SE+VM FL+GL++ Y  +R QIL+M PMP+I K F LV QEE QR + +   +    + A A
Subjt:  SVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQV----TDAMA

Query:  LMAT-TENAKRTNQSRKKDSQRPICTNCGIKGHVIDKCYKLHGYPPGYKSRINGNAENSPQNNSQST------------------PTANYQPKPSPSQQQ
        L      NA   N  RK   +RP+C++CGI GH ++KCY+LHG+PPGYK R   +A      +S +T                   T       S     
Subjt:  LMAT-TENAKRTNQSRKKDSQRPICTNCGIKGHVIDKCYKLHGYPPGYKSRINGNAENSPQNNSQST------------------PTANYQPKPSPSQQQ

Query:  QQQLTLQDRLSL-------KMIGKANNKHELYLL--------NFV-DSSNHHTA---AALSCAISIETWHHRLGHLSPKCLSLLKD--TXXXXXXXNDLS
           + L D L+L       KMIG    +  LY L        +F+ ++ + HT     +++ +  ++ WH RLGH S   L  +KD  +       +   
Subjt:  QQQLTLQDRLSL-------KMIGKANNKHELYLL--------NFV-DSSNHHTA---AALSCAISIETWHHRLGHLSPKCLSLLKD--TXXXXXXXNDLS

Query:  DTLANHVLPLPIQGTLQQNEENHTSPNTSDMFNPGNTPHTAEEQVTFNEEIMQNPSTIEAIEPNNTVESNEATNLPHNITVSPRRSIRRRHPPIFLQDYH
        +T   H    PI+  +  + +  + P+ + + N  + P T       N      P  +  +EP   V S              R+S+R   PP +L DY+
Subjt:  DTLANHVLPLPIQGTLQQNEENHTSPNTSDMFNPGNTPHTAEEQVTFNEEIMQNPSTIEAIEPNNTVESNEATNLPHNITVSPRRSIRRRHPPIFLQDYH

Query:  CNLL------------QGQALNTATP--YSITVKLDSWRMAMDEEINAMERTRTWSI
        CN++            Q  A N + P  Y   V+   WR AM  E+ A+E   TW +
Subjt:  CNLL------------QGQALNTATP--YSITVKLDSWRMAMDEEINAMERTRTWSI

A0A5A7VE66 Cysteine-rich RLK (Receptor-like protein kinase) 84.1e-4132.08Show/hide
Query:  MSVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQVTDAMALMA
        +++E YYTKL TIWQ L+E+R T +CTCGG+K F+DHL+SE++M FLMGLN+ Y  +RAQIL+M P+PSI   F L+IQEE QRS        D +AL  
Subjt:  MSVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQVTDAMALMA

Query:  TTENAKRTNQSRKKDSQRPICTNCGIKGHVIDKCYKLHGYPPGYKSRINGNAENSPQ----NNSQSTPTANYQPKPS-----PSQQQQQQLTL-------
         +  A  T+++RKK  +RP C+ CGIKGH+ DKCYK HGYPPGYK R + +   +P     NN  +T +A     P       S+Q  Q +TL       
Subjt:  TTENAKRTNQSRKKDSQRPICTNCGIKGHVIDKCYKLHGYPPGYKSRINGNAENSPQ----NNSQSTPTANYQPKPS-----PSQQQQQQLTL-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------QDRLSLKMIGKANNKHELYLLNFVDSSNHHTAAALSCAISIETWHHRLGHLSPKCLSLLKDTXXXXXXXNDLSDTLANHV
                            QD     MIGKA+ ++ LY+LN   ++N   A     AIS++TWH RLGHLSPKCLS L  T       N      + HV
Subjt:  --------------------QDRLSLKMIGKANNKHELYLLNFVDSSNHHTAAALSCAISIETWHHRLGHLSPKCLSLLKDTXXXXXXXNDLSDTLANHV

Query:  LPLPIQGTLQQNEENHTSPNTSDM
         PL  Q  L  +  N+ + +  D+
Subjt:  LPLPIQGTLQQNEENHTSPNTSDM

A0A6J1DPT8 uncharacterized protein LOC1110224203.5e-4052.54Show/hide
Query:  MSVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQVTDAMALMA
        +++E YYTKL T+WQELSE+  +  CTCGG+K  L+H +SE+VM+FLMGLNE Y  +RAQIL M PMP I K F L+IQEE+ RSVRN +   D++AL A
Subjt:  MSVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQVTDAMALMA

Query:  TTENAKR--TNQSRKKDSQRPICTNCGIKGHVIDKCYKLHGYPPGYKSRINGNAENSPQNNSQSTPTANYQPKPSPS
          E +KR   ++ RKK+ QRP CT+CGIKGH+I+ CYKLHGYPP Y+ R   +   +  N+  ++P  +   K  PS
Subjt:  TTENAKR--TNQSRKKDSQRPICTNCGIKGHVIDKCYKLHGYPPGYKSRINGNAENSPQNNSQSTPTANYQPKPSPS

SwissProt top hitse value%identityAlignment
P92520 Uncharacterized mitochondrial protein AtMg008208.6e-0443.59Show/hide
Query:  VKLDSWRMAMDEEINAMERTRTWSIVPLPDGHRAIGCKW
        +K   W  AM EE++A+ R +TW +VP P     +GCKW
Subjt:  VKLDSWRMAMDEEINAMERTRTWSIVPLPDGHRAIGCKW

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.9e-0636.78Show/hide
Query:  SVEVYYTKLITIWQELSEHRPTQECTCGG-----IKSFLDHLDSEFVMIFLMG--LNEIYTILRAQILVMSPMPSITKTFPLVIQEE
        SVE Y+ KL  +W ELSE+ P  EC CGG      K   +  + E    FLMG  LN+ +  +  +I+   P PS+ + F +V   E
Subjt:  SVEVYYTKLITIWQELSEHRPTQECTCGG-----IKSFLDHLDSEFVMIFLMG--LNEIYTILRAQILVMSPMPSITKTFPLVIQEE

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.2e-0726.36Show/hide
Query:  STIEAIEPNNTVESNEATNLPHNI-TVSPRRSIRRRHPPIFLQDYHCNLLQGQALNTATPYSITVKLDS--------------------------WRMAM
        S  +A   +++++   + N+ +++   S   S RR   P +LQDY+C+ +    ++  + +    K+                            W  AM
Subjt:  STIEAIEPNNTVESNEATNLPHNI-TVSPRRSIRRRHPPIFLQDYHCNLLQGQALNTATPYSITVKLDS--------------------------WRMAM

Query:  DEEINAMERTRTWSIVPLPDGHRAIGCKW
        D+EI AME T TW I  LP   + IGCKW
Subjt:  DEEINAMERTRTWSIVPLPDGHRAIGCKW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)6.1e-0543.59Show/hide
Query:  VKLDSWRMAMDEEINAMERTRTWSIVPLPDGHRAIGCKW
        +K   W  AM EE++A+ R +TW +VP P     +GCKW
Subjt:  VKLDSWRMAMDEEINAMERTRTWSIVPLPDGHRAIGCKW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGTCGAAGTCTACTATACAAAACTCATCACAATTTGGCAAGAACTATCTGAACATCGGCCTACACAAGAATGTACCTGTGGAGGAATAAAATCCTTTCTTGATCA
CCTTGATTCTGAATTCGTCATGATTTTCTTAATGGGACTAAATGAGATCTATACCATTCTACGTGCTCAAATCTTGGTTATGAGTCCAATGCCTTCAATCACCAAAACCT
TCCCATTGGTAATTCAAGAGGAGCATCAACGATCTGTTCGAAACAATGTCCAAGTAACTGATGCAATGGCTTTAATGGCAACTACAGAGAATGCTAAGAGGACAAATCAA
TCACGAAAGAAAGATTCTCAACGACCTATTTGTACAAATTGTGGCATCAAAGGTCATGTCATTGACAAATGCTACAAACTTCATGGTTATCCCCCGGGTTATAAATCAAG
AATCAATGGAAATGCTGAAAATTCTCCACAAAATAATTCCCAATCTACACCTACTGCAAATTATCAACCAAAACCGAGCCCATCACAGCAGCAACAACAACAACTCACAC
TGCAGGACAGACTTTCATTGAAGATGATTGGCAAGGCTAACAACAAACATGAACTCTATTTGCTCAATTTTGTTGACAGCTCCAATCATCATACTGCTGCTGCTCTTTCT
TGCGCCATCTCAATTGAAACTTGGCATCATCGCTTGGGCCATTTATCTCCCAAATGTTTATCATTGCTAAAAGATACTTTNNNNNNNNNNNNNNNNNACAATGATTTATC
TGACACATTGGCAAACCATGTTTTGCCGCTTCCTATTCAAGGAACATTACAACAAAATGAAGAGAATCACACAAGTCCTAACACTTCTGATATGTTTAATCCTGGAAATA
CTCCTCATACAGCAGAAGAACAAGTAACATTTAATGAAGAAATTATGCAAAATCCTTCTACCATTGAAGCTATTGAGCCCAACAATACGGTTGAATCTAATGAAGCTACA
AATCTTCCACATAACATCACTGTTAGTCCACGAAGATCAATAAGAAGACGTCATCCACCTATTTTTCTTCAAGATTACCACTGTAATTTGCTTCAAGGCCAAGCTTTAAA
CACCGCAACTCCATACTCCATCACTGTCAAATTAGACTCTTGGAGAATGGCTATGGATGAAGAAATTAATGCCATGGAAAGAACAAGAACTTGGAGTATTGTTCCTCTAC
CTGACGGTCATCGTGCAATTGGTTGCAAATGGAAAGATCACGACGGAGAGGGTTTCGAGTTGAGGTGCGATCAGGTAGGACAAATCAACGAGAAGAGGAAGAAGCTAGAG
AAGACGTCGAAGAAGAAGAAGATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCGTCGAAGTCTACTATACAAAACTCATCACAATTTGGCAAGAACTATCTGAACATCGGCCTACACAAGAATGTACCTGTGGAGGAATAAAATCCTTTCTTGATCA
CCTTGATTCTGAATTCGTCATGATTTTCTTAATGGGACTAAATGAGATCTATACCATTCTACGTGCTCAAATCTTGGTTATGAGTCCAATGCCTTCAATCACCAAAACCT
TCCCATTGGTAATTCAAGAGGAGCATCAACGATCTGTTCGAAACAATGTCCAAGTAACTGATGCAATGGCTTTAATGGCAACTACAGAGAATGCTAAGAGGACAAATCAA
TCACGAAAGAAAGATTCTCAACGACCTATTTGTACAAATTGTGGCATCAAAGGTCATGTCATTGACAAATGCTACAAACTTCATGGTTATCCCCCGGGTTATAAATCAAG
AATCAATGGAAATGCTGAAAATTCTCCACAAAATAATTCCCAATCTACACCTACTGCAAATTATCAACCAAAACCGAGCCCATCACAGCAGCAACAACAACAACTCACAC
TGCAGGACAGACTTTCATTGAAGATGATTGGCAAGGCTAACAACAAACATGAACTCTATTTGCTCAATTTTGTTGACAGCTCCAATCATCATACTGCTGCTGCTCTTTCT
TGCGCCATCTCAATTGAAACTTGGCATCATCGCTTGGGCCATTTATCTCCCAAATGTTTATCATTGCTAAAAGATACTTTNNNNNNNNNNNNNNNNNACAATGATTTATC
TGACACATTGGCAAACCATGTTTTGCCGCTTCCTATTCAAGGAACATTACAACAAAATGAAGAGAATCACACAAGTCCTAACACTTCTGATATGTTTAATCCTGGAAATA
CTCCTCATACAGCAGAAGAACAAGTAACATTTAATGAAGAAATTATGCAAAATCCTTCTACCATTGAAGCTATTGAGCCCAACAATACGGTTGAATCTAATGAAGCTACA
AATCTTCCACATAACATCACTGTTAGTCCACGAAGATCAATAAGAAGACGTCATCCACCTATTTTTCTTCAAGATTACCACTGTAATTTGCTTCAAGGCCAAGCTTTAAA
CACCGCAACTCCATACTCCATCACTGTCAAATTAGACTCTTGGAGAATGGCTATGGATGAAGAAATTAATGCCATGGAAAGAACAAGAACTTGGAGTATTGTTCCTCTAC
CTGACGGTCATCGTGCAATTGGTTGCAAATGGAAAGATCACGACGGAGAGGGTTTCGAGTTGAGGTGCGATCAGGTAGGACAAATCAACGAGAAGAGGAAGAAGCTAGAG
AAGACGTCGAAGAAGAAGAAGATGTAG
Protein sequenceShow/hide protein sequence
MSVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQVTDAMALMATTENAKRTNQ
SRKKDSQRPICTNCGIKGHVIDKCYKLHGYPPGYKSRINGNAENSPQNNSQSTPTANYQPKPSPSQQQQQQLTLQDRLSLKMIGKANNKHELYLLNFVDSSNHHTAAALS
CAISIETWHHRLGHLSPKCLSLLKDTXXXXXXXNDLSDTLANHVLPLPIQGTLQQNEENHTSPNTSDMFNPGNTPHTAEEQVTFNEEIMQNPSTIEAIEPNNTVESNEAT
NLPHNITVSPRRSIRRRHPPIFLQDYHCNLLQGQALNTATPYSITVKLDSWRMAMDEEINAMERTRTWSIVPLPDGHRAIGCKWKDHDGEGFELRCDQVGQINEKRKKLE
KTSKKKKM