; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC11G217040 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC11G217040
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionReverse transcriptase
Genome locationCicolChr11:11636030..11637742
RNA-Seq ExpressionCcUC11G217040
SyntenyCcUC11G217040
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
WP_217833156.1 hypothetical protein, partial [Synechococcus sp. PCC 7002]2.8e-19871.73Show/hide
Query:  MCDASDYALGAVLGQRKDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEI
        MCDASDYALGAVLGQR+DNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFR YLL SKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEI
Subjt:  MCDASDYALGAVLGQRKDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEI

Query:  KDRKGCENVVANHLSRIENGDATSWPPIVETFPDEQLYQVKDSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVPQ
        KDRKGCENVVA+HLSRIEN DA SWPPIVE FPDEQLYQVKDSLPWF +IVNYLAGGHLPPDMN QQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVPQ
Subjt:  KDRKGCENVVANHLSRIENGDATSWPPIVETFPDEQLYQVKDSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVPQ

Query:  EEVVSILNSCHASPYGGHFGPTRTASKILVAIDYVSKWVEAIATRTNDARTVLKFLHK-----NIFTRFGTPRATISDE------GSHFCNKLFESMMQK
        EEVVSILNSCHASPYGGHFGPTRTA+K+L +  Y   W         D  T +K   +     NI  +   P   I +       G  F      S    
Subjt:  EEVVSILNSCHASPYGGHFGPTRTASKILVAIDYVSKWVEAIATRTNDARTVLKFLHK-----NIFTRFGTPRATISDE------GSHFCNKLFESMMQK

Query:  YKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWALKLDDALWAYRTAFKNPIDTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKA---
        Y  NHKIATAYHPQTNGLAELSNREIKQ+LEK VKTNRKDWALKLDDALWAYRTAFK PI TS YRLVFGKACHLPVELEHRAYWAIKKLNMDFEKA   
Subjt:  YKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWALKLDDALWAYRTAFKNPIDTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKA---

Query:  --------------------------------------------------------GKLRTRWSRPFIIVKVSPHGAVELQNNDGTTFKVNGQRFKHYIG
                                                                GKLRTRWS PFIIVKVSPHGAVELQ N+GTTFKVNG R KHYIG
Subjt:  --------------------------------------------------------GKLRTRWSRPFIIVKVSPHGAVELQNNDGTTFKVNGQRFKHYIG

Query:  DEERQFENLAFIA
        DEER  ENLAF A
Subjt:  DEERQFENLAFIA

XP_012833687.1 PREDICTED: uncharacterized protein LOC105954563 [Erythranthe guttata]2.8e-16152.17Show/hide
Query:  MCDASDYALGAVLGQRKDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEI
        MCDASDYA+GAVLGQR+D +F+AIYY+SRTLD  Q+ Y+TTEKE+LAVV+A+DKFRPY+L S+++++TDHAA++YLF KKD+KPRL+RW+LLLQEFDLEI
Subjt:  MCDASDYALGAVLGQRKDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEI

Query:  KDRKGCENVVANHLSRIENGDATSWPPIVETFPDEQLYQVKDSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVPQ
        +D+KG ENVVA+HLSR+   +  +   I E+FPDEQL  +    PW+ ++ N+LA G +P D++  QKK+FLH+ + Y W++PLL++   D +IR+CVP+
Subjt:  KDRKGCENVVANHLSRIENGDATSWPPIVETFPDEQLYQVKDSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVPQ

Query:  EEVVSILNSCHASPYGGHFGPTRTASK-------------------------------------------------------------------ILVAID
         EV  IL  CH+SP GGH G +RTA+K                                                                   IL+A+D
Subjt:  EEVVSILNSCHASPYGGHFGPTRTASK-------------------------------------------------------------------ILVAID

Query:  YVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRATISDEGSHFCNKLFESMMQKYKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWALK
        YVSKWVEAIAT TNDARTVLKF HKNIF+RFGTPRA ISDEGSHFCNKL  ++  K    HKIA AYHPQTNGLAELSNREIKQILEKTV TNRKDWALK
Subjt:  YVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRATISDEGSHFCNKLFESMMQKYKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWALK

Query:  LDDALWAYRTAFKNPIDTSPYRLVFGKACHLPVELEHRAYWAIKKLNMD----------------------FEKA-------------------------
        LDDALWAYRTAFK PI  SPY+LV+GKACHLPVELEHRAYWA+KKLN D                      +E A                         
Subjt:  LDDALWAYRTAFKNPIDTSPYRLVFGKACHLPVELEHRAYWAIKKLNMD----------------------FEKA-------------------------

Query:  ------------GKLRTRWSRPFIIVKVSPHGAVELQNNDGTTFKVNGQRFKHY
                    GKL++RWS PF+++  +P G +E++  DG +FKVNGQR KHY
Subjt:  ------------GKLRTRWSRPFIIVKVSPHGAVELQNNDGTTFKVNGQRFKHY

XP_012846413.1 PREDICTED: uncharacterized protein LOC105966405 [Erythranthe guttata]2.1e-16152.17Show/hide
Query:  MCDASDYALGAVLGQRKDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEI
        MCDASDYA+GAVLGQR+D +F+AIYY+SRTLD  Q+ Y+TTEKE+LAVV+A+DKFRPY+L S+++++TDHAA++YLF KKD+KPRL+RW+LLLQEFDLEI
Subjt:  MCDASDYALGAVLGQRKDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEI

Query:  KDRKGCENVVANHLSRIENGDATSWPPIVETFPDEQLYQVKDSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVPQ
        +D+KG ENVVA+HLSR+   +  +   I E+FPDEQL  +    PW+ ++ N+LA G +P D++  QKK+FLH+ + Y W++PLL++   D +IR+CVP+
Subjt:  KDRKGCENVVANHLSRIENGDATSWPPIVETFPDEQLYQVKDSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVPQ

Query:  EEVVSILNSCHASPYGGHFGPTRTASK-------------------------------------------------------------------ILVAID
         EV  IL  CH+SP GGH G +RTA+K                                                                   IL+A+D
Subjt:  EEVVSILNSCHASPYGGHFGPTRTASK-------------------------------------------------------------------ILVAID

Query:  YVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRATISDEGSHFCNKLFESMMQKYKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWALK
        YVSKWVEAIAT TNDARTVLKF HKNIF+RFGTPRA ISDEGSHFCNKL  ++  K    HKIA AYHPQTNGLAELSNREIKQILEKTV TNRKDWALK
Subjt:  YVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRATISDEGSHFCNKLFESMMQKYKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWALK

Query:  LDDALWAYRTAFKNPIDTSPYRLVFGKACHLPVELEHRAYWAIKKLNMD----------------------FEKA-------------------------
        LDDALWAYRTAFK PI  SPY+LV+GKACHLPVELEHRAYWA+KKLN D                      +E A                         
Subjt:  LDDALWAYRTAFKNPIDTSPYRLVFGKACHLPVELEHRAYWAIKKLNMD----------------------FEKA-------------------------

Query:  ------------GKLRTRWSRPFIIVKVSPHGAVELQNNDGTTFKVNGQRFKHY
                    GKL++RWS PF+++  +P G +E++  DG +FKVNGQR KHY
Subjt:  ------------GKLRTRWSRPFIIVKVSPHGAVELQNNDGTTFKVNGQRFKHY

XP_012853783.1 PREDICTED: uncharacterized protein LOC105973307 [Erythranthe guttata]5.6e-16252.35Show/hide
Query:  MCDASDYALGAVLGQRKDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEI
        MCDASDYA+GAVLGQR+D +F+AIYY SRTLD  Q+ Y+TTEKE+LAVV+A+DKFRPY+L S+++++TDHAA++YLF KKD+KPRL+RW+LLLQEFDLEI
Subjt:  MCDASDYALGAVLGQRKDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEI

Query:  KDRKGCENVVANHLSRIENGDATSWPPIVETFPDEQLYQVKDSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVPQ
        +D+KG ENVVA+HLSR+  G+  +   I E+FPDEQL  +    PW+ ++ N+LA G +P D++  QKK+FLH+ + Y W++PLL++   D +IR+CVP+
Subjt:  KDRKGCENVVANHLSRIENGDATSWPPIVETFPDEQLYQVKDSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVPQ

Query:  EEVVSILNSCHASPYGGHFGPTRTASK-------------------------------------------------------------------ILVAID
         EV  IL  CH+SP GGH G +RTA+K                                                                   IL+A+D
Subjt:  EEVVSILNSCHASPYGGHFGPTRTASK-------------------------------------------------------------------ILVAID

Query:  YVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRATISDEGSHFCNKLFESMMQKYKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWALK
        YVSKWVEAIAT TNDARTVLKF HKNIF+RFGTPRA ISDEGSHFCNKL  ++  K    HKIA AYHPQTNGLAELSNREIKQILEKTV TNRKDWALK
Subjt:  YVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRATISDEGSHFCNKLFESMMQKYKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWALK

Query:  LDDALWAYRTAFKNPIDTSPYRLVFGKACHLPVELEHRAYWAIKKLNMD----------------------FEKA-------------------------
        LDDALWAYRTAFK PI  SPY+LV+GKACHLPVELEHRAYWA+KKLN D                      +E A                         
Subjt:  LDDALWAYRTAFKNPIDTSPYRLVFGKACHLPVELEHRAYWAIKKLNMD----------------------FEKA-------------------------

Query:  ------------GKLRTRWSRPFIIVKVSPHGAVELQNNDGTTFKVNGQRFKHY
                    GKL++RWS PF+++  +P G +E++  DG +FKVNGQR KHY
Subjt:  ------------GKLRTRWSRPFIIVKVSPHGAVELQNNDGTTFKVNGQRFKHY

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]1.4e-16553.52Show/hide
Query:  MCDASDYALGAVLGQRKDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEI
        MCDASD+ALGAVLGQR+D +FRAIYYASRTL+  Q  YTTTEKE+LAVVFA DKFR YL+ +K++V TDHAAL+YLF KKD+KPRL+RWILLLQEFDLE+
Subjt:  MCDASDYALGAVLGQRKDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEI

Query:  KDRKGCENVVANHLSRIENGDATSWPPIVETFPDEQLYQVKDSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVPQ
        +D+KG EN VA+HLSR+E  +      I E FPDEQL+  +  LPW+ +IVN+LA   LPPD+   Q+K+FLH+VK Y W++PLL+K C D +IR+CVP+
Subjt:  KDRKGCENVVANHLSRIENGDATSWPPIVETFPDEQLYQVKDSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVPQ

Query:  EEVVSILNSCHASPYGGHFGPTRTASK-------------------------------------------------------------------ILVAID
        EE+ +IL+ CH+S YGGHFG TRTA+K                                                                   IL+A+D
Subjt:  EEVVSILNSCHASPYGGHFGPTRTASK-------------------------------------------------------------------ILVAID

Query:  YVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRATISDEGSHFCNKLFESMMQKYKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWALK
        YVSKWVEAIAT TNDA+ VLKFLHKNIFTRFGTPRA ISDEG+HFCNKLF++++ KY   HKIA AYHPQTNG AE+SNREIK ILEKTV TNRKDWA K
Subjt:  YVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRATISDEGSHFCNKLFESMMQKYKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWALK

Query:  LDDALWAYRTAFKNPIDTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKA-----------------------------------------------
        LDDALWAYRTAFK PI  SPYRLVFGKACHLPVELEH+AYWA+KK N+D + A                                               
Subjt:  LDDALWAYRTAFKNPIDTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKA-----------------------------------------------

Query:  ------------GKLRTRWSRPFIIVKVSPHGAVELQNNDGTTFKVNGQRFKHYIGDEERQFENLAFI
                    GKLR+RW+ P+ I KVS  GA++L++  G  F+VNGQR KHY G++  +  N AFI
Subjt:  ------------GKLRTRWSRPFIIVKVSPHGAVELQNNDGTTFKVNGQRFKHYIGDEERQFENLAFI

TrEMBL top hitse value%identityAlignment
A0A540MQU7 Integrase catalytic domain-containing protein6.9e-15049.1Show/hide
Query:  MCDASDYALGAVLGQRKDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEI
        MCDASDYA+GAVLGQRK+ +   IYYASRTL++ Q  YTTTEKE+LAV+FAL+KFR YL+ SK++V+TDH ALKYL  KKD+KPRL+RW+LLLQEFDL+I
Subjt:  MCDASDYALGAVLGQRKDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEI

Query:  KDRKGCENVVANHLSRIEN--GDATSWPPIVETFPDEQLYQVKDSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCV
        +D+KG ENVVA+HLSR+ +   +     P+ E+FPDEQL+ + D +PW+ +I NYL  G LPPD+++Q +K+FL  VK Y W+DP LYK C+D +IR+CV
Subjt:  KDRKGCENVVANHLSRIEN--GDATSWPPIVETFPDEQLYQVKDSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCV

Query:  PQEEVVSILNSCHASPYGGHFGPTRTASK-------------------------------------------------------------------ILVA
        P  E  SIL  CH+   GGHFGP++TA+K                                                                   ILVA
Subjt:  PQEEVVSILNSCHASPYGGHFGPTRTASK-------------------------------------------------------------------ILVA

Query:  IDYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRATISDEGSHFCNKLFESMMQKYKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWA
        +DYVSKWVEAIAT TND + VL+FL   IF RFGTPR  ISD G HF NK F ++M KY  NH++AT YHPQT+G  E+SNREIK+ILE TV  +RKDW+
Subjt:  IDYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRATISDEGSHFCNKLFESMMQKYKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWA

Query:  LKLDDALWAYRTAFKNPIDTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKA---------------------------------------------
        LKL DALWAYRTA+K PI  SP+RLV+GKACH PVELEHRAYWAIK+LN +++ A                                             
Subjt:  LKLDDALWAYRTAFKNPIDTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKA---------------------------------------------

Query:  --------------GKLRTRWSRPFIIVKVSPHGAVELQN-NDGTTFKVNGQRFKHYI
                      GKLR++W  PF + +V PHGA+E++N   G  FKVNGQR KHY+
Subjt:  --------------GKLRTRWSRPFIIVKVSPHGAVELQN-NDGTTFKVNGQRFKHYI

A0A540NGH5 Integrase catalytic domain-containing protein4.1e-15048.25Show/hide
Query:  MCDASDYALGAVLGQRKDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEI
        MCDASDYA+GAVLGQRK+ +   IYYASRTL++ Q  YTTT+KE+LAV+FAL+KFR YL+ SK++V+TDH ALKYL  KKD+KPRL+RW+LLLQEFDL+I
Subjt:  MCDASDYALGAVLGQRKDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEI

Query:  KDRKGCENVVANHLSRIEN--GDATSWPPIVETFPDEQLYQVKDSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCV
        +D+KG ENVVA+HLSR+ +   +     P+ E+FPDEQL+ + D +PW+ +I NYL  G LPPD+++Q +K+FL  VK Y W+DP LYK C+D +IR+CV
Subjt:  KDRKGCENVVANHLSRIEN--GDATSWPPIVETFPDEQLYQVKDSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCV

Query:  PQEEVVSILNSCHASPYGGHFGPTRTASK-------------------------------------------------------------------ILVA
        P  E  SIL  CH+   GGHFGP++TA+K                                                                   ILVA
Subjt:  PQEEVVSILNSCHASPYGGHFGPTRTASK-------------------------------------------------------------------ILVA

Query:  IDYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRATISDEGSHFCNKLFESMMQKYKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWA
        +DYVSKWVEAIAT TND + VL+FL   IF RFGTPR  ISD G HF NK F ++M KY  NH++AT YHPQT+G  E+SNREIK+ILE TV  +RKDW+
Subjt:  IDYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRATISDEGSHFCNKLFESMMQKYKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWA

Query:  LKLDDALWAYRTAFKNPIDTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKA---------------------------------------------
        LKL DALWAYRTA+K PI  SP+RLV+GKACHLPVELEHRAYWAIK+LN +++ A                                             
Subjt:  LKLDDALWAYRTAFKNPIDTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKA---------------------------------------------

Query:  --------------GKLRTRWSRPFIIVKVSPHGAVELQN-NDGTTFKVNGQRFKHYIGDEERQFENLAF
                      GKLR++W  PF + +V PHGA+E++N   G  FKVNGQR KHY+     + + +A+
Subjt:  --------------GKLRTRWSRPFIIVKVSPHGAVELQN-NDGTTFKVNGQRFKHYIGDEERQFENLAF

A0A6P8CB75 LOW QUALITY PROTEIN: uncharacterized protein LOC1161937461.9e-15250.9Show/hide
Query:  MCDASDYALGAVLGQRKDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEI
        MCDASDYA+GAVLG R+  +F AIYYASRTL+  Q+ Y TTEK+LLAV+FA DKFRPYL+ SKI+V+TDHAALKYLF K D+KPRL+RWILLLQEFDLEI
Subjt:  MCDASDYALGAVLGQRKDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEI

Query:  KDRKGCENVVANHLSRIENGDATSWPPIVETFPDEQLYQVK-DSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVP
        +D KG ENVVA+HLSR+E+    S  PI E FPDEQL+  +   LPW+ +IVNY+     P  ++SQQKK+FLH+VK Y W++P L+K CAD +IR+CVP
Subjt:  KDRKGCENVVANHLSRIENGDATSWPPIVETFPDEQLYQVK-DSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVP

Query:  QEEVVSILNSCHASPYGGHFGPTRTASKI-------------------------------------------------------------------LVAI
        + E +SI+  CH+   GGHFG  RTA+KI                                                                   LVA+
Subjt:  QEEVVSILNSCHASPYGGHFGPTRTASKI-------------------------------------------------------------------LVAI

Query:  DYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRATISDEGSHFCNKLFESMMQKYKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWAL
        DYVSKWVEA+A ++NDAR V++FL KNIF+RFG PRA ISD GSHFCN  FE ++ KY   HKIAT YHPQT G  E+SNR+IK+ILEKTV  +RKDW+L
Subjt:  DYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRATISDEGSHFCNKLFESMMQKYKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWAL

Query:  KLDDALWAYRTAFKNPIDTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKA----------------------------------------------
        KLDDALWAYRTAFK PI  SPY++V+GK+CHLPVELEH+AYWAIK LN D + A                                              
Subjt:  KLDDALWAYRTAFKNPIDTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKA----------------------------------------------

Query:  ------------GKLRTRWSRPFIIVKVSPHGAVELQNNDGTTFKVNGQRFKHYIGDE
                    GKL++RWS PF+I  V P+GAVEL++ D  TFKVNG   KHY   E
Subjt:  ------------GKLRTRWSRPFIIVKVSPHGAVELQNNDGTTFKVNGQRFKHYIGDE

A0A6P8CBX2 Reverse transcriptase2.1e-15451.34Show/hide
Query:  MCDASDYALGAVLGQRKDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEI
        MCDASDYA+GAVLGQR+  +F AIYYASRTL+  Q+ Y TTEKELLAV+FA DKFRPYL+ SKI+V+TDHAALKYLF K D+KPRL+RWILLLQEFDLEI
Subjt:  MCDASDYALGAVLGQRKDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEI

Query:  KDRKGCENVVANHLSRIENGDATSWPPIVETFPDEQLYQVK-DSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVP
        +D KG ENVVA+HLSR+E+    S  PI E FPDEQL+  +   LPW+ +IVNY+     P  ++SQQKK+FLH+VK Y W++P L+K CAD +IR+CVP
Subjt:  KDRKGCENVVANHLSRIENGDATSWPPIVETFPDEQLYQVK-DSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVP

Query:  QEEVVSILNSCHASPYGGHFGPTRTASK-------------------------------------------------------------------ILVAI
        + E +SI+  CH+   GGHFG  RTA+K                                                                   ILVA+
Subjt:  QEEVVSILNSCHASPYGGHFGPTRTASK-------------------------------------------------------------------ILVAI

Query:  DYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRATISDEGSHFCNKLFESMMQKYKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWAL
        DYVSKWVEA+A ++NDAR V++FL KNIF+RFG PRA ISD GSHFCN+ FE ++ KY   HKIAT YHPQT G  E+SNREIK+ILEKTV  +RKDW+L
Subjt:  DYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRATISDEGSHFCNKLFESMMQKYKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWAL

Query:  KLDDALWAYRTAFKNPIDTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKA----------------------------------------------
        KLDDALWAYRTAFK PI  SPY++V+GK+CHLPVELEH+AYWAIK LN D + A                                              
Subjt:  KLDDALWAYRTAFKNPIDTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKA----------------------------------------------

Query:  -------------GKLRTRWSRPFIIVKVSPHGAVELQNNDGTTFKVNGQRFKHYIGDE
                     GKL++RWS PF+I  V P+GAVEL++ D  TFKVNG   KHY   E
Subjt:  -------------GKLRTRWSRPFIIVKVSPHGAVELQNNDGTTFKVNGQRFKHYIGDE

A0A6P8DLJ8 Reverse transcriptase8.7e-15350.98Show/hide
Query:  MCDASDYALGAVLGQRKDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEI
        MC ASDYA+GAVLGQR+  +F AIYYASRTL+  Q+ Y TTEKELLAV+FA DKFRPYL+ SKI+V+TDHAALKYLF K D+KPRL+RWILLLQEFDLEI
Subjt:  MCDASDYALGAVLGQRKDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEI

Query:  KDRKGCENVVANHLSRIENGDATSWPPIVETFPDEQLYQVK-DSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVP
        +D KG ENVVA+HLSR+E+    S  PI E FPDEQL+  +   LPW+ +IVNY+     P  ++SQQKK+FLH+VK Y W++P L+K CAD +IR+CVP
Subjt:  KDRKGCENVVANHLSRIENGDATSWPPIVETFPDEQLYQVK-DSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVP

Query:  QEEVVSILNSCHASPYGGHFGPTRTASK-------------------------------------------------------------------ILVAI
        + E +SI+  CH+   GGHFG  RTA+K                                                                   ILVA+
Subjt:  QEEVVSILNSCHASPYGGHFGPTRTASK-------------------------------------------------------------------ILVAI

Query:  DYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRATISDEGSHFCNKLFESMMQKYKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWAL
        DYVSKWVEA+A ++NDAR V++FL KNIF+R G PRA ISD GSHFCN+ FE ++ KY   HKIAT YHPQT G  E+SNREIK+ILEKTV  +RKDW+L
Subjt:  DYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRATISDEGSHFCNKLFESMMQKYKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWAL

Query:  KLDDALWAYRTAFKNPIDTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKA----------------------------------------------
        KLDDALWAYRTAFK PI  SPY++V+GK+CHLPVELEH+AYWAIK LN D + A                                              
Subjt:  KLDDALWAYRTAFKNPIDTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKA----------------------------------------------

Query:  -------------GKLRTRWSRPFIIVKVSPHGAVELQNNDGTTFKVNGQRFKHYIGDE
                     GKL++RWS PF+I  V P+GAVEL++ D  TFKVNG   KHY   E
Subjt:  -------------GKLRTRWSRPFIIVKVSPHGAVELQNNDGTTFKVNGQRFKHYIGDE

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein8.2e-2324.29Show/hide
Query:  DASDYALGAVLGQR-KDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDS--KIVVHTDHAALKYLFVKKDSKP---RLMRWILLLQEF
        DASD A+GAVL Q+  D+ +  + Y S  +   Q  Y+ ++KE+LA++ +L  +R YL  +     + TDH  L    +  +S+P   RL RW L LQ+F
Subjt:  DASDYALGAVLGQR-KDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDS--KIVVHTDHAALKYLFVKKDSKP---RLMRWILLLQEF

Query:  DLEIKDRKGCENVVANHLSRIENGDATSWPPIVE----------TFPDEQLYQVKDSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHN------------
        + EI  R G  N +A+ LSRI + +    P   E          +  D+   QV       T ++N L       + N Q K   L N            
Subjt:  DLEIKDRKGCENVVANHLSRIENGDATSWPPIVE----------TFPDEQLYQVKDSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHN------------

Query:  -----VKSYHWEDPLLY---KVCADNMIRKCVPQ------EEVVSILNSCHASPYGGH--FGPTR-----------------TA-------SKILVAIDY
             +K YH E  L++   ++  + ++R+   +      +E V   ++C  +    H  +GP +                 TA       + + V +D 
Subjt:  -----VKSYHWEDPLLY---KVCADNMIRKCVPQ------EEVVSILNSCHASPYGGH--FGPTR-----------------TA-------SKILVAIDY

Query:  VSKWVEAI-ATRTNDARTVLKFLHKNIFTRFGTPRATISDEGSHFCNKLFESMMQKYKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWALK
         SK    +  T++  A    +   + +   FG P+  I+D    F ++ ++    KY    K +  Y PQT+G  E +N+ ++++L     T+   W   
Subjt:  VSKWVEAI-ATRTNDARTVLKFLHKNIFTRFGTPRATISDEGSHFCNKLFESMMQKYKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWALK

Query:  LDDALWAYRTAFKNPIDTSPYRLV
        +     +Y  A  +    +P+ +V
Subjt:  LDDALWAYRTAFKNPIDTSPYRLV

P0CT35 Transposon Tf2-2 polyprotein8.2e-2324.29Show/hide
Query:  DASDYALGAVLGQR-KDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDS--KIVVHTDHAALKYLFVKKDSKP---RLMRWILLLQEF
        DASD A+GAVL Q+  D+ +  + Y S  +   Q  Y+ ++KE+LA++ +L  +R YL  +     + TDH  L    +  +S+P   RL RW L LQ+F
Subjt:  DASDYALGAVLGQR-KDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDS--KIVVHTDHAALKYLFVKKDSKP---RLMRWILLLQEF

Query:  DLEIKDRKGCENVVANHLSRIENGDATSWPPIVE----------TFPDEQLYQVKDSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHN------------
        + EI  R G  N +A+ LSRI + +    P   E          +  D+   QV       T ++N L       + N Q K   L N            
Subjt:  DLEIKDRKGCENVVANHLSRIENGDATSWPPIVE----------TFPDEQLYQVKDSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHN------------

Query:  -----VKSYHWEDPLLY---KVCADNMIRKCVPQ------EEVVSILNSCHASPYGGH--FGPTR-----------------TA-------SKILVAIDY
             +K YH E  L++   ++  + ++R+   +      +E V   ++C  +    H  +GP +                 TA       + + V +D 
Subjt:  -----VKSYHWEDPLLY---KVCADNMIRKCVPQ------EEVVSILNSCHASPYGGH--FGPTR-----------------TA-------SKILVAIDY

Query:  VSKWVEAI-ATRTNDARTVLKFLHKNIFTRFGTPRATISDEGSHFCNKLFESMMQKYKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWALK
         SK    +  T++  A    +   + +   FG P+  I+D    F ++ ++    KY    K +  Y PQT+G  E +N+ ++++L     T+   W   
Subjt:  VSKWVEAI-ATRTNDARTVLKFLHKNIFTRFGTPRATISDEGSHFCNKLFESMMQKYKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWALK

Query:  LDDALWAYRTAFKNPIDTSPYRLV
        +     +Y  A  +    +P+ +V
Subjt:  LDDALWAYRTAFKNPIDTSPYRLV

P0CT36 Transposon Tf2-3 polyprotein8.2e-2324.29Show/hide
Query:  DASDYALGAVLGQR-KDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDS--KIVVHTDHAALKYLFVKKDSKP---RLMRWILLLQEF
        DASD A+GAVL Q+  D+ +  + Y S  +   Q  Y+ ++KE+LA++ +L  +R YL  +     + TDH  L    +  +S+P   RL RW L LQ+F
Subjt:  DASDYALGAVLGQR-KDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDS--KIVVHTDHAALKYLFVKKDSKP---RLMRWILLLQEF

Query:  DLEIKDRKGCENVVANHLSRIENGDATSWPPIVE----------TFPDEQLYQVKDSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHN------------
        + EI  R G  N +A+ LSRI + +    P   E          +  D+   QV       T ++N L       + N Q K   L N            
Subjt:  DLEIKDRKGCENVVANHLSRIENGDATSWPPIVE----------TFPDEQLYQVKDSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHN------------

Query:  -----VKSYHWEDPLLY---KVCADNMIRKCVPQ------EEVVSILNSCHASPYGGH--FGPTR-----------------TA-------SKILVAIDY
             +K YH E  L++   ++  + ++R+   +      +E V   ++C  +    H  +GP +                 TA       + + V +D 
Subjt:  -----VKSYHWEDPLLY---KVCADNMIRKCVPQ------EEVVSILNSCHASPYGGH--FGPTR-----------------TA-------SKILVAIDY

Query:  VSKWVEAI-ATRTNDARTVLKFLHKNIFTRFGTPRATISDEGSHFCNKLFESMMQKYKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWALK
         SK    +  T++  A    +   + +   FG P+  I+D    F ++ ++    KY    K +  Y PQT+G  E +N+ ++++L     T+   W   
Subjt:  VSKWVEAI-ATRTNDARTVLKFLHKNIFTRFGTPRATISDEGSHFCNKLFESMMQKYKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWALK

Query:  LDDALWAYRTAFKNPIDTSPYRLV
        +     +Y  A  +    +P+ +V
Subjt:  LDDALWAYRTAFKNPIDTSPYRLV

P0CT41 Transposon Tf2-12 polyprotein8.2e-2324.29Show/hide
Query:  DASDYALGAVLGQR-KDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDS--KIVVHTDHAALKYLFVKKDSKP---RLMRWILLLQEF
        DASD A+GAVL Q+  D+ +  + Y S  +   Q  Y+ ++KE+LA++ +L  +R YL  +     + TDH  L    +  +S+P   RL RW L LQ+F
Subjt:  DASDYALGAVLGQR-KDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDS--KIVVHTDHAALKYLFVKKDSKP---RLMRWILLLQEF

Query:  DLEIKDRKGCENVVANHLSRIENGDATSWPPIVE----------TFPDEQLYQVKDSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHN------------
        + EI  R G  N +A+ LSRI + +    P   E          +  D+   QV       T ++N L       + N Q K   L N            
Subjt:  DLEIKDRKGCENVVANHLSRIENGDATSWPPIVE----------TFPDEQLYQVKDSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHN------------

Query:  -----VKSYHWEDPLLY---KVCADNMIRKCVPQ------EEVVSILNSCHASPYGGH--FGPTR-----------------TA-------SKILVAIDY
             +K YH E  L++   ++  + ++R+   +      +E V   ++C  +    H  +GP +                 TA       + + V +D 
Subjt:  -----VKSYHWEDPLLY---KVCADNMIRKCVPQ------EEVVSILNSCHASPYGGH--FGPTR-----------------TA-------SKILVAIDY

Query:  VSKWVEAI-ATRTNDARTVLKFLHKNIFTRFGTPRATISDEGSHFCNKLFESMMQKYKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWALK
         SK    +  T++  A    +   + +   FG P+  I+D    F ++ ++    KY    K +  Y PQT+G  E +N+ ++++L     T+   W   
Subjt:  VSKWVEAI-ATRTNDARTVLKFLHKNIFTRFGTPRATISDEGSHFCNKLFESMMQKYKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWALK

Query:  LDDALWAYRTAFKNPIDTSPYRLV
        +     +Y  A  +    +P+ +V
Subjt:  LDDALWAYRTAFKNPIDTSPYRLV

Q9UR07 Transposon Tf2-11 polyprotein8.2e-2324.29Show/hide
Query:  DASDYALGAVLGQR-KDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDS--KIVVHTDHAALKYLFVKKDSKP---RLMRWILLLQEF
        DASD A+GAVL Q+  D+ +  + Y S  +   Q  Y+ ++KE+LA++ +L  +R YL  +     + TDH  L    +  +S+P   RL RW L LQ+F
Subjt:  DASDYALGAVLGQR-KDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDS--KIVVHTDHAALKYLFVKKDSKP---RLMRWILLLQEF

Query:  DLEIKDRKGCENVVANHLSRIENGDATSWPPIVE----------TFPDEQLYQVKDSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHN------------
        + EI  R G  N +A+ LSRI + +    P   E          +  D+   QV       T ++N L       + N Q K   L N            
Subjt:  DLEIKDRKGCENVVANHLSRIENGDATSWPPIVE----------TFPDEQLYQVKDSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHN------------

Query:  -----VKSYHWEDPLLY---KVCADNMIRKCVPQ------EEVVSILNSCHASPYGGH--FGPTR-----------------TA-------SKILVAIDY
             +K YH E  L++   ++  + ++R+   +      +E V   ++C  +    H  +GP +                 TA       + + V +D 
Subjt:  -----VKSYHWEDPLLY---KVCADNMIRKCVPQ------EEVVSILNSCHASPYGGH--FGPTR-----------------TA-------SKILVAIDY

Query:  VSKWVEAI-ATRTNDARTVLKFLHKNIFTRFGTPRATISDEGSHFCNKLFESMMQKYKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWALK
         SK    +  T++  A    +   + +   FG P+  I+D    F ++ ++    KY    K +  Y PQT+G  E +N+ ++++L     T+   W   
Subjt:  VSKWVEAI-ATRTNDARTVLKFLHKNIFTRFGTPRATISDEGSHFCNKLFESMMQKYKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDWALK

Query:  LDDALWAYRTAFKNPIDTSPYRLV
        +     +Y  A  +    +P+ +V
Subjt:  LDDALWAYRTAFKNPIDTSPYRLV

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTGATGCTAGTGACTATGCTTTAGGAGCTGTTTTAGGCCAACGTAAAGATAACATGTTTAGGGCAATTTACTATGCTAGTAGGACTCTTGATAATACTCAACAGAA
ATACACAACTACTGAAAAAGAACTACTAGCTGTTGTGTTTGCCCTTGATAAATTTAGACCATATTTGCTTGACTCTAAAATAGTGGTGCATACTGACCATGCTGCTTTAA
AGTATTTGTTTGTTAAGAAAGATTCCAAACCCAGGCTAATGAGGTGGATATTATTATTGCAGGAATTTGACTTAGAAATCAAAGACAGGAAAGGATGTGAAAATGTGGTT
GCAAACCACTTGTCTAGAATTGAGAATGGGGATGCTACATCATGGCCCCCAATTGTTGAGACGTTCCCAGATGAACAACTGTATCAAGTAAAAGATAGTTTGCCTTGGTT
TACTAACATAGTTAACTATCTTGCAGGAGGACATTTGCCACCTGACATGAATTCTCAACAAAAGAAGAGATTCCTGCACAATGTTAAGTCTTACCATTGGGAGGACCCCC
TTCTCTACAAGGTTTGTGCTGACAACATGATACGGAAGTGCGTGCCTCAAGAGGAAGTGGTAAGTATTCTAAATTCATGTCATGCTTCCCCCTATGGAGGTCACTTTGGA
CCCACTAGAACTGCATCCAAGATTTTAGTTGCAATAGATTATGTATCTAAGTGGGTAGAAGCCATAGCCACTAGGACCAATGATGCTCGCACTGTTTTAAAATTCCTGCA
TAAAAACATTTTCACACGTTTTGGTACACCTAGAGCTACTATTAGTGATGAAGGTTCTCACTTTTGCAATAAATTGTTTGAATCCATGATGCAAAAATATAAGGATAATC
ATAAAATTGCTACAGCTTATCATCCTCAAACTAATGGCCTTGCTGAGTTATCTAATAGGGAAATCAAGCAAATTTTGGAAAAGACAGTTAAGACCAATAGGAAGGATTGG
GCCCTGAAGCTCGATGATGCACTATGGGCCTACCGCACAGCGTTCAAAAATCCAATTGACACTTCACCATACAGGTTGGTGTTTGGAAAGGCTTGTCACTTACCGGTAGA
GCTCGAGCATAGAGCTTATTGGGCCATCAAGAAGCTGAACATGGATTTTGAGAAGGCCGGAAAGCTTAGGACACGATGGTCGAGACCCTTTATCATTGTCAAGGTATCAC
CACATGGGGCCGTGGAATTGCAAAACAATGATGGAACAACCTTCAAAGTGAATGGTCAACGATTTAAGCACTACATCGGTGATGAAGAACGTCAATTTGAGAACCTGGCC
TTCATTGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTGTGATGCTAGTGACTATGCTTTAGGAGCTGTTTTAGGCCAACGTAAAGATAACATGTTTAGGGCAATTTACTATGCTAGTAGGACTCTTGATAATACTCAACAGAA
ATACACAACTACTGAAAAAGAACTACTAGCTGTTGTGTTTGCCCTTGATAAATTTAGACCATATTTGCTTGACTCTAAAATAGTGGTGCATACTGACCATGCTGCTTTAA
AGTATTTGTTTGTTAAGAAAGATTCCAAACCCAGGCTAATGAGGTGGATATTATTATTGCAGGAATTTGACTTAGAAATCAAAGACAGGAAAGGATGTGAAAATGTGGTT
GCAAACCACTTGTCTAGAATTGAGAATGGGGATGCTACATCATGGCCCCCAATTGTTGAGACGTTCCCAGATGAACAACTGTATCAAGTAAAAGATAGTTTGCCTTGGTT
TACTAACATAGTTAACTATCTTGCAGGAGGACATTTGCCACCTGACATGAATTCTCAACAAAAGAAGAGATTCCTGCACAATGTTAAGTCTTACCATTGGGAGGACCCCC
TTCTCTACAAGGTTTGTGCTGACAACATGATACGGAAGTGCGTGCCTCAAGAGGAAGTGGTAAGTATTCTAAATTCATGTCATGCTTCCCCCTATGGAGGTCACTTTGGA
CCCACTAGAACTGCATCCAAGATTTTAGTTGCAATAGATTATGTATCTAAGTGGGTAGAAGCCATAGCCACTAGGACCAATGATGCTCGCACTGTTTTAAAATTCCTGCA
TAAAAACATTTTCACACGTTTTGGTACACCTAGAGCTACTATTAGTGATGAAGGTTCTCACTTTTGCAATAAATTGTTTGAATCCATGATGCAAAAATATAAGGATAATC
ATAAAATTGCTACAGCTTATCATCCTCAAACTAATGGCCTTGCTGAGTTATCTAATAGGGAAATCAAGCAAATTTTGGAAAAGACAGTTAAGACCAATAGGAAGGATTGG
GCCCTGAAGCTCGATGATGCACTATGGGCCTACCGCACAGCGTTCAAAAATCCAATTGACACTTCACCATACAGGTTGGTGTTTGGAAAGGCTTGTCACTTACCGGTAGA
GCTCGAGCATAGAGCTTATTGGGCCATCAAGAAGCTGAACATGGATTTTGAGAAGGCCGGAAAGCTTAGGACACGATGGTCGAGACCCTTTATCATTGTCAAGGTATCAC
CACATGGGGCCGTGGAATTGCAAAACAATGATGGAACAACCTTCAAAGTGAATGGTCAACGATTTAAGCACTACATCGGTGATGAAGAACGTCAATTTGAGAACCTGGCC
TTCATTGCATGA
Protein sequenceShow/hide protein sequence
MCDASDYALGAVLGQRKDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFALDKFRPYLLDSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVV
ANHLSRIENGDATSWPPIVETFPDEQLYQVKDSLPWFTNIVNYLAGGHLPPDMNSQQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFG
PTRTASKILVAIDYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRATISDEGSHFCNKLFESMMQKYKDNHKIATAYHPQTNGLAELSNREIKQILEKTVKTNRKDW
ALKLDDALWAYRTAFKNPIDTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGKLRTRWSRPFIIVKVSPHGAVELQNNDGTTFKVNGQRFKHYIGDEERQFENLA
FIA