; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy06g015960 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy06g015960
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr06:35801691..35803719
RNA-Seq ExpressionLcy06g015960
SyntenyLcy06g015960
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]2.5e-13953.83Show/hide
Query:  INRTYIVLIPKIQDPKEMKDFRSISLCTVLYKIIAKVIANRLKGVLNTIISPSQSAFVPGRLITDNAIIGFECIHAVRSKRKGKNGTVALELDMSKTYDR
        +N+T I LIPK  +PK M DFR ISLC V+YK+I+K++ANRLK +L  IIS +QSAF   RLITDN ++ FE +H +  K  GK G +A++LDMSK +DR
Subjt:  INRTYIVLIPKIQDPKEMKDFRSISLCTVLYKIIAKVIANRLKGVLNTIISPSQSAFVPGRLITDNAIIGFECIHAVRSKRKGKNGTVALELDMSKTYDR

Query:  VEWIFIRKVMEKMGFCSRWINLIMQCVESVSFQVLLNGVPGTEFKPNQGLRQGDPLSPYLFLICAEGLSGLLNQAAIRKEVIGLSINKYCPIITHLFYAD
        VEW FI KVME+MGFC+RW +L+MQC+ SVS+ +L+NGV      P++GLRQGDPLSP LFL+CAEGLS L+NQAA  K + G+SIN+ CP +THLF+AD
Subjt:  VEWIFIRKVMEKMGFCSRWINLIMQCVESVSFQVLLNGVPGTEFKPNQGLRQGDPLSPYLFLICAEGLSGLLNQAAIRKEVIGLSINKYCPIITHLFYAD

Query:  DCLLFFKASNKDCRYIKHLLGVYERASGQTINFEKSNFMVSPNTNDDLVREIQTLLQVKHSLNLGQYLGLPSQNARSKKEIFNNIKDRVWKVLQGWKGRF
        D +LF KA+ ++C  ++ +LG YE ASGQ IN +KS+   SPNT  +   EI  +L    +    +YLGLPS   RSK ++F  +K++V   L GWKG+ 
Subjt:  DCLLFFKASNKDCRYIKHLLGVYERASGQTINFEKSNFMVSPNTNDDLVREIQTLLQVKHSLNLGQYLGLPSQNARSKKEIFNNIKDRVWKVLQGWKGRF

Query:  VSAAGRETLVKSVAQAIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKAHGGMRFRDITIFNQAMLAKQSWRIICNPDSLVAK
        +S  G+E L+K+VAQAIP Y MSCF  P  LC+D++ +   FWWG  +Q  K+ W SWKR+C +KA GG+ FR++  FN AMLAKQ+WRI+ NP+SLV +
Subjt:  VSAAGRETLVKSVAQAIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKAHGGMRFRDITIFNQAMLAKQSWRIICNPDSLVAK

Query:  VLRGRYFKTGQFLKAGLGANPSYTWWSIVWGRELFIKGYRWRIG
        VL+ RYF TG  L A LG++PSY+W SI    E+  +G RWR+G
Subjt:  VLRGRYFKTGQFLKAGLGANPSYTWWSIVWGRELFIKGYRWRIG

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]1.1e-13652.8Show/hide
Query:  MGHINRTYIVLIPKIQDPKEMKDFRSISLCTVLYKIIAKVIANRLKGVLNTIISPSQSAFVPGRLITDNAIIGFECIHAVRSKRKGKNGTVALELDMSKT
        M  IN+T I L+PKI++P +M DFR ISLC V+YK+I+KV+ANRLK +L  IIS +QSAF+ GRLITDN ++ FE +H +  K++GK G  A++LDMSK 
Subjt:  MGHINRTYIVLIPKIQDPKEMKDFRSISLCTVLYKIIAKVIANRLKGVLNTIISPSQSAFVPGRLITDNAIIGFECIHAVRSKRKGKNGTVALELDMSKT

Query:  YDRVEWIFIRKVMEKMGFCSRWINLIMQCVESVSFQVLLNGVPGTEFKPNQGLRQGDPLSPYLFLICAEGLSGLLNQAAIRKEVIGLSINKYCPIITHLF
        YDRVEW FI++VMEKMGF  +WI L+M C+ SVS+ +L+NG       P +GLRQGDP+SPY+FL+CA+G S LLN  A +  + G+SI + CP ITHLF
Subjt:  YDRVEWIFIRKVMEKMGFCSRWINLIMQCVESVSFQVLLNGVPGTEFKPNQGLRQGDPLSPYLFLICAEGLSGLLNQAAIRKEVIGLSINKYCPIITHLF

Query:  YADDCLLFFKASNKDCRYIKHLLGVYERASGQTINFEKSNFMVSPNTNDDLVREIQTLLQVKHSLNLGQYLGLPSQNARSKKEIFNNIKDRVWKVLQGWK
        +ADD LLF KA++++C+ +  +L +YE ASGQ IN +KS+   S NT D+   E+  +L         +YLGLPS   +SK EIF  +K+RV + L GWK
Subjt:  YADDCLLFFKASNKDCRYIKHLLGVYERASGQTINFEKSNFMVSPNTNDDLVREIQTLLQVKHSLNLGQYLGLPSQNARSKKEIFNNIKDRVWKVLQGWK

Query:  GRFVSAAGRETLVKSVAQAIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKAHGGMRFRDITIFNQAMLAKQSWRIICNPDSL
         + +S  GRE L+K+VAQAIP Y MSCF+ P +LC +++ +  RFWWG   Q  KI W SWK+LC  K +GGM FR++  FN AMLAKQ WR+I NP+SL
Subjt:  GRFVSAAGRETLVKSVAQAIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKAHGGMRFRDITIFNQAMLAKQSWRIICNPDSL

Query:  VAKVLRGRYFKTGQFLKAGLGANPSYTWWSIVWGRELFIKGYRWRIG
        VA++ + RY+  G   +A LGA+PSYTW SI  G E+  +G RWR+G
Subjt:  VAKVLRGRYFKTGQFLKAGLGANPSYTWWSIVWGRELFIKGYRWRIG

XP_023901742.1 uncharacterized protein LOC112013579 [Quercus suber]4.8e-13551.9Show/hide
Query:  MGHINRTYIVLIPKIQDPKEMKDFRSISLCTVLYKIIAKVIANRLKGVLNTIISPSQSAFVPGRLITDNAIIGFECIHAVRSKRKGKNGTVALELDMSKT
        M  IN+T I LIPK   P  M +FR ISLC   YKII+KV+ANR K +L  IIS +QSAF P RLITDN ++ FE +H +  K +GK   ++++LDMSK 
Subjt:  MGHINRTYIVLIPKIQDPKEMKDFRSISLCTVLYKIIAKVIANRLKGVLNTIISPSQSAFVPGRLITDNAIIGFECIHAVRSKRKGKNGTVALELDMSKT

Query:  YDRVEWIFIRKVMEKMGFCSRWINLIMQCVESVSFQVLLNGVPGTEFKPNQGLRQGDPLSPYLFLICAEGLSGLLNQAAIRKEVIGLSINKYCPIITHLF
        +DRVEW FI+ VMEK+GF  +WI+LIM CV SVS+ VL+NG       P++G+RQGDPLSP LFL+CAEGLS L+++AA  +++ G+SI + CP+ITHLF
Subjt:  YDRVEWIFIRKVMEKMGFCSRWINLIMQCVESVSFQVLLNGVPGTEFKPNQGLRQGDPLSPYLFLICAEGLSGLLNQAAIRKEVIGLSINKYCPIITHLF

Query:  YADDCLLFFKASNKDCRYIKHLLGVYERASGQTINFEKSNFMVSPNTNDDLVREIQTLLQVKHSLNLGQYLGLPSQNARSKKEIFNNIKDRVWKVLQGWK
        +ADD LLF KA  ++C  +  +L  YE ASGQ IN +KS+   SPNT+ +L   I  +L         +YLGLPS   +SK ++F  +KDRV K L GWK
Subjt:  YADDCLLFFKASNKDCRYIKHLLGVYERASGQTINFEKSNFMVSPNTNDDLVREIQTLLQVKHSLNLGQYLGLPSQNARSKKEIFNNIKDRVWKVLQGWK

Query:  GRFVSAAGRETLVKSVAQAIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKAHGGMRFRDITIFNQAMLAKQSWRIICNPDSL
        G+ +S  GRE L+K+VAQA+P Y MSCF+ P +LC DL+ +   FWWG  D+  KI W SW+++C +K HGGM FR+I  FN AMLAKQ WRI+ NP+SL
Subjt:  GRFVSAAGRETLVKSVAQAIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKAHGGMRFRDITIFNQAMLAKQSWRIICNPDSL

Query:  VAKVLRGRYFKTGQFLKAGLGANPSYTWWSIVWGRELFIKGYRWRIG
        +A+V + +YF     L +  G+NPSY W SI    ++  KG RWR+G
Subjt:  VAKVLRGRYFKTGQFLKAGLGANPSYTWWSIVWGRELFIKGYRWRIG

XP_030939975.1 uncharacterized protein LOC115964883 [Quercus lobata]3.3e-13652.48Show/hide
Query:  INRTYIVLIPKIQDPKEMKDFRSISLCTVLYKIIAKVIANRLKGVLNTIISPSQSAFVPGRLITDNAIIGFECIHAVRSKRKGKNGTVALELDMSKTYDR
        +N TYI LIPKI+ P++  DFR ISLC VLYKI++K IANRLK +L  ++S SQSAF+  RLI+DN ++ FE +H +++K KGK+G +A++LDMSK YDR
Subjt:  INRTYIVLIPKIQDPKEMKDFRSISLCTVLYKIIAKVIANRLKGVLNTIISPSQSAFVPGRLITDNAIIGFECIHAVRSKRKGKNGTVALELDMSKTYDR

Query:  VEWIFIRKVMEKMGFCSRWINLIMQCVESVSFQVLLNGVPGTEFKPNQGLRQGDPLSPYLFLICAEGLSGLLNQAAIRKEVIGLSINKYCPIITHLFYAD
        VEW F+ KVMEK+GF +RWI L+  C+ SVSF VL+NG P   F PN+GLRQGDPLSPYLFL+CAEGL  L+ QA I   + G+S+    P ++HLF+AD
Subjt:  VEWIFIRKVMEKMGFCSRWINLIMQCVESVSFQVLLNGVPGTEFKPNQGLRQGDPLSPYLFLICAEGLSGLLNQAAIRKEVIGLSINKYCPIITHLFYAD

Query:  DCLLFFKASNKDCRYIKHLLGVYERASGQTINFEKSNFMVSPNTNDDLVREIQTLLQVKHSLNLGQYLGLPSQNARSKKEIFNNIKDRVWKVLQGWKGRF
        D LLF +A++++   I  +L  YE ASGQ IN EK+    SPNT+  +  EI+TLL V  + N  +YLGLPS   R KK+ F  I++R+W  +QGWK R 
Subjt:  DCLLFFKASNKDCRYIKHLLGVYERASGQTINFEKSNFMVSPNTNDDLVREIQTLLQVKHSLNLGQYLGLPSQNARSKKEIFNNIKDRVWKVLQGWKGRF

Query:  VSAAGRETLVKSVAQAIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKAHGGMRFRDITIFNQAMLAKQSWRIICNPDSLVAK
        +S  GRE L+K+V QA+P + M CFK P SLC D++ +  +FWWG   + RKIHW  WK+LC +K+HGG+ F+DI +FN AML KQ WR+I N DSL  K
Subjt:  VSAAGRETLVKSVAQAIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKAHGGMRFRDITIFNQAMLAKQSWRIICNPDSLVAK

Query:  VLRGRYFKTGQFLKAGLGANPSYTWWSIVWGRELFIKGYRWRIG
        V + ++F     L  G+  N SY W SI+  R +   G +WRIG
Subjt:  VLRGRYFKTGQFLKAGLGANPSYTWWSIVWGRELFIKGYRWRIG

XP_030946812.1 uncharacterized protein LOC115971195 [Quercus lobata]4.8e-13552.25Show/hide
Query:  INRTYIVLIPKIQDPKEMKDFRSISLCTVLYKIIAKVIANRLKGVLNTIISPSQSAFVPGRLITDNAIIGFECIHAVRSKRKGKNGTVALELDMSKTYDR
        +N T+I LIPKI+ P++  DFR ISLC VLYKI++K IANRLK +L  ++S SQSAF+  RLI+DN ++ FE +H +++K KGK G +A++LDMSK YDR
Subjt:  INRTYIVLIPKIQDPKEMKDFRSISLCTVLYKIIAKVIANRLKGVLNTIISPSQSAFVPGRLITDNAIIGFECIHAVRSKRKGKNGTVALELDMSKTYDR

Query:  VEWIFIRKVMEKMGFCSRWINLIMQCVESVSFQVLLNGVPGTEFKPNQGLRQGDPLSPYLFLICAEGLSGLLNQAAIRKEVIGLSINKYCPIITHLFYAD
        VEW F+ KVMEK+GF +RWI L+  C+ SVSF VL+NG P   F PN+GLRQGDPLSPYLFL+CAEGL  L+ Q  I   + G+S+    P ++HLF+AD
Subjt:  VEWIFIRKVMEKMGFCSRWINLIMQCVESVSFQVLLNGVPGTEFKPNQGLRQGDPLSPYLFLICAEGLSGLLNQAAIRKEVIGLSINKYCPIITHLFYAD

Query:  DCLLFFKASNKDCRYIKHLLGVYERASGQTINFEKSNFMVSPNTNDDLVREIQTLLQVKHSLNLGQYLGLPSQNARSKKEIFNNIKDRVWKVLQGWKGRF
        D LLF +A++++   I  +L  YE ASGQ IN EK+    SPNT+  +  EI+TLL V  + N  +YLGLPS   R KK+ F  I++RVW+ +QGWK R 
Subjt:  DCLLFFKASNKDCRYIKHLLGVYERASGQTINFEKSNFMVSPNTNDDLVREIQTLLQVKHSLNLGQYLGLPSQNARSKKEIFNNIKDRVWKVLQGWKGRF

Query:  VSAAGRETLVKSVAQAIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKAHGGMRFRDITIFNQAMLAKQSWRIICNPDSLVAK
        +S  GRE L+K+V QA+P + M CFK P SLC D++ +  +FWWG   + RKIHW  WK+LC +K+ GG+ F+DI +FN AML KQ WR+I N DSL  K
Subjt:  VSAAGRETLVKSVAQAIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKAHGGMRFRDITIFNQAMLAKQSWRIICNPDSLVAK

Query:  VLRGRYFKTGQFLKAGLGANPSYTWWSIVWGRELFIKGYRWRIG
        V + +YF     L  G+  N SY W SI+  R +   G +WRIG
Subjt:  VLRGRYFKTGQFLKAGLGANPSYTWWSIVWGRELFIKGYRWRIG

TrEMBL top hitse value%identityAlignment
A0A2N9F086 Reverse transcriptase domain-containing protein2.8e-13650Show/hide
Query:  INRTYIVLIPKIQDPKEMKDFRSISLCTVLYKIIAKVIANRLKGVLNTIISPSQSAFVPGRLITDNAIIGFECIHAVRSKRKGKNGTVALELDMSKTYDR
        +N T+IVLIPK+++P +M D+R ISLC VLYK+++K +ANRLK +L  ++S SQSAFVPGRLITDN I+ +E +H ++ KR G+ G++A++LDMSK YDR
Subjt:  INRTYIVLIPKIQDPKEMKDFRSISLCTVLYKIIAKVIANRLKGVLNTIISPSQSAFVPGRLITDNAIIGFECIHAVRSKRKGKNGTVALELDMSKTYDR

Query:  VEWIFIRKVMEKMGFCSRWINLIMQCVESVSFQVLLNGVPGTEFKPNQGLRQGDPLSPYLFLICAEGLSGLLNQAAIRKEVIGLSINKYCPIITHLFYAD
        VEW F+ ++M+K+GF  RWINL+M+C+ + S+ VL+NG P     P++G+RQGDPLSPY+FL+CAEG S L+ +A I K++ G+SI++  P ++HL +AD
Subjt:  VEWIFIRKVMEKMGFCSRWINLIMQCVESVSFQVLLNGVPGTEFKPNQGLRQGDPLSPYLFLICAEGLSGLLNQAAIRKEVIGLSINKYCPIITHLFYAD

Query:  DCLLFFKASNKDCRYIKHLLGVYERASGQTINFEKSNFMVSPNTNDDLVREIQTLLQVKHSLNLGQYLGLPSQNARSKKEIFNNIKDRVWKVLQGWKGRF
        D LLF +A  ++CR +  +L +YE++SGQ IN +K+    S NT ++  REIQ +   +  L   +YLG+P+   RSK+  F+ +KDR+ K LQGW  +F
Subjt:  DCLLFFKASNKDCRYIKHLLGVYERASGQTINFEKSNFMVSPNTNDDLVREIQTLLQVKHSLNLGQYLGLPSQNARSKKEIFNNIKDRVWKVLQGWKGRF

Query:  VSAAGRETLVKSVAQAIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKAHGGMRFRDITIFNQAMLAKQSWRIICNPDSLVAK
        +S AGRE L+K+VAQ+IP + MSCFK PI  C D+  + A+FWWGS    RKIHW+ W++LC  K  GG+ FRD+  FN A+L KQ WR I NP SLV +
Subjt:  VSAAGRETLVKSVAQAIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKAHGGMRFRDITIFNQAMLAKQSWRIICNPDSLVAK

Query:  VLRGRYFKTGQFLKAGLGANPSYTWWSIVWGRELFIKGYRWRIG
        V + +YF    F++  LG NPS  W SI+  RE    G  W++G
Subjt:  VLRGRYFKTGQFLKAGLGANPSYTWWSIVWGRELFIKGYRWRIG

A0A2N9FNH6 Reverse transcriptase domain-containing protein3.9e-13853.6Show/hide
Query:  INRTYIVLIPKIQDPKEMKDFRSISLCTVLYKIIAKVIANRLKGVLNTIISPSQSAFVPGRLITDNAIIGFECIHAVRSKRKGKNGTVALELDMSKTYDR
        +N T+I LIPKI  P+ M  FR ISLC VLYKII+KV+ANRLK VL+ IIS +QSAFVPGRLITDN ++ FE +H +++KRKG++  +A++LDMSK YDR
Subjt:  INRTYIVLIPKIQDPKEMKDFRSISLCTVLYKIIAKVIANRLKGVLNTIISPSQSAFVPGRLITDNAIIGFECIHAVRSKRKGKNGTVALELDMSKTYDR

Query:  VEWIFIRKVMEKMGFCSRWINLIMQCVESVSFQVLLNGVPGTEFKPNQGLRQGDPLSPYLFLICAEGLSGLLNQAAIRKEVIGLSINKYCPIITHLFYAD
        VEW F+  +M K+GF  RW+NLIMQC+ SVS+ V+LNG P    KP +G+RQGDPLSPYLFLICAEGL+ LL QA     V GLSI +  P I+HLF+AD
Subjt:  VEWIFIRKVMEKMGFCSRWINLIMQCVESVSFQVLLNGVPGTEFKPNQGLRQGDPLSPYLFLICAEGLSGLLNQAAIRKEVIGLSINKYCPIITHLFYAD

Query:  DCLLFFKASNKDCRYIKHLLGVYERASGQTINFEKSNFMVSPNTNDDLVREIQTLLQVKHSLNLGQYLGLPSQNARSKKEIFNNIKDRVWKVLQGWKGRF
        D LLF +A+  +C+ +  +L  YE+ASGQ +N EK++   S NT+ DL   I TLL+   + +LG+YLGLP    R KK+ F  IK ++ K L GWKG+ 
Subjt:  DCLLFFKASNKDCRYIKHLLGVYERASGQTINFEKSNFMVSPNTNDDLVREIQTLLQVKHSLNLGQYLGLPSQNARSKKEIFNNIKDRVWKVLQGWKGRF

Query:  VSAAGRETLVKSVAQAIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKAHGGMRFRDITIFNQAMLAKQSWRIICNPDSLVAK
        +S AGRE L+KSVAQAIP Y MSCF+ P +LC+++  + ++FWWG   + +KIHWQ W  +C  K+ GGM FRD+T+FNQA+LAKQ WR++ +P++L+ +
Subjt:  VSAAGRETLVKSVAQAIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKAHGGMRFRDITIFNQAMLAKQSWRIICNPDSLVAK

Query:  VLRGRYFKTGQFLKAGLGANPSYTWWSIVWGRELFIKGYRWRIG
        +L+ +YF    F++A +  + S+ W SI   R +  KG RWRIG
Subjt:  VLRGRYFKTGQFLKAGLGANPSYTWWSIVWGRELFIKGYRWRIG

A0A2N9H567 Reverse transcriptase domain-containing protein4.3e-13753.6Show/hide
Query:  INRTYIVLIPKIQDPKEMKDFRSISLCTVLYKIIAKVIANRLKGVLNTIISPSQSAFVPGRLITDNAIIGFECIHAVRSKRKGKNGTVALELDMSKTYDR
        +N T+I LIPK + P  M  FR ISLC VLYKII+KV+ANRLK VLN +IS +QSAFVPGRLITDN ++ FE +H +++KR+GK   +A++LDMSK YDR
Subjt:  INRTYIVLIPKIQDPKEMKDFRSISLCTVLYKIIAKVIANRLKGVLNTIISPSQSAFVPGRLITDNAIIGFECIHAVRSKRKGKNGTVALELDMSKTYDR

Query:  VEWIFIRKVMEKMGFCSRWINLIMQCVESVSFQVLLNGVPGTEFKPNQGLRQGDPLSPYLFLICAEGLSGLLNQAAIRKEVIGLSINKYCPIITHLFYAD
        VEW FIR +M KMGF S+W++LIMQC++SVS+ +++NG P    KP++G+RQGDPLSPYLFLICAEGL+ LL  A    ++ GLSI +  PII HLF+AD
Subjt:  VEWIFIRKVMEKMGFCSRWINLIMQCVESVSFQVLLNGVPGTEFKPNQGLRQGDPLSPYLFLICAEGLSGLLNQAAIRKEVIGLSINKYCPIITHLFYAD

Query:  DCLLFFKASNKDCRYIKHLLGVYERASGQTINFEKSNFMVSPNTNDDLVREIQTLLQVKHSLNLGQYLGLPSQNARSKKEIFNNIKDRVWKVLQGWKGRF
        D LLF++A+  + + +  +L +YE+ASGQ +N+EK++   S NT+ D+   I T L+   + +LG+YLGLP    R KK+ F  IK +V K LQGWKG+ 
Subjt:  DCLLFFKASNKDCRYIKHLLGVYERASGQTINFEKSNFMVSPNTNDDLVREIQTLLQVKHSLNLGQYLGLPSQNARSKKEIFNNIKDRVWKVLQGWKGRF

Query:  VSAAGRETLVKSVAQAIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKAHGGMRFRDITIFNQAMLAKQSWRIICNPDSLVAK
        +S  GRE L+KSVAQAIP + MSCF+ P SLC ++  +  RFWWG  +  RKIHWQ W  LC  K  GG+ FRD+  FNQA+LAKQ WRI+ N  +L+ K
Subjt:  VSAAGRETLVKSVAQAIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKAHGGMRFRDITIFNQAMLAKQSWRIICNPDSLVAK

Query:  VLRGRYFKTGQFLKAGLGANPSYTWWSIVWGRELFIKGYRWRIG
        VL+ +YF    FL+A + ++ S+TW S+   R +   G RWRIG
Subjt:  VLRGRYFKTGQFLKAGLGANPSYTWWSIVWGRELFIKGYRWRIG

A0A2N9HTH6 Reverse transcriptase domain-containing protein1.1e-13750.43Show/hide
Query:  INRTYIVLIPKIQDPKEMKDFRSISLCTVLYKIIAKVIANRLKGVLNTIISPSQSAFVPGRLITDNAIIGFECIHAVRSKRKGKNGTVALELDMSKTYDR
        IN T+IVLIPKI++P ++ D+R I+LC V+YKI++K++ANRLK VL  +IS +QSAFVPGRLITDN ++ FE +H++  K +GK G +AL+LDMSK YDR
Subjt:  INRTYIVLIPKIQDPKEMKDFRSISLCTVLYKIIAKVIANRLKGVLNTIISPSQSAFVPGRLITDNAIIGFECIHAVRSKRKGKNGTVALELDMSKTYDR

Query:  VEWIFIRKVMEKMGFCSRWINLIMQCVESVSFQVLLNGVPGTEFKPNQGLRQGDPLSPYLFLICAEGLSGLLNQAAIRKEVIGLSINKYCPIITHLFYAD
        VEW F+  VM ++GF   WI LIM C+ +VS+ +LLNGV    F  ++G+RQGDPLSPY+FL+CAEGLS LL +    +++ G++ ++  P +THLF+A+
Subjt:  VEWIFIRKVMEKMGFCSRWINLIMQCVESVSFQVLLNGVPGTEFKPNQGLRQGDPLSPYLFLICAEGLSGLLNQAAIRKEVIGLSINKYCPIITHLFYAD

Query:  DCLLFFKASNKDCRYIKHLLGVYERASGQTINFEKSNFMVSPNTNDDLVREIQTLLQVKHSLNLGQYLGLPSQNARSKKEIFNNIKDRVWKVLQGWKGRF
        D +LF +AS+++CR +  +L +YE ASGQ +N  K++   + NT+  + + I+ L  V    +  +YLGLPS   RSKK  FN IKDRVW+ + GWK + 
Subjt:  DCLLFFKASNKDCRYIKHLLGVYERASGQTINFEKSNFMVSPNTNDDLVREIQTLLQVKHSLNLGQYLGLPSQNARSKKEIFNNIKDRVWKVLQGWKGRF

Query:  VSAAGRETLVKSVAQAIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKAHGGMRFRDITIFNQAMLAKQSWRIICNPDSLVAK
        +S AGRE L+K+VAQ+IP Y MSCFK P SLCN+L  + + FWWG    GR +HW  W++LC++K  GG+ FRD+  FN A+LAKQ WRI+  P SLVA+
Subjt:  VSAAGRETLVKSVAQAIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKAHGGMRFRDITIFNQAMLAKQSWRIICNPDSLVAK

Query:  VLRGRYFKTGQFLKAGLGANPSYTWWSIVWGRELFIKGYRWRIGRDGL-------PWTIVRG
        V + +YF T  F+ A LG  PSY W SI   RE+   G RW IG DG        PW   +G
Subjt:  VLRGRYFKTGQFLKAGLGANPSYTWWSIVWGRELFIKGYRWRIGRDGL-------PWTIVRG

A0A2N9J3U0 Reverse transcriptase domain-containing protein3.9e-13853.6Show/hide
Query:  INRTYIVLIPKIQDPKEMKDFRSISLCTVLYKIIAKVIANRLKGVLNTIISPSQSAFVPGRLITDNAIIGFECIHAVRSKRKGKNGTVALELDMSKTYDR
        +N T+I LIPKI  P+ M  FR ISLC VLYKII+KV+ANRLK VL+ IIS +QSAFVPGRLITDN ++ FE +H +++KRKG++  +A++LDMSK YDR
Subjt:  INRTYIVLIPKIQDPKEMKDFRSISLCTVLYKIIAKVIANRLKGVLNTIISPSQSAFVPGRLITDNAIIGFECIHAVRSKRKGKNGTVALELDMSKTYDR

Query:  VEWIFIRKVMEKMGFCSRWINLIMQCVESVSFQVLLNGVPGTEFKPNQGLRQGDPLSPYLFLICAEGLSGLLNQAAIRKEVIGLSINKYCPIITHLFYAD
        VEW F+  +M K+GF  RW+NLIMQC+ SVS+ V+LNG P    KP +G+RQGDPLSPYLFLICAEGL+ LL QA     V GLSI +  P I+HLF+AD
Subjt:  VEWIFIRKVMEKMGFCSRWINLIMQCVESVSFQVLLNGVPGTEFKPNQGLRQGDPLSPYLFLICAEGLSGLLNQAAIRKEVIGLSINKYCPIITHLFYAD

Query:  DCLLFFKASNKDCRYIKHLLGVYERASGQTINFEKSNFMVSPNTNDDLVREIQTLLQVKHSLNLGQYLGLPSQNARSKKEIFNNIKDRVWKVLQGWKGRF
        D LLF +A+  +C+ +  +L  YE+ASGQ +N EK++   S NT+ DL   I TLL+   + +LG+YLGLP    R KK+ F  IK ++ K L GWKG+ 
Subjt:  DCLLFFKASNKDCRYIKHLLGVYERASGQTINFEKSNFMVSPNTNDDLVREIQTLLQVKHSLNLGQYLGLPSQNARSKKEIFNNIKDRVWKVLQGWKGRF

Query:  VSAAGRETLVKSVAQAIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKAHGGMRFRDITIFNQAMLAKQSWRIICNPDSLVAK
        +S AGRE L+KSVAQAIP Y MSCF+ P +LC+++  + ++FWWG   + +KIHWQ W  +C  K+ GGM FRD+T+FNQA+LAKQ WR++ +P++L+ +
Subjt:  VSAAGRETLVKSVAQAIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKAHGGMRFRDITIFNQAMLAKQSWRIICNPDSLVAK

Query:  VLRGRYFKTGQFLKAGLGANPSYTWWSIVWGRELFIKGYRWRIG
        +L+ +YF    F++A +  + S+ W SI   R +  KG RWRIG
Subjt:  VLRGRYFKTGQFLKAGLGANPSYTWWSIVWGRELFIKGYRWRIG

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.0e-1831.64Show/hide
Query:  LPSQNARSKKEIFNNIKDRVWKVLQGWKGRFVSAAGRETLVKSVAQAIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKAHGG
        +P    R  K+ F  I +RV   + GW+ + +S AGR TL K+V  ++P ++MS    P S+ N L  +   F WGS  + +K H   W ++C  K  GG
Subjt:  LPSQNARSKKEIFNNIKDRVWKVLQGWKGRFVSAAGRETLVKSVAQAIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKAHGG

Query:  MRFRDITIFNQAMLAKQSWRIICNPDSLVAKVLRGRY----FKTGQFLKAGLGANPSYTWWSIVWG-RELFIKGYRW
        +  R     N+A+++K  WR++   +SL   VL+ +Y     +  ++L      + S TW SI  G R++   G  W
Subjt:  MRFRDITIFNQAMLAKQSWRIICNPDSLVAKVLRGRY----FKTGQFLKAGLGANPSYTWWSIVWG-RELFIKGYRW

P11369 LINE-1 retrotransposable element ORF2 protein1.6e-3229.05Show/hide
Query:  IVLIPKIQ-DPKEMKDFRSISLCTVLYKIIAKVIANRLKGVLNTIISPSQSAFVPGRLITDNAIIGFECIHAVRSKRKGKNGTVALELDMSKTYDRVEWI
        I LIPK Q DP ++++FR ISL  +  KI+ K++ANR++  +  II P Q  F+PG     N       IH + +K K KN  + + LD  K +D+++  
Subjt:  IVLIPKIQ-DPKEMKDFRSISLCTVLYKIIAKVIANRLKGVLNTIISPSQSAFVPGRLITDNAIIGFECIHAVRSKRKGKNGTVALELDMSKTYDRVEWI

Query:  FIRKVMEKMGFCSRWINLIMQCVESVSFQVLLNGVPGTEFKPNQGLRQGDPLSPYLFLICAEGLSGLLNQAAIRKEVIGLSINKYCPIITHLFYADDCLL
        F+ KV+E+ G    ++N+I          + +NG          G RQG PLSPYLF I  E L+  + Q   +KE+ G+ I K    I+ L  ADD ++
Subjt:  FIRKVMEKMGFCSRWINLIMQCVESVSFQVLLNGVPGTEFKPNQGLRQGDPLSPYLFLICAEGLSGLLNQAAIRKEVIGLSINKYCPIITHLFYADDCLL

Query:  FFKASNKDCRYIKHLLGVYERASGQTINFEKS-NFMVSPNTNDDLVREIQTLLQVKHSLNLGQYLG--LPSQNARSKKEIFNNIKDRVWKVLQGWKGRFV
        +        R + +L+  +    G  IN  KS  F+ + N   +  +EI+         N  +YLG  L  +      + F ++K  + + L+ WK    
Subjt:  FFKASNKDCRYIKHLLGVYERASGQTINFEKS-NFMVSPNTNDDLVREIQTLLQVKHSLNLGQYLG--LPSQNARSKKEIFNNIKDRVWKVLQGWKGRFV

Query:  SAAGRETLVKS--VAQAIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKAHGGMRFRDITIFNQAMLAKQSW
        S  GR  +VK   + +AI  +     K P    N+L+    +F W +     K    +   L   +  GG+   D+ ++ +A++ K +W
Subjt:  SAAGRETLVKS--VAQAIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKAHGGMRFRDITIFNQAMLAKQSW

P14381 Transposon TX1 uncharacterized 149 kDa protein2.0e-2228Show/hide
Query:  RTYIVLIPKIQDPKEMKDFRSISLCTVLYKIIAKVIANRLKGVLNTIISPSQSAFVPGRLITDNAIIGFECIHAVRSKRKGKNGTVALELDMSKTYDRVE
        R  + L+PK  D + +K++R +SL +  YKI+AK I+ RLK VL  +I P QS  VPGR I DN  +  + +H  R  R G +    L LD  K +DRV+
Subjt:  RTYIVLIPKIQDPKEMKDFRSISLCTVLYKIIAKVIANRLKGVLNTIISPSQSAFVPGRLITDNAIIGFECIHAVRSKRKGKNGTVALELDMSKTYDRVE

Query:  WIFIRKVMEKMGFCSRWINLIMQCVESVSFQVLLNGVPGTEFKPNQGLRQGDPLSPYLFLICAEGLSGLLNQAAIRKEVIGLSINKYCPIITHLFYADDC
          ++   ++   F  +++  +     S    V +N          +G+RQG PLS  L+ +  E    LL     RK + GL + +    +    YADD 
Subjt:  WIFIRKVMEKMGFCSRWINLIMQCVESVSFQVLLNGVPGTEFKPNQGLRQGDPLSPYLFLICAEGLSGLLNQAAIRKEVIGLSINKYCPIITHLFYADDC

Query:  LLFFKASNKDCRYIKHLLGVYERASGQTINFEKSNFMVSPNTNDDLVREIQTLLQVKHSLNLGQYLGL-PSQNARSKKEIFNNIKDRVWKVLQGWKG--R
        +L  +    D    +    VY  AS   IN+ KS+ ++  +   D +        +     + +YLG+  S       + F  +++ V   L  WKG  +
Subjt:  LLFFKASNKDCRYIKHLLGVYERASGQTINFEKSNFMVSPNTNDDLVREIQTLLQVKHSLNLGQYLGL-PSQNARSKKEIFNNIKDRVWKVLQGWKG--R

Query:  FVSAAGRETLVKSVAQAIPNYAMSC
         +S  GR  ++  +  +   Y + C
Subjt:  FVSAAGRETLVKSVAQAIPNYAMSC

P16423 Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM4.1e-1229.15Show/hide
Query:  VLIPKIQDPKEMKDFRSISLCTVLYKIIAKVIANRLKGVLNTIISPSQSAFVPGRLITDNAIIGFECIHAVRSKRKGKNGTVALELDMSKTYDRVEWIFI
        V IPK    K  +DFR IS+ +VL + +  ++A RL   +N    P Q  F+P     DNA I       +R   K         LD+SK +D +    I
Subjt:  VLIPKIQDPKEMKDFRSISLCTVLYKIIAKVIANRLKGVLNTIISPSQSAFVPGRLITDNAIIGFECIHAVRSKRKGKNGTVALELDMSKTYDRVEWIFI

Query:  RKVMEKMGFCSRWINLIMQCVESVSFQVLLNGVPGTEFKPNQGLRQGDPLSPYLFLICAEGLSGLLNQAAIRKEVIGLSINKYCPIITHLFYADDCLLF
           +   G    +++ +    E     +  +G    EF P +G++QGDPLSP LF +  + L   L         IG  +     I     +ADD +LF
Subjt:  RKVMEKMGFCSRWINLIMQCVESVSFQVLLNGVPGTEFKPNQGLRQGDPLSPYLFLICAEGLSGLLNQAAIRKEVIGLSINKYCPIITHLFYADDCLLF

P93295 Uncharacterized mitochondrial protein AtMg003103.7e-2947.01Show/hide
Query:  AIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKA-HGGMRFRDITIFNQAMLAKQSWRIICNPDSLVAKVLRGRYFKTGQFLK
        A+P YAMSCF+    LC  L      FWW S +  RKI W +W++LC +K   GG+ FRD+  FNQA+LAKQS+RII  P +L++++LR RYF     ++
Subjt:  AIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKA-HGGMRFRDITIFNQAMLAKQSWRIICNPDSLVAKVLRGRYFKTGQFLK

Query:  AGLGANPSYTWWSIVWGRELFIKGYRWRIGRDGL
          +G  PSY W SI+ GREL  +G    IG DG+
Subjt:  AGLGANPSYTWWSIVWGRELFIKGYRWRIGRDGL

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases8.5e-1336.14Show/hide
Query:  IANRLKGVLNTIISPSQSAFVPGRLITDNAIIGFECIHAVRSKRKGKNGTVALELDMSKTYDRVEWIFIRKVMEKMGFCSRWI
        +  RLK ++  +I P+Q++F+PGR+ TDN +   E +H++R ++KG  G + L+LD+ K YDR+ W ++   +   GF   W+
Subjt:  IANRLKGVLNTIISPSQSAFVPGRLITDNAIIGFECIHAVRSKRKGKNGTVALELDMSKTYDRVEWIFIRKVMEKMGFCSRWI

AT4G29090.1 Ribonuclease H-like superfamily protein3.2e-2841.09Show/hide
Query:  AIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKAHGGMRFRDITIFNQAMLAKQSWRIICNPDSLVAKVLRGRYFKTGQFLKA
        A+P Y M+CF  P ++C  +  + A FWW +  + + +HW++W  L   KA GG+ F+DI  FN A+L KQ WR++  P+SL+AKV + RYF     L A
Subjt:  AIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKAHGGMRFRDITIFNQAMLAKQSWRIICNPDSLVAKVLRGRYFKTGQFLKA

Query:  GLGANPSYTWWSIVWGRELFIKGYRWRIG
         LG+ PS+ W SI   +E+  +G R  +G
Subjt:  GLGANPSYTWWSIVWGRELFIKGYRWRIG

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.6e-3047.01Show/hide
Query:  AIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKA-HGGMRFRDITIFNQAMLAKQSWRIICNPDSLVAKVLRGRYFKTGQFLK
        A+P YAMSCF+    LC  L      FWW S +  RKI W +W++LC +K   GG+ FRD+  FNQA+LAKQS+RII  P +L++++LR RYF     ++
Subjt:  AIPNYAMSCFKFPISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKA-HGGMRFRDITIFNQAMLAKQSWRIICNPDSLVAKVLRGRYFKTGQFLK

Query:  AGLGANPSYTWWSIVWGRELFIKGYRWRIGRDGL
          +G  PSY W SI+ GREL  +G    IG DG+
Subjt:  AGLGANPSYTWWSIVWGRELFIKGYRWRIGRDGL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)3.8e-1347.76Show/hide
Query:  LLNGVPGTEFKPNQGLRQGDPLSPYLFLICAEGLSGLLNQAAIRKEVIGLSINKYCPIITHLFYADD
        ++NG P     P++GLRQGDPLSPYLF++C E LSGL  +A  +  + G+ ++   P I HL +ADD
Subjt:  LLNGVPGTEFKPNQGLRQGDPLSPYLFLICAEGLSGLLNQAAIRKEVIGLSINKYCPIITHLFYADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGCACATAAACCGGACTTATATTGTATTAATTCCAAAGATACAAGATCCAAAAGAAATGAAGGATTTTAGGTCGATTAGCTTATGCACGGTTCTTTACAAAATAAT
TGCCAAAGTTATAGCTAATAGGCTTAAAGGGGTGTTGAATACCATCATTTCCCCGAGTCAATCCGCTTTTGTTCCTGGAAGACTTATAACAGATAATGCAATCATTGGGT
TCGAATGCATTCATGCTGTCAGAAGCAAGAGGAAAGGAAAAAATGGGACGGTTGCTCTAGAATTAGATATGAGCAAGACATACGATAGGGTTGAATGGATTTTCATTAGG
AAAGTTATGGAAAAGATGGGCTTTTGCAGTAGATGGATCAATCTGATTATGCAATGTGTGGAATCAGTTAGTTTCCAAGTTTTGTTAAATGGGGTTCCTGGAACGGAATT
CAAACCTAATCAAGGCCTGAGACAAGGCGACCCTCTATCCCCATATCTGTTTCTGATCTGTGCTGAAGGATTATCCGGCCTCTTGAATCAAGCTGCGATAAGGAAGGAGG
TAATAGGTTTGAGTATCAATAAGTATTGTCCTATTATAACCCATTTGTTCTATGCAGATGATTGCCTTTTGTTTTTCAAAGCCTCTAATAAAGATTGCAGGTACATCAAA
CATCTTCTGGGGGTTTACGAGCGAGCCTCGGGGCAAACAATAAACTTCGAGAAATCGAATTTTATGGTTAGCCCAAATACTAACGATGATCTCGTCAGAGAGATTCAAAC
TCTTTTGCAAGTTAAGCACTCGCTCAACTTGGGTCAGTACCTTGGATTACCATCCCAGAATGCTCGAAGTAAAAAGGAGATATTCAACAACATCAAGGACAGAGTGTGGA
AAGTTTTGCAAGGATGGAAAGGAAGATTCGTCTCGGCGGCTGGAAGGGAGACTCTAGTTAAATCTGTGGCACAAGCCATTCCTAATTATGCCATGAGTTGTTTCAAGTTC
CCTATTTCTTTATGTAATGATCTGAAATATATTTGTGCTAGGTTCTGGTGGGGATCAGGCGACCAGGGTCGGAAAATCCATTGGCAAAGTTGGAAACGCCTCTGTATTAA
CAAAGCTCATGGAGGGATGAGATTTAGAGATATAACAATTTTTAATCAAGCCATGCTCGCAAAGCAAAGCTGGAGAATCATTTGCAACCCTGATAGTCTTGTAGCAAAAG
TGCTCCGAGGAAGATATTTTAAAACCGGTCAATTCCTAAAGGCTGGGTTGGGGGCTAATCCATCCTATACTTGGTGGAGTATCGTATGGGGTAGAGAGCTTTTTATAAAA
GGTTATCGGTGGCGTATTGGAAGGGATGGACTACCATGGACTATTGTGAGAGGCTTTGGAATGGGCACAGTGGAGGTTCAATGGACAAAATGGAGGAGACAAATATACAG
AGGAGCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGCACATAAACCGGACTTATATTGTATTAATTCCAAAGATACAAGATCCAAAAGAAATGAAGGATTTTAGGTCGATTAGCTTATGCACGGTTCTTTACAAAATAAT
TGCCAAAGTTATAGCTAATAGGCTTAAAGGGGTGTTGAATACCATCATTTCCCCGAGTCAATCCGCTTTTGTTCCTGGAAGACTTATAACAGATAATGCAATCATTGGGT
TCGAATGCATTCATGCTGTCAGAAGCAAGAGGAAAGGAAAAAATGGGACGGTTGCTCTAGAATTAGATATGAGCAAGACATACGATAGGGTTGAATGGATTTTCATTAGG
AAAGTTATGGAAAAGATGGGCTTTTGCAGTAGATGGATCAATCTGATTATGCAATGTGTGGAATCAGTTAGTTTCCAAGTTTTGTTAAATGGGGTTCCTGGAACGGAATT
CAAACCTAATCAAGGCCTGAGACAAGGCGACCCTCTATCCCCATATCTGTTTCTGATCTGTGCTGAAGGATTATCCGGCCTCTTGAATCAAGCTGCGATAAGGAAGGAGG
TAATAGGTTTGAGTATCAATAAGTATTGTCCTATTATAACCCATTTGTTCTATGCAGATGATTGCCTTTTGTTTTTCAAAGCCTCTAATAAAGATTGCAGGTACATCAAA
CATCTTCTGGGGGTTTACGAGCGAGCCTCGGGGCAAACAATAAACTTCGAGAAATCGAATTTTATGGTTAGCCCAAATACTAACGATGATCTCGTCAGAGAGATTCAAAC
TCTTTTGCAAGTTAAGCACTCGCTCAACTTGGGTCAGTACCTTGGATTACCATCCCAGAATGCTCGAAGTAAAAAGGAGATATTCAACAACATCAAGGACAGAGTGTGGA
AAGTTTTGCAAGGATGGAAAGGAAGATTCGTCTCGGCGGCTGGAAGGGAGACTCTAGTTAAATCTGTGGCACAAGCCATTCCTAATTATGCCATGAGTTGTTTCAAGTTC
CCTATTTCTTTATGTAATGATCTGAAATATATTTGTGCTAGGTTCTGGTGGGGATCAGGCGACCAGGGTCGGAAAATCCATTGGCAAAGTTGGAAACGCCTCTGTATTAA
CAAAGCTCATGGAGGGATGAGATTTAGAGATATAACAATTTTTAATCAAGCCATGCTCGCAAAGCAAAGCTGGAGAATCATTTGCAACCCTGATAGTCTTGTAGCAAAAG
TGCTCCGAGGAAGATATTTTAAAACCGGTCAATTCCTAAAGGCTGGGTTGGGGGCTAATCCATCCTATACTTGGTGGAGTATCGTATGGGGTAGAGAGCTTTTTATAAAA
GGTTATCGGTGGCGTATTGGAAGGGATGGACTACCATGGACTATTGTGAGAGGCTTTGGAATGGGCACAGTGGAGGTTCAATGGACAAAATGGAGGAGACAAATATACAG
AGGAGCCTAA
Protein sequenceShow/hide protein sequence
MGHINRTYIVLIPKIQDPKEMKDFRSISLCTVLYKIIAKVIANRLKGVLNTIISPSQSAFVPGRLITDNAIIGFECIHAVRSKRKGKNGTVALELDMSKTYDRVEWIFIR
KVMEKMGFCSRWINLIMQCVESVSFQVLLNGVPGTEFKPNQGLRQGDPLSPYLFLICAEGLSGLLNQAAIRKEVIGLSINKYCPIITHLFYADDCLLFFKASNKDCRYIK
HLLGVYERASGQTINFEKSNFMVSPNTNDDLVREIQTLLQVKHSLNLGQYLGLPSQNARSKKEIFNNIKDRVWKVLQGWKGRFVSAAGRETLVKSVAQAIPNYAMSCFKF
PISLCNDLKYICARFWWGSGDQGRKIHWQSWKRLCINKAHGGMRFRDITIFNQAMLAKQSWRIICNPDSLVAKVLRGRYFKTGQFLKAGLGANPSYTWWSIVWGRELFIK
GYRWRIGRDGLPWTIVRGFGMGTVEVQWTKWRRQIYRGA