; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0020188 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0020188
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr5:48680001..48687585
RNA-Seq ExpressionLag0020188
SyntenyLag0020188
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR025558 - Domain of unknown function DUF4283
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBN69274.1 TatD related DNase [Prunus dulcis]1.7e-10353.13Show/hide
Query:  AVRIKDFRPISLTTSVYKVIAKVLAERMRKVMPNIISQAQSAFISGRQILDLVLIANEVVEEYRAKKKKGWILKLDLEKAFDRVDWDFLEYVLRLKGFCE
        +V++ DFRPISL TS+YKVI+KVLA R+R+V+ N ISQ Q AF+  RQILD VL+ANEVVEE R + +KG + K+D EKA+D V+W F++ V+  KGF  
Subjt:  AVRIKDFRPISLTTSVYKVIAKVLAERMRKVMPNIISQAQSAFISGRQILDLVLIANEVVEEYRAKKKKGWILKLDLEKAFDRVDWDFLEYVLRLKGFCE

Query:  KWIEWINGCVRNPKFSIFINGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDTLLFCKYDDAMLDSLI
        KW  WI GC+ +  FSI ING+PRG+  ASRGLRQGDPLSPFLF LVS+VL+  I+   +  +  G + G D+V +S LQFADDT+ F    +    +L+
Subjt:  KWIEWINGCVRNPKFSIFINGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDTLLFCKYDDAMLDSLI

Query:  STIGLFEWCSGKKVNWEKSALCGINIDNDKILAPSSQLNCKVESLPFLYLGLPLGGYPKKVSFWQPVIDKIHKRLDRWRRFNLSRGGRLTLCNSVLSSIP
          + LF   SG K+N  KS + GIN   + +   +    C+V   P +YLGLPLGG P+ ++FW PV+DK+ KRL RW+R  LS+GGRLTL  +VLSSIP
Subjt:  STIGLFEWCSGKKVNWEKSALCGINIDNDKILAPSSQLNCKVESLPFLYLGLPLGGYPKKVSFWQPVIDKIHKRLDRWRRFNLSRGGRLTLCNSVLSSIP

Query:  LYYMSSFLMPSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQASKPLLSGGLGIGGLKNRNLSLLAK
         YYMS F MP  V +KVEQL+R+FLWEG    K  HL RW + +K    GGLGIG L+ RN +L AK
Subjt:  LYYMSSFLMPSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQASKPLLSGGLGIGGLKNRNLSLLAK

CAN70431.1 hypothetical protein VITISV_030910 [Vitis vinifera]2.5e-10250.68Show/hide
Query:  AVRIKDFRPISLTTSVYKVIAKVLAERMRKVMPNIISQAQSAFISGRQILDLVLIANEVVEEYRAKKKKGWILKLDLEKAFDRVDWDFLEYVLRLKGFCE
        +V+I D+RPISL TS+YK+IAKVL+ R+RKV+   IS +Q AF+ GR ILD VLIANEVV+E R   ++G + K+D EKA+D VDW FL++VL+ KGF +
Subjt:  AVRIKDFRPISLTTSVYKVIAKVLAERMRKVMPNIISQAQSAFISGRQILDLVLIANEVVEEYRAKKKKGWILKLDLEKAFDRVDWDFLEYVLRLKGFCE

Query:  KWIEWINGCVRNPKFSIFINGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDTLLFCKYDDAMLDSLI
        KW  WI GC+ +  F+I +NG  +G + ASRGLRQGDPLSPFLF LV++VL+  +    E G+ EGF VG D+  +S+LQFADDT+ F K     L +L 
Subjt:  KWIEWINGCVRNPKFSIFINGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDTLLFCKYDDAMLDSLI

Query:  STIGLFEWCSGKKVNWEKSALCGINIDNDKILAPSSQLNCKVESLPFLYLGLPLGGYPKKVSFWQPVIDKIHKRLDRWRRFNLSRGGRLTLCNSVLSSIP
          + +F   SG K+N EKS + GIN   + + + +S  +C+V   P  YLGLPLGG PK + FW PV+++I +RLD W++  LS GGR+TL  S LS IP
Subjt:  STIGLFEWCSGKKVNWEKSALCGINIDNDKILAPSSQLNCKVESLPFLYLGLPLGGYPKKVSFWQPVIDKIHKRLDRWRRFNLSRGGRLTLCNSVLSSIP

Query:  LYYMSSFLMPSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQASKPLLSGGLGIGGLKNRNLSLLAK
         Y++S F +P+ V SK+E++ R+FLW G+   K +HL RWE  S+P   GGLG G +  RN++LL K
Subjt:  LYYMSSFLMPSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQASKPLLSGGLGIGGLKNRNLSLLAK

RVW64408.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]3.2e-10250.41Show/hide
Query:  AVRIKDFRPISLTTSVYKVIAKVLAERMRKVMPNIISQAQSAFISGRQILDLVLIANEVVEEYRAKKKKGWILKLDLEKAFDRVDWDFLEYVLRLKGFCE
        +V+I D+RPISL TS+YK+IAKVL+ R+RKV+   IS +Q AF+ GR ILD VLIANEVV+E R   ++G + K+D EKA+D VDW FL++VL+ KGF +
Subjt:  AVRIKDFRPISLTTSVYKVIAKVLAERMRKVMPNIISQAQSAFISGRQILDLVLIANEVVEEYRAKKKKGWILKLDLEKAFDRVDWDFLEYVLRLKGFCE

Query:  KWIEWINGCVRNPKFSIFINGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDTLLFCKYDDAMLDSLI
        KW  WI GC+ +  F+I +NG  +G + ASRGLRQGDPLSPFLF LV++VL+  +    E G+ EGF VG D+  +S+LQFADDT+ F K     L +L 
Subjt:  KWIEWINGCVRNPKFSIFINGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDTLLFCKYDDAMLDSLI

Query:  STIGLFEWCSGKKVNWEKSALCGINIDNDKILAPSSQLNCKVESLPFLYLGLPLGGYPKKVSFWQPVIDKIHKRLDRWRRFNLSRGGRLTLCNSVLSSIP
          + +F   SG K+N EKS + GIN   + + + +S  +C+V   P  YLGLPLGG PK + FW PV+++I +RLD W++  LS GGR+TL  S LS IP
Subjt:  STIGLFEWCSGKKVNWEKSALCGINIDNDKILAPSSQLNCKVESLPFLYLGLPLGGYPKKVSFWQPVIDKIHKRLDRWRRFNLSRGGRLTLCNSVLSSIP

Query:  LYYMSSFLMPSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQASKPLLSGGLGIGGLKNRNLSLLAK
         Y++S F +P+ + SK+E++ R+FLW G+   K +HL RWE  S+P   GGLG G +  RN++LL K
Subjt:  LYYMSSFLMPSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQASKPLLSGGLGIGGLKNRNLSLLAK

VVA20479.1 Hypothetical predicted protein, partial [Prunus dulcis]2.2e-10353.13Show/hide
Query:  AVRIKDFRPISLTTSVYKVIAKVLAERMRKVMPNIISQAQSAFISGRQILDLVLIANEVVEEYRAKKKKGWILKLDLEKAFDRVDWDFLEYVLRLKGFCE
        +V++ DFRPISL TS+YKVI+KVLA R+R+V+ N ISQ+Q AF+  RQILD VL+ANEVVEE R + +KG + K+D EKA+D V+W+F++ VL  KGF  
Subjt:  AVRIKDFRPISLTTSVYKVIAKVLAERMRKVMPNIISQAQSAFISGRQILDLVLIANEVVEEYRAKKKKGWILKLDLEKAFDRVDWDFLEYVLRLKGFCE

Query:  KWIEWINGCVRNPKFSIFINGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDTLLFCKYDDAMLDSLI
        KW  WI GC+ +  FSI ING+PRG+  ASRGLRQGDPLSPFLF LVS+VL+  I+   +  +  G + G D+V +S LQFADDT+ F    +    +L+
Subjt:  KWIEWINGCVRNPKFSIFINGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDTLLFCKYDDAMLDSLI

Query:  STIGLFEWCSGKKVNWEKSALCGINIDNDKILAPSSQLNCKVESLPFLYLGLPLGGYPKKVSFWQPVIDKIHKRLDRWRRFNLSRGGRLTLCNSVLSSIP
          + LF   SG K+N  KS + GIN   + +   +    C+V   P +YLGLPLGG P+ ++FW PV+DK+ KRL +W+R  LS+GGRLTL  +VLSSIP
Subjt:  STIGLFEWCSGKKVNWEKSALCGINIDNDKILAPSSQLNCKVESLPFLYLGLPLGGYPKKVSFWQPVIDKIHKRLDRWRRFNLSRGGRLTLCNSVLSSIP

Query:  LYYMSSFLMPSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQASKPLLSGGLGIGGLKNRNLSLLAK
         YYMS F MP  V  KVEQL+R+FLWEG    K  HL RW + +K    GGLGIG L+ RN +L AK
Subjt:  LYYMSSFLMPSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQASKPLLSGGLGIGGLKNRNLSLLAK

XP_038880332.1 uncharacterized protein LOC120071973 [Benincasa hispida]2.8e-10664.87Show/hide
Query:  LEYVLRLKGFCEKWIEWINGCVRNPKFSIFINGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDTLLF
        LE VL+ K F  +WI W+ GCV NPKFSIFI+GRPRGRI  +RG+RQGDP SPFLFLLVSEVL+  I  +HEKG YEGF+VG DKVHISI+QF  DTLLF
Subjt:  LEYVLRLKGFCEKWIEWINGCVRNPKFSIFINGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDTLLF

Query:  CKYDDAMLDSLISTIGLFEWCSGKKVNWEKSALCGINIDNDKILAPSSQLNCKVESLPFLYLGLPLGGYPKKVSFWQPVIDKIHKRLDRWRRFNLSRGGR
        CKY D M+++L  TI +FEWCS +KVNWEKSA+CGINI+ +K+L+ ++QLNCKV+ LP +YLGLPLGGYPK VSFWQPVIDK+  +LD+WRRFNLSRGG+
Subjt:  CKYDDAMLDSLISTIGLFEWCSGKKVNWEKSALCGINIDNDKILAPSSQLNCKVESLPFLYLGLPLGGYPKKVSFWQPVIDKIHKRLDRWRRFNLSRGGR

Query:  LTLCNSVLSSIPLYYMSSFLMPSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQASKPLLSGGLGIGGLKNRNLSLLAK
         TLC SV S++P YY+S FL+P KV+  +E+ +++F WEG  G K+NHL +WE  +K    GGLG+GGL+ RNL+ LAK
Subjt:  LTLCNSVLSSIPLYYMSSFLMPSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQASKPLLSGGLGIGGLKNRNLSLLAK

TrEMBL top hitse value%identityAlignment
A0A803P8A0 Uncharacterized protein1.4e-10653.15Show/hide
Query:  RIKDFRPISLTTSVYKVIAKVLAERMRKVMPNIISQAQSAFISGRQILDLVLIANEVVEEYRAKKKKGWILKLDLEKAFDRVDWDFLEYVLRLKGFCEKW
        ++KDFRPISL TSVYK+IAK LA R+R V+   IS+ QSAF+ GRQILD VL+ANE VE+YR++ KKG++LK+D EKA+DRVDW FL+ VLR KGF E+W
Subjt:  RIKDFRPISLTTSVYKVIAKVLAERMRKVMPNIISQAQSAFISGRQILDLVLIANEVVEEYRAKKKKGWILKLDLEKAFDRVDWDFLEYVLRLKGFCEKW

Query:  IEWINGCVRNPKFSIFINGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDTLLFCKYDDAMLDSLIST
         +WI GCV +  FSIF+NGR RG+   SRGLRQGDPLSPFLF LV++VL   +    E   + GF +G D + +S LQFADDTL F K +D+ L  L+  
Subjt:  IEWINGCVRNPKFSIFINGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDTLLFCKYDDAMLDSLIST

Query:  IGLFEWCSGKKVNWEKSALCGINIDNDKILAPSSQLNCKVESLPFLYLGLPLGGYPKKVSFWQPVIDKIHKRLDRWRRFNLSRGGRLTLCNSVLSSIPLY
        +  F   SG KVN  KS L GI + ++ +   ++ + C+V   P  YLG+PLGG P+K +FW+PV+DK  KR+D W+   LSRGGRLTL  SVLSS+P+Y
Subjt:  IGLFEWCSGKKVNWEKSALCGINIDNDKILAPSSQLNCKVESLPFLYLGLPLGGYPKKVSFWQPVIDKIHKRLDRWRRFNLSRGGRLTLCNSVLSSIPLY

Query:  YMSSFLMPSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQASKPLLSGGLGIGGLKNRNLSLLAK
        Y+S F +P  V+ ++E+++R F WEG + +  +HL  W++  KP   GGL IG L+ RN  LL K
Subjt:  YMSSFLMPSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQASKPLLSGGLGIGGLKNRNLSLLAK

A0A803QI00 Uncharacterized protein4.4e-10553.15Show/hide
Query:  RIKDFRPISLTTSVYKVIAKVLAERMRKVMPNIISQAQSAFISGRQILDLVLIANEVVEEYRAKKKKGWILKLDLEKAFDRVDWDFLEYVLRLKGFCEKW
        ++KDFRPISL TSVYK++AK LA R+R V+   IS+ QSAF+ GRQILD VLIANE VE++R++ KKG++ K+DLEKA+DRVDWDFL+ VL+ KGF E W
Subjt:  RIKDFRPISLTTSVYKVIAKVLAERMRKVMPNIISQAQSAFISGRQILDLVLIANEVVEEYRAKKKKGWILKLDLEKAFDRVDWDFLEYVLRLKGFCEKW

Query:  IEWINGCVRNPKFSIFINGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDTLLFCKYDDAMLDSLIST
         +WI GCV +  FS+ INGR RG+   SRGLRQGDPLSPFLF LV +VL   +    +   + GF VG D + IS LQFADDTL F K D+A L  L+  
Subjt:  IEWINGCVRNPKFSIFINGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDTLLFCKYDDAMLDSLIST

Query:  IGLFEWCSGKKVNWEKSALCGINIDNDKILAPSSQLNCKVESLPFLYLGLPLGGYPKKVSFWQPVIDKIHKRLDRWRRFNLSRGGRLTLCNSVLSSIPLY
        +  F   SG KVN  KS L GI+++ + +   +  + C+V + P  YLG+PLGG P+K +FW+PV+DK  KRLD W+   LSRGGRL L  SVLSS+P+Y
Subjt:  IGLFEWCSGKKVNWEKSALCGINIDNDKILAPSSQLNCKVESLPFLYLGLPLGGYPKKVSFWQPVIDKIHKRLDRWRRFNLSRGGRLTLCNSVLSSIPLY

Query:  YMSSFLMPSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQASKPLLSGGLGIGGLKNRNLSLLAK
        Y+S F  P  V+  +E+++R F WEG + +  +HL  W++  KP   GGL IG L+ RN  LL K
Subjt:  YMSSFLMPSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQASKPLLSGGLGIGGLKNRNLSLLAK

A0A803QQM3 Uncharacterized protein6.4e-10452.6Show/hide
Query:  RIKDFRPISLTTSVYKVIAKVLAERMRKVMPNIISQAQSAFISGRQILDLVLIANEVVEEYRAKKKKGWILKLDLEKAFDRVDWDFLEYVLRLKGFCEKW
        ++KD+RPISL TSVYK+IAK LA R+R V+   IS+ QSAF+ GRQILD VL+ANE VE+YR++ KKG +LK+D EKA+DRVDW FL+ V+R KGF E+W
Subjt:  RIKDFRPISLTTSVYKVIAKVLAERMRKVMPNIISQAQSAFISGRQILDLVLIANEVVEEYRAKKKKGWILKLDLEKAFDRVDWDFLEYVLRLKGFCEKW

Query:  IEWINGCVRNPKFSIFINGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDTLLFCKYDDAMLDSLIST
         +WI GCV    FSIFINGR RG+   SRGLRQ DPLSPFLF L+++VL   +    +     GF +G D + +S LQFADDTL F K D+A L  L+  
Subjt:  IEWINGCVRNPKFSIFINGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDTLLFCKYDDAMLDSLIST

Query:  IGLFEWCSGKKVNWEKSALCGINIDNDKILAPSSQLNCKVESLPFLYLGLPLGGYPKKVSFWQPVIDKIHKRLDRWRRFNLSRGGRLTLCNSVLSSIPLY
        +  F   SG KVN  KS L G+ +D D +   + Q+ C+V   P  YLG+PLGG P+K SFW+PV+DK   R+D W+   LSRGGRLTL  SVLSS+P+Y
Subjt:  IGLFEWCSGKKVNWEKSALCGINIDNDKILAPSSQLNCKVESLPFLYLGLPLGGYPKKVSFWQPVIDKIHKRLDRWRRFNLSRGGRLTLCNSVLSSIPLY

Query:  YMSSFLMPSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQASKPLLSGGLGIGGLKNRNLSLLAK
        ++S F  P  V+ ++E+++R F WEG + +  +HL  W++  KP   GGL IG L+ RN  LL K
Subjt:  YMSSFLMPSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQASKPLLSGGLGIGGLKNRNLSLLAK

M5WKV4 Reverse transcriptase domain-containing protein (Fragment)2.6e-10553.41Show/hide
Query:  AVRIKDFRPISLTTSVYKVIAKVLAERMRKVMPNIISQAQSAFISGRQILDLVLIANEVVEEYRAKKKKGWILKLDLEKAFDRVDWDFLEYVLRLKGFCE
        +V++ D+RPISL TS+YKVI+KVLA R+R+V+ N ISQ+Q AF+  RQILD VL+ANEVVEE R +K+KG + K+D EKA+D V+W+F++ VL  KGF  
Subjt:  AVRIKDFRPISLTTSVYKVIAKVLAERMRKVMPNIISQAQSAFISGRQILDLVLIANEVVEEYRAKKKKGWILKLDLEKAFDRVDWDFLEYVLRLKGFCE

Query:  KWIEWINGCVRNPKFSIFINGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDTLLFCKYDDAMLDSLI
        KW  WI GC+ +  FSI ING+PRG+  ASRGLRQGDPLSPFLF LVS+VL+  I+   +  +  G + G D+V +S LQFADDT+      +    +L+
Subjt:  KWIEWINGCVRNPKFSIFINGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDTLLFCKYDDAMLDSLI

Query:  STIGLFEWCSGKKVNWEKSALCGINIDNDKILAPSSQLNCKVESLPFLYLGLPLGGYPKKVSFWQPVIDKIHKRLDRWRRFNLSRGGRLTLCNSVLSSIP
          + LF   SG K+N  KS + GIN   D +   +    C+V   P +YLGLPLGG P+ ++FW PV+DK+ KRL +W+R  LS+GGRLTL  +VLSSIP
Subjt:  STIGLFEWCSGKKVNWEKSALCGINIDNDKILAPSSQLNCKVESLPFLYLGLPLGGYPKKVSFWQPVIDKIHKRLDRWRRFNLSRGGRLTLCNSVLSSIP

Query:  LYYMSSFLMPSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQASKPLLSGGLGIGGLKNRNLSLLAK
         YYMS F MP  V +KVEQL+R+FLWEG    K  HL RWE+ +K    GGLGIG L+ RN +L AK
Subjt:  LYYMSSFLMPSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQASKPLLSGGLGIGGLKNRNLSLLAK

M5XHS0 Reverse transcriptase domain-containing protein (Fragment)1.3e-10453.13Show/hide
Query:  AVRIKDFRPISLTTSVYKVIAKVLAERMRKVMPNIISQAQSAFISGRQILDLVLIANEVVEEYRAKKKKGWILKLDLEKAFDRVDWDFLEYVLRLKGFCE
        +V++ D+RPISL TS+YKVI+KVL  R+R+V+ N ISQ+Q AF+  RQILD VL+ANEVVEE R +K+KG + K+D EKA+D V+W+F++ VL  KGF  
Subjt:  AVRIKDFRPISLTTSVYKVIAKVLAERMRKVMPNIISQAQSAFISGRQILDLVLIANEVVEEYRAKKKKGWILKLDLEKAFDRVDWDFLEYVLRLKGFCE

Query:  KWIEWINGCVRNPKFSIFINGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDTLLFCKYDDAMLDSLI
        KW  WI GC+ +  FSI ING+PRG+  ASRGLRQGDPLSPFLF LVS+VL+  I+   +  +  G + G D+V +S LQFADDT+      +    +L+
Subjt:  KWIEWINGCVRNPKFSIFINGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDTLLFCKYDDAMLDSLI

Query:  STIGLFEWCSGKKVNWEKSALCGINIDNDKILAPSSQLNCKVESLPFLYLGLPLGGYPKKVSFWQPVIDKIHKRLDRWRRFNLSRGGRLTLCNSVLSSIP
          + LF   SG K+N  KS + GIN   D +   +    C+V   P +YLGLPLGG P+ ++FW PV+DK+ KRL +W+R  LS+GGRLTL  +VLSSIP
Subjt:  STIGLFEWCSGKKVNWEKSALCGINIDNDKILAPSSQLNCKVESLPFLYLGLPLGGYPKKVSFWQPVIDKIHKRLDRWRRFNLSRGGRLTLCNSVLSSIP

Query:  LYYMSSFLMPSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQASKPLLSGGLGIGGLKNRNLSLLAK
         YYMS F MP  V +KVEQL+R+FLWEG    K  HL RWE+ +K    GGLGIG L+ RN +L AK
Subjt:  LYYMSSFLMPSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQASKPLLSGGLGIGGLKNRNLSLLAK

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein8.7e-2626.91Show/hide
Query:  KDFRPISLTTSVYKVIAKVLAERMRKVMPNIISQAQSAFISGRQ-ILDLVLIANEVVEEYRAKKKKGWILKLDLEKAFDRVDWDFLEYVLRLKGFCEKWI
        ++FRPISL     K++ K+LA R+++ +  +I   Q  FI G Q   ++    N +    RAK K   I+ +D EKAFD++   F+   L   G    ++
Subjt:  KDFRPISLTTSVYKVIAKVLAERMRKVMPNIISQAQSAFISGRQ-ILDLVLIANEVVEEYRAKKKKGWILKLDLEKAFDRVDWDFLEYVLRLKGFCEKWI

Query:  EWINGCVRNPKFSIFINGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDTLLFCKYDDAMLDSLISTI
        + I      P  +I +NG+         G RQG PLSP LF +V EVL    + I ++   +G  +G ++V +S+  FADD +++ +       +L+  I
Subjt:  EWINGCVRNPKFSIFINGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDTLLFCKYDDAMLDSLISTI

Query:  GLFEWCSGKKVNWEKSALCGINIDNDKILAPSSQLNCKVESLPFLYLGLPLGGYPKKV--SFWQPVIDKIHKRLDRWRRFNLSRGGRLTLCNSVLSSIPL
          F   SG K+N +KS     N +         +L   + S    YLG+ L    K +    ++P++ +I +  ++W+    S  GR+ +    +    +
Subjt:  GLFEWCSGKKVNWEKSALCGINIDNDKILAPSSQLNCKVESLPFLYLGLPLGGYPKKV--SFWQPVIDKIHKRLDRWRRFNLSRGGRLTLCNSVLSSIPL

Query:  YYMSS--FLMPSKVVSKVEQLIRSFLW
        Y  ++    +P    +++E+    F+W
Subjt:  YYMSS--FLMPSKVVSKVEQLIRSFLW

P08548 LINE-1 reverse transcriptase homolog1.5e-2225.68Show/hide
Query:  RIKDFRPISLTTSVYKVIAKVLAERMRKVMPNIISQAQSAFISGRQ-ILDLVLIANEVVEEYRAKKKKGWILKLDLEKAFDRVDWDFLEYVLRLKGFCEK
        R +++RPISL     K++ K+L  R+++ +  II   Q  FI G Q   ++    N +    + K K   IL +D EKAFD +   F+   L+  G    
Subjt:  RIKDFRPISLTTSVYKVIAKVLAERMRKVMPNIISQAQSAFISGRQ-ILDLVLIANEVVEEYRAKKKKGWILKLDLEKAFDRVDWDFLEYVLRLKGFCEK

Query:  WIEWINGCVRNPKFSIFINGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDTLLFCKYDDAMLDSLIS
        +++ I      P  +I +NG          G RQG PLSP LF +V EVL   I+   E+   +G  +G +++ +S+  FADD +++ +        L+ 
Subjt:  WIEWINGCVRNPKFSIFINGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDTLLFCKYDDAMLDSLIS

Query:  TIGLFEWCSGKKVNWEKSALCGINIDNDKILAPSSQLNCKVESLPFLYLGLPLGGYPKKV--SFWQPVIDKIHKRLDRWRRFNLSRGGRLTLCNSVLSSI
         I  +   SG K+N  KS       +N         +   V      YLG+ L    K +    ++ +  +I + +++W+    S  GR+ +    +   
Subjt:  TIGLFEWCSGKKVNWEKSALCGINIDNDKILAPSSQLNCKVESLPFLYLGLPLGGYPKKV--SFWQPVIDKIHKRLDRWRRFNLSRGGRLTLCNSVLSSI

Query:  PLYYMSSFLM--PSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQASKPLLSGGLGIGGLKNRNLSLLAK
         +Y  ++  +  P      +E++I  F+W            +  Q +K LLS     GG+   +L L  K
Subjt:  PLYYMSSFLM--PSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQASKPLLSGGLGIGGLKNRNLSLLAK

P0C2F6 Putative ribonuclease H protein At1g657502.8e-1637.62Show/hide
Query:  VIDKIHKRLDRWRRFNLSRGGRLTLCNSVLSSIPLYYMSSFLMPSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQASKPLLSGGLGIGGLKNRNLSLLA
        +++++  R+  WR   LS  GRLTL  +VLSS+P++ MS+ L+P  ++++++QL R+FLW  +   K  HL +W +   P   GGLG+   K+ N +L++
Subjt:  VIDKIHKRLDRWRRFNLSRGGRLTLCNSVLSSIPLYYMSSFLMPSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQASKPLLSGGLGIGGLKNRNLSLLA

Query:  K
        K
Subjt:  K

P11369 LINE-1 retrotransposable element ORF2 protein8.2e-2427.27Show/hide
Query:  RIKDFRPISLTTSVYKVIAKVLAERMRKVMPNIISQAQSAFISGRQ-ILDLVLIANEVVEEYRAKKKKGWILKLDLEKAFDRVDWDFLEYVLRLKGFCEK
        +I++FRPISL     K++ K+LA R+++ +  II   Q  FI G Q   ++    N +    + K K   I+ LD EKAFD++   F+  VL   G    
Subjt:  RIKDFRPISLTTSVYKVIAKVLAERMRKVMPNIISQAQSAFISGRQ-ILDLVLIANEVVEEYRAKKKKGWILKLDLEKAFDRVDWDFLEYVLRLKGFCEK

Query:  WIEWINGCVRNPKFSIFINGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDTLLFCKYDDAMLDSLIS
        ++  I      P  +I +NG     I    G RQG PLSP+LF +V EVL    + I ++   +G  +G ++V IS+L  ADD +++          L++
Subjt:  WIEWINGCVRNPKFSIFINGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDTLLFCKYDDAMLDSLIS

Query:  TIGLFEWCSGKKVNWEKSA--LCGINIDNDKILAPSSQLNCKVESLPFLYLGLPLGGYPKKV--SFWQPVIDKIHKRLDRWRRFNLSRGGRLTLCNSVLS
         I  F    G K+N  KS   L   N   +K +  ++  +    ++   YLG+ L    K +    ++ +  +I + L RW+    S  GR+ +    + 
Subjt:  TIGLFEWCSGKKVNWEKSA--LCGINIDNDKILAPSSQLNCKVESLPFLYLGLPLGGYPKKV--SFWQPVIDKIHKRLDRWRRFNLSRGGRLTLCNSVLS

Query:  SIPLYYMSS--FLMPSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQASKPLLSGGLGIGGLK
           +Y  ++    +P++  +++E  I  F+W          L + ++      SGG+ +  LK
Subjt:  SIPLYYMSS--FLMPSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQASKPLLSGGLGIGGLK

P14381 Transposon TX1 uncharacterized 149 kDa protein3.9e-2626.97Show/hide
Query:  IKDFRPISLTTSVYKVIAKVLAERMRKVMPNIISQAQSAFISGRQILDLVLIANEVVEEYRAKKKKGWILKLDLEKAFDRVDWDFLEYVLRLKGFCEKWI
        IK++RP+SL ++ YK++AK ++ R++ V+  +I   QS  + GR I D V +  +++   R        L LD EKAFDRVD  +L   L+   F  +++
Subjt:  IKDFRPISLTTSVYKVIAKVLAERMRKVMPNIISQAQSAFISGRQILDLVLIANEVVEEYRAKKKKGWILKLDLEKAFDRVDWDFLEYVLRLKGFCEKWI

Query:  EWINGCVRNPKFSIFINGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDTLLFCKYDDAMLDSLISTI
         ++     + +  + IN      +   RG+RQG PLS  L+ L  E    F+ ++ ++    G ++    + + +  +ADD +L  + D   L+      
Subjt:  EWINGCVRNPKFSIFINGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDTLLFCKYDDAMLDSLISTI

Query:  GLFEWCSGKKVNWEKSALCGINIDNDKI-LAPSSQLNCKVESLPFLYLGLPLGG--YPKKVSFWQPVIDKIHKRLDRWRRFN--LSRGGRLTLCNSVLSS
         ++   S  ++NW KS+  G+   + K+   P +  +   ES    YLG+ L    YP   +F + + + +  RL +W+ F   LS  GR  + N +++S
Subjt:  GLFEWCSGKKVNWEKSALCGINIDNDKI-LAPSSQLNCKVESLPFLYLGLPLGG--YPKKVSFWQPVIDKIHKRLDRWRRFN--LSRGGRLTLCNSVLSS

Query:  IPLYYMSSFLMPSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQASKPLLSGGLGI
           Y +       + ++K+++ +  FLW G       H      +S PL  GG G+
Subjt:  IPLYYMSSFLMPSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQASKPLLSGGLGI

Arabidopsis top hitse value%identityAlignment
AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.9e-1636.44Show/hide
Query:  SLPFLYLGLPLGGYPKKVSFWQPVIDKIHKRLDRWRRFNLSRGGRLTLCNSVLSSIPLYYMSSFLMPSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQA
        +LP  YLGLPL       S + P+++KI  R+ +W   +LS  GRL L +SV+ S+  ++MS+F +PS  + +++ +  SFLW G   +       W   
Subjt:  SLPFLYLGLPLGGYPKKVSFWQPVIDKIHKRLDRWRRFNLSRGGRLTLCNSVLSSIPLYYMSSFLMPSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQA

Query:  SKPLLSGGLGIGGLKNRN
          P   GGLGI  LK  N
Subjt:  SKPLLSGGLGIGGLKNRN

AT4G20520.1 RNA binding;RNA-directed DNA polymerases3.9e-1347.56Show/hide
Query:  LAERMRKVMPNIISQAQSAFISGRQILDLVLIANEVVEEYRAKK-KKGW-ILKLDLEKAFDRVDWDFLEYVLRLKGFCEKWI
        + ER++ +M N+I  AQ++FI GR   D ++   E V   R KK  KGW +LKLDLEKA+DR+ WD+LE  L   GF E W+
Subjt:  LAERMRKVMPNIISQAQSAFISGRQILDLVLIANEVVEEYRAKK-KKGW-ILKLDLEKAFDRVDWDFLEYVLRLKGFCEKWI

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.8e-1150.75Show/hide
Query:  INGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDT
        ING P+G +  SRGLRQGDPLSP+LF+L +EVL+   +   E+G   G  V  +   I+ L FADDT
Subjt:  INGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFIKIIHEKGVYEGFIVGMDKVHISILQFADDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGCACTTTCAATAGAATTCTTTATTTGCAGAATAAATCGAGATGCTCTGGGAACTCTACTGATGGTGAAGGTGGGGTATTTTAAGGAGGCAGCGGACAGGTTTTT
GTCTTCAGGAGGAACTCTACCGATGGTGAAGGTGGATATCTATCAAGCATTAGTTCATTCAGAATGTATGCTTAATCCTTTTCATGGACAGTTCCTTTTCATGGACAAAA
CACTGCTATATTCACGACATTACGTTTTGGAGGGTTCTTATGTTCAGGCTATTTGGAAGTATAAATTATCAGAGAAGGCTAAGGAGTATCTTTGGACTGGCCTTAGAAAA
TCTTTGAAGTCTCCCTCGCCTCATGCAAAGGGTTTCTTCTCAGCCCGCAAAAACTTCTTCACGAGCGTTTTGCAAAATCACAAGCGTTCTTCTCCGCTGCTTCACAGTTT
CTTCTCTGCTGCTTCACAAGGGCTTGTTATCCGACGCTGTCAACTAACCGTTGACCCAAAATCGCAGATTGTCTGCCGAATTTTTATTGTTGCTGCTTTGCTTTCTTTGG
GTGGTTCCATCCTCCTTCTCCAATGGAAGTTATCAGCTGTTGTATTCAGAATAGTTGAGATTTTGCAAAATCCTGTTTCTTCCTTCTTTCATGAGAAAATTAAGGAAGAA
TTTGGAGTCATTAGGTTGATTAAGTTCTTCTCGGATAACGAATGGTTCTTTGAATGTGTTGTTTGGCCTTCCACGGGTGGAAGAAGGATTATTCAAGTTCCAGCAGGCTT
GGATAAGAAAGGATGGTATGTTTTTTGGGAAATGATTAGGGATTTCATTCTTAAATTTCATTCTTATGAGAATCAACCTATTCGGTCATTGTCGAGCAAAGAGGAGTGTA
TTCCAGTTTTTTATAAGGTTTCAGAAGGTCAAGTCTTTCCTAATTCATATGCTGAAGTGGTAAAGCGAGGTGGTTCTTTAATAAGTTCAGTTTCATTGAATGATTCAATA
AGAAATGTCAAGGGTGTTAATGAAGAAGCTTACTGGGTTCGCAAGAATTGTGATGTGCTGGAAATAGATTTGGAAAGATCAATTGTTGTTTCTAGATTGATGGTCCATTA
TTCTTGGAAGGATGTTAAGATTGCCCTTGAGAAATTCTTTAAATCTTCTGTCTTGATTAACCCCTTCATGGATGATAAAGCTTTGATTCATGCAGCAGATGGTGGCTTGG
AATTTTCTGCAAATGGCAAGTGGAAGAAATTTGGAAACTTACATTTGAAATTGGAATTTTGGTCCTCTGAAATCCATTCACAGCCGAAGTCTATAAAAAGCTATGGAGGT
TGGGTTGCAATTAGAAATCTTCCATTGAATTTATGGCATCGTGACTCCTTTGAAGCGATTGGAAAAAACCTTGGAGGGTTGGAGGCATTTAATGAGGATTTGGTTATTTC
AAAGGAGGTCTCGGTACAAGATGAATGTATTAAGTGCAATGGTTGTATTATTCCTTCAACCAAATTGATTAATGATGATAGTTGTTTATTGAATAATGAAGATTTGAATG
GGGATTTGGTTCTTTCAAAGGATGCCTCGGTACAAGATGTAGGTATTAATTGCAGTGGTTGCTTTATTCCTTCAACAAAGATGATTAATGATGATAGATGTTTTTTGAAT
AATGAAGTACAACAGACTTTAATAGAGAGAGGCCAAGTTAATGAGATGTTGGGTTCTCCAAAAGGTGCTTCTTTGCATGACAAGAGTATTAATAATGCTGGTTGTAAAGG
TTTTAATGCCAGTATTAATGAGCCGGCTTTAGCTCTCTCTCCTTCATTAAATGTCAATGAATTTAATAAGTACAACCATCAGGAAGCCCAACAGTTTCAGGTCAAGGAAT
TTAAGGAATCTTCTATTCAAATTCCAGGGGGAAGTGATGTTTTTGTCCGAGGTATTGGTAGTTCCTTCAATCAAAGTATTCATTCCCCGGTGGATTCAGATGATGAGTCT
ATGGTTAGTGTTAGCAGTGAAGATTCTGATCAATTGTTAGATAAAGAGGATAGTGTGGAACAATTTTCAGAAGACCAAATTGTAGGAAGGCAAACAAAAGGGATGCGTAA
TTTTAATAAATTCATTGAAGACTCGGGTCTTATGGAAATTCCTTTATCAAATGGTAAATTTACATGGTCTAGGGATGATGAAGCTGCTGTCCGTATAAAAGACTTTAGGC
CTATAAGTCTTACCACTTCAGTTTATAAAGTTATTGCTAAAGTCCTTGCTGAAAGAATGAGAAAGGTAATGCCCAACATAATTTCTCAAGCTCAAAGTGCTTTTATTAGT
GGCAGACAGATTCTTGATTTAGTTCTCATTGCTAATGAAGTTGTGGAGGAATATCGAGCTAAGAAAAAAAAGGGGTGGATTTTGAAACTAGATCTTGAGAAAGCTTTTGA
TAGGGTGGATTGGGATTTCCTTGAGTATGTTCTCAGGTTGAAAGGATTTTGTGAAAAATGGATTGAATGGATAAATGGATGTGTTAGGAATCCAAAATTTTCCATTTTCA
TTAATGGTCGGCCAAGGGGGAGAATTTGTGCTTCTAGAGGTCTTAGGCAAGGAGATCCTCTTTCTCCTTTCCTATTTCTTTTAGTTAGTGAAGTATTGAATGCATTTATC
AAAATCATCCATGAGAAGGGCGTTTATGAGGGTTTCATCGTTGGCATGGATAAAGTTCATATTTCCATTCTTCAATTTGCAGATGATACTCTTTTATTTTGTAAGTATGA
TGATGCTATGCTTGATTCTCTCATTTCTACCATTGGTCTTTTTGAATGGTGCTCTGGGAAGAAAGTTAATTGGGAAAAATCTGCATTGTGTGGAATTAATATCGATAATG
ACAAGATTCTGGCTCCTTCTTCCCAGTTGAATTGCAAAGTAGAATCTCTTCCATTTTTGTATCTTGGTCTTCCATTAGGTGGTTATCCGAAAAAGGTGTCATTTTGGCAG
CCAGTGATTGATAAAATTCATAAAAGGCTTGATAGATGGAGGCGTTTTAATCTATCTAGAGGTGGCCGATTGACTTTATGCAATTCAGTTTTATCAAGCATTCCATTATA
TTATATGTCCTCGTTTCTCATGCCTTCTAAAGTTGTTTCAAAAGTGGAGCAGTTAATTAGGTCGTTTTTATGGGAAGGAAGTAATGGTTCCAAGTTAAATCACTTGGCTC
GTTGGGAGCAAGCTTCAAAACCTCTTTTGAGTGGAGGTCTCGGTATTGGTGGCTTGAAAAACAGAAACCTGTCTCTTCTTGCTAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGATTGCACTTTCAATAGAATTCTTTATTTGCAGAATAAATCGAGATGCTCTGGGAACTCTACTGATGGTGAAGGTGGGGTATTTTAAGGAGGCAGCGGACAGGTTTTT
GTCTTCAGGAGGAACTCTACCGATGGTGAAGGTGGATATCTATCAAGCATTAGTTCATTCAGAATGTATGCTTAATCCTTTTCATGGACAGTTCCTTTTCATGGACAAAA
CACTGCTATATTCACGACATTACGTTTTGGAGGGTTCTTATGTTCAGGCTATTTGGAAGTATAAATTATCAGAGAAGGCTAAGGAGTATCTTTGGACTGGCCTTAGAAAA
TCTTTGAAGTCTCCCTCGCCTCATGCAAAGGGTTTCTTCTCAGCCCGCAAAAACTTCTTCACGAGCGTTTTGCAAAATCACAAGCGTTCTTCTCCGCTGCTTCACAGTTT
CTTCTCTGCTGCTTCACAAGGGCTTGTTATCCGACGCTGTCAACTAACCGTTGACCCAAAATCGCAGATTGTCTGCCGAATTTTTATTGTTGCTGCTTTGCTTTCTTTGG
GTGGTTCCATCCTCCTTCTCCAATGGAAGTTATCAGCTGTTGTATTCAGAATAGTTGAGATTTTGCAAAATCCTGTTTCTTCCTTCTTTCATGAGAAAATTAAGGAAGAA
TTTGGAGTCATTAGGTTGATTAAGTTCTTCTCGGATAACGAATGGTTCTTTGAATGTGTTGTTTGGCCTTCCACGGGTGGAAGAAGGATTATTCAAGTTCCAGCAGGCTT
GGATAAGAAAGGATGGTATGTTTTTTGGGAAATGATTAGGGATTTCATTCTTAAATTTCATTCTTATGAGAATCAACCTATTCGGTCATTGTCGAGCAAAGAGGAGTGTA
TTCCAGTTTTTTATAAGGTTTCAGAAGGTCAAGTCTTTCCTAATTCATATGCTGAAGTGGTAAAGCGAGGTGGTTCTTTAATAAGTTCAGTTTCATTGAATGATTCAATA
AGAAATGTCAAGGGTGTTAATGAAGAAGCTTACTGGGTTCGCAAGAATTGTGATGTGCTGGAAATAGATTTGGAAAGATCAATTGTTGTTTCTAGATTGATGGTCCATTA
TTCTTGGAAGGATGTTAAGATTGCCCTTGAGAAATTCTTTAAATCTTCTGTCTTGATTAACCCCTTCATGGATGATAAAGCTTTGATTCATGCAGCAGATGGTGGCTTGG
AATTTTCTGCAAATGGCAAGTGGAAGAAATTTGGAAACTTACATTTGAAATTGGAATTTTGGTCCTCTGAAATCCATTCACAGCCGAAGTCTATAAAAAGCTATGGAGGT
TGGGTTGCAATTAGAAATCTTCCATTGAATTTATGGCATCGTGACTCCTTTGAAGCGATTGGAAAAAACCTTGGAGGGTTGGAGGCATTTAATGAGGATTTGGTTATTTC
AAAGGAGGTCTCGGTACAAGATGAATGTATTAAGTGCAATGGTTGTATTATTCCTTCAACCAAATTGATTAATGATGATAGTTGTTTATTGAATAATGAAGATTTGAATG
GGGATTTGGTTCTTTCAAAGGATGCCTCGGTACAAGATGTAGGTATTAATTGCAGTGGTTGCTTTATTCCTTCAACAAAGATGATTAATGATGATAGATGTTTTTTGAAT
AATGAAGTACAACAGACTTTAATAGAGAGAGGCCAAGTTAATGAGATGTTGGGTTCTCCAAAAGGTGCTTCTTTGCATGACAAGAGTATTAATAATGCTGGTTGTAAAGG
TTTTAATGCCAGTATTAATGAGCCGGCTTTAGCTCTCTCTCCTTCATTAAATGTCAATGAATTTAATAAGTACAACCATCAGGAAGCCCAACAGTTTCAGGTCAAGGAAT
TTAAGGAATCTTCTATTCAAATTCCAGGGGGAAGTGATGTTTTTGTCCGAGGTATTGGTAGTTCCTTCAATCAAAGTATTCATTCCCCGGTGGATTCAGATGATGAGTCT
ATGGTTAGTGTTAGCAGTGAAGATTCTGATCAATTGTTAGATAAAGAGGATAGTGTGGAACAATTTTCAGAAGACCAAATTGTAGGAAGGCAAACAAAAGGGATGCGTAA
TTTTAATAAATTCATTGAAGACTCGGGTCTTATGGAAATTCCTTTATCAAATGGTAAATTTACATGGTCTAGGGATGATGAAGCTGCTGTCCGTATAAAAGACTTTAGGC
CTATAAGTCTTACCACTTCAGTTTATAAAGTTATTGCTAAAGTCCTTGCTGAAAGAATGAGAAAGGTAATGCCCAACATAATTTCTCAAGCTCAAAGTGCTTTTATTAGT
GGCAGACAGATTCTTGATTTAGTTCTCATTGCTAATGAAGTTGTGGAGGAATATCGAGCTAAGAAAAAAAAGGGGTGGATTTTGAAACTAGATCTTGAGAAAGCTTTTGA
TAGGGTGGATTGGGATTTCCTTGAGTATGTTCTCAGGTTGAAAGGATTTTGTGAAAAATGGATTGAATGGATAAATGGATGTGTTAGGAATCCAAAATTTTCCATTTTCA
TTAATGGTCGGCCAAGGGGGAGAATTTGTGCTTCTAGAGGTCTTAGGCAAGGAGATCCTCTTTCTCCTTTCCTATTTCTTTTAGTTAGTGAAGTATTGAATGCATTTATC
AAAATCATCCATGAGAAGGGCGTTTATGAGGGTTTCATCGTTGGCATGGATAAAGTTCATATTTCCATTCTTCAATTTGCAGATGATACTCTTTTATTTTGTAAGTATGA
TGATGCTATGCTTGATTCTCTCATTTCTACCATTGGTCTTTTTGAATGGTGCTCTGGGAAGAAAGTTAATTGGGAAAAATCTGCATTGTGTGGAATTAATATCGATAATG
ACAAGATTCTGGCTCCTTCTTCCCAGTTGAATTGCAAAGTAGAATCTCTTCCATTTTTGTATCTTGGTCTTCCATTAGGTGGTTATCCGAAAAAGGTGTCATTTTGGCAG
CCAGTGATTGATAAAATTCATAAAAGGCTTGATAGATGGAGGCGTTTTAATCTATCTAGAGGTGGCCGATTGACTTTATGCAATTCAGTTTTATCAAGCATTCCATTATA
TTATATGTCCTCGTTTCTCATGCCTTCTAAAGTTGTTTCAAAAGTGGAGCAGTTAATTAGGTCGTTTTTATGGGAAGGAAGTAATGGTTCCAAGTTAAATCACTTGGCTC
GTTGGGAGCAAGCTTCAAAACCTCTTTTGAGTGGAGGTCTCGGTATTGGTGGCTTGAAAAACAGAAACCTGTCTCTTCTTGCTAAATAA
Protein sequenceShow/hide protein sequence
MIALSIEFFICRINRDALGTLLMVKVGYFKEAADRFLSSGGTLPMVKVDIYQALVHSECMLNPFHGQFLFMDKTLLYSRHYVLEGSYVQAIWKYKLSEKAKEYLWTGLRK
SLKSPSPHAKGFFSARKNFFTSVLQNHKRSSPLLHSFFSAASQGLVIRRCQLTVDPKSQIVCRIFIVAALLSLGGSILLLQWKLSAVVFRIVEILQNPVSSFFHEKIKEE
FGVIRLIKFFSDNEWFFECVVWPSTGGRRIIQVPAGLDKKGWYVFWEMIRDFILKFHSYENQPIRSLSSKEECIPVFYKVSEGQVFPNSYAEVVKRGGSLISSVSLNDSI
RNVKGVNEEAYWVRKNCDVLEIDLERSIVVSRLMVHYSWKDVKIALEKFFKSSVLINPFMDDKALIHAADGGLEFSANGKWKKFGNLHLKLEFWSSEIHSQPKSIKSYGG
WVAIRNLPLNLWHRDSFEAIGKNLGGLEAFNEDLVISKEVSVQDECIKCNGCIIPSTKLINDDSCLLNNEDLNGDLVLSKDASVQDVGINCSGCFIPSTKMINDDRCFLN
NEVQQTLIERGQVNEMLGSPKGASLHDKSINNAGCKGFNASINEPALALSPSLNVNEFNKYNHQEAQQFQVKEFKESSIQIPGGSDVFVRGIGSSFNQSIHSPVDSDDES
MVSVSSEDSDQLLDKEDSVEQFSEDQIVGRQTKGMRNFNKFIEDSGLMEIPLSNGKFTWSRDDEAAVRIKDFRPISLTTSVYKVIAKVLAERMRKVMPNIISQAQSAFIS
GRQILDLVLIANEVVEEYRAKKKKGWILKLDLEKAFDRVDWDFLEYVLRLKGFCEKWIEWINGCVRNPKFSIFINGRPRGRICASRGLRQGDPLSPFLFLLVSEVLNAFI
KIIHEKGVYEGFIVGMDKVHISILQFADDTLLFCKYDDAMLDSLISTIGLFEWCSGKKVNWEKSALCGINIDNDKILAPSSQLNCKVESLPFLYLGLPLGGYPKKVSFWQ
PVIDKIHKRLDRWRRFNLSRGGRLTLCNSVLSSIPLYYMSSFLMPSKVVSKVEQLIRSFLWEGSNGSKLNHLARWEQASKPLLSGGLGIGGLKNRNLSLLAK