; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038509 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038509
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr2:19126431..19139896
RNA-Seq ExpressionLag0038509
SyntenyLag0038509
Gene Ontology termsNA
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN61322.1 hypothetical protein VITISV_012106 [Vitis vinifera]4.9e-5734.83Show/hide
Query:  LKTQLQRIKKDGLSVSQYLSKIKEIGDKFLAIGEPISYRDHFAHILDGLGSDYNAFVTYIQNRSDNPALEDVRSLLLAYEARLEKQNSVDQLNLAQASLA
        L+ +LQ  KK  +S+  Y+ KIK   D   AIGEP+S +D   ++L GLGSDYNA VT I  R D  +LE + S+LLA+E RLE+Q+S++Q++   AS +
Subjt:  LKTQLQRIKKDGLSVSQYLSKIKEIGDKFLAIGEPISYRDHFAHILDGLGSDYNAFVTYIQNRSDNPALEDVRSLLLAYEARLEKQNSVDQLNLAQASLA

Query:  TLQ---LHNNSRRPPSRVPSNNQF---------------RPPFNPFSSSP--VLSNSFSPSLLESYH---------------------------------
          +      N  R     P+NN +               R   +P S  P   L   F  +    YH                                 
Subjt:  TLQ---LHNNSRRPPSRVPSNNQF---------------RPPFNPFSSSP--VLSNSFSPSLLESYH---------------------------------

Query:  ---PDENWYLDSGATHHMTSDASSLTHSMSYGGGEHATVGDGKKVSISLIVTSFIPSLSQKPVFLDSVLHTHAITKKLLSVARLCKDNRAFVEFHSSFFL
            DE+WYLDSGA+HH+T +  +LT +  Y G +  T+G+GK +SIS I +  + S +     L  V H   I+  L+SVA+ C +N A +EFHS+ F 
Subjt:  ---PDENWYLDSGATHHMTSDASSLTHSMSYGGGEHATVGDGKKVSISLIVTSFIPSLSQKPVFLDSVLHTHAITKKLLSVARLCKDNRAFVEFHSSFFL

Query:  VKDLQTNQILLKGTLEDGLYRLVVVSSVS-----SNVPVVHSSFSATVKSPSVFLALVAAPIWHLRLGHPSDATLRQILSQLHVSFSSSAHVACGSCQMA
        VKDL T  +L +G LE+GLY+  V S++      +N    HS FS+TV++         A +WH RLGH S   + ++++  +V+        C  CQ+A
Subjt:  VKDLQTNQILLKGTLEDGLYRLVVVSSVS-----SNVPVVHSSFSATVKSPSVFLALVAAPIWHLRLGHPSDATLRQILSQLHVSFSSSAHVACGSCQMA

Query:  KSHCLLFSLAENKSCKPFKLVHSNVWGPSPQVSINGAHYFLLFID
        KSH L   L+   + KP +LV++++WGP+   S +GA YF+LF+D
Subjt:  KSHCLLFSLAENKSCKPFKLVHSNVWGPSPQVSINGAHYFLLFID

RVW36350.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.9e-5734.83Show/hide
Query:  LKTQLQRIKKDGLSVSQYLSKIKEIGDKFLAIGEPISYRDHFAHILDGLGSDYNAFVTYIQNRSDNPALEDVRSLLLAYEARLEKQNSVDQLNLAQASLA
        L+ +LQ  KK  +S+  Y+ KIK   D   AIGEP+S +D   ++L GLGSDYNA VT I  R D  +LE + S+LLA+E RLE+Q+S++Q++   AS +
Subjt:  LKTQLQRIKKDGLSVSQYLSKIKEIGDKFLAIGEPISYRDHFAHILDGLGSDYNAFVTYIQNRSDNPALEDVRSLLLAYEARLEKQNSVDQLNLAQASLA

Query:  TLQ---LHNNSRRPPSRVPSNNQF---------------RPPFNPFSSSP--VLSNSFSPSLLESYH---------------------------------
          +      N  R     P+NN +               R   +P S  P   L   F  +    YH                                 
Subjt:  TLQ---LHNNSRRPPSRVPSNNQF---------------RPPFNPFSSSP--VLSNSFSPSLLESYH---------------------------------

Query:  ---PDENWYLDSGATHHMTSDASSLTHSMSYGGGEHATVGDGKKVSISLIVTSFIPSLSQKPVFLDSVLHTHAITKKLLSVARLCKDNRAFVEFHSSFFL
            DE+WYLDSGA+HH+T +  +LT++  Y G +  T+G+GK +SIS I +  + S +     L  V H   I+  L+SVA+ C +N A +EFHS+ F 
Subjt:  ---PDENWYLDSGATHHMTSDASSLTHSMSYGGGEHATVGDGKKVSISLIVTSFIPSLSQKPVFLDSVLHTHAITKKLLSVARLCKDNRAFVEFHSSFFL

Query:  VKDLQTNQILLKGTLEDGLYRLVVVSSVS-----SNVPVVHSSFSATVKSPSVFLALVAAPIWHLRLGHPSDATLRQILSQLHVSFSSSAHVACGSCQMA
        VKDL T  +L +G LE+GLY+  V S++      +N    HS FS+TV++         A +WH RLGH S   + ++++  +V+        C  CQ+A
Subjt:  VKDLQTNQILLKGTLEDGLYRLVVVSSVS-----SNVPVVHSSFSATVKSPSVFLALVAAPIWHLRLGHPSDATLRQILSQLHVSFSSSAHVACGSCQMA

Query:  KSHCLLFSLAENKSCKPFKLVHSNVWGPSPQVSINGAHYFLLFID
        KSH L   L+   + KP +LV++++WGP+   S +GA YF+LF+D
Subjt:  KSHCLLFSLAENKSCKPFKLVHSNVWGPSPQVSINGAHYFLLFID

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]5.6e-7743.86Show/hide
Query:  VGLKTQLQRIKKDGLSVSQYLSKIKEIGDKFLAIGEPISYRDHFAHILDGLGSDYNAFVTYIQNRSDNPALEDVRSLLLAYEARLEKQNSVDQLNLAQAS
        + L +QLQRIKK  + +S+YLS++K + D+F  IGEP+SYRD    IL+GL  +Y+ FVT I NRSD P+L++V SLL  YE RL +++    LN  QA+
Subjt:  VGLKTQLQRIKKDGLSVSQYLSKIKEIGDKFLAIGEPISYRDHFAHILDGLGSDYNAFVTYIQNRSDNPALEDVRSLLLAYEARLEKQNSVDQLNLAQAS

Query:  LATLQLHNNSRRPPSRV-------------PSNNQFRPP-------FNP----FSSSPV---LSNSFSPSLLESYHPDENWYLDSGATHHMTSDASSLTH
            Q   N+  P  ++              +N  + PP       FNP     +SSP+   L+ S +P+ L S   D +WY+DSGATHH T +   +T 
Subjt:  LATLQLHNNSRRPPSRV-------------PSNNQFRPP-------FNP----FSSSPV---LSNSFSPSLLESYHPDENWYLDSGATHHMTSDASSLTH

Query:  SMSYGGGEHATVGDGKKVSISLIVTSFIPSLSQKPVFLDSVLHTHAITKKLLSVARLCKDNRAFVEFHSSFFLVKDLQTNQILLKGTLEDGLYRLVVVSS
        +M Y  G+HA VG+ K++ IS I  + + S S KP+ L+ VLHT  I+K+L+SV RL  DN+AFVEF+ +FFLVKD QT Q+LL+G LE GLY+L   ++
Subjt:  SMSYGGGEHATVGDGKKVSISLIVTSFIPSLSQKPVFLDSVLHTHAITKKLLSVARLCKDNRAFVEFHSSFFLVKDLQTNQILLKGTLEDGLYRLVVVSS

Query:  V-SSNVPVVHSSFSATVKSPSVFLALV-AAPIWHLRLGHPSDATLRQILSQLHVSFSSSAHVACGSCQMAKSHCLLFSLAENKSCKPFKLVHSNVWGPSP
        + SS+V    S+          FL+      +WH RLGHP+   + Q+L   ++ FS+S H+ C SCQ+AKSH L F L+E+++ KPF LV+S++WGPSP
Subjt:  V-SSNVPVVHSSFSATVKSPSVFLALV-AAPIWHLRLGHPSDATLRQILSQLHVSFSSSAHVACGSCQMAKSHCLLFSLAENKSCKPFKLVHSNVWGPSP

Query:  QVSINGAHYFLLFID
          S+ G  YFLLFID
Subjt:  QVSINGAHYFLLFID

RVW72303.1 Retrovirus-related Pol polyprotein from transposon RE2 [Vitis vinifera]3.6e-6039.07Show/hide
Query:  LKTQLQRIKKDGLSVSQYLSKIKEIGDKFLAIGEPISYRDHFAHILDGLGSDYNAFVTYIQNRSDNPALEDVRSLLLAYEARLEKQNSVDQLNLAQASLA
        L+ +LQ  KK  LS+  Y+ K+K   D   AIGEP+S +D   ++L GLGSDYNA VT I  R D  ++E V S+LLA+E RLE+Q+S++Q +   A+ A
Subjt:  LKTQLQRIKKDGLSVSQYLSKIKEIGDKFLAIGEPISYRDHFAHILDGLGSDYNAFVTYIQNRSDNPALEDVRSLLLAYEARLEKQNSVDQLNLAQASLA

Query:  TLQLHNNSRRPPSRVPSNNQFRPPFNPFSSSPVLSNSFSPSLLESYH-PDENWYLDSGATHHMTSDASSLTHSMSYGGGEHATVGDGKKVSISLIVTSFI
        +    +NSR    R   + Q     N   S+    NS    +  S +  D+ WYLDS A+HH+T    +LT S  Y   +   +G+GK +SIS   +  +
Subjt:  TLQLHNNSRRPPSRVPSNNQFRPPFNPFSSSPVLSNSFSPSLLESYH-PDENWYLDSGATHHMTSDASSLTHSMSYGGGEHATVGDGKKVSISLIVTSFI

Query:  PSLSQKPVFLDSVLHTHAITKKLLSVARLCKDNRAFVEFHSSFFLVKDLQTNQILLKGTLEDGLYRLVVVSSVSSNVPVVHSSFSATVKSPSVFLALVAA
         S S     L  V H   I+  L+SVA+ C +N A +EF S+ F VKDL T ++L +G LE+GLYR  V++  S  V  V ++ S+T  S +  +     
Subjt:  PSLSQKPVFLDSVLHTHAITKKLLSVARLCKDNRAFVEFHSSFFLVKDLQTNQILLKGTLEDGLYRLVVVSSVSSNVPVVHSSFSATVKSPSVFLALVAA

Query:  PIWHLRLGHPSDATLRQILSQLHVSFSSSAH----VACGSCQMAKSHCLLFSLAENKSCKPFKLVHSNVWGPSPQVSINGAHYFLLFID
         +WH RLGH S   + QI+   +VSF  + +      C SCQ+AKSH L   ++ + + KP +LVH+++WGP+P  S +GA YF+LF+D
Subjt:  PIWHLRLGHPSDATLRQILSQLHVSFSSSAH----VACGSCQMAKSHCLLFSLAENKSCKPFKLVHSNVWGPSPQVSINGAHYFLLFID

RVW98057.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]4.9e-5733.95Show/hide
Query:  MISSCNLILISLHGKVGLKTQLQRIKKDGLSVSQYLSKIKEIGDKFLAIGEPISYRDHFAHILDGLGSDYNAFVTYIQNRSDNPALEDVRSLLLAYEARL
        + SSC+   I     + L+ + Q  KK  +S+  Y+ K+K   D   AIGEP+S +D   ++L GLGSDYNA VT I  R D  +LE V S+LLA+E RL
Subjt:  MISSCNLILISLHGKVGLKTQLQRIKKDGLSVSQYLSKIKEIGDKFLAIGEPISYRDHFAHILDGLGSDYNAFVTYIQNRSDNPALEDVRSLLLAYEARL

Query:  EKQNSVDQLNLAQASLATLQLHN------NSRRPPSRVPSNNQFR------------PPFNPFSSSP--VLSNSFSPSLLESYH----------------
        E+Q S++QL    A+ A+   +       N  R P+ + +N+ FR               +  S  P   L   F  ++   YH                
Subjt:  EKQNSVDQLNLAQASLATLQLHN------NSRRPPSRVPSNNQFR------------PPFNPFSSSP--VLSNSFSPSLLESYH----------------

Query:  -------------------PDENWYLDSGATHHMTSDASSLTHSMSYGGGEHATVGDGKKVSISLIVTSFIPSLSQKPVF-LDSVLHTHAITKKLLSVAR
                            D+NWYLDSGA+HH+T + ++LT++  Y G +  T+G+GK ++IS   T F    S    F L  V H   I+  L+SVA+
Subjt:  -------------------PDENWYLDSGATHHMTSDASSLTHSMSYGGGEHATVGDGKKVSISLIVTSFIPSLSQKPVF-LDSVLHTHAITKKLLSVAR

Query:  LCKDNRAFVEFHSSFFLVKDLQTNQILLKGTLEDGLYRLVVVSSVSSNVPVVHSSFSATVKSPSVFLALVAA---PIWHLRLGHPSDATLRQILSQLHVS
         C DN A +EFHS+ F VKDL T ++L +G LE+GLY+  V+S+          +    + + S F          +WH RLGH +   + +I+   +VS
Subjt:  LCKDNRAFVEFHSSFFLVKDLQTNQILLKGTLEDGLYRLVVVSSVSSNVPVVHSSFSATVKSPSVFLALVAA---PIWHLRLGHPSDATLRQILSQLHVS

Query:  FSSSAHVACGSCQMAKSHCLLFSLAENKSCKPFKLVHSNVWGPSPQVSINGAHYFLLFID---RSIKSNVPRSGSLFQ---PINGS
                C SCQ+AKSH L   L+   + KP +LV++++WGP+   S +GA YF+LF+D   RS  S +  SG L +   P N S
Subjt:  FSSSAHVACGSCQMAKSHCLLFSLAENKSCKPFKLVHSNVWGPSPQVSINGAHYFLLFID---RSIKSNVPRSGSLFQ---PINGS

TrEMBL top hitse value%identityAlignment
A0A438DLM0 Retrovirus-related Pol polyprotein from transposon RE11.4e-5734.83Show/hide
Query:  LKTQLQRIKKDGLSVSQYLSKIKEIGDKFLAIGEPISYRDHFAHILDGLGSDYNAFVTYIQNRSDNPALEDVRSLLLAYEARLEKQNSVDQLNLAQASLA
        L+ +LQ  KK  +S+  Y+ KIK   D   AIGEP+S +D   ++L GLGSDYNA VT I  R D  +LE + S+LLA+E RLE+Q+S++Q++   AS +
Subjt:  LKTQLQRIKKDGLSVSQYLSKIKEIGDKFLAIGEPISYRDHFAHILDGLGSDYNAFVTYIQNRSDNPALEDVRSLLLAYEARLEKQNSVDQLNLAQASLA

Query:  TLQ---LHNNSRRPPSRVPSNNQF---------------RPPFNPFSSSP--VLSNSFSPSLLESYH---------------------------------
          +      N  R     P+NN +               R   +P S  P   L   F  +    YH                                 
Subjt:  TLQ---LHNNSRRPPSRVPSNNQF---------------RPPFNPFSSSP--VLSNSFSPSLLESYH---------------------------------

Query:  ---PDENWYLDSGATHHMTSDASSLTHSMSYGGGEHATVGDGKKVSISLIVTSFIPSLSQKPVFLDSVLHTHAITKKLLSVARLCKDNRAFVEFHSSFFL
            DE+WYLDSGA+HH+T +  +LT++  Y G +  T+G+GK +SIS I +  + S +     L  V H   I+  L+SVA+ C +N A +EFHS+ F 
Subjt:  ---PDENWYLDSGATHHMTSDASSLTHSMSYGGGEHATVGDGKKVSISLIVTSFIPSLSQKPVFLDSVLHTHAITKKLLSVARLCKDNRAFVEFHSSFFL

Query:  VKDLQTNQILLKGTLEDGLYRLVVVSSVS-----SNVPVVHSSFSATVKSPSVFLALVAAPIWHLRLGHPSDATLRQILSQLHVSFSSSAHVACGSCQMA
        VKDL T  +L +G LE+GLY+  V S++      +N    HS FS+TV++         A +WH RLGH S   + ++++  +V+        C  CQ+A
Subjt:  VKDLQTNQILLKGTLEDGLYRLVVVSSVS-----SNVPVVHSSFSATVKSPSVFLALVAAPIWHLRLGHPSDATLRQILSQLHVSFSSSAHVACGSCQMA

Query:  KSHCLLFSLAENKSCKPFKLVHSNVWGPSPQVSINGAHYFLLFID
        KSH L   L+   + KP +LV++++WGP+   S +GA YF+LF+D
Subjt:  KSHCLLFSLAENKSCKPFKLVHSNVWGPSPQVSINGAHYFLLFID

A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE12.7e-7743.86Show/hide
Query:  VGLKTQLQRIKKDGLSVSQYLSKIKEIGDKFLAIGEPISYRDHFAHILDGLGSDYNAFVTYIQNRSDNPALEDVRSLLLAYEARLEKQNSVDQLNLAQAS
        + L +QLQRIKK  + +S+YLS++K + D+F  IGEP+SYRD    IL+GL  +Y+ FVT I NRSD P+L++V SLL  YE RL +++    LN  QA+
Subjt:  VGLKTQLQRIKKDGLSVSQYLSKIKEIGDKFLAIGEPISYRDHFAHILDGLGSDYNAFVTYIQNRSDNPALEDVRSLLLAYEARLEKQNSVDQLNLAQAS

Query:  LATLQLHNNSRRPPSRV-------------PSNNQFRPP-------FNP----FSSSPV---LSNSFSPSLLESYHPDENWYLDSGATHHMTSDASSLTH
            Q   N+  P  ++              +N  + PP       FNP     +SSP+   L+ S +P+ L S   D +WY+DSGATHH T +   +T 
Subjt:  LATLQLHNNSRRPPSRV-------------PSNNQFRPP-------FNP----FSSSPV---LSNSFSPSLLESYHPDENWYLDSGATHHMTSDASSLTH

Query:  SMSYGGGEHATVGDGKKVSISLIVTSFIPSLSQKPVFLDSVLHTHAITKKLLSVARLCKDNRAFVEFHSSFFLVKDLQTNQILLKGTLEDGLYRLVVVSS
        +M Y  G+HA VG+ K++ IS I  + + S S KP+ L+ VLHT  I+K+L+SV RL  DN+AFVEF+ +FFLVKD QT Q+LL+G LE GLY+L   ++
Subjt:  SMSYGGGEHATVGDGKKVSISLIVTSFIPSLSQKPVFLDSVLHTHAITKKLLSVARLCKDNRAFVEFHSSFFLVKDLQTNQILLKGTLEDGLYRLVVVSS

Query:  V-SSNVPVVHSSFSATVKSPSVFLALV-AAPIWHLRLGHPSDATLRQILSQLHVSFSSSAHVACGSCQMAKSHCLLFSLAENKSCKPFKLVHSNVWGPSP
        + SS+V    S+          FL+      +WH RLGHP+   + Q+L   ++ FS+S H+ C SCQ+AKSH L F L+E+++ KPF LV+S++WGPSP
Subjt:  V-SSNVPVVHSSFSATVKSPSVFLALV-AAPIWHLRLGHPSDATLRQILSQLHVSFSSSAHVACGSCQMAKSHCLLFSLAENKSCKPFKLVHSNVWGPSP

Query:  QVSINGAHYFLLFID
          S+ G  YFLLFID
Subjt:  QVSINGAHYFLLFID

A0A438GJB1 Retrovirus-related Pol polyprotein from transposon RE21.8e-6039.07Show/hide
Query:  LKTQLQRIKKDGLSVSQYLSKIKEIGDKFLAIGEPISYRDHFAHILDGLGSDYNAFVTYIQNRSDNPALEDVRSLLLAYEARLEKQNSVDQLNLAQASLA
        L+ +LQ  KK  LS+  Y+ K+K   D   AIGEP+S +D   ++L GLGSDYNA VT I  R D  ++E V S+LLA+E RLE+Q+S++Q +   A+ A
Subjt:  LKTQLQRIKKDGLSVSQYLSKIKEIGDKFLAIGEPISYRDHFAHILDGLGSDYNAFVTYIQNRSDNPALEDVRSLLLAYEARLEKQNSVDQLNLAQASLA

Query:  TLQLHNNSRRPPSRVPSNNQFRPPFNPFSSSPVLSNSFSPSLLESYH-PDENWYLDSGATHHMTSDASSLTHSMSYGGGEHATVGDGKKVSISLIVTSFI
        +    +NSR    R   + Q     N   S+    NS    +  S +  D+ WYLDS A+HH+T    +LT S  Y   +   +G+GK +SIS   +  +
Subjt:  TLQLHNNSRRPPSRVPSNNQFRPPFNPFSSSPVLSNSFSPSLLESYH-PDENWYLDSGATHHMTSDASSLTHSMSYGGGEHATVGDGKKVSISLIVTSFI

Query:  PSLSQKPVFLDSVLHTHAITKKLLSVARLCKDNRAFVEFHSSFFLVKDLQTNQILLKGTLEDGLYRLVVVSSVSSNVPVVHSSFSATVKSPSVFLALVAA
         S S     L  V H   I+  L+SVA+ C +N A +EF S+ F VKDL T ++L +G LE+GLYR  V++  S  V  V ++ S+T  S +  +     
Subjt:  PSLSQKPVFLDSVLHTHAITKKLLSVARLCKDNRAFVEFHSSFFLVKDLQTNQILLKGTLEDGLYRLVVVSSVSSNVPVVHSSFSATVKSPSVFLALVAA

Query:  PIWHLRLGHPSDATLRQILSQLHVSFSSSAH----VACGSCQMAKSHCLLFSLAENKSCKPFKLVHSNVWGPSPQVSINGAHYFLLFID
         +WH RLGH S   + QI+   +VSF  + +      C SCQ+AKSH L   ++ + + KP +LVH+++WGP+P  S +GA YF+LF+D
Subjt:  PIWHLRLGHPSDATLRQILSQLHVSFSSSAH----VACGSCQMAKSHCLLFSLAENKSCKPFKLVHSNVWGPSPQVSINGAHYFLLFID

A0A438IG92 Retrovirus-related Pol polyprotein from transposon TNT 1-942.4e-5734.83Show/hide
Query:  LKTQLQRIKKDGLSVSQYLSKIKEIGDKFLAIGEPISYRDHFAHILDGLGSDYNAFVTYIQNRSDNPALEDVRSLLLAYEARLEKQNSVDQLNLAQASLA
        L+ +LQ  KK  +S+  Y+ KIK   D   AIGEP+S +D   ++L GLGSDYNA VT I  R D  +LE + S+LLA+E RLE+Q+S++Q++   AS +
Subjt:  LKTQLQRIKKDGLSVSQYLSKIKEIGDKFLAIGEPISYRDHFAHILDGLGSDYNAFVTYIQNRSDNPALEDVRSLLLAYEARLEKQNSVDQLNLAQASLA

Query:  TLQ---LHNNSRRPPSRVPSNNQF---------------RPPFNPFSSSP--VLSNSFSPSLLESYH---------------------------------
          +      N  R     P+NN +               R   +P S  P   L   F  +    YH                                 
Subjt:  TLQ---LHNNSRRPPSRVPSNNQF---------------RPPFNPFSSSP--VLSNSFSPSLLESYH---------------------------------

Query:  ---PDENWYLDSGATHHMTSDASSLTHSMSYGGGEHATVGDGKKVSISLIVTSFIPSLSQKPVFLDSVLHTHAITKKLLSVARLCKDNRAFVEFHSSFFL
            DE+WYLDSGA+HH+T +  +LT +  Y G +  T+G+GK +SIS I +  + S +     L  V H   I+  L+SVA+ C +N A +EFHS+ F 
Subjt:  ---PDENWYLDSGATHHMTSDASSLTHSMSYGGGEHATVGDGKKVSISLIVTSFIPSLSQKPVFLDSVLHTHAITKKLLSVARLCKDNRAFVEFHSSFFL

Query:  VKDLQTNQILLKGTLEDGLYRLVVVSSVS-----SNVPVVHSSFSATVKSPSVFLALVAAPIWHLRLGHPSDATLRQILSQLHVSFSSSAHVACGSCQMA
        VKDL T  +L +G LE+GLY+  V S++      +N    HS FS+TV++         A +WH RLGH S   + ++++  +V+        C  CQ+A
Subjt:  VKDLQTNQILLKGTLEDGLYRLVVVSSVS-----SNVPVVHSSFSATVKSPSVFLALVAAPIWHLRLGHPSDATLRQILSQLHVSFSSSAHVACGSCQMA

Query:  KSHCLLFSLAENKSCKPFKLVHSNVWGPSPQVSINGAHYFLLFID
        KSH L   L+   + KP +LV++++WGP+   S +GA YF+LF+D
Subjt:  KSHCLLFSLAENKSCKPFKLVHSNVWGPSPQVSINGAHYFLLFID

A5BFR8 Integrase catalytic domain-containing protein2.4e-5734.83Show/hide
Query:  LKTQLQRIKKDGLSVSQYLSKIKEIGDKFLAIGEPISYRDHFAHILDGLGSDYNAFVTYIQNRSDNPALEDVRSLLLAYEARLEKQNSVDQLNLAQASLA
        L+ +LQ  KK  +S+  Y+ KIK   D   AIGEP+S +D   ++L GLGSDYNA VT I  R D  +LE + S+LLA+E RLE+Q+S++Q++   AS +
Subjt:  LKTQLQRIKKDGLSVSQYLSKIKEIGDKFLAIGEPISYRDHFAHILDGLGSDYNAFVTYIQNRSDNPALEDVRSLLLAYEARLEKQNSVDQLNLAQASLA

Query:  TLQ---LHNNSRRPPSRVPSNNQF---------------RPPFNPFSSSP--VLSNSFSPSLLESYH---------------------------------
          +      N  R     P+NN +               R   +P S  P   L   F  +    YH                                 
Subjt:  TLQ---LHNNSRRPPSRVPSNNQF---------------RPPFNPFSSSP--VLSNSFSPSLLESYH---------------------------------

Query:  ---PDENWYLDSGATHHMTSDASSLTHSMSYGGGEHATVGDGKKVSISLIVTSFIPSLSQKPVFLDSVLHTHAITKKLLSVARLCKDNRAFVEFHSSFFL
            DE+WYLDSGA+HH+T +  +LT +  Y G +  T+G+GK +SIS I +  + S +     L  V H   I+  L+SVA+ C +N A +EFHS+ F 
Subjt:  ---PDENWYLDSGATHHMTSDASSLTHSMSYGGGEHATVGDGKKVSISLIVTSFIPSLSQKPVFLDSVLHTHAITKKLLSVARLCKDNRAFVEFHSSFFL

Query:  VKDLQTNQILLKGTLEDGLYRLVVVSSVS-----SNVPVVHSSFSATVKSPSVFLALVAAPIWHLRLGHPSDATLRQILSQLHVSFSSSAHVACGSCQMA
        VKDL T  +L +G LE+GLY+  V S++      +N    HS FS+TV++         A +WH RLGH S   + ++++  +V+        C  CQ+A
Subjt:  VKDLQTNQILLKGTLEDGLYRLVVVSSVS-----SNVPVVHSSFSATVKSPSVFLALVAAPIWHLRLGHPSDATLRQILSQLHVSFSSSAHVACGSCQMA

Query:  KSHCLLFSLAENKSCKPFKLVHSNVWGPSPQVSINGAHYFLLFID
        KSH L   L+   + KP +LV++++WGP+   S +GA YF+LF+D
Subjt:  KSHCLLFSLAENKSCKPFKLVHSNVWGPSPQVSINGAHYFLLFID

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.8e-0622.47Show/hide
Query:  SLHGKVGLKTQLQRIK-KDGLSVSQYLSKIKEIGDKFLAIGEPISYRDHFAHILDGLGSDYNAFVTYIQNRS-DNPALEDVRSLLLAYEARLEKQNSVDQ
        SL  ++ L+ +L  +K    +S+  +     E+  + LA G  I   D  +H+L  L S Y+  +T I+  S +N  L  V++ LL  E +++  +    
Subjt:  SLHGKVGLKTQLQRIK-KDGLSVSQYLSKIKEIGDKFLAIGEPISYRDHFAHILDGLGSDYNAFVTYIQNRS-DNPALEDVRSLLLAYEARLEKQNSVDQ

Query:  LNLAQASLATLQLHNNSR------------RPPSRVPSNNQFRPPFNP-----------FSSSPVLSN-----------------SFSPSLLESYHPDEN
         N     +    +HNN+             +P      N++++   +            F    +L+N                 +F    + +    +N
Subjt:  LNLAQASLATLQLHNNSR------------RPPSRVPSNNQFRPPFNP-----------FSSSPVLSN-----------------SFSPSLLESYHPDEN

Query:  --WYLDSGATHHMTSDASSLTHSMSYGGGEHATVGDGKKVSISLIVTSFIPSLSQKPVFLDSVLHTHAITKKLLSVARLCKDNRAFVEFHSSFFLVKDLQ
          + LDSGA+ H+ +D S  T S+         V    +  I       +   +   + L+ VL        L+SV RL ++    +EF  S   +    
Subjt:  --WYLDSGATHHMTSDASSLTHSMSYGGGEHATVGDGKKVSISLIVTSFIPSLSQKPVFLDSVLHTHAITKKLLSVARLCKDNRAFVEFHSSFFLVKDLQ

Query:  TNQILLKGTLEDGLYRLVVVSSVSSNVPVVH-SSFSATVKSPSVFLALVAAPIWHLRLGHPSDATLRQILSQLHVSFSSSAH------VACGSCQMAKSH
                  ++GL  +V  S + +NVPV++  ++S   K  + F       +WH R GH SD  L +I  +   S  S  +        C  C   K  
Subjt:  TNQILLKGTLEDGLYRLVVVSSVSSNVPVVH-SSFSATVKSPSVFLALVAAPIWHLRLGHPSDATLRQILSQLHVSFSSSAH------VACGSCQMAKSH

Query:  CLLFSLAENKS--CKPFKLVHSNVWGPSPQVSINGAHYFLLFIDR
         L F   ++K+   +P  +VHS+V GP   V+++  +YF++F+D+
Subjt:  CLLFSLAENKS--CKPFKLVHSNVWGPSPQVSINGAHYFLLFIDR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.0e-0922.58Show/hide
Query:  SLHGKVGLKTQLQRI-KKDGLSVSQYLSKIKEIGDKFLAIGEPISYRDHFAHILDGLGSDYNAFVTYIQNRSDNPALEDVRSLLLAYEARLEKQNSVDQL
        +L  K+ LK QL  +   +G +   +L+    +  +   +G  I   D    +L+ L S Y+   T I +      L+DV S LL  E   +K  +  Q 
Subjt:  SLHGKVGLKTQLQRI-KKDGLSVSQYLSKIKEIGDKFLAIGEPISYRDHFAHILDGLGSDYNAFVTYIQNRSDNPALEDVRSLLLAYEARLEKQNSVDQL

Query:  NLAQASLATLQLHNNS-----------RRPPSRVPSNNQFRPP--------------------FNPFSSSPVLSNSFSPSLL-----ESYH---PDENWY
         + +    + Q  +N+            R  SRV +      P                     N  +++ ++ N+ +  L      E  H   P+  W 
Subjt:  NLAQASLATLQLHNNS-----------RRPPSRVPSNNQFRPP--------------------FNPFSSSPVLSNSFSPSLL-----ESYH---PDENWY

Query:  LDSGATHHMTSDASSLTHSMSYGGGEHATV--GDGKKVSISLIVTSFIPSLSQKPVFLDSVLHTHAITKKLLSVARLCKDNRAFVEFHSSFFLVKDLQTN
        +D+ A+HH T           Y  G+  TV  G+     I+ I    I +     + L  V H   +   L+S   L +D       +  + L K    +
Subjt:  LDSGATHHMTSDASSLTHSMSYGGGEHATV--GDGKKVSISLIVTSFIPSLSQKPVFLDSVLHTHAITKKLLSVARLCKDNRAFVEFHSSFFLVKDLQTN

Query:  QILLKGTLEDGLYRLVVVSSVSSNVPVVHSSFSATVKSPSVFLALVAAPIWHLRLGHPSDATLRQILSQLHVSFSSSAHV-ACGSCQMAKSHCLLFSLAE
         ++ KG     LYR        +N  +     +A     SV        +WH R+GH S+  L+ +  +  +S++    V  C  C   K H + F  + 
Subjt:  QILLKGTLEDGLYRLVVVSSVSSNVPVVHSSFSATVKSPSVFLALVAAPIWHLRLGHPSDATLRQILSQLHVSFSSSAHV-ACGSCQMAKSHCLLFSLAE

Query:  NKSCKPFKLVHSNVWGPSPQVSINGAHYFLLFID
         +      LV+S+V GP    S+ G  YF+ FID
Subjt:  NKSCKPFKLVHSNVWGPSPQVSINGAHYFLLFID

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.0e-3327.17Show/hide
Query:  LKTQLQRIKKDGLSVSQYLSKIKEIGDKFLAIGEPISYRDHFAHILDGLGSDYNAFVTYIQNRSDNPALEDVRSLLLAYEARLEKQNSVDQLNLAQASLA
        L+TQL++  K   ++  Y+  +    D+   +G+P+ + +    +L+ L  +Y   +  I  +   P L ++   LL +E+++   +S   + +   +++
Subjt:  LKTQLQRIKKDGLSVSQYLSKIKEIGDKFLAIGEPISYRDHFAHILDGLGSDYNAFVTYIQNRSDNPALEDVRSLLLAYEARLEKQNSVDQLNLAQASLA

Query:  ----TLQLHNNSRRPPSRV--------------------PSNNQFRPPFNPFSSSPVLSNS----------------------FSP------SLLESYHP
            T   +NN+    +R                     P+NNQ +P         V  +S                      F+P        L S + 
Subjt:  ----TLQLHNNSRRPPSRV--------------------PSNNQFRPPFNPFSSSPVLSNS----------------------FSP------SLLESYHP

Query:  DENWYLDSGATHHMTSDASSLTHSMSYGGGEHATVGDGKKVSISLIVTSFIPSLSQKPVFLDSVLHTHAITKKLLSVARLCKDNRAFVEFHSSFFLVKDL
          NW LDSGATHH+TSD ++L+    Y GG+   V DG  + IS   ++ + S   +P+ L ++L+   I K L+SV RLC  N   VEF  + F VKDL
Subjt:  DENWYLDSGATHHMTSDASSLTHSMSYGGGEHATVGDGKKVSISLIVTSFIPSLSQKPVFLDSVLHTHAITKKLLSVARLCKDNRAFVEFHSSFFLVKDL

Query:  QTNQILLKGTLEDGLYRLVVVSSVSSNVPVVHSSFSATVKSPSVFLALVAAPIWHLRLGHPSDATLRQILSQLHVSFSSSAH--VACGSCQMAKSHCLLF
         T   LL+G  +D LY   + SS   ++    SS  AT  S            WH RLGHP+ + L  ++S   +S  + +H  ++C  C + KS+ + F
Subjt:  QTNQILLKGTLEDGLYRLVVVSSVSSNVPVVHSSFSATVKSPSVFLALVAAPIWHLRLGHPSDATLRQILSQLHVSFSSSAH--VACGSCQMAKSHCLLF

Query:  SLAENKSCKPFKLVHSNVWGPSPQVSINGAHYFLLFID
        S +   S +P + ++S+VW  SP +S +   Y+++F+D
Subjt:  SLAENKSCKPFKLVHSNVWGPSPQVSINGAHYFLLFID

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.1e-3428.95Show/hide
Query:  DKFLAIGEPISYRDHFAHILDGLGSDYNAFVTYIQNRSDNPALEDVRSLLLAYEARLEKQNSVDQLNLAQASLAT----------------LQLHNNSRR
        D+   +G+P+ + +    +L+ L  DY   +  I  +   P+L ++   L+  E++L   NS + + +  A++ T                   +NN+ R
Subjt:  DKFLAIGEPISYRDHFAHILDGLGSDYNAFVTYIQNRSDNPALEDVRSLLLAYEARLEKQNSVDQLNLAQASLAT----------------LQLHNNSRR

Query:  PPSRVPSNNQFR--------------------------PPFNPFSSSPVLSNSFSP---------SLLESYHPDENWYLDSGATHHMTSDASSLTHSMSY
          S  PS++  R                          P  + F S+     S SP           + S +   NW LDSGATHH+TSD ++L+    Y
Subjt:  PPSRVPSNNQFR--------------------------PPFNPFSSSPVLSNSFSP---------SLLESYHPDENWYLDSGATHHMTSDASSLTHSMSY

Query:  GGGEHATVGDGKKVSISLIVTSFIPSLSQKPVFLDSVLHTHAITKKLLSVARLCKDNRAFVEFHSSFFLVKDLQTNQILLKGTLEDGLYRLVVVSSVSSN
         GG+   + DG  + I+   ++ +P+ S + + L+ VL+   I K L+SV RLC  NR  VEF  + F VKDL T   LL+G  +D LY   + SS +  
Subjt:  GGGEHATVGDGKKVSISLIVTSFIPSLSQKPVFLDSVLHTHAITKKLLSVARLCKDNRAFVEFHSSFFLVKDLQTNQILLKGTLEDGLYRLVVVSSVSSN

Query:  VPVVHSSFSATVKSPSVFLALVAAPIWHLRLGHPSDATLRQILSQLHVSFSSSAH--VACGSCQMAKSHCLLFSLAENKSCKPFKLVHSNVWGPSPQVSI
        V +  S  S    S            WH RLGHPS A L  ++S   +   + +H  ++C  C + KSH + FS +   S KP + ++S+VW  SP +SI
Subjt:  VPVVHSSFSATVKSPSVFLALVAAPIWHLRLGHPSDATLRQILSQLHVSFSSSAH--VACGSCQMAKSHCLLFSLAENKSCKPFKLVHSNVWGPSPQVSI

Query:  NGAHYFLLFID
        +   Y+++F+D
Subjt:  NGAHYFLLFID

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGATGGATTCAGAGAGAGTGGGTTGGAGGGAGGGAGATCTGGAACGGGGAGGGAGCAGGGAGTCCCTCCCTCTCCGACGCCGGAATCGAAGGGGGTTTTGCCCCT
TCTAGCCGTCGCCGCTCGCGCCACCGCCTGCTCGCCGCGCCGTCCAGCCGCCTCTTTCCTCCTCGCGTCGCCCAGCCGTGCCGAGCTCCCAGCGCGCCGTCGCCTTTCCT
CTTCGCGCCGCCGTCCATCTCGTCTCTCGCTCGGTTTCGCCGGAACCCACGCCGCCGCTCTTTTTCCCCCTCTATTCCTTGCGATTCAACAAGCCAGATTCGCGTGCGTC
CAGCAGCCTGAGCCTCGCTTTTGTGCGATTTTGCCTCTGTCCAGCAAGCGATTTGGCCTCGAATCTCCTTGTCGGCGCCGTCTAAGTGTTCGATTGGGTTCGATACACTT
CAAACTCGAATACCCACTAGCCAAGGAGTATTCTAACACACTGTTCAAGTGGCCAAACTCCTCAGCACAATGCTCCAACTTGAAATTCACGTATAGCCTTTTTGCTCCAA
GAACACTACCAAAACAATACACCTACTATAACTCTTGGGACTCAAGATCCGGCTTGTGGGAACCCAAAACTAGTCTTGAAAGGGATTTACAGCAAGGTCACTCCACTAAA
GACCCACAACTGCACTCTTCTCACTGTAGAATATTTCTGTGTCCACGGATATTGACCAATTGCAAAATTCCTCTCGGGCCAGGAGAGGACAGCGCGCTTTTGTTCAAGCC
CCGGAATCAGCCCTTAAGGGAACACACATCTACTTGCCTCAATAGGGGAAGGAGTGAATTCCATCGTGTACTGTTATGTTCCCAGCCCCCATTCGGTCTTGCCCCTGATA
TGGATACCCCCACCCGCATGTCTCCTACATGGATGCTTTGGATCATTGCATCTGTATCGGATACAAGGTGGGCCGTATCACATAGTGTCACCAGGATAAGGTGTTTAAAA
TTTTGGGTTTGTCCTATGTACGCCGTTATGTTGCCGAAATTTTCGGAGCTCTCGGTAGCATGGGTGAGAGTGGCCAATACGCCGACTCAATATGCCTTCCTTTTTGGGGA
CAAGACCGAGTGGGAGGCTGGGGACATGACAACACAAGAAGGAATTCACTCCTTCCCACTTCTAGGAGAAGTAAATGAGTGTTCCCTTAAGTGGTTGTTCATTAGAGGAG
CGCTGGTACTTAAGGACACAGATGTAACACAGGGAGCAATCAAGAAAGGCGTTCTTGAAGAATTCAGAGTCGTCAACGGAGGTGCTCCCGGTGCAATGAAGGCCTGTAGT
TTTGCATCGTTGGATAAGATTTCATCTGCTTCTCTCTTCGTCTTCTATATTCTTGATGGTATCAGAGCTTTGAGACTCTTTAGACATGTCAACCGGAGCTTCTTCTTCTT
CGTCGTCTGGTGCACTCACTCCATCGGTGGTTACTCCATCAACGCCAAATACAACCCTCGTGGTAACTCCGATTACACAGAGACTTCAGACCCCCATTCAGCCTCGACAC
CTTTGGCTCTGTTCCGCCTCCACCTAAAGTTCTTGATGATCAGCAGCTGCAACCTAATCCTGATTTCACTACATGGGAAAGTGGGACTTAAAACACAATTGCAACGCATT
AAGAAAGATGGTCTTAGTGTTAGTCAGTATTTGTCTAAGATTAAAGAAATTGGTGATAAGTTCTTGGCCATTGGGGAGCCTATTTCTTATAGAGATCATTTTGCTCATAT
ATTAGATGGACTTGGGAGTGATTATAATGCTTTTGTAACTTATATTCAAAATCGATCGGATAACCCTGCCTTAGAGGATGTGCGTAGTTTGTTATTGGCTTATGAGGCTA
GATTGGAAAAACAAAATAGTGTTGATCAACTTAACCTAGCTCAGGCGAGTCTTGCTACTCTCCAACTCCACAATAACAGCCGTAGACCTCCTTCTCGTGTCCCTTCGAAT
AATCAGTTTAGACCTCCCTTCAATCCATTTTCTTCCTCCCCTGTTTTATCCAATTCATTTTCCCCTAGTCTGCTTGAATCGTACCACCCTGATGAGAATTGGTACCTTGA
CTCCGGTGCAACACATCACATGACGTCGGATGCTTCCTCTCTCACTCATTCCATGTCATATGGTGGTGGTGAACATGCCACTGTTGGGGATGGTAAGAAAGTCAGTATAT
CTCTTATTGTTACTTCTTTTATTCCTTCCTTATCTCAAAAACCTGTTTTTCTTGATTCGGTTCTTCATACCCATGCTATTACTAAAAAGTTACTTAGTGTGGCACGCCTT
TGTAAGGATAATCGTGCATTTGTTGAATTTCACTCCTCCTTTTTTCTTGTTAAGGATCTTCAAACCAACCAAATTCTGCTCAAGGGAACTCTTGAAGATGGGCTCTATCG
TTTGGTTGTTGTCTCCTCAGTTTCTTCCAATGTTCCTGTTGTCCATTCTTCTTTCTCCGCTACTGTCAAGTCCCCCTCTGTCTTCTTGGCTTTGGTTGCTGCTCCAATTT
GGCACCTGCGTTTGGGCCATCCTAGTGATGCTACTTTACGCCAAATTTTGTCCCAGCTCCATGTGTCTTTTTCGTCCTCTGCTCATGTCGCTTGTGGCTCTTGTCAAATG
GCTAAAAGCCATTGTTTACTGTTTTCTTTAGCTGAAAATAAATCTTGTAAACCTTTTAAGTTAGTGCATTCCAATGTATGGGGACCTTCCCCTCAAGTTTCTATTAACGG
AGCTCATTATTTCCTTCTTTTCATTGATCGTAGTATCAAATCTAATGTTCCAAGATCTGGAAGCCTGTTCCAACCCATAAATGGATCGATTCAGTTTACAAACTTTTTGC
TCCTGACCTTGGGATATGAACCCTTTGGGTTGAGACATAAAGATATTCTCTTCAAGATTACCATTCAGAAAGGCAGTTTTGACATCCATTTGTCATATCTCATAGTCATA
ATATGTGGCTATGGATAA
mRNA sequenceShow/hide mRNA sequence
ATGATGGATGGATTCAGAGAGAGTGGGTTGGAGGGAGGGAGATCTGGAACGGGGAGGGAGCAGGGAGTCCCTCCCTCTCCGACGCCGGAATCGAAGGGGGTTTTGCCCCT
TCTAGCCGTCGCCGCTCGCGCCACCGCCTGCTCGCCGCGCCGTCCAGCCGCCTCTTTCCTCCTCGCGTCGCCCAGCCGTGCCGAGCTCCCAGCGCGCCGTCGCCTTTCCT
CTTCGCGCCGCCGTCCATCTCGTCTCTCGCTCGGTTTCGCCGGAACCCACGCCGCCGCTCTTTTTCCCCCTCTATTCCTTGCGATTCAACAAGCCAGATTCGCGTGCGTC
CAGCAGCCTGAGCCTCGCTTTTGTGCGATTTTGCCTCTGTCCAGCAAGCGATTTGGCCTCGAATCTCCTTGTCGGCGCCGTCTAAGTGTTCGATTGGGTTCGATACACTT
CAAACTCGAATACCCACTAGCCAAGGAGTATTCTAACACACTGTTCAAGTGGCCAAACTCCTCAGCACAATGCTCCAACTTGAAATTCACGTATAGCCTTTTTGCTCCAA
GAACACTACCAAAACAATACACCTACTATAACTCTTGGGACTCAAGATCCGGCTTGTGGGAACCCAAAACTAGTCTTGAAAGGGATTTACAGCAAGGTCACTCCACTAAA
GACCCACAACTGCACTCTTCTCACTGTAGAATATTTCTGTGTCCACGGATATTGACCAATTGCAAAATTCCTCTCGGGCCAGGAGAGGACAGCGCGCTTTTGTTCAAGCC
CCGGAATCAGCCCTTAAGGGAACACACATCTACTTGCCTCAATAGGGGAAGGAGTGAATTCCATCGTGTACTGTTATGTTCCCAGCCCCCATTCGGTCTTGCCCCTGATA
TGGATACCCCCACCCGCATGTCTCCTACATGGATGCTTTGGATCATTGCATCTGTATCGGATACAAGGTGGGCCGTATCACATAGTGTCACCAGGATAAGGTGTTTAAAA
TTTTGGGTTTGTCCTATGTACGCCGTTATGTTGCCGAAATTTTCGGAGCTCTCGGTAGCATGGGTGAGAGTGGCCAATACGCCGACTCAATATGCCTTCCTTTTTGGGGA
CAAGACCGAGTGGGAGGCTGGGGACATGACAACACAAGAAGGAATTCACTCCTTCCCACTTCTAGGAGAAGTAAATGAGTGTTCCCTTAAGTGGTTGTTCATTAGAGGAG
CGCTGGTACTTAAGGACACAGATGTAACACAGGGAGCAATCAAGAAAGGCGTTCTTGAAGAATTCAGAGTCGTCAACGGAGGTGCTCCCGGTGCAATGAAGGCCTGTAGT
TTTGCATCGTTGGATAAGATTTCATCTGCTTCTCTCTTCGTCTTCTATATTCTTGATGGTATCAGAGCTTTGAGACTCTTTAGACATGTCAACCGGAGCTTCTTCTTCTT
CGTCGTCTGGTGCACTCACTCCATCGGTGGTTACTCCATCAACGCCAAATACAACCCTCGTGGTAACTCCGATTACACAGAGACTTCAGACCCCCATTCAGCCTCGACAC
CTTTGGCTCTGTTCCGCCTCCACCTAAAGTTCTTGATGATCAGCAGCTGCAACCTAATCCTGATTTCACTACATGGGAAAGTGGGACTTAAAACACAATTGCAACGCATT
AAGAAAGATGGTCTTAGTGTTAGTCAGTATTTGTCTAAGATTAAAGAAATTGGTGATAAGTTCTTGGCCATTGGGGAGCCTATTTCTTATAGAGATCATTTTGCTCATAT
ATTAGATGGACTTGGGAGTGATTATAATGCTTTTGTAACTTATATTCAAAATCGATCGGATAACCCTGCCTTAGAGGATGTGCGTAGTTTGTTATTGGCTTATGAGGCTA
GATTGGAAAAACAAAATAGTGTTGATCAACTTAACCTAGCTCAGGCGAGTCTTGCTACTCTCCAACTCCACAATAACAGCCGTAGACCTCCTTCTCGTGTCCCTTCGAAT
AATCAGTTTAGACCTCCCTTCAATCCATTTTCTTCCTCCCCTGTTTTATCCAATTCATTTTCCCCTAGTCTGCTTGAATCGTACCACCCTGATGAGAATTGGTACCTTGA
CTCCGGTGCAACACATCACATGACGTCGGATGCTTCCTCTCTCACTCATTCCATGTCATATGGTGGTGGTGAACATGCCACTGTTGGGGATGGTAAGAAAGTCAGTATAT
CTCTTATTGTTACTTCTTTTATTCCTTCCTTATCTCAAAAACCTGTTTTTCTTGATTCGGTTCTTCATACCCATGCTATTACTAAAAAGTTACTTAGTGTGGCACGCCTT
TGTAAGGATAATCGTGCATTTGTTGAATTTCACTCCTCCTTTTTTCTTGTTAAGGATCTTCAAACCAACCAAATTCTGCTCAAGGGAACTCTTGAAGATGGGCTCTATCG
TTTGGTTGTTGTCTCCTCAGTTTCTTCCAATGTTCCTGTTGTCCATTCTTCTTTCTCCGCTACTGTCAAGTCCCCCTCTGTCTTCTTGGCTTTGGTTGCTGCTCCAATTT
GGCACCTGCGTTTGGGCCATCCTAGTGATGCTACTTTACGCCAAATTTTGTCCCAGCTCCATGTGTCTTTTTCGTCCTCTGCTCATGTCGCTTGTGGCTCTTGTCAAATG
GCTAAAAGCCATTGTTTACTGTTTTCTTTAGCTGAAAATAAATCTTGTAAACCTTTTAAGTTAGTGCATTCCAATGTATGGGGACCTTCCCCTCAAGTTTCTATTAACGG
AGCTCATTATTTCCTTCTTTTCATTGATCGTAGTATCAAATCTAATGTTCCAAGATCTGGAAGCCTGTTCCAACCCATAAATGGATCGATTCAGTTTACAAACTTTTTGC
TCCTGACCTTGGGATATGAACCCTTTGGGTTGAGACATAAAGATATTCTCTTCAAGATTACCATTCAGAAAGGCAGTTTTGACATCCATTTGTCATATCTCATAGTCATA
ATATGTGGCTATGGATAA
Protein sequenceShow/hide protein sequence
MMDGFRESGLEGGRSGTGREQGVPPSPTPESKGVLPLLAVAARATACSPRRPAASFLLASPSRAELPARRRLSSSRRRPSRLSLGFAGTHAAALFPPLFLAIQQARFACV
QQPEPRFCAILPLSSKRFGLESPCRRRLSVRLGSIHFKLEYPLAKEYSNTLFKWPNSSAQCSNLKFTYSLFAPRTLPKQYTYYNSWDSRSGLWEPKTSLERDLQQGHSTK
DPQLHSSHCRIFLCPRILTNCKIPLGPGEDSALLFKPRNQPLREHTSTCLNRGRSEFHRVLLCSQPPFGLAPDMDTPTRMSPTWMLWIIASVSDTRWAVSHSVTRIRCLK
FWVCPMYAVMLPKFSELSVAWVRVANTPTQYAFLFGDKTEWEAGDMTTQEGIHSFPLLGEVNECSLKWLFIRGALVLKDTDVTQGAIKKGVLEEFRVVNGGAPGAMKACS
FASLDKISSASLFVFYILDGIRALRLFRHVNRSFFFFVVWCTHSIGGYSINAKYNPRGNSDYTETSDPHSASTPLALFRLHLKFLMISSCNLILISLHGKVGLKTQLQRI
KKDGLSVSQYLSKIKEIGDKFLAIGEPISYRDHFAHILDGLGSDYNAFVTYIQNRSDNPALEDVRSLLLAYEARLEKQNSVDQLNLAQASLATLQLHNNSRRPPSRVPSN
NQFRPPFNPFSSSPVLSNSFSPSLLESYHPDENWYLDSGATHHMTSDASSLTHSMSYGGGEHATVGDGKKVSISLIVTSFIPSLSQKPVFLDSVLHTHAITKKLLSVARL
CKDNRAFVEFHSSFFLVKDLQTNQILLKGTLEDGLYRLVVVSSVSSNVPVVHSSFSATVKSPSVFLALVAAPIWHLRLGHPSDATLRQILSQLHVSFSSSAHVACGSCQM
AKSHCLLFSLAENKSCKPFKLVHSNVWGPSPQVSINGAHYFLLFIDRSIKSNVPRSGSLFQPINGSIQFTNFLLLTLGYEPFGLRHKDILFKITIQKGSFDIHLSYLIVI
ICGYG