; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039747 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039747
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr2:49240494..49244502
RNA-Seq ExpressionLag0039747
SyntenyLag0039747
Gene Ontology termsNA
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW53406.1 Retrovirus-related Pol polyprotein from transposon RE2 [Vitis vinifera]4.0e-6635.02Show/hide
Query:  TLPQPLSVKLSDNNFLLWKNQLLNVVLANGLSGFLDGSISAPPKFLDDQQLQTNPEFLVWERKN----------------GRNCCFDTAADIWNSLKKSY
        +L   L +KL  NN++LWK Q+ NVV ANG   ++D +   PP+ L   +L  NP+F+ W R +                G+   F T+ D W +L K +
Subjt:  TLPQPLSVKLSDNNFLLWKNQLLNVVLANGLSGFLDGSISAPPKFLDDQQLQTNPEFLVWERKN----------------GRNCCFDTAADIWNSLKKSY

Query:  DSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGCEYNPFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQTTVE
         + + ARI+ L+ + Q  KK    + +Y+ +IK ++D  +AIGEP+   DH+  +L GLG EYN  V S+  R D+ SL  V S+LL +E RL  Q T  
Subjt:  DSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGCEYNPFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQTTVE

Query:  QLNLVRANLSIVQLQNSRSPSSRSPSNQFPRSPFNPPSFSPFPSQTTSSVQSILVKPQFHSNQKWHSRPS--------------PSKPQCQICGKFGHTA
              A+L  V      +PS++ P    PR     P F    S     V     +P    + + H+RP+              P++PQCQ+CGKFGHT 
Subjt:  QLNLVRANLSIVQLQNSRSPSSRSPSNQFPRSPFNPPSFSPFPSQTTSSVQSILVKPQFHSNQKWHSRPS--------------PSKPQCQICGKFGHTA

Query:  LICHHRTNLSYQIPPSPQAMLTTTSPNSNTHVPDTLSTMSTDSFHPDENWFLDSDATHHMTPEASSLYNLVPYHGGEQV------------TDLQSKQVL
        + C+HR +++YQ           TS  S++     L  M   + +  ++WF D+ ATHH++  A +L N+ PY G +QV            TD  +KQ L
Subjt:  LICHHRTNLSYQIPPSPQAMLTTTSPNSNTHVPDTLSTMSTDSFHPDENWFLDSDATHHMTPEASSLYNLVPYHGGEQV------------TDLQSKQVL

Query:  LRGTLEDGLYKLLPSSRASSCSTISYCSRLQHLNSAPAAFVSSTESINWHFRLGHPAAPVLQRVLASYQVSCS---NISDFCNSCQLAKSHRLPFNLSKS
        L+G L DGLY+       SS ST ++ S     NS  A     T    WH RLGHPAAP+L + LAS   S S   N    C  C LAKSH LP++LS S
Subjt:  LRGTLEDGLYKLLPSSRASSCSTISYCSRLQHLNSAPAAFVSSTESINWHFRLGHPAAPVLQRVLASYQVSCS---NISDFCNSCQLAKSHRLPFNLSKS

Query:  ESVAPFNLVHSDYY
         +  P  L+H+D +
Subjt:  ESVAPFNLVHSDYY

RVW66809.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.4e-6333.45Show/hide
Query:  YPTLPQPLSVKLSDNNFLLWKNQLLNVVLANGLSGFLDGSISAPPKFLDDQQLQTNPEFLVWERKN----------------GRNCCFDTAADIWNSLKK
        Y  L   L VKL   N++LWK+Q+ NVV ANG   F+DGS   P K L    +  NP F+ W R++                 +    +++   WN+L+K
Subjt:  YPTLPQPLSVKLSDNNFLLWKNQLLNVVLANGLSGFLDGSISAPPKFLDDQQLQTNPEFLVWERKN----------------GRNCCFDTAADIWNSLKK

Query:  SYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGCEYNPFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQTT
        ++ S + ARIM L+ +LQ  KK  LS+  Y+ ++K  AD  +AIGEPI  +D + ++LDGLG +YN  VT+I  R D  S+E V S+LLA+E RLE+Q++
Subjt:  SYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGCEYNPFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQTT

Query:  VEQLNLVRANLSIVQLQNSRSPSSRSPSNQFPRSPFNPPSFSPFPSQTTSSVQSILVKPQFHSNQKWHSRPSPSKPQCQICGKFGHTALICHHRTNLSYQ
        +EQ + + AN +     NSR    R       R   + P+ S +  +          +   +     H+  S  KPQCQ+CGKFGHT  IC+HR ++SYQ
Subjt:  VEQLNLVRANLSIVQLQNSRSPSSRSPSNQFPRSPFNPPSFSPFPSQTTSSVQSILVKPQFHSNQKWHSRPSPSKPQCQICGKFGHTALICHHRTNLSYQ

Query:  IPPSPQAMLTTTSPNSNTHVPDTLSTMSTDSFH-PDENWFLDSDATHHMTPEASSLYNLVPYHGGEQVTDLQSKQVLLRGTLEDGLYKLLPSSR------
           S     + TSP SN   P+++  M   S +  D+ W+LDS A+HH+T    +L +  PY G ++VT    K + +  T   G ++LL  SR      
Subjt:  IPPSPQAMLTTTSPNSNTHVPDTLSTMSTDSFH-PDENWFLDSDATHHMTPEASSLYNLVPYHGGEQVTDLQSKQVLLRGTLEDGLYKLLPSSR------

Query:  --------ASSCSTISYCS----------------------------------RLQHLNSAPAAFVSSTESIN---------------WHFRLGHPAAPV
                A+  S   +CS                                  R   LNS   AFV +T S                 WH RLGH +  +
Subjt:  --------ASSCSTISYCS----------------------------------RLQHLNSAPAAFVSSTESIN---------------WHFRLGHPAAPV

Query:  LQRVLASYQVSCSN-----ISDFCNSCQLAKSHRLPFNLSKSESVAPFNLVHSDYY
        + +++ S  VS         S  C+SCQLAKSHRLP +LS S +  P  LVH+D +
Subjt:  LQRVLASYQVSCSN-----ISDFCNSCQLAKSHRLPFNLSKSESVAPFNLVHSDYY

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.4e-7935.83Show/hide
Query:  PQPNALNTIAPNSYPNTYPNLLAPN-PYPTLPQPLSVKLSDNNFLLWKNQLLNVVLANGLSGFLDGSISAPPKFLDDQQLQTNPEFLVWERKN-------
        P P + +    N+  N  P +     P P+L Q LS+KL + N LL K+QLLNV++ANGL  F+D   S+PPK+LD    Q NPEF+ W+R N       
Subjt:  PQPNALNTIAPNSYPNTYPNLLAPN-PYPTLPQPLSVKLSDNNFLLWKNQLLNVVLANGLSGFLDGSISAPPKFLDDQQLQTNPEFLVWERKN-------

Query:  ---------GRNCCFDTAADIWNSLKKSYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGCEYNPFVTSIQ
                 G+   + TA DIW SL   Y+S + A +M L +QLQ+IKK  + +S+YL+++K V D+F+ IGEP+SYRD L  IL+GL  EY+ FVTSI 
Subjt:  ---------GRNCCFDTAADIWNSLKKSYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGCEYNPFVTSIQ

Query:  NRSDNPSLEDVRSLLLAYEARLEKQTTVEQLNLVRANLSIVQLQNSRSPSSRSPSNQFPRSPFNPPSFSPFPSQTTSSVQSILVKPQFHSNQKWHSRPSP
        NRSD PSL++V SLL  YE RL +++  + LN  +AN                     PR P                          ++N         
Subjt:  NRSDNPSLEDVRSLLLAYEARLEKQTTVEQLNLVRANLSIVQLQNSRSPSSRSPSNQFPRSPFNPPSFSPFPSQTTSSVQSILVKPQFHSNQKWHSRPSP

Query:  SKPQCQICGKFGHTALICHHRTNLSYQIPPSPQAMLTTTSPNSNTHVPDTLSTMSTDSFHP--------DENWFLDSDATHHMTPEASSLYNLVPYHGGE
        S PQCQICGK GH AL  +HRTNL+Y  P  P A     +PN        +S M T S  P        D +W++DS ATHH TPE   + + + Y  G+
Subjt:  SKPQCQICGKFGHTALICHHRTNLSYQIPPSPQAMLTTTSPNSNTHVPDTLSTMSTDSFHP--------DENWFLDSDATHHMTPEASSLYNLVPYHGGE

Query:  Q---------------------------------------------------------------VTDLQSKQVLLRGTLEDGLYKLLPSSRASSCSTISY
                                                                        V D Q+KQ+LL+G LE GLYKL   +  +S S++S 
Subjt:  Q---------------------------------------------------------------VTDLQSKQVLLRGTLEDGLYKLLPSSRASSCSTISY

Query:  CSRLQHLNSAPA-AFVS-STESINWHFRLGHPAAPVLQRVLASYQVSCSNISDFCNSCQLAKSHRLPFNLSKSESVAPFNLVHSDYYQREPPGTPNFLEY
            Q   + PA AF+S   + + WH RLGHPA  V+ +VL S  +  SN    C+SCQLAKSHRLPF LS+S ++ PF+LV+SD +   P  +   ++Y
Subjt:  CSRLQHLNSAPA-AFVS-STESINWHFRLGHPAAPVLQRVLASYQVSCSNISDFCNSCQLAKSHRLPFNLSKSESVAPFNLVHSDYYQREPPGTPNFLEY

RVW90852.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]4.9e-6434.28Show/hide
Query:  YPTLPQPLSVKLSDNNFLLWKNQLLNVVLANGLSGFLDGSISAPPKFLDDQQLQTNPEFLVWERKN----------------GRNCCFDTAADIWNSLKK
        Y  L   L VKL   N++LW++Q+ NV+ ANG   F+DG+   P K L   ++  NP F+ W R++                 +    +++   WN+L+K
Subjt:  YPTLPQPLSVKLSDNNFLLWKNQLLNVVLANGLSGFLDGSISAPPKFLDDQQLQTNPEFLVWERKN----------------GRNCCFDTAADIWNSLKK

Query:  SYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGCEYNPFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQTT
         + S + ARIM L+ + Q  KK  +S+  Y+ ++K VAD  +AIGE +S +D + ++L GLG +YN  VT+I  R D  SLE V S+LLA+E RLE+Q +
Subjt:  SYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGCEYNPFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQTT

Query:  VEQLNLVRANLSIVQLQNSRSPSSRSPSNQFPRSPFNPPSFSPFPSQTTSSVQSILVKPQFHSNQKWHSRPSPSKPQCQICGKFGHTALICHHRTNLSYQ
        +EQL ++ AN        + S ++R    ++  +    P+F      T S+ +      ++  + + +S  S  +PQCQ+CGKFGHT  +C+HR ++++Q
Subjt:  VEQLNLVRANLSIVQLQNSRSPSSRSPSNQFPRSPFNPPSFSPFPSQTTSSVQSILVKPQFHSNQKWHSRPSPSKPQCQICGKFGHTALICHHRTNLSYQ

Query:  IPPSPQAMLTTTSPNSNTH-VPDTLSTMSTDSFHPDENWFLDSDATHHMTPEASSLYNLVPYHGGEQVT---DLQSKQVLLRGTLEDGLYKL-LPSSRAS
           S Q   T  S +SN++ +P  +++ +  +   D+NW+LDS A+HH+T   ++L N  PY G ++VT       + VL +G LE+GLYK  + S++ +
Subjt:  IPPSPQAMLTTTSPNSNTH-VPDTLSTMSTDSFHPDENWFLDSDATHHMTPEASSLYNLVPYHGGEQVT---DLQSKQVLLRGTLEDGLYKL-LPSSRAS

Query:  SCSTISYCSRLQHLNSAPAAFVSSTESINWHFRLGHPAAPVLQRVLASYQVSCSNI-SDFCNSCQLAKSHRLPFNLSKSESVAPFNLVHSDYY
        +   I+  S  Q       + + +   + WH RLGH A  ++ R++ +  VSC    +  C+SCQLAKSHRLP +LS   +  P  LV++D +
Subjt:  SCSTISYCSRLQHLNSAPAAFVSSTESINWHFRLGHPAAPVLQRVLASYQVSCSNI-SDFCNSCQLAKSHRLPFNLSKSESVAPFNLVHSDYY

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]4.6e-10757.98Show/hide
Query:  PNLLAPNPYPTLPQPLSVKLSDNNFLLWKNQLLNVVLANGLSGFLDGSISAPPKFLDDQQLQTNPEFLVWERKNGRNCCF----------------DTAA
        PN  + NP+PTLPQPL+VKL+DNNFLLWKNQLLN V+ANGL G+LDG+I  PP+FLD  QLQ NP +  WER N    C+                +T  
Subjt:  PNLLAPNPYPTLPQPLSVKLSDNNFLLWKNQLLNVVLANGLSGFLDGSISAPPKFLDDQQLQTNPEFLVWERKNGRNCCF----------------DTAA

Query:  DIWNSLKKSYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGCEYNPFVTSIQNRSDNPSLEDVRSLLLAYE
        DIW+SL + YDSKTTARIMGLKT+LQ ++KDG SVSQYLA+IK++ADKF+A+GEP+SYRDHLAH+LDGLG EYN FVTSI NR+D+PSLEDVRSLLLAYE
Subjt:  DIWNSLKKSYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGCEYNPFVTSIQNRSDNPSLEDVRSLLLAYE

Query:  ARLEKQTTVEQLNLVRANLSIVQLQNSRSPSSRSPSNQFPRSPFNPPSFSPFPSQTTSSVQSILVKPQFHSNQKWHSRPSPSKPQCQICGKFGHTALICH
        ARL+KQ TV+QLN+ +ANL  + LQ+    +S+ P  +F        SF   P     S QSIL KPQ  S  KW  +PS SK QCQICGK GH+A +C+
Subjt:  ARLEKQTTVEQLNLVRANLSIVQLQNSRSPSSRSPSNQFPRSPFNPPSFSPFPSQTTSSVQSILVKPQFHSNQKWHSRPSPSKPQCQICGKFGHTALICH

Query:  HRTNLSYQIPPSPQAMLTTTSPNSNTHVPDTLSTMSTDSFHPDENWFLDSDATHHMTPEASSLYNLVPYHGGEQVT
        HRTN++Y    SPQA+     P S TH      +   +  HPDE+WF+DS ATHHMTP++S L N  PY GGEQVT
Subjt:  HRTNLSYQIPPSPQAMLTTTSPNSNTHVPDTLSTMSTDSFHPDENWFLDSDATHHMTPEASSLYNLVPYHGGEQVT

TrEMBL top hitse value%identityAlignment
A0A438F0E0 Retrovirus-related Pol polyprotein from transposon RE21.9e-6635.02Show/hide
Query:  TLPQPLSVKLSDNNFLLWKNQLLNVVLANGLSGFLDGSISAPPKFLDDQQLQTNPEFLVWERKN----------------GRNCCFDTAADIWNSLKKSY
        +L   L +KL  NN++LWK Q+ NVV ANG   ++D +   PP+ L   +L  NP+F+ W R +                G+   F T+ D W +L K +
Subjt:  TLPQPLSVKLSDNNFLLWKNQLLNVVLANGLSGFLDGSISAPPKFLDDQQLQTNPEFLVWERKN----------------GRNCCFDTAADIWNSLKKSY

Query:  DSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGCEYNPFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQTTVE
         + + ARI+ L+ + Q  KK    + +Y+ +IK ++D  +AIGEP+   DH+  +L GLG EYN  V S+  R D+ SL  V S+LL +E RL  Q T  
Subjt:  DSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGCEYNPFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQTTVE

Query:  QLNLVRANLSIVQLQNSRSPSSRSPSNQFPRSPFNPPSFSPFPSQTTSSVQSILVKPQFHSNQKWHSRPS--------------PSKPQCQICGKFGHTA
              A+L  V      +PS++ P    PR     P F    S     V     +P    + + H+RP+              P++PQCQ+CGKFGHT 
Subjt:  QLNLVRANLSIVQLQNSRSPSSRSPSNQFPRSPFNPPSFSPFPSQTTSSVQSILVKPQFHSNQKWHSRPS--------------PSKPQCQICGKFGHTA

Query:  LICHHRTNLSYQIPPSPQAMLTTTSPNSNTHVPDTLSTMSTDSFHPDENWFLDSDATHHMTPEASSLYNLVPYHGGEQV------------TDLQSKQVL
        + C+HR +++YQ           TS  S++     L  M   + +  ++WF D+ ATHH++  A +L N+ PY G +QV            TD  +KQ L
Subjt:  LICHHRTNLSYQIPPSPQAMLTTTSPNSNTHVPDTLSTMSTDSFHPDENWFLDSDATHHMTPEASSLYNLVPYHGGEQV------------TDLQSKQVL

Query:  LRGTLEDGLYKLLPSSRASSCSTISYCSRLQHLNSAPAAFVSSTESINWHFRLGHPAAPVLQRVLASYQVSCS---NISDFCNSCQLAKSHRLPFNLSKS
        L+G L DGLY+       SS ST ++ S     NS  A     T    WH RLGHPAAP+L + LAS   S S   N    C  C LAKSH LP++LS S
Subjt:  LRGTLEDGLYKLLPSSRASSCSTISYCSRLQHLNSAPAAFVSSTESINWHFRLGHPAAPVLQRVLASYQVSCS---NISDFCNSCQLAKSHRLPFNLSKS

Query:  ESVAPFNLVHSDYY
         +  P  L+H+D +
Subjt:  ESVAPFNLVHSDYY

A0A438G3M6 Retrovirus-related Pol polyprotein from transposon RE16.9e-6433.45Show/hide
Query:  YPTLPQPLSVKLSDNNFLLWKNQLLNVVLANGLSGFLDGSISAPPKFLDDQQLQTNPEFLVWERKN----------------GRNCCFDTAADIWNSLKK
        Y  L   L VKL   N++LWK+Q+ NVV ANG   F+DGS   P K L    +  NP F+ W R++                 +    +++   WN+L+K
Subjt:  YPTLPQPLSVKLSDNNFLLWKNQLLNVVLANGLSGFLDGSISAPPKFLDDQQLQTNPEFLVWERKN----------------GRNCCFDTAADIWNSLKK

Query:  SYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGCEYNPFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQTT
        ++ S + ARIM L+ +LQ  KK  LS+  Y+ ++K  AD  +AIGEPI  +D + ++LDGLG +YN  VT+I  R D  S+E V S+LLA+E RLE+Q++
Subjt:  SYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGCEYNPFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQTT

Query:  VEQLNLVRANLSIVQLQNSRSPSSRSPSNQFPRSPFNPPSFSPFPSQTTSSVQSILVKPQFHSNQKWHSRPSPSKPQCQICGKFGHTALICHHRTNLSYQ
        +EQ + + AN +     NSR    R       R   + P+ S +  +          +   +     H+  S  KPQCQ+CGKFGHT  IC+HR ++SYQ
Subjt:  VEQLNLVRANLSIVQLQNSRSPSSRSPSNQFPRSPFNPPSFSPFPSQTTSSVQSILVKPQFHSNQKWHSRPSPSKPQCQICGKFGHTALICHHRTNLSYQ

Query:  IPPSPQAMLTTTSPNSNTHVPDTLSTMSTDSFH-PDENWFLDSDATHHMTPEASSLYNLVPYHGGEQVTDLQSKQVLLRGTLEDGLYKLLPSSR------
           S     + TSP SN   P+++  M   S +  D+ W+LDS A+HH+T    +L +  PY G ++VT    K + +  T   G ++LL  SR      
Subjt:  IPPSPQAMLTTTSPNSNTHVPDTLSTMSTDSFH-PDENWFLDSDATHHMTPEASSLYNLVPYHGGEQVTDLQSKQVLLRGTLEDGLYKLLPSSR------

Query:  --------ASSCSTISYCS----------------------------------RLQHLNSAPAAFVSSTESIN---------------WHFRLGHPAAPV
                A+  S   +CS                                  R   LNS   AFV +T S                 WH RLGH +  +
Subjt:  --------ASSCSTISYCS----------------------------------RLQHLNSAPAAFVSSTESIN---------------WHFRLGHPAAPV

Query:  LQRVLASYQVSCSN-----ISDFCNSCQLAKSHRLPFNLSKSESVAPFNLVHSDYY
        + +++ S  VS         S  C+SCQLAKSHRLP +LS S +  P  LVH+D +
Subjt:  LQRVLASYQVSCSN-----ISDFCNSCQLAKSHRLPFNLSKSESVAPFNLVHSDYY

A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE11.2e-7935.83Show/hide
Query:  PQPNALNTIAPNSYPNTYPNLLAPN-PYPTLPQPLSVKLSDNNFLLWKNQLLNVVLANGLSGFLDGSISAPPKFLDDQQLQTNPEFLVWERKN-------
        P P + +    N+  N  P +     P P+L Q LS+KL + N LL K+QLLNV++ANGL  F+D   S+PPK+LD    Q NPEF+ W+R N       
Subjt:  PQPNALNTIAPNSYPNTYPNLLAPN-PYPTLPQPLSVKLSDNNFLLWKNQLLNVVLANGLSGFLDGSISAPPKFLDDQQLQTNPEFLVWERKN-------

Query:  ---------GRNCCFDTAADIWNSLKKSYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGCEYNPFVTSIQ
                 G+   + TA DIW SL   Y+S + A +M L +QLQ+IKK  + +S+YL+++K V D+F+ IGEP+SYRD L  IL+GL  EY+ FVTSI 
Subjt:  ---------GRNCCFDTAADIWNSLKKSYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGCEYNPFVTSIQ

Query:  NRSDNPSLEDVRSLLLAYEARLEKQTTVEQLNLVRANLSIVQLQNSRSPSSRSPSNQFPRSPFNPPSFSPFPSQTTSSVQSILVKPQFHSNQKWHSRPSP
        NRSD PSL++V SLL  YE RL +++  + LN  +AN                     PR P                          ++N         
Subjt:  NRSDNPSLEDVRSLLLAYEARLEKQTTVEQLNLVRANLSIVQLQNSRSPSSRSPSNQFPRSPFNPPSFSPFPSQTTSSVQSILVKPQFHSNQKWHSRPSP

Query:  SKPQCQICGKFGHTALICHHRTNLSYQIPPSPQAMLTTTSPNSNTHVPDTLSTMSTDSFHP--------DENWFLDSDATHHMTPEASSLYNLVPYHGGE
        S PQCQICGK GH AL  +HRTNL+Y  P  P A     +PN        +S M T S  P        D +W++DS ATHH TPE   + + + Y  G+
Subjt:  SKPQCQICGKFGHTALICHHRTNLSYQIPPSPQAMLTTTSPNSNTHVPDTLSTMSTDSFHP--------DENWFLDSDATHHMTPEASSLYNLVPYHGGE

Query:  Q---------------------------------------------------------------VTDLQSKQVLLRGTLEDGLYKLLPSSRASSCSTISY
                                                                        V D Q+KQ+LL+G LE GLYKL   +  +S S++S 
Subjt:  Q---------------------------------------------------------------VTDLQSKQVLLRGTLEDGLYKLLPSSRASSCSTISY

Query:  CSRLQHLNSAPA-AFVS-STESINWHFRLGHPAAPVLQRVLASYQVSCSNISDFCNSCQLAKSHRLPFNLSKSESVAPFNLVHSDYYQREPPGTPNFLEY
            Q   + PA AF+S   + + WH RLGHPA  V+ +VL S  +  SN    C+SCQLAKSHRLPF LS+S ++ PF+LV+SD +   P  +   ++Y
Subjt:  CSRLQHLNSAPA-AFVS-STESINWHFRLGHPAAPVLQRVLASYQVSCSNISDFCNSCQLAKSHRLPFNLSKSESVAPFNLVHSDYYQREPPGTPNFLEY

A0A438I2E1 Retrovirus-related Pol polyprotein from transposon RE12.4e-6434.28Show/hide
Query:  YPTLPQPLSVKLSDNNFLLWKNQLLNVVLANGLSGFLDGSISAPPKFLDDQQLQTNPEFLVWERKN----------------GRNCCFDTAADIWNSLKK
        Y  L   L VKL   N++LW++Q+ NV+ ANG   F+DG+   P K L   ++  NP F+ W R++                 +    +++   WN+L+K
Subjt:  YPTLPQPLSVKLSDNNFLLWKNQLLNVVLANGLSGFLDGSISAPPKFLDDQQLQTNPEFLVWERKN----------------GRNCCFDTAADIWNSLKK

Query:  SYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGCEYNPFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQTT
         + S + ARIM L+ + Q  KK  +S+  Y+ ++K VAD  +AIGE +S +D + ++L GLG +YN  VT+I  R D  SLE V S+LLA+E RLE+Q +
Subjt:  SYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGCEYNPFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQTT

Query:  VEQLNLVRANLSIVQLQNSRSPSSRSPSNQFPRSPFNPPSFSPFPSQTTSSVQSILVKPQFHSNQKWHSRPSPSKPQCQICGKFGHTALICHHRTNLSYQ
        +EQL ++ AN        + S ++R    ++  +    P+F      T S+ +      ++  + + +S  S  +PQCQ+CGKFGHT  +C+HR ++++Q
Subjt:  VEQLNLVRANLSIVQLQNSRSPSSRSPSNQFPRSPFNPPSFSPFPSQTTSSVQSILVKPQFHSNQKWHSRPSPSKPQCQICGKFGHTALICHHRTNLSYQ

Query:  IPPSPQAMLTTTSPNSNTH-VPDTLSTMSTDSFHPDENWFLDSDATHHMTPEASSLYNLVPYHGGEQVT---DLQSKQVLLRGTLEDGLYKL-LPSSRAS
           S Q   T  S +SN++ +P  +++ +  +   D+NW+LDS A+HH+T   ++L N  PY G ++VT       + VL +G LE+GLYK  + S++ +
Subjt:  IPPSPQAMLTTTSPNSNTH-VPDTLSTMSTDSFHPDENWFLDSDATHHMTPEASSLYNLVPYHGGEQVT---DLQSKQVLLRGTLEDGLYKL-LPSSRAS

Query:  SCSTISYCSRLQHLNSAPAAFVSSTESINWHFRLGHPAAPVLQRVLASYQVSCSNI-SDFCNSCQLAKSHRLPFNLSKSESVAPFNLVHSDYY
        +   I+  S  Q       + + +   + WH RLGH A  ++ R++ +  VSC    +  C+SCQLAKSHRLP +LS   +  P  LV++D +
Subjt:  SCSTISYCSRLQHLNSAPAAFVSSTESINWHFRLGHPAAPVLQRVLASYQVSCSNI-SDFCNSCQLAKSHRLPFNLSKSESVAPFNLVHSDYY

A0A6J1DQX7 uncharacterized protein LOC1110223152.3e-10757.98Show/hide
Query:  PNLLAPNPYPTLPQPLSVKLSDNNFLLWKNQLLNVVLANGLSGFLDGSISAPPKFLDDQQLQTNPEFLVWERKNGRNCCF----------------DTAA
        PN  + NP+PTLPQPL+VKL+DNNFLLWKNQLLN V+ANGL G+LDG+I  PP+FLD  QLQ NP +  WER N    C+                +T  
Subjt:  PNLLAPNPYPTLPQPLSVKLSDNNFLLWKNQLLNVVLANGLSGFLDGSISAPPKFLDDQQLQTNPEFLVWERKNGRNCCF----------------DTAA

Query:  DIWNSLKKSYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGCEYNPFVTSIQNRSDNPSLEDVRSLLLAYE
        DIW+SL + YDSKTTARIMGLKT+LQ ++KDG SVSQYLA+IK++ADKF+A+GEP+SYRDHLAH+LDGLG EYN FVTSI NR+D+PSLEDVRSLLLAYE
Subjt:  DIWNSLKKSYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGCEYNPFVTSIQNRSDNPSLEDVRSLLLAYE

Query:  ARLEKQTTVEQLNLVRANLSIVQLQNSRSPSSRSPSNQFPRSPFNPPSFSPFPSQTTSSVQSILVKPQFHSNQKWHSRPSPSKPQCQICGKFGHTALICH
        ARL+KQ TV+QLN+ +ANL  + LQ+    +S+ P  +F        SF   P     S QSIL KPQ  S  KW  +PS SK QCQICGK GH+A +C+
Subjt:  ARLEKQTTVEQLNLVRANLSIVQLQNSRSPSSRSPSNQFPRSPFNPPSFSPFPSQTTSSVQSILVKPQFHSNQKWHSRPSPSKPQCQICGKFGHTALICH

Query:  HRTNLSYQIPPSPQAMLTTTSPNSNTHVPDTLSTMSTDSFHPDENWFLDSDATHHMTPEASSLYNLVPYHGGEQVT
        HRTN++Y    SPQA+     P S TH      +   +  HPDE+WF+DS ATHHMTP++S L N  PY GGEQVT
Subjt:  HRTNLSYQIPPSPQAMLTTTSPNSNTHVPDTLSTMSTDSFHPDENWFLDSDATHHMTPEASSLYNLVPYHGGEQVT

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.0e-3724.86Show/hide
Query:  KLSDNNFLLWKNQLLNVVLANGLSGFLDGSISAPPKFL-DDQQLQTNPEFLVWERKN----------------GRNCCFDTAADIWNSLKKSYDSKTTAR
        KL+  N+L+W  Q+  +     L+GFLDGS + PP  +  D   + NP++  W+R++                       TAA IW +L+K Y + +   
Subjt:  KLSDNNFLLWKNQLLNVVLANGLSGFLDGSISAPPKFL-DDQQLQTNPEFLVWERKN----------------GRNCCFDTAADIWNSLKKSYDSKTTAR

Query:  IMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGCEYNPFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQTTVEQLNLVRA
        +  L+TQL++  K   ++  Y+  +    D+ + +G+P+ + + +  +L+ L  EY P +  I  +   P+L ++   LL +E+++     V    ++  
Subjt:  IMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGCEYNPFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQTTVEQLNLVRA

Query:  NLSIVQLQNSRSPSSRSPSNQFPRSPFNPPSFSPFPSQTTSSVQSILVKPQFHSNQKWHSRPSPSKPQCQICGKFGHTALIC----HHRTNLSYQIPPSP
          + V  +N+ + ++ +  N+  R      + +  P Q +S+         FH N   +++  P   +CQICG  GH+A  C    H  ++++ Q PPSP
Subjt:  NLSIVQLQNSRSPSSRSPSNQFPRSPFNPPSFSPFPSQTTSSVQSILVKPQFHSNQKWHSRPSPSKPQCQICGKFGHTALIC----HHRTNLSYQIPPSP

Query:  QAMLTTTSPNSNTHVPDTLSTMSTDSFHPDENWFLDSDATHHMTPEASSLYNLVPYHGGEQVTDLQSKQVLLRGTLEDGLYKLLPSSRASSCSTISYCS-
            T   P +N         ++  S +   NW LDS ATHH+T + ++L    PY GG+ V       + +  T   G   L   SR  +   I Y   
Subjt:  QAMLTTTSPNSNTHVPDTLSTMSTDSFHPDENWFLDSDATHHMTPEASSLYNLVPYHGGEQVTDLQSKQVLLRGTLEDGLYKLLPSSRASSCSTISYCS-

Query:  ---------RLQHLNSA-----PAAF------------------------VSSTESI-------------NWHFRLGHPAAPVLQRVLASYQVSCSNISD
                 RL + N       PA+F                        ++S++ +             +WH RLGHPA  +L  V+++Y +S  N S 
Subjt:  ---------RLQHLNSA-----PAAF------------------------VSSTESI-------------NWHFRLGHPAAPVLQRVLASYQVSCSNISD

Query:  ---FCNSCQLAKSHRLPFNLSKSESVAPFNLVHSDYYQREPPGTPNFLEYV
            C+ C + KS+++PF+ S   S  P   ++SD +        N+  YV
Subjt:  ---FCNSCQLAKSHRLPFNLSKSESVAPFNLVHSDYYQREPPGTPNFLEYV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.4e-2722.87Show/hide
Query:  KLSDNNFLLWKNQLLNVVLANGLSGFLDGSISAPPKFL-DDQQLQTNPEFLVWERKN----------------GRNCCFDTAADIWNSLKKSYDSKTTAR
        KL+  N+L+W  Q+  +     L+GFLDGS   PP  +  D   + NP++  W R++                       TAA IW +L+K Y + +   
Subjt:  KLSDNNFLLWKNQLLNVVLANGLSGFLDGSISAPPKFL-DDQQLQTNPEFLVWERKN----------------GRNCCFDTAADIWNSLKKSYDSKTTAR

Query:  IMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGCEYNPFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQTTVEQLNLVRA
        +    TQL+ I +                D+ + +G+P+ + + +  +L+ L  +Y P +  I  +   PSL ++   L+  E++L    + E + +   
Subjt:  IMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGCEYNPFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQTTVEQLNLVRA

Query:  NLSIVQLQNSRSPSSRSPSNQFPRSPFNPPSFSPFPSQTTSSVQSILVKPQFHSNQKWHSRPSPSKPQCQICGKFGHTALIC----HHRTNLSYQIPPSP
         ++      +R+ ++R  +  +  +     S+ P  S + S                 + +P P   +CQIC   GH+A  C      ++  + Q   SP
Subjt:  NLSIVQLQNSRSPSSRSPSNQFPRSPFNPPSFSPFPSQTTSSVQSILVKPQFHSNQKWHSRPSPSKPQCQICGKFGHTALIC----HHRTNLSYQIPPSP

Query:  QAMLTTTSPNSNTHVPDTLSTMSTDSFHPDENWFLDSDATHHMTPEASSLYNLVPYHGGEQVTDLQSKQVLLRGTLEDGLYKLLPSSRASSCSTISYCS-
            T   P +N         ++ +S +   NW LDS ATHH+T + ++L    PY GG+ V       + +  T   G   L  SSR+   + + Y   
Subjt:  QAMLTTTSPNSNTHVPDTLSTMSTDSFHPDENWFLDSDATHHMTPEASSLYNLVPYHGGEQVTDLQSKQVLLRGTLEDGLYKLLPSSRASSCSTISYCS-

Query:  ---------RLQHLNSA-----PAAF------------------------VSSTESI-------------NWHFRLGHPAAPVLQRVLASYQVSCSNISD
                 RL + N       PA+F                        ++S++++             +WH RLGHP+  +L  V++++ +   N S 
Subjt:  ---------RLQHLNSA-----PAAF------------------------VSSTESI-------------NWHFRLGHPAAPVLQRVLASYQVSCSNISD

Query:  ---FCNSCQLAKSHRLPFNLSKSESVAPFNLVHSDYYQREPPGTPNFLEYV
            C+ C + KSH++PF+ S   S  P   ++SD +        N+  YV
Subjt:  ---FCNSCQLAKSHRLPFNLSKSESVAPFNLVHSDYYQREPPGTPNFLEYV

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.5e-1021.98Show/hide
Query:  PLSVKLSDNNFLLWKNQLLNVVLANGLSGFLDGSISAPPKFLDDQQLQTNPEFLVWERKN-----------------GRNCCFDTAADIWNSLKKSYDSK
        P+ + + ++N+  W+   L   L+  + G +DG++           L TN   + W++++                 G      T+ DIW  +K  + + 
Subjt:  PLSVKLSDNNFLLWKNQLLNVVLANGLSGFLDGSISAPPKFLDDQQLQTNPEFLVWERKN-----------------GRNCCFDTAADIWNSLKKSYDSK

Query:  TTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGCEYNPFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQTTVEQLN
          AR + L ++L+      + V+ Y  ++K +AD    +  P++ R+ + ++L+GL  +++  +  I++R   PS +D  ++L   E RL++       +
Subjt:  TTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGCEYNPFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQTTVEQLN

Query:  LVRANLSIVQLQNSRSPSSRSPSNQFPRSPFN
        +  ++ S V L  S +P    P   F RS  N
Subjt:  LVRANLSIVQLQNSRSPSSRSPSNQFPRSPFN

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.8e-1122.82Show/hide
Query:  LSVKLSDNNFLLWKNQLLNVVLANGLSGFLDGSISAPPKFLDDQQLQTNPEFLVWERKNGRNCCFD-------TAADIWNSLKKSYDSKTTARIMGLKTQ
        +++ L+  N+ +W+     + L+ G+ G +DGS S P    + +  + +    +W      +   D       TA D+W SL+  +     AR +  + +
Subjt:  LSVKLSDNNFLLWKNQLLNVVLANGLSGFLDGSISAPPKFLDDQQLQTNPEFLVWERKNGRNCCFD-------TAADIWNSLKKSYDSKTTARIMGLKTQ

Query:  LQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGCEYNPFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQTTVEQLNLVRANLSIVQL
        L+    D LSV +Y  ++K ++D  + +  PIS R  + H+L+GL  +Y+  +  I+++S  PS  + RS+LL  E+RL  ++     +    +LS V  
Subjt:  LQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGCEYNPFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQTTVEQLNLVRANLSIVQL

Query:  QNSRSPSSRSPSNQFPRSPFNPPSFSPFPSQTTSSVQSILVKPQFHSNQKWHSRPSPS----KPQCQICGKFGHTALICHHRTNLSYQIP---------P
           R         ++P+   N  S          +        ++++N  W     P+     PQ       G      H +T    Q P         P
Subjt:  QNSRSPSSRSPSNQFPRSPFNPPSFSPFPSQTTSSVQSILVKPQFHSNQKWHSRPSPS----KPQCQICGKFGHTALICHHRTNLSYQIP---------P

Query:  SPQAMLTTTSPNSN--------------------THVPDTLSTMSTDSFHPDENW
        SP ++L   +P  +                    +H P T S+MS  S  P+E W
Subjt:  SPQAMLTTTSPNSN--------------------THVPDTLSTMSTDSFHPDENW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGGCTTATTGAGTCGGCAAACAGGCCACTCTCACCCGTACAAATCAAAGGACAATCCCTCATGGACAGGAGTCCATAATCCACTCAGGATTAAGGCCAAGTTGCC
TAGGGGAGTCTTCTTCGTCTTCTTCCTCAGCATCGTTAACCACACTGGTAGCTTCTCCCTCTACTCCGGTGACTACTCCAGTAACCACGCCGACGACTCAACGACCGATT
GCCCAACAACCAAACCATTTTCCACCTCGCCAACCAGCTCCACCACAAACTTTGATTGTTCCTCCTCAGAATACTACCCAACCAGTATTTTTCCTCAACCATATGCTCAA
CCATTTTATCCCACCACTAATCTTCTGCACCCTTCCTTTTTCCATAATTTTTATCCTAATACCTATCCTCAGCCTAATACCTTTGCCCCTTCGTCCTACCCCCAACCCAA
TGCCTTGAATACAATAGCCCCAAACTCTTACCCAAATACATACCCAAACCTCTTGGCTCCAAACCCTTACCCAACTCTTCCCCAACCCTTGTCAGTCAAGCTCTCCGACA
ACAACTTCTTACTCTGGAAAAATCAACTGTTAAATGTTGTTCTCGCAAATGGTCTCAGTGGCTTCCTTGATGGTTCAATCTCGGCTCCTCCAAAATTTCTTGATGATCAA
CAACTTCAAACCAACCCAGAGTTTCTTGTGTGGGAAAGAAAAAATGGGAGAAATTGTTGTTTTGACACTGCTGCTGATATTTGGAATTCATTAAAGAAATCATATGACTC
TAAGACTACTGCTAGGATTATGGGTCTAAAAACACAACTCCAAAAGATTAAGAAAGATGGTTTATCCGTGAGTCAATATTTAGCTCAAATTAAAGATGTGGCTGATAAGT
TTTCCGCTATTGGCGAACCTATTTCTTATCGTGATCACTTAGCGCATATCTTAGATGGTCTTGGATGTGAATACAATCCCTTTGTTACATCGATTCAAAACCGGTCTGAT
AATCCGTCCTTGGAAGATGTTAGAAGCCTTCTATTGGCATATGAAGCTAGGTTGGAAAAACAAACCACTGTGGAGCAGCTTAATCTTGTCCGAGCAAATCTATCTATTGT
TCAGCTCCAAAACAGTCGCAGCCCTTCCTCTCGGTCTCCCTCAAATCAATTCCCTAGATCTCCCTTCAATCCACCATCATTTTCCCCTTTTCCTTCTCAAACTACCTCCT
CTGTCCAAAGTATCCTAGTGAAACCGCAATTCCACTCTAACCAAAAATGGCATTCTCGTCCTTCCCCTAGTAAGCCTCAATGTCAAATTTGTGGGAAATTTGGACACACT
GCCCTCATTTGCCACCATAGGACCAACTTATCTTACCAAATCCCCCCTTCACCTCAAGCTATGTTAACCACAACTTCTCCAAATTCAAATACCCATGTGCCCGATACTCT
ATCAACCATGTCCACTGATTCTTTTCACCCAGACGAAAATTGGTTCCTTGACTCCGACGCTACTCATCATATGACACCTGAGGCTTCATCTCTTTACAATCTGGTTCCTT
ATCATGGTGGTGAGCAGGTCACGGATCTTCAATCCAAGCAAGTCCTCCTCCGGGGCACTCTTGAAGATGGATTGTACAAGCTTCTTCCTTCATCACGTGCCTCCAGCTGC
TCCACTATCTCTTATTGCTCTCGGCTTCAACATTTAAATTCTGCTCCAGCTGCTTTTGTGTCTTCGACTGAGTCCATAAATTGGCATTTTCGGCTTGGTCATCCAGCGGC
CCCTGTTTTGCAGCGAGTTTTGGCTTCCTATCAAGTGTCTTGTTCAAATATTTCTGATTTTTGTAACTCATGTCAGCTTGCTAAGAGTCATCGTCTTCCGTTTAACTTAT
CTAAGTCAGAGTCTGTTGCACCTTTCAATTTAGTACATTCTGATTATTATCAGCGGGAACCTCCCGGAACTCCGAATTTTTTGGAATATGTTTCAACTCCAATGACGTAC
TACCGATCAAGCATTAGGCAACTGGCTATCCCTAACTCAGTCACTGGCGTTCACCAATACACAATGATCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTAGGCTTATTGAGTCGGCAAACAGGCCACTCTCACCCGTACAAATCAAAGGACAATCCCTCATGGACAGGAGTCCATAATCCACTCAGGATTAAGGCCAAGTTGCC
TAGGGGAGTCTTCTTCGTCTTCTTCCTCAGCATCGTTAACCACACTGGTAGCTTCTCCCTCTACTCCGGTGACTACTCCAGTAACCACGCCGACGACTCAACGACCGATT
GCCCAACAACCAAACCATTTTCCACCTCGCCAACCAGCTCCACCACAAACTTTGATTGTTCCTCCTCAGAATACTACCCAACCAGTATTTTTCCTCAACCATATGCTCAA
CCATTTTATCCCACCACTAATCTTCTGCACCCTTCCTTTTTCCATAATTTTTATCCTAATACCTATCCTCAGCCTAATACCTTTGCCCCTTCGTCCTACCCCCAACCCAA
TGCCTTGAATACAATAGCCCCAAACTCTTACCCAAATACATACCCAAACCTCTTGGCTCCAAACCCTTACCCAACTCTTCCCCAACCCTTGTCAGTCAAGCTCTCCGACA
ACAACTTCTTACTCTGGAAAAATCAACTGTTAAATGTTGTTCTCGCAAATGGTCTCAGTGGCTTCCTTGATGGTTCAATCTCGGCTCCTCCAAAATTTCTTGATGATCAA
CAACTTCAAACCAACCCAGAGTTTCTTGTGTGGGAAAGAAAAAATGGGAGAAATTGTTGTTTTGACACTGCTGCTGATATTTGGAATTCATTAAAGAAATCATATGACTC
TAAGACTACTGCTAGGATTATGGGTCTAAAAACACAACTCCAAAAGATTAAGAAAGATGGTTTATCCGTGAGTCAATATTTAGCTCAAATTAAAGATGTGGCTGATAAGT
TTTCCGCTATTGGCGAACCTATTTCTTATCGTGATCACTTAGCGCATATCTTAGATGGTCTTGGATGTGAATACAATCCCTTTGTTACATCGATTCAAAACCGGTCTGAT
AATCCGTCCTTGGAAGATGTTAGAAGCCTTCTATTGGCATATGAAGCTAGGTTGGAAAAACAAACCACTGTGGAGCAGCTTAATCTTGTCCGAGCAAATCTATCTATTGT
TCAGCTCCAAAACAGTCGCAGCCCTTCCTCTCGGTCTCCCTCAAATCAATTCCCTAGATCTCCCTTCAATCCACCATCATTTTCCCCTTTTCCTTCTCAAACTACCTCCT
CTGTCCAAAGTATCCTAGTGAAACCGCAATTCCACTCTAACCAAAAATGGCATTCTCGTCCTTCCCCTAGTAAGCCTCAATGTCAAATTTGTGGGAAATTTGGACACACT
GCCCTCATTTGCCACCATAGGACCAACTTATCTTACCAAATCCCCCCTTCACCTCAAGCTATGTTAACCACAACTTCTCCAAATTCAAATACCCATGTGCCCGATACTCT
ATCAACCATGTCCACTGATTCTTTTCACCCAGACGAAAATTGGTTCCTTGACTCCGACGCTACTCATCATATGACACCTGAGGCTTCATCTCTTTACAATCTGGTTCCTT
ATCATGGTGGTGAGCAGGTCACGGATCTTCAATCCAAGCAAGTCCTCCTCCGGGGCACTCTTGAAGATGGATTGTACAAGCTTCTTCCTTCATCACGTGCCTCCAGCTGC
TCCACTATCTCTTATTGCTCTCGGCTTCAACATTTAAATTCTGCTCCAGCTGCTTTTGTGTCTTCGACTGAGTCCATAAATTGGCATTTTCGGCTTGGTCATCCAGCGGC
CCCTGTTTTGCAGCGAGTTTTGGCTTCCTATCAAGTGTCTTGTTCAAATATTTCTGATTTTTGTAACTCATGTCAGCTTGCTAAGAGTCATCGTCTTCCGTTTAACTTAT
CTAAGTCAGAGTCTGTTGCACCTTTCAATTTAGTACATTCTGATTATTATCAGCGGGAACCTCCCGGAACTCCGAATTTTTTGGAATATGTTTCAACTCCAATGACGTAC
TACCGATCAAGCATTAGGCAACTGGCTATCCCTAACTCAGTCACTGGCGTTCACCAATACACAATGATCTAG
Protein sequenceShow/hide protein sequence
MVGLLSRQTGHSHPYKSKDNPSWTGVHNPLRIKAKLPRGVFFVFFLSIVNHTGSFSLYSGDYSSNHADDSTTDCPTTKPFSTSPTSSTTNFDCSSSEYYPTSIFPQPYAQ
PFYPTTNLLHPSFFHNFYPNTYPQPNTFAPSSYPQPNALNTIAPNSYPNTYPNLLAPNPYPTLPQPLSVKLSDNNFLLWKNQLLNVVLANGLSGFLDGSISAPPKFLDDQ
QLQTNPEFLVWERKNGRNCCFDTAADIWNSLKKSYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGCEYNPFVTSIQNRSD
NPSLEDVRSLLLAYEARLEKQTTVEQLNLVRANLSIVQLQNSRSPSSRSPSNQFPRSPFNPPSFSPFPSQTTSSVQSILVKPQFHSNQKWHSRPSPSKPQCQICGKFGHT
ALICHHRTNLSYQIPPSPQAMLTTTSPNSNTHVPDTLSTMSTDSFHPDENWFLDSDATHHMTPEASSLYNLVPYHGGEQVTDLQSKQVLLRGTLEDGLYKLLPSSRASSC
STISYCSRLQHLNSAPAAFVSSTESINWHFRLGHPAAPVLQRVLASYQVSCSNISDFCNSCQLAKSHRLPFNLSKSESVAPFNLVHSDYYQREPPGTPNFLEYVSTPMTY
YRSSIRQLAIPNSVTGVHQYTMI