; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031783 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031783
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr11:14601560..14611869
RNA-Seq ExpressionLag0031783
SyntenyLag0031783
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR018289 - MULE transposase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN21773.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]1.2e-6742.45Show/hide
Query:  MLFGLCNAPATFQRCIMTIFSDMVERTLEVFMDDFFVFGETFANCLQNLDHVLERCEETNLVLNWKKCHFMVREGTVLGHKISKNGIEVDKAKIELISKL
        M FGLCNAPATFQRC+M IF+DMVE  LE+FMDDF V+G++F  CL NL  +L+RCE+TNLVLNW+KCHFMV+EG VLGHK+S  GIEVDKAK+E I KL
Subjt:  MLFGLCNAPATFQRCIMTIFSDMVERTLEVFMDDFFVFGETFANCLQNLDHVLERCEETNLVLNWKKCHFMVREGTVLGHKISKNGIEVDKAKIELISKL

Query:  QFPKTVRDIRSPLGPTGSSFRALSHTPLSLSIALSLVLLSLSENTEGFIQKISLVLWLPQAPNATSYSRIPVNLRGGVRGDFPARSRCDCCCFWEEIEEF
          P +++ + S LG  G                            + FI+  S +                            ++  C+           
Subjt:  QFPKTVRDIRSPLGPTGSSFRALSHTPLSLSIALSLVLLSLSENTEGFIQKISLVLWLPQAPNATSYSRIPVNLRGGVRGDFPARSRCDCCCFWEEIEEF

Query:  WRNVQGFLQGAFIPSDRPFEFDEDCKQAFGTLKSALSSSPVMIEPNWTQPFELMCDASDFAVGVVLGQKRGRIVQPIYFASKTLNEAQLNYTVQVEKRLD
                    +  D PF FD+    AF  LK  L S+P++  P+W+  FELMCDASDFAVG VLGQ++ +I + IY+ASKTLN+AQLNYT   ++ L 
Subjt:  WRNVQGFLQGAFIPSDRPFEFDEDCKQAFGTLKSALSSSPVMIEPNWTQPFELMCDASDFAVGVVLGQKRGRIVQPIYFASKTLNEAQLNYTVQVEKRLD

Query:  IRESFADEQILAVRAIEIPWFVDYVNYLVSGLKPPEATAQQLKKFLKDRRK
        +  +F   +   V A ++PW+ D VNYL  G+ P + +AQQ KKFL D R+
Subjt:  IRESFADEQILAVRAIEIPWFVDYVNYLVSGLKPPEATAQQLKKFLKDRRK

PIN26668.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]1.6e-6740.35Show/hide
Query:  FGLCNAPATFQRCIMTIFSDMVERTLEVFMDDFFVFGETFANCLQNLDHVLERCEETNLVLNWKKCHFMVREGTVLGHKISKNGIEVDKAKIELISKLQF
        FGLCNAPATFQRC+M IF+DMVE  LEVFMDDF V+G++F  CL NL  VL+RCE+TNLVLNWKKCHFMV+EG VL HK+S  GIEV+KAK+E I KL  
Subjt:  FGLCNAPATFQRCIMTIFSDMVERTLEVFMDDFFVFGETFANCLQNLDHVLERCEETNLVLNWKKCHFMVREGTVLGHKISKNGIEVDKAKIELISKLQF

Query:  PKTVRDIRSPLGPTGSSFRALSHTPLSLSIALSLVLLSLSENTEGFIQKISLVLWLPQAPNATSYSRIPVNLRGGVRGDFPARSRCDCCCFWEEIEEFWR
        P +V+ IRS LG  G   R                          FI+  S +                            ++  C+             
Subjt:  PKTVRDIRSPLGPTGSSFRALSHTPLSLSIALSLVLLSLSENTEGFIQKISLVLWLPQAPNATSYSRIPVNLRGGVRGDFPARSRCDCCCFWEEIEEFWR

Query:  NVQGFLQGAFIPSDRPFEFDEDCKQAFGTLKSALSSSPVMIEPNWTQPFELMCDASDFAVGVVLGQKRGRIVQPIYFASKTLNEAQLNYTVQ--------
                  +  D PF+FD  C  AF  LK  L S+P++  P+W+ PFELMCDASDFA+G VLGQ++ +I + IY+ASKTLN+ QLNYT          
Subjt:  NVQGFLQGAFIPSDRPFEFDEDCKQAFGTLKSALSSSPVMIEPNWTQPFELMCDASDFAVGVVLGQKRGRIVQPIYFASKTLNEAQLNYTVQ--------

Query:  ---------------------------VEK-----RLD----------IRESFADEQILAVRAIEIPWFVDYVNYLVSGLKPPEATAQQLKKFLKDRRK
                                   +EK     RL+          I ++F DEQ+LA+ A ++PW+ D VNYL  G+ P + +AQQ KKFL D R+
Subjt:  ---------------------------VEK-----RLD----------IRESFADEQILAVRAIEIPWFVDYVNYLVSGLKPPEATAQQLKKFLKDRRK

XP_012838027.1 PREDICTED: uncharacterized protein LOC105958568 [Erythranthe guttata]1.2e-6738.41Show/hide
Query:  MLFGLCNAPATFQRCIMTIFSDMVERTLEVFMDDFFVFGETFANCLQNLDHVLERCEETNLVLNWKKCHFMVREGTVLGHKISKNGIEVDKAKIELISKL
        M FGLCNAPATFQRC+M IF+DMVE  LE+FMDDF VFG+T+  CLQ L  VL RCEETNLVLNW+KCHFMV+EG VLGHK+SK G+EVD+AKIE I KL
Subjt:  MLFGLCNAPATFQRCIMTIFSDMVERTLEVFMDDFFVFGETFANCLQNLDHVLERCEETNLVLNWKKCHFMVREGTVLGHKISKNGIEVDKAKIELISKL

Query:  QFPKTVRDIRSPLGPTGSSFRALSHTPLSLSIALSLVLLSLSENTEGFIQKISLVLWLPQAPNATSYSRIPVNLRGGVRGDFPARSRCDCCCFWEEIEEF
          P +V+ IRS LG  G   R                          FI+  S V                            A+  C+           
Subjt:  QFPKTVRDIRSPLGPTGSSFRALSHTPLSLSIALSLVLLSLSENTEGFIQKISLVLWLPQAPNATSYSRIPVNLRGGVRGDFPARSRCDCCCFWEEIEEF

Query:  WRNVQGFLQGAFIPSDRPFEFDEDCKQAFGTLKSALSSSPVMIEPNWTQPFELMCDASDFAVGVVLGQKRGRIVQPIYFASKTLNEAQLNYT--------
                    +  D  F+F+++C +AF  LK+ L ++P+++ P+W +PFELMCDASDFAVG VLGQ++ +I   IY+ASKTLN+AQLNYT        
Subjt:  WRNVQGFLQGAFIPSDRPFEFDEDCKQAFGTLKSALSSSPVMIEPNWTQPFELMCDASDFAVGVVLGQKRGRIVQPIYFASKTLNEAQLNYT--------

Query:  -----------------------------------------------------------------------VQVEKRLDIRESFADEQILAVRAIEIPWF
                                                                                +V+  + I E+F DEQ+LA+   E+PW+
Subjt:  -----------------------------------------------------------------------VQVEKRLDIRESFADEQILAVRAIEIPWF

Query:  VDYVNYLVSGLKPPEATAQQLKKFLKD
         D+VNYLVS + PPE T  Q KKFL D
Subjt:  VDYVNYLVSGLKPPEATAQQLKKFLKD

XP_022143642.1 uncharacterized protein LOC111013502 [Momordica charantia]4.9e-6958.8Show/hide
Query:  FHPLHIQIEGTHLYGKYRGKLLIATSVDSNRHLLPLAFAIVDEECHDTWGWFLKNLRECVTHEEVCLISNRHGGIISVVNNSNDGWTGDKSHYRFCLRHV
        F P+ +QI+GTHLY KY+GKLLIATSVDSN HLLPLAFAIVDEE   TWGWF KNLR+ VTHEE+CLIS+RHGGII  VNN ++GWTG KSH+RFCLRHV
Subjt:  FHPLHIQIEGTHLYGKYRGKLLIATSVDSNRHLLPLAFAIVDEECHDTWGWFLKNLRECVTHEEVCLISNRHGGIISVVNNSNDGWTGDKSHYRFCLRHV

Query:  -RNFNKKYKSKQLKILVYRAGCQYQVQKYNKVVEEIKAINSSCFTFFNNIGMRN-GLNH----TMEGLDTVDDNKLFGVHKQSFKRRSYVANHCTRSTYL
          N N KYK  +LK LVYRAG Q+Q++KYNK VE IK +NSSC  FF NI +    L+H      E   T     + GV K++   R          T  
Subjt:  -RNFNKKYKSKQLKILVYRAGCQYQVQKYNKVVEEIKAINSSCFTFFNNIGMRN-GLNH----TMEGLDTVDDNKLFGVHKQSFKRRSYVANHCTRSTYL

Query:  LYVKYFERRRGETRRALTRNEKCTRYAHDKIIKWATRSSKHEVSHHSELV
          VKYFE+RR ETR AL+RNEK TRYA+DK+++WA RS+KH V  +  +V
Subjt:  LYVKYFERRRGETRRALTRNEKCTRYAHDKIIKWATRSSKHEVSHHSELV

XP_030505517.1 uncharacterized protein LOC115720508 [Cannabis sativa]2.6e-7040.82Show/hide
Query:  MLFGLCNAPATFQRCIMTIFSDMVERTLEVFMDDFFVFGETFANCLQNLDHVLERCEETNLVLNWKKCHFMVREGTVLGHKISKNGIEVDKAKIELISKL
        M FGLCNA ATFQRC+M IFSDM E  LE+FMDDF ++G +F  CL+NL+ VL RCEETNLVLNW+KCHFMV+EG VLGHK+S  GIE+DKAK+E+I KL
Subjt:  MLFGLCNAPATFQRCIMTIFSDMVERTLEVFMDDFFVFGETFANCLQNLDHVLERCEETNLVLNWKKCHFMVREGTVLGHKISKNGIEVDKAKIELISKL

Query:  QFPKTVRDIRSPLGPTGSSFRALSHTPLSLSIALSLVLLSLSENTEGFIQKISLVLWLPQAPNATSYSRIPVNLRGGVRGDFPARSRCDCCCFWEEIEEF
          P TV+ I S LG  G  +R                          FI+                              DF   S+  C          
Subjt:  QFPKTVRDIRSPLGPTGSSFRALSHTPLSLSIALSLVLLSLSENTEGFIQKISLVLWLPQAPNATSYSRIPVNLRGGVRGDFPARSRCDCCCFWEEIEEF

Query:  WRNVQGFLQGAFIPSDRPFEFDEDCKQAFGTLKSALSSSPVMIEPNWTQPFELMCDASDFAVGVVLGQKRGRIVQPIYFASKTLNEAQLNYTV-------
                  + +  +RPFEF  +C + F  LK AL ++PV++ P+W+ PFELMCDASDFA+G VLGQ++ +I   IY+ASKTL +AQ+NYT        
Subjt:  WRNVQGFLQGAFIPSDRPFEFDEDCKQAFGTLKSALSSSPVMIEPNWTQPFELMCDASDFAVGVVLGQKRGRIVQPIYFASKTLNEAQLNYTV-------

Query:  -----------------------------------QVEKRLDIRESFADEQILAVRAIEIPWFVDYVNYLVSGLKPPEATAQQLKKFLKDRR
                                            +E  L I++SF DEQ+L V    +PW+ D VNYLV G   P+ T QQLKKFL D +
Subjt:  -----------------------------------QVEKRLDIRESFADEQILAVRAIEIPWFVDYVNYLVSGLKPPEATAQQLKKFLKDRR

TrEMBL top hitse value%identityAlignment
A0A2G9GWN4 DNA-directed DNA polymerase2.2e-6741.22Show/hide
Query:  MLFGLCNAPATFQRCIMTIFSDMVERTLEVFMDDFFVFGETFANCLQNLDHVLERCEETNLVLNWKKCHFMVREGTVLGHKISKNGIEVDKAKIELISKL
        M FGLCNAPA FQRC+M IF+D+VE  LEVFMDDF V+G +F  CL NL  VL+RCE+TNLVLNW+KC F+V+EG VLGHKIS  GIE+DKAK+E I KL
Subjt:  MLFGLCNAPATFQRCIMTIFSDMVERTLEVFMDDFFVFGETFANCLQNLDHVLERCEETNLVLNWKKCHFMVREGTVLGHKISKNGIEVDKAKIELISKL

Query:  QFPKTVRDIRSPLGPTGSSFRALSHTPLSLSIALSLVLLSLSENTEGFIQKISLVLWLPQAPNATSYSRIPVNLRGGVRGDFPARSRCDCCCFWEEIEEF
          P +V+ +RS LG TG   R                          FI+  S +                            ++  C+           
Subjt:  QFPKTVRDIRSPLGPTGSSFRALSHTPLSLSIALSLVLLSLSENTEGFIQKISLVLWLPQAPNATSYSRIPVNLRGGVRGDFPARSRCDCCCFWEEIEEF

Query:  WRNVQGFLQGAFIPSDRPFEFDEDCKQAFGTLKSALSSSPVMIEPNWTQPFELMCDASDFAVGVVLGQKRGRIVQPIYFASKTLNEAQLNYTVQVEKRL-
                    +  D PF+FD+ C  AF  +K  L S+P++I P+W  PFELMCD SDFA+G VLGQ++ +I + IY+ASK LN AQLNYT   EK L 
Subjt:  WRNVQGFLQGAFIPSDRPFEFDEDCKQAFGTLKSALSSSPVMIEPNWTQPFELMCDASDFAVGVVLGQKRGRIVQPIYFASKTLNEAQLNYTVQVEKRL-

Query:  ------------------------DIRESFADEQILAVRAIEIPWFVDYVNYLVSGLKPPEATAQQLKKFLKDRRK
                                 I ++F DEQ+LA+ A ++PW+ D VNYL  G+ P + + QQ KK L D R+
Subjt:  ------------------------DIRESFADEQILAVRAIEIPWFVDYVNYLVSGLKPPEATAQQLKKFLKDRRK

A0A2G9HWC5 DNA-directed DNA polymerase5.8e-6842.45Show/hide
Query:  MLFGLCNAPATFQRCIMTIFSDMVERTLEVFMDDFFVFGETFANCLQNLDHVLERCEETNLVLNWKKCHFMVREGTVLGHKISKNGIEVDKAKIELISKL
        M FGLCNAPATFQRC+M IF+DMVE  LE+FMDDF V+G++F  CL NL  +L+RCE+TNLVLNW+KCHFMV+EG VLGHK+S  GIEVDKAK+E I KL
Subjt:  MLFGLCNAPATFQRCIMTIFSDMVERTLEVFMDDFFVFGETFANCLQNLDHVLERCEETNLVLNWKKCHFMVREGTVLGHKISKNGIEVDKAKIELISKL

Query:  QFPKTVRDIRSPLGPTGSSFRALSHTPLSLSIALSLVLLSLSENTEGFIQKISLVLWLPQAPNATSYSRIPVNLRGGVRGDFPARSRCDCCCFWEEIEEF
          P +++ + S LG  G                            + FI+  S +                            ++  C+           
Subjt:  QFPKTVRDIRSPLGPTGSSFRALSHTPLSLSIALSLVLLSLSENTEGFIQKISLVLWLPQAPNATSYSRIPVNLRGGVRGDFPARSRCDCCCFWEEIEEF

Query:  WRNVQGFLQGAFIPSDRPFEFDEDCKQAFGTLKSALSSSPVMIEPNWTQPFELMCDASDFAVGVVLGQKRGRIVQPIYFASKTLNEAQLNYTVQVEKRLD
                    +  D PF FD+    AF  LK  L S+P++  P+W+  FELMCDASDFAVG VLGQ++ +I + IY+ASKTLN+AQLNYT   ++ L 
Subjt:  WRNVQGFLQGAFIPSDRPFEFDEDCKQAFGTLKSALSSSPVMIEPNWTQPFELMCDASDFAVGVVLGQKRGRIVQPIYFASKTLNEAQLNYTVQVEKRLD

Query:  IRESFADEQILAVRAIEIPWFVDYVNYLVSGLKPPEATAQQLKKFLKDRRK
        +  +F   +   V A ++PW+ D VNYL  G+ P + +AQQ KKFL D R+
Subjt:  IRESFADEQILAVRAIEIPWFVDYVNYLVSGLKPPEATAQQLKKFLKDRRK

A0A2G9IA86 DNA-directed DNA polymerase7.6e-6840.35Show/hide
Query:  FGLCNAPATFQRCIMTIFSDMVERTLEVFMDDFFVFGETFANCLQNLDHVLERCEETNLVLNWKKCHFMVREGTVLGHKISKNGIEVDKAKIELISKLQF
        FGLCNAPATFQRC+M IF+DMVE  LEVFMDDF V+G++F  CL NL  VL+RCE+TNLVLNWKKCHFMV+EG VL HK+S  GIEV+KAK+E I KL  
Subjt:  FGLCNAPATFQRCIMTIFSDMVERTLEVFMDDFFVFGETFANCLQNLDHVLERCEETNLVLNWKKCHFMVREGTVLGHKISKNGIEVDKAKIELISKLQF

Query:  PKTVRDIRSPLGPTGSSFRALSHTPLSLSIALSLVLLSLSENTEGFIQKISLVLWLPQAPNATSYSRIPVNLRGGVRGDFPARSRCDCCCFWEEIEEFWR
        P +V+ IRS LG  G   R                          FI+  S +                            ++  C+             
Subjt:  PKTVRDIRSPLGPTGSSFRALSHTPLSLSIALSLVLLSLSENTEGFIQKISLVLWLPQAPNATSYSRIPVNLRGGVRGDFPARSRCDCCCFWEEIEEFWR

Query:  NVQGFLQGAFIPSDRPFEFDEDCKQAFGTLKSALSSSPVMIEPNWTQPFELMCDASDFAVGVVLGQKRGRIVQPIYFASKTLNEAQLNYTVQ--------
                  +  D PF+FD  C  AF  LK  L S+P++  P+W+ PFELMCDASDFA+G VLGQ++ +I + IY+ASKTLN+ QLNYT          
Subjt:  NVQGFLQGAFIPSDRPFEFDEDCKQAFGTLKSALSSSPVMIEPNWTQPFELMCDASDFAVGVVLGQKRGRIVQPIYFASKTLNEAQLNYTVQ--------

Query:  ---------------------------VEK-----RLD----------IRESFADEQILAVRAIEIPWFVDYVNYLVSGLKPPEATAQQLKKFLKDRRK
                                   +EK     RL+          I ++F DEQ+LA+ A ++PW+ D VNYL  G+ P + +AQQ KKFL D R+
Subjt:  ---------------------------VEK-----RLD----------IRESFADEQILAVRAIEIPWFVDYVNYLVSGLKPPEATAQQLKKFLKDRRK

A0A6J1CRF1 uncharacterized protein LOC1110135022.4e-6958.8Show/hide
Query:  FHPLHIQIEGTHLYGKYRGKLLIATSVDSNRHLLPLAFAIVDEECHDTWGWFLKNLRECVTHEEVCLISNRHGGIISVVNNSNDGWTGDKSHYRFCLRHV
        F P+ +QI+GTHLY KY+GKLLIATSVDSN HLLPLAFAIVDEE   TWGWF KNLR+ VTHEE+CLIS+RHGGII  VNN ++GWTG KSH+RFCLRHV
Subjt:  FHPLHIQIEGTHLYGKYRGKLLIATSVDSNRHLLPLAFAIVDEECHDTWGWFLKNLRECVTHEEVCLISNRHGGIISVVNNSNDGWTGDKSHYRFCLRHV

Query:  -RNFNKKYKSKQLKILVYRAGCQYQVQKYNKVVEEIKAINSSCFTFFNNIGMRN-GLNH----TMEGLDTVDDNKLFGVHKQSFKRRSYVANHCTRSTYL
          N N KYK  +LK LVYRAG Q+Q++KYNK VE IK +NSSC  FF NI +    L+H      E   T     + GV K++   R          T  
Subjt:  -RNFNKKYKSKQLKILVYRAGCQYQVQKYNKVVEEIKAINSSCFTFFNNIGMRN-GLNH----TMEGLDTVDDNKLFGVHKQSFKRRSYVANHCTRSTYL

Query:  LYVKYFERRRGETRRALTRNEKCTRYAHDKIIKWATRSSKHEVSHHSELV
          VKYFE+RR ETR AL+RNEK TRYA+DK+++WA RS+KH V  +  +V
Subjt:  LYVKYFERRRGETRRALTRNEKCTRYAHDKIIKWATRSSKHEVSHHSELV

A0A6J1DMT3 LOW QUALITY PROTEIN: uncharacterized protein LOC1110220077.6e-6838.84Show/hide
Query:  MLFGLCNAPATFQRCIMTIFSDMVERTLEVFMDDFFVFGETFANCLQNLDHVLERCEETNLVLNWKKCHFMVREGTVLGHKISKNGIEVDKAKIELISKL
        M FGLCNAPATFQRC++ IFSDM+E T+E+FMDDF VFGE+F  CL+NL++VL+RCEETNLVLNW+K HFMV EGTVLGH+IS+ GIEVD AKI++I+KL
Subjt:  MLFGLCNAPATFQRCIMTIFSDMVERTLEVFMDDFFVFGETFANCLQNLDHVLERCEETNLVLNWKKCHFMVREGTVLGHKISKNGIEVDKAKIELISKL

Query:  QFPKTVRDIRSPLGPTGSSFRALSHTPLSLSIALSLVLLSLSENTEGFIQKISLVLWLPQAPNATSYSRIPVNLRGGVRGDFPARSRCDCCCFWEEIEEF
          P TV+ +RS LG  G   R                          FI+                              DF   S+  C          
Subjt:  QFPKTVRDIRSPLGPTGSSFRALSHTPLSLSIALSLVLLSLSENTEGFIQKISLVLWLPQAPNATSYSRIPVNLRGGVRGDFPARSRCDCCCFWEEIEEF

Query:  WRNVQGFLQGAFIPSDRPFEFDEDCKQAFGTLKSALSSSPVMIEPNWTQPFELMCDASDFAVGVVLGQKRGRIVQPIYFASKTLNEAQLNYTVQVEKRL-
                    +  D+PF F E C +AF TLK ALSS+P++IE  W  P ELMCDASDFAVGV+LGQ++G+I+ PIY+ASKTLN +QLNYTV  ++ L 
Subjt:  WRNVQGFLQGAFIPSDRPFEFDEDCKQAFGTLKSALSSSPVMIEPNWTQPFELMCDASDFAVGVVLGQKRGRIVQPIYFASKTLNEAQLNYTVQVEKRL-

Query:  -------------------------------------------------------------------------------DIRESFADEQILAVRAIEIPW
                                                                                       +I+E F DE +LA+   E+PW
Subjt:  -------------------------------------------------------------------------------DIRESFADEQILAVRAIEIPW

Query:  FVDYVNYLVSGLKPPEATAQQLKKFLKDRR
        + DY NYLVS + P + +  Q+KKF  D R
Subjt:  FVDYVNYLVSGLKPPEATAQQLKKFLKDRR

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.1e-2229.1Show/hide
Query:  MLFGLCNAPATFQRCIMTIFSDMVERTLEVFMDDFFVFGETFANCLQNLDHVLERCEETNLVLNWKKCHFMVREGTVLGHKISKNGIEVDKAKIELISKL
        M FGL NAPATFQRC+  I   ++ +   V++DD  VF  +    LQ+L  V E+  + NL L   KC F+ +E T LGH ++ +GI+ +  KIE I K 
Subjt:  MLFGLCNAPATFQRCIMTIFSDMVERTLEVFMDDFFVFGETFANCLQNLDHVLERCEETNLVLNWKKCHFMVREGTVLGHKISKNGIEVDKAKIELISKL

Query:  QFPKTVRDIRSPLGPTGSSFRALSHTPLSLSIALSLVLLSLSENTEGFIQKISLVLWLPQAPNATSYSRIPVNLRGGVRGDFPARSRCDCCCFWEEIEEF
          P   ++I++ LG T                              G+ +K          PN    ++                ++C            
Subjt:  QFPKTVRDIRSPLGPTGSSFRALSHTPLSLSIALSLVLLSLSENTEGFIQKISLVLWLPQAPNATSYSRIPVNLRGGVRGDFPARSRCDCCCFWEEIEEF

Query:  WRNVQGFLQGAFIPSDRPFEFDEDCKQAFGTLKSALSSSPVMIEPNWTQPFELMCDASDFAVGVVLGQKRGRIVQPIYFASKTLNEAQLNYTVQVEKRL
         +N++       I +  P E+D     AF  LK  +S  P++  P++T+ F L  DASD A+G VL Q       P+ + S+TLNE ++NY+  +EK L
Subjt:  WRNVQGFLQGAFIPSDRPFEFDEDCKQAFGTLKSALSSSPVMIEPNWTQPFELMCDASDFAVGVVLGQKRGRIVQPIYFASKTLNEAQLNYTVQVEKRL

P0CT41 Transposon Tf2-12 polyprotein1.1e-1224.1Show/hide
Query:  MLFGLCNAPATFQRCIMTIFSDMVERTLEVFMDDFFVFGETFANCLQNLDHVLERCEETNLVLNWKKCHFMVREGTVLGHKISKNGIEVDKAKIELISKL
        M +G+  APA FQ  I TI  +  E  +  +MDD  +  ++ +  ++++  VL++ +  NL++N  KC F   +   +G+ IS+ G    +  I+ + + 
Subjt:  MLFGLCNAPATFQRCIMTIFSDMVERTLEVFMDDFFVFGETFANCLQNLDHVLERCEETNLVLNWKKCHFMVREGTVLGHKISKNGIEVDKAKIELISKL

Query:  QFPKTVRDIRSPLGPTGSSFRALSHTPLSLSIALSLVLLSLSENTEGFIQKISLVLWLPQAPNATSYSRIPVN--LRGGVRGDFPARSRCDCCCFWEEIE
        + PK  +++R  LG                              +  +++K     ++P+    TS    P+N  L+  VR              W+   
Subjt:  QFPKTVRDIRSPLGPTGSSFRALSHTPLSLSIALSLVLLSLSENTEGFIQKISLVLWLPQAPNATSYSRIPVN--LRGGVRGDFPARSRCDCCCFWEEIE

Query:  EFWRNVQGFLQGAFIPSDRPFEFDEDCKQAFGTLKSALSSSPVMIEPNWTQPFELMCDASDFAVGVVLGQKR-GRIVQPIYFASKTLNEAQLNYTVQVEK
          W   Q                     QA   +K  L S PV+   ++++   L  DASD AVG VL QK       P+ + S  +++AQLNY+V  ++
Subjt:  EFWRNVQGFLQGAFIPSDRPFEFDEDCKQAFGTLKSALSSSPVMIEPNWTQPFELMCDASDFAVGVVLGQKR-GRIVQPIYFASKTLNEAQLNYTVQVEK

Query:  RLDIRES
         L I +S
Subjt:  RLDIRES

P10394 Retrovirus-related Pol polyprotein from transposon 4122.1e-1425.69Show/hide
Query:  FGLCNAPATFQRCIMTIFSDMVERTLEVFMDDFFVFGETFANCLQNLDHVLERCEETNLVLNWKKCHFMVREGTVLGHKISKNGIEVDKAKIELISKLQF
        FGL  AP +FQR +   FS +      ++MDD  V G +  + L+NL  V  +C E NL L+ +KC F + E T LGHK +  GI  D  K ++I     
Subjt:  FGLCNAPATFQRCIMTIFSDMVERTLEVFMDDFFVFGETFANCLQNLDHVLERCEETNLVLNWKKCHFMVREGTVLGHKISKNGIEVDKAKIELISKLQF

Query:  PKTVRDIRSPLGPTGSSFRALSHTPLSLSIALSLVLLSLSENTEGFIQKISLVLWLPQAPNATSYSRIPVNLRGGVRGDFPARSRCDCCCFWEEIEEFWR
        P      R                            ++       FI+            N   YSR    L             C             +
Subjt:  PKTVRDIRSPLGPTGSSFRALSHTPLSLSIALSLVLLSLSENTEGFIQKISLVLWLPQAPNATSYSRIPVNLRGGVRGDFPARSRCDCCCFWEEIEEFWR

Query:  NVQGFLQGAFIPSDRPFEFDEDCKQAFGTLKSALSSSPVMIEPNWTQPFELMCDASDFAVGVVLGQKRGRIVQPIYFASKTLNEAQLN
        NV             PFE+ ++C++AF  LKS L +  ++  P++++ F +  DAS  A G VL Q       P+ +AS+   + + N
Subjt:  NVQGFLQGAFIPSDRPFEFDEDCKQAFGTLKSALSSSPVMIEPNWTQPFELMCDASDFAVGVVLGQKRGRIVQPIYFASKTLNEAQLN

P20825 Retrovirus-related Pol polyprotein from transposon 2971.2e-1725.75Show/hide
Query:  MLFGLCNAPATFQRCIMTIFSDMVERTLEVFMDDFFVFGETFANCLQNLDHVLERCEETNLVLNWKKCHFMVREGTVLGHKISKNGIEVDKAKIELISKL
        M FGL NAPATFQRC+  I   ++ +   V++DD  +F  +    L ++  V  +  + NL L   KC F+ +E   LGH ++ +GI+ +  K++ I   
Subjt:  MLFGLCNAPATFQRCIMTIFSDMVERTLEVFMDDFFVFGETFANCLQNLDHVLERCEETNLVLNWKKCHFMVREGTVLGHKISKNGIEVDKAKIELISKL

Query:  QFPKTVRDIRSPLGPTGSSFRALSHTPLSLSIALSLVLLSLSENTEGFIQKISLVLWLPQAPNATSYSRIPVNLRGGVRGDFPARSRCDCCCFWEEIEEF
          P   ++IR+ LG TG   + +   P    IA  +    L + T+   QK+  +                                             
Subjt:  QFPKTVRDIRSPLGPTGSSFRALSHTPLSLSIALSLVLLSLSENTEGFIQKISLVLWLPQAPNATSYSRIPVNLRGGVRGDFPARSRCDCCCFWEEIEEF

Query:  WRNVQGFLQGAFIPSDRPFEFDEDCKQAFGTLKSALSSSPVMIEPNWTQPFELMCDASDFAVGVVLGQKRGRIVQPIYFASKTLNEAQLNYTVQVEKRL
                                  +AF  LK+ +   P++  P++ + F L  DAS+ A+G VL Q       PI F S+TLN+ +LNY+  +EK L
Subjt:  WRNVQGFLQGAFIPSDRPFEFDEDCKQAFGTLKSALSSSPVMIEPNWTQPFELMCDASDFAVGVVLGQKRGRIVQPIYFASKTLNEAQLNYTVQVEKRL

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus3.2e-2328.62Show/hide
Query:  FGLCNAPATFQRCIMTIFSDMVERTLEVFMDDFFVFGETFANCLQNLDHVLERCEETNLVLNWKKCHFMVREGTVLGHKISKNGIEVDKAKIELISKLQF
        FGL NAPA FQR I  I  + + +   V++DD  VF E +    +NL  VL    + NL +N +K HF+  +   LG+ ++ +GI+ D  K+  IS++  
Subjt:  FGLCNAPATFQRCIMTIFSDMVERTLEVFMDDFFVFGETFANCLQNLDHVLERCEETNLVLNWKKCHFMVREGTVLGHKISKNGIEVDKAKIELISKLQF

Query:  PKTVRDIRSPLGPTGSSFRALSHTPLSLSIALSLVLLSLSENTEGFIQKISLVLWLPQAPNATSYSRIPVNLRGGVRGDFPARSRCDCCCFWEEIEEFWR
        P +V++++  LG T S +R                          FIQ                Y+++   L    RG                    + 
Subjt:  PKTVRDIRSPLGPTGSSFRALSHTPLSLSIALSLVLLSLSENTEGFIQKISLVLWLPQAPNATSYSRIPVNLRGGVRGDFPARSRCDCCCFWEEIEEFWR

Query:  NVQGFLQGAFIPSDRPFEFDEDCKQAFGTLKSALSSSPVMIEPNWTQPFELMCDASDFAVGVVLGQKRGRIVQPIYFASKTLNEAQLNYTVQVEKRL
        N++         S  P   DE   Q+F  LKS L SS ++  P +T+PF L  DAS++A+G VL Q      +PI + S++LN+ + NY   +EK +
Subjt:  NVQGFLQGAFIPSDRPFEFDEDCKQAFGTLKSALSSSPVMIEPNWTQPFELMCDASDFAVGVVLGQKRGRIVQPIYFASKTLNEAQLNYTVQVEKRL

Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase7.9e-1730.73Show/hide
Query:  IQIEGTHLYGKYRGKLLIATSVDSNRHLLPLAFAIVDEECHDTWGWFLKNLRECVTHEE-VCLISNRHGGIISVVNNSNDGWTGDKSHYRFCLRHVRNFN
        I ++  +L GKY+ KL+IA++ D+     PLAFA+  E   D+W WFL  +RE VT  + +CLIS+    I++V+N     W    +++RFCL H+ +  
Subjt:  IQIEGTHLYGKYRGKLLIATSVDSNRHLLPLAFAIVDEECHDTWGWFLKNLRECVTHEE-VCLISNRHGGIISVVNNSNDGWTGDKSHYRFCLRHVRNFN

Query:  KKYK---SKQLKILVYRAGCQYQVQKYNKVVEEIKAINSSCFTFFNNIGMRN-GLNH---TMEGLDTVDDNKLFGVHKQ
                  +  LV  AG   Q ++++  ++EIK  N   + + +        L H      G+  +D   LF V K+
Subjt:  KKYK---SKQLKILVYRAGCQYQVQKYNKVVEEIKAINSSCFTFFNNIGMRN-GLNH---TMEGLDTVDDNKLFGVHKQ

AT1G64255.1 MuDR family transposase3.0e-1629.06Show/hide
Query:  IQIEGTHLYGKYRGKLLIATSVDSNRHLLPLAFAIVDEECHDTWGWFLKNLRECVTHEE-VCLISNRHGGIISVVNNSNDGWTGDKSHYRFCLRH-VRNF
        I ++  +L  +Y+ KL+IA+ VD+     PLAFA+  E   D W WFL  +RE VT  + +CLIS+ H  II+VVN S   W    +++RF L H    F
Subjt:  IQIEGTHLYGKYRGKLLIATSVDSNRHLLPLAFAIVDEECHDTWGWFLKNLRECVTHEE-VCLISNRHGGIISVVNNSNDGWTGDKSHYRFCLRH-VRNF

Query:  NKKYKSKQLKILVYRAGCQYQVQKYNKVVEEIKAINSSCFTFFNNIGMRNGL----NHTMEGLDTVDDNKLFGVHKQSFKRRSYVANHCTRSTYLLYVKY
        ++ + S  L   + RAG   Q  ++   + +IK  N     + +            N    G+  ++   LF V   +F++  +V    T S  LL+ + 
Subjt:  NKKYKSKQLKILVYRAGCQYQVQKYNKVVEEIKAINSSCFTFFNNIGMRNGL----NHTMEGLDTVDDNKLFGVHKQSFKRRSYVANHCTRSTYLLYVKY

Query:  ---FERRRGETRRALTRNEKCTRYAHDKIIKWAT
           F++    +R +L   +  T    DK+ ++ T
Subjt:  ---FERRRGETRRALTRNEKCTRYAHDKIIKWAT

AT1G64260.1 MuDR family transposase4.9e-1927.92Show/hide
Query:  IQIEGTHLYGKYRGKLLIATSVDSNRHLLPLAFAIVDEECHDTWGWFLKNLRECVT-HEEVCLISNRHGGIISVVNNSNDGWTGDKSHYRFCLRHVRN-F
        I ++   L GKY+ KL+IA+ VD+     PLAFA+  E   D+W WF   +RE VT  +++CLIS+    I++VVN     W    +H++FCL H+R+ F
Subjt:  IQIEGTHLYGKYRGKLLIATSVDSNRHLLPLAFAIVDEECHDTWGWFLKNLRECVT-HEEVCLISNRHGGIISVVNNSNDGWTGDKSHYRFCLRHVRN-F

Query:  NKKYKSKQLKILVYRAGCQYQVQKYNKVVEEIKAINSSCFTFFNNIGMR-------NGLNHTMEGLDTVDDNKLFGVHKQSFKRRSYVANHCTRSTYLLY
           ++   L+ LV +AG   Q ++++  + +IK  N   + + + I          +GL +   G+  +D   LF V     +   Y     T    L++
Subjt:  NKKYKSKQLKILVYRAGCQYQVQKYNKVVEEIKAINSSCFTFFNNIGMR-------NGLNHTMEGLDTVDDNKLFGVHKQSFKRRSYVANHCTRSTYLLY

Query:  VKY---FERRRGETRRALTRNEKCTRYAHDKIIKWATRSSKH---EVSHHSELVWNDGNMESYIVDVE------RQFYSVIFP
         +    F++       +L R    T    DK+ ++ T S  +   ++   S  V      E +IV +       R+F S  FP
Subjt:  VKY---FERRRGETRRALTRNEKCTRYAHDKIIKWATRSSKH---EVSHHSELVWNDGNMESYIVDVE------RQFYSVIFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTATTTGGACTCTGTAACGCTCCAGCCACCTTTCAAAGGTGCATAATGACAATATTTTCAGATATGGTGGAGCGAACATTGGAAGTTTTTATGGATGACTTCTTCGT
CTTTGGAGAAACTTTTGCAAATTGTTTACAGAATCTTGACCATGTATTGGAAAGATGCGAAGAAACGAACCTAGTGTTAAATTGGAAAAAATGCCATTTTATGGTACGAG
AAGGTACAGTCCTTGGACACAAGATTTCCAAAAATGGAATTGAGGTAGACAAGGCAAAGATCGAGCTTATATCCAAGCTACAATTTCCTAAGACAGTACGAGACATCAGG
AGTCCATTAGGTCCCACCGGTAGCTCATTTAGGGCGTTGAGTCACACGCCTCTCTCCCTCTCTATCGCTCTCTCTCTGGTTCTGCTCTCTCTCTCTGAGAATACAGAAGG
CTTCATCCAGAAAATATCCTTAGTTCTTTGGCTTCCACAAGCTCCCAACGCAACATCCTACTCGAGAATACCGGTGAACCTTCGTGGTGGTGTTCGTGGTGATTTTCCAG
CAAGATCAAGGTGTGACTGCTGCTGTTTTTGGGAGGAAATAGAAGAATTTTGGCGAAACGTTCAAGGATTTCTTCAAGGGGCTTTTATCCCTTCAGATAGACCTTTTGAA
TTTGATGAAGATTGTAAGCAAGCTTTTGGAACTTTGAAGAGTGCACTTAGTTCATCTCCGGTCATGATTGAACCCAACTGGACTCAACCGTTTGAGCTAATGTGCGACGC
TAGTGACTTTGCAGTAGGAGTTGTGTTAGGCCAGAAAAGAGGGAGGATTGTGCAACCAATCTATTTTGCAAGCAAGACGCTTAACGAAGCACAGTTGAACTATACAGTGC
AGGTGGAGAAGAGGCTGGACATTCGAGAGTCGTTTGCAGATGAACAAATATTGGCAGTAAGGGCAATTGAGATTCCATGGTTTGTAGACTATGTGAACTACTTAGTTAGT
GGACTAAAGCCTCCGGAAGCCACAGCGCAACAACTGAAAAAGTTTCTGAAAGATCGAAGAAAACAAGGAGTCGTTGCGTTTTCGTTCGTTGGAGCATCGTTGGCGAAGAA
CAGTCAAATCTACAACGAAGGTCCTTTCGATGCGCTTTCGCTGCGTAGGGGCTTTCATCCCCTCCACATCCAAATAGAGGGTACTCATTTATATGGCAAGTATAGGGGGA
AGTTATTAATTGCAACATCAGTTGACTCTAATAGACATTTGCTACCACTTGCATTTGCCATTGTAGATGAGGAGTGTCATGACACATGGGGATGGTTTTTGAAAAACTTG
AGAGAGTGTGTTACACATGAAGAAGTTTGCTTGATTTCTAATCGACATGGAGGCATCATCTCGGTTGTTAACAATTCAAATGATGGGTGGACAGGAGACAAATCTCACTA
TAGATTTTGCTTGAGACATGTTAGAAATTTTAATAAAAAATACAAGTCCAAACAATTGAAAATTTTAGTTTATCGTGCTGGGTGTCAGTATCAGGTCCAAAAGTACAATA
AGGTTGTTGAAGAGATTAAAGCAATCAATAGCTCTTGCTTCACCTTCTTCAATAATATTGGAATGAGAAATGGACTCAATCACACGATGGAGGGGTTAGATACGGTGGAT
GACAACAAACTTTTCGGAGTGCATAAACAGAGTTTTAAAAGGAGGTCGTATGTTGCCAATCACTGCACTCGCTCAACTTACCTTTTGTATGTGAAATATTTTGAGAGGAG
AAGAGGAGAAACAAGACGTGCTTTGACTCGAAATGAGAAGTGCACGAGGTATGCTCATGATAAGATCATTAAATGGGCAACACGGTCCAGTAAGCACGAGGTGTCCCATC
ATTCAGAGCTCGTGTGGAATGATGGGAACATGGAAAGTTACATTGTAGACGTCGAGAGGCAATTCTACAGCGTAATATTCCCTTACACCGACTTATATTGCCAGCATTAC
GTGTTGTCGGACAACCAACAAGATCTTCTCTACTACCCTCGGGCCTTGGATCAGATTGTGGGATCCTTAATTGATAGGAATTATGGAAGGGAAGAGCTTGAAAGCTCAAA
TATTGAGAGCTTGAACAAGGTGTCTTTTGAGAGTGAATTTCTCCAACATTTCAGTATAAAATACCATGAATTTCCCTTCTTTGAATCCTCTATATATAGAGAAAGTACTT
TGAGCTTAGGCGATGTCGGGAGCCAACCACGCAAAATTGGTGTTAATTTGGGCCAATTATGGAGTTTTGGAGCCATCTCAGTGTTCTTGGACGGTGAGGAGAATATATTG
TTGAGCGACTTGAGGGAGCAAAATCTGTGCTGGAGCAAAGCAAGAAGCAAAACTTCCACATCACAACCCGTTAGCCAACTTAATGAACTGAATTCTGTTAAGTTATTCTT
GGGATTAAGGAGCAAGAAGAGCCATCCACGTGTCCATATTGACCTAATTAGGGCTTGCTATCACTCTATAAAAGGAGAAGTCGTCTCAAGTCAAATTAATTCCTTTCTTG
GGGAGGAACATTCCACTCTTAGGAAGTTCATTTCCACTTTTGGAAGCCGAATAGAGAAGAAAGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTTATTTGGACTCTGTAACGCTCCAGCCACCTTTCAAAGGTGCATAATGACAATATTTTCAGATATGGTGGAGCGAACATTGGAAGTTTTTATGGATGACTTCTTCGT
CTTTGGAGAAACTTTTGCAAATTGTTTACAGAATCTTGACCATGTATTGGAAAGATGCGAAGAAACGAACCTAGTGTTAAATTGGAAAAAATGCCATTTTATGGTACGAG
AAGGTACAGTCCTTGGACACAAGATTTCCAAAAATGGAATTGAGGTAGACAAGGCAAAGATCGAGCTTATATCCAAGCTACAATTTCCTAAGACAGTACGAGACATCAGG
AGTCCATTAGGTCCCACCGGTAGCTCATTTAGGGCGTTGAGTCACACGCCTCTCTCCCTCTCTATCGCTCTCTCTCTGGTTCTGCTCTCTCTCTCTGAGAATACAGAAGG
CTTCATCCAGAAAATATCCTTAGTTCTTTGGCTTCCACAAGCTCCCAACGCAACATCCTACTCGAGAATACCGGTGAACCTTCGTGGTGGTGTTCGTGGTGATTTTCCAG
CAAGATCAAGGTGTGACTGCTGCTGTTTTTGGGAGGAAATAGAAGAATTTTGGCGAAACGTTCAAGGATTTCTTCAAGGGGCTTTTATCCCTTCAGATAGACCTTTTGAA
TTTGATGAAGATTGTAAGCAAGCTTTTGGAACTTTGAAGAGTGCACTTAGTTCATCTCCGGTCATGATTGAACCCAACTGGACTCAACCGTTTGAGCTAATGTGCGACGC
TAGTGACTTTGCAGTAGGAGTTGTGTTAGGCCAGAAAAGAGGGAGGATTGTGCAACCAATCTATTTTGCAAGCAAGACGCTTAACGAAGCACAGTTGAACTATACAGTGC
AGGTGGAGAAGAGGCTGGACATTCGAGAGTCGTTTGCAGATGAACAAATATTGGCAGTAAGGGCAATTGAGATTCCATGGTTTGTAGACTATGTGAACTACTTAGTTAGT
GGACTAAAGCCTCCGGAAGCCACAGCGCAACAACTGAAAAAGTTTCTGAAAGATCGAAGAAAACAAGGAGTCGTTGCGTTTTCGTTCGTTGGAGCATCGTTGGCGAAGAA
CAGTCAAATCTACAACGAAGGTCCTTTCGATGCGCTTTCGCTGCGTAGGGGCTTTCATCCCCTCCACATCCAAATAGAGGGTACTCATTTATATGGCAAGTATAGGGGGA
AGTTATTAATTGCAACATCAGTTGACTCTAATAGACATTTGCTACCACTTGCATTTGCCATTGTAGATGAGGAGTGTCATGACACATGGGGATGGTTTTTGAAAAACTTG
AGAGAGTGTGTTACACATGAAGAAGTTTGCTTGATTTCTAATCGACATGGAGGCATCATCTCGGTTGTTAACAATTCAAATGATGGGTGGACAGGAGACAAATCTCACTA
TAGATTTTGCTTGAGACATGTTAGAAATTTTAATAAAAAATACAAGTCCAAACAATTGAAAATTTTAGTTTATCGTGCTGGGTGTCAGTATCAGGTCCAAAAGTACAATA
AGGTTGTTGAAGAGATTAAAGCAATCAATAGCTCTTGCTTCACCTTCTTCAATAATATTGGAATGAGAAATGGACTCAATCACACGATGGAGGGGTTAGATACGGTGGAT
GACAACAAACTTTTCGGAGTGCATAAACAGAGTTTTAAAAGGAGGTCGTATGTTGCCAATCACTGCACTCGCTCAACTTACCTTTTGTATGTGAAATATTTTGAGAGGAG
AAGAGGAGAAACAAGACGTGCTTTGACTCGAAATGAGAAGTGCACGAGGTATGCTCATGATAAGATCATTAAATGGGCAACACGGTCCAGTAAGCACGAGGTGTCCCATC
ATTCAGAGCTCGTGTGGAATGATGGGAACATGGAAAGTTACATTGTAGACGTCGAGAGGCAATTCTACAGCGTAATATTCCCTTACACCGACTTATATTGCCAGCATTAC
GTGTTGTCGGACAACCAACAAGATCTTCTCTACTACCCTCGGGCCTTGGATCAGATTGTGGGATCCTTAATTGATAGGAATTATGGAAGGGAAGAGCTTGAAAGCTCAAA
TATTGAGAGCTTGAACAAGGTGTCTTTTGAGAGTGAATTTCTCCAACATTTCAGTATAAAATACCATGAATTTCCCTTCTTTGAATCCTCTATATATAGAGAAAGTACTT
TGAGCTTAGGCGATGTCGGGAGCCAACCACGCAAAATTGGTGTTAATTTGGGCCAATTATGGAGTTTTGGAGCCATCTCAGTGTTCTTGGACGGTGAGGAGAATATATTG
TTGAGCGACTTGAGGGAGCAAAATCTGTGCTGGAGCAAAGCAAGAAGCAAAACTTCCACATCACAACCCGTTAGCCAACTTAATGAACTGAATTCTGTTAAGTTATTCTT
GGGATTAAGGAGCAAGAAGAGCCATCCACGTGTCCATATTGACCTAATTAGGGCTTGCTATCACTCTATAAAAGGAGAAGTCGTCTCAAGTCAAATTAATTCCTTTCTTG
GGGAGGAACATTCCACTCTTAGGAAGTTCATTTCCACTTTTGGAAGCCGAATAGAGAAGAAAGAATAG
Protein sequenceShow/hide protein sequence
MLFGLCNAPATFQRCIMTIFSDMVERTLEVFMDDFFVFGETFANCLQNLDHVLERCEETNLVLNWKKCHFMVREGTVLGHKISKNGIEVDKAKIELISKLQFPKTVRDIR
SPLGPTGSSFRALSHTPLSLSIALSLVLLSLSENTEGFIQKISLVLWLPQAPNATSYSRIPVNLRGGVRGDFPARSRCDCCCFWEEIEEFWRNVQGFLQGAFIPSDRPFE
FDEDCKQAFGTLKSALSSSPVMIEPNWTQPFELMCDASDFAVGVVLGQKRGRIVQPIYFASKTLNEAQLNYTVQVEKRLDIRESFADEQILAVRAIEIPWFVDYVNYLVS
GLKPPEATAQQLKKFLKDRRKQGVVAFSFVGASLAKNSQIYNEGPFDALSLRRGFHPLHIQIEGTHLYGKYRGKLLIATSVDSNRHLLPLAFAIVDEECHDTWGWFLKNL
RECVTHEEVCLISNRHGGIISVVNNSNDGWTGDKSHYRFCLRHVRNFNKKYKSKQLKILVYRAGCQYQVQKYNKVVEEIKAINSSCFTFFNNIGMRNGLNHTMEGLDTVD
DNKLFGVHKQSFKRRSYVANHCTRSTYLLYVKYFERRRGETRRALTRNEKCTRYAHDKIIKWATRSSKHEVSHHSELVWNDGNMESYIVDVERQFYSVIFPYTDLYCQHY
VLSDNQQDLLYYPRALDQIVGSLIDRNYGREELESSNIESLNKVSFESEFLQHFSIKYHEFPFFESSIYRESTLSLGDVGSQPRKIGVNLGQLWSFGAISVFLDGEENIL
LSDLREQNLCWSKARSKTSTSQPVSQLNELNSVKLFLGLRSKKSHPRVHIDLIRACYHSIKGEVVSSQINSFLGEEHSTLRKFISTFGSRIEKKE