; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026349 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026349
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReplication-associated protein
Genome locationchr10:35259344..35260816
RNA-Seq ExpressionLag0026349
SyntenyLag0026349
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0006260 - DNA replication (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0042025 - host cell nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0004386 - helicase activity (molecular function)
GO:0005198 - structural molecule activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
GO:0016888 - endodeoxyribonuclease activity, producing 5'-phosphomonoesters (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001301 - Geminivirus AL1 replication-associated protein, CLV type
IPR002488 - Geminivirus C4 protein
IPR022692 - Geminivirus AL1 replication-associated protein, central domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ABG90906.1 C1 [Merremia leaf curl virus]2.9e-7373.48Show/hide
Query:  DTIEWGTFQVDGRSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSA
        D I+WGTFQVDGRS+RGG QTANDA A+ALN+GSA+ AL IIRE   +DFIF +HNL+ NL+RIF+ P + Y S F  +SF  VP +I DWAA+NV DSA
Subjt:  DTIEWGTFQVDGRSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSA

Query:  ARPDRPISIIIEGPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK
        ARPDRPISI+IEGPSR+GKTVWARSLG HNYLCGHLDLS KVY+N AWYNVIDDV+P+YLKH+KE MGAQK WQSN KYGK
Subjt:  ARPDRPISIIIEGPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK

AGW16201.1 replication-associated protein [Sweet potato leaf curl Georgia virus]2.2e-7373.48Show/hide
Query:  DTIEWGTFQVDGRSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSA
        D I+WGTFQVDGRS+RGG QTANDA A+ALN+GSA+ AL IIRE   +DFIF +HNL+ NL+RIF+ P + Y S F  +SF  VP +I DWAA+NV DSA
Subjt:  DTIEWGTFQVDGRSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSA

Query:  ARPDRPISIIIEGPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK
        ARPDRPISI+IEGPSR+GKTVWARSLG HNYLCGHLDLS KVY+N AWYNVIDDV+P+YLKH+KE MGAQK WQSN KYGK
Subjt:  ARPDRPISIIIEGPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK

AQZ37080.1 replication-associated protein [Sweet potato leaf curl Georgia virus]2.2e-7373.48Show/hide
Query:  DTIEWGTFQVDGRSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSA
        D I+WGTFQVDGRS+RGG QTANDA A+ALN+GSA+ AL IIRE   +DFIF +HNL+ NL+RIF+ P + Y S F  +SF  VP +I DWAA+NV DSA
Subjt:  DTIEWGTFQVDGRSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSA

Query:  ARPDRPISIIIEGPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK
        ARPDRPISI+IEGPSR+GKTVWARSLG HNYLCGHLDLS KVY+N AWYNVIDDV+P+YLKH+KE MGAQK WQSN KYGK
Subjt:  ARPDRPISIIIEGPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK

NP_808771.1 rep protein [Sweet potato leaf curl Georgia virus]2.2e-7373.48Show/hide
Query:  DTIEWGTFQVDGRSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSA
        D I+WGTFQVDGRS+RGG QTANDA A+ALN+GSA+ AL IIRE   +DFIF +HNL+ NL+RIF+ P + Y S F  +SF  VP +I DWAA+NV DSA
Subjt:  DTIEWGTFQVDGRSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSA

Query:  ARPDRPISIIIEGPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK
        ARPDRPISI+IEGPSR+GKTVWARSLG HNYLCGHLDLS KVY+N AWYNVIDDV+P+YLKH+KE MGAQK WQSN KYGK
Subjt:  ARPDRPISIIIEGPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK

QCQ82601.1 replication associated protein [Sweet potato leaf curl virus]2.2e-7374.03Show/hide
Query:  DTIEWGTFQVDGRSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSA
        D I+WG FQVDGRS+RGG QTANDA A+ALN+GSA+ AL IIRE   KDFIF +HNL+ NL+RIF+ P + Y S F  +SF  VP +I DWAA+NV DSA
Subjt:  DTIEWGTFQVDGRSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSA

Query:  ARPDRPISIIIEGPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK
        ARPDRPISI+IEGPSR+GKTVWARSLG HNYLCGHLDLS KVY+N AWYNVIDDV+P+YLKH+KE MGAQK WQSNTKYGK
Subjt:  ARPDRPISIIIEGPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK

TrEMBL top hitse value%identityAlignment
A0A088BDS5 Replication-associated protein1.1e-7373.48Show/hide
Query:  DTIEWGTFQVDGRSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSA
        D I+WGTFQVDGRS+RGG QTANDA A+ALN+GSA+ AL IIRE   +DFIF +HNL+ NL+RIF+ P + Y S F  +SF  VP +I DWAA+NV DSA
Subjt:  DTIEWGTFQVDGRSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSA

Query:  ARPDRPISIIIEGPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK
        ARPDRPISI+IEGPSR+GKTVWARSLG HNYLCGHLDLS KVY+N AWYNVIDDV+P+YLKH+KE MGAQK WQSN KYGK
Subjt:  ARPDRPISIIIEGPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK

A0A1U9Y7P8 Replication-associated protein1.1e-7373.48Show/hide
Query:  DTIEWGTFQVDGRSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSA
        D I+WGTFQVDGRS+RGG QTANDA A+ALN+GSA+ AL IIRE   +DFIF +HNL+ NL+RIF+ P + Y S F  +SF  VP +I DWAA+NV DSA
Subjt:  DTIEWGTFQVDGRSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSA

Query:  ARPDRPISIIIEGPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK
        ARPDRPISI+IEGPSR+GKTVWARSLG HNYLCGHLDLS KVY+N AWYNVIDDV+P+YLKH+KE MGAQK WQSN KYGK
Subjt:  ARPDRPISIIIEGPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK

A0A4P8PIR9 Replication-associated protein1.1e-7374.03Show/hide
Query:  DTIEWGTFQVDGRSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSA
        D I+WG FQVDGRS+RGG QTANDA A+ALN+GSA+ AL IIRE   KDFIF +HNL+ NL+RIF+ P + Y S F  +SF  VP +I DWAA+NV DSA
Subjt:  DTIEWGTFQVDGRSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSA

Query:  ARPDRPISIIIEGPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK
        ARPDRPISI+IEGPSR+GKTVWARSLG HNYLCGHLDLS KVY+N AWYNVIDDV+P+YLKH+KE MGAQK WQSNTKYGK
Subjt:  ARPDRPISIIIEGPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK

I6LGU5 Replication-associated protein1.4e-7373.48Show/hide
Query:  DTIEWGTFQVDGRSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSA
        D I+WGTFQVDGRS+RGG QTANDA A+ALN+GSA+ AL IIRE   +DFIF +HNL+ NL+RIF+ P + Y S F  +SF  VP +I DWAA+NV DSA
Subjt:  DTIEWGTFQVDGRSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSA

Query:  ARPDRPISIIIEGPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK
        ARPDRPISI+IEGPSR+GKTVWARSLG HNYLCGHLDLS KVY+N AWYNVIDDV+P+YLKH+KE MGAQK WQSN KYGK
Subjt:  ARPDRPISIIIEGPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK

Q8V5Z4 Replication-associated protein1.1e-7373.48Show/hide
Query:  DTIEWGTFQVDGRSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSA
        D I+WGTFQVDGRS+RGG QTANDA A+ALN+GSA+ AL IIRE   +DFIF +HNL+ NL+RIF+ P + Y S F  +SF  VP +I DWAA+NV DSA
Subjt:  DTIEWGTFQVDGRSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSA

Query:  ARPDRPISIIIEGPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK
        ARPDRPISI+IEGPSR+GKTVWARSLG HNYLCGHLDLS KVY+N AWYNVIDDV+P+YLKH+KE MGAQK WQSN KYGK
Subjt:  ARPDRPISIIIEGPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK

SwissProt top hitse value%identityAlignment
P14972 Replication-associated protein3.3e-7270.17Show/hide
Query:  DTIEWGTFQVDGRSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSA
        DT+EWG FQ+DGRS+RGG Q+ANDA AKALNSGS  EALN+IRE   KDF+  FHNL  NL+RIF +P  PY S F  +SF +VP  + +W A NV DSA
Subjt:  DTIEWGTFQVDGRSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSA

Query:  ARPDRPISIIIEGPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK
        ARP RP SI+IEG SR GKT+WARSLG HNYLCGHLDLS KV+ NDAWYNVIDDVDP YLKH+KE MG+Q+ WQSNTKYGK
Subjt:  ARPDRPISIIIEGPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK

P14982 Replication-associated protein2.8e-7170.17Show/hide
Query:  DTIEWGTFQVDGRSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSA
        DT+EWG FQ+DGRS+RGG Q+ANDA AKALNSGS  EALN+IRE   KDF+  FHNL  NL+RIF +P  PY S F  +SF +VP  I +W A NV DSA
Subjt:  DTIEWGTFQVDGRSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSA

Query:  ARPDRPISIIIEGPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK
        ARP RP SI+IEG SR GKT+WARSLG HNYLCGHLDLS KV+ N AWYNVIDDVDP YLKH+KE MG+Q+ WQSNTKYGK
Subjt:  ARPDRPISIIIEGPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK

P27260 Replication-associated protein9.0e-7064.64Show/hide
Query:  DTIEWGTFQVDGRSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSA
        D +EWGTFQ+DGRS+RGG QTANDA AKA+N+GS  +AL++I+E   +D++  FHN+  NL+++F  P  PY S F  +SF +VP  +  W ++NV D+A
Subjt:  DTIEWGTFQVDGRSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSA

Query:  ARPDRPISIIIEGPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK
        ARP RP+SI+IEG SR GKT WARSLG HNYLCGHLDLS KVY+N+AWYNVIDDVDP YLKH+KE MGAQ+ WQSNTKYGK
Subjt:  ARPDRPISIIIEGPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK

P36279 Replication-associated protein2.5e-7269.61Show/hide
Query:  DTIEWGTFQVDGRSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSA
        DT+EWG FQ+DGRS+RGG Q+ANDA A+ALN+GS  EALN++RE   KD++  FHNL  NL+RIF  P   Y S F  +SF RVP  + +W A+NV D+A
Subjt:  DTIEWGTFQVDGRSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSA

Query:  ARPDRPISIIIEGPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK
        ARP RPISI+IEG SR GKTVWARSLG HNYLCGHLDLS KVY+NDAWYNVIDDVDP YLKH+KE MGAQ+ WQSNTKYGK
Subjt:  ARPDRPISIIIEGPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK

P38609 Replication-associated protein5.3e-7049.81Show/hide
Query:  IDKDEVLAQILQINLCRIPSSLRFAGNCIKTTIPTFTSFYSLKGSSNARTNASSTSIPQPRRDSSIRTYKVQSLVPMS--KPTSTRTWDTIEWGTFQVDG
        + K+E L Q+LQ+        ++      +   P        +G  N + N     +  P R +      +Q     S  K    +  D +EWGTFQ+DG
Subjt:  IDKDEVLAQILQINLCRIPSSLRFAGNCIKTTIPTFTSFYSLKGSSNARTNASSTSIPQPRRDSSIRTYKVQSLVPMS--KPTSTRTWDTIEWGTFQVDG

Query:  RSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSAARPDRPISIIIE
        RS+RGG QTANDA AKA+N+GS  EAL++I+E   +D+I  FHN+  NL+R+F  P  PY S F  +SF +VP  +  W ++NV D+AARP RP+SI+IE
Subjt:  RSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCPNSFTRVPHVILDWAAKNVCDSAARPDRPISIIIE

Query:  GPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK
        G SR GKT+WARSLG HNYLCGHLDLS KVY+N+AWYNVIDDVDP YLKH+KE MG+Q+ WQSNTKYGK
Subjt:  GPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGGGCAGTGGGACAGGCACAGACAGCAACAGTTGGACGTTGAAGATCTACATCCGTCGCACCATCCGGCGAATGCTGTACGCGTTAATCTACAAGTTGCTCATTG
TAAAGGCGACAACGGACAACTGCAATTAAATCCTTATCAGATCAAAGCCCAAATGTTCATTGACAAAGATGAAGTTCTTGCGCAGATCTTGCAAATAAATCTGTGTCGAA
TACCAAGTTCGTTAAGGTTTGCAGGGAATTGCATCAAGACGACAATCCCCACATTCACGTCCTTCTACAGTTTGAAGGGAAGTTCAAATGCCAGAACGAACGCCTCTTCG
ACCTCAATTCCCCAACCTCGTCGCGACAGTTCCATCCGAACATACAAAGTGCAAAGTCTAGTTCCGATGTCAAAGCCTACATCAACAAGGACATGGGATACGATCGAATG
GGGAACGTTTCAGGTGGACGGAAGAAGTTCTCGCGGAGGTCATCAAACCGCTAACGACGCTTGTGCAAAAGCTCTGAATTCTGGATCGGCTGATGAGGCGTTAAACATTA
TTAGAGAGTTTCGTTCGAAGGATTTCATTTTCAGTTTTCACAATCTACAATTAAACTTGGAAAGAATTTTTGCAAAACCACAAACGCCGTATGAATCCAGATTTTGTCCA
AATTCGTTCACTAGGGTGCCTCATGTAATCTTGGACTGGGCCGCGAAGAACGTCTGTGATTCCGCTGCGCGGCCGGATAGACCCATAAGTATAATCATAGAGGGACCAAG
CAGAGTAGGCAAAACAGTTTGGGCACGATCATTGGGTAAACATAATTACTTATGTGGCCATTTGGATTTGAGCGGTAAAGTATATACTAATGACGCATGGTACAACGTGA
TAGATGACGTGGATCCACGATATCTAAAGCACTATAAAGAGCTCATGGGGGCCCAGAAGAGCTGGCAAAGCAACACAAAGTACGGAAAGTCCGGTAATGATTACTGGCGG
TATACCCGGAATCTTCCTCTGCAATCCTGGCGAAGGGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTGGGCAGTGGGACAGGCACAGACAGCAACAGTTGGACGTTGAAGATCTACATCCGTCGCACCATCCGGCGAATGCTGTACGCGTTAATCTACAAGTTGCTCATTG
TAAAGGCGACAACGGACAACTGCAATTAAATCCTTATCAGATCAAAGCCCAAATGTTCATTGACAAAGATGAAGTTCTTGCGCAGATCTTGCAAATAAATCTGTGTCGAA
TACCAAGTTCGTTAAGGTTTGCAGGGAATTGCATCAAGACGACAATCCCCACATTCACGTCCTTCTACAGTTTGAAGGGAAGTTCAAATGCCAGAACGAACGCCTCTTCG
ACCTCAATTCCCCAACCTCGTCGCGACAGTTCCATCCGAACATACAAAGTGCAAAGTCTAGTTCCGATGTCAAAGCCTACATCAACAAGGACATGGGATACGATCGAATG
GGGAACGTTTCAGGTGGACGGAAGAAGTTCTCGCGGAGGTCATCAAACCGCTAACGACGCTTGTGCAAAAGCTCTGAATTCTGGATCGGCTGATGAGGCGTTAAACATTA
TTAGAGAGTTTCGTTCGAAGGATTTCATTTTCAGTTTTCACAATCTACAATTAAACTTGGAAAGAATTTTTGCAAAACCACAAACGCCGTATGAATCCAGATTTTGTCCA
AATTCGTTCACTAGGGTGCCTCATGTAATCTTGGACTGGGCCGCGAAGAACGTCTGTGATTCCGCTGCGCGGCCGGATAGACCCATAAGTATAATCATAGAGGGACCAAG
CAGAGTAGGCAAAACAGTTTGGGCACGATCATTGGGTAAACATAATTACTTATGTGGCCATTTGGATTTGAGCGGTAAAGTATATACTAATGACGCATGGTACAACGTGA
TAGATGACGTGGATCCACGATATCTAAAGCACTATAAAGAGCTCATGGGGGCCCAGAAGAGCTGGCAAAGCAACACAAAGTACGGAAAGTCCGGTAATGATTACTGGCGG
TATACCCGGAATCTTCCTCTGCAATCCTGGCGAAGGGTCTAG
Protein sequenceShow/hide protein sequence
MSGQWDRHRQQQLDVEDLHPSHHPANAVRVNLQVAHCKGDNGQLQLNPYQIKAQMFIDKDEVLAQILQINLCRIPSSLRFAGNCIKTTIPTFTSFYSLKGSSNARTNASS
TSIPQPRRDSSIRTYKVQSLVPMSKPTSTRTWDTIEWGTFQVDGRSSRGGHQTANDACAKALNSGSADEALNIIREFRSKDFIFSFHNLQLNLERIFAKPQTPYESRFCP
NSFTRVPHVILDWAAKNVCDSAARPDRPISIIIEGPSRVGKTVWARSLGKHNYLCGHLDLSGKVYTNDAWYNVIDDVDPRYLKHYKELMGAQKSWQSNTKYGKSGNDYWR
YTRNLPLQSWRRV