; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026564 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026564
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr10:39091186..39092849
RNA-Seq ExpressionLag0026564
SyntenyLag0026564
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3475057.1 reverse transcriptase [Gossypium australe]7.2e-4339.27Show/hide
Query:  GKGVQVDGGRE------KESLDVVLRSMSRFHIDVMVKADK--GTWRFTGFYGDPEVANREHSWNLLRRLHDLYSCPWVVGGDFNELLRAEEKWGSNIRC
        G  +  +G RE      KE ++V LRS S+ HIDVMVK +     WRFTGFYG P   N+  SWNLLRRL +  S PW+V GDFNE++ + EK G +IR 
Subjt:  GKGVQVDGGRE------KESLDVVLRSMSRFHIDVMVKADK--GTWRFTGFYGDPEVANREHSWNLLRRLHDLYSCPWVVGGDFNELLRAEEKWGSNIRC

Query:  QNQMQAFRDAIDDCDLVDIGFKGSQYTWYKLKGKEVVMLERLDRGLCNSEFYAMFPATNVRNLDFCLSDHCPILIEANSSLSGQRRKGGYYFRFEEVWTQ
        + +M+AFR+ +++C L DIGF G+ +TW +    +  + ERLDR + N+++  +F   N+ +L   +SDHCPIL+   S     R     YF+FE  WT 
Subjt:  QNQMQAFRDAIDDCDLVDIGFKGSQYTWYKLKGKEVVMLERLDRGLCNSEFYAMFPATNVRNLDFCLSDHCPILIEANSSLSGQRRKGGYYFRFEEVWTQ

Query:  FDGCREVVEKGWELPP----QKQGRVQEGRVDSWRRRVKKCADELSK
         +   E ++K W        +K G++Q   +  W   +KK  + L +
Subjt:  FDGCREVVEKGWELPP----QKQGRVQEGRVDSWRRRVKKCADELSK

KAF5443558.1 hypothetical protein F2P56_036105, partial [Juglans regia]2.2e-4740.94Show/hide
Query:  KESLDVVLRSMSRFHIDVMVK-ADKGTWRFTGFYGDPEVANREHSWNLLRRLHDLYSCPWVVGGDFNELLRAEEKWGSNIRCQNQMQAFRDAIDDCDLVD
        K  L V ++S S  HID +++  D   WRFTG YG+PEV NR  +WNLLRRL+     PW+VGGDFNELL   EK G   R +NQM+AFR+ I DC L D
Subjt:  KESLDVVLRSMSRFHIDVMVK-ADKGTWRFTGFYGDPEVANREHSWNLLRRLHDLYSCPWVVGGDFNELLRAEEKWGSNIRCQNQMQAFRDAIDDCDLVD

Query:  IGFKGSQYTWYKLKGKEVVMLERLDRGLCNSEFYAMFPATNVRNLDFCLSDHCPILIEANSSLSGQRRKGGYYFRFEEVWTQFDGCREVVEKGWELPPQK
        +GF+G +YTW   +     + ERLDR L N++F A+FP   VR+     SDH P+  +  S    +R K    FRFE +W   + C +++ + W      
Subjt:  IGFKGSQYTWYKLKGKEVVMLERLDRGLCNSEFYAMFPATNVRNLDFCLSDHCPILIEANSSLSGQRRKGGYYFRFEEVWTQFDGCREVVEKGWELPPQK

Query:  QGRVQEGRVDSWRRRVKKCADELSKWGKEKKGNYAHRISLARQQLHQAVVSQRD
              GR++   R +KKC ++LS+W K+  GN + ++++A+  L Q  +  RD
Subjt:  QGRVQEGRVDSWRRRVKKCADELSKWGKEKKGNYAHRISLARQQLHQAVVSQRD

XP_023886153.1 uncharacterized protein LOC111998282 [Quercus suber]1.9e-4338.31Show/hide
Query:  KESLDVVLRSMSRFHIDVMV-KADKGTWRFTGFYGDPEVANREHSWNLLRRLHDLYSCPWVVGGDFNELLRAEEKWGSNIRCQNQMQAFRDAIDDCDLVD
        K ++D+ + + S+ HID +V K  +G WRFTGFYG+P    R  SWNLLR L+   + PW+  GDFNE+ R  EK G ++R  +QMQ FRDAID+C  +D
Subjt:  KESLDVVLRSMSRFHIDVMV-KADKGTWRFTGFYGDPEVANREHSWNLLRRLHDLYSCPWVVGGDFNELLRAEEKWGSNIRCQNQMQAFRDAIDDCDLVD

Query:  IGFKGSQYTWYKLKGKEVVMLERLDRGLCNSEFYAMFPATNVRNLDFCLSDHCPILIEANSSLSGQRRKGGYYFRFEEVWTQFDGCREVVEKGWELPPQK
        +GF GSQ+TW K       + ERLDRGL NSE+   F  T V +L    SDHCPI I   + +  +  K    F FEE+W    GC + ++  W +    
Subjt:  IGFKGSQYTWYKLKGKEVVMLERLDRGLCNSEFYAMFPATNVRNLDFCLSDHCPILIEANSSLSGQRRKGGYYFRFEEVWTQFDGCREVVEKGWELPPQK

Query:  QGRVQEGRVDSWRRRVKKCADELSKWGKEKKGNYAHRISLARQQLHQA
            +        +++++C  EL KW     G+    +   ++++ QA
Subjt:  QGRVQEGRVDSWRRRVKKCADELSKWGKEKKGNYAHRISLARQQLHQA

XP_023892689.1 uncharacterized protein LOC112004687 [Quercus suber]5.0e-4441.63Show/hide
Query:  KESLDVVLRSMSRFHIDVMVKAD-KGTWRFTGFYGDPEVANREHSWNLLRRLHDLYSCPWVVGGDFNELLRAEEKWGSNIRCQNQMQAFRDAIDDCDLVD
        K  L V + S S  HID ++  D +  WRFTGFYG+PE   R  SW+LL  LH   S PW+  GDFNE+L+  EK G   R   QMQ FRD +D+C L+D
Subjt:  KESLDVVLRSMSRFHIDVMVKAD-KGTWRFTGFYGDPEVANREHSWNLLRRLHDLYSCPWVVGGDFNELLRAEEKWGSNIRCQNQMQAFRDAIDDCDLVD

Query:  IGFKGSQYTWYKLKGKEVVMLERLDRGLCNSEFYAMFPATNVRNLDFCLSDHCPILIEANSSLSGQRRKGGYYFRFEEVWTQFDGCREVVEKGWELPPQK
        IGFKGS YTW KL      + E LDR + + E+++ +P T V ++D   SDH  + IE  S L   +RK    FRFEE+W    GC E VE  W++  ++
Subjt:  IGFKGSQYTWYKLKGKEVVMLERLDRGLCNSEFYAMFPATNVRNLDFCLSDHCPILIEANSSLSGQRRKGGYYFRFEEVWTQFDGCREVVEKGWELPPQK

Query:  QGRVQEGRVDSWRRRVKKCADELSKWGKEKKGNYAHRISLARQQL
        +G     RV    ++V+ C   L++W K+  GN   ++   R++L
Subjt:  QGRVQEGRVDSWRRRVKKCADELSKWGKEKKGNYAHRISLARQQL

XP_030967607.1 uncharacterized protein LOC115988108 [Quercus lobata]6.5e-4441.28Show/hide
Query:  KESLDVVLRSMSRFHIDVMV-KADKGTWRFTGFYGDPEVANREHSWNLLRRLHDLYSCPWVVGGDFNELLRAEEKWGSNIRCQNQMQAFRDAIDDCDLVD
        K    V + S S+FHID +V +     WRFTGFYG+P    RE +W++LR L      PW   GDFNE+L+ EEK G  IR  +QMQAFRD +DDC LVD
Subjt:  KESLDVVLRSMSRFHIDVMV-KADKGTWRFTGFYGDPEVANREHSWNLLRRLHDLYSCPWVVGGDFNELLRAEEKWGSNIRCQNQMQAFRDAIDDCDLVD

Query:  IGFKGSQYTWYKLKGKEVVMLERLDRGLCNSEFYAMFPATNVRNLDFCLSDHCPILIEANSSLSGQR--RKGGYYFRFEEVWTQFDGCREVVEKGWELPP
        +GF G ++TW+  +   ++  ERLDRG+ N ++ A FPA +VR+L    SDH PI +  + +   QR  R+    FRFEE+W    GC+++V + WE+  
Subjt:  IGFKGSQYTWYKLKGKEVVMLERLDRGLCNSEFYAMFPATNVRNLDFCLSDHCPILIEANSSLSGQR--RKGGYYFRFEEVWTQFDGCREVVEKGWELPP

Query:  QKQGRVQEGRVDSWRRRVKKCADELSKWGKEKKGN
        Q+ G      +    ++++KC   L  W K+  G+
Subjt:  QKQGRVQEGRVDSWRRRVKKCADELSKWGKEKKGN

TrEMBL top hitse value%identityAlignment
A0A2N9EVR9 F-box domain-containing protein1.7e-4540.73Show/hide
Query:  KESLDVVLRSMSRFHIDVMV-KADKGTWRFTGFYGDPEVANREHSWNLLRRLHDLYSCPWVVGGDFNELLRAEEKWGSNIRCQNQMQAFRDAIDDCDLVD
        K+ +++ + S S  HID +V +A +  WRFTGFYG PE   RE SW LLRRL+   S PW   GDFNEL+R EEK G + R + QMQ FRD +D+C  VD
Subjt:  KESLDVVLRSMSRFHIDVMV-KADKGTWRFTGFYGDPEVANREHSWNLLRRLHDLYSCPWVVGGDFNELLRAEEKWGSNIRCQNQMQAFRDAIDDCDLVD

Query:  IGFKGSQYTWYKLKGKEVVMLERLDRGLCNSEFYAMFPATNVRNLDFCLSDHCPILIEANSSLSGQRRKGGYYFRFEEVWTQFDGCREVVEKGWELPPQK
        +GF G ++TW   +  ++   ERLDR +   E+   FP+T V +LD   SDH P+ +  +  +   R+     FRFEEVWT   GC EV+   W+ P   
Subjt:  IGFKGSQYTWYKLKGKEVVMLERLDRGLCNSEFYAMFPATNVRNLDFCLSDHCPILIEANSSLSGQRRKGGYYFRFEEVWTQFDGCREVVEKGWELPPQK

Query:  QGRVQEGRVDSWRRRVKKCADELSKWGKEKKGNYAHRISLARQQLHQA
           V    + S   ++  C  EL  W K+  GN   +I    ++L QA
Subjt:  QGRVQEGRVDSWRRRVKKCADELSKWGKEKKGNYAHRISLARQQLHQA

A0A2N9G258 Uncharacterized protein1.1e-4440.73Show/hide
Query:  KESLDVVLRSMSRFHIDVMV-KADKGTWRFTGFYGDPEVANREHSWNLLRRLHDLYSCPWVVGGDFNELLRAEEKWGSNIRCQNQMQAFRDAIDDCDLVD
        K+ +++ + S    HID +V +A    WRFTGFYG PE   RE SW LLRRL+   + PW   GDFNEL+R EEK G + R + QMQ FRD +D+C  VD
Subjt:  KESLDVVLRSMSRFHIDVMV-KADKGTWRFTGFYGDPEVANREHSWNLLRRLHDLYSCPWVVGGDFNELLRAEEKWGSNIRCQNQMQAFRDAIDDCDLVD

Query:  IGFKGSQYTWYKLKGKEVVMLERLDRGLCNSEFYAMFPATNVRNLDFCLSDHCPILIEANSSLSGQRRKGGYYFRFEEVWTQFDGCREVVEKGWELPPQK
        +GF G ++TW   +  ++   ERLDR +   E+   FP+T V++LD   SDH P+ +    S    RR     FRFEEVWT   GC EV+   W+ P   
Subjt:  IGFKGSQYTWYKLKGKEVVMLERLDRGLCNSEFYAMFPATNVRNLDFCLSDHCPILIEANSSLSGQRRKGGYYFRFEEVWTQFDGCREVVEKGWELPPQK

Query:  QGRVQEGRVDSWRRRVKKCADELSKWGKEKKGNYAHRISLARQQLHQA
           V    + S   ++  C  EL  W K+  GN   +I     +L QA
Subjt:  QGRVQEGRVDSWRRRVKKCADELSKWGKEKKGNYAHRISLARQQLHQA

A0A2N9G656 Reverse transcriptase domain-containing protein2.4e-4437.5Show/hide
Query:  KESLDVVLRSMSRFHIDVMV-KADKGTWRFTGFYGDPEVANREHSWNLLRRLHDLYSCPWVVGGDFNELLRAEEKWGSNIRCQNQMQAFRDAIDDCDLVD
        ++ +++ +RS S  HID ++   D   WRFTGFYG P+  +RE SWNLLR L+  Y  PW+  GDFNE+ +  EK G   R + QM+ FR+A+D+C+L+D
Subjt:  KESLDVVLRSMSRFHIDVMV-KADKGTWRFTGFYGDPEVANREHSWNLLRRLHDLYSCPWVVGGDFNELLRAEEKWGSNIRCQNQMQAFRDAIDDCDLVD

Query:  IGFKGSQYTWYKLKGKEVVMLERLDRGLCNSEFYAMFPATNVRNLDFCLSDHCPILIEANSSLSGQRRKGGYYFRFEEVWTQFDGCREVVEKGWELPPQK
        +G++G+ YTW   +     +  RLDR + +S++   F    V++L    SDHCP+LI  N+ +    +K    FRFE++WT   GC E V + W+ P   
Subjt:  IGFKGSQYTWYKLKGKEVVMLERLDRGLCNSEFYAMFPATNVRNLDFCLSDHCPILIEANSSLSGQRRKGGYYFRFEEVWTQFDGCREVVEKGWELPPQK

Query:  QGRVQEGRVDSWRRRVKKCADELSKWGKEKKGNYAHRISLARQQLHQA
        QG      +    ++ K C   LS W K + G+    +     QL QA
Subjt:  QGRVQEGRVDSWRRRVKKCADELSKWGKEKKGNYAHRISLARQQLHQA

A0A2N9G7B6 Uncharacterized protein1.9e-4437.64Show/hide
Query:  QMRKKGKGVQVDGGREKESLDVVLRSMSRFHIDVMVKADK-GTWRFTGFYGDPEVANREHSWNLLRRLHDLYSCPWVVGGDFNELLRAEEKWGSNIRCQN
        Q R KG G+ +     K+ +++ + S S  HID +V  ++   WRFTGFYG PE  NRE SWNLLRRL+  +  PW   GDFNEL+R EEK G + R + 
Subjt:  QMRKKGKGVQVDGGREKESLDVVLRSMSRFHIDVMVKADK-GTWRFTGFYGDPEVANREHSWNLLRRLHDLYSCPWVVGGDFNELLRAEEKWGSNIRCQN

Query:  QMQAFRDAIDDCDLVDIGFKGSQYTWYKLKGKEVVMLERLDRGLCNSEFYAMFPATNVRNLDFCLSDHCPILIEANSSLSGQRRKGGYYFRFEEVWTQFD
        QMQ FRD +D+C  +D+GF G ++TW   +  ++   ERLDR +   ++   FP+  V +L+   SDH P+ +    +    ++     FRFEEVWT   
Subjt:  QMQAFRDAIDDCDLVDIGFKGSQYTWYKLKGKEVVMLERLDRGLCNSEFYAMFPATNVRNLDFCLSDHCPILIEANSSLSGQRRKGGYYFRFEEVWTQFD

Query:  GCREVVEKGWELPPQKQGRVQEGRVDSWRRRVKKCADELSKWGKEKKGNYAHRISLARQQLHQAVVSQRDV
        GC  VV+  W        + +   V     R+ K A+E+S  G++      HR+SL RQ+LH  +  + ++
Subjt:  GCREVVEKGWELPPQKQGRVQEGRVDSWRRRVKKCADELSKWGKEKKGNYAHRISLARQQLHQAVVSQRDV

A0A2N9H6V4 RNase H domain-containing protein3.7e-4539.67Show/hide
Query:  VVLRSMSRFHIDVMVKADKGT-WRFTGFYGDPEVANREHSWNLLRRLHDLYSCPWVVGGDFNELLRAEEKWGSNIRCQNQMQAFRDAIDDCDLVDIGFKG
        V + S S  HID ++  D  T WRFTGFYG P V+ +  +W+LLR L   ++ PW+ GGDFNELL+AEEKWG   R  +QM+AFR  +D+C  VD+GF G
Subjt:  VVLRSMSRFHIDVMVKADKGT-WRFTGFYGDPEVANREHSWNLLRRLHDLYSCPWVVGGDFNELLRAEEKWGSNIRCQNQMQAFRDAIDDCDLVDIGFKG

Query:  SQYTWYKLKGKEVVMLERLDRGLCNSEFYAMFPATNVRNLDFCLSDHCPILIEANSSLSGQRRKGGYYFRFEEVWTQFDGCREVVEKGWELPPQKQGRVQ
        S YTW+  +     +LERLDR L  ++++  FP++ V +L    SDH P+ +E + S   + R     FRFEE+WT   GC + +++ WE   + +G   
Subjt:  SQYTWYKLKGKEVVMLERLDRGLCNSEFYAMFPATNVRNLDFCLSDHCPILIEANSSLSGQRRKGGYYFRFEEVWTQFDGCREVVEKGWELPPQKQGRVQ

Query:  EGRVDSWRRRVKKCADELSKWGKEKKGNYAHRISLARQQLHQ
           V+    ++K C + L KW   + G+    I+     L Q
Subjt:  EGRVDSWRRRVKKCADELSKWGKEKKGNYAHRISLARQQLHQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein6.8e-0726.05Show/hide
Query:  VVGGDFNELLRAEEKWG---SNIRCQNQMQAFRDAIDDCDLVDIGFKGSQYTWYKLKGKEVVMLERLDRGLCNSEFYAMFPATNVRNLDFCLSDHCPILI
        ++ GDF+++    + +    ++I  +  ++ F++ + D DLVDI  +G  YTW   +    + + +LDR + N ++++ FP+         +SDH P +I
Subjt:  VVGGDFNELLRAEEKWG---SNIRCQNQMQAFRDAIDDCDLVDIGFKGSQYTWYKLKGKEVVMLERLDRGLCNSEFYAMFPATNVRNLDFCLSDHCPILI

Query:  EANSSLSGQRRKGGYYFRF
            +L  + +K   YF F
Subjt:  EANSSLSGQRRKGGYYFRF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAATGAGGGGAGGCGATGGTAAGGTAGGGGAGGGCGATGGTATGGAGAATCCTATCGAAGAAGGGGGTGAGGGTATGGAGAGGGATCGAAGTGGAGAGGAAGAGAA
CGTGGGGAGGGGTAATGGGCTTAATGGGCAAGAAAGGATGTCAGGTGAGGTGGAGGGGGGCGAGGGGAAAGGTCTGGAAGGGGAAGGGGAGGGGCTGGGGGAAGGTGAGG
AGGTCGCCGCTACAGGTGGAAGGGTGGGGGAGGAAGGGGTGAGTGAGGTGGCTAGAAAGGGGAAAGGAAAGGAGGTGTGTGTTGAAGAAACTGTGAAGGGAAAGGGGTTT
ATCCCGAAGGGAGGTGTGTCCATCAATGAGCCAAATGGAGAGAGCGTGGGGCTGAAGGGGACGATGGAAATGAATGTAGATGGGAATTTTCCCACTTCTGTGAAGGTAAG
GGAGGGTAAGAAGTGGAAGAGGAGGGGGAGGGATGAGGTAGTGGGAATGGACTGCGAGGAAAGTACCTTATTGGGCAAGAGAGGCCCGAGTGGAGAGACAGAAGATGGGG
CAAGTGTGGGTATGAGTGAGCAGATGAGGAAGAAGGGGAAGGGTGTCCAGGTGGACGGTGGTAGGGAGAAGGAGAGCCTGGATGTAGTGTTGAGATCGATGAGTAGGTTT
CATATTGATGTGATGGTAAAAGCTGATAAAGGGACCTGGAGGTTTACGGGGTTCTATGGGGACCCGGAGGTGGCCAATAGAGAGCATTCATGGAACTTGCTTCGAAGACT
GCATGATCTTTACTCATGTCCGTGGGTGGTGGGTGGTGATTTCAACGAGTTGTTGAGGGCAGAGGAGAAGTGGGGTAGTAATATTCGTTGTCAGAATCAGATGCAGGCGT
TTCGGGATGCTATTGATGATTGTGATTTAGTTGATATAGGTTTTAAAGGCTCTCAATATACGTGGTATAAATTGAAGGGAAAAGAAGTGGTAATGCTGGAAAGGCTAGAC
AGAGGCCTATGTAACTCGGAGTTTTATGCCATGTTCCCTGCTACAAATGTCCGGAACCTAGACTTTTGTTTGTCAGATCACTGTCCAATATTGATCGAGGCAAATAGTTC
GTTGTCGGGGCAGAGGAGAAAGGGAGGTTATTACTTTCGTTTTGAGGAGGTCTGGACCCAGTTCGACGGTTGTAGAGAGGTGGTTGAGAAGGGTTGGGAGCTGCCTCCCC
AAAAGCAAGGAAGGGTTCAGGAGGGGCGAGTTGACAGTTGGAGGAGACGAGTGAAGAAGTGTGCAGACGAGTTGTCGAAGTGGGGGAAGGAGAAGAAAGGAAACTATGCT
CACAGAATTTCCTTGGCTCGCCAACAATTGCATCAAGCGGTTGTCAGTCAGAGGGATGTGAAGGGTGCTAGAAGTTTGGTGGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAATGAGGGGAGGCGATGGTAAGGTAGGGGAGGGCGATGGTATGGAGAATCCTATCGAAGAAGGGGGTGAGGGTATGGAGAGGGATCGAAGTGGAGAGGAAGAGAA
CGTGGGGAGGGGTAATGGGCTTAATGGGCAAGAAAGGATGTCAGGTGAGGTGGAGGGGGGCGAGGGGAAAGGTCTGGAAGGGGAAGGGGAGGGGCTGGGGGAAGGTGAGG
AGGTCGCCGCTACAGGTGGAAGGGTGGGGGAGGAAGGGGTGAGTGAGGTGGCTAGAAAGGGGAAAGGAAAGGAGGTGTGTGTTGAAGAAACTGTGAAGGGAAAGGGGTTT
ATCCCGAAGGGAGGTGTGTCCATCAATGAGCCAAATGGAGAGAGCGTGGGGCTGAAGGGGACGATGGAAATGAATGTAGATGGGAATTTTCCCACTTCTGTGAAGGTAAG
GGAGGGTAAGAAGTGGAAGAGGAGGGGGAGGGATGAGGTAGTGGGAATGGACTGCGAGGAAAGTACCTTATTGGGCAAGAGAGGCCCGAGTGGAGAGACAGAAGATGGGG
CAAGTGTGGGTATGAGTGAGCAGATGAGGAAGAAGGGGAAGGGTGTCCAGGTGGACGGTGGTAGGGAGAAGGAGAGCCTGGATGTAGTGTTGAGATCGATGAGTAGGTTT
CATATTGATGTGATGGTAAAAGCTGATAAAGGGACCTGGAGGTTTACGGGGTTCTATGGGGACCCGGAGGTGGCCAATAGAGAGCATTCATGGAACTTGCTTCGAAGACT
GCATGATCTTTACTCATGTCCGTGGGTGGTGGGTGGTGATTTCAACGAGTTGTTGAGGGCAGAGGAGAAGTGGGGTAGTAATATTCGTTGTCAGAATCAGATGCAGGCGT
TTCGGGATGCTATTGATGATTGTGATTTAGTTGATATAGGTTTTAAAGGCTCTCAATATACGTGGTATAAATTGAAGGGAAAAGAAGTGGTAATGCTGGAAAGGCTAGAC
AGAGGCCTATGTAACTCGGAGTTTTATGCCATGTTCCCTGCTACAAATGTCCGGAACCTAGACTTTTGTTTGTCAGATCACTGTCCAATATTGATCGAGGCAAATAGTTC
GTTGTCGGGGCAGAGGAGAAAGGGAGGTTATTACTTTCGTTTTGAGGAGGTCTGGACCCAGTTCGACGGTTGTAGAGAGGTGGTTGAGAAGGGTTGGGAGCTGCCTCCCC
AAAAGCAAGGAAGGGTTCAGGAGGGGCGAGTTGACAGTTGGAGGAGACGAGTGAAGAAGTGTGCAGACGAGTTGTCGAAGTGGGGGAAGGAGAAGAAAGGAAACTATGCT
CACAGAATTTCCTTGGCTCGCCAACAATTGCATCAAGCGGTTGTCAGTCAGAGGGATGTGAAGGGTGCTAGAAGTTTGGTGGAGTGA
Protein sequenceShow/hide protein sequence
MGMRGGDGKVGEGDGMENPIEEGGEGMERDRSGEEENVGRGNGLNGQERMSGEVEGGEGKGLEGEGEGLGEGEEVAATGGRVGEEGVSEVARKGKGKEVCVEETVKGKGF
IPKGGVSINEPNGESVGLKGTMEMNVDGNFPTSVKVREGKKWKRRGRDEVVGMDCEESTLLGKRGPSGETEDGASVGMSEQMRKKGKGVQVDGGREKESLDVVLRSMSRF
HIDVMVKADKGTWRFTGFYGDPEVANREHSWNLLRRLHDLYSCPWVVGGDFNELLRAEEKWGSNIRCQNQMQAFRDAIDDCDLVDIGFKGSQYTWYKLKGKEVVMLERLD
RGLCNSEFYAMFPATNVRNLDFCLSDHCPILIEANSSLSGQRRKGGYYFRFEEVWTQFDGCREVVEKGWELPPQKQGRVQEGRVDSWRRRVKKCADELSKWGKEKKGNYA
HRISLARQQLHQAVVSQRDVKGARSLVE