; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0009040 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0009040
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr9:34462221..34465336
RNA-Seq ExpressionLag0009040
SyntenyLag0009040
Gene Ontology termsGO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AFP55557.1 non-ltr retroelement reverse transcriptase [Rosa rugosa]3.4e-6345.98Show/hide
Query:  KDFLSSLLGVRHVSDLGKYLGVPSVFSQNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEILIKSIGQAIPSYVISVFKLPKHLHEDITRNFARFWWGTQS
        K+  S++L +  V    +YLG+P+V  ++K K    + DRVW  V GW+    S AGKE+LIK++ QAIP+Y +SVF+LP    + I +  ARFWWG + 
Subjt:  KDFLSSLLGVRHVSDLGKYLGVPSVFSQNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEILIKSIGQAIPSYVISVFKLPKHLHEDITRNFARFWWGTQS

Query:  NKRELHWFRWKDLCLPKSLGGLNFRDIEGFNKALIAKQVWRLLINPESLVSRFLKSQYYRDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKN
         K  +HW RW DLC  K  GGL FRD+  FN+AL+ KQ WRL++ P+SLV+R LK++Y+   + MEA+LG  PSYLW+S LWGRELL KG+R RIG+GK 
Subjt:  NKRELHWFRWKDLCLPKSLGGLNFRDIEGFNKALIAKQVWRLLINPESLVSRFLKSQYYRDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKN

Query:  TFIFKDLWLPKETSFRPICINSEMYNTRVADYISESGAWDLDKLNRVVIDFDINSIKGIPI
          +F D W+P   SFRPI         RV+D +  +G W+++ LN    D +  +I  I +
Subjt:  TFIFKDLWLPKETSFRPICINSEMYNTRVADYISESGAWDLDKLNRVVIDFDINSIKGIPI

PRQ55763.1 putative RNA-directed DNA polymerase [Rosa chinensis]5.4e-6143.33Show/hide
Query:  LKSLWKDFLSSLLGVRHVSDLGKYLGVPSVFSQNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEILIKSIGQAIPSYVISVFKLPKHLHEDITRNFARFW
        +K   +D L++ L V  V    KYLG+P+  S +KS+  +F+ ++V K  QGW+    S AGKEI+IK++ Q+IP+YV+S F+LPKHL  ++ R  A+FW
Subjt:  LKSLWKDFLSSLLGVRHVSDLGKYLGVPSVFSQNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEILIKSIGQAIPSYVISVFKLPKHLHEDITRNFARFW

Query:  WGTQSNKRELHWFRWKDLCLPKSLGGLNFRDIEGFNKALIAKQVWRLLINPESLVSRFLKSQYYRDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRI
        WG +SN +++HW  W+ LC+PK+ GGL FR++  FN+AL+AKQ WR++ NP SL++R  K++Y+ + + M+A+     SY WKS+L+GRELL KGLR ++
Subjt:  WGTQSNKRELHWFRWKDLCLPKSLGGLNFRDIEGFNKALIAKQVWRLLINPESLVSRFLKSQYYRDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRI

Query:  GNGKNTFIFKDLWLPKETSFRPICINSEMYNT-RVADYI-SESGAWDLDKLNRVVIDFDINSIKGIPINM
        GNG    ++ D WLP   SFRPI    E   T RV + I  E   W +  L  + +  ++N I  IP+++
Subjt:  GNGKNTFIFKDLWLPKETSFRPICINSEMYNT-RVADYI-SESGAWDLDKLNRVVIDFDINSIKGIPINM

XP_013657066.1 uncharacterized protein LOC106361809 [Brassica napus]3.1e-6142.21Show/hide
Query:  LLKSSSLEGSSVFKKSWGLEVLKSLWKDFLSSLLGVRHVSDLGKYLGVPSVFSQNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEILIKSIGQAIPSYVI
        L KS+   GS V     G EV     K  + ++LG+ +   +GKYLG+P  F   K +  ++I+D+V K V GWK   F+  GKE+L+KSI  A+P + +
Subjt:  LLKSSSLEGSSVFKKSWGLEVLKSLWKDFLSSLLGVRHVSDLGKYLGVPSVFSQNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEILIKSIGQAIPSYVI

Query:  SVFKLPKHLHEDITRNFARFWWGTQSNKRELHWFRWKDLCLPKSLGGLNFRDIEGFNKALIAKQVWRLLINPESLVSRFLKSQYYRDSNIMEADLGRQPS
        ++F+LPK + E+I    ARFWWGT  +K  LHW+ WK +C+PK  GGL FRD+E FN+AL+ KQVWR++ NP  L++R L+++Y+ D +I++A L ++ S
Subjt:  SVFKLPKHLHEDITRNFARFWWGTQSNKRELHWFRWKDLCLPKSLGGLNFRDIEGFNKALIAKQVWRLLINPESLVSRFLKSQYYRDSNIMEADLGRQPS

Query:  YLWKSLLWGRELLSKGLRNRIGNGKNTFIFKDLWLPKETSFRPICINSEMYNTRVADYISESG-AWDLDKLNRVVIDFDINSIKGIPIN
        Y WKS+L G++L+ KG+R  IGNG++T ++ D WL       P        N++V+DY+  +G  W+LDKL   VI  DI  I  + I+
Subjt:  YLWKSLLWGRELLSKGLRNRIGNGKNTFIFKDLWLPKETSFRPICINSEMYNTRVADYISESG-AWDLDKLNRVVIDFDINSIKGIPIN

XP_022131662.1 uncharacterized protein LOC111004787 [Momordica charantia]1.0e-6752.75Show/hide
Query:  VQGWKNSFFSIAGKEILIKSIGQAIPSYVISVFKLPKHLHEDITRNFARFWWGTQSNKRELHWFRWKDLCLPKSLGGLNFRDIEGFNKALIAKQVWRLLI
        +QGWK SFFS+ GKE+LIKS+GQAIP+Y +SVF+LPK   E++++ FARFWWG+ ++ ++LHW  W+ +CLPK LGGLNFRD+EGFN+AL+AKQVWR+L 
Subjt:  VQGWKNSFFSIAGKEILIKSIGQAIPSYVISVFKLPKHLHEDITRNFARFWWGTQSNKRELHWFRWKDLCLPKSLGGLNFRDIEGFNKALIAKQVWRLLI

Query:  NPESLVSRFLKSQYYRDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKNTFIFKDLWLPKETSFRPICINSEMYNTRVADYISESGAWDLDKL
        NP  LVSR LK++Y+ DS +++A   R  SY WK  +WGR+LL KGLR+R+GNG    IF D W+P+  SFRPI      Y+ +VAD I+ +G WD+  +
Subjt:  NPESLVSRFLKSQYYRDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKNTFIFKDLWLPKETSFRPICINSEMYNTRVADYISESGAWDLDKL

Query:  NRVVIDFDINSIKGIPIN
        + +  + D + I  +P++
Subjt:  NRVVIDFDINSIKGIPIN

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]2.8e-6242.54Show/hide
Query:  LSSLLGVRHVSDLGKYLGVPSVFSQNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEILIKSIGQAIPSYVISVFKLPKHLHEDITRNFARFWWGTQSNKR
        + ++L V  V    +YLG+P+   +N+    ++I DRVWK +QGWK   FSI GKE+LIK++ QAIP Y +S F+LPK L  +     ARFWWG+    +
Subjt:  LSSLLGVRHVSDLGKYLGVPSVFSQNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEILIKSIGQAIPSYVISVFKLPKHLHEDITRNFARFWWGTQSNKR

Query:  ELHWFRWKDLCLPKSLGGLNFRDIEGFNKALIAKQVWRLLINPESLVSRFLKSQYYRDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKNTFI
        ++HW  W  L LPK  GG+ FRD+E FNKAL+AKQ WR+L +P S++SR LK +Y++D + MEA +   PSY+W+S+LWGR+LL KGLR RIGNG + FI
Subjt:  ELHWFRWKDLCLPKSLGGLNFRDIEGFNKALIAKQVWRLLINPESLVSRFLKSQYYRDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKNTFI

Query:  FKDLWLPKETSFRPICINSEMYNTRVADYIS-ESGAWDLDKLNRVVIDFDINSIKGIPINMNLNDKQI
        + D W+P + + + +        +RV+  +  E G W  D +       +   I  IPI     + ++
Subjt:  FKDLWLPKETSFRPICINSEMYNTRVADYIS-ESGAWDLDKLNRVVIDFDINSIKGIPINMNLNDKQI

TrEMBL top hitse value%identityAlignment
A0A6J1BRN0 uncharacterized protein LOC1110047874.9e-6852.75Show/hide
Query:  VQGWKNSFFSIAGKEILIKSIGQAIPSYVISVFKLPKHLHEDITRNFARFWWGTQSNKRELHWFRWKDLCLPKSLGGLNFRDIEGFNKALIAKQVWRLLI
        +QGWK SFFS+ GKE+LIKS+GQAIP+Y +SVF+LPK   E++++ FARFWWG+ ++ ++LHW  W+ +CLPK LGGLNFRD+EGFN+AL+AKQVWR+L 
Subjt:  VQGWKNSFFSIAGKEILIKSIGQAIPSYVISVFKLPKHLHEDITRNFARFWWGTQSNKRELHWFRWKDLCLPKSLGGLNFRDIEGFNKALIAKQVWRLLI

Query:  NPESLVSRFLKSQYYRDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKNTFIFKDLWLPKETSFRPICINSEMYNTRVADYISESGAWDLDKL
        NP  LVSR LK++Y+ DS +++A   R  SY WK  +WGR+LL KGLR+R+GNG    IF D W+P+  SFRPI      Y+ +VAD I+ +G WD+  +
Subjt:  NPESLVSRFLKSQYYRDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKNTFIFKDLWLPKETSFRPICINSEMYNTRVADYISESGAWDLDKL

Query:  NRVVIDFDINSIKGIPIN
        + +  + D + I  +P++
Subjt:  NRVVIDFDINSIKGIPIN

A0A6J1DAR4 uncharacterized protein LOC1110189541.4e-6242.54Show/hide
Query:  LSSLLGVRHVSDLGKYLGVPSVFSQNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEILIKSIGQAIPSYVISVFKLPKHLHEDITRNFARFWWGTQSNKR
        + ++L V  V    +YLG+P+   +N+    ++I DRVWK +QGWK   FSI GKE+LIK++ QAIP Y +S F+LPK L  +     ARFWWG+    +
Subjt:  LSSLLGVRHVSDLGKYLGVPSVFSQNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEILIKSIGQAIPSYVISVFKLPKHLHEDITRNFARFWWGTQSNKR

Query:  ELHWFRWKDLCLPKSLGGLNFRDIEGFNKALIAKQVWRLLINPESLVSRFLKSQYYRDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKNTFI
        ++HW  W  L LPK  GG+ FRD+E FNKAL+AKQ WR+L +P S++SR LK +Y++D + MEA +   PSY+W+S+LWGR+LL KGLR RIGNG + FI
Subjt:  ELHWFRWKDLCLPKSLGGLNFRDIEGFNKALIAKQVWRLLINPESLVSRFLKSQYYRDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKNTFI

Query:  FKDLWLPKETSFRPICINSEMYNTRVADYIS-ESGAWDLDKLNRVVIDFDINSIKGIPINMNLNDKQI
        + D W+P + + + +        +RV+  +  E G W  D +       +   I  IPI     + ++
Subjt:  FKDLWLPKETSFRPICINSEMYNTRVADYIS-ESGAWDLDKLNRVVIDFDINSIKGIPINMNLNDKQI

A0A803NXV2 Uncharacterized protein6.8e-6243.14Show/hide
Query:  LLGVRHVSDLGKYLGVPSVFSQNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEILIKSIGQAIPSYVISVFKLPKHLHEDITRNFARFWWGTQSNKRELH
        +LG+  V+   KYLG+P+V  +NK +  + I D+V K ++GWK SFFS  G+EILIK+I QA+P+Y++S+F+LP    + +     +FWWGT ++KR++ 
Subjt:  LLGVRHVSDLGKYLGVPSVFSQNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEILIKSIGQAIPSYVISVFKLPKHLHEDITRNFARFWWGTQSNKRELH

Query:  WFRWKDLCLPKSLGGLNFRDIEGFNKALIAKQVWRLLINPESLVSRFLKSQYYRDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKNTFIFKD
        W +W+D+C PKS GGL F+D+  FN+A++AKQVWRL+ +P SL +R LK +Y+ +++I++A   +  S+LW SL+WG +LL  GLR  +G+G++   F+D
Subjt:  WFRWKDLCLPKSLGGLNFRDIEGFNKALIAKQVWRLLINPESLVSRFLKSQYYRDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKNTFIFKD

Query:  LWLPKETSFRPICINSEMYNTRVADYISESGAWDLDKLNRVVIDFDINSIKGIPI
         W+P+  SFRPI   + +    V++ I   G WD+  L++  +  DI+ I GIP+
Subjt:  LWLPKETSFRPICINSEMYNTRVADYISESGAWDLDKLNRVVIDFDINSIKGIPI

A0A803QQT2 Uncharacterized protein8.1e-6343.85Show/hide
Query:  KDFLSSLLGVRHVSDLGKYLGVPSVFSQNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEILIKSIGQAIPSYVISVFKLPKHLHEDITRNFARFWWGTQS
        +  L+ +LGVR V + GKYLG+PS   +NK + L  I ++VW  ++GWK S FS+AGKE+LIK I QAIP+Y +S FKLPK     + R  +RFWWG+  
Subjt:  KDFLSSLLGVRHVSDLGKYLGVPSVFSQNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEILIKSIGQAIPSYVISVFKLPKHLHEDITRNFARFWWGTQS

Query:  NKRELHWFRWKDLCLPKSLGGLNFRDIEGFNKALIAKQVWRLLINPESLVSRFLKSQYYRDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKN
         ++++HW +W+ LC PK  GGL FRD+  FN+AL+AKQ+WR L +P+ L SR LK+ Y+    ++EA  G   S++W+SL+WG++L+ KG R R+GNG++
Subjt:  NKRELHWFRWKDLCLPKSLGGLNFRDIEGFNKALIAKQVWRLLINPESLVSRFLKSQYYRDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKN

Query:  TFIFKDLWLPKETSFRPICINSEMYNTRVADYISESGAWDLDKLNRVVIDFDINSIKGIP
          + +D WLP+  +F+     S   N  V D     G WD   +  +    D++ I GIP
Subjt:  TFIFKDLWLPKETSFRPICINSEMYNTRVADYISESGAWDLDKLNRVVIDFDINSIKGIP

J7G0Q7 Non-ltr retroelement reverse transcriptase1.6e-6345.98Show/hide
Query:  KDFLSSLLGVRHVSDLGKYLGVPSVFSQNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEILIKSIGQAIPSYVISVFKLPKHLHEDITRNFARFWWGTQS
        K+  S++L +  V    +YLG+P+V  ++K K    + DRVW  V GW+    S AGKE+LIK++ QAIP+Y +SVF+LP    + I +  ARFWWG + 
Subjt:  KDFLSSLLGVRHVSDLGKYLGVPSVFSQNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEILIKSIGQAIPSYVISVFKLPKHLHEDITRNFARFWWGTQS

Query:  NKRELHWFRWKDLCLPKSLGGLNFRDIEGFNKALIAKQVWRLLINPESLVSRFLKSQYYRDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKN
         K  +HW RW DLC  K  GGL FRD+  FN+AL+ KQ WRL++ P+SLV+R LK++Y+   + MEA+LG  PSYLW+S LWGRELL KG+R RIG+GK 
Subjt:  NKRELHWFRWKDLCLPKSLGGLNFRDIEGFNKALIAKQVWRLLINPESLVSRFLKSQYYRDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKN

Query:  TFIFKDLWLPKETSFRPICINSEMYNTRVADYISESGAWDLDKLNRVVIDFDINSIKGIPI
          +F D W+P   SFRPI         RV+D +  +G W+++ LN    D +  +I  I +
Subjt:  TFIFKDLWLPKETSFRPICINSEMYNTRVADYISESGAWDLDKLNRVVIDFDINSIKGIPI

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.2e-1830.9Show/hide
Query:  ILDRVWKAVQGWKNSFFSIAGKEILIKSIGQAIPSYVISVFKLPKHLHEDITRNFARFWWGTQSNKRELHWFRWKDLCLPKSLGGLNFRDIEGFNKALIA
        IL+RV   + GW+    S AG+  L K++  ++P + +S   LP+ +   + +    F WG+ + K++ H  +W  +C PK  GGL  R  +  N+ALI+
Subjt:  ILDRVWKAVQGWKNSFFSIAGKEILIKSIGQAIPSYVISVFKLPKHLHEDITRNFARFWWGTQSNKRELHWFRWKDLCLPKSLGGLNFRDIEGFNKALIA

Query:  KQVWRLLINPESLVSRFLKSQYY----RDSNIMEADLGRQPSYLWKSLLWG-RELLSKGLRNRIGNGKNTFIFKDLWL
        K  WRLL    SL +  L+ +Y+    RDS  +        S  W+S+  G R+++S G+    G+G+    + D W+
Subjt:  KQVWRLLINPESLVSRFLKSQYY----RDSNIMEADLGRQPSYLWKSLLWG-RELLSKGLRNRIGNGKNTFIFKDLWL

P93295 Uncharacterized mitochondrial protein AtMg003101.5e-3444.08Show/hide
Query:  AIPSYVISVFKLPKHLHEDITRNFARFWWGTQSNKRELHWFRWKDLCLPK-SLGGLNFRDIEGFNKALIAKQVWRLLINPESLVSRFLKSQYYRDSNIME
        A+P Y +S F+L K L + +T     FWW +  NKR++ W  W+ LC  K   GGL FRD+  FN+AL+AKQ +R++  P +L+SR L+S+Y+  S++ME
Subjt:  AIPSYVISVFKLPKHLHEDITRNFARFWWGTQSNKRELHWFRWKDLCLPK-SLGGLNFRDIEGFNKALIAKQVWRLLINPESLVSRFLKSQYYRDSNIME

Query:  ADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKNTFIFKDLWLPKETSFRPI
          +G +PSY W+S++ GRELLS+GL   IG+G +T ++ D W+  ET   P+
Subjt:  ADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKNTFIFKDLWLPKETSFRPI

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein2.1e-0731.37Show/hide
Query:  LKSQYYRDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKNTFIFKDLWLPKETSFRPICINSEMYNTRVADYISESGA---WDLDKLNRVVID
        +K++Y++D +I++A + +Q SY W SLL G  LL KG R+ IG+G+N  I  D  +      RP+          + +     G+   WD  K+++ V  
Subjt:  LKSQYYRDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKNTFIFKDLWLPKETSFRPICINSEMYNTRVADYISESGA---WDLDKLNRVVID

Query:  FD
         D
Subjt:  FD

AT4G29090.1 Ribonuclease H-like superfamily protein1.0e-3337.64Show/hide
Query:  AIPSYVISVFKLPKHLHEDITRNFARFWWGTQSNKRELHWFRWKDLCLPKSLGGLNFRDIEGFNKALIAKQVWRLLINPESLVSRFLKSQYYRDSNIMEA
        A+P+Y ++ F LPK + + I    A FWW  +   + +HW  W  L   K+ GG+ F+DIE FN AL+ KQ+WR+L  PESL+++  KS+Y+  S+ + A
Subjt:  AIPSYVISVFKLPKHLHEDITRNFARFWWGTQSNKRELHWFRWKDLCLPKSLGGLNFRDIEGFNKALIAKQVWRLLINPESLVSRFLKSQYYRDSNIMEA

Query:  DLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKNTFIFKDLWL---PKETSFRPICINSEMYNT-----RVADYISESG
         LG +PS++WKS+   +E+L +G R  +GNG++  I++  WL   P   + R   +  + Y +     +V+D I ESG
Subjt:  DLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKNTFIFKDLWL---PKETSFRPICINSEMYNT-----RVADYISESG

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.1e-3544.08Show/hide
Query:  AIPSYVISVFKLPKHLHEDITRNFARFWWGTQSNKRELHWFRWKDLCLPK-SLGGLNFRDIEGFNKALIAKQVWRLLINPESLVSRFLKSQYYRDSNIME
        A+P Y +S F+L K L + +T     FWW +  NKR++ W  W+ LC  K   GGL FRD+  FN+AL+AKQ +R++  P +L+SR L+S+Y+  S++ME
Subjt:  AIPSYVISVFKLPKHLHEDITRNFARFWWGTQSNKRELHWFRWKDLCLPK-SLGGLNFRDIEGFNKALIAKQVWRLLINPESLVSRFLKSQYYRDSNIME

Query:  ADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKNTFIFKDLWLPKETSFRPI
          +G +PSY W+S++ GRELLS+GL   IG+G +T ++ D W+  ET   P+
Subjt:  ADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKNTFIFKDLWLPKETSFRPI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACCCACCGGTTGAAGAAGCCGACATTGCCTGTCCTCGCCGAGGAGGATATGGGTTTCCTCTTTCATTCGCCGACAACTTCTCTCTCTCCCGCGTATCGATGGCCAG
CCGTGAGCGCCTGCTACCGTTGCAACTCGACCCATCGTCCGCCTTCAAACACATTAAAACTGGAACGCCAGCCTGCTATAGTGAGCGCCTCGCTGCAACTCGACCCATCG
TTAGCCTGCTACCATCTGGAACTCCAGCCCGAATCGAAGGTGGCGCCGAGATCTGTTCAGTCTTTGGTCTTCTAGAACTGTTGAAGTTTTCTAGTCTTGAAGGGTCTTCA
GTCTTTAAGAAGTCTTGGGGTCTTGAAGTCTTGAAATCTTGGAGAGTTTTTGGTCTTCAAGGATTGTTGAAGTCTTCTAGTCTTGAAGGGTCTTCAGTCTTTAAGAAGTC
TTGGGGTCTTGAAGTCTTGAAGAGTCTTTGGAAAGATTTCTTGAGTTCTCTGTTAGGTGTTAGGCATGTGAGTGACCTTGGTAAGTACTTAGGGGTTCCCTCTGTGTTTT
CCCAAAATAAATCAAAGGATCTTAGTTTTATCCTAGATAGGGTGTGGAAAGCTGTCCAAGGATGGAAGAATTCATTCTTCTCAATTGCTGGCAAAGAAATATTAATAAAA
AGTATTGGTCAAGCTATTCCTTCCTATGTTATAAGTGTTTTTAAATTGCCTAAACACTTACATGAAGATATAACTAGAAATTTTGCAAGGTTCTGGTGGGGAACTCAATC
CAATAAAAGAGAGCTACACTGGTTTAGATGGAAAGACTTATGCCTTCCTAAGAGTCTAGGCGGCCTTAACTTTAGAGATATTGAAGGTTTTAACAAAGCCTTAATTGCTA
AACAAGTTTGGCGTCTTCTAATTAACCCTGAGTCATTAGTTTCTAGGTTCCTAAAAAGCCAATATTATAGGGATTCTAATATTATGGAAGCTGATTTGGGGAGGCAACCT
TCTTATCTGTGGAAGAGCCTTTTATGGGGCAGAGAATTACTTAGCAAGGGCCTTCGAAATAGAATAGGAAATGGCAAGAACACATTTATTTTTAAGGACCTTTGGCTTCC
CAAAGAGACTTCTTTTAGACCAATTTGCATTAACAGTGAAATGTACAACACAAGAGTGGCTGATTACATTTCTGAATCAGGCGCTTGGGATTTAGATAAACTAAACAGGG
TTGTTATTGATTTTGACATTAATTCTATAAAGGGCATCCCTATAAACATGAACTTGAATGACAAACAAATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTACCCACCGGTTGAAGAAGCCGACATTGCCTGTCCTCGCCGAGGAGGATATGGGTTTCCTCTTTCATTCGCCGACAACTTCTCTCTCTCCCGCGTATCGATGGCCAG
CCGTGAGCGCCTGCTACCGTTGCAACTCGACCCATCGTCCGCCTTCAAACACATTAAAACTGGAACGCCAGCCTGCTATAGTGAGCGCCTCGCTGCAACTCGACCCATCG
TTAGCCTGCTACCATCTGGAACTCCAGCCCGAATCGAAGGTGGCGCCGAGATCTGTTCAGTCTTTGGTCTTCTAGAACTGTTGAAGTTTTCTAGTCTTGAAGGGTCTTCA
GTCTTTAAGAAGTCTTGGGGTCTTGAAGTCTTGAAATCTTGGAGAGTTTTTGGTCTTCAAGGATTGTTGAAGTCTTCTAGTCTTGAAGGGTCTTCAGTCTTTAAGAAGTC
TTGGGGTCTTGAAGTCTTGAAGAGTCTTTGGAAAGATTTCTTGAGTTCTCTGTTAGGTGTTAGGCATGTGAGTGACCTTGGTAAGTACTTAGGGGTTCCCTCTGTGTTTT
CCCAAAATAAATCAAAGGATCTTAGTTTTATCCTAGATAGGGTGTGGAAAGCTGTCCAAGGATGGAAGAATTCATTCTTCTCAATTGCTGGCAAAGAAATATTAATAAAA
AGTATTGGTCAAGCTATTCCTTCCTATGTTATAAGTGTTTTTAAATTGCCTAAACACTTACATGAAGATATAACTAGAAATTTTGCAAGGTTCTGGTGGGGAACTCAATC
CAATAAAAGAGAGCTACACTGGTTTAGATGGAAAGACTTATGCCTTCCTAAGAGTCTAGGCGGCCTTAACTTTAGAGATATTGAAGGTTTTAACAAAGCCTTAATTGCTA
AACAAGTTTGGCGTCTTCTAATTAACCCTGAGTCATTAGTTTCTAGGTTCCTAAAAAGCCAATATTATAGGGATTCTAATATTATGGAAGCTGATTTGGGGAGGCAACCT
TCTTATCTGTGGAAGAGCCTTTTATGGGGCAGAGAATTACTTAGCAAGGGCCTTCGAAATAGAATAGGAAATGGCAAGAACACATTTATTTTTAAGGACCTTTGGCTTCC
CAAAGAGACTTCTTTTAGACCAATTTGCATTAACAGTGAAATGTACAACACAAGAGTGGCTGATTACATTTCTGAATCAGGCGCTTGGGATTTAGATAAACTAAACAGGG
TTGTTATTGATTTTGACATTAATTCTATAAAGGGCATCCCTATAAACATGAACTTGAATGACAAACAAATCTGA
Protein sequenceShow/hide protein sequence
MYPPVEEADIACPRRGGYGFPLSFADNFSLSRVSMASRERLLPLQLDPSSAFKHIKTGTPACYSERLAATRPIVSLLPSGTPARIEGGAEICSVFGLLELLKFSSLEGSS
VFKKSWGLEVLKSWRVFGLQGLLKSSSLEGSSVFKKSWGLEVLKSLWKDFLSSLLGVRHVSDLGKYLGVPSVFSQNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEILIK
SIGQAIPSYVISVFKLPKHLHEDITRNFARFWWGTQSNKRELHWFRWKDLCLPKSLGGLNFRDIEGFNKALIAKQVWRLLINPESLVSRFLKSQYYRDSNIMEADLGRQP
SYLWKSLLWGRELLSKGLRNRIGNGKNTFIFKDLWLPKETSFRPICINSEMYNTRVADYISESGAWDLDKLNRVVIDFDINSIKGIPINMNLNDKQI