; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038985 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038985
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr2:32588411..32589361
RNA-Seq ExpressionLag0038985
SyntenyLag0038985
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_030479133.1 uncharacterized protein LOC115696372 [Cannabis sativa]1.5e-6444.95Show/hide
Query:  GLHQGDPLSPYLFLLCAKGLSCMLHEAEGARAITRLQIACGCPPISHLFFVDDSLLFFRAKEGEARAVHYILQCYERASGQTINFDKSIISFSPSTAMGV
        GL QGDPLSPYLFL+C++GLS +L   E    +  L ++   P ISHLFF DDSLLF +A +    A+   L  Y RASGQ +N DKS++SFSP+T + V
Subjt:  GLHQGDPLSPYLFLLCAKGLSCMLHEAEGARAITRLQIACGCPPISHLFFVDDSLLFFRAKEGEARAVHYILQCYERASGQTINFDKSIISFSPSTAMGV

Query:  QSQIQ-----------------------------------------GWNGKFFSVGGREVLLKSIVQAIPCYIMNCFRLPQKLVQDISRAMAQFWWNGNT
        Q+  Q                                          WN K FS+GG+EVLLK++VQ+IP Y M+CFRLP KL  +I   MA+FWW  ++
Subjt:  QSQIQ-----------------------------------------GWNGKFFSVGGREVLLKSIVQAIPCYIMNCFRLPQKLVQDISRAMAQFWWNGNT

Query:  EDRRIHWVSWKTLCKPKCLGRMSFRDLKTFNQDLLAKQCRRIVRYPSSFLSRVLKGRYFPSGDFLAAGVGSRPLYIWRSLIWGRELL
        ++++IHW  W+ LCK K  G M FR    FNQ LLAKQ  RI + P+S LSRVLKG YF   DF+ A  G      W+ ++WGRELL
Subjt:  EDRRIHWVSWKTLCKPKCLGRMSFRDLKTFNQDLLAKQCRRIVRYPSSFLSRVLKGRYFPSGDFLAAGVGSRPLYIWRSLIWGRELL

XP_030483769.1 uncharacterized protein LOC115700339 [Cannabis sativa]2.6e-6444.29Show/hide
Query:  GLHQGDPLSPYLFLLCAKGLSCMLHEAEGARAITRLQIACGCPPISHLFFVDDSLLFFRAKEGEARAVHYILQCYERASGQTINFDKSIISFSPSTAMGV
        GL QGDPLSPYLFL+C++GLS +L   E    +  L ++   P ISHL F DDSLLF +A +    A+   L  Y RASGQ +N DKS++SFSP+T M V
Subjt:  GLHQGDPLSPYLFLLCAKGLSCMLHEAEGARAITRLQIACGCPPISHLFFVDDSLLFFRAKEGEARAVHYILQCYERASGQTINFDKSIISFSPSTAMGV

Query:  QSQIQ-----------------------------------------GWNGKFFSVGGREVLLKSIVQAIPCYIMNCFRLPQKLVQDISRAMAQFWWNGNT
        Q+  Q                                          WN K FS+GG+EVLLK++VQ+IP Y M+CFRL +K   ++   MA+FWW   T
Subjt:  QSQIQ-----------------------------------------GWNGKFFSVGGREVLLKSIVQAIPCYIMNCFRLPQKLVQDISRAMAQFWWNGNT

Query:  EDRRIHWVSWKTLCKPKCLGRMSFRDLKTFNQDLLAKQCRRIVRYPSSFLSRVLKGRYFPSGDFLAAGVGSRPLYIWRSLIWGRELLRR
        ++++IHW  WK LCK K  G M FR    FNQ LLAKQ  RI + P+S LSRVLKGRY+P  DF+ A         W+ ++WGRELL +
Subjt:  EDRRIHWVSWKTLCKPKCLGRMSFRDLKTFNQDLLAKQCRRIVRYPSSFLSRVLKGRYFPSGDFLAAGVGSRPLYIWRSLIWGRELLRR

XP_030504959.1 uncharacterized protein LOC115719927 [Cannabis sativa]3.3e-6444.6Show/hide
Query:  GLHQGDPLSPYLFLLCAKGLSCMLHEAEGARAITRLQIACGCPPISHLFFVDDSLLFFRAKEGEARAVHYILQCYERASGQTINFDKSIISFSPSTAMGV
        GL QGDPLSPYLFL+C++GLS +L   EG   +  L++    P +SHL F DDSLLF RA    A A+H IL  Y +ASGQ +N +KS++SFSP+T M  
Subjt:  GLHQGDPLSPYLFLLCAKGLSCMLHEAEGARAITRLQIACGCPPISHLFFVDDSLLFFRAKEGEARAVHYILQCYERASGQTINFDKSIISFSPSTAMGV

Query:  QS-----------------------------------------QIQGWNGKFFSVGGREVLLKSIVQAIPCYIMNCFRLPQKLVQDISRAMAQFWWNGNT
        Q+                                          +  WN K FSVGG+EVLLK++VQ+IP Y M+CF+L  K    +   MA FWW  N 
Subjt:  QS-----------------------------------------QIQGWNGKFFSVGGREVLLKSIVQAIPCYIMNCFRLPQKLVQDISRAMAQFWWNGNT

Query:  EDRRIHWVSWKTLCKPKCLGRMSFRDLKTFNQDLLAKQCRRIVRYPSSFLSRVLKGRYFPSGDFLAAGVGSRPLYIWRSLIWGRELL
           +IHW+ W +LCK K  G M FR    FNQ LLAKQ  +I   P S LSR+LK RYF +  FL A +G  P Y W+S+ WGRELL
Subjt:  EDRRIHWVSWKTLCKPKCLGRMSFRDLKTFNQDLLAKQCRRIVRYPSSFLSRVLKGRYFPSGDFLAAGVGSRPLYIWRSLIWGRELL

XP_030508852.1 uncharacterized protein LOC115723496 [Cannabis sativa]5.7e-6444.95Show/hide
Query:  GLHQGDPLSPYLFLLCAKGLSCMLHEAEGARAITRLQIACGCPPISHLFFVDDSLLFFRAKEGEARAVHYILQCYERASGQTINFDKSIISFSPST----
        G+ QGDPLSPYLFL+CA+GLS +L   E A ++  L+I+   P +SHLFF DDS+LF RA +  ARA+H  L  Y RASGQ IN +K ++SFS +T    
Subjt:  GLHQGDPLSPYLFLLCAKGLSCMLHEAEGARAITRLQIACGCPPISHLFFVDDSLLFFRAKEGEARAVHYILQCYERASGQTINFDKSIISFSPST----

Query:  ------AMGVQSQ-------------------------------IQGWNGKFFSVGGREVLLKSIVQAIPCYIMNCFRLPQKLVQDISRAMAQFWWNGNT
               +G+  Q                               +  W    FS GG+EVLLK++VQAIP Y M+CFRLP  L   I   MA+FWW    
Subjt:  ------AMGVQSQ-------------------------------IQGWNGKFFSVGGREVLLKSIVQAIPCYIMNCFRLPQKLVQDISRAMAQFWWNGNT

Query:  EDRRIHWVSWKTLCKPKCLGRMSFRDLKTFNQDLLAKQCRRIVRYPSSFLSRVLKGRYFPSGDFLAAGVGSRPLYIWRSLIWGRELL
          + IHW +W  LCK K  G + FR+   FNQ LLAKQ  RI+ +P+S LS +L+ RYF +G++L AG+GS P   WRSL+WG+ELL
Subjt:  EDRRIHWVSWKTLCKPKCLGRMSFRDLKTFNQDLLAKQCRRIVRYPSSFLSRVLKGRYFPSGDFLAAGVGSRPLYIWRSLIWGRELL

XP_030509050.1 uncharacterized protein LOC115723712 [Cannabis sativa]1.1e-6444.25Show/hide
Query:  GLHQGDPLSPYLFLLCAKGLSCMLHEAEGARAITRLQIACGCPPISHLFFVDDSLLFFRAKEGEARAVHYILQCYERASGQTINFDKSIISFSPSTA---
        G+ QGDPLSPYLFL+C++G S +L   E   A+  L+++   PPI+HL F DDS+LF RA    ARA+H  L  Y RASGQ +N +KS++SFSP+T    
Subjt:  GLHQGDPLSPYLFLLCAKGLSCMLHEAEGARAITRLQIACGCPPISHLFFVDDSLLFFRAKEGEARAVHYILQCYERASGQTINFDKSIISFSPSTA---

Query:  ----------------------------------MGVQSQI----QGWNGKFFSVGGREVLLKSIVQAIPCYIMNCFRLPQKLVQDISRAMAQFWWNGNT
                                           G+  +I      W  + FS+GG+EVLLK++VQAIP Y M+CF+LP KL   I + M++FWW  + 
Subjt:  ----------------------------------MGVQSQI----QGWNGKFFSVGGREVLLKSIVQAIPCYIMNCFRLPQKLVQDISRAMAQFWWNGNT

Query:  EDRRIHWVSWKTLCKPKCLGRMSFRDLKTFNQDLLAKQCRRIVRYPSSFLSRVLKGRYFPSGDFLAAGVGSRPLYIWRSLIWGRELL
            IHW +WK LCK K  G M FR+   FNQ LLAKQ  RI+  PSS ++RVLK RYF +G FL A  G++P   W+S++WG+ELL
Subjt:  EDRRIHWVSWKTLCKPKCLGRMSFRDLKTFNQDLLAKQCRRIVRYPSSFLSRVLKGRYFPSGDFLAAGVGSRPLYIWRSLIWGRELL

TrEMBL top hitse value%identityAlignment
A0A2N9GRF1 Reverse transcriptase domain-containing protein2.8e-6447.84Show/hide
Query:  GLHQGDPLSPYLFLLCAKGLSCMLHEAEGARAITRLQIACGCPPISHLFFVDDSLLFFRAKEGEARAVHYILQCYERASGQTINFDKSIISFSPSTAMGV
        G+ QGD LSPY+FLLCA+GLS +LH+A  +  +  +Q + G P ISHLFF DDSLLF +A   E R +  IL  YE+ SGQ IN +K+ + FS +T    
Subjt:  GLHQGDPLSPYLFLLCAKGLSCMLHEAEGARAITRLQIACGCPPISHLFFVDDSLLFFRAKEGEARAVHYILQCYERASGQTINFDKSIISFSPSTAMGV

Query:  QSQIQ-------GWNGKFFSVGGREVLLKSIVQAIPCYIMNCFRLPQKLVQDISRAMAQFWWNGNTEDRRIHWVSWKTLCKPKCLGRMSFRDLKTFNQDL
        +  IQ       GW  KF S  GREVL+K+I Q+IP Y MNCFRLP+    DIS  ++ +WW    ++R++HWV W++LC PK  G + FR+L  FN  L
Subjt:  QSQIQ-------GWNGKFFSVGGREVLLKSIVQAIPCYIMNCFRLPQKLVQDISRAMAQFWWNGNTEDRRIHWVSWKTLCKPKCLGRMSFRDLKTFNQDL

Query:  LAKQCRRIVRYPSSFLSRVLKGRYFPSGDFLAAGVGSRPLYIWRSLIWGRELLRR
        LAKQ  R++  P S   RV K +YFP   FL A +GS P ++WRS++ GRE++R+
Subjt:  LAKQCRRIVRYPSSFLSRVLKGRYFPSGDFLAAGVGSRPLYIWRSLIWGRELLRR

A0A7N2L6Z9 Reverse transcriptase domain-containing protein3.3e-6541.64Show/hide
Query:  GLHQGDPLSPYLFLLCAKGLSCMLHEAEGARAITRLQIACGCPPISHLFFVDDSLLFFRAKEGEARAVHYILQCYERASGQTINFDKSIISFSPSTA---
        GL QGDPLSPYLFLLCA+GLS +LH+A   + +  + +  GCP I+HLFF DDSLLF +A   E   +  IL+ YE ASGQ +N DKS I FSP+T    
Subjt:  GLHQGDPLSPYLFLLCAKGLSCMLHEAEGARAITRLQIACGCPPISHLFFVDDSLLFFRAKEGEARAVHYILQCYERASGQTINFDKSIISFSPSTA---

Query:  -------------------MGVQS-------------------QIQGWNGKFFSVGGREVLLKSIVQAIPCYIMNCFRLPQKLVQDISRAMAQFWWNGNT
                           +G+ S                   ++ GW GK  S GG+E+L+K++ QAIP Y M+CF LP+ L  ++ + M  FWW    
Subjt:  -------------------MGVQS-------------------QIQGWNGKFFSVGGREVLLKSIVQAIPCYIMNCFRLPQKLVQDISRAMAQFWWNGNT

Query:  EDRRIHWVSWKTLCKPKCLGRMSFRDLKTFNQDLLAKQCRRIVRYPSSFLSRVLKGRYFPSGDFLAAGVGSRPLYIWRSLIWGRELLRRVFAGGLEMGRK
        ++ ++ W+SW+ +CKPK LG + FR+L  FN  LLAKQ  RI+  P S  +R+LK +YFP GD L A +GS P Y WRS+    E+L++     +  GR+
Subjt:  EDRRIHWVSWKTLCKPKCLGRMSFRDLKTFNQDLLAKQCRRIVRYPSSFLSRVLKGRYFPSGDFLAAGVGSRPLYIWRSLIWGRELLRRVFAGGLEMGRK

Query:  LVYMD
        +   D
Subjt:  LVYMD

A0A803PYN1 Uncharacterized protein1.6e-6444.6Show/hide
Query:  GLHQGDPLSPYLFLLCAKGLSCMLHEAEGARAITRLQIACGCPPISHLFFVDDSLLFFRAKEGEARAVHYILQCYERASGQTINFDKSIISFSPSTAMGV
        GL QGDPLSPYLFL+C++GLS +L   EG   +  L++    P +SHL F DDSLLF RA    A A+H IL  Y +ASGQ +N +KS++SFSP+T M  
Subjt:  GLHQGDPLSPYLFLLCAKGLSCMLHEAEGARAITRLQIACGCPPISHLFFVDDSLLFFRAKEGEARAVHYILQCYERASGQTINFDKSIISFSPSTAMGV

Query:  QS-----------------------------------------QIQGWNGKFFSVGGREVLLKSIVQAIPCYIMNCFRLPQKLVQDISRAMAQFWWNGNT
        Q+                                          +  WN K FSVGG+EVLLK++VQ+IP Y M+CF+L  K    +   MA FWW  N 
Subjt:  QS-----------------------------------------QIQGWNGKFFSVGGREVLLKSIVQAIPCYIMNCFRLPQKLVQDISRAMAQFWWNGNT

Query:  EDRRIHWVSWKTLCKPKCLGRMSFRDLKTFNQDLLAKQCRRIVRYPSSFLSRVLKGRYFPSGDFLAAGVGSRPLYIWRSLIWGRELL
           +IHW+ W +LCK K  G M FR    FNQ LLAKQ  +I   P S LSR+LK RYF +  FL A +G  P Y W+S+ WGRELL
Subjt:  EDRRIHWVSWKTLCKPKCLGRMSFRDLKTFNQDLLAKQCRRIVRYPSSFLSRVLKGRYFPSGDFLAAGVGSRPLYIWRSLIWGRELL

A0A803Q6Z2 Uncharacterized protein5.6e-6544.25Show/hide
Query:  GLHQGDPLSPYLFLLCAKGLSCMLHEAEGARAITRLQIACGCPPISHLFFVDDSLLFFRAKEGEARAVHYILQCYERASGQTINFDKSIISFSPSTA---
        G+ QGDPLSPYLFL+C++G S +L   E   A+  L+++   PPI+HL F DDS+LF RA    ARA+H  L  Y RASGQ +N +KS++SFSP+T    
Subjt:  GLHQGDPLSPYLFLLCAKGLSCMLHEAEGARAITRLQIACGCPPISHLFFVDDSLLFFRAKEGEARAVHYILQCYERASGQTINFDKSIISFSPSTA---

Query:  ----------------------------------MGVQSQI----QGWNGKFFSVGGREVLLKSIVQAIPCYIMNCFRLPQKLVQDISRAMAQFWWNGNT
                                           G+  +I      W  + FS+GG+EVLLK++VQAIP Y M+CF+LP KL   I + M++FWW  + 
Subjt:  ----------------------------------MGVQSQI----QGWNGKFFSVGGREVLLKSIVQAIPCYIMNCFRLPQKLVQDISRAMAQFWWNGNT

Query:  EDRRIHWVSWKTLCKPKCLGRMSFRDLKTFNQDLLAKQCRRIVRYPSSFLSRVLKGRYFPSGDFLAAGVGSRPLYIWRSLIWGRELL
            IHW +WK LCK K  G M FR+   FNQ LLAKQ  RI+  PSS ++RVLK RYF +G FL A  G++P   W+S++WG+ELL
Subjt:  EDRRIHWVSWKTLCKPKCLGRMSFRDLKTFNQDLLAKQCRRIVRYPSSFLSRVLKGRYFPSGDFLAAGVGSRPLYIWRSLIWGRELL

A0A803QGT2 Uncharacterized protein7.3e-6544.95Show/hide
Query:  GLHQGDPLSPYLFLLCAKGLSCMLHEAEGARAITRLQIACGCPPISHLFFVDDSLLFFRAKEGEARAVHYILQCYERASGQTINFDKSIISFSPSTAMGV
        GL QGDPLSPYLFL+C++GLS +L   E    +  L ++   P ISHLFF DDSLLF +A +    A+   L  Y RASGQ +N DKS++SFSP+T + V
Subjt:  GLHQGDPLSPYLFLLCAKGLSCMLHEAEGARAITRLQIACGCPPISHLFFVDDSLLFFRAKEGEARAVHYILQCYERASGQTINFDKSIISFSPSTAMGV

Query:  QSQIQ-----------------------------------------GWNGKFFSVGGREVLLKSIVQAIPCYIMNCFRLPQKLVQDISRAMAQFWWNGNT
        Q+  Q                                          WN K FS+GG+EVLLK++VQ+IP Y M+CFRLP KL  +I   MA+FWW  ++
Subjt:  QSQIQ-----------------------------------------GWNGKFFSVGGREVLLKSIVQAIPCYIMNCFRLPQKLVQDISRAMAQFWWNGNT

Query:  EDRRIHWVSWKTLCKPKCLGRMSFRDLKTFNQDLLAKQCRRIVRYPSSFLSRVLKGRYFPSGDFLAAGVGSRPLYIWRSLIWGRELL
        ++++IHW  W+ LCK K  G M FR    FNQ LLAKQ  RI + P+S LSRVLKG YF   DF+ A  G      W+ ++WGRELL
Subjt:  EDRRIHWVSWKTLCKPKCLGRMSFRDLKTFNQDLLAKQCRRIVRYPSSFLSRVLKGRYFPSGDFLAAGVGSRPLYIWRSLIWGRELL

SwissProt top hitse value%identityAlignment
P92555 Uncharacterized mitochondrial protein AtMg012503.9e-0744.44Show/hide
Query:  GLHQGDPLSPYLFLLCAKGLSCMLHEAEGARAITRLQIACGCPPISHLFFVDDS
        GL QGDPLSPYLF+LC + LS +   A+    +  ++++   P I+HL F DD+
Subjt:  GLHQGDPLSPYLFLLCAKGLSCMLHEAEGARAITRLQIACGCPPISHLFFVDDS

P93295 Uncharacterized mitochondrial protein AtMg003104.3e-3050.41Show/hide
Query:  AIPCYIMNCFRLPQKLVQDISRAMAQFWWNGNTEDRRIHWVSWKTLCKPK-CLGRMSFRDLKTFNQDLLAKQCRRIVRYPSSFLSRVLKGRYFPSGDFLA
        A+P Y M+CFRL + L + ++ AM +FWW+     R+I WV+W+ LCK K   G + FRDL  FNQ LLAKQ  RI+  P + LSR+L+ RYFP    + 
Subjt:  AIPCYIMNCFRLPQKLVQDISRAMAQFWWNGNTEDRRIHWVSWKTLCKPK-CLGRMSFRDLKTFNQDLLAKQCRRIVRYPSSFLSRVLKGRYFPSGDFLA

Query:  AGVGSRPLYIWRSLIWGRELLRR
          VG+RP Y WRS+I GRELL R
Subjt:  AGVGSRPLYIWRSLIWGRELLRR

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein4.7e-2439.34Show/hide
Query:  AIPCYIMNCFRLPQKLVQDISRAMAQFWWNGNTEDRRIHWVSWKTLCKPKCLGRMSFRDLKTFNQDLLAKQCRRIVRYPSSFLSRVLKGRYFPSGDFLAA
        A+P Y M CF LP+ + + I   +A FWW    E + +HW +W  L   K  G + F+D++ FN  LL KQ  R++  P S +++V K RYF   D L A
Subjt:  AIPCYIMNCFRLPQKLVQDISRAMAQFWWNGNTEDRRIHWVSWKTLCKPKCLGRMSFRDLKTFNQDLLAKQCRRIVRYPSSFLSRVLKGRYFPSGDFLAA

Query:  GVGSRPLYIWRSLIWGRELLRR
         +GSRP ++W+S+   +E+LR+
Subjt:  GVGSRPLYIWRSLIWGRELLRR

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.0e-3150.41Show/hide
Query:  AIPCYIMNCFRLPQKLVQDISRAMAQFWWNGNTEDRRIHWVSWKTLCKPK-CLGRMSFRDLKTFNQDLLAKQCRRIVRYPSSFLSRVLKGRYFPSGDFLA
        A+P Y M+CFRL + L + ++ AM +FWW+     R+I WV+W+ LCK K   G + FRDL  FNQ LLAKQ  RI+  P + LSR+L+ RYFP    + 
Subjt:  AIPCYIMNCFRLPQKLVQDISRAMAQFWWNGNTEDRRIHWVSWKTLCKPK-CLGRMSFRDLKTFNQDLLAKQCRRIVRYPSSFLSRVLKGRYFPSGDFLA

Query:  AGVGSRPLYIWRSLIWGRELLRR
          VG+RP Y WRS+I GRELL R
Subjt:  AGVGSRPLYIWRSLIWGRELLRR

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.8e-0844.44Show/hide
Query:  GLHQGDPLSPYLFLLCAKGLSCMLHEAEGARAITRLQIACGCPPISHLFFVDDS
        GL QGDPLSPYLF+LC + LS +   A+    +  ++++   P I+HL F DD+
Subjt:  GLHQGDPLSPYLFLLCAKGLSCMLHEAEGARAITRLQIACGCPPISHLFFVDDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCCTTCACCAGGGAGATCCACTTTCCCCGTATTTGTTCTTATTGTGTGCTAAAGGGCTCTCTTGTATGTTGCATGAAGCGGAAGGGGCGAGGGCTATCACGAGGTT
GCAAATAGCGTGTGGGTGCCCTCCTATCTCCCATTTGTTTTTTGTTGATGACAGCCTCCTTTTCTTTCGGGCTAAAGAGGGTGAAGCTCGAGCTGTGCATTACATCCTCC
AATGCTATGAGCGAGCATCCGGGCAAACCATAAACTTTGACAAATCAATTATCTCCTTTAGTCCGAGCACTGCAATGGGTGTTCAATCCCAGATTCAAGGTTGGAATGGG
AAGTTTTTCTCTGTAGGGGGCAGGGAGGTGTTACTGAAGTCCATCGTGCAGGCTATCCCGTGTTACATTATGAATTGTTTCCGCTTGCCTCAAAAGCTGGTTCAGGACAT
TAGTAGGGCAATGGCGCAATTCTGGTGGAATGGGAATACAGAAGATAGAAGGATCCACTGGGTTAGTTGGAAGACGCTGTGCAAGCCAAAATGTTTGGGTCGAATGAGTT
TCAGAGATTTGAAAACTTTCAACCAAGACCTCTTGGCCAAACAGTGTAGGAGGATTGTTCGATATCCTTCCTCGTTTCTCTCCCGTGTGTTGAAGGGGCGGTATTTTCCT
AGTGGAGACTTCCTGGCTGCAGGGGTGGGCTCCCGTCCCTTGTATATTTGGAGGAGCCTGATTTGGGGGAGGGAGCTTTTGAGAAGGGTATTCGCTGGAGGATTGGAAAT
GGGAAGAAAGTTAGTGTATATGGATCTAACAAAATATTTAAGACAGGTATATGTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCCTTCACCAGGGAGATCCACTTTCCCCGTATTTGTTCTTATTGTGTGCTAAAGGGCTCTCTTGTATGTTGCATGAAGCGGAAGGGGCGAGGGCTATCACGAGGTT
GCAAATAGCGTGTGGGTGCCCTCCTATCTCCCATTTGTTTTTTGTTGATGACAGCCTCCTTTTCTTTCGGGCTAAAGAGGGTGAAGCTCGAGCTGTGCATTACATCCTCC
AATGCTATGAGCGAGCATCCGGGCAAACCATAAACTTTGACAAATCAATTATCTCCTTTAGTCCGAGCACTGCAATGGGTGTTCAATCCCAGATTCAAGGTTGGAATGGG
AAGTTTTTCTCTGTAGGGGGCAGGGAGGTGTTACTGAAGTCCATCGTGCAGGCTATCCCGTGTTACATTATGAATTGTTTCCGCTTGCCTCAAAAGCTGGTTCAGGACAT
TAGTAGGGCAATGGCGCAATTCTGGTGGAATGGGAATACAGAAGATAGAAGGATCCACTGGGTTAGTTGGAAGACGCTGTGCAAGCCAAAATGTTTGGGTCGAATGAGTT
TCAGAGATTTGAAAACTTTCAACCAAGACCTCTTGGCCAAACAGTGTAGGAGGATTGTTCGATATCCTTCCTCGTTTCTCTCCCGTGTGTTGAAGGGGCGGTATTTTCCT
AGTGGAGACTTCCTGGCTGCAGGGGTGGGCTCCCGTCCCTTGTATATTTGGAGGAGCCTGATTTGGGGGAGGGAGCTTTTGAGAAGGGTATTCGCTGGAGGATTGGAAAT
GGGAAGAAAGTTAGTGTATATGGATCTAACAAAATATTTAAGACAGGTATATGTATAG
Protein sequenceShow/hide protein sequence
MGLHQGDPLSPYLFLLCAKGLSCMLHEAEGARAITRLQIACGCPPISHLFFVDDSLLFFRAKEGEARAVHYILQCYERASGQTINFDKSIISFSPSTAMGVQSQIQGWNG
KFFSVGGREVLLKSIVQAIPCYIMNCFRLPQKLVQDISRAMAQFWWNGNTEDRRIHWVSWKTLCKPKCLGRMSFRDLKTFNQDLLAKQCRRIVRYPSSFLSRVLKGRYFP
SGDFLAAGVGSRPLYIWRSLIWGRELLRRVFAGGLEMGRKLVYMDLTKYLRQVYV