; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008432 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008432
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr9:21421811..21423891
RNA-Seq ExpressionLag0008432
SyntenyLag0008432
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONI01138.1 hypothetical protein PRUPE_6G123900 [Prunus persica]1.9e-3834.8Show/hide
Query:  RTSTLKFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQGIPCYAMNYFRLPKKLIAKINRMMA--------------------------------RDLET
        R    + +KD++W+ I GWK KL S  G+E+L+K+V+Q IP Y+M+ FR+PK L  ++N +MA                                RDLE 
Subjt:  RTSTLKFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQGIPCYAMNYFRLPKKLIAKINRMMA--------------------------------RDLET

Query:  FNQALLAKQCWRIVSQPTSYLARVLKGWYFPSSEFLDAVVGSRPSFIWRSLMWGKKLLEKGVRWRIGNGERV----------------------------
        FNQALLAKQCWRI+  P S +AR+ +  Y PS  FL+A VG+ PSFIWRSL WGK+LL KG+RWR+G+G  +                            
Subjt:  FNQALLAKQCWRIVSQPTSYLARVLKGWYFPSSEFLDAVVGSRPSFIWRSLMWGKKLLEKGVRWRIGNGERV----------------------------

Query:  -----------------------------------ACEDAVIWHFEKSGLYLVKSGYRVG--QDPLLAGSPSS
                                           A  D +IWH+E++G+Y VKSGYR+   +   ++G PS+
Subjt:  -----------------------------------ACEDAVIWHFEKSGLYLVKSGYRVG--QDPLLAGSPSS

ONI09819.1 hypothetical protein PRUPE_4G011200 [Prunus persica]1.6e-3734.43Show/hide
Query:  RTSTLKFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQGIPCYAMNYFRLPKKLIAKINRMMA--------------------------------RDLET
        R    + +KD++W+ I GWK KL S  G+E+L+K+V+Q IP Y+M+ F++PK L  ++N +MA                                RDLE 
Subjt:  RTSTLKFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQGIPCYAMNYFRLPKKLIAKINRMMA--------------------------------RDLET

Query:  FNQALLAKQCWRIVSQPTSYLARVLKGWYFPSSEFLDAVVGSRPSFIWRSLMWGKKLLEKGVRWRIGNGERV----------------------------
        FNQALLAKQCWRI+  P S +AR+ +  Y PS  FL+A VG+ PSFIW SL WGK+LL KGVRWR+G+G  +                            
Subjt:  FNQALLAKQCWRIVSQPTSYLARVLKGWYFPSSEFLDAVVGSRPSFIWRSLMWGKKLLEKGVRWRIGNGERV----------------------------

Query:  -----------------------------------ACEDAVIWHFEKSGLYLVKSGYRVG--QDPLLAGSPSS
                                           A  D +IWH+E++G+Y VKSGYR+   +   ++G PS+
Subjt:  -----------------------------------ACEDAVIWHFEKSGLYLVKSGYRVG--QDPLLAGSPSS

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]6.5e-3935.29Show/hide
Query:  RTSTLKFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQGIPCYAMNYFRLPKKLIAKINRMMA--------------------------------RDLET
        R    + +KD++W+ I GWK KL S  G+E+L+K+V+Q IP Y+M+ FR+PK L  ++N +MA                                RDLE 
Subjt:  RTSTLKFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQGIPCYAMNYFRLPKKLIAKINRMMA--------------------------------RDLET

Query:  FNQALLAKQCWRIVSQPTSYLARVLKGWYFPSSEFLDAVVGSRPSFIWRSLMWGKKLLEKGVRWRIGNGERV----------------------------
        FNQALLAKQCWRI+  P S +AR+ +  Y PS  FL+A VG+ PSFIWRSL WGK+LL KG+RWR+GNG  +                            
Subjt:  FNQALLAKQCWRIVSQPTSYLARVLKGWYFPSSEFLDAVVGSRPSFIWRSLMWGKKLLEKGVRWRIGNGERV----------------------------

Query:  -----------------------------------ACEDAVIWHFEKSGLYLVKSGYRVG--QDPLLAGSPS
                                           A  D +IWH+E++G+Y VKSGYR+   +   ++G PS
Subjt:  -----------------------------------ACEDAVIWHFEKSGLYLVKSGYRVG--QDPLLAGSPS

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]1.6e-4537.81Show/hide
Query:  MPRCRTSTLKFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQGIPCYAMNYFRLPKKLIAKINRMMA--------------------------------R
        MPR R     +IKDRVW+ +QGWK KLFS+GG+EVL+K+V Q IPCY M+ FRLPK+LI + + + A                                R
Subjt:  MPRCRTSTLKFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQGIPCYAMNYFRLPKKLIAKINRMMA--------------------------------R

Query:  DLETFNQALLAKQCWRIVSQPTSYLARVLKGWYFPSSEFLDAVVGSRPSFIWRSLMWGKKLLEKGVRWRIGNGERV------------------------
        DLE FN+ALLAKQCWRI++ P S L+RVLKG YF    F++A +   PS+IWRS++WG+ LL+KG+RWRIGNG+ V                        
Subjt:  DLETFNQALLAKQCWRIVSQPTSYLARVLKGWYFPSSEFLDAVVGSRPSFIWRSLMWGKKLLEKGVRWRIGNGERV------------------------

Query:  ----------------------------------------ACEDAVIWHFEKSGLYLVKSGYRVG--QDPLLAGSPSSSSPEL
                                                A ED +IW++EK+G+Y V+SGY+V    +P +    SSSS E+
Subjt:  ----------------------------------------ACEDAVIWHFEKSGLYLVKSGYRVG--QDPLLAGSPSSSSPEL

XP_024039324.1 uncharacterized protein LOC112097962 [Citrus clementina]1.8e-3626.11Show/hide
Query:  RCRTSTLKFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQGIPCYAMNYFRLPKKLIAKINRMMA--------------------------------RDL
        R +TS  + +K +V  +I  W+ K+FS GG+E+L+K+V Q +P +AM+ F+LPK L  +I   +A                                RD 
Subjt:  RCRTSTLKFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQGIPCYAMNYFRLPKKLIAKINRMMA--------------------------------RDL

Query:  ETFNQALLAKQCWRIVSQPTSYLARVLKGWYFPSSEFLDAVVGSRPSFIWRSLMWGKKLLEKGVRWRIGNGE--------------------------RV
         +FNQA++AKQ WR++  P S +++VL+  YF S  FL+A  GS PSFIWRS++WG+++++KG RW IGNG                           R 
Subjt:  ETFNQALLAKQCWRIVSQPTSYLARVLKGWYFPSSEFLDAVVGSRPSFIWRSLMWGKKLLEKGVRWRIGNGE--------------------------RV

Query:  ACEDAVIWHFEKSGLYLVKSGYRVG---QDPLLAGSPSSSSPELIHGGLTRRTYVCAVVGWGSRVFTCSGIVSRRERLCGLQVLGLFCPSARQTTLNSSS
           D ++WH++K G Y V+SGY++    + P   GS  S+S                +  W +         +  +R C    +   C    +T  +   
Subjt:  ACEDAVIWHFEKSGLYLVKSGYRVG---QDPLLAGSPSSSSPELIHGGLTRRTYVCAVVGWGSRVFTCSGIVSRRERLCGLQVLGLFCPSARQTTLNSSS

Query:  DI-------------------------RNE-MRQKKKVSMANLADWVVGYLNAFRDSGRREMDLLGRGPSLVVCRWERPEVDGFKVNVDAAFCLESETTG
         I                         RN+ + ++KK++   LA      L A+  + + +   +     +V  +WE P  +  KVNVDA   +  E + 
Subjt:  DI-------------------------RNE-MRQKKKVSMANLADWVVGYLNAFRDSGRREMDLLGRGPSLVVCRWERPEVDGFKVNVDAAFCLESETTG

Query:  VAWYVEILGGRWASQLRPSMRISEMLTLRKGGGVGVGDVGGGCGEGCSSGWFIENNFTFREGNCVAHRLARLAMEDRCDRVWVEEGPSCVLGLL
         A  +E               + ++L   KG   G+  V     E       ++     R  N  AH LA+LA+      VW++  P+ +L +L
Subjt:  VAWYVEILGGRWASQLRPSMRISEMLTLRKGGGVGVGDVGGGCGEGCSSGWFIENNFTFREGNCVAHRLARLAMEDRCDRVWVEEGPSCVLGLL

TrEMBL top hitse value%identityAlignment
A0A251NPF0 Reverse transcriptase domain-containing protein9.1e-3934.8Show/hide
Query:  RTSTLKFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQGIPCYAMNYFRLPKKLIAKINRMMA--------------------------------RDLET
        R    + +KD++W+ I GWK KL S  G+E+L+K+V+Q IP Y+M+ FR+PK L  ++N +MA                                RDLE 
Subjt:  RTSTLKFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQGIPCYAMNYFRLPKKLIAKINRMMA--------------------------------RDLET

Query:  FNQALLAKQCWRIVSQPTSYLARVLKGWYFPSSEFLDAVVGSRPSFIWRSLMWGKKLLEKGVRWRIGNGERV----------------------------
        FNQALLAKQCWRI+  P S +AR+ +  Y PS  FL+A VG+ PSFIWRSL WGK+LL KG+RWR+G+G  +                            
Subjt:  FNQALLAKQCWRIVSQPTSYLARVLKGWYFPSSEFLDAVVGSRPSFIWRSLMWGKKLLEKGVRWRIGNGERV----------------------------

Query:  -----------------------------------ACEDAVIWHFEKSGLYLVKSGYRVG--QDPLLAGSPSS
                                           A  D +IWH+E++G+Y VKSGYR+   +   ++G PS+
Subjt:  -----------------------------------ACEDAVIWHFEKSGLYLVKSGYRVG--QDPLLAGSPSS

A0A5E4FZN9 PREDICTED: retrotransposon3.1e-3935.29Show/hide
Query:  RTSTLKFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQGIPCYAMNYFRLPKKLIAKINRMMA--------------------------------RDLET
        R    + +KD++W+ I GWK KL S  G+E+L+K+V+Q IP Y+M+ FR+PK L  ++N +MA                                RDLE 
Subjt:  RTSTLKFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQGIPCYAMNYFRLPKKLIAKINRMMA--------------------------------RDLET

Query:  FNQALLAKQCWRIVSQPTSYLARVLKGWYFPSSEFLDAVVGSRPSFIWRSLMWGKKLLEKGVRWRIGNGERV----------------------------
        FNQALLAKQCWRI+  P S +AR+ +  Y PS  FL+A VG+ PSFIWRSL WGK+LL KG+RWR+GNG  +                            
Subjt:  FNQALLAKQCWRIVSQPTSYLARVLKGWYFPSSEFLDAVVGSRPSFIWRSLMWGKKLLEKGVRWRIGNGERV----------------------------

Query:  -----------------------------------ACEDAVIWHFEKSGLYLVKSGYRVG--QDPLLAGSPS
                                           A  D +IWH+E++G+Y VKSGYR+   +   ++G PS
Subjt:  -----------------------------------ACEDAVIWHFEKSGLYLVKSGYRVG--QDPLLAGSPS

A0A6J1DAR4 uncharacterized protein LOC1110189547.7e-4637.81Show/hide
Query:  MPRCRTSTLKFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQGIPCYAMNYFRLPKKLIAKINRMMA--------------------------------R
        MPR R     +IKDRVW+ +QGWK KLFS+GG+EVL+K+V Q IPCY M+ FRLPK+LI + + + A                                R
Subjt:  MPRCRTSTLKFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQGIPCYAMNYFRLPKKLIAKINRMMA--------------------------------R

Query:  DLETFNQALLAKQCWRIVSQPTSYLARVLKGWYFPSSEFLDAVVGSRPSFIWRSLMWGKKLLEKGVRWRIGNGERV------------------------
        DLE FN+ALLAKQCWRI++ P S L+RVLKG YF    F++A +   PS+IWRS++WG+ LL+KG+RWRIGNG+ V                        
Subjt:  DLETFNQALLAKQCWRIVSQPTSYLARVLKGWYFPSSEFLDAVVGSRPSFIWRSLMWGKKLLEKGVRWRIGNGERV------------------------

Query:  ----------------------------------------ACEDAVIWHFEKSGLYLVKSGYRVG--QDPLLAGSPSSSSPEL
                                                A ED +IW++EK+G+Y V+SGY+V    +P +    SSSS E+
Subjt:  ----------------------------------------ACEDAVIWHFEKSGLYLVKSGYRVG--QDPLLAGSPSSSSPEL

A0A803Q0L5 Uncharacterized protein8.2e-4026.73Show/hide
Query:  RCRTSTLKFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQGIPCYAMNYFRLPKKLIAKINRMMA--------------------------------RDL
        R +   L  IK++VW +++GWK  +FS+ G+EVL+K+VVQ IP YAM+ FRL KK I+ I+RM A                                RDL
Subjt:  RCRTSTLKFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQGIPCYAMNYFRLPKKLIAKINRMMA--------------------------------RDL

Query:  ETFNQALLAKQCWRIVSQPTSYLARVLKGWYFPSSEFLDAVVGSRPSFIWRSLMWGKKLLEKGVRWRIGNGERV--------------------------
          FNQA+LAKQ WR +    +  +RVLK  YFP    L+A  G+  SF+WRSL+WGKK++  G RWR+GNGE V                          
Subjt:  ETFNQALLAKQCWRIVSQPTSYLARVLKGWYFPSSEFLDAVVGSRPSFIWRSLMWGKKLLEKGVRWRIGNGERV--------------------------

Query:  -------------------------------------ACEDAVIWHFEKSGLYLVKSGYRVGQDPLLAGSPSSSS--------------PELIHGGLTRR
                                               ED ++WH+ K+G Y VKSGY++    +     S+                P  I   + + 
Subjt:  -------------------------------------ACEDAVIWHFEKSGLYLVKSGYRVGQDPLLAGSPSSSS--------------PELIHGGLTRR

Query:  TY-------VCAVVGWGSRVFTCSGIVSRRERLCGLQVLGLFCPSARQTTLNSSS----------DIRNEMRQKKKVSMA-NLADWVVGYLNAFRDSGRR
         Y         A  G     +  SG     +RL    VL      ARQ                 +IRN +       MA  + DW   YL  F   G  
Subjt:  TY-------VCAVVGWGSRVFTCSGIVSRRERLCGLQVLGLFCPSARQTTLNSSS----------DIRNEMRQKKKVSMA-NLADWVVGYLNAFRDSGRR

Query:  EMDLLGRGPSLVVCRWERPEVDGFKVNVDAAFCLESETTGVAWYVEILGGR--WASQLRPSMRIS----EMLTLRKGGGVGVG------DVGGGC-----
         +  + R  S    +W  PE+   K+NVDA        +G+   +   GGR  +AS       ++    E+  +  G  VG+        +   C     
Subjt:  EMDLLGRGPSLVVCRWERPEVDGFKVNVDAAFCLESETTGVAWYVEILGGR--WASQLRPSMRIS----EMLTLRKGGGVGVG------DVGGGC-----

Query:  -----GEGCSSGWFIEN--------------NFTFREGNCVAHRLARLAMEDRCDRVWVEEGPSC
              EGC     + +              +F FRE N VA+ LA  A+ ++   +W+    SC
Subjt:  -----GEGCSSGWFIEN--------------NFTFREGNCVAHRLARLAMEDRCDRVWVEEGPSC

M5W5F3 Reverse transcriptase domain-containing protein (Fragment)9.1e-3934.8Show/hide
Query:  RTSTLKFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQGIPCYAMNYFRLPKKLIAKINRMMA--------------------------------RDLET
        R    + +KD++W+ I GWK KL S  G+E+L+K+V+Q IP Y+M+ FR+PK L  ++N +MA                                RDLE 
Subjt:  RTSTLKFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQGIPCYAMNYFRLPKKLIAKINRMMA--------------------------------RDLET

Query:  FNQALLAKQCWRIVSQPTSYLARVLKGWYFPSSEFLDAVVGSRPSFIWRSLMWGKKLLEKGVRWRIGNGERV----------------------------
        FNQALLAKQCWRI+  P S +AR+ +  Y PS  FL+A VG+ PSFIWRSL WGK+LL KG+RWR+G+G  +                            
Subjt:  FNQALLAKQCWRIVSQPTSYLARVLKGWYFPSSEFLDAVVGSRPSFIWRSLMWGKKLLEKGVRWRIGNGERV----------------------------

Query:  -----------------------------------ACEDAVIWHFEKSGLYLVKSGYRVG--QDPLLAGSPSS
                                           A  D +IWH+E++G+Y VKSGYR+   +   ++G PS+
Subjt:  -----------------------------------ACEDAVIWHFEKSGLYLVKSGYRVG--QDPLLAGSPSS

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.5e-0623.03Show/hide
Query:  RCRTSTLKFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQGIPCYAMNYFRLPKKLIAKINRMM--------------------------------ARDL
        R    T   I +RV  ++ GW+ K  S  GR  L K+V+  +P ++M+   LP+ ++ +++++                                  R  
Subjt:  RCRTSTLKFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQGIPCYAMNYFRLPKKLIAKINRMM--------------------------------ARDL

Query:  ETFNQALLAKQCWRIVSQPTSYLARVLKGWYFPSSEFLDA---VVGSRPSFIWRSLMWG-KKLLEKGVRWRIGNGERV
        ++ N+AL++K  WR++ +  S    VL+  Y    E  D+   +     S  WRS+  G + ++  GV W  G+G+++
Subjt:  ETFNQALLAKQCWRIVSQPTSYLARVLKGWYFPSSEFLDA---VVGSRPSFIWRSLMWG-KKLLEKGVRWRIGNGERV

P93295 Uncharacterized mitochondrial protein AtMg003101.0e-1537.4Show/hide
Query:  IPCYAMNYFRLPKKLIAKINRMMA---------------------------------RDLETFNQALLAKQCWRIVSQPTSYLARVLKGWYFPSSEFLDA
        +P YAM+ FRL K L  K+   M                                  RDL  FNQALLAKQ +RI+ QP + L+R+L+  YFP S  ++ 
Subjt:  IPCYAMNYFRLPKKLIAKINRMMA---------------------------------RDLETFNQALLAKQCWRIVSQPTSYLARVLKGWYFPSSEFLDA

Query:  VVGSRPSFIWRSLMWGKKLLEKGVRWRIGNG
         VG+RPS+ WRS++ G++LL +G+   IG+G
Subjt:  VVGSRPSFIWRSLMWGKKLLEKGVRWRIGNG

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein4.3e-1747.06Show/hide
Query:  RDLETFNQALLAKQCWRIVSQPTSYLARVLKGWYFPSSEFLDAVVGSRPSFIWRSLMWGKKLLEKGVRWRIGNGERVACEDAVIW
        +D+E FN ALL KQ WR++S+P S +A+V K  YF  S+ L+A +GSRPSF+W+S+   +++L +G R  +GNG     ED +IW
Subjt:  RDLETFNQALLAKQCWRIVSQPTSYLARVLKGWYFPSSEFLDAVVGSRPSFIWRSLMWGKKLLEKGVRWRIGNGERVACEDAVIW

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.4e-1737.4Show/hide
Query:  IPCYAMNYFRLPKKLIAKINRMMA---------------------------------RDLETFNQALLAKQCWRIVSQPTSYLARVLKGWYFPSSEFLDA
        +P YAM+ FRL K L  K+   M                                  RDL  FNQALLAKQ +RI+ QP + L+R+L+  YFP S  ++ 
Subjt:  IPCYAMNYFRLPKKLIAKINRMMA---------------------------------RDLETFNQALLAKQCWRIVSQPTSYLARVLKGWYFPSSEFLDA

Query:  VVGSRPSFIWRSLMWGKKLLEKGVRWRIGNG
         VG+RPS+ WRS++ G++LL +G+   IG+G
Subjt:  VVGSRPSFIWRSLMWGKKLLEKGVRWRIGNG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCGTTGTAGGACGAGTACGTTGAAATTCATTAAGGACCGAGTTTGGCAGCAGATTCAGGGGTGGAAGGGGAAATTGTTCTCTGTTGGAGGGAGGGAGGTCTTGCT
GAAATCTGTGGTTCAGGGCATCCCATGTTATGCTATGAACTACTTCCGGCTGCCGAAGAAACTGATTGCAAAAATAAACCGGATGATGGCGAGGGACTTGGAGACCTTTA
ACCAGGCCCTACTGGCCAAACAGTGTTGGAGGATAGTCAGTCAGCCAACGTCTTACCTGGCCCGTGTCCTGAAAGGGTGGTACTTTCCGTCTTCAGAGTTCTTGGATGCG
GTGGTAGGGAGTCGACCTTCCTTTATCTGGAGGAGCCTTATGTGGGGGAAGAAGTTGTTGGAAAAGGGGGTTCGCTGGAGGATTGGGAATGGAGAAAGGGTTGCTTGTGA
GGATGCCGTCATTTGGCACTTTGAGAAGTCGGGGCTATATTTAGTAAAGAGTGGGTATCGGGTTGGCCAAGACCCTCTTTTGGCCGGAAGCCCCTCATCTTCGTCTCCAG
AGTTAATACACGGGGGGTTGACTCGCCGAACGTATGTGTGTGCTGTGGTAGGATGGGGGAGTCGAGTCTTCACCTGTTCTGGCATTGTAAGCAGACGAGAGAGATTATGT
GGTCTGCAGGTTTTGGGGCTATTTTGTCCAAGTGCAAGGCAGACGACATTAAATTCCTCCTCCGACATACGGAATGAGATGAGGCAAAAGAAGAAGGTCTCGATGGCGAA
TCTGGCAGATTGGGTGGTCGGGTATCTGAATGCATTCCGTGATTCGGGGAGGAGGGAGATGGATCTGTTGGGGAGAGGTCCTAGTTTGGTTGTTTGCAGGTGGGAGCGGC
CAGAGGTTGACGGCTTCAAGGTGAATGTTGATGCGGCCTTTTGTCTGGAGAGCGAAACAACAGGTGTGGCGTGGTATGTTGAGATTCTTGGGGGCAGGTGGGCTTCACAA
CTACGGCCTTCTATGAGAATATCAGAGATGCTGACTTTGCGAAAGGGAGGAGGTGTCGGAGTTGGGGATGTTGGTGGAGGATGCGGTGAGGGGTGTTCCAGTGGGTGGTT
TATCGAAAACAACTTTACATTCAGGGAAGGCAACTGCGTAGCCCATCGTTTGGCAAGGTTGGCGATGGAGGACAGATGTGACCGAGTGTGGGTGGAAGAAGGCCCTTCAT
GTGTTTTGGGCTTGTTGGTTGAGGAGGGAGTGCATGAGTTTCCCCAAGCGAGGAGGGGAGGATTTTCTGGAGGAATGATATGTTTTAGGTTGGGGTGGGGATGGTGTAGG
CTAGTGGGCTTTGGTTTAAGTCTGGTTGCACCTGAGCGCCAGGCTAATTTATTATTTTACTCTTGTTATGTTGCGTCTGTTTTGCTCTGCTGGATTGAGTTGAGGCCATG
CACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTCGTTGTAGGACGAGTACGTTGAAATTCATTAAGGACCGAGTTTGGCAGCAGATTCAGGGGTGGAAGGGGAAATTGTTCTCTGTTGGAGGGAGGGAGGTCTTGCT
GAAATCTGTGGTTCAGGGCATCCCATGTTATGCTATGAACTACTTCCGGCTGCCGAAGAAACTGATTGCAAAAATAAACCGGATGATGGCGAGGGACTTGGAGACCTTTA
ACCAGGCCCTACTGGCCAAACAGTGTTGGAGGATAGTCAGTCAGCCAACGTCTTACCTGGCCCGTGTCCTGAAAGGGTGGTACTTTCCGTCTTCAGAGTTCTTGGATGCG
GTGGTAGGGAGTCGACCTTCCTTTATCTGGAGGAGCCTTATGTGGGGGAAGAAGTTGTTGGAAAAGGGGGTTCGCTGGAGGATTGGGAATGGAGAAAGGGTTGCTTGTGA
GGATGCCGTCATTTGGCACTTTGAGAAGTCGGGGCTATATTTAGTAAAGAGTGGGTATCGGGTTGGCCAAGACCCTCTTTTGGCCGGAAGCCCCTCATCTTCGTCTCCAG
AGTTAATACACGGGGGGTTGACTCGCCGAACGTATGTGTGTGCTGTGGTAGGATGGGGGAGTCGAGTCTTCACCTGTTCTGGCATTGTAAGCAGACGAGAGAGATTATGT
GGTCTGCAGGTTTTGGGGCTATTTTGTCCAAGTGCAAGGCAGACGACATTAAATTCCTCCTCCGACATACGGAATGAGATGAGGCAAAAGAAGAAGGTCTCGATGGCGAA
TCTGGCAGATTGGGTGGTCGGGTATCTGAATGCATTCCGTGATTCGGGGAGGAGGGAGATGGATCTGTTGGGGAGAGGTCCTAGTTTGGTTGTTTGCAGGTGGGAGCGGC
CAGAGGTTGACGGCTTCAAGGTGAATGTTGATGCGGCCTTTTGTCTGGAGAGCGAAACAACAGGTGTGGCGTGGTATGTTGAGATTCTTGGGGGCAGGTGGGCTTCACAA
CTACGGCCTTCTATGAGAATATCAGAGATGCTGACTTTGCGAAAGGGAGGAGGTGTCGGAGTTGGGGATGTTGGTGGAGGATGCGGTGAGGGGTGTTCCAGTGGGTGGTT
TATCGAAAACAACTTTACATTCAGGGAAGGCAACTGCGTAGCCCATCGTTTGGCAAGGTTGGCGATGGAGGACAGATGTGACCGAGTGTGGGTGGAAGAAGGCCCTTCAT
GTGTTTTGGGCTTGTTGGTTGAGGAGGGAGTGCATGAGTTTCCCCAAGCGAGGAGGGGAGGATTTTCTGGAGGAATGATATGTTTTAGGTTGGGGTGGGGATGGTGTAGG
CTAGTGGGCTTTGGTTTAAGTCTGGTTGCACCTGAGCGCCAGGCTAATTTATTATTTTACTCTTGTTATGTTGCGTCTGTTTTGCTCTGCTGGATTGAGTTGAGGCCATG
CACCTGA
Protein sequenceShow/hide protein sequence
MPRCRTSTLKFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQGIPCYAMNYFRLPKKLIAKINRMMARDLETFNQALLAKQCWRIVSQPTSYLARVLKGWYFPSSEFLDA
VVGSRPSFIWRSLMWGKKLLEKGVRWRIGNGERVACEDAVIWHFEKSGLYLVKSGYRVGQDPLLAGSPSSSSPELIHGGLTRRTYVCAVVGWGSRVFTCSGIVSRRERLC
GLQVLGLFCPSARQTTLNSSSDIRNEMRQKKKVSMANLADWVVGYLNAFRDSGRREMDLLGRGPSLVVCRWERPEVDGFKVNVDAAFCLESETTGVAWYVEILGGRWASQ
LRPSMRISEMLTLRKGGGVGVGDVGGGCGEGCSSGWFIENNFTFREGNCVAHRLARLAMEDRCDRVWVEEGPSCVLGLLVEEGVHEFPQARRGGFSGGMICFRLGWGWCR
LVGFGLSLVAPERQANLLFYSCYVASVLLCWIELRPCT