; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000354 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000354
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr4:4952580..4964499
RNA-Seq ExpressionLag0000354
SyntenyLag0000354
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151688.1 uncharacterized protein LOC111019603 [Momordica charantia]1.6e-2835.77Show/hide
Query:  APPVLAAEALQAMLGNAF-LNNLQHVGANGAPALGEDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWK
        A   L   ALQA++ N+      Q +    A AL  + QFI+ F +  PP+F+G S+ +  V EWI  LEA++ +LG + Q +V+GA FML+G A  WW 
Subjt:  APPVLAAEALQAMLGNAF-LNNLQHVGANGAPALGEDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWK

Query:  VVGRTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELS------------------------CRVP-GLGRPTTFAAA
        VV   E+    PI+W+  K L+ D++  +    E+E EF+ L Q TL V QY ++F E S                         + P  L RPTT+A A
Subjt:  VVGRTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELS------------------------CRVP-GLGRPTTFAAA

Query:  LASARMLDRD-IPRTDRSQEVGTSSGAKKKSEVEVLAASQKVRGSP
        +  A ++D+D I +    Q+VG SSG K+K  V  +++SQ  + SP
Subjt:  LASARMLDRD-IPRTDRSQEVGTSSGAKKKSEVEVLAASQKVRGSP

XP_022155925.1 uncharacterized protein LOC111022925 [Momordica charantia]2.6e-2635.45Show/hide
Query:  PEVNSHPEANPSVPP--PAPPVLAAEALQAMLGNAFLNNLQHVGANGA----PALG----EDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQF
        P       A+ +VPP  P   VL AEALQ +L NA        GA GA    P+ G    E+VQFI+ F +  PP F+G S+   A  EW+  LEA++ +
Subjt:  PEVNSHPEANPSVPP--PAPPVLAAEALQAMLGNAFLNNLQHVGANGA----PALG----EDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQF

Query:  LGANAQQRVQGAAFMLKGHARTWWKVVGRTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELS---------------
        LG +   +V+GA FML+G A  WW+ V   E+    P++W+ FK L+ +++       E+ AEF+ L Q +L V QY R+F ELS               
Subjt:  LGANAQQRVQGAAFMLKGHARTWWKVVGRTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELS---------------

Query:  CRVPGLGR----------PTTFAAALASARMLDRDIPRTDRSQEVGTSSGAKKKSEVEVLAASQKVRG
          + GL R          PTT+AAA+  A ++D+ +      Q +G+SSG K+K      ++SQ  RG
Subjt:  CRVPGLGR----------PTTFAAALASARMLDRDIPRTDRSQEVGTSSGAKKKSEVEVLAASQKVRG

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]6.2e-2840.72Show/hide
Query:  DVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWKVVGRTENRPENPISWSGFKGLVRDHFGCRFADVEQE
        + +FIK F +  PP+FDG S+ + AV EWI  LEA++ +LG   Q +V+GA FML+G A  WW  V   E+    PI W+ FK L+ D++        +E
Subjt:  DVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWKVVGRTENRPENPISWSGFKGLVRDHFGCRFADVEQE

Query:  AEFVSLVQGTLSVEQYVRRFEELS------------------------CRVP-GLGRPTTFAAALASARMLDRDIP-RTDRSQEVGTSSGAKKK
        AEF+ LVQGTLSV QY R+F ELS                         R P  L RPTT+A A+  A ++D+D+  +     EVG+SSG K+K
Subjt:  AEFVSLVQGTLSVEQYVRRFEELS------------------------CRVP-GLGRPTTFAAALASARMLDRDIP-RTDRSQEVGTSSGAKKK

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]5.8e-2634.73Show/hide
Query:  PDVPEVNSHPEANPSVPPPAPPVLAAEALQAMLGNA-FLNNLQHVGANGAPALGEDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQ
        P VP+  + P+  P V P     L AEALQ +L NA      Q      A    ++VQFI+ F    PP F+G S+   A  EW+  LEA++ +LG +  
Subjt:  PDVPEVNSHPEANPSVPPPAPPVLAAEALQAMLGNA-FLNNLQHVGANGAPALGEDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQ

Query:  QRVQGAAFMLKGHARTWWKVVGRTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELS---------------CRVPGL
         +V+GA FML+G A  WW+ V   E+    P++W+ FK L+ +++    A  E+  EF+ L QG+L+V QY R+F ELS                 + GL
Subjt:  QRVQGAAFMLKGHARTWWKVVGRTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELS---------------CRVPGL

Query:  GR----------PTTFAAALASARMLDRDIPRTDRSQEVGTSSGAKKKSEVEVLAASQKVRG
         R          PTT+AAA+  A ++D+ +      Q +G++SG K+K      +ASQ  RG
Subjt:  GR----------PTTFAAALASARMLDRDIPRTDRSQEVGTSSGAKKKSEVEVLAASQKVRG

XP_022157413.1 uncharacterized protein LOC111024114 [Momordica charantia]8.9e-2734.83Show/hide
Query:  GRSGPPDVPEVNSHPEANPSVPPPAPPVLAAEALQAMLGNA-FLNNLQHVGANGAPALGEDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQFL
        GR  PP VP+  + P+  P V P     L AEALQ +L NA      Q      A    ++VQFI+ F +  PP F+G S+   A  EW+  LEA++ +L
Subjt:  GRSGPPDVPEVNSHPEANPSVPPPAPPVLAAEALQAMLGNA-FLNNLQHVGANGAPALGEDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQFL

Query:  GANAQQRVQGAAFMLKGHARTWWKVVGRTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELS----------------
        G +   +V+GA FML+G A  WW+ V   E+    P++W+ FK L+ +++       E+ AEF+ L QG+L+V QY R+F ELS                
Subjt:  GANAQQRVQGAAFMLKGHARTWWKVVGRTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELS----------------

Query:  ------CRVPGL---GRPTTFAAALASARMLDRDIPRTDRSQEVGTSSGAKKKSEVEVLAASQKVRG
                + GL     PTT+AAA+  A ++D+ +      Q +G+SSG K+K    + ++SQ  RG
Subjt:  ------CRVPGL---GRPTTFAAALASARMLDRDIPRTDRSQEVGTSSGAKKKSEVEVLAASQKVRG

TrEMBL top hitse value%identityAlignment
A0A6J1DCW8 uncharacterized protein LOC1110196037.9e-2935.77Show/hide
Query:  APPVLAAEALQAMLGNAF-LNNLQHVGANGAPALGEDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWK
        A   L   ALQA++ N+      Q +    A AL  + QFI+ F +  PP+F+G S+ +  V EWI  LEA++ +LG + Q +V+GA FML+G A  WW 
Subjt:  APPVLAAEALQAMLGNAF-LNNLQHVGANGAPALGEDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWK

Query:  VVGRTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELS------------------------CRVP-GLGRPTTFAAA
        VV   E+    PI+W+  K L+ D++  +    E+E EF+ L Q TL V QY ++F E S                         + P  L RPTT+A A
Subjt:  VVGRTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELS------------------------CRVP-GLGRPTTFAAA

Query:  LASARMLDRD-IPRTDRSQEVGTSSGAKKKSEVEVLAASQKVRGSP
        +  A ++D+D I +    Q+VG SSG K+K  V  +++SQ  + SP
Subjt:  LASARMLDRD-IPRTDRSQEVGTSSGAKKKSEVEVLAASQKVRGSP

A0A6J1DNV8 uncharacterized protein LOC1110229251.3e-2635.45Show/hide
Query:  PEVNSHPEANPSVPP--PAPPVLAAEALQAMLGNAFLNNLQHVGANGA----PALG----EDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQF
        P       A+ +VPP  P   VL AEALQ +L NA        GA GA    P+ G    E+VQFI+ F +  PP F+G S+   A  EW+  LEA++ +
Subjt:  PEVNSHPEANPSVPP--PAPPVLAAEALQAMLGNAFLNNLQHVGANGA----PALG----EDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQF

Query:  LGANAQQRVQGAAFMLKGHARTWWKVVGRTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELS---------------
        LG +   +V+GA FML+G A  WW+ V   E+    P++W+ FK L+ +++       E+ AEF+ L Q +L V QY R+F ELS               
Subjt:  LGANAQQRVQGAAFMLKGHARTWWKVVGRTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELS---------------

Query:  CRVPGLGR----------PTTFAAALASARMLDRDIPRTDRSQEVGTSSGAKKKSEVEVLAASQKVRG
          + GL R          PTT+AAA+  A ++D+ +      Q +G+SSG K+K      ++SQ  RG
Subjt:  CRVPGLGR----------PTTFAAALASARMLDRDIPRTDRSQEVGTSSGAKKKSEVEVLAASQKVRG

A0A6J1DQB9 Reverse transcriptase2.8e-2634.73Show/hide
Query:  PDVPEVNSHPEANPSVPPPAPPVLAAEALQAMLGNA-FLNNLQHVGANGAPALGEDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQ
        P VP+  + P+  P V P     L AEALQ +L NA      Q      A    ++VQFI+ F    PP F+G S+   A  EW+  LEA++ +LG +  
Subjt:  PDVPEVNSHPEANPSVPPPAPPVLAAEALQAMLGNA-FLNNLQHVGANGAPALGEDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQ

Query:  QRVQGAAFMLKGHARTWWKVVGRTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELS---------------CRVPGL
         +V+GA FML+G A  WW+ V   E+    P++W+ FK L+ +++    A  E+  EF+ L QG+L+V QY R+F ELS                 + GL
Subjt:  QRVQGAAFMLKGHARTWWKVVGRTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELS---------------CRVPGL

Query:  GR----------PTTFAAALASARMLDRDIPRTDRSQEVGTSSGAKKKSEVEVLAASQKVRG
         R          PTT+AAA+  A ++D+ +      Q +G++SG K+K      +ASQ  RG
Subjt:  GR----------PTTFAAALASARMLDRDIPRTDRSQEVGTSSGAKKKSEVEVLAASQKVRG

A0A6J1DTA8 uncharacterized protein LOC1110241144.3e-2734.83Show/hide
Query:  GRSGPPDVPEVNSHPEANPSVPPPAPPVLAAEALQAMLGNA-FLNNLQHVGANGAPALGEDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQFL
        GR  PP VP+  + P+  P V P     L AEALQ +L NA      Q      A    ++VQFI+ F +  PP F+G S+   A  EW+  LEA++ +L
Subjt:  GRSGPPDVPEVNSHPEANPSVPPPAPPVLAAEALQAMLGNA-FLNNLQHVGANGAPALGEDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQFL

Query:  GANAQQRVQGAAFMLKGHARTWWKVVGRTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELS----------------
        G +   +V+GA FML+G A  WW+ V   E+    P++W+ FK L+ +++       E+ AEF+ L QG+L+V QY R+F ELS                
Subjt:  GANAQQRVQGAAFMLKGHARTWWKVVGRTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELS----------------

Query:  ------CRVPGL---GRPTTFAAALASARMLDRDIPRTDRSQEVGTSSGAKKKSEVEVLAASQKVRG
                + GL     PTT+AAA+  A ++D+ +      Q +G+SSG K+K    + ++SQ  RG
Subjt:  ------CRVPGL---GRPTTFAAALASARMLDRDIPRTDRSQEVGTSSGAKKKSEVEVLAASQKVRG

A0A6J1DUM2 uncharacterized protein LOC1110232473.0e-2840.72Show/hide
Query:  DVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWKVVGRTENRPENPISWSGFKGLVRDHFGCRFADVEQE
        + +FIK F +  PP+FDG S+ + AV EWI  LEA++ +LG   Q +V+GA FML+G A  WW  V   E+    PI W+ FK L+ D++        +E
Subjt:  DVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWKVVGRTENRPENPISWSGFKGLVRDHFGCRFADVEQE

Query:  AEFVSLVQGTLSVEQYVRRFEELS------------------------CRVP-GLGRPTTFAAALASARMLDRDIP-RTDRSQEVGTSSGAKKK
        AEF+ LVQGTLSV QY R+F ELS                         R P  L RPTT+A A+  A ++D+D+  +     EVG+SSG K+K
Subjt:  AEFVSLVQGTLSVEQYVRRFEELS------------------------CRVP-GLGRPTTFAAALASARMLDRDIP-RTDRSQEVGTSSGAKKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCCAAAGTAAGGAACATGTCCCTGTACTCGTGCTAAAAGGCATGGCGGCAACACAAGTCCAAGGAACATGTCCCTATACTCATGCTGAAAAGGCATGGTGGCGACA
CAAGTCCAAGGAACATGTCCCAAAGCCAGGAATATGTCCCTGCACTCGTGCTAAAAGGCATGGCGGCAACACAAGTCCAAGGAACATGTCCCAAAGCCAGGAACATGTCC
CTGCACTCGTGCTGAAAGGGCGTGACAGCGACACAAGTCCAAGGAACATGTCCCAAAGTAAGGAACATGTCCCTATACTCATGCTGAAAAGGCATGGTGGCGACACAAGT
CCAAGGAACATGTCCCAAAGTAAAGAACATGTCCCTGTACTCATGCTGAAAAGGCGTGGTGGCGACACAAGTCCAAGGAACATGTCCCAAAGCCAGGAACATGTCCCTGC
ACTCGTGCCAAAAGGCATGGCGGCGACACAAGTCCAAGGAACATGTCCCAAAGGCATGACGGCGACACAAGTCCAAGGGACATGTCCCTATACTCATGTTGAAAAGGCAT
GGTGGCGACACAAGTTCAAGGAACATGTCCCAAAGCCAGGAACATGTCCCTACACTCGTGCTAAAAGGCATGACGGCAACACAAGTCCAAGGAACATGTCCCAAAGCCAG
GAACATGTCCCTGCATTTGTGCTGAAAGGGCGTAGCGGCGACACAAGTCCAAGGAACGTGTCCCAAAGTAAGGAACATGTCCCTATACTCATGCTGAAAAGGCGTGGTGG
CGACACAAGTCCAAGGAACATGTCCCAAAGCCAGGAACATGTCCCTGCACTCGTGCTGAAAGACGTGGCGGTGACACAAGTCCAAGGAACATGTCCTTGTACTTGTACTG
AAAGACATTGTACCTGCAAATTGAAGACAACAAGGATATGGGTTCAGAAATTCTACACTCTCACAAAGACAAGAGTTCAGAGTTTCAAAGCTCTCAAGCAGAACCAAAGA
ATTCAGAGAGACTCCACCAAGTCTGAAGACGCGGACTCTCTGCAATCCATAAGCTCAAGTGTTGAACGCTTCTTAAAGACCAAACACTCTTCAAGACTTCAACACTCCTT
GAAGACCAAATACTCTTCAAGACTTCAACACTCCTTGAAGATCAAAGACTCTTTAGGACATCAACTCCGCCAGCCCCCCTTCTCCTTCAGCTTCTTCTTCTTGCGCCGCC
AGCCACCCGTGGGCTTCTTCTCCGACGTCTCCCTCTCTCCTGTGAGCCGCGCCGCACGTTCAGCCGTCGTTTCCCTCTCTTTTCTTCATCTGCGCGTGCGTATCAGGTGT
CGCGATCGTGGGTTTTCATCAAAGCCCTTCCTCGAATTCTTGTGGTGTTCGCTGGTAGTCCCGCGACAAAAGGCTTTGACCCACGGAAGTTCAAGTTCGCGAATCTCTCT
CTCGCAGATCTCTCTCCCTCTCTGCGTCGTCGTTCTGCTCGAGTGTCGCTGCCTCCTCGACGTGTCGTCACCACTACAGCACGATCTCGCGTGCCTAGCAATTCGGAGTC
CCGCCGTCCTCGTTCCAGCCGATTTCGCTTATGTCCAGTGGTGTTCGACCCCGTTCTGGTTCATTGCGGCGTCGTTTAGCGTGATTGTGCTTAGCGCCTCTAAGCAAGCG
GTTTTAAACCTCTTTGCTTGCTGGACAGCACGTGTTCGAGGTCTTTCGGACGCTGAACGGGTCAATACGTTGCTTAGTCATCGAGGCCTTGGGTGTAAATGGTCAAGGGT
CGATGCACAGTTCGAGGCCTTGGATATAAATGGTCAAGGGTCGAATGCCGAGCTCTGTAGAGAAGTGTCGAGGCCCTGGGTAGGAGTTGCTTACCAGTACCTTAGTGTAT
TGACCCCCTCCCCTCTCTCTCCCCCCAACTACCAGACTTTGCAGGTTATGAGGACTGCGTGGACCATGAGCAGAGTTGTTCCTGTAGACTGGCCTAGGAAATCTAGGTTG
TTTGGTTGTTTAGGGTTATGGTCTTCCTCGTTCTCCTCTCCATCACCAGCGATGAGTTCGAGTAGCAATCAAGGTAGTGGACGTTCTGGCCCTCCAGACGTTCCCGAGGT
TAATTCGCATCCTGAGGCGAATCCTTCTGTTCCTCCCCCAGCGCCTCCTGTGCTGGCAGCAGAGGCCTTGCAGGCGATGCTTGGCAATGCATTCCTGAACAACCTGCAGC
ACGTTGGTGCAAATGGAGCCCCTGCTCTTGGCGAAGATGTGCAGTTTATCAAGAGTTTCATGAAGGCGAAGCCTCCTTCATTCGATGGGCACTCGGATAGTTCTGAAGCA
GTGGTAGAATGGATTGCTGCGTTGGAAGCGATATTTCAATTTCTTGGAGCTAATGCCCAACAACGAGTCCAAGGAGCTGCCTTTATGCTCAAAGGCCATGCTCGCACTTG
GTGGAAGGTTGTGGGTCGAACCGAGAACCGCCCAGAGAATCCCATTTCCTGGTCGGGGTTCAAAGGTCTTGTGCGAGACCATTTTGGCTGTCGTTTTGCTGATGTTGAGC
AAGAAGCAGAGTTTGTCTCTCTTGTTCAAGGGACCTTGTCTGTGGAACAGTACGTCAGAAGGTTCGAAGAGTTGTCTTGCCGAGTCCCGGGGCTTGGTCGACCGACCACC
TTTGCAGCAGCCCTAGCGAGCGCTCGGATGTTGGATAGGGACATCCCCAGGACGGATCGGTCCCAAGAGGTTGGCACGTCGTCTGGTGCCAAGAAGAAGAGCGAAGTGGA
AGTGCTTGCAGCTAGTCAGAAGGTTAGAGGATCTCCGTCAGGATCTAGCGCGTGCACTGAGGAGTTCTTGCCCTGTGTCACCGATGAGGAGCTCAAGGCAGAATACCCAG
AGCTTTACGATGTCGATGTTAGCCGCCAGCCTCCCTTCTCCTTCAGCTTATTCTTCTTGCGCCGCCAGCCACCCGTGGGCTTCTTCTCCGACGTCTCCCTCTCTCTTGTG
AGCCGTCGCCGCACGTTCAGCCGTCGTTTCCCTCTCTTTTCTTCATCTGCGCGTGCGTATCAGGTGTCGCGATCGTGGGTTTTCATCAAAGCCCTTCCTCGAATTCTTGT
GGTGTTCGCTGATCTCGCGTGCCTAGCAATTCGGAGTCCCGTCGTCCTCGTTCCATCTGATTTAGCTTCTGTCCAGTGGTGTTCGACCCCGTTCTGGTTCATTGCGGCGT
CGTTTAGCGTGATTGTGCTTAGCGCCTCTAAGCAAGCGGTTTTAAACCTCTTTGCTTGTTGGACAGCACGTGTTCGAGGTCTTTCGGACGCTGAACGGGTCTCGGGTATA
AATGGTCGAGGACTGATACATCACTTTGGATGTCGATGCCTCGGGTATAAATGGTCGAGGGTCGATATGCCAATGTTAGATAAAGAGGAGCATCGAGGCCTTGGGAATAA
ATGGTCAAGGGTCGGTGTGATGAGTCTCGAGGCAAGTATTGAGGCCTTGGGTATAAATGGTCCAGGGTCAATACGTTGCTTAGTCATCGAGGCCTTGGGTGTAAATGGTC
AAGGGTCGATGCACAGTTCGAGGCCTTGGGTAGGAGGTTCTTACCAGTACCTTAGTGTATTGACCCCCTCCCCTCTCTCTTCCCCCAACTACCAGACTTTGCAGGTTATG
AGGACTGCGTGGACCATGGTTATAAAATCCCGAGGCGTTACATCAACATTTCTTGAAGACCCAACACTCTTCAAGATTTCAACACTCCTTGAAGATCAAAGGGACATCAA
CACTTCTGGAAGATTGAAGACTCCTTCAAGACTGGAAGACTTCAAGCTCCAAGGATCCATTGAAGCTTCAAACTCGAGATCAAGCTTTCCAGTCCTCGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCCAAAGTAAGGAACATGTCCCTGTACTCGTGCTAAAAGGCATGGCGGCAACACAAGTCCAAGGAACATGTCCCTATACTCATGCTGAAAAGGCATGGTGGCGACA
CAAGTCCAAGGAACATGTCCCAAAGCCAGGAATATGTCCCTGCACTCGTGCTAAAAGGCATGGCGGCAACACAAGTCCAAGGAACATGTCCCAAAGCCAGGAACATGTCC
CTGCACTCGTGCTGAAAGGGCGTGACAGCGACACAAGTCCAAGGAACATGTCCCAAAGTAAGGAACATGTCCCTATACTCATGCTGAAAAGGCATGGTGGCGACACAAGT
CCAAGGAACATGTCCCAAAGTAAAGAACATGTCCCTGTACTCATGCTGAAAAGGCGTGGTGGCGACACAAGTCCAAGGAACATGTCCCAAAGCCAGGAACATGTCCCTGC
ACTCGTGCCAAAAGGCATGGCGGCGACACAAGTCCAAGGAACATGTCCCAAAGGCATGACGGCGACACAAGTCCAAGGGACATGTCCCTATACTCATGTTGAAAAGGCAT
GGTGGCGACACAAGTTCAAGGAACATGTCCCAAAGCCAGGAACATGTCCCTACACTCGTGCTAAAAGGCATGACGGCAACACAAGTCCAAGGAACATGTCCCAAAGCCAG
GAACATGTCCCTGCATTTGTGCTGAAAGGGCGTAGCGGCGACACAAGTCCAAGGAACGTGTCCCAAAGTAAGGAACATGTCCCTATACTCATGCTGAAAAGGCGTGGTGG
CGACACAAGTCCAAGGAACATGTCCCAAAGCCAGGAACATGTCCCTGCACTCGTGCTGAAAGACGTGGCGGTGACACAAGTCCAAGGAACATGTCCTTGTACTTGTACTG
AAAGACATTGTACCTGCAAATTGAAGACAACAAGGATATGGGTTCAGAAATTCTACACTCTCACAAAGACAAGAGTTCAGAGTTTCAAAGCTCTCAAGCAGAACCAAAGA
ATTCAGAGAGACTCCACCAAGTCTGAAGACGCGGACTCTCTGCAATCCATAAGCTCAAGTGTTGAACGCTTCTTAAAGACCAAACACTCTTCAAGACTTCAACACTCCTT
GAAGACCAAATACTCTTCAAGACTTCAACACTCCTTGAAGATCAAAGACTCTTTAGGACATCAACTCCGCCAGCCCCCCTTCTCCTTCAGCTTCTTCTTCTTGCGCCGCC
AGCCACCCGTGGGCTTCTTCTCCGACGTCTCCCTCTCTCCTGTGAGCCGCGCCGCACGTTCAGCCGTCGTTTCCCTCTCTTTTCTTCATCTGCGCGTGCGTATCAGGTGT
CGCGATCGTGGGTTTTCATCAAAGCCCTTCCTCGAATTCTTGTGGTGTTCGCTGGTAGTCCCGCGACAAAAGGCTTTGACCCACGGAAGTTCAAGTTCGCGAATCTCTCT
CTCGCAGATCTCTCTCCCTCTCTGCGTCGTCGTTCTGCTCGAGTGTCGCTGCCTCCTCGACGTGTCGTCACCACTACAGCACGATCTCGCGTGCCTAGCAATTCGGAGTC
CCGCCGTCCTCGTTCCAGCCGATTTCGCTTATGTCCAGTGGTGTTCGACCCCGTTCTGGTTCATTGCGGCGTCGTTTAGCGTGATTGTGCTTAGCGCCTCTAAGCAAGCG
GTTTTAAACCTCTTTGCTTGCTGGACAGCACGTGTTCGAGGTCTTTCGGACGCTGAACGGGTCAATACGTTGCTTAGTCATCGAGGCCTTGGGTGTAAATGGTCAAGGGT
CGATGCACAGTTCGAGGCCTTGGATATAAATGGTCAAGGGTCGAATGCCGAGCTCTGTAGAGAAGTGTCGAGGCCCTGGGTAGGAGTTGCTTACCAGTACCTTAGTGTAT
TGACCCCCTCCCCTCTCTCTCCCCCCAACTACCAGACTTTGCAGGTTATGAGGACTGCGTGGACCATGAGCAGAGTTGTTCCTGTAGACTGGCCTAGGAAATCTAGGTTG
TTTGGTTGTTTAGGGTTATGGTCTTCCTCGTTCTCCTCTCCATCACCAGCGATGAGTTCGAGTAGCAATCAAGGTAGTGGACGTTCTGGCCCTCCAGACGTTCCCGAGGT
TAATTCGCATCCTGAGGCGAATCCTTCTGTTCCTCCCCCAGCGCCTCCTGTGCTGGCAGCAGAGGCCTTGCAGGCGATGCTTGGCAATGCATTCCTGAACAACCTGCAGC
ACGTTGGTGCAAATGGAGCCCCTGCTCTTGGCGAAGATGTGCAGTTTATCAAGAGTTTCATGAAGGCGAAGCCTCCTTCATTCGATGGGCACTCGGATAGTTCTGAAGCA
GTGGTAGAATGGATTGCTGCGTTGGAAGCGATATTTCAATTTCTTGGAGCTAATGCCCAACAACGAGTCCAAGGAGCTGCCTTTATGCTCAAAGGCCATGCTCGCACTTG
GTGGAAGGTTGTGGGTCGAACCGAGAACCGCCCAGAGAATCCCATTTCCTGGTCGGGGTTCAAAGGTCTTGTGCGAGACCATTTTGGCTGTCGTTTTGCTGATGTTGAGC
AAGAAGCAGAGTTTGTCTCTCTTGTTCAAGGGACCTTGTCTGTGGAACAGTACGTCAGAAGGTTCGAAGAGTTGTCTTGCCGAGTCCCGGGGCTTGGTCGACCGACCACC
TTTGCAGCAGCCCTAGCGAGCGCTCGGATGTTGGATAGGGACATCCCCAGGACGGATCGGTCCCAAGAGGTTGGCACGTCGTCTGGTGCCAAGAAGAAGAGCGAAGTGGA
AGTGCTTGCAGCTAGTCAGAAGGTTAGAGGATCTCCGTCAGGATCTAGCGCGTGCACTGAGGAGTTCTTGCCCTGTGTCACCGATGAGGAGCTCAAGGCAGAATACCCAG
AGCTTTACGATGTCGATGTTAGCCGCCAGCCTCCCTTCTCCTTCAGCTTATTCTTCTTGCGCCGCCAGCCACCCGTGGGCTTCTTCTCCGACGTCTCCCTCTCTCTTGTG
AGCCGTCGCCGCACGTTCAGCCGTCGTTTCCCTCTCTTTTCTTCATCTGCGCGTGCGTATCAGGTGTCGCGATCGTGGGTTTTCATCAAAGCCCTTCCTCGAATTCTTGT
GGTGTTCGCTGATCTCGCGTGCCTAGCAATTCGGAGTCCCGTCGTCCTCGTTCCATCTGATTTAGCTTCTGTCCAGTGGTGTTCGACCCCGTTCTGGTTCATTGCGGCGT
CGTTTAGCGTGATTGTGCTTAGCGCCTCTAAGCAAGCGGTTTTAAACCTCTTTGCTTGTTGGACAGCACGTGTTCGAGGTCTTTCGGACGCTGAACGGGTCTCGGGTATA
AATGGTCGAGGACTGATACATCACTTTGGATGTCGATGCCTCGGGTATAAATGGTCGAGGGTCGATATGCCAATGTTAGATAAAGAGGAGCATCGAGGCCTTGGGAATAA
ATGGTCAAGGGTCGGTGTGATGAGTCTCGAGGCAAGTATTGAGGCCTTGGGTATAAATGGTCCAGGGTCAATACGTTGCTTAGTCATCGAGGCCTTGGGTGTAAATGGTC
AAGGGTCGATGCACAGTTCGAGGCCTTGGGTAGGAGGTTCTTACCAGTACCTTAGTGTATTGACCCCCTCCCCTCTCTCTTCCCCCAACTACCAGACTTTGCAGGTTATG
AGGACTGCGTGGACCATGGTTATAAAATCCCGAGGCGTTACATCAACATTTCTTGAAGACCCAACACTCTTCAAGATTTCAACACTCCTTGAAGATCAAAGGGACATCAA
CACTTCTGGAAGATTGAAGACTCCTTCAAGACTGGAAGACTTCAAGCTCCAAGGATCCATTGAAGCTTCAAACTCGAGATCAAGCTTTCCAGTCCTCGAGTGA
Protein sequenceShow/hide protein sequence
MSQSKEHVPVLVLKGMAATQVQGTCPYTHAEKAWWRHKSKEHVPKPGICPCTRAKRHGGNTSPRNMSQSQEHVPALVLKGRDSDTSPRNMSQSKEHVPILMLKRHGGDTS
PRNMSQSKEHVPVLMLKRRGGDTSPRNMSQSQEHVPALVPKGMAATQVQGTCPKGMTATQVQGTCPYTHVEKAWWRHKFKEHVPKPGTCPYTRAKRHDGNTSPRNMSQSQ
EHVPAFVLKGRSGDTSPRNVSQSKEHVPILMLKRRGGDTSPRNMSQSQEHVPALVLKDVAVTQVQGTCPCTCTERHCTCKLKTTRIWVQKFYTLTKTRVQSFKALKQNQR
IQRDSTKSEDADSLQSISSSVERFLKTKHSSRLQHSLKTKYSSRLQHSLKIKDSLGHQLRQPPFSFSFFFLRRQPPVGFFSDVSLSPVSRAARSAVVSLSFLHLRVRIRC
RDRGFSSKPFLEFLWCSLVVPRQKALTHGSSSSRISLSQISLPLCVVVLLECRCLLDVSSPLQHDLACLAIRSPAVLVPADFAYVQWCSTPFWFIAASFSVIVLSASKQA
VLNLFACWTARVRGLSDAERVNTLLSHRGLGCKWSRVDAQFEALDINGQGSNAELCREVSRPWVGVAYQYLSVLTPSPLSPPNYQTLQVMRTAWTMSRVVPVDWPRKSRL
FGCLGLWSSSFSSPSPAMSSSSNQGSGRSGPPDVPEVNSHPEANPSVPPPAPPVLAAEALQAMLGNAFLNNLQHVGANGAPALGEDVQFIKSFMKAKPPSFDGHSDSSEA
VVEWIAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWKVVGRTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLGRPTT
FAAALASARMLDRDIPRTDRSQEVGTSSGAKKKSEVEVLAASQKVRGSPSGSSACTEEFLPCVTDEELKAEYPELYDVDVSRQPPFSFSLFFLRRQPPVGFFSDVSLSLV
SRRRTFSRRFPLFSSSARAYQVSRSWVFIKALPRILVVFADLACLAIRSPVVLVPSDLASVQWCSTPFWFIAASFSVIVLSASKQAVLNLFACWTARVRGLSDAERVSGI
NGRGLIHHFGCRCLGYKWSRVDMPMLDKEEHRGLGNKWSRVGVMSLEASIEALGINGPGSIRCLVIEALGVNGQGSMHSSRPWVGGSYQYLSVLTPSPLSSPNYQTLQVM
RTAWTMVIKSRGVTSTFLEDPTLFKISTLLEDQRDINTSGRLKTPSRLEDFKLQGSIEASNSRSSFPVLE