; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039568 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039568
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr2:46660904..46669535
RNA-Seq ExpressionLag0039568
SyntenyLag0039568
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151688.1 uncharacterized protein LOC111019603 [Momordica charantia]1.2e-3638.78Show/hide
Query:  APPVLAAEALQAMLGNAF-LNNLQHVGANGAPALGEDVQFLKSFLKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWN
        A   L   ALQA++ N+      Q +    A AL  + QF++ F +  PP+F+G S+ +  V EWI  LEA++ +LG + Q +V+GA FML+G A  WW+
Subjt:  APPVLAAEALQAMLGNAF-LNNLQHVGANGAPALGEDVQFLKSFLKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWN

Query:  VVGQTENRPENPISWSGFKGLVRDHFGCRFAGVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEIRGLVRLGRPATFAAA
        VV   E+    PI+W+  K L+ D++  +    E+E EF+ L Q TL V QY ++F E S     L+ TE  +I RFV GL   I+G + L RP T+A A
Subjt:  VVGQTENRPENPISWSGFKGLVRDHFGCRFAGVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEIRGLVRLGRPATFAAA

Query:  LASARMLDRD-IPRTEQSQEVGTSSGAKKKSEVEVLAASQKVRSS
        +  A ++D+D I + +  Q+VG SSG K+K  V  +++SQ  ++S
Subjt:  LASARMLDRD-IPRTEQSQEVGTSSGAKKKSEVEVLAASQKVRSS

XP_022155925.1 uncharacterized protein LOC111022925 [Momordica charantia]2.6e-3638.58Show/hide
Query:  PEVNSHPEANPPVPP--PAPPVLAAEALQAMLGNAFLNNLQHVGANGA----PALG----EDVQFLKSFLKAKPPSFDGHSDSSEAVVEWIAALEAIFQF
        P       A+  VPP  P   VL AEALQ +L NA        GA GA    P+ G    E+VQF++ F +  PP F+G S+   A  EW+  LEA++ +
Subjt:  PEVNSHPEANPPVPP--PAPPVLAAEALQAMLGNAFLNNLQHVGANGA----PALG----EDVQFLKSFLKAKPPSFDGHSDSSEAVVEWIAALEAIFQF

Query:  LGANAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVRDHFGCRFAGVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRIN
        LG +   +V+GA FML+G A  WW  V   E+    P++W+ FK L+ +++       E+ AEF+ L Q +L V QY R+F ELS      + TE+++I+
Subjt:  LGANAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVRDHFGCRFAGVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRIN

Query:  RFVNGLRAEIRGLVRLGRPATFAAALASARMLDRDIPRTEQSQEVGTSSGAKKK
        +F++GLR EI+GL+ L  P T+AAA+  A ++D+ +   +  Q +G+SSG K+K
Subjt:  RFVNGLRAEIRGLVRLGRPATFAAALASARMLDRDIPRTEQSQEVGTSSGAKKK

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]6.3e-3845.88Show/hide
Query:  DVQFLKSFLKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVRDHFGCRFAGVEQE
        + +F+K F +  PP+FDG S+ + AV EWI  LEA++ +LG   Q +V+GA FML+G A  WW+ V   E+    PI W+ FK L+ D++        +E
Subjt:  DVQFLKSFLKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVRDHFGCRFAGVEQE

Query:  AEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEIRGLVRLGRPATFAAALASARMLDRDIP-RTEQSQEVGTSSGAKKK
        AEF+ LVQGTLSV QY R+F ELS     L+ TE ++I RFV GLR  IRG V L RP T+A A+  A ++D+D+  +     EVG+SSG K+K
Subjt:  AEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEIRGLVRLGRPATFAAALASARMLDRDIP-RTEQSQEVGTSSGAKKK

XP_022156546.1 uncharacterized protein LOC111023424 [Momordica charantia]3.5e-3638.78Show/hide
Query:  PEVNSHPEANPPVPPPAPPVLAAEALQAMLGNAFLNNLQHVGANGAPALGEDVQFLKSFLKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQ
        P +       PP PP A   LA     A +G A     +H+    + A     QF+K F +  PP+F G S+ +    EW+  LEA++ +LG   Q +V+
Subjt:  PEVNSHPEANPPVPPPAPPVLAAEALQAMLGNAFLNNLQHVGANGAPALGEDVQFLKSFLKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQ

Query:  GAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVRDHFGCRFAGVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEI
        GA FML+  A  WW+ V  TE+    P+ W+ FK L+ DH+        +E EF+ LVQGTL+V QY R+F ELS     L+ TE ++I RFV GL   I
Subjt:  GAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVRDHFGCRFAGVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEI

Query:  RGLVRLGRPATFAAALASARMLDRDIP-RTEQSQEVGTSSGAKKK
        RG V L RP T+A A+    ++D+D+  R +   EVG+S G K+K
Subjt:  RGLVRLGRPATFAAALASARMLDRDIP-RTEQSQEVGTSSGAKKK

XP_022157413.1 uncharacterized protein LOC111024114 [Momordica charantia]3.5e-3637.7Show/hide
Query:  PPVPPPAPP---------VLAAEALQAMLGNA-FLNNLQHVGANGAPALGEDVQFLKSFLKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQ
        PPVP  AP           L AEALQ +L NA      Q      A    ++VQF++ F +  PP F+G S+   A  EW+  LEA++ +LG +   +V+
Subjt:  PPVPPPAPP---------VLAAEALQAMLGNA-FLNNLQHVGANGAPALGEDVQFLKSFLKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQ

Query:  GAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVRDHFGCRFAGVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEI
        GA FML+G A  WW  V   E+    P++W+ FK L+ +++       E+ AEF+ L QG+L+V QY R+F ELS      + TE+++I++F++GLR EI
Subjt:  GAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVRDHFGCRFAGVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEI

Query:  RGLVRLGRPATFAAALASARMLDRDIPRTEQSQEVGTSSGAKKK
        +GL+ +  P T+AAA+  A ++D+ +   +  Q +G+SSG K+K
Subjt:  RGLVRLGRPATFAAALASARMLDRDIPRTEQSQEVGTSSGAKKK

TrEMBL top hitse value%identityAlignment
A0A6J1DCW8 uncharacterized protein LOC1110196035.8e-3738.78Show/hide
Query:  APPVLAAEALQAMLGNAF-LNNLQHVGANGAPALGEDVQFLKSFLKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWN
        A   L   ALQA++ N+      Q +    A AL  + QF++ F +  PP+F+G S+ +  V EWI  LEA++ +LG + Q +V+GA FML+G A  WW+
Subjt:  APPVLAAEALQAMLGNAF-LNNLQHVGANGAPALGEDVQFLKSFLKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWN

Query:  VVGQTENRPENPISWSGFKGLVRDHFGCRFAGVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEIRGLVRLGRPATFAAA
        VV   E+    PI+W+  K L+ D++  +    E+E EF+ L Q TL V QY ++F E S     L+ TE  +I RFV GL   I+G + L RP T+A A
Subjt:  VVGQTENRPENPISWSGFKGLVRDHFGCRFAGVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEIRGLVRLGRPATFAAA

Query:  LASARMLDRD-IPRTEQSQEVGTSSGAKKKSEVEVLAASQKVRSS
        +  A ++D+D I + +  Q+VG SSG K+K  V  +++SQ  ++S
Subjt:  LASARMLDRD-IPRTEQSQEVGTSSGAKKKSEVEVLAASQKVRSS

A0A6J1DNV8 uncharacterized protein LOC1110229251.3e-3638.58Show/hide
Query:  PEVNSHPEANPPVPP--PAPPVLAAEALQAMLGNAFLNNLQHVGANGA----PALG----EDVQFLKSFLKAKPPSFDGHSDSSEAVVEWIAALEAIFQF
        P       A+  VPP  P   VL AEALQ +L NA        GA GA    P+ G    E+VQF++ F +  PP F+G S+   A  EW+  LEA++ +
Subjt:  PEVNSHPEANPPVPP--PAPPVLAAEALQAMLGNAFLNNLQHVGANGA----PALG----EDVQFLKSFLKAKPPSFDGHSDSSEAVVEWIAALEAIFQF

Query:  LGANAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVRDHFGCRFAGVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRIN
        LG +   +V+GA FML+G A  WW  V   E+    P++W+ FK L+ +++       E+ AEF+ L Q +L V QY R+F ELS      + TE+++I+
Subjt:  LGANAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVRDHFGCRFAGVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRIN

Query:  RFVNGLRAEIRGLVRLGRPATFAAALASARMLDRDIPRTEQSQEVGTSSGAKKK
        +F++GLR EI+GL+ L  P T+AAA+  A ++D+ +   +  Q +G+SSG K+K
Subjt:  RFVNGLRAEIRGLVRLGRPATFAAALASARMLDRDIPRTEQSQEVGTSSGAKKK

A0A6J1DTA8 uncharacterized protein LOC1110241141.7e-3637.7Show/hide
Query:  PPVPPPAPP---------VLAAEALQAMLGNA-FLNNLQHVGANGAPALGEDVQFLKSFLKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQ
        PPVP  AP           L AEALQ +L NA      Q      A    ++VQF++ F +  PP F+G S+   A  EW+  LEA++ +LG +   +V+
Subjt:  PPVPPPAPP---------VLAAEALQAMLGNA-FLNNLQHVGANGAPALGEDVQFLKSFLKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQ

Query:  GAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVRDHFGCRFAGVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEI
        GA FML+G A  WW  V   E+    P++W+ FK L+ +++       E+ AEF+ L QG+L+V QY R+F ELS      + TE+++I++F++GLR EI
Subjt:  GAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVRDHFGCRFAGVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEI

Query:  RGLVRLGRPATFAAALASARMLDRDIPRTEQSQEVGTSSGAKKK
        +GL+ +  P T+AAA+  A ++D+ +   +  Q +G+SSG K+K
Subjt:  RGLVRLGRPATFAAALASARMLDRDIPRTEQSQEVGTSSGAKKK

A0A6J1DUM2 uncharacterized protein LOC1110232473.1e-3845.88Show/hide
Query:  DVQFLKSFLKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVRDHFGCRFAGVEQE
        + +F+K F +  PP+FDG S+ + AV EWI  LEA++ +LG   Q +V+GA FML+G A  WW+ V   E+    PI W+ FK L+ D++        +E
Subjt:  DVQFLKSFLKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVRDHFGCRFAGVEQE

Query:  AEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEIRGLVRLGRPATFAAALASARMLDRDIP-RTEQSQEVGTSSGAKKK
        AEF+ LVQGTLSV QY R+F ELS     L+ TE ++I RFV GLR  IRG V L RP T+A A+  A ++D+D+  +     EVG+SSG K+K
Subjt:  AEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEIRGLVRLGRPATFAAALASARMLDRDIP-RTEQSQEVGTSSGAKKK

A0A6J1DVA0 uncharacterized protein LOC1110234241.7e-3638.78Show/hide
Query:  PEVNSHPEANPPVPPPAPPVLAAEALQAMLGNAFLNNLQHVGANGAPALGEDVQFLKSFLKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQ
        P +       PP PP A   LA     A +G A     +H+    + A     QF+K F +  PP+F G S+ +    EW+  LEA++ +LG   Q +V+
Subjt:  PEVNSHPEANPPVPPPAPPVLAAEALQAMLGNAFLNNLQHVGANGAPALGEDVQFLKSFLKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQ

Query:  GAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVRDHFGCRFAGVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEI
        GA FML+  A  WW+ V  TE+    P+ W+ FK L+ DH+        +E EF+ LVQGTL+V QY R+F ELS     L+ TE ++I RFV GL   I
Subjt:  GAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVRDHFGCRFAGVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEI

Query:  RGLVRLGRPATFAAALASARMLDRDIP-RTEQSQEVGTSSGAKKK
        RG V L RP T+A A+    ++D+D+  R +   EVG+S G K+K
Subjt:  RGLVRLGRPATFAAALASARMLDRDIP-RTEQSQEVGTSSGAKKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGTGGAATGGAAGGTTCAGGGATGTGGAATGCTCTCAAGGATTTCTACTTTTGGAGGAAGAAGATGGACGGGTGGGCTGCTGCATGAAGGCTATTCTGCAGCCTCC
AGCAGCCGCCCCCTGCCCGTGTGTTTCAGCCGCCAGCCCTCCCAACTGCCGCGCCGCCGCTCGTGCTCCAGCGCCGCCGTCACCGTTCCCGTCGCGCAGCCACCTCCCTT
CGCGAGCCTCTTCGCGGATCTCTCTCTCTCTCCCGTGTTCTGACGCCTACAGCAGCAGGTCCGTGACCCCTGCATCCGCGAGCGCCGGCCAGCACCCACAGCTTCAGTTT
TTCTCCTTCAAGCTCGCGTGTAAGTGGCTGGAGTCTTCGTCGTTCTGGCATGTTGGGCGCGTTCGGGCTGATTTAAGGTCCGTTTCAGCGTGTTCGGACTCTTTCGGTGT
TCGATTAAGTTCGAAACACTTCAACTTGGATAACCCACGCAAGAATCCGGCCGTTTCCGTCAGTGGGTGTGCTGTCCAGCAGCGTTTTCGCTCTGTTTGGGTCGTTACAG
CGTCACCTAACAATTCCGCTGTTCATCGAGCGTTTGATCTCATATATCGTGTTCAACGAATACCCACAACCCGAAAGACCTTGATTTTGGTTACCCATAACCCGGATTGT
TTAGGAGCGTTTGAGCTTCCGGGCCTCGGGTATAAAGGTCGGGGACTGATATATCACTATTGGTGTCGATGCCTCGGGTATAAATGGTCGAGGGTCGATATGCCAATGTT
AGATAAAGAGGAGCATCGAGGCCTTGGGTATAAATGGTCAAGGGTCGGTGTGATGAGTCCTGAGGCAAGTATTGAGGCCTTGGGTATAAATGGTCAAGGGTCAGTACGTT
GCTTTGTCATCGAGGCCTTGGGTATAAATGGCCAAGGGTCGATGCACAGTTCGAGGCCTTGGGTAGGGGCTGCTTACCAGTACCTTAGTGTACTGACCCCCTCCTCTCTC
CCCCCAACTACCAGATTTTGCAGAATGGCGGAGTCGTTTATTCTGTTAAATAGAAGTGTTTTATGCATAACAGGAGTTAGTAATGTCGCTGGGTTAGCTTCTAAAATCCT
GGGGCGTTACAGTTGGTATCAGAGCAGAGTTGTTCCTGTAGACTGGCCTAGGAAATCTAGGTTGTTTGGATGTTTAGGGTTATGGTCTTCCTCGTTCTCCTCTCCATCAC
CAGGCATAGGTTGTTTTCTTAAAGCCTGGGTTGCTGCCTTAGTTCGAGACGTTCCCGAGGTTAATTCGCATCCTGAGGCGAATCCTCCTGTTCCTCCCCCAGCGCCTCCT
GTGCTGGCAGCAGAGGCATTGCAGGCGATGCTTGGCAATGCATTCCTGAATAACCTGCAGCATGTCGGTGCAAATGGAGCCCCTGCTCTTGGCGAAGATGTGCAGTTTCT
CAAGAGTTTCTTGAAGGCGAAGCCTCCTTCATTCGATGGGCACTCGGATAGTTCTGAAGCAGTGGTAGAATGGATCGCCGCATTGGAAGCGATATTTCAATTTCTTGGAG
CTAATGCCCAACAGCGGGTCCAAGGAGCTGCCTTTATGCTCAAAGGCCACGCTCGCACTTGGTGGAACGTTGTGGGTCAAACCGAGAACCGCCCAGAGAATCCCATTTCC
TGGTCGGGGTTCAAAGGTCTTGTGCGGGACCATTTTGGTTGTCGTTTTGCTGGTGTTGAGCAAGAAGCAGAGTTTGTCTCTCTTGTTCAAGGGACCTTGTCTGTGGAGCA
GTACGCCAGAAGGTTTGAAGAGTTATCCTGCCGAGTCCCAGGGTTGGTTGCCACCGAAGAGATTAGGATCAACCGATTCGTTAATGGGCTCCGCGCAGAAATTCGAGGTT
TGGTCCGGCTTGGTCGACCGGCCACCTTTGCAGCAGCCTTAGCGAGCGCTCGGATGCTGGATAGGGACATCCCCAGGACGGAGCAGTCCCAAGAGGTTGGCACGTCATCT
GGTGCCAAGAAGAAGAGCGAAGTGGAAGTGCTTGCAGCTAGTCAGAAGGTCAGAAGTTCTTCGTCAAGATCTAGTGCGTACGCTGAGGAGTTCTTGCCCTGTGTCACCGA
TGAGGAACTCAAGGCAGAATACCCAGAGCTTTACGATGTCGACGATTCTGATGATGAAGATAGCTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGTGGAATGGAAGGTTCAGGGATGTGGAATGCTCTCAAGGATTTCTACTTTTGGAGGAAGAAGATGGACGGGTGGGCTGCTGCATGAAGGCTATTCTGCAGCCTCC
AGCAGCCGCCCCCTGCCCGTGTGTTTCAGCCGCCAGCCCTCCCAACTGCCGCGCCGCCGCTCGTGCTCCAGCGCCGCCGTCACCGTTCCCGTCGCGCAGCCACCTCCCTT
CGCGAGCCTCTTCGCGGATCTCTCTCTCTCTCCCGTGTTCTGACGCCTACAGCAGCAGGTCCGTGACCCCTGCATCCGCGAGCGCCGGCCAGCACCCACAGCTTCAGTTT
TTCTCCTTCAAGCTCGCGTGTAAGTGGCTGGAGTCTTCGTCGTTCTGGCATGTTGGGCGCGTTCGGGCTGATTTAAGGTCCGTTTCAGCGTGTTCGGACTCTTTCGGTGT
TCGATTAAGTTCGAAACACTTCAACTTGGATAACCCACGCAAGAATCCGGCCGTTTCCGTCAGTGGGTGTGCTGTCCAGCAGCGTTTTCGCTCTGTTTGGGTCGTTACAG
CGTCACCTAACAATTCCGCTGTTCATCGAGCGTTTGATCTCATATATCGTGTTCAACGAATACCCACAACCCGAAAGACCTTGATTTTGGTTACCCATAACCCGGATTGT
TTAGGAGCGTTTGAGCTTCCGGGCCTCGGGTATAAAGGTCGGGGACTGATATATCACTATTGGTGTCGATGCCTCGGGTATAAATGGTCGAGGGTCGATATGCCAATGTT
AGATAAAGAGGAGCATCGAGGCCTTGGGTATAAATGGTCAAGGGTCGGTGTGATGAGTCCTGAGGCAAGTATTGAGGCCTTGGGTATAAATGGTCAAGGGTCAGTACGTT
GCTTTGTCATCGAGGCCTTGGGTATAAATGGCCAAGGGTCGATGCACAGTTCGAGGCCTTGGGTAGGGGCTGCTTACCAGTACCTTAGTGTACTGACCCCCTCCTCTCTC
CCCCCAACTACCAGATTTTGCAGAATGGCGGAGTCGTTTATTCTGTTAAATAGAAGTGTTTTATGCATAACAGGAGTTAGTAATGTCGCTGGGTTAGCTTCTAAAATCCT
GGGGCGTTACAGTTGGTATCAGAGCAGAGTTGTTCCTGTAGACTGGCCTAGGAAATCTAGGTTGTTTGGATGTTTAGGGTTATGGTCTTCCTCGTTCTCCTCTCCATCAC
CAGGCATAGGTTGTTTTCTTAAAGCCTGGGTTGCTGCCTTAGTTCGAGACGTTCCCGAGGTTAATTCGCATCCTGAGGCGAATCCTCCTGTTCCTCCCCCAGCGCCTCCT
GTGCTGGCAGCAGAGGCATTGCAGGCGATGCTTGGCAATGCATTCCTGAATAACCTGCAGCATGTCGGTGCAAATGGAGCCCCTGCTCTTGGCGAAGATGTGCAGTTTCT
CAAGAGTTTCTTGAAGGCGAAGCCTCCTTCATTCGATGGGCACTCGGATAGTTCTGAAGCAGTGGTAGAATGGATCGCCGCATTGGAAGCGATATTTCAATTTCTTGGAG
CTAATGCCCAACAGCGGGTCCAAGGAGCTGCCTTTATGCTCAAAGGCCACGCTCGCACTTGGTGGAACGTTGTGGGTCAAACCGAGAACCGCCCAGAGAATCCCATTTCC
TGGTCGGGGTTCAAAGGTCTTGTGCGGGACCATTTTGGTTGTCGTTTTGCTGGTGTTGAGCAAGAAGCAGAGTTTGTCTCTCTTGTTCAAGGGACCTTGTCTGTGGAGCA
GTACGCCAGAAGGTTTGAAGAGTTATCCTGCCGAGTCCCAGGGTTGGTTGCCACCGAAGAGATTAGGATCAACCGATTCGTTAATGGGCTCCGCGCAGAAATTCGAGGTT
TGGTCCGGCTTGGTCGACCGGCCACCTTTGCAGCAGCCTTAGCGAGCGCTCGGATGCTGGATAGGGACATCCCCAGGACGGAGCAGTCCCAAGAGGTTGGCACGTCATCT
GGTGCCAAGAAGAAGAGCGAAGTGGAAGTGCTTGCAGCTAGTCAGAAGGTCAGAAGTTCTTCGTCAAGATCTAGTGCGTACGCTGAGGAGTTCTTGCCCTGTGTCACCGA
TGAGGAACTCAAGGCAGAATACCCAGAGCTTTACGATGTCGACGATTCTGATGATGAAGATAGCTCCTAA
Protein sequenceShow/hide protein sequence
MVWNGRFRDVECSQGFLLLEEEDGRVGCCMKAILQPPAAAPCPCVSAASPPNCRAAARAPAPPSPFPSRSHLPSRASSRISLSLPCSDAYSSRSVTPASASAGQHPQLQF
FSFKLACKWLESSSFWHVGRVRADLRSVSACSDSFGVRLSSKHFNLDNPRKNPAVSVSGCAVQQRFRSVWVVTASPNNSAVHRAFDLIYRVQRIPTTRKTLILVTHNPDC
LGAFELPGLGYKGRGLIYHYWCRCLGYKWSRVDMPMLDKEEHRGLGYKWSRVGVMSPEASIEALGINGQGSVRCFVIEALGINGQGSMHSSRPWVGAAYQYLSVLTPSSL
PPTTRFCRMAESFILLNRSVLCITGVSNVAGLASKILGRYSWYQSRVVPVDWPRKSRLFGCLGLWSSSFSSPSPGIGCFLKAWVAALVRDVPEVNSHPEANPPVPPPAPP
VLAAEALQAMLGNAFLNNLQHVGANGAPALGEDVQFLKSFLKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPIS
WSGFKGLVRDHFGCRFAGVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEIRGLVRLGRPATFAAALASARMLDRDIPRTEQSQEVGTSS
GAKKKSEVEVLAASQKVRSSSSRSSAYAEEFLPCVTDEELKAEYPELYDVDDSDDEDSS