; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027703 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027703
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr8:3765826..3769068
RNA-Seq ExpressionLag0027703
SyntenyLag0027703
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155925.1 uncharacterized protein LOC111022925 [Momordica charantia]2.6e-3336.5Show/hide
Query:  PEVNSHPEANPPIPP--PAPPVLAAEALQAMLGNAFLNNLQHVDAYGAPARG----EDMQFFKIFMKAKPPSFDGHLDSSEAVIEWTAALEAIFQFLGAN
        P       A+  +PP  P   VL AEALQ +L NA   N         P+RG    E++QF + F +  PP F+G  +   A  EW   LEA++ +LG +
Subjt:  PEVNSHPEANPPIPP--PAPPVLAAEALQAMLGNAFLNNLQHVDAYGAPARG----EDMQFFKIFMKAKPPSFDGHLDSSEAVIEWTAALEAIFQFLGAN

Query:  AQQRVQGAAFMLKGHARTWWKVVGQTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRTNRFVN
           +V+GA FML+G A  WW+ V   E+    P++W+ FK L+ +++       E+ AEF+ L Q +L V QY R+F ELS      + TE+++ ++F++
Subjt:  AQQRVQGAAFMLKGHARTWWKVVGQTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRTNRFVN

Query:  GLRAEIRGLVRLGRPATFAAALASARMLDRDIPRTDQSQEVGTSSGAKKKHEVEVFATSQKVR
        GLR EI+GL+ L  P T+AAA+  A ++D+ +      Q +G+SSG K+K     F++SQ  R
Subjt:  GLRAEIRGLVRLGRPATFAAALASARMLDRDIPRTDQSQEVGTSSGAKKKHEVEVFATSQKVR

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]2.1e-3544.33Show/hide
Query:  DMQFFKIFMKAKPPSFDGHLDSSEAVIEWTAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWKVVGQTENRPENPISWSGFKGLVRDHFGCRFADVEQE
        + +F K F +  PP+FDG  + + AV EW   LEA++ +LG   Q +V+GA FML+G A  WW  V   E+    PI W+ FK L+ D++        +E
Subjt:  DMQFFKIFMKAKPPSFDGHLDSSEAVIEWTAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWKVVGQTENRPENPISWSGFKGLVRDHFGCRFADVEQE

Query:  AEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRTNRFVNGLRAEIRGLVRLGRPATFAAALASARMLDRDIP-RTDQSQEVGTSSGAKKK
        AEF+ LVQGTLSV QY R+F ELS     L+ TE ++  RFV GLR  IRG V L RP T+A A+  A ++D+D+  +     EVG+SSG K+K
Subjt:  AEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRTNRFVNGLRAEIRGLVRLGRPATFAAALASARMLDRDIP-RTDQSQEVGTSSGAKKK

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]8.9e-3435.29Show/hide
Query:  PPIPPPAPP---------VLAAEALQAMLGNA-FLNNLQHVDAYGAPARGEDMQFFKIFMKAKPPSFDGHLDSSEAVIEWTAALEAIFQFLGANAQQRVQ
        PP+P  AP           L AEALQ +L NA      Q      A    +++QF + F    PP F+G  +   A  EW   LEA++ +LG +   +V+
Subjt:  PPIPPPAPP---------VLAAEALQAMLGNA-FLNNLQHVDAYGAPARGEDMQFFKIFMKAKPPSFDGHLDSSEAVIEWTAALEAIFQFLGANAQQRVQ

Query:  GAAFMLKGHARTWWKVVGQTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRTNRFVNGLRAEI
        GA FML+G A  WW+ V   E+    P++W+ FK L+ +++    A  E+  EF+ L QG+L+V QY R+F ELS      V TE+++ ++F++GLR EI
Subjt:  GAAFMLKGHARTWWKVVGQTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRTNRFVNGLRAEI

Query:  RGLVRLGRPATFAAALASARMLDRDIPRTDQSQEVGTSSGAKKKHEVEVFATSQKVRSSSSDLVRALRSSCP
        +GL+ L  P T+AAA+  A ++D+ +      Q +G++SG K+K     FA+    +SS      A R + P
Subjt:  RGLVRLGRPATFAAALASARMLDRDIPRTDQSQEVGTSSGAKKKHEVEVFATSQKVRSSSSDLVRALRSSCP

XP_022156546.1 uncharacterized protein LOC111023424 [Momordica charantia]3.1e-3437.9Show/hide
Query:  PDVPEVNSHPEANPPIPPPAPPVLAAEALQAMLGNAFLNNLQHVDAYGAPARGEDMQFFKIFMKAKPPSFDGHLDSSEAVIEWTAALEAIFQFLGANAQQ
        P  P +       PP PP A   LA     A +G A     +H+    + A     QF K F +  PP+F G  + +    EW   LEA++ +LG   Q 
Subjt:  PDVPEVNSHPEANPPIPPPAPPVLAAEALQAMLGNAFLNNLQHVDAYGAPARGEDMQFFKIFMKAKPPSFDGHLDSSEAVIEWTAALEAIFQFLGANAQQ

Query:  RVQGAAFMLKGHARTWWKVVGQTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRTNRFVNGLR
        +V+GA FML+  A  WW  V  TE+    P+ W+ FK L+ DH+     +  +E EF+ LVQGTL+V QY R+F ELS     L+ TE ++  RFV GL 
Subjt:  RVQGAAFMLKGHARTWWKVVGQTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRTNRFVNGLR

Query:  AEIRGLVRLGRPATFAAALASARMLDRDIP-RTDQSQEVGTSSGAKKK
          IRG V L RP T+A A+    ++D+D+  R     EVG+S G K+K
Subjt:  AEIRGLVRLGRPATFAAALASARMLDRDIP-RTDQSQEVGTSSGAKKK

XP_022157413.1 uncharacterized protein LOC111024114 [Momordica charantia]5.2e-3435.21Show/hide
Query:  GRSGPPDVPEVNSHPEANPPIPPPAPPVLAAEALQAMLGNA-FLNNLQHVDAYGAPARGEDMQFFKIFMKAKPPSFDGHLDSSEAVIEWTAALEAIFQFL
        GR  PP VP+  + P+  P + P     L AEALQ +L NA      Q      A    +++QF + F +  PP F+G  +   A  EW   LEA++ +L
Subjt:  GRSGPPDVPEVNSHPEANPPIPPPAPPVLAAEALQAMLGNA-FLNNLQHVDAYGAPARGEDMQFFKIFMKAKPPSFDGHLDSSEAVIEWTAALEAIFQFL

Query:  GANAQQRVQGAAFMLKGHARTWWKVVGQTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRTNR
        G +   +V+GA FML+G A  WW+ V   E+    P++W+ FK L+ +++       E+ AEF+ L QG+L+V QY R+F ELS      + TE+++ ++
Subjt:  GANAQQRVQGAAFMLKGHARTWWKVVGQTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRTNR

Query:  FVNGLRAEIRGLVRLGRPATFAAALASARMLDRDIPRTDQSQEVGTSSGAKKKHEVEVFATSQKVRSSSSDLVRALRSSCPVSP
        F++GLR EI+GL+ +  P T+AAA+  A ++D+ +      Q +G+SSG K+K    +F++SQ  R     + R  +++ PV P
Subjt:  FVNGLRAEIRGLVRLGRPATFAAALASARMLDRDIPRTDQSQEVGTSSGAKKKHEVEVFATSQKVRSSSSDLVRALRSSCPVSP

TrEMBL top hitse value%identityAlignment
A0A6J1DCW8 uncharacterized protein LOC1110196031.3e-3337.14Show/hide
Query:  APPVLAAEALQAMLGNAF-LNNLQHVDAYGAPARGEDMQFFKIFMKAKPPSFDGHLDSSEAVIEWTAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWK
        A   L   ALQA++ N+      Q +    A A   + QF + F +  PP+F+G  + +  V EW   LEA++ +LG + Q +V+GA FML+G A  WW 
Subjt:  APPVLAAEALQAMLGNAF-LNNLQHVDAYGAPARGEDMQFFKIFMKAKPPSFDGHLDSSEAVIEWTAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWK

Query:  VVGQTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRTNRFVNGLRAEIRGLVRLGRPATFAAA
        VV   E+    PI+W+  K L+ D++  +    E+E EF+ L Q TL V QY ++F E S     L+ TE  +  RFV GL   I+G + L RP T+A A
Subjt:  VVGQTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRTNRFVNGLRAEIRGLVRLGRPATFAAA

Query:  LASARMLDRD-IPRTDQSQEVGTSSGAKKKHEVEVFATSQKVRSS
        +  A ++D+D I +    Q+VG SSG K+K  V   ++SQ  ++S
Subjt:  LASARMLDRD-IPRTDQSQEVGTSSGAKKKHEVEVFATSQKVRSS

A0A6J1DQB9 Reverse transcriptase4.3e-3435.29Show/hide
Query:  PPIPPPAPP---------VLAAEALQAMLGNA-FLNNLQHVDAYGAPARGEDMQFFKIFMKAKPPSFDGHLDSSEAVIEWTAALEAIFQFLGANAQQRVQ
        PP+P  AP           L AEALQ +L NA      Q      A    +++QF + F    PP F+G  +   A  EW   LEA++ +LG +   +V+
Subjt:  PPIPPPAPP---------VLAAEALQAMLGNA-FLNNLQHVDAYGAPARGEDMQFFKIFMKAKPPSFDGHLDSSEAVIEWTAALEAIFQFLGANAQQRVQ

Query:  GAAFMLKGHARTWWKVVGQTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRTNRFVNGLRAEI
        GA FML+G A  WW+ V   E+    P++W+ FK L+ +++    A  E+  EF+ L QG+L+V QY R+F ELS      V TE+++ ++F++GLR EI
Subjt:  GAAFMLKGHARTWWKVVGQTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRTNRFVNGLRAEI

Query:  RGLVRLGRPATFAAALASARMLDRDIPRTDQSQEVGTSSGAKKKHEVEVFATSQKVRSSSSDLVRALRSSCP
        +GL+ L  P T+AAA+  A ++D+ +      Q +G++SG K+K     FA+    +SS      A R + P
Subjt:  RGLVRLGRPATFAAALASARMLDRDIPRTDQSQEVGTSSGAKKKHEVEVFATSQKVRSSSSDLVRALRSSCP

A0A6J1DTA8 uncharacterized protein LOC1110241142.5e-3435.21Show/hide
Query:  GRSGPPDVPEVNSHPEANPPIPPPAPPVLAAEALQAMLGNA-FLNNLQHVDAYGAPARGEDMQFFKIFMKAKPPSFDGHLDSSEAVIEWTAALEAIFQFL
        GR  PP VP+  + P+  P + P     L AEALQ +L NA      Q      A    +++QF + F +  PP F+G  +   A  EW   LEA++ +L
Subjt:  GRSGPPDVPEVNSHPEANPPIPPPAPPVLAAEALQAMLGNA-FLNNLQHVDAYGAPARGEDMQFFKIFMKAKPPSFDGHLDSSEAVIEWTAALEAIFQFL

Query:  GANAQQRVQGAAFMLKGHARTWWKVVGQTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRTNR
        G +   +V+GA FML+G A  WW+ V   E+    P++W+ FK L+ +++       E+ AEF+ L QG+L+V QY R+F ELS      + TE+++ ++
Subjt:  GANAQQRVQGAAFMLKGHARTWWKVVGQTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRTNR

Query:  FVNGLRAEIRGLVRLGRPATFAAALASARMLDRDIPRTDQSQEVGTSSGAKKKHEVEVFATSQKVRSSSSDLVRALRSSCPVSP
        F++GLR EI+GL+ +  P T+AAA+  A ++D+ +      Q +G+SSG K+K    +F++SQ  R     + R  +++ PV P
Subjt:  FVNGLRAEIRGLVRLGRPATFAAALASARMLDRDIPRTDQSQEVGTSSGAKKKHEVEVFATSQKVRSSSSDLVRALRSSCPVSP

A0A6J1DUM2 uncharacterized protein LOC1110232471.0e-3544.33Show/hide
Query:  DMQFFKIFMKAKPPSFDGHLDSSEAVIEWTAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWKVVGQTENRPENPISWSGFKGLVRDHFGCRFADVEQE
        + +F K F +  PP+FDG  + + AV EW   LEA++ +LG   Q +V+GA FML+G A  WW  V   E+    PI W+ FK L+ D++        +E
Subjt:  DMQFFKIFMKAKPPSFDGHLDSSEAVIEWTAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWKVVGQTENRPENPISWSGFKGLVRDHFGCRFADVEQE

Query:  AEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRTNRFVNGLRAEIRGLVRLGRPATFAAALASARMLDRDIP-RTDQSQEVGTSSGAKKK
        AEF+ LVQGTLSV QY R+F ELS     L+ TE ++  RFV GLR  IRG V L RP T+A A+  A ++D+D+  +     EVG+SSG K+K
Subjt:  AEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRTNRFVNGLRAEIRGLVRLGRPATFAAALASARMLDRDIP-RTDQSQEVGTSSGAKKK

A0A6J1DVA0 uncharacterized protein LOC1110234241.5e-3437.9Show/hide
Query:  PDVPEVNSHPEANPPIPPPAPPVLAAEALQAMLGNAFLNNLQHVDAYGAPARGEDMQFFKIFMKAKPPSFDGHLDSSEAVIEWTAALEAIFQFLGANAQQ
        P  P +       PP PP A   LA     A +G A     +H+    + A     QF K F +  PP+F G  + +    EW   LEA++ +LG   Q 
Subjt:  PDVPEVNSHPEANPPIPPPAPPVLAAEALQAMLGNAFLNNLQHVDAYGAPARGEDMQFFKIFMKAKPPSFDGHLDSSEAVIEWTAALEAIFQFLGANAQQ

Query:  RVQGAAFMLKGHARTWWKVVGQTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRTNRFVNGLR
        +V+GA FML+  A  WW  V  TE+    P+ W+ FK L+ DH+     +  +E EF+ LVQGTL+V QY R+F ELS     L+ TE ++  RFV GL 
Subjt:  RVQGAAFMLKGHARTWWKVVGQTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRTNRFVNGLR

Query:  AEIRGLVRLGRPATFAAALASARMLDRDIP-RTDQSQEVGTSSGAKKK
          IRG V L RP T+A A+    ++D+D+  R     EVG+S G K+K
Subjt:  AEIRGLVRLGRPATFAAALASARMLDRDIP-RTDQSQEVGTSSGAKKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCCAGTAGCAGTCAAGGTAGTGGACGTTCTGGCCCTCCAGACGTACCCGAGGTTAATTCGCATCCTGAGGCGAATCCTCCCATTCCTCCCCCAGCGCCTCCTGT
GCTGGCAGCAGAGGCATTGCAGGCGATGCTTGGTAATGCATTCCTAAACAACCTGCAGCACGTCGATGCATATGGAGCCCCTGCTCGTGGCGAGGATATGCAGTTTTTCA
AGATCTTCATGAAGGCGAAGCCTCCTTCATTCGATGGGCATTTGGATAGTTCTGAAGCAGTGATAGAATGGACCGCCGCATTGGAAGCAATATTTCAATTTCTTGGAGCT
AATGCTCAACAACGGGTCCAAGGAGCTGCCTTTATGCTCAAAGGCCACGCTCGCACTTGGTGGAAGGTTGTGGGTCAAACCGAGAACCGCCCAGAGAATCCCATTTCCTG
GTCAGGGTTCAAAGGTCTTGTGCGAGACCATTTTGGCTGTCGTTTTGCTGATGTTGAGCAAGAAGCGGAGTTTGTCTCTCTTGTTCAAGGGACCTTGTCTGTGGAGCAGT
ACGTCAGAAGGTTCGAAGAGTTGTCCTGCCGAGTTCCGGGGTTGGTTGCCACCGAAGAGATTAGGACCAACCGATTCGTTAATGGGCTCCGCGCAGAAATTCGAGGTTTG
GTCAGGCTTGGTCGACCAGCCACCTTTGCAGCAGCCCTAGCGAGCGCTCGGATGTTGGATAGGGACATCCCCAGGACGGATCAATCCCAAGAGGTTGGCACGTCATCTGG
TGCCAAGAAAAAGCACGAAGTGGAAGTGTTTGCAACTAGTCAGAAGGTTAGAAGTTCTTCGTCGGATCTAGTGCGTGCGTTGAGGAGTTCTTGCCCTGTGTCACCGATGA
GGAGCTCAAGGCGGAATATCCAGAGCTTTACGATGTCGATGATTCTGATGATGAAGATAGCTCCTAAGGTGGGGAGTCAGCGTACCCCTCGCTCAAGTTTAAGAGTCAAG
GAAGTTAGCGTTCCAAGGATGGCCTTCAAAACCAAGGCTATCGTGGATGTACCTAGTAGGGCAGTAATAAGTCTCGTAGCCATTGCTGAGAAGGATGAATGCAACCGCGC
CGCCGTCGCCTGCAGCCTCGCGCCTCCGCCCGCAACCGCGCCCTCGCGTCGCGCAGCAGCCGTCGCCGCGAGCTCCCGCCGTCGCCGCCGTCTCTCTCTCCAGCCGCAGG
CAATCTCACCGCCGTCGCTCCTCCCCGTTTCCTCCTCGCCGCAGCCGTCGTCGTCGCTGTCCCTCGCCGGAAAACAAAGACCCAAGCGCCGTCGCTCTTTTTCCCCTCTT
TTCCTTGCGTTTCAACAAGGCCGATCTAAGTGTTCCTTGGTTCGATTTGTTAGATCTCGCGCCCTAGCAGCTCGAAGCCTCAGTTCCTCGCGTTTTCTCTCTGTCCAGCA
TCAATTGGCGTTTTCGGCGTCGTTTAGCTATTTCGGTGCTGAATTTTCTGATCTGCAGCGCCGTAAAAGTGATCGATTGAGTTCGGATCACTTAAGCTCGAATACCCATT
GCCCAAGGAGCGTTCTAACACGTTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTTCCAGTAGCAGTCAAGGTAGTGGACGTTCTGGCCCTCCAGACGTACCCGAGGTTAATTCGCATCCTGAGGCGAATCCTCCCATTCCTCCCCCAGCGCCTCCTGT
GCTGGCAGCAGAGGCATTGCAGGCGATGCTTGGTAATGCATTCCTAAACAACCTGCAGCACGTCGATGCATATGGAGCCCCTGCTCGTGGCGAGGATATGCAGTTTTTCA
AGATCTTCATGAAGGCGAAGCCTCCTTCATTCGATGGGCATTTGGATAGTTCTGAAGCAGTGATAGAATGGACCGCCGCATTGGAAGCAATATTTCAATTTCTTGGAGCT
AATGCTCAACAACGGGTCCAAGGAGCTGCCTTTATGCTCAAAGGCCACGCTCGCACTTGGTGGAAGGTTGTGGGTCAAACCGAGAACCGCCCAGAGAATCCCATTTCCTG
GTCAGGGTTCAAAGGTCTTGTGCGAGACCATTTTGGCTGTCGTTTTGCTGATGTTGAGCAAGAAGCGGAGTTTGTCTCTCTTGTTCAAGGGACCTTGTCTGTGGAGCAGT
ACGTCAGAAGGTTCGAAGAGTTGTCCTGCCGAGTTCCGGGGTTGGTTGCCACCGAAGAGATTAGGACCAACCGATTCGTTAATGGGCTCCGCGCAGAAATTCGAGGTTTG
GTCAGGCTTGGTCGACCAGCCACCTTTGCAGCAGCCCTAGCGAGCGCTCGGATGTTGGATAGGGACATCCCCAGGACGGATCAATCCCAAGAGGTTGGCACGTCATCTGG
TGCCAAGAAAAAGCACGAAGTGGAAGTGTTTGCAACTAGTCAGAAGGTTAGAAGTTCTTCGTCGGATCTAGTGCGTGCGTTGAGGAGTTCTTGCCCTGTGTCACCGATGA
GGAGCTCAAGGCGGAATATCCAGAGCTTTACGATGTCGATGATTCTGATGATGAAGATAGCTCCTAAGGTGGGGAGTCAGCGTACCCCTCGCTCAAGTTTAAGAGTCAAG
GAAGTTAGCGTTCCAAGGATGGCCTTCAAAACCAAGGCTATCGTGGATGTACCTAGTAGGGCAGTAATAAGTCTCGTAGCCATTGCTGAGAAGGATGAATGCAACCGCGC
CGCCGTCGCCTGCAGCCTCGCGCCTCCGCCCGCAACCGCGCCCTCGCGTCGCGCAGCAGCCGTCGCCGCGAGCTCCCGCCGTCGCCGCCGTCTCTCTCTCCAGCCGCAGG
CAATCTCACCGCCGTCGCTCCTCCCCGTTTCCTCCTCGCCGCAGCCGTCGTCGTCGCTGTCCCTCGCCGGAAAACAAAGACCCAAGCGCCGTCGCTCTTTTTCCCCTCTT
TTCCTTGCGTTTCAACAAGGCCGATCTAAGTGTTCCTTGGTTCGATTTGTTAGATCTCGCGCCCTAGCAGCTCGAAGCCTCAGTTCCTCGCGTTTTCTCTCTGTCCAGCA
TCAATTGGCGTTTTCGGCGTCGTTTAGCTATTTCGGTGCTGAATTTTCTGATCTGCAGCGCCGTAAAAGTGATCGATTGAGTTCGGATCACTTAAGCTCGAATACCCATT
GCCCAAGGAGCGTTCTAACACGTTCTTAG
Protein sequenceShow/hide protein sequence
MSSSSSQGSGRSGPPDVPEVNSHPEANPPIPPPAPPVLAAEALQAMLGNAFLNNLQHVDAYGAPARGEDMQFFKIFMKAKPPSFDGHLDSSEAVIEWTAALEAIFQFLGA
NAQQRVQGAAFMLKGHARTWWKVVGQTENRPENPISWSGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRTNRFVNGLRAEIRGL
VRLGRPATFAAALASARMLDRDIPRTDQSQEVGTSSGAKKKHEVEVFATSQKVRSSSSDLVRALRSSCPVSPMRSSRRNIQSFTMSMILMMKIAPKVGSQRTPRSSLRVK
EVSVPRMAFKTKAIVDVPSRAVISLVAIAEKDECNRAAVACSLAPPPATAPSRRAAAVAASSRRRRRLSLQPQAISPPSLLPVSSSPQPSSSLSLAGKQRPKRRRSFSPL
FLAFQQGRSKCSLVRFVRSRALAARSLSSSRFLSVQHQLAFSASFSYFGAEFSDLQRRKSDRLSSDHLSSNTHCPRSVLTRS