; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039038 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039038
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr2:34287783..34291371
RNA-Seq ExpressionLag0039038
SyntenyLag0039038
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151688.1 uncharacterized protein LOC111019603 [Momordica charantia]5.3e-3439.34Show/hide
Query:  APPVLAAEALQAMLGNAF-LNNLQHVGANGAPALGEEVQFIKSFMKAKPPSFDGHSDSSEAVVEWTAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWN
        A   L   ALQA++ N+      Q +    A AL  E QFI+ F +  PP+F+G S+ +  V EW   LEA++ +LG + Q +V+GA FML+G A  WW+
Subjt:  APPVLAAEALQAMLGNAF-LNNLQHVGANGAPALGEEVQFIKSFMKAKPPSFDGHSDSSEAVVEWTAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWN

Query:  VVGQTENRPENPISWSGFKGLVWDHFGCRFADVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEIRGLVRLCRPATFAAA
        VV   E+    PI+W+  K L++D++  +    E+E EF+ L Q TL V QY ++F E S     L+ TE  +I RFV GL   I+G + L RP T+A A
Subjt:  VVGQTENRPENPISWSGFKGLVWDHFGCRFADVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEIRGLVRLCRPATFAAA

Query:  LASARMLDRDI
        +  A ++D+D+
Subjt:  LASARMLDRDI

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]3.3e-3646.86Show/hide
Query:  EVQFIKSFMKAKPPSFDGHSDSSEAVVEWTAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVWDHFGCRFADVEQE
        E +FIK F +  PP+FDG S+ + AV EW   LEA++ +LG   Q +V+GA FML+G A  WW+ V   E+    PI W+ FK L++D++        +E
Subjt:  EVQFIKSFMKAKPPSFDGHSDSSEAVVEWTAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVWDHFGCRFADVEQE

Query:  AEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEIRGLVRLCRPATFAAALASARMLDRDI
        AEF+ LVQGTLSV QY R+F ELS     L+ TE ++I RFV GLR  IRG V L RP T+A A+  A ++D+D+
Subjt:  AEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEIRGLVRLCRPATFAAALASARMLDRDI

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]2.0e-3338.56Show/hide
Query:  PPVPPPAPP---------VLAAEALQAMLGNA-FLNNLQHVGANGAPALGEEVQFIKSFMKAKPPSFDGHSDSSEAVVEWTAALEAIFQFLGANAQQRVQ
        PPVP  AP           L AEALQ +L NA      Q      A    +EVQFI+ F    PP F+G S+   A  EW   LEA++ +LG +   +V+
Subjt:  PPVPPPAPP---------VLAAEALQAMLGNA-FLNNLQHVGANGAPALGEEVQFIKSFMKAKPPSFDGHSDSSEAVVEWTAALEAIFQFLGANAQQRVQ

Query:  GAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVWDHFGCRFADVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEI
        GA FML+G A  WW  V   E+    P++W+ FK L+++++    A  E+  EF+ L QG+L+V QY R+F ELS      V TE+++I++F++GLR EI
Subjt:  GAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVWDHFGCRFADVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEI

Query:  RGLVRLCRPATFAAALASARMLDR--DIPRTDRSLG
        +GL+ L  P T+AAA+  A ++D+  + P++ + +G
Subjt:  RGLVRLCRPATFAAALASARMLDR--DIPRTDRSLG

XP_022156546.1 uncharacterized protein LOC111023424 [Momordica charantia]6.3e-3537.45Show/hide
Query:  PDVPEVNSHPEANPPVPPPAPPVLAAEALQAMLGNAFLNNLQHVGANGAPALGEEVQFIKSFMKAKPPSFDGHSDSSEAVVEWTAALEAIFQFLGANAQQ
        P  P +       PP PP A   LA     A +G A     +H+    +     E QFIK F +  PP+F G S+ +    EW   LEA++ +LG   Q 
Subjt:  PDVPEVNSHPEANPPVPPPAPPVLAAEALQAMLGNAFLNNLQHVGANGAPALGEEVQFIKSFMKAKPPSFDGHSDSSEAVVEWTAALEAIFQFLGANAQQ

Query:  RVQGAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVWDHFGCRFADVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLR
        +V+GA FML+  A  WW+ V  TE+    P+ W+ FK L++DH+     +  +E EF+ LVQGTL+V QY R+F ELS     L+ TE ++I RFV GL 
Subjt:  RVQGAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVWDHFGCRFADVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLR

Query:  AEIRGLVRLCRPATFAAALASARMLDRDIP-------RTDRSLGLARHLVP
          IRG V L RP T+A A+    ++D+D+            SLG+ R + P
Subjt:  AEIRGLVRLCRPATFAAALASARMLDRDIP-------RTDRSLGLARHLVP

XP_022157413.1 uncharacterized protein LOC111024114 [Momordica charantia]4.5e-3337.96Show/hide
Query:  GRSGPPDVPEVNSHPEANPPVPPPAPPVLAAEALQAMLGNA-FLNNLQHVGANGAPALGEEVQFIKSFMKAKPPSFDGHSDSSEAVVEWTAALEAIFQFL
        GR  PP VP+  + P+  P V P     L AEALQ +L NA      Q      A    +EVQFI+ F +  PP F+G S+   A  EW   LEA++ +L
Subjt:  GRSGPPDVPEVNSHPEANPPVPPPAPPVLAAEALQAMLGNA-FLNNLQHVGANGAPALGEEVQFIKSFMKAKPPSFDGHSDSSEAVVEWTAALEAIFQFL

Query:  GANAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVWDHFGCRFADVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINR
        G +   +V+GA FML+G A  WW  V   E+    P++W+ FK L+++++       E+ AEF+ L QG+L+V QY R+F ELS      + TE+++I++
Subjt:  GANAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVWDHFGCRFADVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINR

Query:  FVNGLRAEIRGLVRLCRPATFAAALASARMLDR--DIPRTDRSLG
        F++GLR EI+GL+ +  P T+AAA+  A ++D+  + P++ + +G
Subjt:  FVNGLRAEIRGLVRLCRPATFAAALASARMLDR--DIPRTDRSLG

TrEMBL top hitse value%identityAlignment
A0A6J1DCW8 uncharacterized protein LOC1110196032.6e-3439.34Show/hide
Query:  APPVLAAEALQAMLGNAF-LNNLQHVGANGAPALGEEVQFIKSFMKAKPPSFDGHSDSSEAVVEWTAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWN
        A   L   ALQA++ N+      Q +    A AL  E QFI+ F +  PP+F+G S+ +  V EW   LEA++ +LG + Q +V+GA FML+G A  WW+
Subjt:  APPVLAAEALQAMLGNAF-LNNLQHVGANGAPALGEEVQFIKSFMKAKPPSFDGHSDSSEAVVEWTAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWN

Query:  VVGQTENRPENPISWSGFKGLVWDHFGCRFADVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEIRGLVRLCRPATFAAA
        VV   E+    PI+W+  K L++D++  +    E+E EF+ L Q TL V QY ++F E S     L+ TE  +I RFV GL   I+G + L RP T+A A
Subjt:  VVGQTENRPENPISWSGFKGLVWDHFGCRFADVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEIRGLVRLCRPATFAAA

Query:  LASARMLDRDI
        +  A ++D+D+
Subjt:  LASARMLDRDI

A0A6J1DQB9 Reverse transcriptase9.8e-3438.56Show/hide
Query:  PPVPPPAPP---------VLAAEALQAMLGNA-FLNNLQHVGANGAPALGEEVQFIKSFMKAKPPSFDGHSDSSEAVVEWTAALEAIFQFLGANAQQRVQ
        PPVP  AP           L AEALQ +L NA      Q      A    +EVQFI+ F    PP F+G S+   A  EW   LEA++ +LG +   +V+
Subjt:  PPVPPPAPP---------VLAAEALQAMLGNA-FLNNLQHVGANGAPALGEEVQFIKSFMKAKPPSFDGHSDSSEAVVEWTAALEAIFQFLGANAQQRVQ

Query:  GAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVWDHFGCRFADVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEI
        GA FML+G A  WW  V   E+    P++W+ FK L+++++    A  E+  EF+ L QG+L+V QY R+F ELS      V TE+++I++F++GLR EI
Subjt:  GAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVWDHFGCRFADVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEI

Query:  RGLVRLCRPATFAAALASARMLDR--DIPRTDRSLG
        +GL+ L  P T+AAA+  A ++D+  + P++ + +G
Subjt:  RGLVRLCRPATFAAALASARMLDR--DIPRTDRSLG

A0A6J1DTA8 uncharacterized protein LOC1110241142.2e-3337.96Show/hide
Query:  GRSGPPDVPEVNSHPEANPPVPPPAPPVLAAEALQAMLGNA-FLNNLQHVGANGAPALGEEVQFIKSFMKAKPPSFDGHSDSSEAVVEWTAALEAIFQFL
        GR  PP VP+  + P+  P V P     L AEALQ +L NA      Q      A    +EVQFI+ F +  PP F+G S+   A  EW   LEA++ +L
Subjt:  GRSGPPDVPEVNSHPEANPPVPPPAPPVLAAEALQAMLGNA-FLNNLQHVGANGAPALGEEVQFIKSFMKAKPPSFDGHSDSSEAVVEWTAALEAIFQFL

Query:  GANAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVWDHFGCRFADVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINR
        G +   +V+GA FML+G A  WW  V   E+    P++W+ FK L+++++       E+ AEF+ L QG+L+V QY R+F ELS      + TE+++I++
Subjt:  GANAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVWDHFGCRFADVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINR

Query:  FVNGLRAEIRGLVRLCRPATFAAALASARMLDR--DIPRTDRSLG
        F++GLR EI+GL+ +  P T+AAA+  A ++D+  + P++ + +G
Subjt:  FVNGLRAEIRGLVRLCRPATFAAALASARMLDR--DIPRTDRSLG

A0A6J1DUM2 uncharacterized protein LOC1110232471.6e-3646.86Show/hide
Query:  EVQFIKSFMKAKPPSFDGHSDSSEAVVEWTAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVWDHFGCRFADVEQE
        E +FIK F +  PP+FDG S+ + AV EW   LEA++ +LG   Q +V+GA FML+G A  WW+ V   E+    PI W+ FK L++D++        +E
Subjt:  EVQFIKSFMKAKPPSFDGHSDSSEAVVEWTAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVWDHFGCRFADVEQE

Query:  AEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEIRGLVRLCRPATFAAALASARMLDRDI
        AEF+ LVQGTLSV QY R+F ELS     L+ TE ++I RFV GLR  IRG V L RP T+A A+  A ++D+D+
Subjt:  AEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEIRGLVRLCRPATFAAALASARMLDRDI

A0A6J1DVA0 uncharacterized protein LOC1110234243.1e-3537.45Show/hide
Query:  PDVPEVNSHPEANPPVPPPAPPVLAAEALQAMLGNAFLNNLQHVGANGAPALGEEVQFIKSFMKAKPPSFDGHSDSSEAVVEWTAALEAIFQFLGANAQQ
        P  P +       PP PP A   LA     A +G A     +H+    +     E QFIK F +  PP+F G S+ +    EW   LEA++ +LG   Q 
Subjt:  PDVPEVNSHPEANPPVPPPAPPVLAAEALQAMLGNAFLNNLQHVGANGAPALGEEVQFIKSFMKAKPPSFDGHSDSSEAVVEWTAALEAIFQFLGANAQQ

Query:  RVQGAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVWDHFGCRFADVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLR
        +V+GA FML+  A  WW+ V  TE+    P+ W+ FK L++DH+     +  +E EF+ LVQGTL+V QY R+F ELS     L+ TE ++I RFV GL 
Subjt:  RVQGAAFMLKGHARTWWNVVGQTENRPENPISWSGFKGLVWDHFGCRFADVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLR

Query:  AEIRGLVRLCRPATFAAALASARMLDRDIP-------RTDRSLGLARHLVP
          IRG V L RP T+A A+    ++D+D+            SLG+ R + P
Subjt:  AEIRGLVRLCRPATFAAALASARMLDRDIP-------RTDRSLGLARHLVP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGGATTGTCCTTTGATTTGTACGGGTGAGAGTGGCCAGTTCGCCGACTCAATAAGCCTACCATTTTGGGGACAAGACCGAATGGGGAGCTGGGAACGTAGTCTTAC
AAGATGGAATTCACTCCTTCCTGATATGAGGAGAATTCTCAGAAAAGAGAAAATCCAGAGAAAATCCTTAGAGTCTGTTGAGTTCCCACAAGCTCCCAACGCGTATCCTG
CTGAGAATACTGGTGAAACCACGTGGTGGTGTTCGTGGCAAACTCTTCCAGCGAAAAAGGAATTGGAAGTTGCTGTATTTTTAGGAAAAATTCAAAGAATTCGGCGAAAA
GTTCAAGGATTTCTTCAAAGTCCAATGTACGCCGTTATGCTGCCGAAATTTTCGGTGCTCACGGTTTGTTTTGGTCTAGGAGTTAGTAATGTCGCTGGGTTAGCTTTTAA
AATCCTGGGGCGTTACAGTTGGTATCAGAGCAGAGTTGTTCCTGTAGACTGGCCTAGGAAATCTAGGTTGTTTGGATGTTTAGGATTGTGGTCTTCCTCGTTCTCCTCTC
CATCACCAGCGATGAGTTCCAGTAGCAGTCAAGGTAGTGGACGTTCTGGCCCCCCAGACGTTCCCGAGGTTAATTCGCATCCTGAGGCGAATCCTCCTGTTCCTCCCCCA
GCGCCTCCTGTGCTGGCAGCAGAGGCATTGCAGGCGATGCTTGGCAATGCGTTCCTGAACAATCTGCAGCACGTCGGTGCAAATGGAGCCCCTGCTCTTGGCGAAGAAGT
GCAGTTTATCAAGAGCTTCATGAAGGCGAAGCCTCCTTCATTCGATGGGCACTCGGATAGTTCTGAAGCAGTGGTAGAATGGACCGCCGCATTGGAAGCGATATTTCAAT
TTCTTGGAGCTAATGCCCAACAGCGGGTCCAAGGAGCTGCCTTTATGCTCAAAGGCCACGCACGCACTTGGTGGAACGTTGTGGGTCAAACCGAGAACCGCCCAGAGAAT
CCCATTTCCTGGTCGGGGTTCAAAGGTCTTGTGTGGGACCATTTTGGTTGTCGTTTTGCTGATGTTGAGCAAGAAGCAGAGTTTGTCTCTCTTGTTCAAGGGACCTTGTC
TGTGGAGCAGTACGCCAGAAGGTTTGAAGAGTTATCCTGCCGAGTCCCAGGGTTGGTTGCCACCGAGGAGATTAGGATCAACCGATTCGTTAATGGGCTCCGCGCAGAAA
TTCGAGGTTTGGTCCGGCTTTGTCGACCGGCCACTTTTGCAGCAGCTCTAGCAAGCGCTCGGATGTTGGATAGGGACATCCCCAGGACGGATCGGTCCCTAGGGCTGGCA
CGTCATCTGGTGCCAAGAAGAAGAGCGAAGTGGAAGTGCTTGCAGCTAGTCAGAAGGTCAGAAGTTCTCCGTCAGGATCTAGCGCGTGCACTGAGGAGTTCTTGCCCTGT
GTCACCGATGTGGAGCTCAAGGCAGAATACCCAGAGCTTTACGATGTCGATGGTTCTGATGATGAAGATAGTTCCTAAGGTGGGGAGTCAGCATGCCCCTCGCTCAAGAT
CTTCTGTTCCTCAGTTCGTTCAAGGCTCGATTGGTGTCGTCGGGTTTCTAAGGATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGGATTGTCCTTTGATTTGTACGGGTGAGAGTGGCCAGTTCGCCGACTCAATAAGCCTACCATTTTGGGGACAAGACCGAATGGGGAGCTGGGAACGTAGTCTTAC
AAGATGGAATTCACTCCTTCCTGATATGAGGAGAATTCTCAGAAAAGAGAAAATCCAGAGAAAATCCTTAGAGTCTGTTGAGTTCCCACAAGCTCCCAACGCGTATCCTG
CTGAGAATACTGGTGAAACCACGTGGTGGTGTTCGTGGCAAACTCTTCCAGCGAAAAAGGAATTGGAAGTTGCTGTATTTTTAGGAAAAATTCAAAGAATTCGGCGAAAA
GTTCAAGGATTTCTTCAAAGTCCAATGTACGCCGTTATGCTGCCGAAATTTTCGGTGCTCACGGTTTGTTTTGGTCTAGGAGTTAGTAATGTCGCTGGGTTAGCTTTTAA
AATCCTGGGGCGTTACAGTTGGTATCAGAGCAGAGTTGTTCCTGTAGACTGGCCTAGGAAATCTAGGTTGTTTGGATGTTTAGGATTGTGGTCTTCCTCGTTCTCCTCTC
CATCACCAGCGATGAGTTCCAGTAGCAGTCAAGGTAGTGGACGTTCTGGCCCCCCAGACGTTCCCGAGGTTAATTCGCATCCTGAGGCGAATCCTCCTGTTCCTCCCCCA
GCGCCTCCTGTGCTGGCAGCAGAGGCATTGCAGGCGATGCTTGGCAATGCGTTCCTGAACAATCTGCAGCACGTCGGTGCAAATGGAGCCCCTGCTCTTGGCGAAGAAGT
GCAGTTTATCAAGAGCTTCATGAAGGCGAAGCCTCCTTCATTCGATGGGCACTCGGATAGTTCTGAAGCAGTGGTAGAATGGACCGCCGCATTGGAAGCGATATTTCAAT
TTCTTGGAGCTAATGCCCAACAGCGGGTCCAAGGAGCTGCCTTTATGCTCAAAGGCCACGCACGCACTTGGTGGAACGTTGTGGGTCAAACCGAGAACCGCCCAGAGAAT
CCCATTTCCTGGTCGGGGTTCAAAGGTCTTGTGTGGGACCATTTTGGTTGTCGTTTTGCTGATGTTGAGCAAGAAGCAGAGTTTGTCTCTCTTGTTCAAGGGACCTTGTC
TGTGGAGCAGTACGCCAGAAGGTTTGAAGAGTTATCCTGCCGAGTCCCAGGGTTGGTTGCCACCGAGGAGATTAGGATCAACCGATTCGTTAATGGGCTCCGCGCAGAAA
TTCGAGGTTTGGTCCGGCTTTGTCGACCGGCCACTTTTGCAGCAGCTCTAGCAAGCGCTCGGATGTTGGATAGGGACATCCCCAGGACGGATCGGTCCCTAGGGCTGGCA
CGTCATCTGGTGCCAAGAAGAAGAGCGAAGTGGAAGTGCTTGCAGCTAGTCAGAAGGTCAGAAGTTCTCCGTCAGGATCTAGCGCGTGCACTGAGGAGTTCTTGCCCTGT
GTCACCGATGTGGAGCTCAAGGCAGAATACCCAGAGCTTTACGATGTCGATGGTTCTGATGATGAAGATAGTTCCTAAGGTGGGGAGTCAGCATGCCCCTCGCTCAAGAT
CTTCTGTTCCTCAGTTCGTTCAAGGCTCGATTGGTGTCGTCGGGTTTCTAAGGATTTGA
Protein sequenceShow/hide protein sequence
MRDCPLICTGESGQFADSISLPFWGQDRMGSWERSLTRWNSLLPDMRRILRKEKIQRKSLESVEFPQAPNAYPAENTGETTWWCSWQTLPAKKELEVAVFLGKIQRIRRK
VQGFLQSPMYAVMLPKFSVLTVCFGLGVSNVAGLAFKILGRYSWYQSRVVPVDWPRKSRLFGCLGLWSSSFSSPSPAMSSSSSQGSGRSGPPDVPEVNSHPEANPPVPPP
APPVLAAEALQAMLGNAFLNNLQHVGANGAPALGEEVQFIKSFMKAKPPSFDGHSDSSEAVVEWTAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWNVVGQTENRPEN
PISWSGFKGLVWDHFGCRFADVEQEAEFVSLVQGTLSVEQYARRFEELSCRVPGLVATEEIRINRFVNGLRAEIRGLVRLCRPATFAAALASARMLDRDIPRTDRSLGLA
RHLVPRRRAKWKCLQLVRRSEVLRQDLARALRSSCPVSPMWSSRQNTQSFTMSMVLMMKIVPKVGSQHAPRSRSSVPQFVQGSIGVVGFLRI